[WIP][feature][spark] Support streaming #7476

CheneyYin · 2024-08-23T12:10:41Z

Purpose of this pull request

Support streaming for spark engine.
Related:

Does this PR introduce any user-facing change?

How was this patch tested?

Check list

If any new Jar binary package adding in your PR, please add License Notice according
New License Guide
If necessary, please update the documentation to describe the new feature. https://github.com/apache/seatunnel/tree/dev/docs
If you are contributing the connector code, please check that the following files are updated:
1. Update plugin-mapping.properties and add new connector information in it
2. Update the pom file of seatunnel-dist
3. Add ci label in label-scope-conf
4. Add e2e testcase in seatunnel-e2e
5. Update connector plugin_config
Update the release-note.

hailin0 · 2024-08-23T14:10:12Z

cc @Carl-Zhou-CN

CheneyYin · 2024-08-24T07:04:20Z

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/micro/SeaTunnelMicroBatchPartitionReader.java

Lines 27 to 49 in 1bba723

    
           public class SeaTunnelMicroBatchPartitionReader implements PartitionReader<InternalRow> { 
        
               private final ParallelBatchPartitionReader partitionReader; 
        
               public SeaTunnelMicroBatchPartitionReader(ParallelBatchPartitionReader partitionReader) { 
        
                   this.partitionReader = partitionReader; 
        
               } 
        
               @Override 
        
               public boolean next() throws IOException { 
        
                   return partitionReader.next(); 
        
               } 
        
               @Override 
        
               public InternalRow get() { 
        
                   return partitionReader.get(); 
        
               } 
        
               @Override 
        
               public void close() throws IOException { 
        
                   partitionReader.close(); 
        
               } 
        
           }

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/batch/ParallelBatchPartitionReader.java

Lines 87 to 97 in 1bba723

    
           public boolean next() throws IOException { 
        
               prepare(); 
        
               while (running && handover.isEmpty()) { 
        
                   try { 
        
                       Thread.sleep(INTERVAL); 
        
                   } catch (InterruptedException e) { 
        
                       throw new RuntimeException(e); 
        
                   } 
        
               } 
        
               return running || !handover.isEmpty(); 
        
           }

PartitionReader never close in streaming mode.

Carl-Zhou-CN · 2024-08-28T10:08:22Z

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/micro/SeaTunnelMicroBatchPartitionReader.java

Lines 27 to 49 in 1bba723

public class SeaTunnelMicroBatchPartitionReader implements PartitionReader<InternalRow> {

private final ParallelBatchPartitionReader partitionReader;

public SeaTunnelMicroBatchPartitionReader(ParallelBatchPartitionReader partitionReader) {

this.partitionReader = partitionReader;

}

@Override

public boolean next() throws IOException {

return partitionReader.next();

}

@Override

public InternalRow get() {

return partitionReader.get();

}

@Override

public void close() throws IOException {

partitionReader.close();

}

}

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/batch/ParallelBatchPartitionReader.java

Lines 87 to 97 in 1bba723

public boolean next() throws IOException {

prepare();

while (running && handover.isEmpty()) {

try {

Thread.sleep(INTERVAL);

} catch (InterruptedException e) {

throw new RuntimeException(e);

}

}

return running || !handover.isEmpty();

}

PartitionReader never close in streaming mode.

hi @CheneyYin It seems that after a checkpoint, it will be close

CheneyYin · 2024-08-28T12:38:36Z

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/micro/SeaTunnelMicroBatchPartitionReader.java

Lines 27 to 49 in 1bba723

public class SeaTunnelMicroBatchPartitionReader implements PartitionReader<InternalRow> {

private final ParallelBatchPartitionReader partitionReader;

public SeaTunnelMicroBatchPartitionReader(ParallelBatchPartitionReader partitionReader) {

this.partitionReader = partitionReader;

}

@Override

public boolean next() throws IOException {

return partitionReader.next();

}

@Override

public InternalRow get() {

return partitionReader.get();

}

@Override

public void close() throws IOException {

partitionReader.close();

}

}

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/batch/ParallelBatchPartitionReader.java

Lines 87 to 97 in 1bba723

public boolean next() throws IOException {

prepare();

while (running && handover.isEmpty()) {

try {

Thread.sleep(INTERVAL);

} catch (InterruptedException e) {

throw new RuntimeException(e);

}

}

return running || !handover.isEmpty();

}

PartitionReader never close in streaming mode.

hi @CheneyYin It seems that after a checkpoint, it will be close

Yes. If the reader does not receive new data for a long time, Spark will end the current micro batch. Spark's micro batch mechanism does not fully meet the requirements of long term streaming computing. First, creating a new reader for the next batch will incur some overhead. Second, the granularity of fault recovery is too large, and the Spark micro batch mechanism cannot restore the reader from the latest snapshot of the Seatunnel reader.
I am looking for strategies to alleviate these problems while ensuring fault recovery. Currently, I add metadata to the seatunnel row and use a special identifier to represent the checkpoint event. After the source completes a checkpoint, it will create a checkpoint record and send it to the downstream. After receiving the checkpoint record, the sink saves the snapshot and confirms the prepared checkpoint made by the source. These checkpoint operations are performed based on the file system directory space.

CheneyYin · 2024-08-28T12:44:57Z

The checkpoint space like this:

./
├── commits
│   ├........
│   ├── 10
│   ├── 11
│   ├── 12
│   ├── 13
│   ├── 14
│   ├── 15
|    ......
├── metadata
├── offsets
│   ├ ........
│   ├── 10
│   ├── 11
│   ├── 12
│   ├── 13
│   ├── 14
│   ├── 15
│   ├── 16
│     .......
└── sources
    └── 0
        ├ ........
        ├── 11
        │   ├── 0
        │   │   └── 0.committed
        │   └── 1
        │       └── 0.committed
        ├── 12
        │   ├── 0
        │   │   └── 0.committed
        │   └── 1
        │       └── 0.committed
        ├── 13
        │   ├── 0
        │   │   └── 0.committed
        │   └── 1
        │       └── 0.committed
        ├── 14
        │   ├── 0
        │   │   └── 0.committed
        │   └── 1
        │       └── 0.committed
        ├── 15
        │   ├── 0
        │   │   └── 0.committed
        │   └── 1
        │       └── 0.committed
  ........

CheneyYin · 2024-08-28T13:55:43Z

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/micro/SeaTunnelMicroBatchPartitionReader.java

Lines 27 to 49 in 1bba723

public class SeaTunnelMicroBatchPartitionReader implements PartitionReader<InternalRow> {

private final ParallelBatchPartitionReader partitionReader;

public SeaTunnelMicroBatchPartitionReader(ParallelBatchPartitionReader partitionReader) {

this.partitionReader = partitionReader;

}

@Override

public boolean next() throws IOException {

return partitionReader.next();

}

@Override

public InternalRow get() {

return partitionReader.get();

}

@Override

public void close() throws IOException {

partitionReader.close();

}

}

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/batch/ParallelBatchPartitionReader.java

Lines 87 to 97 in 1bba723

public boolean next() throws IOException {

prepare();

while (running && handover.isEmpty()) {

try {

Thread.sleep(INTERVAL);

} catch (InterruptedException e) {

throw new RuntimeException(e);

}

}

return running || !handover.isEmpty();

}

PartitionReader never close in streaming mode.

hi @CheneyYin It seems that after a checkpoint, it will be close

The next() never return false useless call close(). However Spark MicroStreamExecution will call close() after next() return false. So reader never stop and never commit batch.

Carl-Zhou-CN · 2024-08-29T04:01:59Z

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/micro/SeaTunnelMicroBatchPartitionReader.java

Lines 27 to 49 in 1bba723

public class SeaTunnelMicroBatchPartitionReader implements PartitionReader<InternalRow> {

private final ParallelBatchPartitionReader partitionReader;

public SeaTunnelMicroBatchPartitionReader(ParallelBatchPartitionReader partitionReader) {

this.partitionReader = partitionReader;

}

@Override

public boolean next() throws IOException {

return partitionReader.next();

}

@Override

public InternalRow get() {

return partitionReader.get();

}

@Override

public void close() throws IOException {

partitionReader.close();

}

}

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/batch/ParallelBatchPartitionReader.java

Lines 87 to 97 in 1bba723

public boolean next() throws IOException {

prepare();

while (running && handover.isEmpty()) {

try {

Thread.sleep(INTERVAL);

} catch (InterruptedException e) {

throw new RuntimeException(e);

}

}

return running || !handover.isEmpty();

}

PartitionReader never close in streaming mode.

hi @CheneyYin It seems that after a checkpoint, it will be close

Yes. If the reader does not receive new data for a long time, Spark will end the current micro batch. Spark's micro batch mechanism does not fully meet the requirements of long term streaming computing. First, creating a new reader for the next batch will incur some overhead. Second, the granularity of fault recovery is too large, and the Spark micro batch mechanism cannot restore the reader from the latest snapshot of the Seatunnel reader. I am looking for strategies to alleviate these problems while ensuring fault recovery. Currently, I add metadata to the seatunnel row and use a special identifier to represent the checkpoint event. After the source completes a checkpoint, it will create a checkpoint record and send it to the downstream. After receiving the checkpoint record, the sink saves the snapshot and confirms the prepared checkpoint made by the source. These checkpoint operations are performed based on the file system directory space.

Yes, the micro-batch process cannot meet the requirements of streaming.

Carl-Zhou-CN · 2024-08-30T01:40:20Z

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/micro/SeaTunnelMicroBatchPartitionReader.java

Lines 27 to 49 in 1bba723

public class SeaTunnelMicroBatchPartitionReader implements PartitionReader<InternalRow> {

private final ParallelBatchPartitionReader partitionReader;

public SeaTunnelMicroBatchPartitionReader(ParallelBatchPartitionReader partitionReader) {

this.partitionReader = partitionReader;

}

@Override

public boolean next() throws IOException {

return partitionReader.next();

}

@Override

public InternalRow get() {

return partitionReader.get();

}

@Override

public void close() throws IOException {

partitionReader.close();

}

}

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/batch/ParallelBatchPartitionReader.java

Lines 87 to 97 in 1bba723

public boolean next() throws IOException {

prepare();

while (running && handover.isEmpty()) {

try {

Thread.sleep(INTERVAL);

} catch (InterruptedException e) {

throw new RuntimeException(e);

}

}

return running || !handover.isEmpty();

}

PartitionReader never close in streaming mode.

hi @CheneyYin It seems that after a checkpoint, it will be close

The next() never return false useless call close(). However Spark MicroStreamExecution will call close() after next() return false. So reader never stop and never commit batch.

yes, you're right

Carl-Zhou-CN · 2024-08-30T01:48:30Z

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/micro/SeaTunnelMicroBatchPartitionReader.java

Lines 27 to 49 in 1bba723

public class SeaTunnelMicroBatchPartitionReader implements PartitionReader<InternalRow> {

private final ParallelBatchPartitionReader partitionReader;

public SeaTunnelMicroBatchPartitionReader(ParallelBatchPartitionReader partitionReader) {

this.partitionReader = partitionReader;

}

@Override

public boolean next() throws IOException {

return partitionReader.next();

}

@Override

public InternalRow get() {

return partitionReader.get();

}

@Override

public void close() throws IOException {

partitionReader.close();

}

}

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/batch/ParallelBatchPartitionReader.java

Lines 87 to 97 in 1bba723

public boolean next() throws IOException {

prepare();

while (running && handover.isEmpty()) {

try {

Thread.sleep(INTERVAL);

} catch (InterruptedException e) {

throw new RuntimeException(e);

}

}

return running || !handover.isEmpty();

}

PartitionReader never close in streaming mode.

hi @CheneyYin It seems that after a checkpoint, it will be close

Yes. If the reader does not receive new data for a long time, Spark will end the current micro batch. Spark's micro batch mechanism does not fully meet the requirements of long term streaming computing. First, creating a new reader for the next batch will incur some overhead. Second, the granularity of fault recovery is too large, and the Spark micro batch mechanism cannot restore the reader from the latest snapshot of the Seatunnel reader. I am looking for strategies to alleviate these problems while ensuring fault recovery. Currently, I add metadata to the seatunnel row and use a special identifier to represent the checkpoint event. After the source completes a checkpoint, it will create a checkpoint record and send it to the downstream. After receiving the checkpoint record, the sink saves the snapshot and confirms the prepared checkpoint made by the source. These checkpoint operations are performed based on the file system directory space.

I think this pattern would be more like Spark's continuous streaming mode, but it seems to completely lack fault tolerance

CheneyYin · 2024-08-30T06:28:20Z

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/micro/SeaTunnelMicroBatchPartitionReader.java

Lines 27 to 49 in 1bba723

public class SeaTunnelMicroBatchPartitionReader implements PartitionReader<InternalRow> {

private final ParallelBatchPartitionReader partitionReader;

public SeaTunnelMicroBatchPartitionReader(ParallelBatchPartitionReader partitionReader) {

this.partitionReader = partitionReader;

}

@Override

public boolean next() throws IOException {

return partitionReader.next();

}

@Override

public InternalRow get() {

return partitionReader.get();

}

@Override

public void close() throws IOException {

partitionReader.close();

}

}

seatunnel/seatunnel-translation/seatunnel-translation-spark/seatunnel-translation-spark-3.3/src/main/java/org/apache/seatunnel/translation/spark/source/partition/batch/ParallelBatchPartitionReader.java

Lines 87 to 97 in 1bba723

public boolean next() throws IOException {

prepare();

while (running && handover.isEmpty()) {

try {

Thread.sleep(INTERVAL);

} catch (InterruptedException e) {

throw new RuntimeException(e);

}

}

return running || !handover.isEmpty();

}

PartitionReader never close in streaming mode.

hi @CheneyYin It seems that after a checkpoint, it will be close

Yes. If the reader does not receive new data for a long time, Spark will end the current micro batch. Spark's micro batch mechanism does not fully meet the requirements of long term streaming computing. First, creating a new reader for the next batch will incur some overhead. Second, the granularity of fault recovery is too large, and the Spark micro batch mechanism cannot restore the reader from the latest snapshot of the Seatunnel reader. I am looking for strategies to alleviate these problems while ensuring fault recovery. Currently, I add metadata to the seatunnel row and use a special identifier to represent the checkpoint event. After the source completes a checkpoint, it will create a checkpoint record and send it to the downstream. After receiving the checkpoint record, the sink saves the snapshot and confirms the prepared checkpoint made by the source. These checkpoint operations are performed based on the file system directory space.

I think this pattern would be more like Spark's continuous streaming mode, but it seems to completely lack fault tolerance

It can ensure end-to-end at-least-once semantics. If sink can be idempotent for handling reprocessing data, it can ensure exactly-once. At present, spark continuous streaming execution mode is still experimental and guarantees at-least-once fault-tolerance.

hailin0 · 2024-09-10T02:51:56Z

❤️

- Support Spark Continuous Processing

CheneyYin marked this pull request as draft August 23, 2024 12:10

github-actions bot added core SeaTunnel core module Spark labels Aug 23, 2024

CheneyYin force-pushed the support-spark-streaming branch from 0f4a03f to 2969453 Compare August 24, 2024 05:25

CheneyYin force-pushed the support-spark-streaming branch 8 times, most recently from 1b1d744 to 6392a37 Compare August 27, 2024 11:30

github-actions bot added the api label Aug 27, 2024

CheneyYin force-pushed the support-spark-streaming branch from 6392a37 to f9b244a Compare August 28, 2024 12:14

CheneyYin force-pushed the support-spark-streaming branch from f9b244a to 3520b9b Compare August 29, 2024 02:16

CheneyYin force-pushed the support-spark-streaming branch 5 times, most recently from 2f11b11 to d3d78b3 Compare August 29, 2024 12:47

CheneyYin force-pushed the support-spark-streaming branch 2 times, most recently from 52bb0ae to f0eced2 Compare September 3, 2024 11:24

CheneyYin force-pushed the support-spark-streaming branch 5 times, most recently from 9630cd0 to 37b6533 Compare September 10, 2024 06:00

CheneyYin force-pushed the support-spark-streaming branch from 7c8db55 to 6e45582 Compare September 19, 2024 11:31

CheneyYin added 2 commits September 20, 2024 10:33

[WIP][feature][spark] Support streaming

2e82482

[WIP][feature][spark] Support streaming

b522c35

- Support Spark Continuous Processing

CheneyYin force-pushed the support-spark-streaming branch from 6e45582 to b522c35 Compare September 20, 2024 02:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP][feature][spark] Support streaming #7476

[WIP][feature][spark] Support streaming #7476

CheneyYin commented Aug 23, 2024 •

edited

Loading

hailin0 commented Aug 23, 2024

CheneyYin commented Aug 24, 2024

Carl-Zhou-CN commented Aug 28, 2024

CheneyYin commented Aug 28, 2024

CheneyYin commented Aug 28, 2024

CheneyYin commented Aug 28, 2024

Carl-Zhou-CN commented Aug 29, 2024

Carl-Zhou-CN commented Aug 30, 2024

Carl-Zhou-CN commented Aug 30, 2024

CheneyYin commented Aug 30, 2024 •

edited

Loading

hailin0 commented Sep 10, 2024

[WIP][feature][spark] Support streaming #7476

Are you sure you want to change the base?

[WIP][feature][spark] Support streaming #7476

Conversation

CheneyYin commented Aug 23, 2024 • edited Loading

Purpose of this pull request

Does this PR introduce any user-facing change?

How was this patch tested?

Check list

hailin0 commented Aug 23, 2024

CheneyYin commented Aug 24, 2024

Carl-Zhou-CN commented Aug 28, 2024

CheneyYin commented Aug 28, 2024

CheneyYin commented Aug 28, 2024

CheneyYin commented Aug 28, 2024

Carl-Zhou-CN commented Aug 29, 2024

Carl-Zhou-CN commented Aug 30, 2024

Carl-Zhou-CN commented Aug 30, 2024

CheneyYin commented Aug 30, 2024 • edited Loading

hailin0 commented Sep 10, 2024

CheneyYin commented Aug 23, 2024 •

edited

Loading

CheneyYin commented Aug 30, 2024 •

edited

Loading