You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Please vote on this issue by adding a 👍 reaction to the original issue to help the community and maintainers prioritize this request
If you are interested in working on this issue or have submitted a pull request, please leave a comment
Problem
I'm trying to build pipeline Logs -> Vector sink GCS -> Google Cloud storage and then process these GCS files again by vector (on demand) via Vector source File -> Logs.
The problem is the fact log entries loss is constantly happening. The last line of each log file is omitted.
With reference to #18341, I've understood this is expected behavior of File source.
However, the file(s) / chunks are produced by vector, and therefore I believe there's a bug in GCS sink.
INFO vector::sources::file_descriptors: Capturing stdin.
INFO vector: Vector has started. debug="false" version="0.35.0" arch="x86_64" revision="e57c0c0 2024-01-08 14:42:10.103908779"
foo
baz
bar
A note for the community
Problem
I'm trying to build pipeline Logs -> Vector sink GCS -> Google Cloud storage and then process these GCS files again by vector (on demand) via Vector source File -> Logs.
The problem is the fact log entries loss is constantly happening. The last line of each log file is omitted.
With reference to #18341, I've understood this is expected behavior of File source.
However, the file(s) / chunks are produced by vector, and therefore I believe there's a bug in GCS sink.
Configuration
Version
0.35.0
Debug Output
No response
Example Data
INFO vector::sources::file_descriptors: Capturing stdin.
INFO vector: Vector has started. debug="false" version="0.35.0" arch="x86_64" revision="e57c0c0 2024-01-08 14:42:10.103908779"
foo
baz
bar
laptop$ gsutil cp -r 'gs://test/**' /tmp/logs
laptop$ cat -vE /tmp/logs/14:11:00.log
{"host":"laptop","message":"foo","source_type":"stdin","timestamp":"2024-01-17T14:11:00.270743111Z"}$
{"host":"laptop","message":"baz","source_type":"stdin","timestamp":"2024-01-17T14:11:01.549495036Z"}$
{"host":"laptop","message":"bar","source_type":"stdin","timestamp":"2024-01-17T14:11:02.518769996Z"}laptop$
2024-01-17T14:12:48.472079Z INFO vector: Vector has started. debug="false" version="0.35.0" arch="x86_64" revision="e57c0c0 2024-01-08 14:42:10.103908779"
2024-01-17T14:12:48.472139Z INFO source{component_kind="source" component_id=logs component_type=file}: vector::sources::file: Starting file server. include=["/tmp/logs/*"] exclude=[]
2024-01-17T14:12:48.472724Z INFO source{component_kind="source" component_id=logs component_type=file}:file_server: file_source::checkpointer: Attempting to read legacy checkpoint files.
2024-01-17T14:12:48.483353Z INFO source{component_kind="source" component_id=logs component_type=file}:file_server: vector::internal_events::file::source: Found new file to watch. file=/tmp/logs/14:1100.log
{"file":"/tmp/logs/14:11:00.log","host":"laptop","message":"{"host":"laptop","message":"foo","source_type":"stdin","timestamp":"2024-01-17T14:11:00.270743111Z"}","source_type":"file","timestamp":"2024-01-17T14:12:48.483668135Z"}
{"file":"/tmp/logs/14:11:00.log","host":"laptop","message":"{"host":"laptop","message":"baz","source_type":"stdin","timestamp":"2024-01-17T14:11:01.549495036Z"}","source_type":"file","timestamp":"2024-01-17T14:12:48.483709234Z"}
Additional Context
No response
References
No response
The text was updated successfully, but these errors were encountered: