Sftp refactor #3073

ooesili · 2024-12-11T21:02:48Z

Before this commit, when a file was exuasted the ReadBatch method
returned ErrNotConnected which cause the engine to call Connect again.
Aside from being awkward, this causes the connection status to
incorrectly be reported as disconnected during normal operation.

This commit moves the logic to advance to the next file when the current
file is exhuasted into a the ReadBatch method.

Builds on top of #2435

This commit reduces the scope of critical sections guarded by scannerMut to remove a deadlock that causes the last file to not be deleted when the SFTP input is used with watching enabled.

`(*watcherPathProvider).Next()` currently uses recursion to loop until a path is found. This commit refactors that function to use a for loop instead which is more straight forward to read.

This integration test makes sure that when `delete_on_finish` is true and watching is enabled that we delete every file.

Before this commit, when a file was exuasted the `ReadBatch` method returned ErrNotConnected which cause the engine to call `Connect` again. Aside from being awkward, this causes the connection status to incorrectly be reported as disconnected during normal operation. This commit moves the logic to advance to the next file when the current file is exhuasted into a the ReadBatch method.

The v2 suffix was added to some functions during the recent refactor and they were accidentally left in place.

mihaitodor

Hey @ooesili, thanks for bearing with me for a review on this! Really awesome job man 🏆 This does make it easier to go through the code. I think I saw a few potential issues, but should be good otherwise.

I think @rockwotj also left a few comments in #3037. Please have a look and then I'm happy to merge both PRs.

internal/impl/sftp/input.go

internal/impl/sftp/integration_test.go

ReadBatch was holding the state lock the while it polled for new files, which blocked AckFns from cleaning up successfully processed files when deleteOnFinish is set to true.

Jeffail

Found a couple of concurrency issues, I don't think either would actually be encountered in practice, but it's better to hold the locks for longer and be sure.

Jeffail · 2025-01-08T17:02:53Z

internal/impl/sftp/input.go

+	s.stateLock.Lock()
+	defer s.stateLock.Unlock()
+
+	parts, codecAckFn, err := s.scanner.NextBatch(ctx)


I'm not sure it's actually possible to be hit in practice but there's a race condition here, as the lock around s.scanner is yielded within initScanner before the lock is re-acquired, so there's opportunity for another goroutine to set s.scanner to nil in that gap. If you want to avoid this then have initScanner return the scanner pointer and use that reference.

Wow good catch. I'll do this

Jeffail · 2025-01-08T17:32:50Z

internal/impl/sftp/input.go

+			return fmt.Errorf("creating scanner: %w", err)
+		}
+
+		s.stateLock.Lock()


Re-acquiring the lock here means there's a period of time where another goroutine could potentially have used the client to open a file, create a scanner and assign it, and now this goroutine is going to overwrite s.scanner, which means it's lost. I think in terms of performance you should be fine holding the lock for the entire method call.

The problem holding the lock for the entire call is that in watch mode it will hold the lock open while s.pathProvider.Next() blocks and waits for a new file, which will cause AckFns to block because they need to grab the lock for a bit too to access s.pathProvider

mihaitodor

Nice job @ooesili! 🏆 Thanks for bearing with us on this review! The pool implementation looks like a great idea!

I left a small comment and please don't forget to add a note in the Changelog.

Feel free to merge if the stuff Ash mentioned is sorted.

internal/impl/sftp/input.go

This prevents a race condition between two calls to ReadBatch clobbering each other.

ooesili added 6 commits December 3, 2024 14:33

fix(sftp): fix polling logic in watcher

a36f25e

fix(sftp): fix deadlock so last file is deleted

68eba81

This commit reduces the scope of critical sections guarded by scannerMut to remove a deadlock that causes the last file to not be deleted when the SFTP input is used with watching enabled.

refactor(sftp): use for loop in watcher provider

259d12b

`(*watcherPathProvider).Next()` currently uses recursion to loop until a path is found. This commit refactors that function to use a for loop instead which is more straight forward to read.

fix(sftp): reduce mutex scope even further

18b29aa

test(sftp): add test for delete-on-finish bug

ab133f4

This integration test makes sure that when `delete_on_finish` is true and watching is enabled that we delete every file.

ooesili added the inputs Any tasks or issues relating specifically to inputs label Dec 11, 2024

ooesili requested review from mihaitodor, Jeffail and rockwotj December 11, 2024 21:02

ooesili self-assigned this Dec 11, 2024

fix(sftp): work around shutdown concurrency issue

d7a6609

ooesili marked this pull request as ready for review December 12, 2024 17:04

chore(sftp): remove v2 suffix from refactored code

0142589

The v2 suffix was added to some functions during the recent refactor and they were accidentally left in place.

mihaitodor reviewed Dec 17, 2024

View reviewed changes

internal/impl/sftp/input.go Show resolved Hide resolved

internal/impl/sftp/input.go Show resolved Hide resolved

internal/impl/sftp/input.go Outdated Show resolved Hide resolved

mihaitodor reviewed Dec 17, 2024

View reviewed changes

internal/impl/sftp/integration_test.go Outdated Show resolved Hide resolved

fix(sftp): reduce criticl sections of mutexes

b22a741

ReadBatch was holding the state lock the while it polled for new files, which blocked AckFns from cleaning up successfully processed files when deleteOnFinish is set to true.

Jeffail reviewed Jan 8, 2025

View reviewed changes

refactor(sftp): create clientPool to manage connection status

a2838f1

mihaitodor approved these changes Jan 9, 2025

View reviewed changes

internal/impl/sftp/input.go Outdated Show resolved Hide resolved

fix(sftp): return scanner from (*sftpReader).initScanner

9cf4382

This prevents a race condition between two calls to ReadBatch clobbering each other.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Sftp refactor #3073

Sftp refactor #3073

ooesili commented Dec 11, 2024

mihaitodor left a comment •

edited

Loading

Jeffail left a comment

Jeffail Jan 8, 2025

ooesili Jan 8, 2025

Jeffail Jan 8, 2025

ooesili Jan 8, 2025

mihaitodor left a comment

Sftp refactor #3073

Are you sure you want to change the base?

Sftp refactor #3073

Conversation

ooesili commented Dec 11, 2024

mihaitodor left a comment • edited Loading

Choose a reason for hiding this comment

Jeffail left a comment

Choose a reason for hiding this comment

Jeffail Jan 8, 2025

Choose a reason for hiding this comment

ooesili Jan 8, 2025

Choose a reason for hiding this comment

Jeffail Jan 8, 2025

Choose a reason for hiding this comment

ooesili Jan 8, 2025

Choose a reason for hiding this comment

mihaitodor left a comment

Choose a reason for hiding this comment

mihaitodor left a comment •

edited

Loading