-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Exist data loss when flume-ng-redis stop or restart #13
Comments
Currently, there is no perfect solution for this case. Flume can be configured with multiplexing but the data will be replicated with multiplexing configuration. After implementation using Redis List structure, we can solve this problem. |
https://github.com/fengpeiyuan/flumeng-plugins-redis |
Yes. Using redis list structure can solve the problem. And I just implemented the plugin using list structure also. You can use it from master branch. You can solve the problem using multiple subscriber with pub/sub implementation. But there are duplicated records in multiple subscription. You can deal with the duplication in the stage before using the collected data as known as ETL. I think pub/sub is faster than list but the list structure is enough in common case. I attached a article about this. (https://davidmarquis.wordpress.com/2013/01/03/reliable-delivery-message-queues-with-redis/) |
I have tested the plugin using list structure,2M/S,11700 record/S.Maybe we shouled make it faster. |
Hi! Thanks for your effort to test. :) But I cannot understand your suggestion perfectly. I'll add this feature as a option. But I'm preparing my final exam in school now. Maybe I can implement this feature within 2-3 weeks. Thank you. |
publish ---->redis---->flume-ng-redis(source)
when the flume-ng-redis(source) stop for some reason,the data will lose during this period。
The text was updated successfully, but these errors were encountered: