Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add script to automatically update source primary_language #11

Open
rahulbot opened this issue Jan 14, 2025 · 0 comments
Open

add script to automatically update source primary_language #11

rahulbot opened this issue Jan 14, 2025 · 0 comments
Labels
enhancement New feature or request

Comments

@rahulbot
Copy link

Our sources have a primary_language field on them. We should fill that in regularly by doing something like pulling 1000 random stories in the last year and grabbing the most common detected language. Perhaps don't do this for sources that have less than 100 articles because that's an indicator that we don't get regular data from them? This should probably be a cron-based task run once every few months. A simple data science script so I'm logging it here, but probably best implemented as a Django management command like update-stories-per-week.

@rahulbot rahulbot added the enhancement New feature or request label Jan 14, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant