Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Canisterize logger and maybe database connection #10

Open
moseshll opened this issue May 7, 2024 · 0 comments
Open

Canisterize logger and maybe database connection #10

moseshll opened this issue May 7, 2024 · 0 comments

Comments

@moseshll
Copy link
Contributor

moseshll commented May 7, 2024

The logger part is easy, the database connection maybe not so much.

moseshll added a commit that referenced this issue Sep 6, 2024
* DEV-1335 Make hathifiles_database Date Independent
- Add `Log` class for recording file + timestamp in `hf_log` table.
- Add `Hathifiles` class for producing agenda of files to load.
- Add `exe/hathifiles_database_full_update` script for bringing database up to date.
- Update README with `exe/` inventory and notes.
- The existing code was getting too many false changes on ISSNs in monthly delta
  - Loosened restriction on input file format to accommodate more database-like values (allow 0/1 for `access`)
  - Add more tests for round-trip data fidelity -- one should be able to load any hathifile, and the delta with itself should be empty.
- Address Dependabot #10 REXML denial of service vulnerability
- TIDY
  - Remove dead code after __END__ blocks
  - Address issue #11 Remove wait-for and use healthchecks
- Address #8 add prometheus / pushgateway
  - Batch up the calls to milemarker instead of calling for each INSERT
- Monthly update bucket chain must `sort` after `cut` to keep `comm` happy.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant