Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix logs in oai-backend and cassandra-oai #602

Open
1 task
eloiferrer opened this issue Dec 17, 2024 · 3 comments
Open
1 task

Fix logs in oai-backend and cassandra-oai #602

eloiferrer opened this issue Dec 17, 2024 · 3 comments
Assignees
Labels
enhancement New feature or request

Comments

@eloiferrer
Copy link
Member

eloiferrer commented Dec 17, 2024

Issue description:
It looks like the oai-backend service is currently saving almost 200 GB of logs in a backend-logs volume:

- backend-logs:/usr/local/tomcat/logs

Also cassandra-oai has a volume named cassandra-data

- cassandra-data:/var/lib/cassandra/
with more than 30 GB. I am not sure if these are also just logs.

Could someone check if this is necessary and change the configuration if it is not?

For context, these two volumes were taking >50GB and >60GB in staging and causing space-related problems.

TODOS:

  • decrease log level of tomcat once development of oai-importer has been completed.
@eloiferrer eloiferrer added the enhancement New feature or request label Dec 17, 2024
@physikerwelt
Copy link
Member

The configuration looks good to me. 30GB of data seems okay, and 200GB of log storage is not unexpected overall. Once the development is done we can decrease the log-level.

@eloiferrer
Copy link
Member Author

I might be wrong, but I thought the development for this was taking place in staging. These logs currently take half the size of our database, if they are not being actively used it would be helpful if we can deactivate them in production @physikerwelt @Mazztok45

@Mazztok45
Copy link
Contributor

I didn’t deactivate the logs, but removed them since they weren’t crucial to our current work. I recognize this isn’t a sustainable long-term approach. As a future step, many .txt logs could be stored in Parquet or Arrow formats to compress and secure them, reducing disk usage.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants