Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

System architecture questions #69

Closed
lars-t-hansen opened this issue Sep 15, 2023 · 1 comment
Closed

System architecture questions #69

lars-t-hansen opened this issue Sep 15, 2023 · 1 comment
Labels
discussion Further information is requested

Comments

@lars-t-hansen
Copy link
Collaborator

We're starting to have a lot of moving parts and we're going to need to integrate them at some point. See also #19 #53 #57. Summarizing:

  • sonar runs (by cron) on the ML nodes and writes output to files on a disk, call it D, which is available from the ML nodes.
  • sonar will ditto run on every node on Fox and write output to a disk, call it F, available from Fox.
  • (and maybe more systems)
  • sonalyze runs (manually, by cron to run bughunt/cpuhog scripts, or by naicreport to produce load data; naicreport is in turn run by cron) on some machine that has access to D and/or F. D and F could be the same or separate disks. sonalyze could run multiple places on different logs.
  • some of those runs of sonalyze are a little expensive and it may or may not be a good idea to run them on the HPC/ML hardware directly
  • the raw output data produced by sonalyze and naicreport are consumed by email systems and web servers
  • in particular a web server needs to serve data produced by naicreport

Questions:

  • where do we store and how do we share log data (shared disk or some kind of rsync)?
  • where do we run the various scripts and programs that process log data and generate other data?
  • where do we run the web server that will serve some of the generated data?
  • are these places the same for the ml nodes and fox?

There are probably many more questions about the use of the system, so this is just the start of a conversation.

@lars-t-hansen lars-t-hansen added discussion Further information is requested design labels Sep 15, 2023
@lars-t-hansen
Copy link
Collaborator Author

The questions have been answered:

  • we store and share (at least for the ml nodes) on /itf-fi-ml/shared/users/sonarmon, probably
  • scripts run on moneypenny
  • the web server runs on a vm with a static, public ip
  • we'll get to fox when we get to it, but i believe there could be a completely parallel setup for fox without this impacting any of the infrastructure at all

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
discussion Further information is requested
Projects
None yet
Development

No branches or pull requests

1 participant