Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Integrate inspector dashboard json #3

Open
gsxryan opened this issue Jul 16, 2019 · 7 comments
Open

Integrate inspector dashboard json #3

gsxryan opened this issue Jul 16, 2019 · 7 comments
Labels
enhancement New feature or request postpone delay until

Comments

@gsxryan
Copy link
Owner

gsxryan commented Jul 16, 2019

image
image

@gsxryan
Copy link
Owner Author

gsxryan commented Jul 18, 2019

image

Append -p 7777:7777 to the run command

gsxryan pushed a commit that referenced this issue Jul 19, 2019
@gsxryan gsxryan added the enhancement New feature or request label Jul 19, 2019
@JeromeGillard
Copy link
Contributor

Any idea where this /app/config/inspector might be?

@gsxryan
Copy link
Owner Author

gsxryan commented Jul 19, 2019

It's inside the storagenode container.

@JeromeGillard
Copy link
Contributor

They seem to build the GO inspector from Storj source code. I guess they run it in the SNO container by binding the inspector binary from the host machine. (Correct me if I am wrong, that's just how I would do that).

As my Telegraf container doesn't have golang binaries, I can't do that and keep being portable with official SNO and Telegraf Docker images.

I've heard the SNO dashboard (web interface) will be issued soon. I hope we will be able to get more easily many information we strive to compute.

I would wait for that release.

@gsxryan
Copy link
Owner Author

gsxryan commented Jul 20, 2019

I imagined capturing within the container, and dumping it somewhere outside the container for access. I do agree with you, this should be tabled until checking out the SNO board.

@gsxryan gsxryan added the postpone delay until label Jul 27, 2019
@gsxryan
Copy link
Owner Author

gsxryan commented Aug 2, 2019

I think the most critical attribute to follow here would be the "Time since last online" metric. Some SNOs report that they get high values here, and the only way to fix it is to reboot the node. If we can monitor this, we could schedule proactive reboots to remedy it, and alerts to monitor how often it occurs.

Alternatively, it likely goes along with low throughput, so that could be monitored instead.

@gsxryan
Copy link
Owner Author

gsxryan commented Aug 31, 2019

@gsxryan gsxryan mentioned this issue Nov 24, 2019
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request postpone delay until
Projects
None yet
Development

No branches or pull requests

2 participants