Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

DOCS-3499: Improve capture edge data #3956

Merged

Conversation

sguequierre
Copy link
Contributor

Working with Jack/Devin

@viambot viambot added the safe to build This pull request is marked safe to build from a trusted zone label Jan 31, 2025
Copy link

netlify bot commented Jan 31, 2025

Deploy Preview for viam-docs ready!

Name Link
🔨 Latest commit 0613d85
🔍 Latest deploy log https://app.netlify.com/sites/viam-docs/deploys/67a234b8f055700008000dc5
😎 Deploy Preview https://deploy-preview-3956--viam-docs.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.
Lighthouse
Lighthouse
1 paths audited
Performance: 29 (🟢 up 7 from production)
Accessibility: 98 (no change from production)
Best Practices: 100 (no change from production)
SEO: 92 (no change from production)
PWA: 70 (no change from production)
View the detailed breakdown and full score reports

To edit notification comments on pull requests, go to your Netlify site configuration.

- A data management {{< glossary_tooltip term_id="service" text="service" >}} that writes captured data to local edge device storage and syncs that data with the cloud.
- Individual {{< glossary_tooltip term_id="resource" text="resource" >}} configurations that specify what data to capture and how often.

## How data capture and data sync works
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Please make sure there is at least an intro sentence when you have bullet points.

Worth considering if the explanation of how it works is actually what's needed this early in the page. Do we expect people to come here primarily for setup or for explanation?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

IMO the old page was very setup oriented so I think more of an explanation is worth trying and presenting to eliot

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't view them as mutually exclusive; imo a quick overview / explanation of the system before getting into specific set up steps helps contextualize the set up work

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sounds good. It's always a trade off between context and pushing the content people may be trying to get to below the fold. If you both in the future want to check data on this you can always look at clarity data.

only other consideration is whether you want to link to the more detailed how capture & sync works page. Please feel free to resolve once you've considered this

- Data is synced to the Viam cloud at a configured sync interval using encrypted gRPC calls and deleted from the disk once synced.
- You can capture and sync data independently, one can run without the other.

## Set up data capture and data sync
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Headings should never be followed by another heading like this without text in between. This generally indicates a content design issue.

In this case I wouldn't be surprised it you'd also find that people accidentally skip the step to add the data mgmt service. There is nothing helping the reader to avoid this from happening so when they're just skimming, they could easily miss it.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thank you for the explanation!

Comment on lines 63 to 66
The data management service is responsible for:

- Writing data from your configured Viam resources to local storage on your edge device
- Syncing data from the edge device to the cloud
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is explanatory content that you've put with a heading where readers would expect pragmatic content. This info should likely be with the "How data capture and data sync works" section.

To start, configure the data management service to capture and sync data.
Data capture and sync in Viam involves two key pieces:

- A data management {{< glossary_tooltip term_id="service" text="service" >}} that writes captured data to local edge device storage and syncs that data with the cloud.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Unless this has changed, there can only be one data management service, that's why it should be a definite article.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

good point


## Configure data capture
Configure data capture for individual resources by:
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think you need to add where they'd do this though.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think those instructions are right below in the tabs. Do we need to repeat them twice?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

still adding

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thx


The following components and services support data capture and cloud sync:
### Direct to MongoDB capture
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I don't understand this headline - maybe "Capture directly to MongoDB"?

You can configure direct capture of tabular data to a MongoDB instance alongside disk storage on your edge device.
This can be useful for powering real-time dashboards before data is synced from the edge to the cloud.

Configure using `mongo_capture_config` attributes in your data manager service.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

grammar?

## Stop data capture

If this is a test project, make sure you stop data capture to avoid charges for a large amount of unwanted data.
If you don't need to capture data, for instance in a test scenario, you can turn off data capture to reduce unnecessary storage.
Alternatively, see [advanced data capture and sync configurations](/data-ai/capture-data/capture-sync/#advanced-data-capture-and-sync-configurations) for other ways to control data usage, such as conditional sync or retention policies.
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

nice

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@jdamon96 did you want to keep this section above? I'm in favor of keeping it along with the text you added below

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

yes - along w/ the text added - agree!

@npentrel npentrel dismissed their stale review February 3, 2025 16:29

Dismissing for now - will re-review once finalized

@@ -705,25 +671,102 @@ To view all the captured data you have access to, go to the [**DATA** tab](https

You can also access data from a resource or machine part menu.
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Can we lift this above the 'Stop data capture' section? imo that order makes more sense

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

integrating Devin's feedback re also informing user how to stop data sync, perhaps we should update the header to below & then add the equivalent language of how to toggle off data sync (+ intro comma phrases now that we're describing both), e.g.:

Stop data capture and sync

To turn off data capture for a specific resource's capture method (e.g., a Camera component capturing via the GetImage capture method) navigate to the Data capture section of your resource's configuration card and toggle the configured capture method's switch to Off . You can also globally turn off data capture on the data manager service configuration card by toggling the Capturing switch to Off.

To turn off data sync, navigate to the data manager service configuration card and toggle the Syncing switch to Off.

Click the Save button in the top right corner of the page to save your config.


If you have captured data that you do not want to sync, delete the data on the machine before resuming cloud sync.
To delete the data locally, `ssh` into your machine and delete the data in the directory where you capture data.
- **Retain data up to a certain size or for a specific length of time:** Set `retention_policies` at the resource level.
Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

i think it'd be helpful to add quick parenthetical examples here:

  • Retain data up to a certain size (e.g., 100GB) or for a specific length of time (e.g., 14 days):

Copy link
Collaborator

@npentrel npentrel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM from my end. If there are more major changes, please re-tag

@sguequierre sguequierre merged commit a615a71 into viamrobotics:main Feb 4, 2025
12 checks passed
@sguequierre sguequierre deleted the DOCS-3499/improve-capture-edge-data branch February 4, 2025 17:51
Copy link

github-actions bot commented Feb 4, 2025

🔎💬 Inkeep AI search and chat service is syncing content for source 'Viam Docs (https://docs.viam.com)'

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
safe to build This pull request is marked safe to build from a trusted zone
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants