Skip to content

Commit

Permalink
Merge pull request #664 from geobtaa/resource-lifecycle
Browse files Browse the repository at this point in the history
Update resource-lifecycle.md
  • Loading branch information
karenmajewicz authored Jul 19, 2024
2 parents 34b4795 + 3a1ea06 commit 391f77e
Showing 1 changed file with 16 additions and 16 deletions.
32 changes: 16 additions & 16 deletions docs/resource-lifecycle.md
Original file line number Diff line number Diff line change
@@ -1,14 +1,5 @@
# Resource Lifecycle

**5 Stages of the Resource Lifecycle**

``` mermaid
flowchart LR
I((1.<br> IDENTIFY)) --> H[/2. <br> HARVEST/] --> P[3. <br> EDIT] --> X[4. <br>INDEX] --> M{{5. <br>MAINTAIN}}--> H[/2. <br>HARVEST/]
```

## 1. Identify

:fontawesome-solid-user: BTAA-GIN Team Members and Product Manager
Expand All @@ -29,35 +20,44 @@ Here are the most common ways that we obtain the metadata:
4. we manually copy and paste the metadata into a spreadsheet
5. a combination of one or more of the above

This step also involves using a crosswalk to convert the metadata into the schema needed for the Geoportal. Our goal is to end up with a spreadsheet containing columns matching our [metadata template](https://z.umn.edu/b1g-template).
## 3. Crosswalk

This step involves using a crosswalk to convert the metadata into the schema needed for the Geoportal. Our goal is to end up with a spreadsheet containing columns matching our [metadata template](https://z.umn.edu/b1g-template).

!!! info "Why do we rely on CSV?"

CSV (Comma Separated Values) files organize tabular data in plain text format, where each row of data is separated by a line break, and each column of data is separated by a delimiter.

We have found this tabular format to be the most human-readable way to batch create, edit, and troubleshoot metadata records. We can visually scan large numbers of records at once and normalize the values in ways that would be difficult with native nested formats, like JSON or XML. Therefore, many of our workflow processes involve transforming things to and from CSV.

## 3. Edit
## 4. Edit

:fontawesome-solid-user: Graduate Research Assistants and Product Manager

When working with metadata, it is common to come across missing or corrupted values, which require troubleshooting and manual editing in our spreadsheets. Refer to the [Collections Project Board](https://github.com/orgs/geobtaa/projects/4) for examples of this work.

After compiling the metadata, we run a validation and cleaning script to ensure the records conform to the required elements of our schema. Finally, we upload the completed spreadsheet to GBL Admin, which serves as the administrative interface for the Geoportal. If GBL Admin detects any formatting errors, it will issue a warning and may reject the upload.
## 5. Validate

After compiling the metadata, we run a validation and cleaning script to ensure the records conform to the required elements of our schema.

## 4. Index
## 6. Index

:fontawesome-solid-user: Product Manager

We upload the completed spreadsheet to GBL Admin, which serves as the administrative interface for the Geoportal. If GBL Admin detects any formatting errors, it will issue a warning and may reject the upload.

Once the metadata is successfully uploaded to GBL Admin, we can publish the records to the Geoportal. The technology that actually stores the records and enables searching is called [Solr](https://solr.apache.org). The action of adding records is known as "Indexing."

Periodically, we need to remove records from the Geoportal. To do this, we use GBL Admin to either delete them or change their status to "unpublished."

## 5. Maintain

## 7. Maintain

:fontawesome-solid-user: BTAA-GIN Team Members, Graduate Research Assistants, and Product Manager

The Geoportal is programmatically checked for broken links on a monthly basis. The are fixed either by manually repairing them or by reharvesting from the source.
Periodically, we need to remove records from the Geoportal. To do this, we use GBL Admin to either delete them or change their status to "unpublished."


We monitor currentness and re-harvest based on how frequently sources update their content. See the [Collections Dashboard](https://github.com/orgs/geobtaa/projects/4) for this schedule.


## Sequence diagram of Resource Lifecycle
Expand Down

0 comments on commit 391f77e

Please sign in to comment.