Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Converters for Austria and Brazil #49

Draft
wants to merge 5 commits into
base: main
Choose a base branch
from

Conversation

aninda-ghosh
Copy link

@aninda-ghosh aninda-ghosh commented Jun 14, 2024

Added converters for

  1. Austria
  2. Brazil

@@ -0,0 +1,140 @@
# TEMPLATE FOR A FIBOA CONVERTER
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I propose to rename to fs_at.py

@@ -0,0 +1,136 @@
# TEMPLATE FOR A FIBOA CONVERTER
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I propose to rename to fs_br.py

# Title of the collection
TITLE = "Field boundaries for Austria (Fieldscapes)"
# Description of the collection. Can be multiline and include CommonMark.
DESCRIPTION = """ The dataset contains field boundaries for the Austria."""
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
DESCRIPTION = """ The dataset contains field boundaries for the Austria."""
DESCRIPTION = "The dataset contains field boundaries for the Austria."

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Why does the template contain three apostrophes, if one is desired here? Is it that three lets you do multiline? But multiline isn't needed for this one?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, """ is multiline, " is single line.
I provided """ as default to make like simpler for implementors as I had hoped for a bit longer, usually multiline descriptions.

# Add columns with constant values.
# The key is the column name, the value is a constant value that's used for all rows.
ADD_COLUMNS = {
"determination_datetime": "2021-01-01T00:00:00Z"
Copy link
Contributor

@m-mohr m-mohr Jun 16, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Where does this information come from? If it's unknown, better remove the line.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I did a similar thing in the France dataset. These datasets from the government are released once a year, and so all the fields in it are clearly for that year. But they don't have any more specificity on a per field level.

Putting in the first of the year is obviously the safest. I think with France I saw some 'release date', so I used that.

It does seem very useful to know that it's from that year, so I lean towards including it in some way, like is done here.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, but yesterday we realized that people put things like the upload date ofthe source data there. That's not the determination time.
We should start on the timestamps extension now, otherwise determination_datetime ends up being just whatever time the creator finds/has available.
As I've asked before in the meeting notes doc: Do you organize the meeting for the timestamps extension discussion or shall someone else take the lead? I can also organize it, but it sounded like you'd lead it, so don't want to interfere with you ;-) @cholmes

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yes, but yesterday we realized that people put things like the upload date ofthe source data there. That's not the determination time.

Agreed, that's not good.

Do you organize the meeting for the timestamps extension discussion or shall someone else take the lead? I can also organize it, but it sounded like you'd lead it, so don't want to interfere with you ;-) @cholmes

Yeah, sorry I've been negligent. I just sent out one on the deforestation regulation stuff. Will do a smaller timestamps / core one now.

# Add columns with constant values.
# The key is the column name, the value is a constant value that's used for all rows.
ADD_COLUMNS = {
"determination_datetime": "2020-01-01T00:00:00Z"
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Where does this information come from?`If it's unknown, better remove the line.


# Schemas for the fields that are not defined in fiboa
# Keys must be the values from the COLUMNS dict, not the keys
MISSING_SCHEMAS = {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

schemas missing for properties that are not defined by fiboa core spec.

Copy link
Contributor

@m-mohr m-mohr left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

see comments

# Title of the collection
TITLE = "Field boundaries for Brazil (Fieldscapes)"
# Description of the collection. Can be multiline and include CommonMark.
DESCRIPTION = """ The dataset contains field boundaries for the Brazil."""
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
DESCRIPTION = """ The dataset contains field boundaries for the Brazil."""
DESCRIPTION = "The dataset contains field boundaries for the Brazil."

@m-mohr
Copy link
Contributor

m-mohr commented Jun 17, 2024

Please rebase the PR to the latest CLI version, including renaming the URI constant to SOURCES and renaming the parameter cache_file to cache. I tried to push it upstream, but you didn't permit changes to this PR so you need to do these changes yourself. Thanks.

@m-mohr m-mohr marked this pull request as draft July 10, 2024 15:27
@m-mohr
Copy link
Contributor

m-mohr commented Jul 25, 2024

@aninda-ghosh What's the difference to the existing converter for Austria (at)? Can we discard this converter instead of the existing one?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: In Progress
Development

Successfully merging this pull request may close these issues.

3 participants