Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create Estonia (from eurocrops) #22

Open
1 of 6 tasks
cholmes opened this issue May 3, 2024 · 12 comments
Open
1 of 6 tasks

Create Estonia (from eurocrops) #22

cholmes opened this issue May 3, 2024 · 12 comments
Assignees
Labels

Comments

@cholmes
Copy link

cholmes commented May 3, 2024

Use public data from Estonia and:

  • Add it to the data survey (optional)
  • Make the template for the CLI (and contribute it back)
  • Run the conversion and validate
  • Upload to Source Cooperative
  • Get STAC, STAC Browser and PMTiles
  • Make nice readme for source cooperative.

Instructions available at https://github.com/fiboa/data/blob/main/HOWTO.md

@m-mohr m-mohr added this to fiboa May 3, 2024
@github-project-automation github-project-automation bot moved this to Backlog in fiboa May 3, 2024
@PowerChell PowerChell self-assigned this May 8, 2024
@cholmes
Copy link
Author

cholmes commented May 9, 2024

This should be based on the eurocrops dataset - information at https://github.com/maja601/EuroCrops/wiki/Estonia and download at https://zenodo.org/records/8229128/files/EE_2021.zip?download=1

In the converter use ec_ee for the name, as @m-mohr pointed out it'd be good to group the eurocrops ones.

Also I do think we eventually want a converter for the actual source data from the Estonia government, but we can make a new ticket for that.

@cholmes cholmes changed the title Create Estonia Create Estonia (from eurocrops) May 9, 2024
@cholmes cholmes added the dataset label May 9, 2024
@cholmes cholmes added this to the First conversions milestone May 9, 2024
@PowerChell PowerChell moved this from Backlog to In Progress in fiboa May 27, 2024
@m-mohr
Copy link
Contributor

m-mohr commented Aug 13, 2024

Any updates on this @PowerChell ?

@PowerChell
Copy link
Contributor

Hey sorry got pulled into other work and dropped off this. I can give @kyle-rasch what I have and have him finish it off!

@m-mohr
Copy link
Contributor

m-mohr commented Nov 7, 2024

EE has a self-intersecting polygon which makes the validator fail. Not sure how to handle this case.

@kyle-rasch
Copy link
Collaborator

kyle-rasch commented Nov 7, 2024 via email

@PowerChell PowerChell removed their assignment Nov 7, 2024
@cholmes
Copy link
Author

cholmes commented Nov 7, 2024

EE has a self-intersecting polygon which makes the validator fail. Not sure how to handle this case.

The 'ideal' to me for this is what I've been saying where a converter should be as simple as possible, and then we have 'tools' to clean it up / make it valid / add interesting stuff.

In the short term can we just put in custom code that fixes or removes the self-intersecting polygon? And we just document what we did to clean up the dataset.

@cholmes
Copy link
Author

cholmes commented Nov 7, 2024

A similar one that doesn't break validation but is clearly off is that the France eurocrops one has like 6 fields who have at least one of their points in Africa. It'd be nice to have a little tool that can find and fix things like that.

@m-mohr
Copy link
Contributor

m-mohr commented Nov 7, 2024

I'm just wondering whether it's our task to fix the data. Do we just make data follow the fiboa spec or do we actually want to "fix" the data (whatever that means)? We can implement something like fiboa fix ee.parquet (or a converter option) that just runs https://shapely.readthedocs.io/en/2.0.6/reference/shapely.make_valid.html But which file would we publish to Source?

@cholmes
Copy link
Author

cholmes commented Nov 7, 2024

Do we just make data follow the fiboa spec

Does the fiboa spec currently allow self-intersecting polygons? If it doesn't I think I'd argue it should in some way- the specification aims to make field boundary data more usable and interoperable. I do think it should be easy to get data into 'not-quite valid fiboa' easily, like I do think a first converter should not do validation of self-intersecting polygons. Perhaps there is a second level of fiboa validation, that makes sure there is self intersections, perhaps also no intersection between polygons (at least ones that are on the same time range).

I think the experience we want of people downloading fiboa data is that it's 'always good'. So if we just have to publish one on source I'd put the fixed one. But I think ideally we'd put both, to make it clear that something was done to the data past what the original was published as.

In the short term I think it's also fine to just fix the data and put the one fixed version up...

@m-mohr
Copy link
Contributor

m-mohr commented Nov 7, 2024

We don't have requirements for the geometries yet, see also fiboa/specification#28

@ivorbosloper
Copy link
Collaborator

This has been implemented, right? fiboa/cli#94

@m-mohr
Copy link
Contributor

m-mohr commented Nov 12, 2024

The converter is complete, yes. There are still some minor topics open with regards to validity and Source publication.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
Status: In Progress
Development

No branches or pull requests

5 participants