-
Notifications
You must be signed in to change notification settings - Fork 25
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
support parquet files in catalog #50
Comments
Yes! We should totally be able to do this. We need to map the stac type to the intake-parquet driver. Here's where that would go: intake-stac/intake_stac/catalog.py Lines 351 to 363 in d71b2d2
Are you up for adding this feature? |
I think I can handle adding one line to your drivers :) But I'd think this would also require adding ingest-parquet as a dependency somewhere. Your top level requirements.txt, I assume? And I'd need to add something to https://github.com/intake/intake-stac/blob/d71b2d2b0ea2f8c89cb0310706c4de6d19406e17/intake_stac/tests/test_catalog.py |
Was looking over this during STAC sprint 6, currently updating types based on STAC Types
|
@wildintellect - if you are up for it, let's just do one PR where we update all the media types. I think |
As I said in #48, I was recently involved in group trying to use intake-stac with some data we have sitting in s3. This data is in parquet format. I've used intake-parquet on this data with no problem to get a dask data frame. But when I try with intake-stac,
I get the error:
I assume that intake-stac is keying off the "type" field in the item field. Parquet doesn't have a mime-type, so I tried 'parquet' without success. I then re-read your Readme and realized that if intake-stac is built on top of intake-xarray, then you probably can't read in parquet regardless of what I put in the "type" field.
Would it be possible to add parquet via the intake-parquet library?
I'm wondering if parquet is beyond the scope of the STAC catalog spec? I don't see parquet in STAC's list of media types here. But then I don't see zarr either and I'm guessing that you support zarr with intake-stac because it's your favored data type for pangeo.
The text was updated successfully, but these errors were encountered: