-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ENH: output format geoparquet #95
Comments
thanks for starting the discussion @jsignell 🙏 This is definitely something we could/should support. |
Which endpoints should have GeoParquet? Lines 1065 to 1067 in be04e60
It might be easiest to use the existing https://github.com/stac-utils/stac-geoparquet library for this |
yeah exactly! I think |
@kylebarron yes, I think it makes senses to enable GeoParquet output for
stac-geoparquet, depends on pandas and geopandas (thus shapely), this would be quite heavy dependencies |
Unfortunately GeoParquet currently requires rather heavy dependencies to read and write from Python. For one, the primary way to read and write Parquet in Python is via
Additionally, the GeoParquet spec says to store geometries in WKB, so you need some way to convert your existing geometries into WKB, and Shapely seems like the easiest to reach for. People have been discussing making pyarrow more modular so that the bundle size is smaller, but nothing has happened yet. When my Rust geoarrow library and its Python bindings are more stable (not imminently) it might be a good choice for stuff like this that intends to be able to be deployed on lambda. |
🤯 I don't think this feature is extremely needed right now so we can wait especially if this can help for your library to be ready :-) FYI: we already have pyproj dependency (via morecantile) Note: we could still add an heavy dependency and make the whole thing optional if this is really something user/customers want |
Definitely agree with making it an optional dependency if we add it. |
Yes, definitely would want to add this as optional dependency. We would probably want to implement this starting from a query like |
I'm not too familiar with the tipg internals but happy to help implement this |
This came up in the STAC meeting today. Currently it looks like the supported output formats (available with the
f
query param) are: 'geojson', 'html', 'json', 'csv', 'geojsonseq', 'ndjson'. I got that list by naively trying https://firenrt.delta-backend.com/collections/public.eis_fire_lf_perimeter_nrt/items?f=geoparquetIt would be neat to add 'geoparquet' as an option.
Not sure if this is the right place to capture the request so feel free to close, just wanted to increase the visibility of that conversation.
The text was updated successfully, but these errors were encountered: