You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Firstly, snappydata is amazing! We were having issues with joining large datasets and having the min/max column statistics gave us an incredible boost. I had some follow up questions regarding two things:
Is it valuable to pre-sort my parquet set columns that I'm joining on hopefully organizing the column buffers to allow for a smaller number of mix/max ranges making querying and joining more efficient?
Is it possible to partition by two columns on two different tables AND co-locate the two tables? We have two columns on both tables: an id and a type and we want to evenly distribute those across nodes but it seems like we get an error trying to colocate.
Thanks!
The text was updated successfully, but these errors were encountered:
Hi
Firstly, snappydata is amazing! We were having issues with joining large datasets and having the min/max column statistics gave us an incredible boost. I had some follow up questions regarding two things:
Is it valuable to pre-sort my parquet set columns that I'm joining on hopefully organizing the column buffers to allow for a smaller number of mix/max ranges making querying and joining more efficient?
Is it possible to partition by two columns on two different tables AND co-locate the two tables? We have two columns on both tables: an
id
and atype
and we want to evenly distribute those across nodes but it seems like we get an error trying to colocate.Thanks!
The text was updated successfully, but these errors were encountered: