You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
UDF interface is record at a time. With R is vectorize or bust. There is somebody working on this for SparkR but it's unclear, actually, unlikely the goodness will trickle to SQL. UDAF is a better opportunity. Imagine grouping by a col then fitting a linear model on each group
subtasks
support binary cols in backend. There is a problem in RJDBC preventing this. Could us ASCII instead.
support automatic serde of cols. Question is where do we store the metadata that says that? Or do we autodetect?
The text was updated successfully, but these errors were encountered:
we could use renjin to define R UDFS UDAFs etc.
Some obstacles
subtasks
The text was updated successfully, but these errors were encountered: