-
Notifications
You must be signed in to change notification settings - Fork 15
Description
Checks
- I have checked that this enhancement has not already been requested
How would you categorize this request. You can select multiple if not sure
Summary stats
Enhancement Description
In polars structs and arrays are just different ways of including extra series into a dataframe. For summary stats, the basic summary stats can be run on the core series of these data types.
Pseudo Code Implementation
The summary stats code should be fairly straight forward. In fact for future caching and for simplicity, summary stats would be better of dealing with a completely flat dataframe - no structs or arrays... but would it
take a representation of 2d points as struct of {x:int64, y:int64} or [int64, int64]. Mean and median could be computed from each constituent series. Unique counts though, is meaningful for the constiutent series, and the combined points. I'm not quite sure how to handle that, maybe a combined_unique summary stat.
Work would need to be done properly display this in the nested column UI
Prior Art
N/A