Skip to content

add polars struct and array based summary stats #428

@paddymul

Description

@paddymul

Checks

  • I have checked that this enhancement has not already been requested

How would you categorize this request. You can select multiple if not sure

Summary stats

Enhancement Description

In polars structs and arrays are just different ways of including extra series into a dataframe. For summary stats, the basic summary stats can be run on the core series of these data types.

Pseudo Code Implementation

The summary stats code should be fairly straight forward. In fact for future caching and for simplicity, summary stats would be better of dealing with a completely flat dataframe - no structs or arrays... but would it

take a representation of 2d points as struct of {x:int64, y:int64} or [int64, int64]. Mean and median could be computed from each constituent series. Unique counts though, is meaningful for the constiutent series, and the combined points. I'm not quite sure how to handle that, maybe a combined_unique summary stat.

Work would need to be done properly display this in the nested column UI

Prior Art

N/A

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions