You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm wondering if there's any interest in supporting Arrow's float16 type in Parquet.
There seem to be one or two float16 / halffloat tickets here (e.g., PARQUET-1403) but nothing that speaks to adding half-float support to Parquet in-general.
PLANS
I'm able to spend some time on this, if someone points me in the right direction.
Ben Harkins / @benibus:
I'm currently working on this, so feel free to assign me. Although it's probably worth mentioning that the current plan is to implement this as a logical type in accordance with the proposal PR for PARQUET-758, which deviates from some of the plan in this issue's description.
Freddy Fostvedt:
Thanks for putting effort into this @benibus , this is a very valuable piece at my place of work. It will save very significant costs on data processing / training cost if we can reduce memory usage.
DESCRIPTION
I'm wondering if there's any interest in supporting Arrow's
float16
type in Parquet.There seem to be one or two
float16
/halffloat
tickets here (e.g., PARQUET-1403) but nothing that speaks to adding half-float support to Parquet in-general.PLANS
I'm able to spend some time on this, if someone points me in the right direction.
Add the
HALFFLOAT
orFLOAT16
enum (any preferred naming convention?) to https://github.com/apache/parquet-format/blob/master/src/main/thrift/parquet.thrift#L32Add
HALFFLOAT
toorg.apache.parquet.schema.PrimitiveType
Add
HALFFLOAT
support toorg.apache.parquet.arrow.schema.SchemaConverter
Add encoding for new type at
org.apache.parquet.column.Encoding
??
If anyone has any interest in this, pointers, or comments, they would be greatly appreciated!
Reporter: The Alchemist
Assignee: Jiashen Zhang / @zhangjiashen
Related issues:
PRs and other links:
Note: This issue was originally created as PARQUET-1647. Please see the migration documentation for further details.
The text was updated successfully, but these errors were encountered: