-
Notifications
You must be signed in to change notification settings - Fork 81
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Issue with "Flight. split" method, likely related to "PyArrow-backed" DataFrame #466
Comments
Could you please provide the code for a failing example please? |
Code to reproduce the issue attached: SplitTestB.py - "contrasting" example using Belevingsvlucht dataset. Flight.split method works fine in this case. |
Let me start a branch to fix all these pyarrow related issues, with a related PR |
Thanks! I hope these small pieces of code could be useful. |
Let me close the issue, let's confirm in #468 that everything works |
Fresh install of the Traffic library v. 2.10.2 in a separate conda environment (from conda-forge).
Contains:
Python 3.12.7, numpy 1.26.4, pandas 2.2.3, trino-python-client 0.330.0
pyarrow, pyarrow-core 17.0.0
"_split" method in flight.py doesn't work correctly
diff = data.timestamp.diff().values
…
max_ = np.nanmax(diff)
The last statement returns max_= NaT, most likely, due to timestamp values having "timestamp[ns, tz=UTC][pyarrow]" type
and, as the result, "diff" values being of "duration[ns][pyarrow]" type.
The text was updated successfully, but these errors were encountered: