-
Notifications
You must be signed in to change notification settings - Fork 99
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Create abstraction to split into multiple columns easily #85
Comments
I can do it. Should we add an option to have a default value or not? I mean, if there is a string that does not fit with the pattern, should we raise the "IndexOutOfBound" exception, or should we return default value? |
@SemyonSinchenko - we should probably give the user both options. Perhaps we should have a |
@SemyonSinchenko & @MrPowers : It seems to be interesting feature and mostly used transformation in the ETL projects.
|
* Added files for schema append functionality * Update test_append_if_schema_identical.py * Made the changes as per the review comments * Made the changes as per the review comments & added comments for better readability. * Made the changes as per the review comments & added comments for better readability. * Added function to handle the splitting of column. * Made changes to include split_col function. * Made changes to default mode as 'strict'. * Added test cases to test the functionality. * Additional functionality as per review comments. --------- Co-authored-by: Matthew Powers <[email protected]>
Suppose you have this DataFrame:
Here is how to clean the DataFrame:
It'd be nice to have a function that would do this automatically:
The current syntax is tedious.
The text was updated successfully, but these errors were encountered: