Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

json_zip stage assumes JSON Lines input files but they actually gzipped #22

Open
radix0000 opened this issue May 24, 2023 · 0 comments
Open

Comments

@radix0000
Copy link
Collaborator

The output json.zip file contains a {source}.json file which should contain a concatenated JSON Lines from input files. Howvere it assumes those input files are JSON Lines when in fact they have been gzipped. So the output {source}.json file actually contains concatenated contents of the gzip files.

radix0000 pushed a commit that referenced this issue May 24, 2023
json_zip assuming inputs are JSON lines files when in fact they are gzipped so output is mass of concatenated gzip data rather than JSON lines

#22
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant