Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Refactor tweetset_loader.py for greater flexibility #134

Open
dolsysmith opened this issue Aug 3, 2021 · 1 comment
Open

Refactor tweetset_loader.py for greater flexibility #134

dolsysmith opened this issue Aug 3, 2021 · 1 comment

Comments

@dolsysmith
Copy link
Contributor

  • Move from argparse to click
  • Move less-commonly used options to config
  • Add logic for creating extracts without loading
@dolsysmith dolsysmith added this to the 2.2 milestone Aug 3, 2021
@dolsysmith dolsysmith self-assigned this Aug 3, 2021
@dolsysmith dolsysmith changed the title Refactor tweetset_loader.py Refactor tweetset_loader.py for greater flexibility Aug 6, 2021
@lwrubel lwrubel modified the milestones: 2.2, 2.3 Aug 26, 2021
@lwrubel
Copy link
Collaborator

lwrubel commented Sep 13, 2021

Add option to specify concatenating JSON files up to a size.

Also move parse_size() from spark_utils.py to utils.py since it will need to be called to supply to_size value.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants