Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Documenting the data ingestion process #9

Open
Zircoz opened this issue Oct 10, 2022 · 7 comments
Open

Documenting the data ingestion process #9

Zircoz opened this issue Oct 10, 2022 · 7 comments
Assignees
Labels
documentation Improvements or additions to documentation

Comments

@Zircoz
Copy link
Owner

Zircoz commented Oct 10, 2022

  1. Describe the function's Inputs, Outputs & working
  2. Document the output df's schema and relation between them
@Zircoz Zircoz added the documentation Improvements or additions to documentation label Oct 10, 2022
@unclebinary1001
Copy link

Hi @Zircoz, Can I work on this?

@Zircoz
Copy link
Owner Author

Zircoz commented Oct 13, 2022 via email

@unclebinary1001
Copy link

@Zircoz Do you have sample dummy data that I can use to run? I use amazon.com as a customer, so I can only get order history, not retail history.

@Zircoz
Copy link
Owner Author

Zircoz commented Oct 16, 2022

@unclebinary1001 That's an interesting problem!
This file has the df info after converting: https://github.com/Zircoz/AmazonSummarizer/blob/main/AmazonSummarizer.ipynb
I'll also make a dummy file and share for reference in that case.

Besides that, can you help us find what files does amazon US provide (and their format) ? (We can open a different issue for this "research")

@Zircoz
Copy link
Owner Author

Zircoz commented Oct 16, 2022

Hey @unclebinary1001 !
I've added my csv file to hashed csv files. I've also added a PII Obfuscator notebook to help you (and anyone else) hide their data if/when they share their file(s). (If you need any help in understanding how to run that notebook, please feel free to LMK) !

@unclebinary1001
Copy link

Hello @Zircoz , I just wanted to let you know that I am still working on this. I will give you an update soon.

Then I will run my retail history using the Obfuscator notebook. I will share it in the hashed csv files folder.

@Zircoz
Copy link
Owner Author

Zircoz commented Oct 21, 2022

Thanks for letting me know @unclebinary1001 , take your time & LMK if you need anything!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation
Projects
None yet
Development

No branches or pull requests

2 participants