We need to establish a workflow for creating chants given a CSV file #1259

jacobdgm · 2024-01-16T15:43:48Z

Debra recently gave us a .csv file with information for a bunch of chants that need created on cantusdatabase.org.

We need a system to create chants based on CSV files. Debra said that this used to happen on OldCantus, and will continue to need to happen from time to time in NewCantus.

We could create a fully automated system with a specification - "your file should include all these columns and exactly these columns", and so on. It might, however, make sense to have a more flexible system - perhaps a management command that can be adapted by a developer to accommodate whichever individual spreadsheets are sent to us by the musicologists. If we adopt this second approach, however, we will need to attend to these .csv files promptly.

Thoughts on how best to approach this?

annamorphism · 2024-01-16T19:07:47Z

I think the best approach would be the first one, in an interface only accessible to admins. The second approach means a lot more work for developers to work out what's what, and I would anticipate such files to be mediated through a Debra anyway.

ahankinson · 2024-01-16T19:47:08Z

My 2c, FWIW. I would suggest doing a combination of 1 and 2: A strict CSV format that is uploaded by admins on the command line.

My reasons:

Error handling with data import is really hard. Communicating the errors with processing and uploading spreadsheets takes a lot of forethought and effort. The command-line, on the other hand, is quite easy -- an exception thrown on the command line doesn't need to be reported anywhere else.
My experience is that users always want to modify their spreadsheets, either intentionally ("Oh, I thought it would import this new column automatically") or unintentionally ("No, I must have deleted that header by accident."). A validation step, followed by an import, is probably the best approach for all involved.
If something goes wrong ("OH, shoot -- I didn't mean to overwrite those!") it's much easier to see that happen on the command line
It's easier for devs to test a CSV upload on a staging system and then run it on the production system, than it is to expect users to run it on staging first.

You might approach it in a way that you develop a sort of import module, which is initially called by the management scripts but, then when it matures, can move to a UI system.

jacobdgm · 2024-01-16T20:23:31Z

If we follow @ahankinson's advice, perhaps the best approach is: set up a management command that expects a CSV file with a specific format. A developer copies the CSV into the container and runs the management command on Staging, and makes any necessary changes to the CSV (reordering/renaming columns, etc.) in case of error messages. If/when the command runs to completion, upload the working CSV to Production and run it there. Unless something unforseen arises, this would take maybe 5-10 minutes of developer time per CSV.

Does this make sense?

jacobdgm · 2024-01-16T21:11:47Z

I've begun work on this, but for the specific CSV at hand, progress is blocked until we figure out what's going on with #1261.

jacobdgm · 2024-01-23T14:04:28Z

if it's true that Sources (rather than Chants) should have a fragmentarium_id (see #1261 (comment)), then work on this can proceed - after creating the source and all the chants, we can just add the proper value for the Fragmentarium ID on the source once the field has been created.

annamorphism · 2024-07-05T19:43:02Z

curious if there's been any progress on this, since it came up today in passing...

jacobdgm added priority: medium SPECIAL DEBRA REQUEST labels Jan 16, 2024

jacobdgm self-assigned this Jan 16, 2024

jacobdgm mentioned this issue Jan 16, 2024

Remove fragmentarium_id field comment from chant model #1261

Open

dchiller added priority: high and removed priority: medium labels Oct 25, 2024

dchiller mentioned this issue Oct 25, 2024

feature request: upload CSV #1099

Closed

dchiller assigned dchiller and unassigned jacobdgm Dec 16, 2024

dchiller linked a pull request Feb 28, 2025 that will close this issue

Create workflow to add chants to a source from a csv file. #1770

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

We need to establish a workflow for creating chants given a CSV file #1259

We need to establish a workflow for creating chants given a CSV file #1259

jacobdgm commented Jan 16, 2024

annamorphism commented Jan 16, 2024

ahankinson commented Jan 16, 2024

jacobdgm commented Jan 16, 2024 •

edited

Loading

jacobdgm commented Jan 16, 2024 •

edited

Loading

jacobdgm commented Jan 23, 2024 •

edited

Loading

annamorphism commented Jul 5, 2024

We need to establish a workflow for creating chants given a CSV file #1259

We need to establish a workflow for creating chants given a CSV file #1259

Comments

jacobdgm commented Jan 16, 2024

annamorphism commented Jan 16, 2024

ahankinson commented Jan 16, 2024

jacobdgm commented Jan 16, 2024 • edited Loading

jacobdgm commented Jan 16, 2024 • edited Loading

jacobdgm commented Jan 23, 2024 • edited Loading

annamorphism commented Jul 5, 2024

jacobdgm commented Jan 16, 2024 •

edited

Loading

jacobdgm commented Jan 16, 2024 •

edited

Loading

jacobdgm commented Jan 23, 2024 •

edited

Loading