-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
ML Pipeline v2 #684
Merged
Merged
ML Pipeline v2 #684
Conversation
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
…ub.com/RolnickLab/ami-platform into feat/ml-pipeline-registry
…o feat/ml-pipeline-registry
✅ Deploy Preview for ami-dev canceled.
|
…enna into feat/ml-pipeline-v2
This was referenced Jan 26, 2025
17 tasks
Suspect IssuesThis pull request was deployed and Sentry observed the following issues:
Did you find this useful? React with a 👍 or 👎 |
This was referenced Jan 27, 2025
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Summary
Bringing our ML pipeline out of beta.
List of Major Changes
New Processing Services section. Allows users to add an endpoint themselves and serve machine learning pipelines from their own compute infrastructure or local machine. When a Service is added, the pipelines it has available are registered as options that the project's users can choose for processing images.
Begin storing label lists / category maps that the algorithms use. This model is called
AlgorithmCategoryMap
and maps the class index from the last model layer to the actual categories they represent. The categories have both a simple text label "Species name" as well as an object of metadata with an optional GBIF key, taxon rank, etc. These are used to create taxon entries in the Antenna database as well as show the top 3,5,10, N number of predictions from a model rather than just the top 1 which is saved as the occurrence's determination.Moves job logs to their own field on the job model to reduce DB writes and overwriting the status field when writing logs.
Job status and progress fields have been fixed so that the status & progress (# images done) are not toggling between states, or ending on a strange value.
Update what is considered a job failure: If more than 50% of images in a job fail, or if any result saving fails.
Add "task_type" field to algorithms so we can determine if its a classification model vs. a detection model and so on.
Handle results from multiple algorithms. The best score from all (non-intermediate) algorithms is used to determine the species determination of an occurrence (not just the latest prediction). Identifiers can now see and Agree with the result from any classifier, including an intermediate classifier like Moth/Non-Moth.
Faster & more stable saving of results. Results from the ML backend are now saved more efficiently after each batch of images and error handling is greatly improved.
Related Issues
Closes #607
Closes #310
Unblocks several tickets coming up. Including:
Can use confidence score algorithms other than softmax on the Antenna side (e.g. temperature calibrated)
Screenshots
Deployment Notes
Needs to be deployed with corresponding ML backend changes here:
RolnickLab/ami-data-companion#67
Checklist