-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Size of Prediction Sets using APS Different Than Reported in RAPS Paper #8
Comments
It's probably due to the lack of randomization! |
Hey @kevinkasa, have you had a chance to follow up here? Just wondering if this answers your question. |
Hey @aangelopoulos thanks for the quick response! I was just slightly confused since both your paper and the APS paper seemed to suggest that randomization should affect the sets by at most one element, so it was surprising that APS lead to considerably larger sets without it. I suppose that algorithm is just super sensitive without it then? Was planning on trying to add randomization to the notebook implementations but haven't had a chance yet. I am trying out the other RAPS repository in the meantime as well. Thanks! |
Good question. Randomization at test time only changes the set by one element. |
I see, thank you for the clarification! |
Hello,
Thank you so much for providing the conformal prediction tutorial & corresponding notebooks, they are super helpful!
I had a question regarding the size of the prediction sets returned using the APS methods. In the implementation provided in the notebooks, the prediction sets are far larger than reported on your paper than introduced RAPS. The notebook implementation returns sets that are on average >200 labels, whereas the paper reports an average set size of 10.4, on ResNet152.
I have not done extensive evaluation on RAPS, but it seems the notebook implementation also returns slightly larger sets (set size of ~3).
I was wondering if you have any ideas as to what might be causing this discrepancy, and what the best way to replicate the results in the paper might be.
Also, I wasn't sure which repo this issue should be opened in, so apologies if it doesn't fit here. Thanks in advance!
The text was updated successfully, but these errors were encountered: