-
Notifications
You must be signed in to change notification settings - Fork 13
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Pagination bug for IA IIIF manifests displayed in Mirador puts spine on outside edge #50
Comments
The Manifest generation process for Internet Archive's IIIF server is pretty vanilla -- The manifest is generated here: And in this sub-section, we consider logic relevant to dealing with Internet Archive books/texts: Furthermore, this section considers the individual pages of a text by making an http request to an existing archive.org Manifest service (used by our BookReader) which returns its own formatted (non-iiif) "manifest": The json this IA Manifest service returns looks like: I don't see an easy way from this data to identify / filter out color-cards. In order to do this filtering, we'd need to add another http request to another/different endpoint to fetch its "scandata.xml". Every archive.org book item has a "scandata.xml" file. It "usually" (there are weird exceptions, e.g. multiple books living in the same item; such edge cases I don't have time to fix) lives at... https://archive.org/download/:identifier/:identifier_scandata.xml so, for instance, if the ID is So, fixing this issue isn't exactly trivial (but not difficult):
|
Thanks so much for your attention on this and the super thorough reply. Our backender on the project (@lucasmoeskops) just figured out a work-around by telling Mirador to move the first page to the end, but since we'd really like a cleaner solution than that, this is super helpful. Plus, I imagine there are other Mirador-based projects out there that would appreciate the fix too. |
Yes, @amandelman + @lucasmoeskops careful as not every book item on archive.org is/was digitized by the Internet Archive and therefore it is actually likely an item will not have a color card. In these cases, moving the first page to the end won't work. Coupling this approach with some heuristic such as.... We already fetch this metadata as json in the code:
Therefore, after line https://github.com/ArchiveLabs/iiif.archivelab.org/blob/master/iiify/resolver.py#L138 you may wish to check....
This would probably be useful to all patrons and partners, so feel free to open a PR with this patch if it works reliably for you :) Thank you! |
Hi, I just wanted to comment that we've also just recently noticed that some volumes (not all) have an additional first canvas. I think it's the same issue that you are discussing here. In our work this seems to throw off the relationship between the canvas/page number and the OCR canvas/page number on those volumes. We've been working around it, but I'm happy to see a discussion about it coming up. Thanks! |
Similar to an email I received today -- some partners reporting that some manifests being generated erroneously with a page 0 $0 (this may be because of color cards or just an off-by-one) In Mirador: https://iiif.archivelab.org/iiif/floragraecasive3sibt |
When loading IA IIIF manifests into Mirador 3's book view, a pagination error results in displaying the pages with the spine on the outside edge instead of in the middle. It looks like this could be because IA inserts an extra cover page showing calibration marks, which then throws off the pagination sequence, but I'm not sure.
Example manifests:
https://iiif.archivelab.org/iiif/americabeinglate00mont/manifest.json
https://iiif.archivelab.org/iiif/siguenseunosbrev01gilb/manifest.json
https://iiif.archivelab.org/iiif/arteyvocabulario00unkn/manifest.json
See what the Mirador team has to say about it: ProjectMirador/mirador#3244
The text was updated successfully, but these errors were encountered: