-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Primary and Alternate Titles #49
Comments
"I am not currently constraining the alternative titles and the dcterms:title. I think these should be constrained where the "I don't know which of these would be primary titles. But I do think concatenating them is less helpful." Agreed. "How many dc:title triples should we allow in the EDM triples?" It's a repeating field in DC. In EDM the maximum is unbounded. "The same query is used to populate dc:title, dcterms:alternative, and dcterms:title. This is the current way the titles are derived, some changes should be applied from the above discussion." Agreed. "For OCHRE, it would be preferable to have a singular, unique title that we could create, and use the other forms of the titles as alternatives. I don't know if there is a need to create a heuristic for defining what a primary title is, there are only a 111 titles to worry about." Agreed. "But some of them are problematic. itemId Title ItemID Title I would put these in dc:description (mapping that to dcterms:description as you describe above for dcterms:title). If that leaves us without a title, use a controlled vocabulary word from DC kernel metadata. Perhaps for this use "(:unas)" for "unassigned". EDM stipulates dc:title or dc:description (or both), but the lack of a dc:title for EDM is not problematic. Also, because I am finicky about such things, if any of the above goes into output, please change sh:message "dc:titles are required." ; to sh:message "dc:title is required." ; Thanks. |
I am not sure I understand. I think we should only have one dc:title. Though looking at the SHACL shape, I said a ProvidedCHO can have either a But for item 14360. Are we going to say the triples should be: ex:someArkID
dc:title "(:unas) (14360)" ;
dc:description "Balochi7-edit" , "Baluchi7-edit" ;
... If it isn't required, it makes things easier to have a unique title for OCHRE. And whatever title we can come up with that should probably be mapped to the value for I think what I will do in my export script is: If there is only one primary title: that title will become the title. If there are two primary titles: both will become descriptions, and the title will be (Thank you for pointing out the type in the sh:message. I fixed it in the Shapes). |
I misunderstood the question, "How many dc:title triples should we allow in the EDM triples?", to mean how many are allowed by the relevant standards. I see, now, that that wasn't its intent. Yes to your proposed solution. Thanks. |
Titles
I have looked at the data regarding titles in the database, and there are a few issues recording titles that I would like to document and then provide a suggestions on how we should move forward with creating titles for the items.
Some of these issues may also exist for collections. But thankfully, there are fewer collections.
Number of Titles
I have constrained the predicate dc:title to have only one value.
I am not currently constraining the alternative titles and the
dcterms:title
. I thinkthese should be constrained where the
dcterms:title
has to agree with thedc:title
, and thedcterms:alternative
should not match thedcterms:title
.For OCHRE, it would be preferable to have a singular, unique title that we could create, and use the other forms of
the titles as alternatives. I don't know if there is a need to create a heuristic for defining what a primary title is, there
are only a 111 titles to worry about.
But some of them are problematic.
Or:
I don't know which of these would be primary titles. But I do think concatenating them is less helpful.
How many
dc:title
triples should we allow in the EDM triples?How Titles are currently derived
Item titles are currently pulled from the ItemTitle table with following query:
query derived from here
query populated with this input
and
and
The same query is used to populate
dc:title
,dcterms:alternative
, anddcterms:title
.This is the current way the titles are derived, some changes should be applied from the above discussion.
Problems with Primary and Alternate Titles
There are 201 alternate titles:
Several have more than one "Alternate" Title, but not if you filter by distinct titles.
50 items have two distinct primary titles:
"10167" "10168" "10169" "10170" "10171" "10181" "10182" "10183" "10184" "10185" "10428" "10806" "11942" "12576" "12594" "13066" "13430" "14360" "14361" "15046" "15047" "15174" "15705" "17462" "17994" "18091" "18273" "19340" "19465" "20899" "21655" "22093" "22178" "22179" "22181" "24183" "24184" "24185" "24186" "24187" "24188" "24189" "5720" "5721" "5722" "5723" "6945" "6947" "6954" "6973"
Problems with No Titles
There are 22 items that don't have a title:
"29503" "29767" "29899" "37457" "37503" "38496" "38546" "38558" "38571" "38576" "38586" "38626" "38776" "39170" "39417" "39518" "39622" "40534" "40535" "40537" "40563" "40570"
These don't have an entry in ItemTitles either.
Two items have a Item.Title_t but not a Item.ItemTitle_list
"40548" Doesn't have an ARKID
"33618" Has an ARKID:
ark:61001/b28d1mz4g50b
But these are also not in the ItemTitles table.
Do these items that don't have titles matter? Are these part of the list of records that are filtered out?
Suggestions
Every Item should have only one Primary Title. We could create a json object of primary and alternate titles based on sql query that is then updated manually for the 100 or so items that need to be corrected. This object would then be used as the source of truth for titles that are imported into OCHRE
In order to make sure that titles are unique, I would suggest item titles are created as
Primary Title (itemId)
.The text was updated successfully, but these errors were encountered: