Releases: caltechlibrary/irdmtools
interim progress and bug fixes
This release features updated Go package requirements and the beginning of a demo for building a feeds.library.caltech.edu like static site using irdmtools, dataset, datatools and Pandoc. It also includes some bug fixes. The is just an interim release as I work through some of the critical bugs as I build out the feeds demo.
improved mapping
These release fixes some of the issues discussed in issue #55. Fixed editor handling, corrects series handling, fixed imprint handling, adds other numbering system handling to rdm2eprint.
new tools, rdm2eprint, ep3util
This release is focused on adding two new tools to back port our RDM content to our feeds system. ep3util mirrors rdmutil in irdmtools. It uses the same command structure but is limited to read only functionality. rdm2eprint will read a RDM record via RDM's JSON API and return an EPrint compatible structure for metadata (i.e. excludes support for .documents and .documents.files). It can be sued to process lists of RDM ids or individual rdm ids. It can support rendering as EPrint XML for comparison and debugging purposes. It includes a harvest option which retrieve records from RDM, convert to an EPrint JSON sturcture then save them in a dataset collection.
Also fixed a JSON encoding/decoding bug by using custom JSON encoders/decoders across irdmtools.
Upgrade crossrefapi dependency
This release features crossrefapi v1.0.6 which fixes some oddness regarding HTML entities getting encoded as Unicode code points.
improved ROR retrieve in doi2rdm
These release trims the https://ror.org/ prefix from the ROR retrieve in doi2rdm for funders (issue #50). The func lookupROR now takes a trimPrefix parameter. It also includes an additional Python script get_eprint_documents.py which will fetch the documents for an eprint id from a remote EPrints server via scp and place them in a simplified directory called "problem_files".
improvements to doi2rdm
This release features updates to doi2rdm. It includes changes indicated by issue #40, #48 and #50.
Full Changelog: v0.0.49...v0.0.50
fixes and eprintrest tool added
This release includes a new web service called eprintrest which replicates the EPRints 3.3 REST API for the dataset EPrint for those eprints with a status of "archive". It is minimal but very fast. Requires access to the MySQL eprints database as well as the archives directory for the repository you want to access locally. See the man page for eprintrest in this release for details.
Includes fix for issue #53, problem with duplicate subjects in eprint2rdm.
Fixed blocking migration issues
This release contains the fixes related to eprint2rdm needed for the migration project in the caltechauthors repository. They related to metadata crosswalks from our customized legacy EPrints repository.
bug fixes, metdata fixes
This release features two bug fixes in eprint2rdm discovered in our migration project. Rights are now assigned appropriately (had title and description swapped) and EPrint's eprint.date_type with an empty string or NULL is now treated as a "published" date when eprint.date is not NULL.
mid-migration release
This release includes the changes made based on migrating the majority records from EPrints to RDM. mk_eprint_redirects.py now uses Postgres database to generate a CSV file of eprintid, rdmid and record status which is then processed into a NginX redirects-map.conf file output.
Full Changelog: v0.0.45...v0.0.46