This project uses semantic versioning. This change log uses principles from keep a changelog.
- Descriptive
dtool
CLI documentation - Writing of mimetype overlay to
dtool new dataset
,dtool markup dataset
, anddtool manifest update
- Made code work with
dtoolcore
api dtool.DescriptiveMetadata
->dtool.metadata.DescriptiveMetadata
dtool.metadata_from_path
->dtool.metadata.metadata_from_path
dtool.Project
->dtool.project.Project
dtool.DataSet
class now indtoolcore
dtool.Collection
class now indtoolcore
dtool.Manifest
class now indtoolcore
dtool.filehasher
module now indtoolcore
mimetype
from structural metadata in manifest--hash-function
option fromdtool new dataset
anddtool markup
CLI
dtool.Manifest.from_path
method to be able to fixdtool.DataSet.from_path
dtool.DataSet.from_path
now setsDataSet._structural_metadata
property correctly
dtool info
CLI command--hash-function
option todtool new dataset
anddtool markup
CLIdtool.Manifest
classdtool.DataSet.identifiers
propertydtool.DataSet.overlays
propertydtool.DataSet.empty_overlay
methoddtool.DataSet.persist_overlay
methoddtool.DataSet.item_from_hash
methoddtool.DataSet.item_path_from_hash
methoddtool.filehasher.md5sum
functiondtool.clickutils.info_from_path
function
- Update
dtool.Dataset
to usedtool.Manifest
for structural metadata
dtool.manifest
module- Manifest helper functions (now provided by
dtool.Manifest
class) dtool.log
- Fluentd logging
dtool.slurm
module moved intoarctool
package
arctool
- this now lives in it's own repository- github.com/JIC-CSB/arctool.
datatool
CLI renamed todtool
dtool.datatool.cli
module renameddtool.cli
- The
arctool
CLI - The
dtool.archive
anddtool.arctool
modules
- Extra variables from collections/projects are now propagated to the
descriptive metadata of the dataset/archive when using the
datatool
/arctool
CLI datatool markup
CLI commanddtool.metadata_from_path
helper function
- datatool and arctool CLIs now use utility functions for new dataset and markup
- Reading of project level metadata in
arctool new dataset
- Pretty printing of manifest.json (indent 2)
- Remove cookiecutter dependency
- datatool README_SCHEMA
arctool manifest create
(points at dataset directory)datatool new dataset
now derives descriptive metadata defaults from parent collections/projects- Recursive build up of descriptive metadata from parent directories
- Ability to create projects using the
datatool
- dtool.DescriptiveMetadata.persist_to_path method
- dtool.archive.ArchiveFileBuilder
- Now need to create
datatool manifest update
at dataset dir - Move summarise_archive function into dtool.archive.ArchiveFile.summarise method
- Move verify_file and verify_all into dtool.archive.ArchiveFile class
- dtool.archive.ArchiveFile no longer used to build archives
- Ability to extract files from (gzipped) tarball using the arctool cli
- arctool.create_manifest function
- arctool.rel_paths_for_archiving
datatool manifest create
- Fixed command line tools; broken because they still used create_manifest function
- DescriptiveMetadata class
- Full DataSet class
- ArchiveDataSet class
- ArchiveFile class
- Collection class in dtool module
- new metadata file structure (.dtool/dtool and .dtool/manifest.json)
- major API breaking changes
- moved away from functional towards OO paradigm
- dtool.arctool.new_archive_dataset now uses DataSet class, always takes descriptive metadata as a parameter and returns both the dataset and the path to which it is persisted
- Datatool command line tool implementation
- DataSet initialisation and persistence
- manifest_root in .dtool-dataset
- Add libmagic1 to packages installed in Docker image
- Added missing package to setup.py
- Add Project class to arctool
- Add API call is_collection to test if path is collection
- Document tab completion (now enabled due to using entry point for CLI)
- Add API call icreate_collection for idempotently creating new collection.
- Change location of CLI script to use entry point.
- Change new_archive to new_archive_dataset in arctool API
- Add CLI commands for file verification
- Add API calls for single file verification and all files verification
- Add logging of dataset UUID to CLI operations
- Add logging of full command line invocation to CLI
- Archive class loaded from .tar or .tar.gz file
- DataSet class loaded from path
- Progress bar for tar creation
- API documentation
- Documentation now built using Sphinx
- Split package into modules:
archive
,arctool
,filehasher
,manifest
andslurm
- Added dtool_version to manifest.json
- Removed arctool_version from manifest.json
- Removed arctool_version from README.yml
- Fixed defect where
arctool create archive
could not be run from arbitrary location
- Create .dtool-dataset with UUID, user and version
- Include name of hash function in manifest.json
- Added logging of API version to CLI
- Added --version argument to CLI
- Added CLI integration tests
- Added output of next command and outside-tool actions to CLI
- Added README.yml validation upon archive creation
- Added extract_file() method to dtool package
- Added "confidential" key to README.yml meta data (defaults to False)
- Added "personally_identifiable_information" key to README.yml meta data (defaults to False)
- Changed manifest creation to include file mimetypes from python-magic
- Changed tar creation to force README.yml and manifest.json to be first two files
- Initial tagged release