This project provides a Python utility package to extract specific components (year, month, day, and time) from ISO 8601 date strings. The input can either be in a string format or a pandas series. These functions can be applied individually or integrated with data analysis workflows in Pandas, simplifying date manipulation and analysis.
First, Python version 3.9 or above needs to be installed.
It can be downloaded here.
Poetry is the tool used to manage dependencies. You need to install Poetry globally.
Run the following command to install Poetry:
curl -sSL https://install.python-poetry.org | python3 -
In the root of the folder, run in terminal:
poetry install
Run the follow commands sequentially to check that the tests pass, and to check test coverage:
poetry run pytest
poetry run pytest --cov=src/date_extractor_mds
poetry run pytest --cov-branch --cov=src/date_extractor_mds
$ pip install date_extractor_mds
- extract_year: Extracts the year as a four-digit integer from an ISO 8601 date string.
- extract_month: Retrieves the month as an integer (1-12) from the ISO 8601 date.
- extract_day: Captures the day as an integer (1-31) from the ISO date.
- extract_time: Returns the time component as a string in hh:mm:ss format.
This package complements existing Python libraries like datetime and pandas by offering specialized, lightweight utilities focused solely on ISO 8601 string manipulation. While datetime provides similar functionality, this package simplifies usage by bypassing full date parsing for basic extraction tasks, increasing performance in large-scale data analysis.
Interested in contributing? Check out the Contributing Guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.
Rashid Mammadov, Derek Rodgers, Yibin Long, Fazeeia Mohammed.
The Date Extractor Python Package was created by Rashid Mammadov, Derek Rodgers, Yibin Long, and Fazeeia Mohammed. It is licensed under the terms of the MIT license, linked here.
date_extractor_mds
was created with cookiecutter
and the py-pkgs-cookiecutter
template.