Skip to content

UBC-MDS/DSCI524_Group28_date_extractor_mds

Repository files navigation

Documentation Status

Date Extractor Python Package

This project provides a Python utility package to extract specific components (year, month, day, and time) from ISO 8601 date strings. The input can either be in a string format or a pandas series. These functions can be applied individually or integrated with data analysis workflows in Pandas, simplifying date manipulation and analysis.

Setup Instructions

1. Install Python

First, Python version 3.9 or above needs to be installed.

It can be downloaded here.

2. Install Poetry

Poetry is the tool used to manage dependencies. You need to install Poetry globally.

Run the following command to install Poetry:

curl -sSL https://install.python-poetry.org | python3 -

Running Tests

1. Run Poetry Install

In the root of the folder, run in terminal:

poetry install

2. Run Tests

Run the follow commands sequentially to check that the tests pass, and to check test coverage:

poetry run pytest
poetry run pytest --cov=src/date_extractor_mds
poetry run pytest --cov-branch --cov=src/date_extractor_mds

Package Installation

$ pip install date_extractor_mds

Usage

  • extract_year: Extracts the year as a four-digit integer from an ISO 8601 date string.
  • extract_month: Retrieves the month as an integer (1-12) from the ISO 8601 date.
  • extract_day: Captures the day as an integer (1-31) from the ISO date.
  • extract_time: Returns the time component as a string in hh:mm:ss format.

Position in Python Ecosystem:

This package complements existing Python libraries like datetime and pandas by offering specialized, lightweight utilities focused solely on ISO 8601 string manipulation. While datetime provides similar functionality, this package simplifies usage by bypassing full date parsing for basic extraction tasks, increasing performance in large-scale data analysis.

Contributing

Interested in contributing? Check out the Contributing Guidelines. Please note that this project is released with a Code of Conduct. By contributing to this project, you agree to abide by its terms.

Contributors

Rashid Mammadov, Derek Rodgers, Yibin Long, Fazeeia Mohammed.

License

The Date Extractor Python Package was created by Rashid Mammadov, Derek Rodgers, Yibin Long, and Fazeeia Mohammed. It is licensed under the terms of the MIT license, linked here.

Credits

date_extractor_mds was created with cookiecutter and the py-pkgs-cookiecutter template.