Hi! Thanks for your interest in contributing to 3aransia. You'll be joining a long list of contributors. In this document we'll try to summarize everything that you need to know to do a good job.
We use GitHub to host our code repositories and issues. The 3aransia organization on GitHub has many repositories, so we can manage better the issues and development. The most important are:
- 3aransia/3aransia, the main repository with code related to the library;
- 3aransia/3aransia.github.com, 3aransia website with information about the library, documentation, link for downloading 3aransia etc.;
3aransia consists of the functionality that the Python/NLP community is motivated to contribute. Some priority areas for development are listed in the 3aransia Wiki
We use Git as our version control system, so the best way to contribute is to learn how to use it and put your changes on a Git repository. There's a plenty of documentation about Git -- you can start with the Pro Git book.
To set up your local development environment for contributing to the main repository 3aransia/3aransia:
- Fork the 3aransia/3aransia repository on GitHub to your account;
- Clone your forked repository locally
(
git clone https://github.com/<your-github-username>/3aransia.git
); - Run
cd 3aransia
to get to the root directory of the3aransia
code base; - Install the dependencies (
pip install -r requirements.txt
); - Download the datasets for running tests
(
python -m 3aransia.downloader all
); - Create a remote link from your local repository to the
upstream
3aransia/3aransia
on GitHub (git remote add 3aransia https://github.com/3aransia/3aransia.git
) -- you will need to use this3aransia
link when updating your local repository with all the latest contributions.
We use the famous gitflow to manage our branches.
Summary of our git branching model:
- Go to the
develop
branch (git checkout develop
); - Get all the latest work from the upstream
3aransia/3aransia
repository (git pull upstream develop
); - Create a new branch off of
develop
with a descriptive name (for example:feature/latin_digit_moroccan-arabic-mapping
,hotfix/bug-on-downloader
). You can do it switching todevelop
branch (git checkout develop
) and then creating a new branch (git checkout -b name-of-the-new-branch
); - Do many small commits on that branch locally (
git add files-changed
,git commit -m "Add some change"
); - Run the tests to make sure nothing breaks
- Add your name to the
AUTHORS.md
file as a contributor; - Push to your fork on GitHub (with the name as your local branch:
git push origin branch-name
); - Create a pull request using the GitHub Web interface (asking us to pull the
changes from your new branch and add to our
develop
branch); - Wait for comments.
- Write helpful commit messages.
- Anything in the
develop
branch should be deployable (no failing tests). - Never use
git add .
: it can add unwanted files; - Avoid using
git commit -a
unless you know what you're doing; - Check every change with
git diff
before adding them to the index (stage area) and withgit diff --cached
before commiting; - Make sure you add your name to our list of contributors;
- If you have push access to the main repository, please do not commit directly
to
develop
: your access should be used only to accept pull requests; if you want to make a new feature, you should use the same process as other developers so you code will be reviewed.
- Use PEP8;
- Write tests for your new features (please see "Tests" topic below);
- Always remember that commented code is dead code;
- Name identifiers (variables, classes, functions, module names) with readable
names (
x
is always wrong); - When manipulating strings, use Python's new-style
formatting
(
'{} = {}'.format(a, b)
instead of'%s = %s' % (a, b)
); - All
#TODO
comments should be turned into issues (use our GitHub issue system); - Run all tests before pushing so you will know if your changes broke something;
See also our developer's guide.
You should write tests for every feature you add or bug you solve in the code. Having automated tests for every line of our code let us make big changes without worries: there will always be tests to verify if the changes introduced bugs or lack of features. If we don't have tests we will be blind and every change will come with some fear of possibly breaking something.
For a better design of your code, we recommend using a technique called test-driven development, where you write your tests before writing the actual code that implements the desired feature.
We have three disscussion plateforms:
- 3aransia Messenger for real-time discussions and annoucements;
- 3aransia Facebook Group for annoucements and community support;
- 3aransia Slack for feature developments and thread support in channels.
Please feel free to contact us through the [email protected] mail if you have any questions or suggestions. Every contribution is very welcome!
Happy hacking! (;