- This is a Python 3 implementation
- We need the
nltk
module to run this implementation, so we need topip install nltk
. - It takes a while to load the necessary files for SORTA to run. There should be some output in red in the console, and then "Ready for Matching" will be printed. At this point, our custom SORTA is ready to begin matching.
These are the outputs and scores produced by Molgenis' SORTA for the following input:
- "congenital nose anomalies" matches to "Congenital hip dislocation", 79.07%
- "other specified congenital malformations of skull and face bone" matches to "Congenital malformation of the great arteries", 50.70%
In Example.py we run our custom SORTA algorithm with the same inputs.