Skip to content

v1.2.2: Word Segmenter Cascades, Unigram word segmenter

Compare
Choose a tag to compare
@ruanchaves ruanchaves released this 12 Feb 10:15
· 68 commits to master since this release
a49e144

Features:

  • Introduces word segmenter cascades that allow us to chain rerankers ( ad infinitum ).
  • Replaces ekphrasis by an unigram segmenter based on wordfreq. It can run on all languages supported by the wordfreq library.

Breaking changes:

  • WordSegmenter has been renamed to TransformerWordSegmenter.