This folder includes benchmark scripts for vtext
To run the benchmarks download an (adapted) copy of the 20 newsgroup dataset
here, then extract
the contents under vtext/data/
.
Scikit-learn (Python) benchmark can be run with,
python benchmarks/bench_scikit-learn.py
To run the equivalent processing with vtext (Rust), run,
cargo run --release