Skip to content

Bot to scrap our Websites for semantic purposes, indexing, training, etc.

Notifications You must be signed in to change notification settings

EuropeanRespiratorySociety/journals-scrapper

Repository files navigation

#Quick export to csv scrapy crawl article --output=../../Projects/scrapy-journals/data/erj-test.csv

Saves state of the spider

scrapy crawl article -s JOBDIR=./data

source venv/bin/activate

.deleteMany({"canonical":{$regex : ".*.DC1"}})

Build

docker build -t <name> . docker run -a <name>

About

Bot to scrap our Websites for semantic purposes, indexing, training, etc.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published