This project is a web scraping application that utilizes Puppeteer and Crawler to extract data from various websites. It provides a modular and scalable structure for web scraping, allowing customization for different websites.
- 🌐 Environment Setup
- 🚀 Installation
- 📦 Cloning the Repository
- 🏃 Running the Application
- 🧪 Running Tests
- 🧹 Linting
Before getting started with the project, ensure you have the following prerequisites installed:
- Node.js: https://nodejs.org/
- npm (Node Package Manager): https://www.npmjs.com/
- nvm (Node Version Manager) (optional but recommended): https://github.com/nvm-sh/nvm
To install project dependencies, run the following command in your project directory:
npm install
To clone the project repository and set up Node.js using nvm, follow these steps:
-
Install nvm (Node Version Manager) if you haven't already. You can use the following command:
curl -o- https://raw.githubusercontent.com/nvm-sh/nvm/v0.38.0/install.sh | bash
-
Close and reopen your terminal or run:
source ~/.nvm/nvm.sh
-
Navigate to the project directory where you cloned the repository.
-
Run the following command to set up the required Node.js version using nvm:
nvm install
-
Set the installed Node.js version as the default for this project:
nvm use
Now, you have Node.js installed and configured for this project using nvm.
To run the web scraping application, use the following command:
npm start
To run tests, use the following command:
npm test
To run the linter and check your code for style issues, use the following command:
npm run lint