A simple web crawler built using Node.js, Axios, and Cheerio.
-
Clone the repository:
git clone https://github.com/yourusername/WebCrawler.git cd WebCrawler -
Install the dependencies:
npm install
To start the web crawler, run the following command:
node index.jsThe crawler will start from the URL specified in the startUrl variable and will crawl up to the depth specified in the maxDepth variable. The results will be saved in results.json.
You can configure the following variables in the index.js file:
startUrl: The URL to start crawling from.maxDepth: The maximum depth to crawl.
Contributions are welcome! Please open an issue or submit a pull request.