This is a Python-based web scraping application that allows you to extract data from websites in a simple and efficient way. It provides functionality to handle links and paginations, making it easier to scrape multiple pages or follow links within a website.
- Extract data from websites using specified element names, class names, and ID names.
- Handle links and paginations to scrape multiple pages or follow links within a website.
- Save the scraped data to a CSV file for further analysis or processing.
- Customizable user agent selection to mimic different web browsers or devices.
- Modern GUI interface for easy input and interaction.
- Clone the repository:
git clone https://github.com/pi22by7/scraper.git
- Navigate to the project directory:
cd web-scraper
- Install the required dependencies:
pip install -r requirements.txt
- Run the application:
python gui.py
- Enter the URL to scrape, element name, class name (optional), and ID name (optional) in the GUI.
- [WIP] Optionally, select a user agent from the dropdown menu to mimic different web browsers or devices.
- Click the "Scrape" button to start the scraping process.
- The scraped data will be saved to a CSV file in your chosen directory.
Contributions are welcome! If you have any suggestions, bug reports, or feature requests, please open an issue or submit a pull request.