Skip to content

This project involves developing a machine learning model to analyze second-hand car listings gathered from Divar.ir, a popular Iranian online marketplace.

Notifications You must be signed in to change notification settings

AliNikoo73/Divar-Crawler-SecondHand-Cars-Listings

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 

Repository files navigation

🚗 Web Scraping and Machine Learning on Second-Hand Car Listings from Divar.ir


📜 Summary

This project involves developing a machine learning model to analyze second-hand car listings gathered from Divar.ir, a popular Iranian online marketplace.

  • Web scraping is done using Selenium to extract data, including car models, prices, and additional features.
  • The data is stored in MongoDB and processed using Pandas and NumPy for further analysis.
  • The machine learning model focuses on price prediction, demand analysis, and market trends.

This project aims to provide actionable insights for car buyers and dealerships by predicting car prices, analyzing model trends, and offering a data-driven approach to understanding the second-hand car market.


🎯 Objective

To develop a data-driven machine learning model using web-scraped data from Divar.ir to analyze second-hand car listings and provide actionable insights such as price predictions and market trends.


🛠 Skills Required

Technical Skills

  • Python (for web scraping and machine learning)
  • Selenium (for web scraping automation)
  • MongoDB (for storing scraped data)
  • Pandas & NumPy (for data preprocessing and analysis)
  • Scikit-learn (for building machine learning models)
  • Data Visualization (Matplotlib, Seaborn)
  • HTML/CSS/JavaScript Knowledge (for handling dynamic web content)

Soft Skills

  • 🔍 Problem-Solving Skills
  • 🎯 Attention to Detail
  • ⏱️ Time Management & Project Organization
  • 💬 Effective Communication of Findings

📊 Deliverables

Key Outputs

  • 🖥️ Web Scraper: A functional scraper to collect second-hand car listings from Divar.ir.
  • 💾 MongoDB Database: All extracted listings (car model, price, features, etc.) stored for further analysis.
  • 🧹 Cleaned and Preprocessed Dataset: Ready for machine learning tasks.
  • 🤖 Predictive Machine Learning Models: Price estimation and market trend analysis.
  • 📈 Visualizations: Insights like price distribution and model trends visualized.
  • 📝 Final Report: Summarizing findings and actionable insights from the data.

🔍 Additional Information

Tools/Libraries Used

  • Selenium for web scraping.
  • MongoDB for data storage.
  • Pandas & NumPy for data cleaning and preprocessing.
  • Scikit-learn for machine learning.
  • Matplotlib & Seaborn for data visualization.

Constraints

Handling the dynamic nature of Divar.ir’s content (e.g., pagination, AJAX loading) was addressed using stealth techniques in Selenium to avoid blocking or throttling.

Dataset Features

The dataset includes attributes such as car brand, model, year, price, mileage, and condition to provide a comprehensive analysis of the second-hand car market.

About

This project involves developing a machine learning model to analyze second-hand car listings gathered from Divar.ir, a popular Iranian online marketplace.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages