Skip to content

yeaung276/web-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Web scraper using LLM

How to run

  1. Create a virtual environment python -m venv .venv and activate it source .venv/bin/activate
  2. Create .env file and add secret keys
BROWSERLESS_API_KEY = 
SERP_API_KEY = 
OPENAI_API_KEY = 
  1. Install dependencies. pip install -r requirements.txt
  2. Run the server. python main.py
  3. Go to localhost:8000/docs for openAPI documentation.

API Documentation

get /search

Request

  1. query: (string *required) a search query to be made
  2. stream: (bool optional)(default: false) should the server stream the scraped result or not

Response

List/Stream of dictionary

  1. description: (string) Extracted description of the website about the product
  2. url: (string) Source url of the website
  3. reduced: (bool) Whether the description is extracted using map-reduced or not.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages