Screen Finder

A Python tool that takes a screenshot of your screen, sends it to ChatGPT's vision API, and returns the coordinates of objects you're looking for.

Installation

Install dependencies:

pip install -r requirements.txt

Set your OpenAI API key:

export OPENAI_API_KEY="your-api-key-here"

Or create a .env file (not included in git) with:

OPENAI_API_KEY=your-api-key-here

Usage

As a Python function:

from screen_finder import find_object_coordinates

# Find coordinates of a button
coords = find_object_coordinates("submit button")
if coords:
    x, y = coords
    print(f"Found at: ({x}, {y})")

As a command-line tool:

python screen_finder.py "submit button"
python screen_finder.py "login button"
python screen_finder.py "close icon"

How it works

Takes a screenshot of your entire screen using pyautogui
Encodes the screenshot as base64
Sends it to ChatGPT's vision API (gpt-4o) with a prompt asking for coordinates
Parses the response to extract the (x, y) coordinates
Returns the coordinates as a tuple

Notes

The function returns the center coordinates of the object
If the object is not found, it returns None
Make sure you have a valid OpenAI API key with access to vision models
The default model is gpt-4o which supports vision

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
.DS_Store		.DS_Store
.gitignore		.gitignore
Commands.py		Commands.py
README.md		README.md
client.py		client.py
requirements.txt		requirements.txt
server.py		server.py
testspeech.py		testspeech.py
triage.py		triage.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Screen Finder

Installation

Usage

As a Python function:

As a command-line tool:

How it works

Notes

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Screen Finder

Installation

Usage

As a Python function:

As a command-line tool:

How it works

Notes

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages