-
-
Notifications
You must be signed in to change notification settings - Fork 89
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Selecting Icons #32
Comments
Hey @llermaly, because Tarsier is typically for text parsing, we currently don't support icons. (Not sure how we'd best go about it in the future either) For images it is quite straight forward. There is a |
If you still want to go the text approach, you can manually find out which of those elements may be related to a search icon (through image name, or some other tag in the html itself) and provide that information in the prompt as well |
@llermaly I'm trying a combination of text based extraction with Tarsier and direct html parsing. Using playwright, one can do something like
And you can then use |
Cool stuff! We might expose that information directly as a part of Tarsier @tvatter Happy to take a PR if that would be of interest to you |
Hi! ,
I'm trying to automate using the search bar on a list of unknown sites.
In most cases the bar is not visible but there is an icon I must click before to display the search bar.
This example, I want to detect and click the magnifying glass:
The problem is it shows this way in the text
[ @ 18 ]
so GPT can not pick it (I'm using the llamaindex agent)The website is https://elastic.co
I read @asim-shrestha mentions GPT-V mode in another issue but I'm not sure on how activate that one, I'm following the docs without success.
Any advice? thanks
The text was updated successfully, but these errors were encountered: