You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm trying to run WebArena evaluation following this guide. However, it only shows how to run it using OpenAI API. Now I am trying to evaluate WebArena using Azure OpenAI API. Are there any instructions for me to follow?
Thanks.
The text was updated successfully, but these errors were encountered:
Yes, this is a bit annoying and I encountered a similar issue when trying to use LiteLLM proxy for VisualWebArena evaluation. There is 2-step dependency here - OpenHands relies on BrowserGym for evaluation on WebArena benchmark and BrowserGym internally relies on WebArena functions to compute resolve rates. In case the model name mentioned in the code linked above matches your Azure OpenAI model, you can try setting OPENAI_BASE_URL environment variable to Azure API base URL in your sandbox here.
I'm trying to run WebArena evaluation following this guide. However, it only shows how to run it using OpenAI API. Now I am trying to evaluate WebArena using Azure OpenAI API. Are there any instructions for me to follow?
Thanks.
The text was updated successfully, but these errors were encountered: