Skip to content

csv610/SophoSet

Repository files navigation

SophoSet

SophoSet is a comprehensive tool for benchmarking Vision and Language models, designed to facilitate research and development in AI. With a user-friendly interface powered by Streamlit, SophoSet allows users to view benchmark datasets, interact with models for generating answers and explanations, curate problem sets, and conduct thorough evaluations.

Features

1. Viewing Benchmark Data on Browser

  • Interactive Data Viewing: Easily browse through benchmark datasets directly in your web browser.
  • Dataset Compatibility: Supports a variety of datasets, making it versatile for different research needs.

2. Getting Answers and Explanations Using Language and Vision Models

  • Model Integration: Utilize state-of-the-art Language and Vision models, supported by Ollama, to generate answers and explanations for benchmark questions.
  • Real-Time Interaction: Interact with models in real-time to understand their reasoning and improve model interpretability.

3. Curating Problem Sets

  • Custom Problem Sets: Create and manage custom problem sets tailored to specific research objectives.
  • Flexible Curation: Easily add, remove, or modify problems within the sets.

4. Evaluation

  • Model Performance Evaluation: Evaluate the performance of different models on curated problem sets.
  • Detailed Analytics: Gain insights into model strengths and weaknesses through detailed analysis and reporting.

About

Streamlit Apps for LLM and VLM Benchmarks

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages