Skip to content

Conversation

@sebastiand-cerebras
Copy link

This PR refactors the demo to use only the Cerebras API, making the deployment more lightweight and easier to maintain.

Changes:

  • Unified all agents to use gpt-oss-120b model via Cerebras API
  • Removed pyproject.toml and uv.lock in favor of requirements.txt only
  • Consolidated environment variables to use CEREBRAS_API_KEY, CEREBRAS_BASE_URL, and CEREBRAS_CHAT_MODEL consistently across all agents
  • Removed litellm dependency conflicts by using LiteLlm with cerebras/ prefix
  • Updated documentation to reflect API-only architecture
  • Simplified Docker build process with single dependency management system

Benefits:

  • Faster Docker builds
  • Smaller repository size (removed 2000+ lines of lock files)
  • Easier to understand and maintain
  • Consistent API configuration
  • Reduced dependency conflicts

Testing:

  • Clean rebuild from scratch verified
  • All containers start successfully
  • Application runs without errors

- Unified all agents to use gpt-oss-120b model via Cerebras API
- Removed pyproject.toml and uv.lock in favor of requirements.txt only
- Consolidated environment variables to use CEREBRAS_* consistently across all agents
- Removed litellm dependency conflicts by using LiteLlm with cerebras/ prefix
- Updated documentation to reflect API-only architecture
- Simplified Docker build process with single dependency management system
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant