A comprehensive AI agent management platform with specialized research categories, document processing pipeline, and integrated LLM backends.
- Unique Differentiator: Beautiful gradient-based category containers
- Three Specialized Workspaces:
- π Text Generation - Content creation, summaries, reports
- π Technical Analysis - Data analysis, comparisons, insights
- π Document Processing - Batch processing, organization, synthesis
- Multi-format Support: PDF, DOCX, TXT, MD, JSON, CSV, HTML
- Batch Processing: Extract text, summarize, analyze, categorize
- File Management: Upload, organize, retrieve, delete
- Storage Analytics: Usage statistics and optimization
- Ollama Backend: Local LLM inference with privacy
- Model Management: Import and manage models from
/mnt/llm/LLM-Models - Multiple Models: Support for Llama, Dolphin, CodeLlama, and more
- Real-time Generation: Streaming and batch text generation
- FastAPI Backend: High-performance async API
- Agent Orchestrator: Task management and coordination
- WebSocket Support: Real-time updates and monitoring
- Modular Design: Pluggable components and backends
- Python 3.12+
- Ollama installed
- Access to model files
- docs/setup.md β VantaBlack + split UI (local dashboard, remote API), networking, Git
- docs/integration.md β Integration and architecture
- docs/README.md β Doc index
- docs/archive/ β Older troubleshooting notes
-
Clone the repository:
git clone <repository-url> cd AI_Agent_Ecosystem
-
Create virtual environment:
python3 -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
-
Install dependencies:
pip install -r requirements.txt
-
Start Ollama server:
ollama serve --host 0.0.0.0
-
Start the API:
python api/main.py
-
Access the web interface: Open
http://localhost:8000in your browser
- Navigate to the Research section
- Choose from three specialized categories:
- Text Generation: For content creation and writing assistance
- Technical Analysis: For data analysis and insights
- Document Processing: For batch document workflows
- Drag and drop files into any workspace
- Supported formats: PDF, DOCX, TXT, MD, JSON, CSV, HTML
- Files are automatically processed and analyzed
- Discover available models:
GET /models/discover - Import GGUF models:
POST /models/import - List Ollama models:
GET /models/ollama
AI_Agent_Ecosystem/
βββ api/ # FastAPI backend
βββ agents/ # AI agent implementations
βββ orchestrator/ # Task coordination
βββ llm_backends/ # Ollama and other LLM adapters
βββ features/ # Feature modules (file_management, document_analysis, β¦)
βββ web/ # Dashboard (static HTML/JS/CSS)
βββ monitoring/ # Status tracking, realtime monitor
βββ scripts/ # start/stop helpers, fix DB, remote API starter
βββ tests/ # Ad-hoc / integration test scripts
βββ docs/ # Setup, integration, archived notes
βββ config/ # YAML/JSON settings
βββ logs/ # Runtime logs (gitignored)
GET /health- System health checkPOST /tasks- Create new tasksGET /tasks/{task_id}- Get task status
POST /files/upload- Upload filesGET /files- List uploaded filesPOST /files/batch-process- Batch process files
POST /generate-text- Generate text with LLMPOST /analyze- Analyze contentPOST /process-batch- Process batch requests
GET /models/discover- Discover available modelsPOST /models/import- Import model to OllamaGET /models/ollama- List Ollama models
- Research Category Interface: Beautiful, intuitive workspace organization
- Local LLM Focus: Privacy-first with Ollama integration
- Document Pipeline: Comprehensive file processing capabilities
- Model Flexibility: Easy import and management of local models
- Real-time Updates: WebSocket-based live system monitoring
- Modular Architecture: Each feature is a separate module
- Async Design: Built on FastAPI and asyncio
- Type Safety: Full type hints throughout
- Error Handling: Comprehensive error management
- Logging: Structured logging for debugging
- Create feature module in
features/ - Add API routes in
api/ - Update orchestrator if needed
- Add frontend components in
web/
- RAM: 8GB minimum, 16GB recommended
- Storage: 50GB for models and data
- GPU: Optional but recommended for faster inference
- Network: For model downloads and updates
- Local Processing: All LLM inference runs locally
- File Isolation: User files are properly sandboxed
- Input Validation: All inputs are validated and sanitized
- Error Handling: Secure error messages without data leakage
This project is licensed under the MIT License - see the LICENSE file for details.
- Fork the repository
- Create a feature branch
- Make your changes
- Add tests if applicable
- Submit a pull request
For support and questions:
- Create an issue on GitHub
- See docs/setup.md and the docs/ folder
- With the API running, open http://localhost:8000/docs (Swagger) or /redoc
Built with β€οΈ for the AI community