- Backend: FastAPI, MariaDB, SQL Model, OpenAI completions API, Tavily Search API
- Frontend: Jinja2, JS, HTMX, Bootstrap
- Security: Oauth2 password grant (ROPC)
- Infrastructure: AWS ECS/Fargate and DigitalOcean
This is a work in progress and it's planned to have multiple updates on a weekly basis.
- Added multiuser capability
- Added Tavily Extract API
- Switched from gpt-3.5-turbo to gpt-4o
- Added text to speech using gpt-4o-mini-tts for completion.choices[0].message.content (that means the agent now has a voice)
- Addied speech to text and text to speech using and Whisper and gpt-4o-mini-transcribe
- Added UTX date tool and time tools so the model can be time aware
- Added database persistence, for conversation history, in combination with in-memory python dict
- Employed a hybrid HTMX/JS solution to play TTS in browser from text and voice requests
- Added some error handling
- Added HTMX to avoid full page refreshes
- Cleaned up UI
- Switched to gpt-4o-mini
- Containerized with Podman
- Deployed to AWS ECS/Fargate: SENTyENT.com
Will refine here and there, but this phase of the project, in the main, is complete.
Take lessons learned from previous phase and apply to off-grid, multi-agent, multi-user, sensor fusion project
- Fine tune SLM (either phi-4 variant or gpt-oss 20B)
- Purchase hardware
- Start making things work together
- Deploy in the field
- Create a demonstration video