Implemented data versioning, model experimentation & CI using DVC, DVClive, and GitHub Action.
-
Updated
May 7, 2024 - Python
Implemented data versioning, model experimentation & CI using DVC, DVClive, and GitHub Action.
This project demonstrates a robust, modular, and reproducible end-to-end machine learning pipeline for text classification (spam detection), leveraging DVC for data and experiment versioning, and AWS S3 for scalable remote storage. The pipeline is designed for extensibility, transparency, and ease of collaboration.
Add a description, image, and links to the dvclive topic page so that developers can more easily learn about it.
To associate your repository with the dvclive topic, visit your repo's landing page and select "manage topics."