Log Classification With Hybrid Classification Framework

This project implements a hybrid log classification system, combining three complementary approaches to handle varying levels of complexity in log patterns. The classification methods ensure flexibility and effectiveness in processing predictable, complex, and poorly-labeled data patterns.

Classification Approaches

Regular Expression (Regex):
- Handles the most simplified and predictable patterns.
- Useful for patterns that are easily captured using predefined rules.
Sentence Transformer + Logistic Regression:
- Manages complex patterns when there is sufficient training data.
- Utilizes embeddings generated by Sentence Transformers and applies Logistic Regression as the classification layer.
LLM (Large Language Models):
- Used for handling complex patterns when sufficient labeled training data is not available.
- Provides a fallback or complementary approach to the other methods.

Folder Structure

datasets/:
- This folder contains resource files such as test CSV files, output files, etc.

Setup Instructions

Install Dependencies: Make sure you have Python installed on your system. Install the required Python libraries by running the following command:
```
pip install -r requirements.txt
```

Setup Google Colab If running the notebook in Google Colab, mount your Google Drive to access the dataset

from google.colab import drive
drive.mount('/content/drive')

Prepare Dataset Place your synthetic_logs(2).csv dataset in the appropriate directory.

Update file paths in the notebook or scripts if necessary.

Usage

Upload a CSV file containing logs for classification. Ensure the file has the following columns:

source
log_message

The output will be a CSV file with an additional column target_label, which represents the classified label for each log entry.

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
datasets		datasets
.gitignore		.gitignore
README.md		README.md
log-classification-system.ipynb		log-classification-system.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Log Classification With Hybrid Classification Framework

Classification Approaches

Folder Structure

Setup Instructions

Usage

About

Uh oh!

Releases

Packages

Languages

yashwanthjack/log-classifier

Folders and files

Latest commit

History

Repository files navigation

Log Classification With Hybrid Classification Framework

Classification Approaches

Folder Structure

Setup Instructions

Usage

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages