Skip to content
#

data-exploration-and-preprocessing

Here are 22 public repositories matching this topic...

vector-io

Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, backup, re-embed (using any model) or access your vector data from any vector databases or repository.

  • Updated Dec 23, 2024
  • Jupyter Notebook

Successfully developed a machine learning model which can accurately predict whether a firm will become bankrupt or not, depending on various features such as net value growth rate, borrowing dependency, cash/total assets, etc.

  • Updated Oct 22, 2023
  • Jupyter Notebook

The Employee Attrition Control project uses data analysis and predictive modeling to understand and address employee turnover. It provides insights and recommendations to reduce attrition and improve employee satisfaction and retention.

  • Updated Jun 16, 2023
  • Jupyter Notebook

Successfully developed a machine learning model which can accurately predict up to 100% accuracy whether a credit card application of a given applicant would be approved or not, based on several demographic features such as applicant age, total income, marital status, total years of work experience, etc.

  • Updated Oct 27, 2023
  • Jupyter Notebook

Successfully created a machine learning model which can accurately predict the fare of a taxi trip based on several features such as trip duration, tip amount, etc.

  • Updated Oct 26, 2023
  • Jupyter Notebook

Successfully established a text summarization model using Seq2Seq modeling with Luong Attention, which can give a short and concise summary of the global news headlines.

  • Updated May 6, 2024
  • Jupyter Notebook

Successfully established a machine learning model which can accurately predict whether an employee of a given company will leave it in the impending future or not, based on several employee details and employment metrics.

  • Updated Oct 15, 2023
  • Jupyter Notebook

Successfully established a machine learning model using PySpark which can accurately classify whether a bank customer will churn or not up to an accuracy of more than 86% on the test set.

  • Updated Aug 4, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to the data-exploration-and-preprocessing topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the data-exploration-and-preprocessing topic, visit your repo's landing page and select "manage topics."

Learn more