lsh-algorithm

The assignment comprises two main tasks: implementing LSH to identify similar businesses based on user ratings and developing various collaborative filtering recommendation systems to predict user ratings for businesses.

spark collaborative-filtering hybrid-recommendation lsh-algorithm model-based-recommendation item-based-recommendation

Updated Feb 25, 2024
Python

DevPhamPham / NCKH_PySpark

Star

pyspark flask-application mobilenetv2 lsh-algorithm

Updated Apr 3, 2024
Python

Lefteris-Souflas / Movie-Rating-User-Similarity

Star

Explored Jaccard distance, Min-Hashing, and LSH for user similarity in a movie rating dataset. Tasks involve dataset preprocessing, exact Jaccard Similarity computation, Min-Hash signatures, and LSH implementation. Results and observations are documented in code, output files, and a report

jaccard-similarity lsh-algorithm min-hashing

Updated Apr 18, 2024
Jupyter Notebook

shaltielshmid / MinHashSharp

Star

A Robust Library in C# for Similarity Estimation

statistics lsh minhash deduplication lsh-implementation lsh-algorithm deduplication-filter

Updated Nov 30, 2023
C#

SwamiKannan / Natural-Language-Processing-Specialization

Star

Coursera's Natural Language Processing specialization

viterbi-algorithm natural-language-processing coursera n-grams locality-sensitive-hashing part-of-speech-tagger specialization stochastic-gradient-descent word2vec-algorithm lsh-algorithm

Updated Oct 25, 2022
HTML

LM1997610 / ADM_HW4

Star

Homework_4 for Algorithmic Methods for Data Mining (ADM), MSc in Data Science at La Sapienza University of Rome

pca-analysis dimensionality-reduction minhash-lsh-algorithm k-means-clustering lsh-algorithm

Updated Mar 9, 2023
Jupyter Notebook

JaiJaveria / Data_Mining

Star

Projects involving Frequent Itemset Mining and analysis of hierarchical space partitioning techniques

r-tree fp-tree apriori-algorithm lsh-algorithm

Updated Nov 18, 2021
HTML

kochlisGit / Big-Data-Algorithms

Star

Implementation of algorithms for big data using python, numpy, pandas.

python bloom-filter lsh streams frequent-itemset-mining pcy frequent-itemsets stream-mining shingling big-data-processing lsh-algorithm min-hasing similar-items a-priori multistage-pcy multihash-pcy

Updated Apr 27, 2020
Python

Sitaras / Software-Development-for-Algorithmic-Problems_Project-2

Star

📈|Time Series - Nearest neighbor search and Clustering using LSH, Hypercube (and Lloyd's only at the clustering) algorithms with metrics: L2, Discrete and Continuous Fréchet.

time-series clustering lsh nearest-neighbor-search nearest-neighbors knn fr hypercube frechet-distance lsh-implementation lsh-algorithm

Updated Apr 15, 2022
C

Muvels / LSHEngine

Star

This repo aims to implement an modular engine for Locality-Sensitive Hashing (LSH).

python database huffman-coding lsh-algorithm

Updated May 5, 2023
Python

AlessandraMonaco / Data-Mining

Star

This repository contains simple and funny Data Mining projects in Python.

data-science data-mining clustering inverted-index feature-engineering lstm-neural-networks bert-model lsh-algorithm

Updated Feb 22, 2021
Jupyter Notebook

mark-antal-csizmadia / finding-similar-items-textually-similar-documents

Star

Finding Similar Items: Textually Similar Documents

python data-mining textual-similarity shingling lsh-algorithm similar-items min-hashing

Updated Sep 14, 2022
Jupyter Notebook

eduardosantoshf / most-frequent-itemsets

Star

MDLE First Assignment - The objective of this project was to implement the A-Priori algorithm to obtain the most frequent itemsets for a list of conditions for a large set of patients, obtaining then associations between conditions by extracting some rules, and also to implement and apply LSH to identify similar news articles from a dataset.

data-mining-algorithms apriori-algorithm lsh-algorithm