Skip to content
View leschiffres's full-sized avatar
Block or Report

Block or report leschiffres

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
leschiffres/README.md

Hi there 👋

My name is Matthaios Letsios and in the past years I have been working in the field of Data Engineering and Data Science. List of my responsibilities in my previous roles have been to:

  • Maintain, develop and enhance airflow pipelines focusing proper data collection, transformation and storage in the most efficient manner.
  • Develop python services using technologies like gRPC, postgreSQL, clickhouse, redis and kafka.
  • Develop and maintain NLP services.
  • Developing different machine learning models (mainly anomaly detection).
  • Ensure reliable software development by appropriate unit and regression tests as well as automated linting as part of Github workflows.

Research

In the past I have also worked as a researcher in Telecom ParisTech & Universite Pierre Marie Curie.

I'm also proud of being co-author to the following publications:

  • Finding Heaviest k-Subgraphs and Events in Social Media. ICDM Workshops 2016 link
  • Scheduling under Uncertainty: A Query-based Approach. IJCAI 2018 link

Fun Projects

Understats Scraping

understat.com is a website providing advanced data for football matches e.g. xGoals, xAssists, xGChain etc. In this project I set up an airflow instance to collect the data for the Premier League football matches, transform them and then in the end visualize them in a jupyter notebook. The motivation behind this is the Fantasy Premier League game, where I use those data as a basis for the weekly player selection.

Book Recommender

This is an web app that makes use of sentence embeddings to be able to find appropriate books, based on the given desired description. The webapp was build using Fastapi.

Google Haschode

In the past I have participated in many google hashcode competitions. Unfortunately the competition no longer exists, but I hope Google will bring it back in the future. Here is the code for some of my participations.

Google Hashcode 2021 - Qualification Round

Problem: Given a city plan and all car itineraries in that city, the goal is to schedule all traffic lights, to help as many cars as possible to reach their destination on time.

Approach: Our solution takes into account the in & out degree of each traffic light and assigns proportinally the time to each traffic light. Also we find out that assigning to all traffic lights 1 second of traffic time provided efficient solutions in some instances. This solution allowed us to reach top 20% of the leaderboard during the competition.

Pinned Loading

  1. shiny_clustering_app shiny_clustering_app Public

    This is a basic shiny app that uses the shiny dashboard and applies a clustering algorithm to a given dataset.

    R 1

  2. book-recommender book-recommender Public

    Python

  3. Cache_Memory_Implementation Cache_Memory_Implementation Public

    This repository is about an efficient cache memory implementation. It was implemented as a project of the course Data Structures, during my bachelor studies.

    Java

  4. hashcode-2021-qualification-round hashcode-2021-qualification-round Public

    This is a repo containing our solution for the qualification round of google hashcode 2021. This solution allowed us to reach top 20% of the leaderboard during the competition.

    Jupyter Notebook

  5. understats-fpl understats-fpl Public

    Understats scraping to get player and team form for the Fantasy Premier League.

    Jupyter Notebook

  6. hashcode-2020-qualification-round hashcode-2020-qualification-round Public

    Python