Data Intensive Computing

Description

Repository of the project developed for the "Data Intensive Computing" course, part of the Master of Science in Distributed Systems and Data Mining for Big Data at KTH Royal Institute of Technology.

This course aims at providing students with the knowledge and skills needed to understand, design and develop complex pipelines to process Big Data. Relevant frameworks like Spark, Flink and Kafka are all introduced and studied during the course, with an heavy focus on hands-on implementation.

This repository refers to the 2019 edition of the course. The implementation consists in a Big Data system to retrieve live-streaming tweets from featured hashtags on Twitter, process them and extract the keywords that represent each hashtag. Finally, all data is presented using a Word Cloud visualization in a Web Application deployed on Heroku.

Website

Trend Analyser

The Kafka Consumer and Producer are available under /Spark. Kafka broker is deployed on a Google Cloud instance, now powered off.

The project has been developed with the following technologies:

Big Data: Spark, Spark Streaming, Kafka
Backend: Node.js
Database: PostgreSQL
Frontend: HTML5, CSS3, jQuery, Bootstrap

Group

First name	Last Name	Email address
Vittorio Maria Enrico	Denti	[email protected]
Francesco Vito	Lorenzo	[email protected]

Name		Name	Last commit message	Last commit date
Latest commit History 25 Commits
.idea		.idea
Spark		Spark
api		api
controllers		controllers
public		public
service		service
utils		utils
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
index.js		index.js
package-lock.json		package-lock.json
package.json		package.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Intensive Computing

Description

Website

Group

About

Releases

Packages

Languages

vittorio96/DIC

Folders and files

Latest commit

History

Repository files navigation

Data Intensive Computing

Description

Website

Group

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages