Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
Assignment 1.ipynb		Assignment 1.ipynb
README.md		README.md

Repository files navigation

Big Data - Exploratory Data Analysis

Analyzing Road Crash Data

The Department of Planning, Transport and Infrastructure (DPTI), South Australia collects data from various road crashes for further analysis in an endeavor to improve road safety. Over time, the data increases in size; the increase in the number of vehicles also contributes to huge amounts of data. As we look across multiple states, we can imagine a rather large set of data. Here, we want to employ various operations on the dataset using Spark to answer different queries.

This was an individual assignment where I scored the highest grade.

Setup

Clone this repository into your system

git clone https://github.com/akale1994/Big-Data-Exploratory-Data-Analysis.git

Make sure your Apache Spark clusters are running
Open and run the notebook

Root directory > Assignment 1.ipynb

About

No description, website, or topics provided.

Report repository

Releases

No releases published

Packages

No packages published

Languages

Jupyter Notebook 100.0%