Skip to content

Programming languages Compilation

Hila Calderon edited this page Nov 9, 2020 · 1 revision

Programming languages Compilation

Data Mining Compilation

  1. https://github.com/hicala/news-classifier

    News Classifier App

    Overview

    In this research project we took a political dataset from the 2016 US Presidential elections and created a machine learning model using Python to classify the news as REAL or FAKE. We implemented a TfidfVectorizer, initialized a PassiveAggressiveClassifier, and fit our model. Finally, we run an uncertainty evaluation of the model to obtain the level of accuracy.

  2. https://github.com/hicala/prj_911_kaggle

    Data analytical review of the 911 Call incindents in 2016: Pennsylvania, US

    Overview

    Data Source: Kaggle.

The data contains the following fields( all are declared as a String variable):

  • lat : Latitude
  • lng: Longitude
  • desc: Description of the Emergency
  • zip: Zipcode
  • title: Title
  • timeStamp: YYYY-MM-DD HH:MM:SS
  • twp: Township
  • addr: Address
  • e: Dummy variable (always 1)
  1. https://github.com/hicala/gdp_python-data-mining

    GDP Analysis in the context of the economic development and income growth

    Overview

    I put together all the information related to the GDP and see it relationship per capita. It is already known that Per capita GDP is a metric that breaks down a country’s GDP per person and is calculated by dividing the GDP of a country by its population. Gross domestic product (GDP) is the market value of all final goods and services from a nation in a given year. Countries are sorted by nominal GDP estimates from financial and statistical institutions, which are calculated at market or government official exchange rates.

    I use Python to carry out the Data mining on GDP information around the world separated by 3 years interval (2018, 2019, 2020).

  2. https://github.com/hicala/piracy_reporting_centre_app

    The 21st Century Piracy Phenomenon. Exploring Contemporary Sea Piracy

    Overview

    In this study the main goal is to evaluate the concentrations of the modern piracy incidents around the world. Modern-day pirates around the world share the legal designation of their historic brethren as “enemies of all mankind” because they disrupt and hinder the safe navigation of maritime vessels containing goods and people.

    As we already know Piracy still exists in the 21st Century. Piracy is a global crime which impedes the free movement of ships containing people and goods, with its attendant economic ramifications. The perpetrators are usually heavily armed, with sophisticated weapons to enable them to hijack a vessel or vessels and redirect them to their desired location for the payment of an expected ransom.

  3. https://github.com/hicala/nba_roster_analytic

    A NBA Statistical Analytical Research: Atlanta Hawks Roster. 2019-2020

    Overview

    This study is part of a series of statistical analysis in the composition and salary earned by main and key players in the NBA.

    I am using Beautiful Soup for the this Python app. Beautiful Soup is a Python library for parsing data out of HTML and XML files (aka webpages). It works with your favorite parser to provide idiomatic ways of navigating, searching, and modifying the parse tree.

    The data I used came from Atlanta Hawks Roster from 2019-2020 Championship