Skip to content

An assignment to explore data analysis using the pandas module in Python.

License

Notifications You must be signed in to change notification settings

nyu-database-design/pandas-exploration

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

pandas exploration

In this assignment you will select a data set and do some munging, analysis, and visualization of it using pandas, Jupyter Notebooks, and associated Python-centric data science tools.

Data selection and retrieval

Select a data source

First, you will need to select a datafile to work from. For this assignment, please select any reputable data source that is of interest to you. Download the data in a plain text data format, not a spreadsheet-specific file format.

Where to find data

There are many data sources available at NYU Libraries' Data Services division. Use all available resources to identify a data set of interest to yourself.

Save the data

Save the original raw data file of your choice into the data directory.

Jupyter Notebook

Use JupyterLab to open the Jupyter Notebook named analysis.ipynb. You will import the data file and do all the data munging, analysis, and visualization within this notebook.

Submit your work

Use Visual Studio Code to perform git stage, commit and push actions to submit. These actions are all available as menu items in Visual Studio Code's Source Control panel.

  1. Type a short note about what you have done to the files in the Message area, and then type Command-Enter (Mac) or Control-Enter (Windows) to perform git stage and commit actions.
  2. Click the ... icon next to the words, "Source Control" and select "Push" to perform the git push action. This will upload your work to your repository on GitHub.com.

Pushing work in Visual Studio Code

About

An assignment to explore data analysis using the pandas module in Python.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published