diff --git a/01_materials/slides/04_choosing_the_right_visualization.pdf b/01_materials/slides/04_choosing_the_right_visualization.pdf index 02c4dcbd2..8bf01c0ef 100644 Binary files a/01_materials/slides/04_choosing_the_right_visualization.pdf and b/01_materials/slides/04_choosing_the_right_visualization.pdf differ diff --git a/02_activities/assignments/assignment_1.md b/02_activities/assignments/assignment_1.md index 74d1323aa..025f544bd 100644 --- a/02_activities/assignments/assignment_1.md +++ b/02_activities/assignments/assignment_1.md @@ -34,7 +34,7 @@ * You should make a commit after each session with that lesson's code and notes. Your PR should have the same number of commits as there are sessions. It is important to make the commits to your branch in a timely manner right after each class. ### Submission Parameters: -* Submission Due Date: `ongoing` +* Submission Due Date: 23:59 - 02/02/2026 * The branch name for your repo should be: `assignment-1` * What to submit for this assignment: * The `participation` folder/directory should be populated with the above mentioned .py/.ipynb files along with any written notes or comments (preferably in .md or .txt format). diff --git a/02_activities/assignments/assignment_2.md b/02_activities/assignments/assignment_2.md index 22cf8bdea..e0244e4b9 100644 --- a/02_activities/assignments/assignment_2.md +++ b/02_activities/assignments/assignment_2.md @@ -51,7 +51,7 @@ 🚨 **Please review our [Assignment Submission Guide](https://github.com/UofT-DSI/onboarding/blob/main/onboarding_documents/submissions.md)** 🚨 for detailed instructions on how to format, branch, and submit your work. Following these guidelines is crucial for your submissions to be evaluated correctly. ### Submission Parameters: -* Submission Due Date: `23:59 - 10/26/2025` +* Submission Due Date: `23:59 - 01/26/2026` * The branch name for your repo should be: `assignment-2` * What to submit for this assignment: * This markdown file (assignment_2.md) should be populated and should be the only change in your pull request. diff --git a/02_activities/assignments/assignment_3.md b/02_activities/assignments/assignment_3.md index 99341bc82..91b64a4d2 100644 --- a/02_activities/assignments/assignment_3.md +++ b/02_activities/assignments/assignment_3.md @@ -48,7 +48,7 @@ 🚨 **Please review our [Assignment Submission Guide](https://github.com/UofT-DSI/onboarding/blob/main/onboarding_documents/submissions.md)** 🚨 for detailed instructions on how to format, branch, and submit your work. Following these guidelines is crucial for your submissions to be evaluated correctly. ### Submission Parameters: -* Submission Due Date: `23:59 - 11/02/2025` +* Submission Due Date: `23:59 - 02/02/2026` * The branch name for your repo should be: `assignment-3` * What to submit for this assignment: * A folder/directory containing: diff --git a/03_instructional_team/markdown_slides/01_course_intro.md b/03_instructional_team/markdown_slides/01_course_intro.md index 6092eed1d..bb1ed48ab 100644 --- a/03_instructional_team/markdown_slides/01_course_intro.md +++ b/03_instructional_team/markdown_slides/01_course_intro.md @@ -1,4 +1,4 @@ ---- +--- marp: true theme: dsi_certificates_theme paginate: true @@ -12,9 +12,20 @@ $ echo "Data Sciences Institute" --- +# Acknowledgement + +We wish to acknowledge this land on which the University of Toronto operates. For thousands of years it has been the traditional land of the Huron-Wendat, the Seneca, and most recently, the Mississaugas of the Credit River. Today, this meeting place is still the home to many Indigenous people from across Turtle Island and we are grateful to have the opportunity to work on this land. + +-------- + + # Welcome! -Introductions and housekeeping + +- TF: krystal Wang (she/her) fw2400@cumc.columbia.edu +- LS: Tianyi tianyi21e@gmail.com +- LS: Anjali Deshpande deshpande2013@gmail.com +- LS: Vishakh Patel Vishakh8128@gmail.com --- @@ -24,7 +35,7 @@ Introductions and housekeeping - Question what makes ‘good’ data visualization - Introduce a range of software and tools that are used for data visualization ---- +------- # Case Study: Why should we care about data visualization? diff --git a/03_instructional_team/markdown_slides/02_getting_started_matplotlib.md b/03_instructional_team/markdown_slides/02_getting_started_matplotlib.md index f078dacfb..0e34831ea 100644 --- a/03_instructional_team/markdown_slides/02_getting_started_matplotlib.md +++ b/03_instructional_team/markdown_slides/02_getting_started_matplotlib.md @@ -168,6 +168,7 @@ ax.hist(y) ax.set_title('Total growth over time') ax.set_ylabel('Total growth') ax.set_xlabel('Years since start') + fig.tight_layout() ``` --- @@ -192,6 +193,7 @@ ax.hist(y) ax.set_title('Total growth over time', fontdict = font1) ax.set_ylabel('Total growth', fontdict = font2) ax.set_xlabel('Years since start', fontdict = font2) + fig.tight_layout() ``` ---- @@ -203,7 +205,6 @@ ax.hist(y) ``` fig, ax = plt.subplots(figsize=(5, 3)) ax.plot(x,y) - ax.set_title('Total growth over time', fontdict = font1, loc = 'left') ax.set_ylabel('Total growth', fontdict = font2) ax.set_xlabel('Years since start', fontdict = font2) ``` @@ -220,8 +221,8 @@ ax.hist(y) ``` fig, ax = plt.subplots(figsize=(5, 3)) - ax.scatter( x, - y, + ax.scatter(x, + y, marker='*', # NEW ADDITIONS color = "indigo") # NEW ADDITIONS @@ -271,7 +272,7 @@ ax.plot(x,y,marker='*', linestyle = '--', linewidth = 2, markeredgecolor = '#fa9359', # NEW ADDITIONS - markerfacecolor = '#fa9359' ) # NEW ADDITIONS + markerfacecolor = '#000000' ) # NEW ADDITIONS ``` --- @@ -325,6 +326,13 @@ ax.plot(x,y,marker='*', --- +# Feedback! + + + + +----- + # In the next session, we'll continue with... - How do we choose the right data visualization for a given situation? diff --git a/03_instructional_team/markdown_slides/04_choosing_the_right_visualization.md b/03_instructional_team/markdown_slides/04_choosing_the_right_visualization.md index af8ea01ee..30c159e8f 100644 --- a/03_instructional_team/markdown_slides/04_choosing_the_right_visualization.md +++ b/03_instructional_team/markdown_slides/04_choosing_the_right_visualization.md @@ -32,7 +32,73 @@ REFERENCES D’Ignazio, C., & Klein, L. (2020). 3. On Rational, Scientific, Objective Viewpoints from Mythical, Imaginary, Impossible Standpoints. In Data Feminism. MIT Press. https://data-feminism.mitpress.mit.edu/pub/5evfe9yd/release/5 --> ---- +----- +# Agenda for today + +- Go through slide deck #4: Choosing the right visualization +- Discuss assignment 2 + +------ + +# Review - How does matplotlib work? + +- A **figure** is like a container that holds a set of **axes** + +- The axes is our actual plot or graph + +- A figure can hold multiple axes (like subplots) + +- Every visual element of our plots – colour, legends, axis titles and scales, text – is called an **artist** and belongs to an axes (not to a figure) + +------ + +# Review - Making a figure with matplotlib + +- Create a **Figure** container and one **Axes** (subplot) inside it; store them in fig and ax + + +``` + fig, ax = plt.subplots(figsize=(5, 3)) +``` + +- If we color the `fig` object red and the `ax` object green, we get the following result: + +![w:500](./images/fig_axes.png) + +------- + +# Review - Making a figure with matplotlib + +- Make our basic scatterplot: + +``` +fig, ax = plt.subplots(figsize=(5, 3)) +ax.scatter(x,y) +fig.show() + +``` +![w:450](./images/fig_axes.png) + + +------ + +# Review - Example + +![w:500](./images/fig_axes2.png) + +- Figure = the entire canvas (in red) that contains everything + +- Axes = the plotting area inside a figure (the green boxes with x/y ticks where data is drawn). + +---- + +# Assignment Deadlines + +- Assignment 2: January 26, 2026, 11:59 PM (Monday) +- Assignments 1 and 3: February 2, 2026, 11:59 PM (Monday) + +----- + ### - So far, we have learned how to make and modify different types of data visualizations ### - How do we decide which of these types of data visualization to use, and when?