Nanodegree: AI Product Management

This repository tracks my journey as I progress through the Udacity Nanodegree in AI Product Management.

The course provides a comprehensive overview of how product management for AI technologies differ from other project. It covers everything from creating datasets and building models, to measuring impact and continuously updating models.

Course Layout 📚

🌟 Introduction to AI in Business

In this module, we delve into the fascinating world of AI and its applications in business. We learn how AI technologies can analyse and learn from data, leading to improved decision-making and more effective product management.

📝 Creating Datasets

In this part of the course, we explore how to build custom datasets, which form the foundation of machine learning models. We also learn how to annotate datasets effectively to ensure accurate and meaningful results.

skills:

Dataset building
Data fit & annotation
Data cleaning
Job design
CML
Result auditing
Planning for failure & longevity

🧠 Building Models

This module introduces us to the process of training and evaluating neural networks, which are at the heart of many AI technologies. We also learn about automated machine learning tools that can simplify and accelerate the model-building process.

skills:

Modeling
Neural networks
Neural architecture search
Activation functions
Back propagation
Multi-layer perceptrons
Convolutional neural networks
Long-short term memory (LSTM) networks
Training data / validation data / test data
Model evaluation
precision / recall / F1 score
Transfer learning
Automated Machine learning (AutoML)
Cloud Solutions

📈 Measuring Impact and Updating Models

In the final part of the course, we learn how to improve machine learning models and AI products. We discuss strategies to mitigate bias and explore how to scale AI/ML products. The module also covers the importance of continuously updating models to ensure they remain effective and relevant.

skills:

Measure business impact
Testing & versioning
Monitoring
Mitigating bias (model bias / data bias / annotation bias)
Continuous learning
Data security & privacy
Data laws
Proof of concent
Ethics & compliance
Scaling
AI product development
Define business cases
Prototype, test & refine
Release management

Course Projects 🏗️

🩻 Create a Medical Image Annotation Job

In this project, we designed a product aimed at assisting doctors in quickly identifying cases of pneumonia in children. The goal was to develop a classification system that could:

Flag serious cases
Quickly identify healthy cases
Act as a diagnostic aid for doctors

We started by building a labeled dataset capable of distinguishing between healthy and pneumonia x-ray images. This dataset would later serve as a foundation for machine learning engineers to construct a classification product.

We used the Chest x-ray dataset, with labels removed, and every data point is a chest x-ray image. The images vary in size and exposure times.

A typical, labeled image is shown below:

The challenge in this project lies in the fact that it is not always clear when pneumonia symptoms are present in an image. Thus, the system is not intended to replace a doctor, but rather to aid in quickly identifying healthy patients and highlighting potential cases of pneumonia.

To address this challenge, we designed a data annotation job suitable for a non-expert to identify more noticeable cases of pneumonia. We planned for uncertainty in data labels and incorporated test questions to capture this uncertainty.

Below are some visual examples demonstrating the characteristics of a healthy image and symptoms of pneumonia:

Characteristics of a healthy image: a clear lung area.

Examples of pneumonia symptoms: (Left) a concentrated, opaque area in the lungs, (Right) multiple, smaller opaque areas throughout the lung area and any diaphragm shadow is obscured:

Our main deliverables for this project were an HTML file of a complete job Preview, which includes instructions for annotation and example test questions, and a Proposal document discussing the job's design and quality assurance steps.

In our annotator instructions, we acknowledged the challenging nature of the classification task and provided clear examples and instructions to potential annotators. We offered an 'Unknown' or 'Other' option to account for uncertainty in an annotation and allowed annotators to indicate their confidence level in the presence of pneumonia symptoms on a numerical scale.

This project was a valuable experience in dataset creation, annotation job design, handling uncertainty, and planning for quality assurance.

feedback we received from the reviewer on this project:

🩺 Develop & Train Classification Models for Pneumonia Detection

In this extension of the project, we aimed to use the curated medical image dataset (supra) to build different machine learning models capable of discerning pneumonia in chest x-rays. The ultimate goals are:

To swiftly identify uncomplicated cases.
To serve as a reliable red-flag mechanism for serious cases.
To act as an adjunct to a doctor's diagnostic process.

1. Binary Classifier for Pneumonia Detection

Utilizing a balanced dataset "normal" and "pneumonia" images, we trained a basic binary classifier. Key areas of evaluation included:

Train/test split: Understanding the partition between training and validation sets.
Confusion matrix: Interpreting each cell in the matrix.
Precision & recall: Understanding and calculating these key metrics.

2. Unbalanced Binary Classifier

To explore the effects of data imbalance, we trained a model on a dataset of 1/3 of "normal" than "pneumonia" images. This exercise aimed to understand:

Confusion matrix: Insights into the behavior of skewed data.
Precision & recall: How these metrics are affected in an unbalanced setting.
Unbalanced Classes: The impact on the machine learning model.

3. Binary Classifier with Dirty Data

In this model, we introduced data noise by mislabeling 1/10 of the images in each category. This aimed to assess:

Confusion matrix: How dirty data affects the matrix.
Precision & recall: Changes in these metrics due to data noise.
Dirty data: The overall impact on the machine learning model.

4. Tri-Class Model for Pneumonia Types

For a more detailed diagnosis, we trained a model to differentiate between "normal," "bacterial pneumonia," and "viral pneumonia". Evaluation included:

Confusion matrix: The behavior in a multi-class setting.
Precision & recall: Calculation and evaluation of these metrics for multiple classes.
More Data: Exploration of data augmentation techniques to reach an 85% threshold for precision and recall, and assessing the volume of data needed.

After intensive model training, all findings, analyses, and visual representations are documented in the AutoML Modeling Report. We recommend referring to this report for detailed insights

Feedback received from a modeling expert:

This project was an invaluable experience in not just model development but also in understanding the complexities of medical diagnostics, striving for models that are versatile and resilient across diverse scenarios.

Course Context

I am currently pursuing these courses to broaden and deepen my skill set as a Data Scientist/AI Expert, enhancing my T-shaped profile. As I find myself taking up more responsibities towards tasks entailing more project and product management, I have curated a path of learning to equip myself with the relevant skills and knowledge. This learning journey is outlined as follows:

✅ Product Manager Nanodegree

The Product Manager Nanodegree programme will equip you with the foundational skills to assume entry-level product manager roles. You’ll learn directly from experienced Product Managers at Uber and Google, who have constructed this Nanodegree program to equip you with the most in-demand and relevant industry skills. This Nanodegree program teaches the core skill set required in all Product Manager roles, which is the foundation for more specialized roles like Growth Product Manager, Data Product Manager, AI Product Manager, and more.

Prerequisite knowledge:

General understanding of the product development lifecycle
Familiarity with different roles required to build a product
Familiarity with Google Workspace and/or Microsoft Office Applications

✅ AI Product Manager Nanodegree

The AI Product Manager Nanodegree programme is meant for product managers that are responsible for building and deploying AI products. The AI PM Nanodegree program is focused on the hands-on tasks of scoping a data set, training a model, and evaluating the performance of the model.

Prerequisite knowledge:

Product management
Data analysis
AI/ML algorithms

✅ Data Product Manager Nanodegree

The Data Product Manager Nanodegree programme is meant for experienced Product Managers who are looking to specialize their skills in product management and be equipped to fill data-focused roles in the development and strategy behind data products. You'll learn how to build an MVP launch strategy for a new service product that utilizes market insights extracted from extensive data analyses and visualizations, develop a data model with corresponding data pipelines and transformations to evaluate user activity of a product, and identify key behavioral and descriptive attributes of users to construct hypotheses for new product features and experiments to validate these hypotheses.

Prerequisite knowledge:

Data Analysis
Product Management
Big data
Database
AI/ML Algorithms

⏳ Growth Product Manager Nanodegree (in consideration)

The Growth Product Manager Nanodegree programme is meant for experienced Product Managers who are looking to specialize their skills in product management and be equipped to fill growth-focused roles. You’ll learn how to grow the user base of your product, get customers engaged and activated as quickly as possible, and monetize your product to have it generate revenue.

Prerequisite knowledge:

Experience as a Product Manager
Experience scoping business requirements
KPIs
Data/statistical analysis
Excel/Spreadsheets

Name		Name	Last commit message	Last commit date
Latest commit History 60 Commits
Study Material		Study Material
images		images
project-1_annotate-medical-data-set		project-1_annotate-medical-data-set
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Nanodegree: AI Product Management

Course Layout 📚

🌟 Introduction to AI in Business

📝 Creating Datasets

skills:

🧠 Building Models

skills:

📈 Measuring Impact and Updating Models

skills:

Course Projects 🏗️

🩻 Create a Medical Image Annotation Job

🩺 Develop & Train Classification Models for Pneumonia Detection

1. Binary Classifier for Pneumonia Detection

2. Unbalanced Binary Classifier

3. Binary Classifier with Dirty Data

4. Tri-Class Model for Pneumonia Types

Course Context

✅ Product Manager Nanodegree

Prerequisite knowledge:

✅ AI Product Manager Nanodegree

Prerequisite knowledge:

✅ Data Product Manager Nanodegree

Prerequisite knowledge:

⏳ Growth Product Manager Nanodegree (in consideration)

Prerequisite knowledge:

About

Releases

Packages

Languages

MarieLynneBlock/nanodegree_AI-product-management

Folders and files

Latest commit

History

Repository files navigation

Nanodegree: AI Product Management

Course Layout 📚

🌟 Introduction to AI in Business

📝 Creating Datasets

skills:

🧠 Building Models

skills:

📈 Measuring Impact and Updating Models

skills:

Course Projects 🏗️

🩻 Create a Medical Image Annotation Job

🩺 Develop & Train Classification Models for Pneumonia Detection

1. Binary Classifier for Pneumonia Detection

2. Unbalanced Binary Classifier

3. Binary Classifier with Dirty Data

4. Tri-Class Model for Pneumonia Types

Course Context

✅ Product Manager Nanodegree

Prerequisite knowledge:

✅ AI Product Manager Nanodegree

Prerequisite knowledge:

✅ Data Product Manager Nanodegree

Prerequisite knowledge:

⏳ Growth Product Manager Nanodegree (in consideration)

Prerequisite knowledge:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages