📺 Suggestion Path Mapping

🎯 Project Overview

This project explores how YouTube’s suggestion system can lead viewers — particularly younger users — from family-friendly content to videos with potentially inappropriate language or themes. We start from a video known to be child-appropriate and recursively collect metadata, transcripts, and comments from suggested videos.

Each video is evaluated using a dictionary of inappropriate words/phrases curated with input from mental health professionals and concerned parents. We flag matches found in titles, descriptions, transcripts, and comments, then use a graph structure to map how videos are connected through YouTube’s recommendations.

🛠️ How It Works

🧠 Graph Structure (C++):
Videos are nodes; suggested links are directed edges.
🐍 Data Collection (Python):
Gathers metadata, comments, and transcripts using the YouTube API and stores them in a local SQLite database.
🚩 Flagging System:
Compares video content against a growing dictionary of inappropriate language. Re-analysis is supported when the dictionary is updated.
🌌 3D Visualization:
The graph is exported as graph.json and rendered with 3d-force-graph.
🔧 Technologies Used:
Python · SQLite · C++ · VSCode · Git · JavaScript + Three.js

🧠 Traversal Algorithms (Complete & Visualized)

We've designed traversal algorithms to simulate how a viewer might progress from safe to unsafe content, in order to compare their behavior and evaluate risk within the graph structure:

🔍 A*: Simulates the shortest weighted path to a flagged node.
🧭 Dijkstra’s: Captures efficient exploratory paths through high-volume content.
🎲 Random Walks: Mimics a naive user clicking randomly — often showing unintended risk chains.
🧵 DFS: Used as the base traversal and first fully visualized path overlay.

We’re also designing the structure so that future semantic analysis (via LLMs or classifiers) can be plugged in easily.

📄 Final Report & Dataset

🧾 [Report_152 - FINAL.pdf](docs/Report_152 - FINAL.pdf) — Full project documentation and findings
🗃️ youtube_data.db — Final dataset with video metadata, links, flags, and stats
📑 README_data.md — Table schema and ethical usage disclaimer

📂 Project Structure

data/           ← SQLite DB, flag dictionary
scripts/        ← Python scripts for collection & flagging
cpp/            ← Graph + algorithm implementations (C++)
visualization/  ← 3D rendering using ForceGraph3D + JSON export
backups/        ← Daily zipped backups of the DB
archive/        ← Deprecated or legacy scripts
docs/           ← Dev notes, algorithm summaries, and HowToRun instructions

🔍 Key Features

✅ Modular, recursive data collection
✅ Real-time flagging with an updatable dictionary
✅ Multiple traversal algorithms (testable + swappable)
✅ JSON graph export for visualization
✅ Auto-rotation of DB backups
✅ Clean Git structure for collaboration

📝 Extended Notes

For design decisions, collection strategies, and team-specific workflows, see docs/dev_notes.md. This file serves as a running log of project milestones and pivots.

🚀 Getting Started

✅ Prerequisites

Python 3.10+
C++17 or later
SQLite3
YouTube Data API v3 key
VSCode with Python & C++ extensions recommended

🔧 Setup

$ git clone https://github.com/YOUR_USERNAME/suggestion-path-mapping
$ cd suggestion-path-mapping
$ cp .env.example .env      # Add your YouTube API key
$ python scripts/run_all.py # Populate the database (see dev_notes for options)

🚦 Full Run Instructions

To compile and run the graph, launch the interactive CLI, and view results in the 3D visualization:

$ python compile_graph.py      # Compiles C++ files, runs the program, and starts the server (auto opens browser)

✅ Supports both PowerShell and MSYS2 UCRT64 terminals (unknown support for MinGW)

📘 Full run instructions and terminal compatibility table available in:

docs/HowToRun.md

🙌🏻 Acknowledgements

Built for COP3530: Data Structures & Algorithms
Mental health professionals who contributed to the inappropriate word list
YouTube API for enabling this kind of research
Our teammates for bringing their skills and interests to this project:
- Yepeth Berhie (@Y-Berhie)
- Carrie Ruble (@CouldBeYourMom)
- Adam Schwartz (@schwartza-afs)
- Kevin Yu (@kevinyu0)
Visualization powered by 3d-force-graph

🔒 This repository is now frozen and read-only for archival purposes.
All collaborators have been removed; all contributions are preserved and documented.
Please fork the project if you'd like to use it as a foundation.

See docs/team_acknowledgments.md for detailed contribution credits.

🪪 License

This project is licensed under the GNU GPLv3.
You are free to use and modify it for academic or non-commercial use, provided you maintain this license and include attribution.

Name		Name	Last commit message	Last commit date
Latest commit History 126 Commits
.github		.github
archive		archive
backups		backups
cpp		cpp
data		data
docs		docs
scripts		scripts
visualization		visualization
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
compile_graph.py		compile_graph.py
requirements.txt		requirements.txt
run_SPM.bat		run_SPM.bat

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📺 Suggestion Path Mapping

🎯 Project Overview

🛠️ How It Works

🧠 Traversal Algorithms (Complete & Visualized)

📄 Final Report & Dataset

📂 Project Structure

🔍 Key Features

📝 Extended Notes

🚀 Getting Started

✅ Prerequisites

🔧 Setup

🚦 Full Run Instructions

🙌🏻 Acknowledgements

🪪 License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 7

Uh oh!

Languages

License

CouldBeYourMom/suggestion-path-mapping

Folders and files

Latest commit

History

Repository files navigation

📺 Suggestion Path Mapping

🎯 Project Overview

🛠️ How It Works

🧠 Traversal Algorithms (Complete & Visualized)

📄 Final Report & Dataset

📂 Project Structure

🔍 Key Features

📝 Extended Notes

🚀 Getting Started

✅ Prerequisites

🔧 Setup

🚦 Full Run Instructions

🙌🏻 Acknowledgements

🪪 License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 7

Uh oh!

Languages

Packages