Skip to content

Latest commit

 

History

History
15 lines (12 loc) · 1.54 KB

README.md

File metadata and controls

15 lines (12 loc) · 1.54 KB

Chemicals in Cosmetics

Overview

Consumers are generally not aware of the potentially hazardous ingredients in the cosmetics that they use on a daily basis. In order to make such information available to the public, The California Safe Cosmetics Program in the California Department of Public Health has been collecting information on these hazardous ingredients in cosmetic products sold in the state. Under the California Safe Cosmetics Act, manufacturers, packers, and/or distributors named on the product label are required to report a list of all cosmetic products that contain any ingredients which may cause cancer, birth defects, or issues related to development and reproduction.

Data

Raw data about the cosmetic companies that have reported hazardous ingredients in their products is aggregated by the CDPH and made available by HealthData.gov. Free-text columns are cleaned using the Python pandas library in an effort to reduce redundant company and brand names that have been recorded in different formats.

Files

  • cscpopendata.csv: raw dataset
  • cscopendata_clean.csv: cleaned dataset
  • cali_chemicals_cosmetics.ipynb: Jupyter notebook for cleaning free-text columns

Analysis and Visualization

Exploratory analysis and visualization are done in Tableau. The final visualization has been submitted for Tableau's 2020 Iron Viz competition and can be found here.