Cloudera Data Science Workbench demos

Basic tour of Cloudera Data Science Workbench.

Workbench

There are 4 scripts provided which walk through the interactive capabilities of Cloudera Data Science Workbench.

Basic Python visualizations (Python 2). Demonstrates:

Markdown via comments
Jupyter-compatible visualizations
Simple console sharing

PySpark (Python 2). Demonstrates:

Easy connectivity to (kerberized) Spark in YARN client mode.
Access to Hadoop HDFS CLI (e.g. hdfs dfs -ls /).

Tensorflow (Python 2). Demonstrates:

Ability to install and use custom packages (e.g. pip search tensorflow)

R on Spark via Sparklyr (R). Demonstrates:

Use familiar dplyr with Spark using Sparklyr

Advanced visualization with Shiny (R) Demonstrates:

Use of 'shiny' to provide interactive graphics inside CDSW

Jobs

We recommend setting up a "Nightly Analysis" job to illustrate how data scientists can easily automate their projects.

Setup instructions

Note: You only need to do this once.

In a Python 2 Session:

! pip2 install --upgrade dask keras matplotlib pandas_highcharts protobuf tensorflow seaborn

Note, you must then stop the session and start a new Python session in order for all the packages to be seen.

In an R Session:

install.packages('sparklyr')
install.packages('plotly')
install.packages("nycflights13")
install.packages("Lahman")
install.packages("mgcv")
install.packages('shiny')

Stop all sessions, then proceed.

‹

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
data		data
.gitignore		.gitignore
1_python.py		1_python.py
2_pyspark.py		2_pyspark.py
3_tensorflow.py		3_tensorflow.py
4_sparklyr.R		4_sparklyr.R
5_shiny.R		5_shiny.R
README.md		README.md
server.R		server.R
spark-defaults.conf		spark-defaults.conf
ui.R		ui.R
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Cloudera Data Science Workbench demos

Workbench

Jobs

Setup instructions

About

Releases

Packages

Contributors 3

Languages

TobyHFerguson/cdsw-demo-short

Folders and files

Latest commit

History

Repository files navigation

Cloudera Data Science Workbench demos

Workbench

Jobs

Setup instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages