Tasks prep batch Create a new folder, in same location for each/all projects

Ubuntu a. Install ubuntu servers (on vb) b. Install java-8 c. Install python3 d. Install sbt (optional) e. Set up ssh
Netcat a. Send message from producer port (Terminal) b. Receive message on consumer port (Terminal)
Python Basic a. Complete map-reduce tasks in python, b. Store output data to csv
Scala Basic a. Complete map reduce tasks in python, b. Store output data to csv
Java Basic a. Complete map reduce tasks in python, b. Store output data to csv
Bash automation a. Automate tasks 1, to 4
Airflow a. Automate tasks 1 to 4
Git a. Create git repo precursors b. Push tasks 1 to 6 to github
HDFS – install a. Install HDFS b. Create directories: data, tmp, and user c. Write installation notes – ptg
Flume – install a. Install Flume b. Make sure flume agent is available everywhere
Flume-1 a. Flume read from file/terminal b. Write to file
Flume-2 a. Flume read from source 2 b. Write to hdfs
Kafka – Install a. Install Kafka b. Make sure flume agent is available everywhere c. Start 3 kafka servers
Kafka – 0 a. Full kafka servers start automation b. Based on number of kafka servers wanted
Kafka – 1 – py a. Kafka producer in python b. Kafka consumer in python c. Read from twitter api, write to terminal
Kafka – 1 – scala a. Kafka producer in Scala b. Kafka consumer in Scala c. Read from twitter api, write to terminal
Hive – Hortonworks a. Employees data to hive b. Full crud operations c. Hive queries
Mysql – Hortonworks a. Employees data to hive b. Full crud operations c. Hive queries
Sqoop – Hortonworks a. Sqoop csv to mysql b. Sqoop mysql to hive c. Sqoop hive files to mysql
Hive – Local a. Install Hive b. Employees data to hive c. Full crud operations d. Hive queries
HBase – cloudera a. Employees data to hbase b. Full crud operations c. hbase queries
HBase – local a. Install hbase b. Employees data to hbase c. Full crud operations d. hbase queries
Spark – (Hortonworks) – scala a. Read/write text file b. Read/write csv file c. Read/write json file d. Map reduce to hive
Spark – (Hortonworks) – python a. Read/write text file b. Read/write csv file c. Read/write json file d. Map reduce to hive
Kafka - Spark a. Stream data from kafka api producer b. Set spark Consumer
Kafka – Spark – hive a. Stream data from kafka api producer b. Set spark Consumer c. Write to hive
Kafka – Spark – hbase a. Stream data from kafka api producer b. Set spark Consumer c. Write to hbase
Capstone Project a. Completed requested project

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

process.md

process.md

Files

process.md

Latest commit

History

process.md

File metadata and controls