##Hadoop Tutorials
This repo contains a set of tutorials for Hadoop designed to work inside or alongside the Hortonworks Sandbox. Sandbox is a single-node Hadoop cluster that runs in a Virtual Machine (Virtual Box, VMWare, Hyper-V).
Please feel free to recommend edits to existing tutorials, and contribute your own. New contributions should be placed in the 'Community' folder.
Current tutorials are as follows:
###Sandbox Tutorials (Embedded in Sandbox and tested with v2.0)
- 1 - Hello World - Using Sandbox with Hive and Pig
- 2 - Data Processing with Pig - Processing Baseball Stats
- 3 - Data Processing with Hive - Processing Baseball Stats
- 4 - HCatalog, Basic Pig and Hive Commands
- 5 - Using Basic Pig Commands
- 6 - Loading Data into the Hadoop
- 7 - Installing the Hortonworks ODBC driver on Windows 7
- 8 - Using Excel 2013 to Access Hadoop Data
- 9 - Using Excel 2013 to Analyze Hadoop Data
- 10 - Visualizing Website Clickstream Data
- 11 - Installing the Hortonworks ODBC driver on Mac OS X
- 12 - Refining and Visualizing Server Log Data
- 13 - Refining and Visualizing Sentiment Data
- 14 - Analyzing Machine and Sensor Data
###Community Tutorials
Feel free to contribute tutorials and help the community harness Hadoop!
- 1 - Using RHadoop to Predict Web Visitors from mwacc
- 2 - Using Spring XD to stream Tweets to Hadoop for Sentiment Analysis from mehzer
- 3 - Word Counting with Pig from flacrosse
- 4 - Import from Microsoft SQL Server into the Hortonworks Sandbox using Sqoop from flacrosse
- 5 - Use HDFS Explorer to Manage Files on Hortonworks Sandbox from RedGate
- 6 - Java-Hive Integration - Salary Delta Analysis and Visualization Using Hive, Java and JFreeChart from vivganes
- 7 - elasticsearch-Hadoop integration from yannbrrd
- 8 - Java/HBase Integration from vivganes