Loading…
DataWeek Conference + Expo has ended
Back To Schedule
Monday, September 15 • 10:00am - 12:00pm
Apache Pig Workshop

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Apache Pig is a high level data flow language for Hadoop eco system.
Pig facilitates defining simple to complex workflows that can operate
on data sizes ranging from gigabytes  to petabytes.   The simplicity
of Pig scripting is big plus compared to Java Map Reduce.  Pig was
oringally developed at Yahoo; now Pig is heavily used by companies
like Netflix, LInkedin and Yahoo.

This workshop will introduce Apache Pig to students.   We will go
through the Pig concepts and learn Pig Latin language.  Students will
learn by working on hands-on labs using Hadoop and Pig.  The workshop
will focus on solving practical, real world problems (no toy labs)

This is a HANDS-ON workshop.   Estimated run time 2 hrs.

Note to attendees:
Attendees *must* have a working Apache Hadoop + Pig environment
pre-installed on their laptop.  We recommend using Hadoop virtual
machines offered by Cloudera or HortonWorks.  Since these are *BIG*
downloads, please download and install them well in advance.

Cloudera VM : http://www.cloudera.com/content/support/en/downloads/quickstart_vms/cdh-5-1-x1.html
Hortonworks VM : http://hortonworks.com/products/hortonworks-sandbox/

Speakers
SM

Sujee Maniyam

Founder / Principal, ElephantScale
Sujee has been developing software for 15 years. He is a hands-on expert on Hadoop, NoSQL and Cloud technologies. He consults and teaches Big Data technologies. Sujee has authored a few open source projects and has contributed to Hadoop project. He is an author of open source Hadoop... Read More →


Monday September 15, 2014 10:00am - 12:00pm PDT
Hotel Kabuki

Attendees (0)