May 28, 2015 – DAMA Day: Demystifying Big Data

Click Here to Register for this Event

LOCATION:

Pointe Hilton Squaw Peak

7677 N 16th St. Phoenix, AZ 85020

DATE:

Thursday May 28, 2015

7:30 AM: Registration Opens

7:45 AM: Light Breakfast

8:15 AM: Welcome to DAMA Day

8:30 AM: Introduction to Demystifying Big Data – David Marco

8:45 AM: “Understanding the Modern Data Architecture” – Mike Lamble

10:00 AM: Synopsis of Big Data Technologies – David Marco

10:30 AM: Big Data Technologies Part 1 – Lynn Hedegard

11:30 AM: Lunch

12:30 PM: Afternoon Opening Remarks

12:35 PM: Big Data Technologies Part 2 – Lynn Hedegard

1:30 PM: “Demystifying Big Data” – David Marco

3:45 PM: Panel – Big Data Examples Local Practioners

4:45 PM: Session Closing Remarks – David Marco

5:15 PM: DAMA Day Meeting Close

5:30 PM: No Host Happy Hour Networking

 

DavidMarco
David Marco, President EWSolutions

LynneHedegardLynn Hedegard, IBM

MikeLamble

Mike Lamble, CEO Clarity Solutions

Map & Directions

Thanks to our Silver Sponsor ….StateFarm

SESSION SUMMARY    (Click for Session PDF)

Demystifying Big Data

  • Level set on Big Data technologies
  • Examples of Big data solutions
  • Understanding the Modern Data Architecture
  • Local Big Data examples / discussions 

Big Data Technologies

This session is intended to provide an overview of the Hadoop environment from the ground up.  We will probe why Data Lakes became such a popular topic and why they are getting a bad rap recently from the industry analysts.  We will discuss a couple of use cases and how organizations are able to successfully introduce Hadoop into their current environments.  Finally we will wrap up discussions with what the recent announcement of the Open Data Platform means and where we see this leading with respect to businesses, vendors, and architectures.
  • Why Hadoop?
    • Why HDFS?/Why MapReduce?/Why schema on read?
  • Why Spark?
    • What is TEZ?
  • Why Yarn?
    • What is MESOS and Platform Symphony?
  • Data manipulation
    • Pig/Flume/Python/Scala/Java
    • ETL tools
    • Analytics
    • Text analytics
    • Streams/Storm/Spark
  • Popular data storage options in Hadoop
    • JSON/Parquet/Avro
  • SQL and NoSQL
    • Hive/Hbase/HCatalog
    • Impala/Stinger/Big SQL/SQL-H/HAWQ
    • Cassandra/mongoDB/etc
  • Data Lakes and Governance
  • Apache and ODP
  • Use Cases
  • Where is this all going
    • Architecture perspective
      • z/Linux
      • clouds
    • Vendor perspective
    • Business perspective

SPEAKERS BIOGRAPHIES

David Marco – President EW Solutions

Best known as the world’s foremost authority on meta data management, he is an internationally recognized expert in the fields of data warehousing, data governance and enterprise information management. In 2004 David Marco was named the “Melvil Dewey of Metadata” by Crain’s Chicago Business as he was selected to their very prestigious “Top 40 Under 40” list. He followed up this honor in 2007 by being named one of DePaul University’s “Top 14 Alumni Under 40”. In 2008 David Marco was enshrined into the very select DAMA Data Management Hall of Fame (Professional Achievement Award).

Lynn Hedegard – Technical Specialist, IBM

Lynn Hedegard has been involved in the Big Data / Data Warehouse / Business Intelligence industry for over 35 years, contributing to numerous DW implementations in both the commercial sector as well as State & Federal government organizations.  Lynn’s areas of expertise include Big Data, Streams, Data Science, Enterprise Architecture, Data Warehousing, Business Intelligence, MetaData Management, Master Data Management, High Availability, Fault Tolerance, Disaster Recovery, Event Processing, Data Mining, Quantitative Analytics, and Business Automation.
Lynn joined IBM in October of 2012, where he works as a Technical Specialist supporting IBM sales teams, and consults with IBM clients and prospects.  Activities include; business discovery & analysis, development of proposals, project planning, project scoping, and project management.
Lynn holds a patent for; “Automated Application Fail-Over for Coordinating Applications with DBMS Availability”; United States US6202149 B1.  Lynn earned a bachelor’s degree in Computer Science from Michigan Technological University in 1978.

Mike LambleCEO Clarity Solution Group

Mike Lamble is CEO of Clarity Solution Group, the largest on-shore data and analytics consulting company in the US and a veteran leader in the field of enterprise-class data and analytic solutions. Prior to joining Clarity Solution Group, Mike was president of XtremeData, providers of massively scalable DBMS. Previously, he was a practice leader in IBM’s Business Analytics and Optimization business unit. Mr. Lamble also served as Vice President of Worldwide Services for Greenplum and held executive management positions with Knightsbridge Solutions, which was acquired by HP in 2007. Mike earned his MBA at the University of Chicago and has a BA in Economics from Columbia University.