Programming Hive

Programming Hive

Edward Capriolo, Dean Wampler

Need to maneuver a relational database program to Hadoop? This entire advisor introduces you to Apache Hive, Hadoop’s facts warehouse infrastructure. You’ll fast methods to use Hive’s SQL dialect—HiveQL—to summarize, question, and examine huge datasets saved in Hadoop’s allotted filesystem.

This example-driven consultant indicates you ways to establish and configure Hive on your setting, presents a close assessment of Hadoop and MapReduce, and demonstrates how Hive works in the Hadoop atmosphere. You’ll additionally locate real-world case experiences that describe how businesses have used Hive to resolve particular difficulties related to petabytes of data.

  • Use Hive to create, adjust, and drop databases, tables, perspectives, capabilities, and indexes
  • Customize info codecs and garage ideas, from documents to exterior databases
  • Load and extract information from tables—and use queries, grouping, filtering, becoming a member of, and different traditional question methods
  • Gain most sensible practices for developing consumer outlined capabilities (UDFs)
  • Learn Hive styles you can use and anti-patterns you'll want to avoid
  • Integrate Hive with different information processing programs
  • Use garage handlers for NoSQL databases and different datastores
  • Learn the professionals and cons of working Hive on Amazon’s Elastic MapReduce

Show sample text content

Download sample