Programming Hive

Data Warehouse and Query Language for Hadoop

Nonfiction, Computers, Internet, Web Development, Java, Programming, Programming Languages
Cover of the book Programming Hive by Edward Capriolo, Dean Wampler, Jason Rutherglen, O'Reilly Media
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart
Author: Edward Capriolo, Dean Wampler, Jason Rutherglen ISBN: 9781449326975
Publisher: O'Reilly Media Publication: September 19, 2012
Imprint: O'Reilly Media Language: English
Author: Edward Capriolo, Dean Wampler, Jason Rutherglen
ISBN: 9781449326975
Publisher: O'Reilly Media
Publication: September 19, 2012
Imprint: O'Reilly Media
Language: English

Need to move a relational database application to Hadoop? This comprehensive guide introduces you to Apache Hive, Hadoop’s data warehouse infrastructure. You’ll quickly learn how to use Hive’s SQL dialect—HiveQL—to summarize, query, and analyze large datasets stored in Hadoop’s distributed filesystem.

This example-driven guide shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You’ll also find real-world case studies that describe how companies have used Hive to solve unique problems involving petabytes of data.

  • Use Hive to create, alter, and drop databases, tables, views, functions, and indexes
  • Customize data formats and storage options, from files to external databases
  • Load and extract data from tables—and use queries, grouping, filtering, joining, and other conventional query methods
  • Gain best practices for creating user defined functions (UDFs)
  • Learn Hive patterns you should use and anti-patterns you should avoid
  • Integrate Hive with other data processing programs
  • Use storage handlers for NoSQL databases and other datastores
  • Learn the pros and cons of running Hive on Amazon’s Elastic MapReduce
View on Amazon View on AbeBooks View on Kobo View on B.Depository View on eBay View on Walmart

Need to move a relational database application to Hadoop? This comprehensive guide introduces you to Apache Hive, Hadoop’s data warehouse infrastructure. You’ll quickly learn how to use Hive’s SQL dialect—HiveQL—to summarize, query, and analyze large datasets stored in Hadoop’s distributed filesystem.

This example-driven guide shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You’ll also find real-world case studies that describe how companies have used Hive to solve unique problems involving petabytes of data.

More books from O'Reilly Media

Cover of the book Client-Server Web Apps with JavaScript and Java by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book Intellectual Property and Open Source by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book Design and Prototyping for Drupal by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book Windows Server 2012: Up and Running by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book CSS Pocket Reference by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book Managing RAID on Linux by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book Production-Ready Microservices by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book Blockchain by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book Think Perl 6 by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book The Canon EOS Digital Rebel XS/1000D Companion by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book Publishing with iBooks Author by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book Open Sources 2.0 by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book Natural Language Processing with Python by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book Kindle Fire: Out of the Box by Edward Capriolo, Dean Wampler, Jason Rutherglen
Cover of the book ActionScript 3.0 Programming: Overview, Getting Started, and Examples of New Concepts by Edward Capriolo, Dean Wampler, Jason Rutherglen
We use our own "cookies" and third party cookies to improve services and to see statistical information. By using this website, you agree to our Privacy Policy