Faster previews. Personalized experience. Get started with a FREE account.
Apache Sqoop Cookbook

Apache Sqoop Cookbook

by Jarek Jarcec Cecho
94 Pages · 2013 · 2.4 MB · 3,202 Downloads · New!
" Happiness doesn't result from what we get, but from what we give. ” ― Ben Carson
Actionable Intelligence
by Clifford Siegel
224 Pages · 2014 · 7.5 MB · 1,042 Downloads · New!
Actionable Intelligence: A Guide to Delivering Business Results with Big Data Fast! is the comprehensive guide to achieving the dream that business intelligence practitioners have been chasing since the concept itself came into being. Written by an IT visionary with extensive global supply chain experience and insight, this book describes what happens when team members have accurate, reliable, usable, and timely information at their fingertips. With a focus on leveraging big data, the book provides expert guidance on developing an analytical ecosystem to effectively manage, use the internal and external information to deliver business results.
Agile Data Science
by Russell Jurney
178 Pages · 2013 · 11.5 MB · 4,183 Downloads · New!
Mining big data requires a deep investment in people and time. How can you be sure you’re building the right models? With this hands-on book, you’ll learn a flexible toolset and methodology for building effective analytics applications with Hadoop.
Apache Accumulo for Developers
by Guomundur Jon Halldorsson
120 Pages · 2013 · 4.8 MB · 4,244 Downloads · New!
Accumulo is a sorted and distributed key/value store designed to handle large amounts of data. Being highly robust and scalable, its performance makes it ideal for real-time data storage. Apache Accumulo is based on Google’s BigTable design and is built on top of Apache Hadoop, Zookeeper, and Thrift.
Apache Hadoop YARN
by Arun C. Murthy
400 Pages · 2014 · 7.4 MB · 1,523 Downloads · New!
Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop YARN, two Hadoop technical leaders show you how to develop new applications and adapt existing code to fully leverage these revolutionary advances.
Apache Hive Essentials
by Dayong Du
313 Pages · 2015 · 1.8 MB · 1,994 Downloads · New!
In this book, we prepare you for your journey into big data by firstly introducing you to backgrounds in the big data domain along with the process of setting up and getting familiar with your Hive working environment. Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skill in using the Hive language in an efficient manner. Towards the end, the book focuses on advanced topics such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey.
Apache Oozie Essentials
by Jagat Jasjit Singh
175 Pages · 2016 · 7.14 MB · 4,819 Downloads · New!
As more and more organizations are discovering the use of big data analytics, interest in platforms that provide storage, computation, and analytic capabilities is booming exponentially. This calls for data management. Hadoop caters to this need. Oozie fulfils this necessity for a scheduler for a Hadoop job by acting as a cron to better analyze data.