Faster previews. Personalized experience. Get started with a FREE account.
Apache Oozie Essentials

Apache Oozie Essentials

by Jagat Jasjit Singh
175 Pages · 2016 · 7.14 MB · 4,819 Downloads · New!
Scraped from this link
" Happiness doesn't result from what we get, but from what we give. ” ― Ben Carson
Actionable Intelligence
by Clifford Siegel
224 Pages · 2014 · 7.5 MB · 1,042 Downloads · New!
Actionable Intelligence: A Guide to Delivering Business Results with Big Data Fast! is the comprehensive guide to achieving the dream that business intelligence practitioners have been chasing since the concept itself came into being. Written by an IT visionary with extensive global supply chain experience and insight, this book describes what happens when team members have accurate, reliable, usable, and timely information at their fingertips. With a focus on leveraging big data, the book provides expert guidance on developing an analytical ecosystem to effectively manage, use the internal and external information to deliver business results.
Agile Data Science
by Russell Jurney
178 Pages · 2013 · 11.5 MB · 4,183 Downloads · New!
Mining big data requires a deep investment in people and time. How can you be sure you’re building the right models? With this hands-on book, you’ll learn a flexible toolset and methodology for building effective analytics applications with Hadoop.
Apache Accumulo for Developers
by Guomundur Jon Halldorsson
120 Pages · 2013 · 4.8 MB · 4,244 Downloads · New!
Accumulo is a sorted and distributed key/value store designed to handle large amounts of data. Being highly robust and scalable, its performance makes it ideal for real-time data storage. Apache Accumulo is based on Google’s BigTable design and is built on top of Apache Hadoop, Zookeeper, and Thrift.
Apache Hadoop YARN
by Arun C. Murthy
400 Pages · 2014 · 7.4 MB · 1,523 Downloads · New!
Apache Hadoop is helping drive the Big Data revolution. Now, its data processing has been completely overhauled: Apache Hadoop YARN provides resource management at data center scale and easier ways to create distributed applications that process petabytes of data. And now in Apache Hadoop YARN, two Hadoop technical leaders show you how to develop new applications and adapt existing code to fully leverage these revolutionary advances.
Apache Hive Essentials
by Dayong Du
313 Pages · 2015 · 1.8 MB · 1,994 Downloads · New!
In this book, we prepare you for your journey into big data by firstly introducing you to backgrounds in the big data domain along with the process of setting up and getting familiar with your Hive working environment. Next, the book guides you through discovering and transforming the values of big data with the help of examples. It also hones your skill in using the Hive language in an efficient manner. Towards the end, the book focuses on advanced topics such as performance, security, and extensions in Hive, which will guide you on exciting adventures on this worthwhile big data journey.
Apache Sqoop Cookbook
by Jarek Jarcec Cecho
94 Pages · 2013 · 2.4 MB · 3,202 Downloads · New!
Integrating data from multiple sources is essential in the age of big data, but it can be a challenging and time-consuming task. This handy cookbook provides dozens of ready-to-use recipes for using Apache Sqoop, the command-line interface application that optimizes data transfers between relational databases and Hadoop.