Hadoop in Practice, Second Edition
The first edition of my book went to press on November 2012, just over a year ago! It’s not that long, but in Hadoop years it’s a generation, and there have been many exciting developments in Hadoop and its ecosystem, especially YARN, and the promise of a general-purpose, distributed platform that can support any computing models, beyond MapReduce.
I’m excited to announce that I’ve started work on the second edition of the book, which will bring the existing coverage of the book up to date, and also add new chapters to cover items such as:
- An overview of YARN and how it works
- How MapReduce 2 works as a YARN application
- Recipes for writing your own YARN applications
- Pulling data out of Kafka into HDFS
- Running Storm on YARN and using it to perform aggregations
- Using Spark for in-memory, iterative data processing
The book is currently in MEAP, which is Manning’s early access program. The benefit of this program is that you get new content as it’s being written, and at the end you’ll get the full production-polished version of the book.
I welcome any suggestions or ideas for how the book can be improved at the forum.
About the author
Alex Holmes works on tough big-data problems. He is a software engineer, author, speaker, and blogger specializing in large-scale Hadoop projects. He is the author of Hadoop in Practice, a book published by Manning Publications. He has presented multiple times at JavaOne, and is a JavaOne Rock Star.
If you want to see what Alex is up to you can check out his work on GitHub, or follow him on Twitter or Google+.
RECENT BLOG POSTS
-
Configuring memory for MapReduce running on YARN
This post examines the various memory configuration settings for your MapReduce job.
-
Big data anti-patterns presentation
Details on the presentation I have at JavaOne in 2015 on big data antipatterns.
-
Understanding how Parquet integrates with Avro, Thrift and Protocol Buffers
Parquet offers integration with a number of object models, and this post shows how Parquet supports various object models.
-
Using Oozie 4.4.0 with Hadoop 2.2
Patching Oozie's build so that you can create a package targetting Hadoop 2.2.0.
-
Hadoop in Practice, Second Edition
A sneak peek at what's coming in the second edition of my book.