Big Data: Principles and best practices of scalable realtime data systems

Big Data: Principles and best practices of scalable realtime data systems Author Nathan Marz
ISBN-10 1617290343
ISBN-13 9781617290343
Year 2015-05-10
Pages 328
Language English
Publisher Manning Publications
Download Link Click Here


SummaryBig Data teaches you to build big data systems using an architecture that takes advantage of clustered hardware along with new tools designed specifically to capture and analyze web scale data It describes a scalable easy to understand approach to big data systems that can be built and run by a small team Following a realistic example this book guides readers through the theory of big data systems how to implement them in practice and how to deploy and operate them once they re built Purchase of the print book includes a free eBook in PDF Kindle and ePub formats from Manning Publications About the BookWeb scale applications like social networks real time analytics or e commerce sites deal with a lot of data whose volume and velocity exceed the limits of traditional database systems These applications require architectures built around clusters of machines to store and process data of any size or speed Fortunately scale and simplicity are not mutually exclusive Big Data teaches you to build big data systems using an architecture designed specifically to capture and analyze web scale data This book presents the Lambda Architecture a scalable easy to understand approach that can be built and run by a small team You ll explore the theory of big data systems and how to implement them in practice In addition to discovering a general framework for processing big data you ll learn specific technologies like Hadoop Storm and NoSQL databases This book requires no previous exposure to large scale data analysis or NoSQL tools Familiarity with traditional databases is helpful What s InsideIntroduction to big data systemsReal time processing of web scale dataTools like Hadoop Cassandra and StormExtensions to traditional database skillsAbout the AuthorsNathan Marz is the creator of Apache Storm and the originator of the Lambda Architecture for big data systems James Warren is an analytics architect with a background in machine learning and scientific computing Table of ContentsA new paradigm for Big DataPART 1 BATCH LAYERData model for Big DataData model for Big Data IllustrationData storage on the batch layerData storage on the batch layer IllustrationBatch layerBatch layer IllustrationAn example batch layer Architecture and algorithmsAn example batch layer ImplementationPART 2 SERVING LAYERServing layerServing layer IllustrationPART 3 SPEED LAYERRealtime viewsRealtime views IllustrationQueuing and stream processingQueuing and stream processing IllustrationMicro batch stream processingMicro batch stream processing IllustrationLambda Architecture in depth

Big Data

Big Data Author Nathan Marz
ISBN-10 1617290343
Release 2015
Pages 328