N

Nathan Marz Books

1 book·~10 min total read

Nathan Marz is a software engineer known for creating Apache Storm, a distributed real-time computation system. He has worked at BackType and Twitter, where he developed large-scale data processing frameworks.

Known for: Big Data: Principles and Best Practices of Scalable Real-Time Data Systems

Books by Nathan Marz

Big Data: Principles and Best Practices of Scalable Real-Time Data Systems

Big Data: Principles and Best Practices of Scalable Real-Time Data Systems

data_science·10 min read

This book provides a comprehensive guide to building scalable real-time data systems using big data technologies. It introduces the Lambda Architecture, a design pattern for processing massive quantities of data by leveraging both batch and real-time processing. The author explains principles for reliability, scalability, and maintainability, offering practical insights for engineers and architects working with distributed systems.

Read Summary

Key Insights from Nathan Marz

1

The Lambda Architecture: Foundations for Scalable Real-Time Systems

At the heart of this book lies the Lambda Architecture — my response to the contradictions of real-time big data processing. Traditional systems forced an impossible choice: either process in batch for correctness or in real time for speed. But with the Lambda Architecture, I propose that we can hav...

From Big Data: Principles and Best Practices of Scalable Real-Time Data Systems

2

Inside the Batch and Serving Layers: Building the Foundation of Truth

The batch layer, in my experience, is the intellectual core of the Lambda Architecture. It holds the immutable master dataset — a complete, append-only record of everything that has happened. By keeping this dataset uncorrupted by updates or deletions, we remove the complexity inherent in most mutab...

From Big Data: Principles and Best Practices of Scalable Real-Time Data Systems

About Nathan Marz

Nathan Marz is a software engineer known for creating Apache Storm, a distributed real-time computation system. He has worked at BackType and Twitter, where he developed large-scale data processing frameworks. His work focuses on building robust and scalable data architectures for real-time analytic...

Read more

Nathan Marz is a software engineer known for creating Apache Storm, a distributed real-time computation system. He has worked at BackType and Twitter, where he developed large-scale data processing frameworks. His work focuses on building robust and scalable data architectures for real-time analytics.

Frequently Asked Questions

Nathan Marz is a software engineer known for creating Apache Storm, a distributed real-time computation system. He has worked at BackType and Twitter, where he developed large-scale data processing frameworks.

Read Nathan Marz's books in 15 minutes

Get AI-powered summaries with key insights from 1 book by Nathan Marz.