LIBRISTO
LIBROAMANTO
povinné
Staňte se součástí komunity milovníků knih z celého světa a získejte hromadu výhod. Založit účet zdarma
0
Doprava zdarma se Zásilkovnou nad 1 499 Kč
Kurýr DPD 69 PPL shop 49 Balíkovna 69 PPL kurýr 74 PPL box 39 Balíkovna 49 Výdejní místo DPD 49 Zásilkovna 39

Doprava zdarma při nákupu nad 1 499 Kč přes Zásilkovnu nebo PPL Box.

Data Engineering & Analytics

Parallel Data Pipelines in Production: Design and Scale Real-Time Data Systems Using Modern Distributed Frameworks

Jazyk AngličtinaAngličtina
Kniha Brožovaná
Kniha Data Engineering & Analytics M.T Holbrook
Libristo kód: 52288924
Nakladatelství Independently published, květen 2026
Build production-grade data pipelines that scale - from your first ETL workflow to distributed syste... Celý popis
? points 93 b Připravujeme Připravujeme Nové Nové
925
Očekávané naskladnění Naskladnění 10. 05. 2026

30 dní na vrácení zboží

Build production-grade data pipelines that scale - from your first ETL workflow to distributed systems handling real-time data under production pressure
Data engineering isn't about scripts that work once. It's about systems that process massive volumes of data continuously, survive failures, and deliver results when it matters. As datasets grow and systems become distributed, the real challenge is no longer writing code - it's designing pipelines that scale, perform, and remain reliable in production.

This book takes you from zero to production-ready data systems with a practical, no-nonsense approach. You'll start by understanding how distributed processing actually works - why single machines fail at scale, and how parallelism, latency, and throughput define system performance. Then you'll build a complete pipeline from scratch, implementing extraction, transformation, and loading while adding logging, monitoring, and debugging practices used in real-world systems.
As your pipeline grows, you'll move beyond basics into the problems that break most systems. You'll learn how to partition large datasets correctly, eliminate bottlenecks caused by skewed data, and process streaming data in real time. You'll integrate message brokers to decouple services and build pipelines that don't collapse under load.

You'll design systems that tolerate failure by default, implement checkpointing and recovery mechanisms, and optimise performance using profiling and resource tuning. Security is treated as a core requirement, not an afterthought, with practical approaches to encryption, access control, and audit logging.
You'll then step into operating data systems at scale - building monitoring and observability pipelines, setting up alerting, managing infrastructure costs, and testing systems under real-world conditions. The book concludes with deployment strategies using CI/CD, zero-downtime updates, and advanced architectures like Lambda, Kappa, and event-driven systems used in modern data platforms.

Key Features

  • Build scalable data pipelines using parallel and distributed processing for both batch and real-time systems
  • Design high-performance pipelines with efficient partitioning, resource optimisation, and bottleneck elimination
  • Implement production-grade reliability with fault tolerance, monitoring, logging, and secure data handling
What you will learn
  • Understand how distributed data systems work and why scalability, latency, and throughput matter
  • Build end-to-end ETL pipelines with logging, monitoring, and debugging built in from the start
  • Design partitioning strategies that prevent data skew and maximise parallel performance
  • Process real-time data streams using event-time semantics, windowing, and aggregation techniques
  • Integrate message brokers to decouple systems and handle high-throughput data flows and more
Who this book is for
This book is for developers and engineers who want to build serious data systems - not demos. You should be comfortable writing code and understand basic data processing concepts.
If you've built pipelines that work locally but break at scale, this book will show you how to fix that. Backend developers moving into data engineering, data analysts stepping into engineering roles, and DevOps engineers managing data infrastructure will find this especially valuable.

No prior experience with distributed systems is required, but this is not a beginner's walkthrough. It's a practical guide for engineers who want to build systems that actually run in production - reliably, efficiently, and at scale.

Herečka & Polyglotka
EWA KASP pro
Přehrát video
Ewa Kasp
Libristo má největší výběr cizojazyčné literatury. Proto své knihy kupuji tady.

Informace o knize

Plný název Data Engineering & Analytics
Autor M.T Holbrook
Jazyk Angličtina
Vazba Kniha - Brožovaná
Datum vydání 2026
Počet stran 374
EAN 9798195654290
Libristo kód 52288924
Nakladatelství Independently published
Váha 867
Rozměry 216 x 280 x 20
Darujte tuto knihu ještě dnes
Je to snadné
1 Přidejte knihu do košíku a zvolte doručit jako dárek 2 Obratem vám zašleme poukaz 3 Kniha dorazí na adresu obdarovaného

Přihlášení

Přihlaste se ke svému účtu. Ještě nemáte Libristo účet? Vytvořte si ho nyní!

 
povinné
povinné

Nemáte účet? Získejte výhody Libristo účtu!

Díky Libristo účtu budete mít vše pod kontrolou.

Vytvořit Libristo účet