Sign Up for Fishpond's Best Deals Delivered to You Every Day
Go
Modern Data Architectures ­with Python
A practical guide to building and deploying data pipelines, data warehouses, and data lakes with Python

Rating
Format
Paperback, 318 pages
Published
United Kingdom, 1 September 2023

Build scalable and reliable data ecosystems using Data Mesh, Databricks Spark, and Kafka

Key Features

Develop modern data skills used in emerging technologies
Learn pragmatic design methodologies such as Data Mesh and data lakehouses
Gain a deeper understanding of data governance
Purchase of the print or Kindle book includes a free PDF eBook

Book DescriptionModern Data Architectures with Python will teach you how to seamlessly incorporate your machine learning and data science work streams into your open data platforms. You’ll learn how to take your data and create open lakehouses that work with any technology using tried-and-true techniques, including the medallion architecture and Delta Lake.
Starting with the fundamentals, this book will help you build pipelines on Databricks, an open data platform, using SQL and Python. You’ll gain an understanding of notebooks and applications written in Python using standard software engineering tools such as git, pre-commit, Jenkins, and Github. Next, you’ll delve into streaming and batch-based data processing using Apache Spark and Confluent Kafka. As you advance, you’ll learn how to deploy your resources using infrastructure as code and how to automate your workflows and code development. Since any data platform's ability to handle and work with AI and ML is a vital component, you’ll also explore the basics of ML and how to work with modern MLOps tooling. Finally, you’ll get hands-on experience with Apache Spark, one of the key data technologies in today’s market.
By the end of this book, you’ll have amassed a wealth of practical and theoretical knowledge to build, manage, orchestrate, and architect your data ecosystems.What you will learn

Understand data patterns including delta architecture
Discover how to increase performance with Spark internals
Find out how to design critical data diagrams
Explore MLOps with tools such as AutoML and MLflow
Get to grips with building data products in a data mesh
Discover data governance and build confidence in your data
Introduce data visualizations and dashboards into your data practice

Who this book is forThis book is for developers, analytics engineers, and managers looking to further develop a data ecosystem within their organization. While they’re not prerequisites, basic knowledge of Python and prior experience with data will help you to read and follow along with the examples.

Show more

Our Price
$102
Ships from UK Estimated delivery date: 18th Apr - 25th Apr from UK
  Include FREE SHIPPING on a Fishpond Premium Trial

Already Own It? Sell Yours
Buy Together
+
Buy together with The Data Wrangling Workshop, Second Edition at a great price!
Buy Together
$178.52

Product Description

Build scalable and reliable data ecosystems using Data Mesh, Databricks Spark, and Kafka

Key Features

Develop modern data skills used in emerging technologies
Learn pragmatic design methodologies such as Data Mesh and data lakehouses
Gain a deeper understanding of data governance
Purchase of the print or Kindle book includes a free PDF eBook

Book DescriptionModern Data Architectures with Python will teach you how to seamlessly incorporate your machine learning and data science work streams into your open data platforms. You’ll learn how to take your data and create open lakehouses that work with any technology using tried-and-true techniques, including the medallion architecture and Delta Lake.
Starting with the fundamentals, this book will help you build pipelines on Databricks, an open data platform, using SQL and Python. You’ll gain an understanding of notebooks and applications written in Python using standard software engineering tools such as git, pre-commit, Jenkins, and Github. Next, you’ll delve into streaming and batch-based data processing using Apache Spark and Confluent Kafka. As you advance, you’ll learn how to deploy your resources using infrastructure as code and how to automate your workflows and code development. Since any data platform's ability to handle and work with AI and ML is a vital component, you’ll also explore the basics of ML and how to work with modern MLOps tooling. Finally, you’ll get hands-on experience with Apache Spark, one of the key data technologies in today’s market.
By the end of this book, you’ll have amassed a wealth of practical and theoretical knowledge to build, manage, orchestrate, and architect your data ecosystems.What you will learn

Understand data patterns including delta architecture
Discover how to increase performance with Spark internals
Find out how to design critical data diagrams
Explore MLOps with tools such as AutoML and MLflow
Get to grips with building data products in a data mesh
Discover data governance and build confidence in your data
Introduce data visualizations and dashboards into your data practice

Who this book is forThis book is for developers, analytics engineers, and managers looking to further develop a data ecosystem within their organization. While they’re not prerequisites, basic knowledge of Python and prior experience with data will help you to read and follow along with the examples.

Show more
Product Details
EAN
9781801070492
ISBN
1801070490
Dimensions
23.5 x 19.1 x 1.7 centimetres (0.55 kg)

Table of Contents

Table of Contents

  • Modern Data Processing Architectures
  • Basics of Data Analytics Engineering
  • Cloud Storage and Processing Concepts
  • Python Batch and Stream Processing with Spark
  • Streaming Data with Kafka
  • Python MLOps
  • Python and SQL based Visualizations
  • Integrating CI into your workflow
  • Data Orchestration
  • Data Governance
  • Introduction to Saturn Insurance, Deploying CI and ELT
  • Data Governance and Dashboards
  • About the Author

    Brian Lipp is a Technology Polyglot, Engineer, and Solution Architect with a wide skillset in many technology domains. His programming background has ranged from R, Python, and Scala, to Go and Rust development. He has worked on Big Data systems, Data Lakes, data warehouses, and backend software engineering. Brian earned a Master of Science, CSIS from Pace University in 2009. He is currently a Sr. Data Engineer working with large Tech firms to build Data Ecosystems.

    Show more
    Review this Product
    What our customers have to say
    Ask a Question About this Product More...
     
    Look for similar items by category
    How Fishpond Works
    Fishpond works with suppliers all over the world to bring you a huge selection of products, really great prices, and delivery included on over 25 million products that we sell. We do our best every day to make Fishpond an awesome place for customers to shop and get what they want — all at the best prices online.
    Webmasters, Bloggers & Website Owners
    You can earn a 8% commission by selling Modern Data Architectures with Python: A practical guide to building and deploying data pipelines, data warehouses, and data lakes with Python on your website. It's easy to get started - we will give you example code. After you're set-up, your website can earn you money while you work, play or even sleep! You should start right now!
    Authors / Publishers
    Are you the Author or Publisher of a book? Or the manufacturer of one of the millions of products that we sell. You can improve sales and grow your revenue by submitting additional information on this title. The better the information we have about a product, the more we will sell!
    Item ships from and is sold by Fishpond World Ltd.

    Back to top