{"product_id":"a-practical-guide-for-building-an-enterprise-data-lake-9789365891430","title":"A Practical Guide for Building an Enterprise Data Lake","description":"\u003cp\u003e • Author(s): Sai Srinivas Sriparasa\u003cbr\u003e • Publisher: Bpb Publications\u003cbr\u003e • Publisher Imprint: Bpb Publications\u003cbr\u003e • BISAC: Data Science - Data Warehousing\u003c\/p\u003e\u003cp\u003e\u003c\/p\u003e\u003cp\u003e\u003cspan class=\"metafield-multi_line_text_field\"\u003eData lakes are the essential technology for tackling the explosive growth of big data volume, velocity, and variety, moving beyond traditional data warehousing to unlock advanced analytics and machine learning. \u003cbr\u003e\n\u003cbr\u003e\nThis comprehensive book begins by clearly defining the differences between the data lake, lake house, and data mesh architectures and immediately addresses critical governance pitfalls and required upskilling before diving into technical implementation. You will learn the discovery process to define data zones and master ingestion using bulk methods and streaming via Apache Kafka to build Lambda architectures. We then detail ad-hoc data discovery and cataloguing with tools like AWS Glue Data Catalog, followed by practical data transformation using PySpark ETL and orchestration tools to ensure data quality rules. The book concludes by showing you how to enable consumption layers for OLAP engines and machine learning, and finally, how to secure the entire platform with strong security, networking, and budget governance.\u003cbr\u003e\n\u003cbr\u003e\nUpon completing this practical book, you will possess the competency to not only architect and build a scalable data lake but also to strategically expand its value by treating data as a product, making you a highly effective and confident enterprise data lake professional ready for real-world application.\u003cbr\u003e\n\u003cbr\u003e\nWHAT YOU WILL LEARN\u003cbr\u003e\n● Differentiate Data Lake, Lake House, Data Mesh, and Data Fabric semantics.\u003cbr\u003e\n● Design data zones and cost allocation during the discovery process.\u003cbr\u003e\n● Implement streaming ingestion using Apache Kafka for Lambda architecture.\u003cbr\u003e\n● Build PySpark ETL\/SQL ELT pipelines with orchestration tools for quality.\u003cbr\u003e\n● Implement security, networking, and monitoring requirements for governance.\u003cbr\u003e\n\u003cbr\u003e\nWHO THIS BOOK IS FOR\u003cbr\u003e\nThis practical book is ideal for business\/product leaders, architects, and solution engineers. Readers should have foundational knowledge of open-source technologies and major cloud environments like AWS, GCP, or Azure.\u003cbr\u003e\n\u003c\/span\u003e\u003c\/p\u003e","brand":"Bpb Publications","offers":[{"title":"Paperback","offer_id":46556106358935,"sku":"9789365891430","price":800.0,"currency_code":"INR","in_stock":false}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0666\/3471\/1191\/files\/9789365891430.webp?v=1768035269","url":"https:\/\/atlanticbooks.com\/products\/a-practical-guide-for-building-an-enterprise-data-lake-9789365891430","provider":"Atlantic Books","version":"1.0","type":"link"}