{"product_id":"advanced-analytics-with-pyspark-patterns-for-learning-from-data-at-scale-using-python-and-spark-9781098103651","title":"Advanced Analytics with Pyspark: Patterns for Learning from Data at Scale Using Python and Spark","description":"\u003cp\u003e • Author(s): Akash Tandon\u003cbr\u003e • Publisher: O'Reilly Media\u003cbr\u003e • Publisher Imprint: O'Reilly Media\u003cbr\u003e • BISAC: Data Science - Data Analytics\u003c\/p\u003e\u003cp\u003e\u003c\/p\u003e\u003cp\u003eThe amount of data being generated today is staggering and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world datasets to teach you how to approach analytics problems using PySpark, Spark's Python API, and other best practices in Spark programming. \u003c\/p\u003e\u003cp\u003e Data scientists Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills offer an introduction to the Spark ecosystem, then dive into patterns that apply common techniques-including classification, clustering, collaborative filtering, and anomaly detection, to fields such as genomics, security, and finance. This updated edition also covers NLP and image processing. \u003c\/p\u003e\u003cp\u003e If you have a basic understanding of machine learning and statistics and you program in Python, this book will get you started with large-scale data analysis. \u003c\/p\u003e\u003cul\u003e \u003cli\u003eFamiliarize yourself with Spark's programming model and ecosystem \u003c\/li\u003e\n\u003cli\u003eLearn general approaches in data science \u003c\/li\u003e\n\u003cli\u003eExamine complete implementations that analyze large public datasets \u003c\/li\u003e\n\u003cli\u003eDiscover which machine learning tools make sense for particular problems \u003c\/li\u003e\n\u003cli\u003eExplore code that can be adapted to many uses \u003c\/li\u003e\n\u003c\/ul\u003e","brand":"O'Reilly Media","offers":[{"title":"Paperback","offer_id":45031219593367,"sku":"9781098103651","price":4526.0,"currency_code":"INR","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0666\/3471\/1191\/files\/9781098103651.webp?v=1767896727","url":"https:\/\/atlanticbooks.com\/products\/advanced-analytics-with-pyspark-patterns-for-learning-from-data-at-scale-using-python-and-spark-9781098103651","provider":"Atlantic Books","version":"1.0","type":"link"}