{"product_id":"data-science-solutions-with-python-fast-and-scalable-models-using-keras-pyspark-mllib-h2o-xgboost-and-scikit-learn-9781484277614","title":"Data Science Solutions with Python: Fast and Scalable Models Using Keras, PySpark MLlib, H2O, XGBoost, and Scikit-Learn","description":"\u003cp\u003e • Author(s): Tshepo Chris Nokeri\u003cbr\u003e • Publisher: Apress\u003cbr\u003e • Publisher Imprint: Apress\u003cbr\u003e • BISAC: Probability \u0026amp; Statistics - General\u003c\/p\u003e\u003cp\u003e\u003c\/p\u003e\u003cp\u003e\u003cb\u003eFrom the Back Cover\u003c\/b\u003e\u003cbr\u003eApply supervised and unsupervised learning to solve practical and real-world big data problems. This book teaches you how to engineer features, optimize hyperparameters, train and test models, develop pipelines, and automate the machine learning (ML) process. \u003cbr\u003eThe book covers an in-memory, distributed cluster computing framework known as PySpark, machine learning framework platforms known as scikit-learn, PySpark MLlib, H2O, and XGBoost, and a deep learning (DL) framework known as Keras.\u003cbr\u003e\u003c\/p\u003e\u003cp\u003eThe book starts off presenting supervised and unsupervised ML and DL models, and then it examines big data frameworks along with ML and DL frameworks. Author Tshepo Chris Nokeri considers a parametric model known as the Generalized Linear Model and a survival regression model known as the Cox Proportional Hazards model along with Accelerated Failure Time (AFT). Also presented is a binary classification model (logistic regression) and an ensemble model (Gradient Boosted Trees). The book introduces DL and an artificial neural network known as the Multilayer Perceptron (MLP) classifier. A way of performing cluster analysis using the K-Means model is covered. Dimension reduction techniques such as Principal Components Analysis and Linear Discriminant Analysis are explored. And automated machine learning is unpacked.\u003c\/p\u003e\u003cp\u003eThis book is for intermediate-level data scientists and machine learning engineers who want to learn how to apply key big data frameworks and ML and DL frameworks. You will need prior knowledge of the basics of statistics, Python programming, probability theories, and predictive analytics. \u003c\/p\u003eWhat You Will Learn\u003cbr\u003e\u003cul\u003e\n\u003cli\u003eUnderstand widespread supervised and unsupervised learning, including key dimension reduction techniques\u003c\/li\u003e\n\u003cli\u003eKnow the big data analytics layers such as data visualization, advanced statistics, predictive analytics, machine learning, and deep learning\u003c\/li\u003e\n\u003cli\u003eIntegrate big data frameworks with a hybrid of machine learning frameworks and deep learning frameworks\u003c\/li\u003e\n\u003cli\u003eDesign, build, test, and validate skilled machine models and deep learning models\u003c\/li\u003e\n\u003cli\u003eOptimize model performance using data transformation, regularization, outlier remedying, hyperparameter optimization, and data split ratio alteration\u003c\/li\u003e\n\u003c\/ul\u003e\u003cp\u003e\u003c\/p\u003e\u003cp\u003e \u003c\/p\u003e \u003cp\u003e\u003c\/p\u003e","brand":"Apress","offers":[{"title":"Paperback","offer_id":45127401832599,"sku":"9781484277614","price":2614.0,"currency_code":"INR","in_stock":true}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0666\/3471\/1191\/files\/9781484277614.webp?v=1767903583","url":"https:\/\/atlanticbooks.com\/products\/data-science-solutions-with-python-fast-and-scalable-models-using-keras-pyspark-mllib-h2o-xgboost-and-scikit-learn-9781484277614","provider":"Atlantic Books","version":"1.0","type":"link"}