{"product_id":"small-language-models-for-mobile-devices-a-guide-to-on-device-ai-model-optimization-and-edge-computing-for-android-and-ios-9798259071360","title":"Small Language Models for Mobile Devices: A Guide to On-Device AI, Model Optimization, and Edge Computing for Android and iOS","description":"\u003cp\u003e • Author(s): Thomas O. Greene\u003cbr\u003e • Publisher: Independently Published\u003cbr\u003e • Publisher Imprint: Independently Published\u003cbr\u003e • BISAC: Data Science - Neural Networks\u003c\/p\u003e\u003cp\u003eStop Renting Intelligence. Start Owning It. \u003c\/p\u003e\u003cp\u003e\u003c\/p\u003e\u003cb\u003eThe Cloud is hitting a wall. Latency is killing your user experience. Privacy is becoming a legal minefield. And API costs are bleeding your startup dry.\u003c\/b\u003e\u003cbr\u003eNow, the \"God Models\" have moved from massive data centers into the palm of your hand.\u003cbr\u003eIn \u003ci\u003e\u003cb\u003eSmall Language Models for Mobile Devices\u003c\/b\u003e\u003c\/i\u003e, visionary developer and engineer Thomas O. Greene reveals the blueprint for the most significant shift in computing since the smartphone itself: The Silicon Sovereignty.\u003cbr\u003eWe are moving away from \"Intelligence-as-a-Service\" and toward\u003cb\u003e \"Intelligence-as-a-Utility.\"\u003c\/b\u003e This book is your technical manifesto and hands-on guide to building, optimizing, and deploying high-performance AI that runs 100% offline, with sub-50ms latency, on standard Android and iOS hardware. \u003cp\u003e\u003c\/p\u003e\u003cb\u003eWhat's Inside the Engine Room?\u003c\/b\u003e\u003cul\u003e\n\u003cli\u003e\n\u003cb\u003eThe Architecture of Efficiency: \u003c\/b\u003eDeep-dives into \u003cb\u003ePhi-4, Gemma, and Llama-3-Mobile\u003c\/b\u003e. Learn why \"small\" doesn't mean \"weak\" when you master \u003cb\u003eGrouped-Query Attention (GQA) \u003c\/b\u003eand \u003cb\u003eRotary Embeddings\u003c\/b\u003e.\u003c\/li\u003e\n\u003cli\u003e\n\u003cb\u003eThe Magic of Quantization: \u003c\/b\u003e Step-by-step techniques to squeeze 7B parameter models into 4GB of RAM using \u003cb\u003eINT4, NF4, and the 1.58-bit Binary Frontier.\u003c\/b\u003e\n\u003c\/li\u003e\n\u003cli\u003e\n\u003cb\u003eNext-Gen Frameworks: \u003c\/b\u003eMaster \u003cb\u003eExecuTorch (PyTorch Edge), Apple MLX, \u003c\/b\u003eand \u003cb\u003eAndroid AICore\u003c\/b\u003e to talk directly to the NPU silicon.\u003c\/li\u003e\n\u003cli\u003e\n\u003cb\u003eBeyond Text: \u003c\/b\u003eDeploy \u003cb\u003eMulti-Modal SLMs\u003c\/b\u003e that \"see\" through the camera and \"hear\" through the mic with native audio-to-audio processing.\u003c\/li\u003e\n\u003cli\u003e\n\u003cb\u003eThe Agentic Revolution: \u003c\/b\u003eBuild \u003cb\u003eLarge Action Models (LAMs)\u003c\/b\u003e that navigate mobile UIs, booking rides and sending messages without a single cloud request.\u003c\/li\u003e\n\u003cli\u003e\n\u003cb\u003eThe Future is Liquid: \u003c\/b\u003eAn exclusive look at \u003cb\u003eLiquid Neural Networks (LNNs)\u003c\/b\u003e-the breakthrough for infinite context and constant memory footprints.\u003c\/li\u003e\n\u003c\/ul\u003eWhy This Book is Essential: \u003cbr\u003eWhether you are a Mobile Developer tired of \"Cloud Fatigue,\" a Machine Learning Engineer fighting the \"Memory Wall,\" or a Tech Leader demanding \"Privacy-First\" AI, this book provides \u003cb\u003ethe code, the math, and the strategy to win\u003c\/b\u003e. \u003cp\u003e\u003c\/p\u003e\u003cb\u003eThe era of the \"Frozen Snapshot\" LLM is over. The era of the Fluid, Private, and Autonomous Mobile Agent has begun.\u003cbr\u003eStop sending your users' data to a third-party server. Take the red pill of Data Sovereignty and build the private, powerful, and portable future today.\u003c\/b\u003e","brand":"Independently Published","offers":[{"title":"Paperback","offer_id":47883272880279,"sku":"9798259071360","price":1832.0,"currency_code":"INR","in_stock":false}],"thumbnail_url":"\/\/cdn.shopify.com\/s\/files\/1\/0666\/3471\/1191\/files\/9798259071360.webp?v=1781100740","url":"https:\/\/atlanticbooks.com\/products\/small-language-models-for-mobile-devices-a-guide-to-on-device-ai-model-optimization-and-edge-computing-for-android-and-ios-9798259071360","provider":"Atlantic Books","version":"1.0","type":"link"}