Text Data Management and Analysis: A Practical Introduction to Information Retrieval and Text Mining
Imported Edition - Ships in 18-21 Days
Free Shipping in India on orders above Rs. 500
Imported Edition - Ships in 18-21 Days
Free Shipping in India on orders above Rs. 500
Recent years have seen a dramatic growth of natural language text data, including web pages, news articles, scientific literature, emails, enterprise documents, and social media such as blog articles, forum posts, product reviews, and tweets. This has led to an increasing demand for powerful software tools to help people analyze and manage vast amounts of text data effectively and efficiently. Unlike data generated by a computer system or sensors, text data are usually generated directly by humans, and are accompanied by semantically rich content. As such, text data are especially valuable for discovering knowledge about human opinions and preferences, in addition to many other kinds of knowledge that we encode in text. In contrast to structured data, which conform to well-defined schemas (thus are relatively easy for computers to handle), text has less explicit structure, requiring computer processing toward understanding of the content encoded in text. The current technology of natural language processing has not yet reached a point to enable a computer to precisely understand natural language text, but a wide range of statistical and heuristic approaches to analysis and management of text data have been developed over the past few decades. They are usually very robust and can be applied to analyze and manage text data in any natural language, and about any topic.
This book provides a systematic introduction to all these approaches, with an emphasis on covering the most useful knowledge and skills required to build a variety of practically useful text information systems. The focus is on text mining applications that can help users analyze patterns in text data to extract and reveal useful knowledge. Information retrieval systems, including search engines and recommender systems, are also covered as supporting technology for text mining applications. The book covers the major concepts, techniques, and ideas in text data mining and information retrieval from a practical viewpoint, and includes many hands-on exercises designed with a companion software toolkit (i.e., MeTA) to help readers learn how to apply techniques of text mining and information retrieval to real-world text data and how to experiment with and improve some of the algorithms for interesting application tasks. The book can be used as a textbook for a computer science undergraduate course or a reference book for practitioners working on relevant problems in analyzing and managing text data.
Massung, Sean: - Sean Massung is a Ph.D. candidate in computer science at the University of Illinois at Urbana-Champaign, where he also received both his B.S. and M.S. degrees. He is a co-founder of META and uses it in all of his research. He has been instructor for CS 225: Data Structures and Programming Principles, CS 410: Text Information Systems, and CS 591txt: Text Mining Seminar. He is included in the 2014 List of Teachers Ranked as Excellent at the University of Illinois and has received an Outstanding Teaching Assistant Award and CS@Illinois Outstanding Research Project Award. He has given talks at Jump Labs Champaign and at UIUC for Data and Information Systems Seminar, Intro to Big Data, and Teaching Assistant Seminar. His research interests include text mining applications in information retrieval, natural language processing, and education.
Zhai, Chengxiang: - ChengXiang Zhai is a Professor of Computer Science and Willett Faculty Scholar at the University of Illinois at Urbana-Champaign, where he is also affiliated with the Graduate School of Library and Information Science, Institute for Genomic Biology, and Department of Statistics. He received a Ph.D. in Computer Science from Nanjing University in 1990, and a Ph.D. in Language and Information Technologies from Carnegie Mellon University in 2002. He worked at Clairvoyance Corp. as a Research Scientist and then Senior Research Scientist from 1997-2000. His research interests include information retrieval, text mining, natural language processing, machine learning, biomedical and health informatics, and intelligent education information systems. He has published over 200 research papers in major conferences and journals. He served as an Associate Editor for Information Processing and Management, as an Associate Editor of ACM Transactions on Information Systems, and on the editorial board of Information Retrieval Journal. He was a conference program co-chair of ACM CIKM 2004, NAACL HLT 2007, ACM SIGIR 2009, ECIR 2014, ICTIR 2015, and WWW 2015, and conference general co-chair for ACM CIKM 2016. He is an ACM Distinguished Scientist and a recipient of multiple awards, including the ACM SIGIR 2004 Best Paper Award, the ACM SIGIR 2014 Test of Time Paper Award, Alfred P. Sloan Research Fellowship, IBM Faculty Award, HP Innovation Research Program Award, Microsoft Beyond Search Research Award, and the Presidential Early Career Award for Scientists and Engineers (PECASE).
• Author(s): Clear | James • Publisher: Penguin • Publisher Imprint: Penguin Random House • Subject: General Books
• Author(s): Jeff Kinney • Publisher: Penguin Random House Children's UK • Publisher Imprint: Penguin Random House Children's UK • BISAC: Comics & Graphic Novels - Humorous
• Author(s): Ichiro Kishimi • Publisher: GROVE ATLANTIC • Publisher Imprint: Allen & Unwin • BISAC: Personal Growth - SuccessIchiro Kishimi lives in Kyoto. He writes, lectures and teaches in psychiatric clinics as a certified counsellor and c...
View full details• Author(s): Chetan Bhagat • Publisher: HarperCollins Publishers India • Publisher Imprint: HarperCollins Publishers India • BISAC: GeneralFrom India's top-selling writer Chetan Bhagat comes a powerful new love story that will make you laugh, cry...
View full details• Author(s): Brianna Wiest • Publisher: Manjul Publishing • Publisher Imprint: Amaryllis • BISAC: Body Mind And SpiritThis is a book about self-sabotage. Why we do it, when we do it, and how to stop doing it—for good. Coexisting but conflicting n...
View full details• Author(s): Morgan Housel • Publisher: Pan Macmillan • Publisher Imprint: Pan Macmillan • BISAC: Finance - Wealth ManagementA third book from the International bestselling author of The Psychology of Money and Same as Ever, lessons on harnessing...
View full details• Author(s): Arundhati Roy• Publisher: PRH INDIA LOCAL PRINT• Publisher Imprint: Penguin Hamish Hamilton• BISAC: Literary FiguresArundhati Roy’s first work of memoir, this is a soaring account, both intimate and inspiring, of how the author became...
View full details• Author(s): Acharya Prashant • Publisher: HarperCollins Publishers India • Publisher Imprint: HarperCollins Publishers India • BISAC: GeneralIn a world where vagueness is mistaken for depth and obscurity passes for wisdom, Truth without Apology ...
View full details• Author(s): Sudha Murthy • Publisher: India Puffin • Publisher Imprint: India Puffin • BISAC: Short StoriesWho can resist a good story, especially when it's being told by Grandma? From her bag emerges tales of kings and cheats, monkeys and mic...
View full details• Author(s): Satoshi Yagisawa • Publisher: Bonnier Books Ltd • Publisher Imprint: Bonnier Books Ltd
• Author(s): Newport, Cal • Publisher: Little, Brown Book Group • Publisher Imprint: Piatkus
• Author(s): Shrijeet Shandilya • Publisher: Ebury Press • Publisher Imprint: Ebury Press • BISAC: Romance - GeneralIn the electric haze of college life, three friends are bound by laughter, late-night talks and unspoken promises. But when two of...
View full details• Author(s): Dan Brown • Publisher: Transworld Publishers Ltd • Publisher Imprint: Transworld Publishers Ltd • BISAC: Thrillers - EspionageDan Brown is the bestselling author of Digital Fortress, Deception Point, Angels and Demons, The Da Vinci C...
View full details• Author(s): Sudha Murty • Publisher: India Puffin • Publisher Imprint: India Puffin • BISAC: Action & Adventure - General
Rich Dad Poor Dad: What the Rich Teach Their Kids about Money That the Poor and Middle Class Do Not!
• Publisher: Penguin • Publisher Imprint: Penguin Random House • Subject: General Books • BISAC: Personal Finance - GeneralApril of 2022 marks a 25-year milestone for the personal finance classic Rich Dad Poor Dad that still ranks as the #1 Pers...
View full details• Author(s): Dale Carnegie | Napoleon Hill • Publisher: Fingerprint • Publisher Imprint: Fingerprint • Subject: General Books
• Author(s): Freida Mcfadden • Publisher: Penguin Select Print • Publisher Imprint: Penguin Select Publishing"Multi-Million Copy Bestselling Series •Now Being Made Into a Major Motion Picture Starring Sydney Sweeney and Amanda Seyfried #1 New Yor...
View full details• Author(s): Wonder House Books • Publisher: Wonder House Books • Publisher Imprint: Wonder House Books • BISAC: Comics & Graphic Novels - Fairy Tales, Folklore, Legends & MTimeless Wisdom, Talking Animals & Life Lessons for Young Min...
View full details• Author(s): Viktor E. Frankl • Publisher: Random House • Publisher Imprint: Random Hou • Subject: Medical, Nursing and Health Sciences
• Author(s): Madhavi Bharadwaj • Publisher: PRH India • Publisher Imprint: Penguin Ebury Press • BISAC: Parenting - MotherhoodWelcome to the wild, messy, wonderful world of parenting--where the nights are long, the diapers are explosive, and unso...
View full details• Author(s): Vir Das • Publisher: HarperCollins Publishers India • Publisher Imprint: HarperCollins Publishers India • BISAC: Entertainment & Performing ArtsComedian and actor Vir Das is beloved (by some, tolerated by others, blocked by a few...
View full details• Author(s): Eric Carle • Publisher: Penguin Books, Limited (UK) • Publisher Imprint: Penguin Books, Limited (UK) • BISAC: Animals - Butterflies, Moths & CaterpillarsEric Carle's The Very Hungry Caterpillar is a perennial favourite with child...
View full details• Author(s): Prajakta Koli • Publisher: Harper Fiction India • Publisher Imprint: Harper Fiction India • BISAC: Romance - ContemporaryWinner of the Amazon India Popular Choice Debut Book 2025 Award. From one of India's most-loved creators comes s...
View full details