Spoken Language Processing: A Guide to Theory, Algorithm and System Development - Hardcover

Huang, Xuedong; Acero, Alex; Hon, Hsiao-Wuen

 
9780130226167: Spoken Language Processing: A Guide to Theory, Algorithm and System Development

Synopsis

This will be the definitive book on spoken language systems written by the people at Microsoft Research who have developed the voic-activated technologies that will be imbedded in Windows 2000 and other key Microsoft products of the future. This is not a Microsoft book, however, this is a book on the science and linguistics of this technology and how to use it in developing and building hardware and software products.

"synopsis" may belong to another edition of this title.

About the Author

XUEDONG HUANG is founder and head of the Speech Technology Group at Microsoft Research. He received his Ph.D. from the University of Edinburgh. He is an IEEE Fellow.

ALEX ACERO and HSIAO-WUEN HON are Senior Researchers at Microsoft Research and Senior Members of IEEE. Both received doctorates from Carnegie Mellon University.

Foreword by Dr. Raj Reddy, Carnegie Mellon University

From the Back Cover

  • New advances in spoken language processing: theory and practice
  • In-depth coverage of speech processing, speech recognition, speech synthesis, spoken language understanding, and speech interface design
  • Many case studies from state-of-the-art systems, including examples from Microsoft's advanced research labs

Spoken Language Processing draws on the latest advances and techniques from multiple fields: computer science, electrical engineering, acoustics, linguistics, mathematics, psychology, and beyond. Starting with the fundamentals, it presents all this and more:

  • Essential background on speech production and perception, probability and information theory, and pattern recognition
  • Extracting information from the speech signal: useful representations and practical compression solutions
  • Modern speech recognition techniques: hidden Markov models, acoustic and language modeling, improving resistance to environmental noises, search algorithms, and large vocabulary speech recognition
  • Text-to-speech: analyzing documents, pitch and duration controls; trainable synthesis, and more
  • Spoken language understanding: dialog management, spoken language applications, and multimodal interfaces

To illustrate the book's methods, the authors present detailed case studies based on state-of-the-art systems, including Microsoft's Whisper speech recognizer, Whistler text-to-speech system, Dr. Who dialog system, and the MiPad handheld device. Whether you're planning, designing, building, or purchasing spoken language technology, this is the state of the art―from algorithms through business productivity.

"About this title" may belong to another edition of this title.