Syntax-based Statistical Machine Translation (Synthesis Lectures on Human Language Technologies) - Softcover

Philip Williams ; Rico Sennrich ; Matt Post ; Philipp Koehn

9781627059008: Syntax-based Statistical Machine Translation (Synthesis Lectures on Human Language Technologies)

Softcover

ISBN 10: 1627059008 ISBN 13: 9781627059008

Publisher: Morgan & Claypool Publishers, 2016

View all copies of this ISBN edition

0 Used

1 New

From � 16

This unique book provides a comprehensive introduction to the most popular syntax-based statistical machine translation models, filling a gap in the current literature for researchers and developers in human language technologies. While phrase-based models have previously dominated the field, syntax-based approaches have proved a popular alternative, as they elegantly solve many of the shortcomings of phrase-based models. The heart of this book is a detailed introduction to decoding for syntax-based models.

The book begins with an overview of synchronous-context free grammar (SCFG) and synchronous tree-substitution grammar (STSG) along with their associated statistical models. It also describes how three popular instantiations (Hiero, SAMT, and GHKM) are learned from parallel corpora. It introduces and details hypergraphs and associated general algorithms, as well as algorithms for decoding with both tree and string input. Special attention is given to efficiency, including search approximations such as beam search and cube pruning, data structures, and parsing algorithms. The book consistently highlights the strengths (and limitations) of syntax-based approaches, including their ability to generalize phrase-based translation units, their modeling of specific linguistic phenomena, and their function of structuring the search space.

"synopsis" may belong to another edition of this title.

About the Author

University of Edinburgh|Heidelberg University, Germany|Johns Hopkins University|Johns Hopkins University

"About this title" may belong to another edition of this title.

Publisher: Morgan & Claypool Publishers
Publication date: 2016
Language: English
ISBN 10: 1627059008
ISBN 13: 9781627059008
Binding: Paperback
Number of pages: 208

Other Popular Editions of the Same Title

9783031010361: Syntax-based Statistical Machine Translation (Synthesis Lectures on Human Language Technologies)

Featured Edition

ISBN 10: 3031010361 ISBN 13: 9783031010361
Publisher: Springer, 2016
Softcover

Search results for Syntax-based Statistical Machine Translation (Synthesis...

Seller Image