Statistical Methods for Annotation Analysis (Synthesis Lectures on Human Language Technologies)

Paun, Silviu; Artstein, Ron; Poesio, Massimo

ISBN 10: 3031037537 ISBN 13: 9783031037535
Published by Springer, 2022
New Soft cover

From Ria Christie Collections, Uxbridge, United Kingdom Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

AbeBooks Seller since 25 March 2015

This specific item is no longer available.

About this Item

Description:

In English. Seller Inventory # ria9783031037535_new

Report this item

Synopsis:

Labelling data is one of the most fundamental activities in science, and has underpinned practice, particularly in medicine, for decades, as well as research in corpus linguistics since at least the development of the Brown corpus. With the shift towards Machine Learning in Artificial Intelligence (AI), the creation of datasets to be used for training and evaluating AI systems, also known in AI as corpora, has become a central activity in the field as well. Early AI datasets were created on an ad-hoc basis to tackle specific problems. As larger and more reusable datasets were created, requiring greater investment, the need for a more systematic approach to dataset creation arose to ensure increased quality. A range of statistical methods were adopted, often but not exclusively from the medical sciences, to ensure that the labels used were not subjective, or to choose among different labels provided by the coders. A wide variety of such methods is now in regular use. This book is meantto provide a survey of the most widely used among these statistical methods supporting annotation practice. As far as the authors know, this is the first book attempting to cover the two families of methods in wider use. The first family of methods is concerned with the development of labelling schemes and, in particular, ensuring that such schemes are such that sufficient agreement can be observed among the coders. The second family includes methods developed to analyze the output of coders once the scheme has been agreed upon, particularly although not exclusively to identify the most likely label for an item among those provided by the coders. The focus of this book is primarily on Natural Language Processing, the area of AI devoted to the development of models of language interpretation and production, but many if not most of the methods discussed here are also applicable to other areas of AI, or indeed, to other areas of Data Science.

About the Author: Silviu Paun got his Ph.D. from the University of Essex in 2017 with a thesis on topic models. Since then he has been at Queen Mary University of London. His research focuses on models of annotation, probabilistic and neural, for creating resources and to more efficiently train machine learning models. His models have been deployed to create the Phrase Detectives coreference corpus, one of the largest crowdsourced NLP corpora, created using the Phrase Detectives Game-With-A-Purpose.Ron Artstein received his Ph.D. in Linguistics from Rutgers University in 2002, held positions at the Technion–Israel Institute of Technology and the University of Essex, and is presently a research scientist at the Institute for Creative Technologies, University of Southern California. His current research focuses on the collection, annotation, and management of linguistic data for human–machine interaction, analysis of corpora, and the evaluation of implemented dialogue systems; he has published work on theoretical and computational linguistics, conversational dialogue systems, and human–agent and human–robot interaction.Massimo Poesio received his Ph.D. from the University of Rochester in 1994. He is a Professor in Computational Linguistics at Queen Mary University of London and a Turing Institute Fellow. His main interests are in anaphora resolution, disagreements in language interpretation, the use of games-with-a-purpose for creating NLP resources, and semantic interpretation in dialogue.

"About this title" may belong to another edition of this title.

Bibliographic Details

Title: Statistical Methods for Annotation Analysis ...
Publisher: Springer
Publication Date: 2022
Binding: Soft cover
Condition: New

Top Search Results from the AbeBooks Marketplace

Stock Image

Paun, Silviu
Published by Springer 2022-01, 2022
ISBN 10: 3031037537 ISBN 13: 9783031037535
New PF

Seller: Chiron Media, Wallingford, United Kingdom

Seller rating 4 out of 5 stars 4-star rating, Learn more about seller ratings

PF. Condition: New. Seller Inventory # 6666-IUK-9783031037535

Contact seller

Buy New

£ 53
£ 15.49 shipping
Ships from United Kingdom to U.S.A.

Quantity: 10 available

Add to basket

Seller Image

Paun, Silviu|Artstein, Ron|Poesio, Massimo
ISBN 10: 3031037537 ISBN 13: 9783031037535
New Kartoniert / Broschiert
Print on Demand

Seller: moluna, Greven, Germany

Seller rating 4 out of 5 stars 4-star rating, Learn more about seller ratings

Kartoniert / Broschiert. Condition: New. Dieser Artikel ist ein Print on Demand Artikel und wird nach Ihrer Bestellung fuer Sie gedruckt. Labelling data is one of the most fundamental activities in science, and has underpinned practice, particularly in medicine, for decades, as well as research in corpus linguistics since at least the development of the Brown corpus. With the shift towards Ma. Seller Inventory # 608129662

Contact seller

Buy New

£ 54.19
£ 42.92 shipping
Ships from Germany to U.S.A.

Quantity: Over 20 available

Add to basket

Seller Image

Paun, Silviu; Artstein, Ron; Poesio, Massimo
Published by Springer, 2022
ISBN 10: 3031037537 ISBN 13: 9783031037535
New Softcover

Seller: GreatBookPricesUK, Woodford Green, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # 44801478-n

Contact seller

Buy New

£ 55.48
£ 15 shipping
Ships from United Kingdom to U.S.A.

Quantity: Over 20 available

Add to basket

Seller Image

Silviu Paun (u. a.)
ISBN 10: 3031037537 ISBN 13: 9783031037535
New Taschenbuch

Seller: preigu, Osnabrück, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Taschenbuch. Condition: Neu. Statistical Methods for Annotation Analysis | Silviu Paun (u. a.) | Taschenbuch | xix | Englisch | 2022 | Springer International Publishing | EAN 9783031037535 | Verantwortliche Person für die EU: Springer Verlag GmbH, Tiergartenstr. 17, 69121 Heidelberg, juergen[dot]hartmann[at]springer[dot]com | Anbieter: preigu. Seller Inventory # 121975945

Contact seller

Buy New

£ 56.35
£ 61.32 shipping
Ships from Germany to U.S.A.

Quantity: 5 available

Add to basket

Seller Image

Paun, Silviu; Artstein, Ron; Poesio, Massimo
Published by Springer, 2022
ISBN 10: 3031037537 ISBN 13: 9783031037535
Used Softcover

Seller: GreatBookPricesUK, Woodford Green, United Kingdom

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: As New. Unread book in perfect condition. Seller Inventory # 44801478

Contact seller

Buy Used

£ 61.27
£ 15 shipping
Ships from United Kingdom to U.S.A.

Quantity: Over 20 available

Add to basket

Seller Image

Paun, Silviu; Artstein, Ron; Poesio, Massimo
Published by Springer, 2022
ISBN 10: 3031037537 ISBN 13: 9783031037535
Used Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: As New. Unread book in perfect condition. Seller Inventory # 44801478

Contact seller

Buy Used

£ 62.01
£ 1.97 shipping
Ships within U.S.A.

Quantity: Over 20 available

Add to basket

Seller Image

Silviu Paun
ISBN 10: 3031037537 ISBN 13: 9783031037535
New Taschenbuch

Seller: AHA-BUCH GmbH, Einbeck, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Taschenbuch. Condition: Neu. Druck auf Anfrage Neuware - Printed after ordering - Labelling data is one of the most fundamental activities in science, and has underpinned practice, particularly in medicine, for decades, as well as research in corpus linguistics since at least the development of the Brown corpus. With the shift towards Machine Learning in Artificial Intelligence (AI), the creation of datasets to be used for training and evaluating AI systems, also known in AI as corpora, has become a central activity in the field as well. Early AI datasets were created on an ad-hoc basis to tackle specific problems. As larger and more reusable datasets were created, requiring greater investment, the need for a more systematic approach to dataset creation arose to ensure increased quality. A range of statistical methods were adopted, often but not exclusively from the medical sciences, to ensure that the labels used were not subjective, or to choose among different labels provided by the coders. A wide variety of such methods is now in regular use. This book is meantto provide a survey of the most widely used among these statistical methods supporting annotation practice. As far as the authors know, this is the first book attempting to cover the two families of methods in wider use. The first family of methods is concerned with the development of labelling schemes and, in particular, ensuring that such schemes are such that sufficient agreement can be observed among the coders. The second family includes methods developed to analyze the output of coders once the scheme has been agreed upon, particularly although not exclusively to identify the most likely label for an item among those provided by the coders. The focus of this book is primarily on Natural Language Processing, the area of AI devoted to the development of models of language interpretation and production, but many if not most of the methods discussed here are also applicable to other areas of AI, or indeed, to other areas of Data Science. Seller Inventory # 9783031037535

Contact seller

Buy New

£ 62.75
£ 54.40 shipping
Ships from Germany to U.S.A.

Quantity: 1 available

Add to basket

Seller Image

Silviu Paun
ISBN 10: 3031037537 ISBN 13: 9783031037535
New Taschenbuch

Seller: buchversandmimpf2000, Emtmannsberg, BAYE, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Taschenbuch. Condition: Neu. Neuware -Labelling data is one of the most fundamental activities in science, and has underpinned practice, particularly in medicine, for decades, as well as research in corpus linguistics since at least the development of the Brown corpus. With the shift towards Machine Learning in Artificial Intelligence (AI), the creation of datasets to be used for training and evaluating AI systems, also known in AI as corpora, has become a central activity in the field as well. Early AI datasets were created on an ad-hoc basis to tackle specific problems. As larger and more reusable datasets were created, requiring greater investment, the need for a more systematic approach to dataset creation arose to ensure increased quality. A range of statistical methods were adopted, often but not exclusively from the medical sciences, to ensure that the labels used were not subjective, or to choose among different labels provided by the coders. A wide variety of such methods is now in regular use. This book is meantto provide a survey of the most widely used among these statistical methods supporting annotation practice. As far as the authors know, this is the first book attempting to cover the two families of methods in wider use. The first family of methods is concerned with the development of labelling schemes and, in particular, ensuring that such schemes are such that sufficient agreement can be observed among the coders. The second family includes methods developed to analyze the output of coders once the scheme has been agreed upon, particularly although not exclusively to identify the most likely label for an item among those provided by the coders. The focus of this book is primarily on Natural Language Processing, the area of AI devoted to the development of models of language interpretation and production, but many if not most of the methods discussed here are also applicable to other areas of AI, or indeed, to other areas of Data Science.Springer Verlag GmbH, Tiergartenstr. 17, 69121 Heidelberg 220 pp. Englisch. Seller Inventory # 9783031037535

Contact seller

Buy New

£ 62.75
£ 52.56 shipping
Ships from Germany to U.S.A.

Quantity: 2 available

Add to basket

Seller Image

Silviu Paun
ISBN 10: 3031037537 ISBN 13: 9783031037535
New Taschenbuch
Print on Demand

Seller: BuchWeltWeit Ludwig Meier e.K., Bergisch Gladbach, Germany

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Taschenbuch. Condition: Neu. This item is printed on demand - it takes 3-4 days longer - Neuware -Labelling data is one of the most fundamental activities in science, and has underpinned practice, particularly in medicine, for decades, as well as research in corpus linguistics since at least the development of the Brown corpus. With the shift towards Machine Learning in Artificial Intelligence (AI), the creation of datasets to be used for training and evaluating AI systems, also known in AI as corpora, has become a central activity in the field as well. Early AI datasets were created on an ad-hoc basis to tackle specific problems. As larger and more reusable datasets were created, requiring greater investment, the need for a more systematic approach to dataset creation arose to ensure increased quality. A range of statistical methods were adopted, often but not exclusively from the medical sciences, to ensure that the labels used were not subjective, or to choose among different labels provided by the coders. A wide variety of such methods is now in regular use. This book is meantto provide a survey of the most widely used among these statistical methods supporting annotation practice. As far as the authors know, this is the first book attempting to cover the two families of methods in wider use. The first family of methods is concerned with the development of labelling schemes and, in particular, ensuring that such schemes are such that sufficient agreement can be observed among the coders. The second family includes methods developed to analyze the output of coders once the scheme has been agreed upon, particularly although not exclusively to identify the most likely label for an item among those provided by the coders. The focus of this book is primarily on Natural Language Processing, the area of AI devoted to the development of models of language interpretation and production, but many if not most of the methods discussed here are also applicable to other areas of AI, or indeed, to other areas of Data Science. 220 pp. Englisch. Seller Inventory # 9783031037535

Contact seller

Buy New

£ 62.75
£ 20.15 shipping
Ships from Germany to U.S.A.

Quantity: 2 available

Add to basket

Seller Image

Paun, Silviu; Artstein, Ron; Poesio, Massimo
Published by Springer, 2022
ISBN 10: 3031037537 ISBN 13: 9783031037535
New Softcover

Seller: GreatBookPrices, Columbia, MD, U.S.A.

Seller rating 5 out of 5 stars 5-star rating, Learn more about seller ratings

Condition: New. Seller Inventory # 44801478-n

Contact seller

Buy New

£ 69.53
£ 1.97 shipping
Ships within U.S.A.

Quantity: Over 20 available

Add to basket

There are 4 more copies of this book

View all search results for this book