img Leseprobe Leseprobe

Building and Using Comparable Corpora for Multilingual Natural Language Processing

Pierre Zweigenbaum, Serge Sharoff, Reinhard Rapp, et al.

PDF
ca. 42,79
Amazon iTunes Thalia.de Hugendubel Bücher.de ebook.de kobo Osiander Google Books Barnes&Noble bol.com Legimi yourbook.shop Kulturkaufhaus ebooks-center.de
* Affiliatelinks/Werbelinks
Hinweis: Affiliatelinks/Werbelinks
Links auf reinlesen.de sind sogenannte Affiliate-Links. Wenn du auf so einen Affiliate-Link klickst und über diesen Link einkaufst, bekommt reinlesen.de von dem betreffenden Online-Shop oder Anbieter eine Provision. Für dich verändert sich der Preis nicht.

Springer International Publishing img Link Publisher

Naturwissenschaften, Medizin, Informatik, Technik / Informatik

Beschreibung

This book provides a comprehensive overview of methods to build comparable corpora and of their applications, including machine translation, cross-lingual transfer, and various kinds of multilingual natural language processing. The authors begin with a brief history on the topic followed by a comparison to parallel resources and an explanation of why comparable corpora have become more widely used. In particular, they provide the basis for the multilingual capabilities of pre-trained models, such as BERT or GPT. The book then focuses on building comparable corpora, aligning their sentences to create a database of suitable translations, and using these sentence translations to produce dictionaries and term banks. Then, it is explained how comparable corpora can be used to build machine translation engines and to develop a wide variety of multilingual applications.

Weitere Titel in dieser Kategorie
Cover AI Glossary
Richard Khan
Cover AI Glossary
Richard Khan
Cover Some Future Day
Marc Beckman
Cover AI in Disease Detection
Shaik Vaseem Akram
Cover Network Models in Finance
Gueorgui S. Konstantinov

Kundenbewertungen

Schlagwörter

Natural Language Processing, Cross-lingual Models, Machine Translation, Vector Space Model, Comparable Corpora, Parallel Corpora, Multilingual Natural Language Processing