Lizenz
Bitte beziehen Sie sich beim Zitieren dieses Dokumentes immer auf folgende
URN: urn:nbn:de:kobv:517-opus-27115
URL: http://opus.kobv.de/ubp/volltexte/2008/2711/
Barbaiani, Mădălina ;
Cancedda, Nicola ;
Dance , Chris ;
Fazekas, Szilárd ;
Gaál, Tamás ;
Gaussier, Éric
Asymmetric term alignment with selective contiguity constraints by multi-tape automata
Kurzfassung in Deutsch
This article describes a HMM-based word-alignment method that can selectively enforce a contiguity constraint. This method has a direct application in the extraction of a bilingual terminological lexicon from a parallel corpus, but can also be used as a preliminary step for the extraction of phrase pairs in a Phrase-Based Statistical Machine Translation system. Contiguous source words composing terms are aligned to contiguous target language words. The HMM is transformed into a Weighted Finite State Transducer (WFST) and contiguity constraints are enforced by specific multi-tape WFSTs. The proposed method is especially suited when basic linguistic resources (morphological analyzer, part-of-speech taggers and term extractors) are available for the source language only.
|
Collection: |
|
Universität Potsdam / Tagungen / Finite-state methods and natural language processing : 6th International Workshop, FSMNLP 2007 / II Regular Papers |
|
Institut: |
|
Extern |
|
DDC-Sachgruppe: |
|
Sprachwissenschaft, Linguistik |
|
Dokumentart: |
|
c InProceedings (Aufsatz / Paper einer Konferenz etc.) |
|
Sprache: |
|
Englisch |
|
Erstellungsjahr: |
|
2008 |
|
Publikationsdatum: |
|
11.12.2008 |
|
Bemerkung: |
|
The complete edition of the proceedings "Finite-state methods and natural language processing : 6th International Workshop, FSMNLP 2007 ; Revised Papers" is available:
URN urn:nbn:de:kobv:517-opus-23812 |
|
Lizenz: |
|
Diese Nutzungsbedingung gilt nicht, wenn in den Metadaten eine modifizierende Lizenz genannt ist.
Keine Nutzungslizenz vergeben - es gilt das deutsche Urheberrecht
|