Sammanfattning
This paper describes representing translations in the Finnish wordnet, FinnWordNet (FiWN), and constructing the FiWN database. FiWN was created by translating all the word senses of the Princeton WordNet (PWN) into Finnish and by joining the translations with the semantic and lexical relations of PWN extracted into a relational (database) format. The approach naturally resulted in a translation relation between PWN and FiWN. Unlike many other multilingual wordnets, the translation relation in FiWN is primarily not on the level of synsets, but on the level of an individual word sense, which allows more precise translation correspondences. This can easily be projected into a synset-level translation relation, used for linking with other wordnets via Core WordNet. Synset-level translations are also used as a default in the absence of word sense translations. The FiWN data in the relational database can be converted to other formats. In the PWN database format, translations are attached to source-language words, allowing the implementation of a Web search interface also working as a bilingual dictionary. Another representation encodes the translation relation as a finite-state transducer.
Originalspråk | engelska |
---|---|
Titel på värdpublikation | The eighth international conference on Language Resources and Evaluation (LREC) : LREC 2012 |
Antal sidor | 8 |
Volym | 8 |
Utgivningsort | Istanbul |
Förlag | European Language Resources Association (ELRA) |
Utgivningsdatum | 23 maj 2012 |
ISBN (tryckt) | 978-2-9517408-7-7 |
Status | Publicerad - 23 maj 2012 |
MoE-publikationstyp | A4 Artikel i en konferenspublikation |
Evenemang | International conference on Language Resources and Evaluation - Istanbul, Turkiet Varaktighet: 23 maj 2012 → 25 maj 2012 Konferensnummer: 8 |
Vetenskapsgrenar
- 6121 Språkvetenskaper