Wiktextract: A utility for extracting data from Wiktionary

Forskningsoutput: Icke-textbaserad outputProgramvaraVetenskaplig

Sammanfattning

This tool extracts glosses, parts-of-speech, declension/conjugation information when available, translations for all languages when available, pronunciations (including audio file links), qualifiers including usage notes, word forms, links between words including hypernyms, hyponym, holonyms, meronyms, related words, derived terms, compounds, alternative forms, etc. For many classes of words, a word sense is annotated with specific information such as what ward it is a form of, what is the RGB value of the color it represents, what is the numeric value of the number, what SI unit it represents, etc.
Originalspråkengelska
Utgivningsortgithub
FörlagTatu Ylonen
StatusPublicerad - 1 jan. 2019
MoE-publikationstypI2 ICT-programvara

Vetenskapsgrenar

  • 6121 Språkvetenskaper

Citera det här