Lingua-Align: An Experimental Toolbox for Automatic Tree-to-Tree Alignment

Forskningsoutput: Kapitel i bok/rapport/konferenshandlingKonferensbidragVetenskapligPeer review

Sammanfattning

In this paper we present an experimental toolbox for automatic tree-to-tree alignment based on local classification and alignment inference. The aligner implements a recurrent architecture for structural prediction using history features and a sequential classification procedure. The discriminative base classifier uses a log-linear model which enables simple integration of various features extracted from the data. The Lingua-Align toolbox provides a flexible framework for feature extraction including contextual properties and implements several alignment inference procedures. Various settings and constraints can be controlled via a simple frontend or called from external scripts. Lingua-Align supports different treebank formats and includes additional tools for conversion and evaluation. In our experiments we can show that our tree aligner produces results with high quality and outperforms unsupervised techniques proposed otherwise. It also integrates well with another existing tool for manual tree alignment which makes it possible to quickly integrate additional training material and to run semi-automatic alignment strategies.
Originalspråkengelska
Titel på värdpublikationProceedings of the International Conference on Language Resources and Evaluation, LREC 2010, 17-23 May 2010, Valletta, Malta
RedaktörerNicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Mike Rosner, Daniel Tapias
Antal sidor8
FörlagEuropean Language Resources Association (ELRA)
Utgivningsdatum1 maj 2010
Sidor736-743
ISBN (tryckt)2-9517408-6-7
StatusPublicerad - 1 maj 2010
Externt publiceradJa
MoE-publikationstypA4 Artikel i en konferenspublikation
EvenemangLREC 2010 - Malta, Malta
Varaktighet: 17 maj 201023 maj 2010

Citera det här