Lingua-Align: An Experimental Toolbox for Automatic Tree-to-Tree Alignment

Jörg Tiedemann

Research output: Chapter in Book/Report/Conference proceedingConference contributionScientificpeer-review

Abstract

In this paper we present an experimental toolbox for automatic tree-to-tree alignment based on local classification andalignment in-ference. The aligner implements a recurrent architecture for structural prediction using history features and a sequential classificationprocedure. The discriminative base classifier uses a log-linear model which enables simple integration of various features extracted fromthe data. The Lingua-Align toolbox provides a flexible framework for feature extraction including contextual properties and implementsseveral alignment inference procedures. Various settingsand constraints can be controlled via a simple frontend or called from externalscripts. Lingua-Align supports different treebank formats and includes additional tools for conversion and evaluation. In our experimentswe can show that our tree aligner produces results with high quality and outperforms unsupervised techniques proposed otherwise. Italso integrates well with another existing tool for manual tree alignment which makes it possible to quickly integrate additional trainingmaterial and to run semi-automatic alignment strategies
Original languageEnglish
Title of host publicationProceedings of the International Conference on Language Resources and Evaluation, LREC 2010, 17-23 May 2010, Valletta, Malta
EditorsNicoletta Calzolari (Conference Chair), Khalid Choukri, Bente Maegaard, Joseph Mariani, Jan Odijk, Stelios Piperidis, Mike Rosner, Daniel Tapias
Number of pages8
PublisherEuropean Language Resources Association (ELRA)
Publication date1 May 2010
Pages736-743
ISBN (Print)2-9517408-6-7
Publication statusPublished - 1 May 2010
Externally publishedYes
MoE publication typeA4 Article in conference proceedings
EventLREC 2010 - Malta, Malta
Duration: 17 May 201023 May 2010

Cite this