Projects per year
Abstract
We introduce Semstem, a new method for the
reconstruction of so called stemmatic trees, i.e., trees encoding
the copying relationships among a set of textual variants.
Our method is based on a structural expectation-maximization
(structural EM) algorithm. It is the first computer-based
method able to estimate general latent tree structures, unlike
earlier methods that are usually restricted to bifurcating trees
where all the extant texts are placed in the leaf nodes. We
present experiments on two well known benchmark data
sets, showing that the new method outperforms current stateof-
the-art both in terms of a numerical score as well as
interpretability.
reconstruction of so called stemmatic trees, i.e., trees encoding
the copying relationships among a set of textual variants.
Our method is based on a structural expectation-maximization
(structural EM) algorithm. It is the first computer-based
method able to estimate general latent tree structures, unlike
earlier methods that are usually restricted to bifurcating trees
where all the extant texts are placed in the leaf nodes. We
present experiments on two well known benchmark data
sets, showing that the new method outperforms current stateof-
the-art both in terms of a numerical score as well as
interpretability.
Original language | English |
---|---|
Title of host publication | 2011 IEEE 11th International Conference on Data Mining (ICDM 2011) |
Editors | Diane Cook, Jian Pei, Wei Wang, Osmar Zaïane, Xindong Wu |
Number of pages | 10 |
Publisher | IEEE Computer Society |
Publication date | 11 Dec 2011 |
Pages | 567-576 |
ISBN (Print) | 9781457720758 |
DOIs | |
Publication status | Published - 11 Dec 2011 |
MoE publication type | A4 Article in conference proceedings |
Event | Unknown host publication - , Canada Duration: 1 Jan 1800 → … |
Fields of Science
- 113 Computer and information sciences
Projects
- 1 Finished
-
Stemmatologian algoritmiset menetelmät
Roos, T., Myllymäki, P., Heikkilä, T. & Zou, Y.
01/01/2009 → 31/12/2011
Project: Research project