A Derivational Model of Discontinuous Parsing

Mark-Jan Nederhof, Anssi Mikael Yli-Jyrä

Research output: Chapter in Book/Report/Conference proceedingChapterScientificpeer-review

Abstract

The notion of latent-variable probabilistic context-free derivation of syntactic structures is enhanced to allow heads and unrestricted discontinuities. The chosen formalization covers both constituent parsing and dependency parsing. The derivational model is accompanied by an equivalent probabilistic automaton model. By the new framework, one obtains a probability distribution over the space of all discontinuous parses. This lends itself to intrinsic evaluation in terms of perplexity, as shown in experiments.
Translated title of the contributionEpäjatkuvan jäsennyspuun johtoon perustuva tilastollinen malli
Original languageEnglish
Title of host publicationLanguage and Automata Theory and Applications : LATA 2017
EditorsCarlos Martín-Vide
Number of pages12
Place of PublicationBerlin
PublisherSpringer-Verlag
Publication date6 Mar 2017
Pages299-310
ISBN (Print)978-3-319-53732-0
ISBN (Electronic)978-3-319-53733-7
DOIs
Publication statusPublished - 6 Mar 2017
MoE publication typeA3 Book chapter

Publication series

NameLecture Notes in Computer Science
PublisherSpringer
ISSN (Print)0302-9743
ISSN (Electronic)1611-3349

Fields of Science

  • 111 Mathematics
  • grammars
  • perplexity
  • 113 Computer and information sciences
  • weighted automata
  • context-free derivation
  • discontinuity
  • derivation
  • 6121 Languages
  • parsing
  • syntactic structures

Cite this