Activities per year
Abstract
Transition-based parsing of natural language uses transition systems to build directed annotation graphs (digraphs) for sentences. In this paper, we define, for an arbitrary ordered digraph, a unique decomposition and a corresponding linear encoding that are associated bijectively with each other via a new transition system. These results give us an efficient and succinct representation for digraphs and sets of digraphs. Based on the system and our analysis of its syntactic properties, we give structural bounds under which the set of encoded digraphs is restricted and becomes a context-free or a regular string language. The context-free restriction is essentially a superset of the encodings used previously to characterise properties of noncrossing digraphs and to solve maximal subgraphs problems. The regular restriction with a tight bound is shown to capture the Universal Dependencies v2.4 treebanks in linguistics.
Translated title of the contribution | Järjestettyjen verkkojen siirtymäpohjainen koodaus ja formaalien kielten teoria |
---|---|
Original language | English |
Title of host publication | The 14th International Conference on Finite-State Methods and Natural Language Processing : Proceedings of the Conference |
Editors | Heiko Vogler, Andreas Maletti |
Number of pages | 14 |
Place of Publication | Stroudsburg |
Publisher | The Association for Computational Linguistics |
Publication date | 23 Sept 2019 |
Pages | 118–131 |
ISBN (Electronic) | 978-1-950737-96-3 |
DOIs | |
Publication status | Published - 23 Sept 2019 |
MoE publication type | A4 Article in conference proceedings |
Event | International Conference on Finite State Methods and Natural Language Processing - Dresden, Germany Duration: 23 Sept 2019 → 25 Sept 2019 Conference number: 14 |
Publication series
Name | Proceedings of the International Conference on Finite-State Methods and Natural Language Processing |
---|---|
Publisher | Association for Computational Linguistics |
Bibliographical note
The ISBN of the host publication can be found on the web site of the conference (https://wwwtcs.inf.tu-dresden.de/fsmnlp2019/accepted_papers/).Fields of Science
- 113 Computer and information sciences
- 6121 Languages
Datasets
-
Universal Dependencies version 2.4
Rueter, J. (Creator), Tyers, F. M. (Contributor) & Zeman, D. (Contributor), Universal Dependencies Consortium, 15 May 2019
http://hdl.handle.net/11234/1-2988.
Dataset
Activities
- 1 Academic visit to other institution
-
The Rachel and Selim Benin School of Engineering and Computer Science, The Hebrew University of Jerusalem, Israel
Anssi Yli-Jyrä (Visiting researcher), Mirjam Ronning (Other role), Halvor Ronning (Other role) & Omri Abend (Other role)
1 Jan 2020 → 17 Jan 2020Activity: Visiting an external institution types › Academic visit to other institution