ArchiMob corpus Release 1

  • Yves Scherrer (Creator)
  • Tanja Samardžić (Creator)
  • Elvira Glaser (Creator)

Dataset

Description

The ArchiMob corpus represents German varieties spoken on the territory of Switzerland. It is the first electronic resource containing long samples of transcribed text in Swiss German, intended to be used for studying spatial distribution of morphosyntactic features and for natural language processing. The size of the current version of the corpus is 528 381 tokens.
Date made available12 Aug 2016
PublisherUniversity Zurich
Date of data production2006 - 2016
Geographical coverageGerman-speaking Switzerland

Cite this

Scherrer, Y. (Creator), Samardžić, T. (Creator), Glaser, E. (Creator) (12 Aug 2016). ArchiMob corpus Release 1. University Zurich. 10.5281/zenodo.1158572