ArchiMob corpus Release 1

  • Yves Scherrer (Skapad av)
  • Tanja Samardžić (Skapad av)
  • Elvira Glaser (Skapad av)

Datauppsättning

Beskrivning

The ArchiMob corpus represents German varieties spoken on the territory of Switzerland. It is the first electronic resource containing long samples of transcribed text in Swiss German, intended to be used for studying spatial distribution of morphosyntactic features and for natural language processing. The size of the current version of the corpus is 528 381 tokens.
Datum som det gjorts tillgängligt12 aug 2016
FörlagUniversity Zurich
Datum för dataproduktion2006 - 2016
Geografisk täckningGerman-speaking Switzerland

Citera det här

Scherrer, Y. (Skapad av), Samardžić, T. (Skapad av), Glaser, E. (Skapad av) (12 aug 2016). ArchiMob corpus Release 1. University Zurich. 10.5281/zenodo.1158572