Task 1: Text Normalization for Swiss German


We invite you to participate in the 2021 shared task on Text Normalization for Swiss German which will be held at SwissText 2021.

Written Swiss German is not standardized and varies across authors and their dialects and its use is almost exclusively constrained to communication on social media or via text messaging.  Many corpora will therefore contain many distinct surface forms for the same word which can make their analysis challenging. It is therefore desirable to be able to normalize them to a single common surface form.

We collected Swiss German utterances from social media and two annotators mapped every token to a corresponding form in Standard German (see examples below). The task is to build models that can perform such a mapping automatically. This is different from translation since the resulting normalized utterance will in general not be grammatically correct Standard German as word order is preserved.

More details can be found here.

Organizers