TU Berlin

Quality and Usability LabReviewed Conference Papers

Page Content

to Navigation

Reviewed Conference Papers

go back to overview

A Dataset for Subjective Assessment of German Text Complexity
Citation key mohtaj2019b
Author Mohtaj, Salar and Naderi, Babak and Ensikat, Kaspar and Möller, Sebastian
Title of Book Dialog for Good (DiGo), Workshop on Speech and Language Technology Serving Society, Stockholm, 10 September, 2019.
Pages 1–6
Year 2019
Month sep
Note online
How Published Fullpaper
Abstract This paper presents TextComplexityDE, a text readability assessment dataset consisting of 1000 sentences in German that taken from 23 Wikipedia articles. The corpus could be used for developing text complexity predictors and automatic German text simplification. Text complexity predictor models have diverse applications such as choosing appropriate reading materials for people with intellectual disabilities. The dataset includes subjective assessment of different text-complexity aspects provided by German learners in level A to C. In addition, it contains manual simplification of 250 of those sentences provided by native speakers and subjective assessment of the simplified sentences by participants from the target group.
Link to publication Link to original publication Download Bibtex entry

go back to overview


Quick Access

Schnellnavigation zur Seite über Nummerneingabe