Inhalt des Dokuments
Gabriel Mittag
Biography
Gabriel Mittag received his B.Sc. and M.Sc. degree in electrical and electronic engineering at the Technische Universität Berlin. During his master's degree he spent two semesters at the RMIT University in Melbourne, Australia and focused primarily on biomedical and speech signal processing. Since 2016 he is employed as research assistant at the Quality and Usability Lab at the TU Berlin, where he works towards a PhD in the field of Quality of Experience (QoE) of speech communication services. His main interests are in psychoacoustics, quality evaluation, signal processing, and machine learning.
Research Fields
- Perceived Quality of Speech
- Speech and Signal Processing
Research Topic
- Diagnostic Quality of Transmitted Speech
Projects
- Qualitätsattribute und Gesamtqualität übertragener Sprache: Erkenntnistransfer-Projekt (DFG)
- User-Friendly Estimation of Speech Quality for Telecommunications and Network Carriers (BMBF)
- PIUS: Problemidentifkation übertragener Sprache (BMBF)
Prizes
- DAGA-Posterpreis: G. Mittag, F. Köster, S. Möller, "Non-intrusive Estimation of the Perceptual Dimension Coloration", DAGA 2016.
- Best Paper Award: F. Köster, G. Mittag, T. Polzehl, S. Möller, "Non-intrusive Estimation of Noisiness as a Perceptual Quality Dimension of Transmitted Speech", PQS 2016.
Teaching
- Speech Communication (Exercise)
Open Bachelor/Master theses:
Contact
Email: gabriel.mittag@tu-berlin.de
Publications
Zitatschlüssel | zequeirajimenez2018d |
---|---|
Autor | Zequeira Jiménez, Rafael and Mittag, Gabriel and Möller, Sebastian |
Buchtitel | Proceedings of IEEE International Symposium on Multimedia (ISM) |
Seiten | 175–179 |
Jahr | 2018 |
ISBN | 978-1-5386-6857-3 |
DOI | 10.1109/ISM.2018.00-16 |
Adresse | Piscataway, NJ |
Monat | dec |
Notiz | electronic |
Verlag | IEEE |
Serie | ISM |
Wie herausgegeben | Fullpaper |
Zusammenfassung | Crowdsourcing (CS) has established as a powerful tool to collect human input for data acquisition and labeling. However, it remains the question about the validity of the data collected in a CS platform. Sometimes, the users work carelessly or they try to tweak the system to maximize their profits. This paper reports on whether the number of speech stimuli presented to the listeners has an impact on the user perception of certain degradation conditions applied to the speech signal. To this end, a crowdsourcing study has been conducted with 209 listeners that were divided in three non-overlapping user groups, each of which was presented with tasks containing a different number of stimuli: 10, 20, or 40. Listeners were asked to rate speech stimuli with respect to their overall quality and the ratings were collected on a 5-point scale in accordance with ITU-T Rec. P.800. Workers assessed the speech stimuli of the database 501 from ITU-T Rec. P.863. Additionally, the influence of certain speech signal characteristics, such as interruptions and bandwidth, on the quality perception of the workers was investigated. |