Inhalt des Dokuments
Building a Speaker Clustering Tool by Employing Speaker Recognition Techniques
LOCATION: TEL, Auditorium 3 (20th floor),
Ernst-Reuter-Platz 7, 10587 Berlin
Date/Time: 23.10.2017, 14:15-15:00
SPEAKER: Renzo Verastegui (TU Berlin)
The analysis of voice similarity is useful in many research fields, such as speaker diarization, speech synthesis, and voice casting. Applications that measure the similarity of voice are extremely expensive to administer and also require a significant amount of expert listening. For these reasons, it is of main interest to be able to automate the detection of speech similarities in voice segments.
The aim of this Master Thesis is to build a speech clustering detection tool based
on speaker recognition state-of-the-art techniques: i-vectors and deep neural networks, among others. Finally, the relevance of the generated clusters is determined by a comparison to a previously conducted study of subjective similarity.