Reviewed Journal Papers

Schönweiler, R., Kaese, S., Möller, S., Rinscheid, A. and Ptok, M. (1996). Neuronal Networks and Self-Organizing Maps: New Computer Techniques in the Acoustic Evaluation of the Infant Cry. Int. J. of Pediatric Otorhinolaryngology. Elsevier, 1–11.

Gierlich, H.-W. and Perkins, M. (1999). Speech Communication: Speech Quality in Telecommunications. Journal Acoustic Society. AIP Publ. and ASA, 974.

Möller, S. and Schönweiler, R. (1999). Analysis of Infant Cries for the Early Detection of Hearing Impairment. Speech Communication. Elsevier Science Publishers B. V., 175–193.

Wang, G., Rabenstein, R., Strobel, N. and Spors, S. (2000). Object Localization by Joint Audio-Video Signal Processing. In Vision Modelling and Visualization, 97–104.


Möller, S., Jekosch, U., Mersdorf, J. and Kraft, V. (2001). Auditory Assessment of Synthesized Speech in Application Scenarios: Two Case Studies. Speech Communication. Elsevier Science Publishers B. V., 229–246.

Spors, S., Rabenstein, R. and Strobel, N. (2001). A Multi-Sensor Object Localization System. In Vision, Modelling and Visualization (VMV). infix, 19–26.

Strobel, N., Spors, S. and Rabenstein, R. (2001). Joint audio-video object localization and tracking. IEEE Signal Processing Magazine, 22–31.


Möller, S. and Berger, J. (2002). Describing Telephone Speech Codec Quality Degradations by Means of Impairment Factors. Journal of the Audio Engineering Society. Audio Engineering Society, 667–680.

Möller, S. and Raake, A. (2002). Telephone Speech Quality Prediction: Towards Network Planning and Monitoring Models for Modern Network Scenarios. Speech Communication. Elsevier Science Publishers B. V., 47–75.

Möller, S. (2002). Analytic Assessment of Telephone Transmission Impact on ASR Performance Using a Simulation Model. Speech Communication. Elsevier Science Publishers B. V., 441–459.

Spors, S., Teutsch, H. and Rabenstein, R. (2002). High-Quality Acoustic Rendering with Wave Field Synthesis. In Vision, Modelling and Visualization (VMV). infix, 101–108.


Sieger, H. (2003). X11 and NEXTSTEP. NeXTeZine. Markus Schmidt and Joacim Melin, 10–17.

Kuntz, A., Spors, S. and Teutsch, H. (2004). Die akustische Mensch-Maschine-Schnittstelle. Elektronik, 60–65.

Möller, S. and Skowronek, J. (2004). An Analysis of Quality Prediction Models for Telephone-Based Spoken Dialogue Systems. Acta Acustica united with Acustica. Hirzel, 1112–1130.

Möller, S. (2004). Telephone Transmission Impact on Synthesized Speech: Quality Assessment and Prediction. Acta Acustica united with Acustica. Hirzel, 121–136.

Möller, S., Krebber, J. and Smeele, P. (2005). Evaluating the Speech Output Component of a Smart-Home System. Speech Communication. Elsevier Science Publishers B. V., 1–27.

Velisavljevic, V., Beferull-Lozano, B., Vetterli, M. and Dragotti, P. L. (2006). Directionlets: anisotropic multi-directional representation with separable filtering. IEEE Transactions on Image Processing. IEEE.

Möller, S., Raake, A., Kitawaki, N., Takahashi, A. and Wältermann, M. (2006). Impairment Factor Framework for Wideband Speech Codecs. IEEE Trans. Audio, Speech and Language Processing. IEEE, 1969–1976.

Rabenstein, R., Steffen, P. and Spors, S. (2006). Representation of Two-Dimensional Wave Fields by Multidimensional Signals. EURASIP Signal Processing Magazine, 1341–1351.

Ballagas, R., Rohs, M., Sheridan, J. G. and Borchers, J. (2006). The Smart Phone: A Ubiquitous Input Device. IEEE Pervasive Computing, 70–77.

Hußlein, S., Hurtienne, J., Israel, J. H., Mohs, C., Kindsmüller, M. C., Meyer, H. A., Naumann, A. and Pohlmeyer, A. (2007). Intuitive Benutzung - nur ein Schlagwort?. design report, 26–27.

Mohs, C., Naumann, A. and Kindsmüller, M. C. (2007). Mensch-Technik-Interaktion: intuitiv erwartungskonform oder vertraut?. MMI-Interaktiv Journal, 25–35.

Naumann, A., Brunstein, A. and Krems, J. F. (2007). DEWEX: A System for Designing and Conducting Web based Experiments. Behavior Research Methods, 248–258.

Spors, S., Rabenstein, R., Buchner, H. and Herbordt, W. (2007). Active listening room compensation for massive multichannel sound reproduction systems using Wave-Domain Adaptive Filtering. Journal of the Acoustical Society of America (JASA), 354–369.

Velisavljevic, V., Beferull-Lozano, B. and Vetterli, M. (2007). Space-frequency quantization for image compression with directionlets. IEEE Trans. on Image Processing. IEEE.

Rohs, M. (2007). Marker-Based Embodied Interaction for Handheld Augmented Reality Games. Journal of Virtual Reality and Broadcasting

Möller, S., Smeele, P., Boland, H. and Krebber, J. (2007). Evaluating Spoken Dialogue Systems According to De-Facto Standards: A Case Study. Computer Speech and Language, 26–53.

Falk, T. and Möller, S. (2008). Towards Signal-Based Instrumental Quality Diagnosis for Text-to-Speech Systems. IEEE Signal Processing Letters. IEEE, 781–784.

Möller, S., Engelbrecht, K.-P. and Schleicher, R. (2008). Predicting the Quality and Usability of Spoken Dialogue Services. Speech Communication. Elsevier Science Publishers B. V, 730–744.

Rath, M. and Schleicher, R. (2008). On the Relevance of Auditory Feedback for Quality of Control in a Balancing Task. Acta Acustica united with Acustica. S. Hirzel Verlag, 12–20.

Schleicher, R., Galley, N., Briest, S. and Galley, L. (2008). Blinks and saccades as indicators of fatigue in sleepiness warnings: looking tired?. Ergonomics, 982–1010.

Möller, S., Kim, D.-S. and Malfait, L. (2008). Estimating the Quality of Synthesized and Natural Speech Transmitted Through Telephone Networks Using Single-ended Prediction Models. Acta Acustica united with Acustica. Hirzel, Stuttgart, 21–31.

Rohs, M. and Essl, G. (2008). Sensing-based Interaction for Information Navigation on Handheld Displays. Journal on Advances in Human-Computer Interaction (AHCI), 1–11.

Ahrens, J. and Spors, S. (2008). An Analytical Approach to Sound Field Reproduction using Circular and Spherical Loudspeaker Distributions. Acta Acoustica united with Acoustica. S. Hirzel, 988–999.

Essl, G. and Rohs, M. (2009). Interactivity for Mobile Music Making. Organised Sound. Cambridge University Press, 197–207.

Möller, S., Cote, N., Gautier-Turbin, V., Kitawaki, N. and Takahashi, A. (2009). Instrumental Estimation of Equipment Impairment Factors for Wideband Speech Codecs. Speech Communication. Elsevier, 1–27.

Weiss, B., Möller, S., Raake, A., Berger, J. and Ullmann, R. (2009). Modeling Call Quality for Time-Varying Transmission Characteristics Using Simulated Conversational Structures. Acta Acustica united with Acustica, 1140–1151.

(2009). Enhanced Phone Posteriors for Improving Speech Recognition Systems. IEEE Transactions on Speech and Audio Processing

Engelbrecht, K.-P., Quade, M. and Möller, S. (2009). Analysis of a New Simulation Approach to Dialogue System Evaluation. Speech Communication. Elsevier Science Publishers B. V., 1234–1252.

Kray, C., Rohs, M., Hook, J. and Kratz, S. (2009). Bridging the Gap between the Kodak and the Flickr Generations: A Novel Interaction Technique for Collocated Photo Sharing. International Journal on Human-Computer Studies (IJHCS)

Velisavljevic, V. (2009). Low-complexity iris coding and recognition based on directionlets. IEEE Trans. on Information Forensics and Security. IEEE, 410–417.

Rohs, M., Schleicher, R., Schöning, J., Essl, G., Naumann, A. and Krüger, A. (2009). Impact of Item Density on the Utility of Visual Context in Magic Lens Interactions. Journal Personal and Ubiquitous Computing. Springer, 633–646.

Huber, S., Schulz, M., Wittbrodt, N. and Urvoy, C. (2009). Die Beurteilung der Informationsdarstellung auf Airport Moving Maps nach DIN EN ISO 9241-12. MMI-Interaktiv, Sonderausgabe


Strohmeier, D., Jumisko-Pyykkö, S. and Kunze, K. (2010). Open Profiling of Quality: A Mixed Method Approach to Understanding Multimodal Quality Perception. Advances in Multimedia. Hindawi, 28.

Wältermann, M., Raake, A. and Möller, S. (2010). Quality Dimensions of Narrowband and Wideband Speech Transmission. Acta Acustica united with Acustica

Weinland, D., Ronfard, R. and Boyer, E. (2010). A Survey of Vision-Based Methods for Action Representation, Segmentation and Recognition. Computer Vision and Image Understanding. Elsevier.

Weiss, B., Kühnel, C., Wechsung, I., Fagel, S. and Möller, S. (2010). Quality of Talking Heads in Different Interaction and Media Contexts. Speech Communication, 481–492.

Wolters, M., Engelbrecht, K.-P., Gödde, F., Möller, S., Naumann, A. and Schleicher, R. (2010). Making it easier for older people to talk to smart homes: the effect of early help prompts. Universal Access in the Information Society. Springer Berlin / Heidelberg, 311–326.

Cobos, M., Lopez, J. J. and Spors, S. (2010). A Sparsity-Based Approach to 3-D Binaural Sound Synthesis Using Time-Frequency Array Processing. EURASIP Journal on Advances in Signal Processing, 1–13.

Engelbrecht, K.-P. and Möller, S. (2010). Sequential Classifiers for the Prediction of User Judgments about Spoken Dialog Systems. Speech Communication. Elsevier, 816–833.

