Verbal and Nonverbal Communication Behaviours: COST Action 2102 International Workshop, Vietri sul Mare, Italy, March 29-31, 2007, Revised Selected and Invited Papers 3540764410, 9783540764410


111 79 34MB

English Pages [338] Year 2007

Report DMCA / Copyright

DOWNLOAD PDF FILE

Table of contents :
Title Page
Preface
organization
Table of Contents
COST 2102: Cross-Modal Analysis of Verbal and Nonverbal Communication (CAVeNC)
Prelude
Background
Related European Initiatives
Related Cost Actions
Objectives and Benefits
Main Outcomes and Benefits
Milestones (in Bold) and Outputs of the Action
Scientific Programme
Working Groups
Organisation
Timetable
Dimensions
Dissemination Plans
Annotation Schemes for Verbal and Non-Verbal Communication: Some General Issues
Introduction
What Is a Coding Scheme?
Theory and Completeness
Coding Scheme Semantics, Criteria
Meta-data
Consolidated Coding Schemes
Previous Work
Current Trends
Increasing Coding Activity
Project Examples
Towards Consolidation and Standards
Future Challenges
References
Presenting in Style by Virtual Humans
Introduction
Related Work
Styled Virtual Humans
What Decides the Style?
Style Displayed in Nonverbal Behavior
The GESTYLE Language to Declare Style
The Definition of Multimodal Gestures
Gesture Dictionaries
Different and Varied Nondeterministic Behavior
Implementation and Examples
Summary and Further Work
References
Analysis of Nonverbal Involvement in Dyadic Interactions
Nonverbal Involvement and Interaction Modelling
Dynamic Patterns of Prosody in Patient-Therapist-Interactions
Intrapersonal Conflicts Expressed Through Emotional Overinvolvement
Design and Method
Conflict Regulation in Children's Interactions Being Close Friends
Nonverbal Involvement and Interaction Conflicts
Design and Method
Pilot Studies
Conflict Episode in Dyadic Child Interaction
Method
Results
Summary and Outlook
Children’s Perception of Musical Emotional Expressions
Introduction
Materials and Methods
Participants
Stimuli
Procedure
Results
Conclusions
References
Emotional Style Conversion in the TTS System with Cepstral Description
Introduction
Emotional Style Conversion Method
Building of the ESP Database
Application of ESP in TTS Synthesis
Experiments and Results
Conclusion
References
Meaningful Parameters in Emotion Characterisation
Introduction
Databases
Database Description
Subjective Evaluation
Meaningful Parameters to Represent Emotion
Classification Experiments
Conclusion
References
Prosodic and Gestural Expression of Interactional Agreement
Introduction
Agreement in Family Relationships and in Organizational Structures
Chronic Aggressions and Conflicts
Support and Agreement
Agreement in Dyadic Interaction: State of Research in Social and Clinical Psychology
Prosodic and Visual Measures of Conversational Interaction Parameters
Prosodic Measures
Visual Measures
Conclusion
References
Gesture, Prosody and Lexicon in Task-Oriented Dialogues: Multimedia Corpus Recording and Labelling
Introduction
Project Overview
The Design of the Tasks
Video and Audio Recordings
Labelling System
Orthographic Transcription
Dialogue Acts
Lexicon, Derivational System and Syntax
Rhythm
Intonation
Gestures
Applications of the Corpus
References
Egyptian Grunts and Transportation Gestures
Introduction
The First Egyptian Grunts Dictionary
Hand Gestures and Transportation in Cairo
Conclusions
References
On the Use of NonVerbal Speech Sounds in Human Communication
Introduction
A Database for Paralinguistic Research
Prosody of Paralinguistic Speech
Voice Quality and Paralinguistic Speech
Synthesis of Paralinguistic Speech
Recognition/Classification of Paralinguistic Properties of Speech
Analysis of Paralinguistic Speech
Assessment and Perception of Paralinguistic Speech
Typology of Paralinguistic Speech
Applications
Conclusions
Speech Spectrum Envelope Modeling
Introduction
Spectral Smoothing by Hidden Homomorphic Processing
Spectral Envelope Estimation
Conclusion
References
Using Prosody in Fixed Stress Languages for Improvement of Speech Recognition
Introduction
Usage of Prosody in Speech Technology
Underlined Role of Prosody in Agglutinating Languages
Exploiting Prosodic Information
Acoustic Prosodic Features
Acoustic Prosodic Pre-processing
Prosodic Speech Material
Automatic Prosodic Segmentation
Speech Recognition Using Automatic Prosodic Segmentation
First Pass Recognition
Lattice Rescoring
Second Pass Recognition
Experimental Tests
Speech Recognition Experiment Using Prosody
Reliability of Prosodic Segmentation
Conclusion
Single-Channel Noise Suppression by Wavelets in Spectral Domain
Introduction
Modulation Spectrum and Its Analysis
Enhancing the Estimate of Power Spectral Density
Application of Enhanced Noise Estimation in the Spectral Subtraction Method, Using the Wavelet Transform
Experimental Results
Conclusions
References
Voice Source Change During Fundamental Frequency Variation
Introduction
Voice Source Characterisation
Measures from the Voice Source Waveform
Measures from the Speech Spectrum
Analysis Studies of Glottal Source Variation with Changing Fundamental Frequency
Method
Analysis
Results
Discussion
Conclusion
References
A Gesture-Based Concept for Speech Movement Control in Articulatory Speech Synthesis
Introduction
Three-Dimensional Model of the Vocal Tract and Acoustic Simulation
Vocal Tract Model
Acoustic Simulation
The Gesture-Based Control Concept
Learning Gesture-Based Control Using Sensory Feedback and External Speech Signals
Discussion
References
A Novel Psychoacoustically Motivated Multichannel Speech Enhancement System
Introduction
Problem Formulation and GSC Beamforming
Proposed Framework Overview
Multichannel Signal Detection
Perceptually Optimal Spectral Amplitude (PO-SA) Estimation
Simulations and Results
Conclusion
References
Analysis of Verbal and Nonverbal Acoustic Signals with the Dresden UASR System
Introduction
Unified Approach to Speech Synthesis and Recognition (UASR)
System Concept
Structure Learning as Core Problem
FSM Implementation
Some Experiments and Applications
Speech Server
Parametric Speech Synthesis
Applying Speech Models and Pronunciation Dictionaries in Spontaneous Speech Synthesis
Classification of Non-verbal Acoustic Signals
Non-speech Signals
Speech Prosody
Interaction Research
Conclusion
VideoTRAN: A Translation Framework for Audiovisual Face-to-Face Conversations
Introduction
The VideoTRAN Audiovisual Translation Framework
Target Language Audiovisual Alignment
AudioVisual Speech Synthesis
The VideoTRAN Videophone Client
Conclusions
References
Spoken and Multimodal Communication Systems in Mobile Settings
Introduction
Speech-Based and Multimodal Mobile Applications
Mobile Service Infrastructure
Models for Distributed Mobile Spoken Dialogue Systems
Generic Model for Distributed Mobile Multimodal Speech Systems
Mobile Multimodal Timetable Service
Dialogue Markup
Server System Architecture
Distribution in the Stopman System
Conclusions
References
Multilingual Augmentative Alternative Communication System
Introduction
MACS – PES (Multilingual Augmentative Communication System)
The Basic Features of MACS
Interface
Personalization
Keyboard Layout
Speech Production
Prediction
Scanning
Vocabulary
Acronymic Writing
Backup Module
Advantages of the System
Additional Features
What Is New in the Software
Future Enhancements of MACS
References
Analysis and Synthesis of Multimodal Verbal and Non-verbal Interaction for Animated Interface Agents
Introduction
KTH Multimodal Speech Synthesis
Data Collection and Data-Driven Visual Synthesis
Improving Intelligibility and Information Presentation
Analyzing and Modelling Visual Cues for Prominence
Perception Experiments
Production Studies with Prominence in Several Expressive Modes
Visual Cues for Feedback
Future Challenges
References
Generating Nonverbal Signals for a Sensitive Artificial Listener
Introduction
Background on Listening Behavior for Embodied Agents
Data Collection and Analysis in AMI and Humaine
Annotation of Listening Behavior
Impression Management
Design of a Listening Behavior Model
A Remote Sensitive Artificial Listening System
Conclusions
References
Low-Complexity Algorithms for Biometric Recognition
Introduction
Face Recognition
Walsh Hadamard Transform
Experimental Results
Database
Conditions of the Experiments
Reduction of Dimensionality Using DCT, WHT and Eigenfaces
Practical Implementation
Conclusions
References
Towards to Mobile Multimodal Telecommunications Systems and Services
Introduction
Multimodal Communicator Architecture
Servers of the Communicator
PDA-Side Services
Examples of Multimodal Services
Railway Scheduler Service
Weather Service
Conclusion and Future Work
References
Embodied Conversational Agents in Wizard-of-Oz and Multimodal Interaction Applications
Introduction
Image Synthesis for Embodied Conversational Agent
Speech Recognition
Off-Line Viseme Recognition
On-Line Viseme Recognition
Speech Synthesis
Speech-to-Speech Translation (S2ST) as an Interface in Multilingual Virtual Collaborative Environments
Speech Technology for Those Embodied Conversational Agents Used for Human-Computer Interaction in the Slovenian Language
Speech Recognition System
PLATTOS TTS System
BABILON Speech-to-Speech Translation System
Implementation Examples
Wizard-of-Oz
Major-Domo
Conversational Agent Lili
Conclusion
References
Telling Stories with a Synthetic Character: Understanding Inter-modalities Relations
Introduction
The Synthetic Storyteller
Method
Design
Participants
Material
Procedure
Results
Discussion
Analysis
Study Limitations
Future Work
Annex
Author Index

Verbal and Nonverbal Communication Behaviours: COST Action 2102 International Workshop, Vietri sul Mare, Italy, March 29-31, 2007, Revised Selected and Invited Papers
 3540764410, 9783540764410

  • 0 0 0
  • Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up
File loading please wait...
Recommend Papers