Laughter Synthesis using Mass-spring Model and Excitation Source Characteristics

TitleLaughter Synthesis using Mass-spring Model and Excitation Source Characteristics
Publication TypeConference Paper
Year of Publication2018
AuthorsJuhitha, K.., S.. Vekkot, M.J.. Yogesh, S.. Tripathi, and R.. Shashank
Conference Name7th International Conference on Advances in Computing, Communications and Informatics, ICACCI 2018
Date Published2018
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN Number9781538653142 (ISBN)
KeywordsExcitation source information, Excitation sources, Fundamental frequencies, Human computer interaction, Information filtering, Linear prediction analysis, Mass-spring model, Mass-spring models, Metals, Microphones, Prosody, Speech synthesis, Synthesized speech, Zero frequency filtering, Zero-frequency filtering

With growing interest, an amalgamation of the field of Human-Computer Interaction and Speech Synthesis is redefining the way humans interact with technology by exploring several facets of interaction and thus incorporating it in synthesized speech. One such facet is laughter - an important form of interaction that expresses emotion like humor, joy, amusement as well as sarcasm. The most significant excitation of a laughter signal occurs at the instant of glottal closure, and determination of these instants, known as epochs, along with instantaneous fundamental frequency is used for deriving the excitation source information - a method known as Zero Frequency Filtering. In the existing literature, modification of the characteristics of the excitation source information along with the LP residuals and prosody that involves changing the pitch and duration of the signal, form the rudimentary step for synthesizing laughter calls. In order to emulate the naturalness, these modifications are carried out by first examining the features of natural laughter. This paper proposes a method to generate laughter signal by concatenation of laughter calls using mass-spring model, as laughter calls make up a laughter signal. Using the mass-spring model, a fair CMOS is achieved making the synthesized laughter sound more natural. The synthesized laughter can be used to improve the perceivability of a synthesized speech in Human-Computer Interaction © 2018 IEEE.