The perception of emotional prosody in Mandarin Chinese words and sentences
Abstract
I Introduction
II The perception of emotional prosody
1 Effects of emotion type and syllable length
2 Second language experience effect
3 Some methodological limitations in previous research
III The current study
IV Methods
1 Participants
2 Stimuli
Example | Monosyllabic word | Disyllabic word | Trisyllabic word |
---|---|---|---|
Pinyin | xiū | shōu yīn | zhāng zhōng bīn |
IPA | ɕəu1 | ʂəu1 jin1 | ʈʂaŋ1 ʈʂʷuŋ1 pʲin1 |
Chinese Character(s) | 修 | 收 音 | 张 中 斌 |
English translation | repair | Receive sound | Zhang Zhongbin |
Example | Sentence | ||
Pinyin | zhāng zhōng bīn | xīng qī tiān xiū | shōu yīn jī |
IPA | ʈʂaŋ1 ʈʂʷuŋ1 pʲin1 | ɕəŋ1 tɕʰiː1 tʰʲæn1 ɕəu1 | ʂəu1 jin1 tɕiː1 |
Chinese characters | 张 中 斌 | 星 期 天 修 | 收 音 机 |
English translation | Zhang Zhongbin repairs radio on Sunday. |
F0 (Hz) | Intensity (dB SPL) | Duration (ms) | |
---|---|---|---|
Syllable length: | |||
monosyllable | 268.377 (56.206) | 54.322 (4.655) | 619.269 (140.522) |
disyllable | 292.143 (59.764) | 54.246 (3.744) | 717.842 (151.506) |
trisyllable | 295.402 (60.607) | 55.005 (3.293) | 898.905 (231.538) |
sentence | 277.690 (46.567) | 55.906 (3.516) | 2461.395 (412.817) |
Emotion type: | |||
neutral | 239.988 (32.043) | 52.212 (2.660) | 1050.859 (552.316) |
joy | 346.544 (43.131) | 55.546 (2.440) | 897.286 (543.131) |
anger | 327.620 (29.362) | 58.650 (2.895) | 719.172 (454.840) |
sadness | 241.779 (22.770) | 52.359 (2.327) | 1,176.570 (680.702) |
3 Procedure
4 Analysis
V Results
1 Descriptive statistical results



2 Inferential statistical results
Fixed effects: | Estimate | SE | z | Pr (>|z|) |
---|---|---|---|---|
(Intercept) | 2.999 | 0.170 | 17.624 | < .001*** |
Group: native | 0.660 | 0.211 | 3.121 | .002** |
Group: L2 | 0.740 | 0.213 | 3.475 | .001*** |
Emo_type: joy | –0.709 | 0.085 | –8.288 | < .001*** |
Emo_type: anger | 0.414 | 0.109 | 3.786 | < .001*** |
Emo_type: sadness | 0.179 | 0.105 | 1.705 | .088 |
Syll_length: monosyllable | –0.900 | 0.171 | –5.265 | < .001*** |
Syll_length: disyllable | –0.006 | 0.117 | –0.047 | .963 |
Syll_length: trisyllable | 0.255 | 0.127 | 2.012 | .044* |
joy × monosyllable | 0.232 | 0.136 | 1.701 | .089 |
anger × monosyllable | 0.905 | 0.180 | 5.027 | < .001*** |
sadness × monosyllable | –0.593 | 0.149 | –3.990 | < .001*** |
joy × disyllable | 0.081 | 0.101 | 0.803 | .422 |
anger × disyllable | –0.384 | 0.119 | –3.222 | .001** |
sadness × disyllable | 0.239 | 0.124 | 1.917 | .055 |
joy × trisyllable | –0.004 | 0.111 | –0.033 | .974 |
anger × trisyllable | –0.116 | 0.135 | –0.859 | .390 |
sadness × trisyllable | 0.150 | 0.138 | 1.091 | .275 |
native × joy | –0.499 | 0.107 | –4.674 | < .001*** |
L2 × joy | 0.317 | 0.114 | 2.777 | .006** |
native × anger | 0.256 | 0.144 | 1.773 | .076 |
L2 × anger | –0.214 | 0.134 | –1.603 | .109 |
native × sadness | –0.242 | 0.129 | –1.882 | .060 |
L2 × sadness | 0.019 | 0.134 | 0.144 | .885 |
Random effects | Variance | SD | ||
ID | 1.082 | 1.040 | ||
Item | 0.128 | 0.357 |
Group contrast | diff | lwr | upr | P adj |
---|---|---|---|---|
L2–non-native | 0.172 | 0.153 | 0.190 | <.001*** |
Native–non-native | 0.160 | 0.142 | 0.178 | <.001*** |
Native–L2 | –0.011 | –0.030 | 0.007 | .298 |
Estimate | SE | z | Pr(>|z|) | |
---|---|---|---|---|
(Intercept) | 3.581 | 0.294 | 12.168 | < .001*** |
Emo_type: joy | –1.255 | 0.164 | –7.651 | < .001*** |
Emo_type: anger | 0.663 | 0.255 | 2.596 | .009** |
Emo_type: sadness | 0.140 | 0.240 | 0.583 | .560 |
Syll_length: monosyllable | –0.794 | 0.283 | –2.810 | .005** |
Syll_length: disyllable | 0.362 | 0.200 | 1.807 | .071 |
Syll_length: trisyllable | 0.267 | 0.210 | 1.274 | .203 |
joy × monosyllable | –0.423 | 0.295 | –1.433 | .152 |
anger × monosyllable | 1.548 | 0.598 | 2.591 | .009** |
sadness × monosyllable | –0.701 | 0.360 | –1.947 | .052 |
joy × disyllable | 0.187 | 0.227 | 0.825 | .409 |
anger × disyllable | –0.325 | 0.334 | –0.974 | .330 |
sadness × disyllable | –0.250 | 0.307 | –0.816 | .414 |
joy × trisyllable | 0.231 | 0.240 | 0.964 | .335 |
anger × trisyllable | –0.041 | 0.364 | –0.113 | .910 |
sadness × trisyllable | –0.162 | 0.323 | –0.503 | .615 |
Random effects | Variance | SD | ||
ID | 1.118 | 1.057 | ||
Item | 0.121 | 0.348 |
Contrast | Estimate | SE | z | p |
---|---|---|---|---|
Emotion type: | ||||
joy–anger | –1.918 | 0.343 | –5.583 | < .001*** |
joy–sadness | –1.395 | 0.320 | –4.363 | < .001*** |
joy–neutral | –1.707 | 0.294 | –5.804 | < .001*** |
anger–sadness | 0.523 | 0.423 | 1.236 | .604 |
anger–neutral | 0.210 | 0.404 | 0.521 | .954 |
sadness–neutral | –0.313 | 0.384 | –0.814 | .848 |
Syllable length: | ||||
monosyllable–disyllable | –1.156 | 0.392 | –2.947 | .017* |
monosyllable–trisyllable | –1.061 | 0.402 | –2.642 | .041* |
monosyllable–sentence | –0.958 | 0.513 | –1.867 | .242 |
disyllable–trisyllable | 0.095 | 0.285 | 0.332 | .987 |
disyllable–sentence | 0.198 | 0.430 | 0.460 | .968 |
trisyllable–sentence | 0.103 | 0.438 | 0.234 | .996 |
Estimate | SE | z | Pr(>|z|) | |
---|---|---|---|---|
(Intercept) | 4.128 | 0.379 | 10.898 | < .001*** |
Emo_type: joy | –0.210 | 0.228 | –0.921 | .357 |
Emo_type: anger | 0.372 | 0.276 | 1.348 | .178 |
Emo_type: sadness | 0.033 | 0.252 | 0.133 | .894 |
Syll_length: monosyllable | –0.897 | 0.325 | –2.761 | .006** |
Syll_length: disyllable | –0.413 | 0.235 | –1.761 | .078 |
Syll_length: trisyllable | 0.518 | 0.280 | 1.849 | .064 |
joy × monosyllable | 0.949 | 0.405 | 2.342 | .019* |
anger × monosyllable | 0.367 | 0.434 | 0.845 | .398 |
sadness × monosyllable | –0.710 | 0.350 | –2.028 | .043* |
joy × disyllable | –0.414 | 0.268 | –1.545 | .122 |
anger × disyllable | –0.690 | 0.314 | –2.194 | .028* |
sadness × disyllable | 0.655 | 0.321 | 2.042 | .041* |
joy × trisyllable | –0.204 | 0.339 | –0.602 | .547 |
anger × trisyllable | 0.515 | 0.468 | 1.099 | .272 |
sadness × trisyllable | –0.092 | 0.373 | –0.246 | .806 |
Random effects | Variance | SD | ||
ID | 1.700 | 1.304 | ||
Item | 0.323 | 0.568 |
Contrast | Estimate | SE | z | p |
---|---|---|---|---|
Emotion type: | ||||
joy–anger | –0.582 | 0.415 | –1.402 | .498 |
joy–sadness | –0.244 | 0.383 | –0.636 | .921 |
joy–neutral | –0.014 | 0.375 | –0.038 | 1.000 |
anger–sadness | 0.339 | 0.442 | 0.767 | .869 |
anger–neutral | 0.568 | 0.435 | 1.306 | .559 |
sadness–neutral | 0.229 | 0.404 | 0.568 | .942 |
Syllable length: | ||||
monosyllable–disyllable | –0.484 | 0.426 | –1.137 | .667 |
monosyllable–trisyllable | –1.416 | 0.478 | –2.959 | .016* |
monosyllable–sentence | –1.690 | 0.657 | –2.572 | .049* |
disyllable–trisyllable | –0.931 | 0.357 | –2.610 | .045* |
disyllable–sentence | –1.205 | 0.575 | –2.097 | .154 |
trisyllable–sentence | –0.274 | 0.614 | –0.446 | .970 |
Estimate | SE | z | Pr(>|z|) | |
---|---|---|---|---|
(Intercept) | 1.641 | 0.240 | 6.828 | < .001*** |
Emo_type: joy | –0.519 | 0.101 | –5.128 | < .001*** |
Emo_type: anger | 0.394 | 0.118 | 3.335 | .001*** |
Emo_type: sadness | 0.351 | 0.118 | 2.981 | .003** |
Syll_length: monosyllable | –0.894 | 0.213 | –4.204 | < .001*** |
Syll_length: disyllable | –0.031 | 0.145 | –0.215 | .830 |
Syll_length: trisyllable | 0.169 | 0.156 | 1.086 | .277 |
joy × monosyllable | 0.389 | 0.188 | 2.071 | .038* |
anger × monosyllable | 0.806 | 0.222 | 3.637 | < .001*** |
sadness × monosyllable | –0.608 | 0.197 | –3.094 | .002** |
joy × disyllable | 0.085 | 0.133 | 0.635 | .525 |
anger × disyllable | –0.356 | 0.149 | –2.383 | .017* |
sadness × disyllable | 0.285 | 0.156 | 1.833 | .067 |
joy × trisyllable | –0.092 | 0.144 | –0.641 | .521 |
anger × trisyllable | –0.248 | 0.163 | –1.519 | .129 |
sadness × trisyllable | 0.320 | 0.172 | 1.854 | .064 |
Random effects | Variance | SD | ||
ID | 0.898 | 0.947 | ||
Item | 0.191 | 0.437 |
Contrast | Estimate | SE | z | p |
---|---|---|---|---|
Emotion type: | ||||
joy–anger | –0.913 | 0.177 | –5.147 | < .001*** |
joy–sadness | –0.870 | 0.177 | –4.921 | < .001*** |
joy–neutral | –0.293 | 0.171 | –1.716 | .315 |
anger–sadness | 0.043 | 0.196 | 0.220 | .996 |
anger–neutral | 0.620 | 0.192 | 3.232 | .007** |
sadness–neutral | 0.576 | 0.191 | 3.016 | .014* |
Syllable length: | ||||
monosyllable–disyllable | –0.863 | 0.290 | –2.976 | .016* |
monosyllable–trisyllable | –1.063 | 0.301 | –3.530 | .002** |
monosyllable–sentence | –1.650 | 0.388 | –4.250 | < .001*** |
disyllable–trisyllable | –0.200 | 0.206 | –0.974 | .764 |
disyllable–sentence | –0.787 | 0.319 | –2.465 | .066 |
trisyllable–sentence | –0.587 | 0.330 | –1.780 | .283 |
VI Discussion
VII Conclusions
Acknowledgments
Declaration of conflicting interests
Funding
ORCID iD
Footnotes
Data availability statement
References
Cite article
Cite article
Cite article
Download to reference manager
If you have citation software installed, you can download article citation data to the citation manager of your choice
Information, rights and permissions
Information
Published In
Keywords
Data availability statement
Authors
Metrics and citations
Metrics
Article usage*
Total views and downloads: 633
*Article usage tracking started in December 2016
Altmetric
See the impact this article is making through the number of times it’s been read, and the Altmetric Score.
Learn more about the Altmetric Scores
Articles citing this one
Receive email alerts when this article is cited
Web of Science: 0
Crossref: 1
- Gender Differences in Acoustic-Perceptual Mapping of Emotional Prosody in Mandarin Speech
Figures and tables
Figures & Media
Tables
View Options
View options
PDF/EPUB
View PDF/EPUBAccess options
If you have access to journal content via a personal subscription, university, library, employer or society, select from the options below:
loading institutional access options
Alternatively, view purchase options below:
Purchase 24 hour online access to view and download content.
Access journal content via a DeepDyve subscription or find out more about this option.