PSYCH OpenIR  > 社会与工程心理学研究室
基于语音的抑郁症识别
其他题名Depression recognition based on speech analysis
潘玮1,2; 汪静莹1,2; 刘天俐3; 刘晓倩1; 刘明明1,2; 胡斌4; 朱廷劭1
第一作者潘玮
通讯作者邮箱tszhu@psych.ac.cn
心理所单位排序1
摘要

抑郁症是世界范围内常见的精神疾病之一,抑郁症患者往往长期伴随情绪低落,如悲伤内疚、低自尊、兴趣丧失、功能减退等,对个人、家庭及社会造成了巨大损失.抑郁症的发病原因复杂,临床诊断存在一定的困难,有必要寻找一种更加便捷、客观、高效的方式来辅助抑郁症的快速识别.语音作为一个相对客观且容易获得的变量,具有其潜在的价值.本研究旨在构建基于语音的抑郁症识别模型,探究语音与抑郁症之间的关系.收集了103名被试(45名抑郁症患者,58名健康人)的语音数据,实验组为临床确诊的抑郁症患者,年龄在23.8~44.6岁之间,控制组为健康人,年龄为20.1~41.7岁.我们采用了3(情绪状态:正性、中性、负性)×3(任务类型:语言问答、文本朗读、图片描述)的实验设计,运用机器学习的分类算法——逻辑回归(LR)来构建抑郁识别模型.实验结果表明,语音的抑郁识别精度可以达到82.9%.本文采用机器学习方法,基于语音变量建立有效的抑郁症自动识别模型,为抑郁症的辅助识别提供客观的指标和依据.

其他摘要

Depression is one of the common mental diseases. Patients with depression often have depressed moods such as sadness, guilty, low self-esteem, loss of interest, hypofunction and so on. They suffer from serious emotional problems, unexplained suffering, which has caused enormous losses to individuals, families and society. According to the World Health Organization, there are aproximately 322 million people suffering from depression in the whole world in 2017. While there are about 54 million depressive patients in China.

Depression can be cured effciently. However, due to the complexity of the pathogenesis of depression, clinical diagnosis is accompanied with many difficulties. Firstly, the mental disease, especially depression, are not getting enough attention and even being misinterpreted by other people. Secondly, the depression patients are less willing to ask for help. Thirdly, it is hard to select and dignose the potential depression patients precisely, as well as there are limited medical resource for depression diagnosis.

It is necessary to find a more convenient, objective and efficient way to assist the fast identification of depression. As a relatively objective and easily accessible variable, speech has its potential value. The speech of patient is easy to acquire, and also, it has been proved that the sound of depressed patients have special charcteristics such as slow speech rate, lack of cadence and so on. The purpose of this paper is to explore the relationship between speech and depression by establishing classification models of voice feature and depression prediction. In this research, 3(emotion mood: positive, neutral, negative)×3(task type: question answering, text reading, picture description) experimental design was employed, and the voice data was collected from the speech of individuals recorded during different tasks. 103 participants were inculded in this study, including 45 depression patients (age: 23.8–44.6, M=34.2, SD=10.4, males=22, females=23) and 58 healthy ones (age: 20.1–41.7, M=30.9, SD=10.8, males=27, females=31). The former were recruited in the hospital in Beijing Anding Hospital and Huilongguan Hospital, while the latter were recruited by advertisement. All of them were diagnosed by specialist with DSM-IV and MINI interview. All participants did not have substance abuse, substance dependence, personality disorders and other mental diseases, no serious physical illness or suicidal behavior. The education level of subjects are all above the elementary school. 988 Voice features were extracted from the speech data using open SMILE software. Logistic regression, a machine learning method, was used to train the predicting models. Results showed that the precision rate of predicting can reach to 82.9%. Based on machine learning methods, this paper employed voice features to establish predicting models of depression. Results show the speech of depression patients has certain predicting effect, which paves the way for the further identification of depression in a more thorough way.

关键词抑郁症 语音特征 分类算法 逻辑回归
2018
语种中文
发表期刊科学通报
ISSN0023-074X
卷号63期号:20页码:2081 - 2092
收录类别CSCD ; 中文核心期刊要目总览
CSCD记录号CSCD:6332375
引用统计
被引频次:6[CSCD]   [CSCD记录]
文献类型期刊论文
条目标识符http://ir.psych.ac.cn/handle/311026/27047
专题社会与工程心理学研究室
通讯作者朱廷劭
作者单位1.中国科学院心理研究所
2.中国科学院大学心理学系
3.北京大学人口研究所
4.兰州大学信息科学与工程学院
第一作者单位中国科学院心理研究所
通讯作者单位中国科学院心理研究所
推荐引用方式
GB/T 7714
潘玮,汪静莹,刘天俐,等. 基于语音的抑郁症识别[J]. 科学通报,2018,63(20):2081 - 2092.
APA 潘玮.,汪静莹.,刘天俐.,刘晓倩.,刘明明.,...&朱廷劭.(2018).基于语音的抑郁症识别.科学通报,63(20),2081 - 2092.
MLA 潘玮,et al."基于语音的抑郁症识别".科学通报 63.20(2018):2081 - 2092.
条目包含的文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
基于语音的抑郁症识别_潘玮.pdf(1608KB)期刊论文出版稿限制开放CC BY-NC-SA请求全文
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[潘玮]的文章
[汪静莹]的文章
[刘天俐]的文章
百度学术
百度学术中相似的文章
[潘玮]的文章
[汪静莹]的文章
[刘天俐]的文章
必应学术
必应学术中相似的文章
[潘玮]的文章
[汪静莹]的文章
[刘天俐]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。