长沙理工大学学报(自然科学版)
基于重要性评分的多级随机森林网络语音情感识别
DOI:
CSTR:
作者:
作者单位:

作者简介:

通讯作者:

叶吉祥(1963-),男,湖南长沙人,长沙理工大学教授,主要从事语音情感处理,无人机通讯等方面的研究。E-mail:huyebowen@163.com

中图分类号:

H107

基金项目:

国家自然科学基金资助项目 (61702052);长沙市科技计划项目 (KQ1703018);湖南省教育厅重点项目(17A007,16A008)


Multi-level random forest speech emotional recognition basedon importance score
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    在源数据不充分或不平衡的情况下,深度学习方法在小样本集上难以取得令人满意的语音情感识别效果。因此,本研究构造了一种三层随机森林情感识别网络,在每一层都单独剥离易于区分的情感类别, 并通过重要性评分方法,为每一层网络都构造一个识别特定类别的特征集,该特征集的每一个特征都依据贡献度大小得到赋权,以确保对分类贡献越多的特征因子对结果影响越大。本研究构建的多级情感识别网络, 在小样本集语音情感识别的整体识别率上,较单层随机森林网络和支持向量机分别提高了5%和7%,较流行的深度学习方法卷积神经网络提高了12%。实验结果和理论分析表明:基于重要性评分的多级随机森林网络相较于其他方法,在源数据样本量较少和部分不平衡的情况下,有更高的识别准确率,具有语音情感识别方向的实际应用意义。

    Abstract:

    In the case of insufficient or unbalanced source data, deep learning method is diffi-cult to achieve satisfactory emotion recognition effect on small sample set.Therefore, thispaper constructs a three-layer random forest emotion recognition network, which separatesthe easily distinguishable emotion categories in each layer, through the importance scoringmethod.A feature set for identifying specific categories is constructed for each layer of thenetwork, each feature of the feature set is weighted according to the contribution degree toensure that the feature factor pairs that contribute more to the classification result, thegreater the impact.The multi-level emotion recognition network constructed in this paper improves the overall recognition rate of speech emotion recognition in small sample sets by5%and 7%respectively compared with single-layer random forest network and supportvector machine, and the network increased by 12%compared to popular deep learningmethod CNN.The experimental results and theoretical analysis show that the multi-levelrandom forest network based on importance score has higher recognition accuracy andspeech emotion recognition than other methods when the source data sample size is smalland partially unbalanced, so that it has the practical significance of the direction.

    参考文献
    相似文献
    引证文献
引用本文

叶吉祥,涂晴宇,陈沅涛.基于重要性评分的多级随机森林网络语音情感识别[J].长沙理工大学学报(自然科学版),2019,16(3):77-83.
YE Ji-xiang, TU Qing-yu, CHEN Yuan-tao. Multi-level random forest speech emotional recognition basedon importance score[J]. Journal of Changsha University of Science & Technology (Natural Science),2019,16(3):77-83.

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:
  • 最后修改日期:
  • 录用日期:
  • 在线发布日期: 2022-04-24
  • 出版日期:
文章二维码