武夷山国家公园两栖动物鸣声分析与声纹识别技术

Study of amphibian vocalizations and voiceprint recognition technology in Wuyishan National Park

  • 摘要: 近年来, 生物多样性日益成为全球关注的环境与发展议题。生物多样性调查是摸清我国生物多样性资源本底、监测生物多样性动态变化的有效途径。然而, 传统的生物多样性调查方法耗时耗力, 受到时间和空间上的诸多限制。蛙类、昆虫和鸟类等动物十分依赖鸣声通讯, 基于动物鸣声的生物多样性监测可以克服传统调查方法的诸多困难。在武夷山国家公园开展多年两栖动物多样性调查, 录制、分析代表性物种的鸣声特征, 尝试使用人工智能技术进行两栖动物声纹的自动识别。结果显示, 武夷山国家公园已记录无尾两栖类37种, 数据库收录了81.1%物种的鸣声; 声纹识别模型能够对数据库中的30个物种实现超过80%的识别准确率。基于声纹识别的两栖动物多样性监测技术是实现动物多样性智慧监测的有效方案, 声纹数据的人工智能识别技术未来将在生物多样性监测与保护中发挥重要作用。

     

    Abstract: In recent years, biodiversity has gained increasing attention as a crucial topic within global environmental and development discussions. To understand the baseline and dynamic changes of biodiversity resources in China, investigation and monitoring efforts are essential. However, traditional methods typically require significant time and human resources are constrained by spatial and temporal conditions. Although infrared camera technology has considerably improved the efficiency of monitoring wild mammals, it is less applicable to ectothermic animals such as amphibians and insects. Since animals like frogs, insects, and birds rely extensively on vocal communication, biodiversity monitoring methods can effectively address the limitations of traditional approaches. Based on years of amphibian diversity surveys conducted in Wuyishan National Park, a vocalization database for amphibians in the area was established. This study analyzed the acoustic characteristics of representative anuran species and explored the application of artificial intelligence technology for species identification through amphibian vocalizations. The results indicate that 37 anuran species have been documented in Wuyishan National Park, with vocalization data collected for 30 species, accounting for 81.1% in total. The accuracy rate for species recognition with voiceprint recognition model exceeded 80% for these species. Passive voiceprint recording equipment supports continuous long-term monitoring and can be deployed in areas challenging for personnel to access. Furthermore, the ResNet50 artificial intelligence model can automatically identify the amphibian voiceprint data, significantly enhancing the efficiency of species identification. To sum up, amphibian diversity monitoring technology based on voiceprint recognition presents an effective solution for intelligent monitoring. In the future, the artificial intelligence recognition technology for voiceprint data will play an increasingly important role in biodiversity monitoring and animal conservation.

     

/

返回文章
返回