欢迎访问《图学学报》 分享到:

图学学报 ›› 2025, Vol. 46 ›› Issue (6): 1274-1280.DOI: 10.11996/JG.j.2095-302X.2025061274

• 图像处理与计算机视觉 • 上一篇    下一篇

基于图像分块交互的咖啡果实成熟度预测模型

张馨匀(), 张力文, 周李, 罗笑南()   

  1. 桂林电子科技大学人工智能交叉研究院广西 桂林 541004
  • 收稿日期:2025-03-13 接受日期:2025-04-23 出版日期:2025-12-30 发布日期:2025-12-27
  • 通讯作者:罗笑南(1963-),男,教授,博士。主要研究方向为图形图像处理和计算机视觉等。E-mail:luoxn@guet.edu.cn
  • 第一作者:张馨匀(1997-),女,硕士研究生。主要研究方向为数字图像处理与语音处理。E-mail:wxqys178@163.com
  • 基金资助:
    广西科技重大专项(桂科AA24263013)

Coffee fruit maturity prediction model based on image blocking interaction

ZHANG Xinyun(), ZHANG Liwen, ZHOU Li, LUO Xiaonan()   

  1. Institute of Artificial Intelligence Cross Research, Guilin University of Electronic Science and Technology, Guilin Guangxi 541004, China
  • Received:2025-03-13 Accepted:2025-04-23 Published:2025-12-30 Online:2025-12-27
  • First author:ZHANG Xinyun (1997-), master student. Her main research interests cover digital image processing and speech processing. E-mail:wxqys178@163.com
  • Supported by:
    Guangxi Science and Technology Major Special Project(桂科AA24263013)

摘要:

随着咖啡文化的普及和消费需求的增长,咖啡果实的成熟度成为决定品质和市场价值的关键因素。然而,不合理采收导致品质参差不齐,影响经济效益。通过先进成熟度检测技术,可提升采摘精准度,为农户提供数据化决策支持,但在复杂背景下现有方法的鲁棒性和高密度小目标检测方面仍存在技术挑战。因此,提出一种基于图像分块交互的咖啡树果实成熟度预测模型,通过引入空间分块交互注意力机制(SBIAM)实现局部特征和全局特征信息的互补融合,使得模型既能聚焦果实区域,又能有效抑制背景干扰,增强模型对关键特征的关注能力。此外,引入归一化Wasserstein距离(NWD)损失函数解决咖啡果实分类较多出现预测位置偏差等问题,提升复杂场景下咖啡果实成熟度检测的精度和鲁棒性。实验结果表明,改进模型不仅提升了检测精度,还实现了性能与效率的良好平衡。

关键词: 咖啡果实, 成熟度预测模型, 空间分块交互, 注意力机制, NWD损失函数

Abstract:

With the popularity of coffee culture and growing consumer demand, the maturity of coffee fruits has become a key determinant of quality and market value. However, irrational harvesting leads to uneven quality and impacts economic benefits. Through advanced ripeness detection techniques, harvesting accuracy can be improved to provide data-based decision support for farmers; however, the existing methods still have technical challenges in terms of robustness in complex backgrounds and high-density small-target detection. Therefore, a coffee tree fruit ripeness prediction model based on image-chunking interaction was proposed, which achieved the complementary fusion of local and global feature information by introducing a spatial-blocking interaction attention mechanism (SBIAM), so that the model can focus on the fruit region as well as effectively inhibit the background interference, enhancing the model's ability to pay attention to key features. In addition, a normalized Wasserstein distance (NWD) loss function was introduced to solve the problems such as the prediction-position deviation common in coffee-fruit classification, thereby improving the accuracy and robustness of coffee-fruit ripeness detection in complex scenes. Experimental results demonstrated that the proposed improved model not only enhanced the detection accuracy, but also achieved a good balance between performance and efficiency.

Key words: coffee fruit, maturity prediction model, spatial block interaction, attention mechanism, NWD loss function

中图分类号: