欢迎访问《图学学报》 分享到:

图学学报 ›› 2026, Vol. 47 ›› Issue (1): 223-233.DOI: 10.11996/JG.j.2095-302X.2026010223

• 工业设计 • 上一篇    下一篇

机器人如何吸引儿童?外观、动作和声音多感官特征在早期互动中的作用

李奕1,2(), 曹成才3, 宋章通4, 李祚奇5, 黎晓1,2, 李和森1,2   

  1. 1 湖北美术学院工业设计学院湖北 武汉 430205
    2 湖北美术学院现代公共视觉艺术设计研究中心湖北 武汉 430205
    3 武汉工程大学艺术设计学院湖北 武汉 430205
    4 武汉商学院艺术学院湖北 武汉 430056
    5 中国地质大学(武汉)计算机学院湖北 武汉 430078
  • 收稿日期:2025-03-24 接受日期:2025-09-02 出版日期:2026-02-28 发布日期:2026-03-16
  • 通讯作者:李奕,E-mail:liyi@hifa.edu.cn
  • 基金资助:
    湖北省人文社会科学重点研究基地-湖北美术学院现代公共视觉艺术设计研究中心重大项目(JD-2025-01)

How do robots attract children? The role of appearance, motion, and voice as multisensory features in early-stage interactions

LI Yi1,2(), CAO Chengcai3, SONG Zhangtong4, LI Zuoqi5, LI Xiao1,2, LI Hesen1,2   

  1. 1 School of Industrial Design, Hubei Institute of Fine Arts, Wuhan Hubei 430205, China
    2 Research Center for Modern Public Visual Arts and Design, Hubei Institute of Fine Arts, Wuhan Hubei 430205, China
    3 School of Art and Design, Wuhan Institute of Technology, Wuhan Hubei 430205, China
    4 School of Art, Wuhan Business University, Wuhan Hubei 430056, China
    5 School of Computer Science, China University of Geosciences, Wuhan Hubei 430078, China
  • Received:2025-03-24 Accepted:2025-09-02 Published:2026-02-28 Online:2026-03-16
  • Supported by:
    Major Projects of Key Research Institute of Humanities and Social Sciences at Universities in Hubei Province-Research Center for Modern Public Visual Arts and Design of Hubei Institute of Fine Arts(JD-2025-01)

摘要:

随着人工智能技术的快速发展,多模态机器人在学前儿童的教育、娱乐及日常生活中扮演着日益重要的角色。现有研究多关注机器人单一感官线索对儿童感知的作用,但关于多感官整合效应仍缺乏系统性研究。为探究机器人多模态特征如何共同影响儿童的情感偏好和视觉注意力,招募318名4~6岁儿童参与1项眼动追踪实验。实验采用2(外观特征:拟人型、拟动物型)×3(声音引导:男声、女声、无)×2(手势引导:有、无)的三因素混合实验设计,以机器人外观特征(拟人型与拟动物型)和行为特征(声音、手势引导)为自变量,以儿童情感偏好和眼动指标为因变量,系统考察了多感官特征对儿童用户的影响。结果表明,在外观特征方面,拟人型机器人和拟动物型机器人在主观偏好评分上没有显著差异,但拟人型机器人吸引了更长的总注视时长、更多的注视次数和更短的首次注视时间,拟人型机器人在注意力指标上显著优于拟动物型机器人,儿童在视觉接触的初始阶段更容易被拟人型机器人所吸引,且拟人化设计在持续吸引儿童注意力方面更具优势。在行为特征方面,具有手势引导的机器人在主观偏好评分上显著高于无手势机器人,且吸引了更长的总注视时长和更多的注视次数。女声机器人在主观偏好评分上略高于男声,且两者均显著高于无声机器人。男声机器人在总注视时长上略高于女声,且两者均显著高于无声机器人。而女声和男声机器人在注视次数的差异不显著,但两者均显著高于无声机器人。具有手势引导和声音(尤其是女声)的机器人在主观评分和视觉注意力分配上表现更好,说明行为特征在增强儿童的情感偏好和互动体验中具有重要作用。此外,外观和行为特征对儿童情感偏好和视觉注意力的影响相对独立,未观察到显著交互作用。该研究揭示了机器人外观和行为特征对学前儿童情感偏好和视觉注意力的影响机制,为设计满足用户情感偏好的儿童机器人提供了科学依据。

关键词: 学龄前儿童, 机器人, 外观特征, 行为特征, 情感偏好, 视觉注意力, 多感官整合

Abstract:

With the rapid development of artificial intelligence technology, multimodal robots are playing an increasingly important role in preschool children’s education, entertainment, and daily life. Existing studies have primarily focused on the effects of single sensory cues of robots on children’s perception, while systematic research on multisensory integration effects remains limited. To explore how robots’ multimodal features jointly influence children’s emotional preferences and visual attention, 318 children aged 4-6 years were recruited to participate in an eye-tracking experiment. The experiment adopted a 2 (appearance features: humanoid vs. animal-like) × 3 (voice guidance: male voice, female voice, none) × 2 (gesture guidance: present vs. absent) mixed factorial design, with robot appearance features (humanoid vs. animal-like) and behavioral features (voice and gesture guidance) as independent variables, and children’s emotional preferences and eye-tracking indicators as dependent variables, thereby systematically examining the effects of multimodal features on child users. The results showed that, in terms of appearance features, no significant difference was observed in subjective preference ratings between humanoid and animal-like robots. However, humanoid robots attracted longer total fixation duration, more fixation counts, and shorter first-fixation latency, indicating superior attention-related performance compared with animal-like robots. Children were more readily attracted to humanoid robots during the initial stage of visual contact, and anthropomorphic design showed greater advantages in sustaining children’s attention. In terms of behavioral features, robots with gesture guidance received significantly higher subjective preference ratings than those without gestures, and also elicited longer total fixation duration and more fixation counts. Robots with female voices received slightly higher subjective preference ratings than those with male voices, and both were significantly preferred over robots without voices. Robots with male voices had slightly longer total fixation duration than those with female voices, and both significantly outperformed robots without voices. The difference in fixation counts between male- and female-voice robots was not significant, but both attracted significantly more fixations than robots without voices. Robots with gesture guidance and voice (especially female voice) performed better in subjective ratings and visual attention allocation, suggesting that behavioral features substantially enhanced children’s emotional preferences and interactive experiences. Furthermore, the effects of appearance and behavioral features on children’s emotional preferences and visual attention were relatively independent, and no significant interaction effects were observed. This study revealed the mechanisms through which robot appearance and behavioral features influenced preschool children’s emotional preferences and visual attention, thereby providing scientific evidence for designing child-oriented robots that align with users’ emotional needs.

Key words: preschool children, robot, appearance features, behavioral features, emotional preference, visual attention, multisensory integration

中图分类号: