TLDR: MathBuddy is an innovative AI-powered math tutor that uses both conversational text and facial expressions to detect a student’s emotional state. By categorizing emotions as positive, neutral, or negative, the system dynamically adjusts its teaching strategy—challenging engaged students and motivating those experiencing negative emotions. This multimodal approach significantly enhances student engagement and learning effectiveness, as demonstrated by substantial performance gains in automatic evaluations and a user study showing longer positive emotional states in students.
The world of educational technology is rapidly changing with the rise of AI-powered conversational systems. However, many of these advanced learning models often overlook a crucial aspect of human learning: emotions. Studies in educational psychology consistently show that a student’s emotional state, whether positive or negative, significantly impacts their ability to learn.
To address this important gap, researchers have developed MathBuddy, an innovative, emotionally aware AI math tutor. MathBuddy dynamically models a student’s emotions and uses this understanding to apply relevant teaching strategies, making the interaction between the tutor and student much more empathetic and effective.
How MathBuddy Understands and Responds to Emotions
MathBuddy gathers insights into a student’s emotions from two main sources: their conversational text and their facial expressions. These emotional cues are then combined and categorized into one of three states: Positive, Neutral, or Negative. This aggregated emotional state then guides the AI tutor’s response. If a student is in a positive or neutral emotional state, MathBuddy challenges them to further their learning. If the student is experiencing a negative emotion, the system shifts to a motivational approach, offering encouragement and support.
The system’s design involves several key components. For text-based emotion recognition, MathBuddy uses fine-tuned BERT-based models to analyze student utterances. For facial emotion recognition, it employs the face-api.js package, which can detect emotions like Happy, Sad, Angry, Surprised, Fearful, Disgusted, and Neutral from webcam data. These two modalities are then combined through a multimodal aggregation process, which prioritizes non-neutral emotions, assuming that any hint of a strong emotion is more informative than neutrality.
Evaluating MathBuddy’s Effectiveness
The effectiveness of MathBuddy was rigorously evaluated through both automatic metrics and real-time user studies. In automatic evaluations, MathBuddy demonstrated a significant performance gain, including a 23-point increase in win rate and a 3-point gain in overall DAMR (Desired Annotation Match Rate) scores compared to baseline models. These results strongly support the hypothesis that modeling student emotions improves the pedagogical abilities of AI tutors.
A user study involving 30 participants (aged 15–55) further highlighted MathBuddy’s impact. Participants experienced two tutoring sessions, one with emotion-aware adaptation (Emotion ON) and one without (Emotion OFF). The study found that empathy is a highly desired quality in a tutor, and empathetic responses significantly enhanced user learning. Participants in the Emotion ON condition displayed a higher frequency and longer average durations of positive facial expressions, suggesting a more engaging and satisfying learning experience. While the system achieved an overall accuracy of 60% in detecting emotions in real-time usage, it showed particularly high recall for negative emotions, though it sometimes struggled with neutral or positive affect.
Also Read:
- RoboBuddy: Empowering Teachers with LLM-Powered Social Robots for Engaging Classroom Learning
- Next-Generation AI for Education: Combining Social and Technical Learning Support
Contributions and Future Vision
MathBuddy represents a significant step forward as one of the first emotionally-aware LLM tutoring systems grounded in educational theory. It adapts its responses based on a student’s affective state and includes an automatic evaluation system for comprehensive assessment. The researchers also annotated a dataset of 224 student utterances with emotion labels, which will be made publicly available. You can learn more about this research in the full paper: MathBuddy: A Multimodal System for Affective Math Tutoring.
While current modalities (text and facial expressions) show promise, future work aims to explore additional modalities like spoken audio, handwritten notes, or biometric sensors to further enhance emotion recognition accuracy. The goal is to move towards more human-centered, emotionally intelligent learning systems that can truly personalize the educational experience.


