TLDR: The TCM-Tongue research paper introduces the first large-scale, standardized dataset of 6,719 high-quality, expert-annotated tongue images for AI-assisted Traditional Chinese Medicine (TCM) diagnosis. Addressing challenges like diagnostic subjectivity and data scarcity, the dataset features images captured under controlled conditions and annotated with 20 pathological categories by licensed TCM practitioners. It supports various AI annotation formats and has been benchmarked with deep learning models, demonstrating its utility for developing objective and scalable AI tools in TCM, thereby bridging ancient diagnostic wisdom with modern technology.
Traditional Chinese Medicine (TCM) has a rich history spanning millennia, rooted in a holistic philosophy that views the human body in harmony with nature. A cornerstone of TCM diagnosis is tongue examination, where practitioners subjectively interpret visual features like color, texture, and coating to understand a patient’s health. However, this traditional method faces challenges due to its subjective nature and inconsistencies in imaging, making it difficult to standardize and scale.
The integration of Artificial Intelligence (AI), particularly deep learning for image analysis, offers a promising path to modernize TCM diagnostics. AI can enhance objectivity, consistency, and scalability in tongue assessment. Yet, this integration has been hampered by critical issues: a scarcity of standardized, large-scale datasets, inconsistencies in how tongue images are acquired, and the complexity of labeling features according to TCM theory, which focuses on ‘symptom patterns’ rather than specific diseases.
To address these significant gaps, researchers have introduced a groundbreaking resource: the TCM-Tongue dataset. This is the first specialized dataset designed specifically for AI-driven TCM tongue diagnosis. It comprises 6,719 high-quality tongue images, all captured under standardized conditions and meticulously annotated with 20 distinct pathological symptom categories. Each image averages 2.54 clinically validated labels, all verified by licensed TCM practitioners, ensuring the data’s clinical authenticity and adherence to traditional principles.
The creation of the TCM-Tongue dataset involved a rigorous, dual-pronged approach. First, a standardized image acquisition protocol was developed. This included a dedicated hardware system with a synchronized dual-camera array and precision-calibrated illumination to ensure consistent lighting and angle alignment. The system even incorporates intelligent facial proximity detection and real-time demographic analysis to ensure subject eligibility and optimal positioning. This meticulous process minimizes environmental variability and enhances the dataset’s reliability for AI applications.
Second, an expert-annotated, AI-ready labeling framework was implemented. Renowned TCM physicians guided the curation of diagnostic labels, ensuring they captured essential TCM-specific nuances while remaining compatible with modern deep learning frameworks. The annotations preserve classical TCM diagnostic markers, such as tongue coating texture, color gradations, and fissure patterns, and are structured as multi-label classifications and segmentation masks. The dataset supports multiple annotation formats, including COCO, TXT, and XML, for broad usability.
To demonstrate its utility, the dataset has been benchmarked using nine different deep learning models, including variants of YOLOv5, YOLOv7, YOLOv8, SSD, and MobileNetV2. The technical validation revealed that while increasing model depth doesn’t always improve accuracy (likely due to the dataset size being smaller than general benchmarks, leading to overfitting in very deep models), mid-sized models like YOLOv7 and YOLOv8m offer an optimal balance between accuracy and computational efficiency. These findings are crucial for developing practical AI tools for TCM, especially considering the computational constraints of intelligent tongue diagnosis robots.
The TCM-Tongue dataset represents a critical foundation for advancing reliable computational tools in TCM. It bridges the data shortage that has hindered progress in the field and facilitates the integration of AI into both research and clinical practice through standardized, high-quality diagnostic data. This work not only accelerates the digitization of TCM, enhancing objectivity and scalability while preserving its holistic principles, but also expands the scope of AI in medicine beyond Western-centric approaches, showcasing how deep learning can embrace diverse diagnostic traditions. For more detailed information, you can refer to the original research paper.
Also Read:
- AI Model FundusExpert Enhances Eye Disease Diagnosis Through Clinical Reasoning
- Enhancing Sign Language Handshape Recognition Using Generative Models
Future applications stemming from this dataset could include personalized TCM diagnostics, telemedicine platforms, and hybrid AI-human decision systems, opening new pathways for interdisciplinary innovation in global healthcare by leveraging machine learning to decode ancient diagnostic wisdom.


