TLDR: A research paper investigates why ChatGPT models outperform humans in providing data visualization design advice. Through a comparative analysis of responses from ChatGPT-3.5, ChatGPT-4, and human experts, the study found that AI models are generally preferred due to their comprehensive coverage, breadth of knowledge, and structured explanations. ChatGPT-4, in particular, blends human-like rhetorical styles with extensive AI knowledge, excelling in clarity and actionability, while humans still offer unique value in domain-specific insights and theoretical references.
A recent study delves into the fascinating question of why advanced AI models, specifically ChatGPT, are proving to be more effective than humans when it comes to offering advice on data visualization design. This research, conducted by Yongsu Ahn and Nam Wook Kim, provides a systematic comparison of responses from different ChatGPT versions and human experts, shedding light on the underlying reasons for AI’s superior performance.
Many individuals involved in data visualization learn their skills informally, often relying on intuition or seeking feedback from peers. While previous studies hinted at AI’s potential in this area, they didn’t fully explain the ‘why’ behind its success. This paper aims to fill that gap by examining the rhetorical structure, breadth of knowledge, and perceived quality of advice provided by both AI and humans.
To conduct their analysis, the researchers gathered real-world data visualization questions from the VisGuides forum. They then fed these questions to ChatGPT-3.5 and ChatGPT-4 (with vision capabilities), comparing their responses to those provided by human forum users. The methodology involved extracting various features related to rhetorical styles (like text length, use of examples, or references), knowledge coverage (how well visualization concepts were addressed), and perceived quality (rated by human participants on metrics such as clarity, depth, and actionability).
The findings reveal several key distinctions. Interestingly, ChatGPT-4, the more advanced model, demonstrated a hybrid of characteristics, aligning more closely with human rhetorical styles and knowledge coverage than ChatGPT-3.5. For instance, while ChatGPT-3.5 responses were often much longer, ChatGPT-4 and human responses showed greater lexical diversity and more complex sentence structures. Humans tended to use more external references and sequential markers, whereas both ChatGPT versions frequently employed contrastive language to highlight differences.
In terms of knowledge, all sources covered a similar span of visualization concepts. However, humans often emphasized understanding the underlying data characteristics (domain problems), while ChatGPT models more frequently addressed generic interaction techniques. Crucially, ChatGPT-4’s knowledge distribution was found to be closer to that of humans.
Perhaps the most significant finding was that both ChatGPT models were generally preferred over human responses across all quality metrics. Participants noted that while human advice sometimes offered valuable personal experience, it often lacked depth, sufficient information, and a user-friendly structure. They also mentioned that human responses could sometimes stray from the main topic.
Between the two AI models, ChatGPT-3.5 was highly rated for its coverage, breadth, and depth, often providing extensive and detailed explanations. ChatGPT-4, on the other hand, excelled in overall quality, topicality, clarity, and actionability, being praised for its focus, better structure, and ability to suggest alternative visual solutions.
The study also identified what factors collectively influence users’ perception of quality. Coverage and topicality emerged as the most impactful factors, indicating that users value feedback that is direct and comprehensive. The inclusion of examples and analogies also contributed to higher perceived usefulness. For more details on the methodology and specific findings, you can read the full research paper here.
Also Read:
- DeepVIS: Making Data Visualizations Clearer with Step-by-Step AI Reasoning
- Unpacking AI Trust: Why Better Explanations Can Lead to Less Belief
In conclusion, this research highlights that large language models are becoming increasingly capable in providing sophisticated visualization feedback, even combining strengths from both human and AI approaches. While AI models like ChatGPT-4 are demonstrating extensive knowledge and human-like text generation, humans still hold an edge in areas like domain-specific problem knowledge, step-by-step guidance, and referencing theoretical frameworks. This suggests a future where human and AI capabilities could complement each other for even better outcomes in data visualization design.


