TLDR: Perovskite-R1 is a specialized AI model (Large Language Model) designed to accelerate the discovery and design of precursor additives for perovskite solar cells. By analyzing a vast amount of scientific literature and material data, Perovskite-R1 can intelligently propose solutions for improving solar cell performance and stability. Experimental validation showed that additives suggested by Perovskite-R1 significantly outperformed those chosen manually by researchers, highlighting the potential of AI in materials science to overcome traditional trial-and-error methods and speed up research and development.
Perovskite solar cells (PSCs) are a promising technology for generating clean energy, known for their impressive ability to convert sunlight into electricity and their favorable material properties. However, their widespread use is currently limited by challenges such as long-term stability, environmental concerns, and the difficulty of large-scale manufacturing. One effective approach to tackle these issues is through precursor additive engineering, which involves adding specific ingredients to the initial solution used to create the solar cells. This method can significantly improve both the performance and durability of PSCs.
The field of PSC research is growing rapidly, with an explosion of scientific literature. This makes it increasingly difficult for researchers to efficiently find, organize, and use the vast amount of knowledge available. To bridge this gap, a new specialized large language model (LLM) called Perovskite-R1 has been developed. This AI model is specifically designed with advanced reasoning capabilities to assist in the discovery and design of PSC precursor additives.
Perovskite-R1 was built by systematically analyzing and curating 1,232 high-quality scientific publications and integrating a comprehensive library of 33,269 potential materials. This extensive data was used to create a specialized training dataset through automated question-answer generation and a process called chain-of-thought reasoning, where the AI learns to show its step-by-step thinking. By fine-tuning the QwQ-32B model on this dataset, Perovskite-R1 gained the ability to intelligently synthesize insights from existing literature and generate innovative, practical solutions for fixing material flaws (defect passivation) and selecting the best precursor additives.
Experimental tests have confirmed the effectiveness of several strategies proposed by Perovskite-R1. For instance, when comparing additives recommended by the AI (3,5-difluoropyridine-2-carboxylic acid, or AI-DFCA, and 5-hydroxy-2-methylbenzoic acid, or AI-HMBA) against those chosen manually by researchers (gallic acid, or Manual-GA, and caffeic acid, or Manual-CA), the AI-selected additives significantly improved device performance. In contrast, the manually chosen additives led to inferior results. This stark difference highlights the benefits of data-driven screening over traditional, experience-based methods in the complex field of materials discovery.
The development of Perovskite-R1 involved four key stages: building a high-quality instruction fine-tuning dataset, fine-tuning the LLM using an efficient technique called LoRA, designing structured prompts to guide the model, and finally, validating the model’s recommendations through laboratory experiments. The model’s ability to reason and generate solutions is further enhanced by its comprehensive dataset, which covers both foundational and cutting-edge knowledge in perovskite precursor additive strategies.
Perovskite-R1 consistently outperformed other leading LLMs, such as DeepSeek-R1 and Gemini-2.5-Flash-Thinking, on a benchmark dataset focused on perovskite research, especially excelling in challenging, domain-specific tasks. This demonstrates its superior capability and adaptation to the perovskite field.
While Perovskite-R1 shows great promise, the researchers acknowledge areas for future improvement. Currently, the depth of its output can be limited by the question’s phrasing, often providing concise summaries without detailed explanations. Future enhancements could include multi-turn dialogue mechanisms for more in-depth analysis and better control over structured experimental design tasks, such as precisely modeling parameter settings like molar concentration or spin-coating speed. The long-term vision for Perovskite-R1 involves expanding its application to a wider range of tasks, like interface engineering and solvent optimization, and integrating it with automated synthesis platforms to evolve into an intelligent research assistant that can actively propose hypotheses, validate feedback, and self-optimize.
Also Read:
- AI Framework Accelerates New Battery Material Development
- DiffuMeta: Designing Metamaterials with AI’s Language of Shapes
This work demonstrates the significant potential of domain-adapted LLMs in accelerating materials discovery and provides a closed-loop framework for intelligent, data-driven advancements in perovskite photovoltaic research. For more details, you can refer to the original research paper.


