TLDR: BPMN Assistant is a novel tool that uses Large Language Models (LLMs) to enable the creation and editing of Business Process Model and Notation (BPMN) diagrams through natural language. It introduces a specialized JSON-based representation that significantly enhances reliability, speeds up processing, and achieves higher success rates in editing tasks compared to direct XML manipulation. This approach aims to make business process modeling more accessible to non-technical users, bridging the communication gap between IT and business stakeholders and leading to more accurate and timely process documentation.
Business Process Model and Notation (BPMN) has long been the standard for visualizing and optimizing business workflows. However, its inherent complexity often creates significant hurdles for individuals without specialized training. This complexity leads to inefficiencies, a heavy reliance on experts, increased operational costs, and delays in decision-making. Furthermore, a persistent communication gap exists between IT departments, who are comfortable with formal modeling, and business stakeholders, who typically describe processes in natural language. This disconnect frequently results in misunderstandings and implementation delays.
The challenge of extracting process models from unstructured text is substantial, with many current methods struggling with real-world document complexity. Traditional process elicitation methods often fail to capture the nuances of human-centric activities. The dynamic nature of modern business environments also demands frequent updates to processes, but the formal requirements of BPMN and the expertise needed for modifications often create bottlenecks, leading to a gap between actual operations and their formal documentation.
Introducing BPMN Assistant
A new tool, BPMN Assistant, aims to bridge these gaps by leveraging Large Language Models (LLMs) for the natural language-based creation and editing of BPMN diagrams. This system is designed to make business process modeling more accessible, allowing users to describe processes in everyday language rather than requiring knowledge of specialized tools or technical formats like XML.
The BPMN Assistant system is composed of three main parts: a Python-based backend, a BPMN layout server, and a Vue.js frontend. The backend handles user inputs, interacts with the LLM, and manages BPMN diagrams. It uses a specialized JSON-based representation for diagrams, which is then converted to BPMN XML for visualization. The layout server adds graphical information to the diagrams, ensuring they are visually presentable. The frontend provides an intuitive graphical user interface with a chat interface on the left for natural language queries and a BPMN canvas on the right to display the diagrams.
The system supports a variety of advanced LLMs from different providers, including OpenAI’s GPT-4o, Anthropic’s Claude 3.5 Sonnet, and Google’s Gemini 2.0 Flash, among others. It facilitates operations on common BPMN elements such as general tasks, user tasks, service tasks, exclusive gateways, parallel gateways, start events, and end events. Crucially, it also supports a set of specialized functions for modifying diagrams, allowing users to delete, redirect, add, move, or update elements through natural language instructions.
Also Read:
- AutoPlay: Automating Task Generation for Interactive AI Agents
- Empowering AI to Recognize Its Own Limits in Complex Reasoning
Why JSON Makes a Difference
A core innovation of BPMN Assistant is its use of a specialized JSON-based representation for BPMN diagrams. This structured approach offers significant advantages over directly manipulating BPMN XML. In evaluations, the JSON-based representation achieved similar quality scores for diagram generation compared to direct XML generation, but it demonstrated greater reliability with fewer failures. More importantly, JSON-based generation was significantly faster, with a mean latency of 14.41 seconds compared to 24.07 seconds for XML.
The benefits of JSON are even more pronounced in editing tasks. Models consistently achieved higher success rates when using the JSON-based approach for interpreting natural language editing instructions. This is likely due to JSON’s structured nature, which provides a clearer representation for modifications. While JSON requires more input tokens on average, it produces more concise outputs and achieves faster processing times. This trade-off is often cost-effective, as input tokens typically cost less than output tokens.
By reducing the need for technical expertise, BPMN Assistant empowers business professionals to directly contribute to process design and updates. This can lead to more accurate and timely documentation, fostering better communication between technical and non-technical stakeholders. The natural language approach also opens possibilities for process modeling in educational contexts, allowing students to focus on understanding process concepts rather than mastering complex technical interfaces.
While BPMN Assistant shows great promise, it currently supports only a subset of BPMN elements, limiting the complexity of processes that can be modeled. Future work will focus on expanding support for additional elements, incorporating semantic evaluation techniques, and conducting usability studies to further enhance its practical applicability. You can learn more about this research in the full paper available here.


