TLDR: Tafo is a new neuro-symbolic AI system that automatically suggests conditional formatting rules for tables without requiring user input or examples. It learns both the rule conditions and visual formatting properties by combining symbolic and neural AI techniques and analyzing a large corpus of spreadsheets. Tafo significantly outperforms existing methods, automating over 50% of manual formatting tasks and improving accuracy in matching user-defined rules, making spreadsheet data analysis more accessible and efficient.
Spreadsheets are powerful tools for managing and analyzing data, but creating conditional formatting (CF) rules can be a daunting task. These rules, which automatically apply visual styles based on data values, often require technical expertise and can be challenging to set up. Existing solutions typically rely on users providing examples or natural language instructions, which can still be difficult and prone to inaccuracies.
Introducing Tafo: Smart Formatting for Your Tables
A new research paper, Tabularis Formatus: Predictive Formatting for Tables, introduces Tafo, an innovative system designed to simplify and automate conditional formatting. Tafo stands out because it can predict and suggest conditional formatting rules without any explicit input or examples from the user. It automatically learns both the conditions that trigger a format and the visual properties (like colors or bolding) to apply, making the entire process predictive and automated.
How Tafo Works Its Magic
Tafo employs a sophisticated “neuro-symbolic” approach, combining the strengths of different artificial intelligence techniques. It uses a symbolic generator that systematically explores possible rules, a neural generator powered by large language models (LLMs) that understand the semantics of data, and a unique neuro-symbolic generator that blends these two. This combination allows Tafo to generate a diverse range of relevant suggestions, even complex ones that might involve calculations between different columns, like comparing ‘Budget’ and ‘Cost’ to highlight profits or losses.
A key aspect of Tafo is its ability to learn not just the rule, but also the most appropriate visual format. It does this by analyzing a vast collection of existing spreadsheets and conditional formatting rules. If your table has a certain style or uses specific shades of colors, Tafo can even adapt its suggestions to match your existing preferences, ensuring the formatting looks natural and consistent.
Impressive Results and Real-World Impact
The researchers evaluated Tafo using a massive dataset of 1.8 million public Excel workbooks containing conditional and manual formatting. The results show that Tafo significantly outperforms other symbolic and neural systems designed for or adapted to table formatting. It generates more accurate, diverse, and complete formatting suggestions, improving performance by 15.6% to 26.5% in matching user-added rules.
Beyond conditional formatting, Tafo also proved highly effective in automating manual formatting tasks. For tables where users had manually applied formatting without defining rules, Tafo could have automated over 50% of these tasks with just three suggestions. This highlights Tafo’s potential to save users significant time and effort, making data analysis and presentation much more efficient.
Also Read:
- Measuring AI’s Ability to Write Surveys: Introducing SGSimEval
- Advancing GUI Automation with Difficulty-Aware Reinforcement Learning
Looking Ahead
Tafo represents a significant step forward in intelligent spreadsheet automation. By removing the need for users to specify their formatting intent, it makes advanced data visualization accessible to a wider audience. This research opens doors for similar predictive capabilities in other spreadsheet tasks, such as data cleaning, transformation, and querying, further enhancing the intelligence of data management software.


