TextOnly: A Unified Text Box for Smarter Smartphone Interactions

TLDR: TextOnly is a new smartphone interface that allows users to access various text-related functions across different applications by simply typing into one unified text box. It leverages a combination of large language models (LLM) for general knowledge and a BERT model for personalized learning, achieving high accuracy (71.35% top-1) and continuously improving over time. User studies demonstrate its effectiveness, user preference over manual execution, and its ability to support a greater range of functions with more concise inputs compared to traditional voice assistants and built-in text portals.

Navigating the myriad of applications on our smartphones to perform simple text-related tasks can often feel like a chore. Whether you’re searching for a restaurant, sending a quick message, or jotting down a note, the process typically involves multiple steps: unlocking your phone, finding the right app, opening it, and then locating the specific text box for your input. This common frustration is what a new research paper, TextOnly: A Unified Function Portal for Text-Related Functions on Smartphones, aims to solve.

Authored by Minghao Tu, Chun Yu, Xiyuan Shen, Zhi Zheng, Li Chen, and Yuanchun Shi, primarily from Tsinghua University, this paper introduces TextOnly, a novel unified function portal designed to simplify how we interact with text-related functions on our mobile devices. Imagine typing a restaurant name into a single, central text box, and TextOnly instantly suggests opening Google Maps to search for it. Or, typing a greeting and having it prompt a conversation in WhatsApp. This is the core idea behind TextOnly: to interpret user intentions from concise text inputs and directly launch the desired function across various applications.

TextOnly achieves this impressive feat by integrating two powerful AI models: a large language model (LLM) and a BERT model. The LLM provides a broad base of general knowledge, helping TextOnly understand diverse inputs and handle new users or functions effectively. Complementing this, the BERT model continuously learns user-specific preferences and habits, enabling quicker and more personalized predictions over time. This dual-model approach ensures both robust initial performance and continuous improvement.

The researchers conducted extensive real-world user studies, demonstrating TextOnly’s effectiveness. It achieved a top-1 accuracy of 71.35%, meaning the correct function was the first recommendation over 70% of the time. Even more impressively, its top-5 accuracy reached 89.94%. Participants in the study found TextOnly to have satisfactory usability and expressed a clear preference for it over manually executing tasks. They also noted that TextOnly’s accuracy and inference speed improved significantly with continued use, highlighting its personalized learning capabilities.

Compared to existing solutions, TextOnly offers distinct advantages. While voice assistants are popular, they often require more elaborate commands and can be inconvenient in public or noisy environments. TextOnly, on the other hand, supports a wider range of text-related functions and allows for much more concise inputs. For instance, typing just a number could trigger a payment function, a scenario not typically handled by traditional voice assistants. It also outperforms built-in text portals on smartphones in terms of the number of supported functions and user satisfaction.

The user interface of TextOnly is designed for simplicity. It appears as a floating ball on the screen, which users can tap to reveal a single text input box. After typing, TextOnly displays a list of predicted functions. Users simply tap their desired option, and TextOnly, using Robotic Process Automation (RPA), automatically executes the function within the relevant application. This seamless execution significantly reduces the steps involved in daily smartphone tasks.

The research also delved into user behavior, collecting over 5,000 data entries from participants. Findings showed that people frequently use a wide variety of text-related functions, with searching being the most common intention. The length and nature of text inputs varied greatly depending on the action (e.g., short inputs for payments, longer for reviews), providing valuable data for TextOnly’s predictive models. Crucially, the study emphasized the need for user personalization, as individual habits and app usage patterns differ significantly.

While TextOnly represents a significant step forward, the authors acknowledge certain limitations and areas for future work. Current challenges include the inference time, which can be affected by network latency when querying the LLM, and supporting functions that require more parameters than just text input. Future enhancements could involve using offline or lightweight LLMs, dynamically capturing text boxes for a broader range of functions, and automating the recording of new RPA scripts to make adding new functions easier for users. As large language models continue to evolve, TextOnly is expected to become even more powerful and efficient.

Also Read:

In conclusion, TextOnly offers a compelling vision for a more intuitive and efficient smartphone experience. By unifying access to text-related functions through a single, intelligent portal, it not only streamlines daily tasks but also opens up new possibilities for human-computer interaction, making our devices truly more responsive to our intentions.

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

Financial Sector Fortifies Against Surging AI-Powered Scams

Deloitte’s 2025 Outlook: Navigating Escalating AI Challenges in Human Capital

Salesforce Study Reveals Data Quality is Pivotal for Employee Trust in AI Adoption

Top Executives Sidestep Company AI Guidelines, Fueling Shadow AI Risks

Intel’s Evolving IP Strategy: A Calculated Shift Towards Core AI Innovation

Generative AI Prompts Increased Workforce Surveillance in Indian IT Sector

TextOnly: A Unified Text Box for Smarter Smartphone Interactions

Gen AI News and Updates

Vesl AI Recognized for AI Infrastructure Innovation with ASOCIO Digital Summit Award

AT&T Unleashes Agentic AI Across Business Operations for Enhanced Efficiency and Innovation

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

Boosting Business Efficiency: A New AI and Big Data Model for Process Optimization

AlphaCast: A New Approach to Time Series Prediction Through Human-AI Collaboration

New Graph Neural Networks Improve Reasoning in Assumption-Based Argumentation

Enhancing AI Reasoning: How Recursive Refinement and Multi-Agent Systems Improve Language Model Performance

ARGUS: A Proactive Framework for Enhancing Autonomous Driving Safety

Generative AI Powers Next-Gen Autonomous Emergency Response

OR-R1: Advancing Automated Optimization with Smart, Data-Efficient AI

Enhancing GUI Agents with Memory: A New Framework for History-Aware Reasoning

ProBench: A Deeper Look into How We Evaluate AI Agents for Mobile Apps

Enhancing Large Language Model Reasoning with Concise Outputs

Ensuring Trust in Autonomous AI: A Two-Layered Monitoring Approach for Agentic Systems

MedFuse: A Multiplicative Approach to Understanding Irregular Clinical Time Series Data

HyperD: A New Framework for More Accurate and Robust Traffic Predictions

Beyond Training: Researchers Propose ‘Model Raising’ for AI with Intrinsic Values

Bridging the Divide: Why AI Needs a Qualitative Revolution

Language Models Enhance Safety Certificate Synthesis for Dynamic Systems

Subscribe to get the latest news and updates