TLDR: PREFINE is a novel AI framework for personalized story generation that extends the Critique-and-Refine paradigm. It constructs a pseudo-user agent from a user’s interaction history and generates user-specific rubrics (evaluation criteria). This agent then critiques and refines story outputs on the user’s behalf based on these tailored rubrics, achieving personalized generation without requiring parameter updates or direct user feedback. Evaluations show PREFINE outperforms baselines in personalization while maintaining general story quality, with potential applications beyond storytelling.
Large Language Models (LLMs) have made incredible strides in creative text generation, but truly personalized stories that cater to individual tastes remain a significant challenge. Current methods often demand explicit user feedback or costly model fine-tuning, which can be burdensome for users and computationally expensive. This is where a new framework called PREFINE steps in, offering a clever solution for generating stories that genuinely reflect a user’s unique preferences without needing constant input or retraining.
PREFINE, which stands for Persona-and-Rubric Guided Critique-and-Refine, builds upon existing “Critique-and-Refine” approaches. These methods typically involve an LLM critiquing and improving its own output based on a set of rules. PREFINE takes this a step further by introducing two crucial elements for personalization: a pseudo-user agent and user-specific rubrics.
How PREFINE Works
Imagine you want a story tailored just for you. PREFINE starts by generating an initial story. Then, it creates a “pseudo-user agent” – essentially, an AI that learns to mimic your preferences by analyzing your past interactions, like stories you’ve liked or disliked. This agent then generates a “user-specific rubric,” which is a set of personalized evaluation criteria based on what it understands about your tastes. For example, if you prefer stories with strong, independent characters, the rubric might include criteria like “Characters demonstrate autonomy and self-determination.”
With this personalized rubric in hand, the pseudo-user agent critiques the initial story, providing feedback and suggestions for improvement. This feedback isn’t generic; it’s specifically designed to align the story with your inferred preferences. The story is then refined based on this feedback in an iterative cycle, gradually becoming more and more personalized. This entire process happens without requiring you to explicitly tell the system what you like or dislike during generation, and without the need to update the underlying language model.
Key Advantages and Findings
The researchers, Kentaro Ueda and Takehiro Takayanagi, conducted extensive evaluations using two story datasets, PerDOC and PerMPST, comparing PREFINE against several baseline methods. These baselines included generating stories with no user information (Zero-Persona), directly inserting user preferences into the prompt (Prompt-Persona), and a general Critique-and-Refine method (Self-Refine) that focuses on overall story quality rather than personalization.
In automatic evaluations, PREFINE consistently achieved higher win rates and statistically significant scores compared to all baselines, demonstrating its superior ability to personalize stories. What’s more, human evaluations confirmed these findings, with participants showing a strong preference for stories generated by PREFINE. Interestingly, PREFINE not only excels at personalization but also maintains general story quality at a level comparable to methods specifically designed for quality enhancement. This suggests that the framework successfully improves stories along two distinct dimensions: general quality and individual user preference.
The study also highlighted the importance of both the pseudo-user agent and the user-specific rubrics. When these components were removed or simplified in variant models, the personalization performance decreased, confirming their crucial role in PREFINE’s success.
Also Read:
- ProfileXAI: Tailoring AI Explanations for Every User
- Jarvis: A New Framework for Personalized AI Assistants with Smart Memory Retrieval
Beyond Storytelling
While PREFINE was validated in the context of personalized story generation, its underlying principles hold significant promise for other applications requiring personalized systems. This includes areas such as dialogue systems, educational tools, and recommendation engines, where adapting to individual user preferences can greatly enhance user experience and effectiveness.
For more in-depth technical details, you can read the full research paper here.


