TLDR: This research paper details a novel approach to predicting life satisfaction using machine learning and explainable AI (XAI). By analyzing a Danish government survey of 19,000 people, the study identified 27 key questions that accurately predict contentment. An ensemble machine learning model achieved 93.80% accuracy, while BioBERT, a large language model, also performed strongly (93.74% accuracy) after converting tabular data to natural language. The integration of XAI ensures transparency in predictions, and the study also reveals how determinants of life satisfaction vary across different age groups, with health being a consistent primary factor. The findings have significant implications for mental health assessment and policy-making, with an interactive app developed for practical application.
Life satisfaction, a cornerstone of human well-being, has traditionally been measured through methods that are often complex, time-consuming, and prone to errors. These conventional approaches have made it challenging to understand how individuals truly experience their lives and to develop effective interventions for mental health and well-being. However, a recent study by Alif Elham Khan, Mohammad Junayed Hasan, Humayra Anjum, Nabeel Mohammed, and Sifat Momen from North South University, Bangladesh, explores a groundbreaking approach: using machine learning and explainable AI to predict life satisfaction with remarkable accuracy.
The researchers leveraged a comprehensive dataset from a Danish government survey of 19,000 people aged 16-64. This rich dataset, originally containing 243 questions, was meticulously processed using advanced feature learning techniques. The goal was to identify the most significant determinants of contentment, ultimately distilling the survey down to a concise set of 27 crucial questions. This streamlined questionnaire makes the assessment process simpler, highly reproducible, and easier to interpret for both academics and policymakers.
Machine Learning Unlocks High Accuracy
The study demonstrated the immense potential of machine learning algorithms to predict life satisfaction. An ensemble model, combining various tree-based and boosting algorithms like Random Forest, Gradient Boosting, and XGBoost, achieved an impressive accuracy of 93.80% and a macro F1-score of 73.00%. These metrics are particularly important for imbalanced datasets, ensuring a balanced measure of the model’s ability to correctly identify both satisfied and dissatisfied individuals. The robust performance of these models is attributed to their ability to learn complex patterns and relationships within the data, effectively reducing prediction errors.
Large Language Models Offer New Perspectives
Beyond traditional machine learning, the research also ventured into the realm of large language models (LLMs). The tabular survey data was transformed into natural language sentences, allowing models like BERT, BioBERT, and ClinicalBERT to analyze textual representations of individuals’ responses. Among these, BioBERT, a model pre-trained on extensive biomedical literature, emerged as the top performer, achieving an accuracy of 93.74% and a macro F1-score of 73.21%. This intriguing finding suggests that predicting life satisfaction is more closely aligned with the biomedical domain than the clinical domain, highlighting the broad impact of physical and mental health on overall well-being.
Explainable AI: Building Trust and Understanding
A critical aspect of this research is the integration of Explainable AI (XAI). While AI models can make highly accurate predictions, understanding *why* a model makes a particular decision is crucial for building trust and enabling informed action. XAI techniques were employed to unveil the underlying decision processes, providing clear justifications for predictions. For instance, if a model predicts someone is ‘content,’ XAI can show which specific answers contributed positively or negatively to that prediction, making the AI’s reasoning transparent and reliable. This transparency is vital for decision-makers, such as policymakers, who need to comprehend and apply these insights effectively.
Life’s Determinants Evolve with Age
The study also provided fascinating insights into how the primary determinants of life satisfaction shift across different age groups. While health consistently remains the most important factor across all ages, other concerns vary significantly. For younger individuals (16-21), factors like depression, tension, mood, and worry play crucial roles. In early adulthood (22-34), primary sources of income, employment status, and emotional stability become key. For middle-aged (35-44) and older adults (45-64), physical and mental health, worrying, depression, and long-term health problems are dominant, with physical health being more prominent in middle age and mental health in old age.
Also Read:
- Understanding Heart Health: How AI’s Language Skills Are Transforming Cardiology
- Large AI Models Reshape Neuroscience Research and Clinical Practice
Broader Implications and Future Directions
The findings of this research have significant social, economic, cultural, physical, and mental health implications. The concise 27-question survey, coupled with AI-driven predictions, offers a scalable tool for mental healthcare, potentially assisting individuals who may not have access to traditional therapy. The study highlights the importance of social support, relationship status, and cultural engagement (like traveling or attending events) in fostering well-being. Physical factors such as self-rated health, chronic conditions, height, and weight also play a role, as do mental health indicators like stress management, perseverance, and neuroticism.
To make this research accessible, an interactive application has been developed, allowing individuals to answer the questionnaire and receive real-time predictions of their contentment. This tool aims to expand the reach of the research and encourage the use of AI to understand and enhance subjective well-being. You can explore the full research paper here: Predicting life satisfaction using machine learning and explainable AI.
While the study presents promising results, the authors acknowledge limitations, primarily the generalizability of the models due to the dataset’s specific cultural and socioeconomic context (Denmark). Future work aims to validate and enhance the models’ robustness by incorporating data from a diverse array of countries and exploring deeper neural network architectures to capture more complex temporal dependencies in life satisfaction data.


