spot_img
HomeResearch & DevelopmentMM-Food-100K: A New Dataset for Smarter Food AI, Powered...

MM-Food-100K: A New Dataset for Smarter Food AI, Powered by Community and Blockchain

TLDR: MM-Food-100K is a 100,000-sample multimodal food dataset designed to improve AI models for food intelligence. It features real-world photos with detailed annotations, sourced from a community and reviewed by AI for quality. The accompanying Codatta Protocol uses blockchain to track data origin and offers royalty-based payments to contributors, shifting data acquisition costs to usage-based models and demonstrating significant performance gains for AI in food classification and nutrition prediction.

A new research paper introduces MM-Food-100K, a groundbreaking dataset comprising 100,000 multimodal food samples. This dataset is specifically designed to enhance the capabilities of Artificial Intelligence models, particularly for tasks related to food intelligence. The creators, including Yi Dong, Yusuke Muraoka, Scott Shi, and Yi Zhang, highlight its potential to improve how AI understands and processes information about food.

The core innovation behind MM-Food-100K lies in its unique data sourcing approach. It combines contributions from a large community of users with an advanced, automated quality review system powered by Large Vision-Language Models (LVLMs). This ensures that the dataset contains high-quality, real-world photos accompanied by rich, multi-level annotations. These annotations go beyond simple labels, including details like dish name, food type (homemade, restaurant, packaged, raw), ingredients, portion size, nutritional profile (calories, protein, fat, carbs), cooking method, and even authenticity indicators to verify if the photo is from a camera or phone versus an online download.

A significant aspect of this project is the introduction of the Codatta Protocol. This novel framework addresses common issues in traditional data markets, such as high costs for businesses to acquire verifiable data and inadequate compensation or attribution for data contributors. The Codatta Protocol utilizes a privacy-preserving blockchain system to meticulously track data provenance, meaning it records the origin and history of each data sample. This system also enables a royalty-based reward mechanism, where contributors can earn ongoing payments based on the commercial use of their data, rather than just a one-time fee. This innovative model aims to align incentives between data providers and data consumers, fostering a more sustainable and equitable data ecosystem.

The utility of MM-Food-100K has been demonstrated through experiments where AI models fine-tuned on this dataset consistently outperformed original Large Vision-Language Models. These improvements were observed in critical food-related tasks such as food classification and regression, particularly in predicting nutritional values like kilocalories. For instance, fine-tuning significantly reduced the Mean Absolute Error (MAE) for calorie prediction in models like Qwen-Max and ChatGPT-4o, showing the dataset’s value in enhancing predictive accuracy.

The data collection process itself is a two-stage, AI-augmented workflow. It begins with human data providers submitting images and annotations. An initial quality review by an Automated LVM quickly filters out low-quality submissions. Submissions that pass this stage then undergo a more thorough Final Quality Review by a Programmed LVM, ensuring data accuracy and integrity. Once validated, the data is curated and split into two sets: a 10% Public Access Subset for research and a 90% Commercial Access Subset. Both subsets retain wallet-linked provenance, setting the stage for future royalty distributions.

The Codatta Protocol’s royalty model transforms data acquisition from a large upfront capital expense into a usage-based operating expense for buyers. This reduces risk and allows for smaller pilot projects before scaling. For contributors, it offers recurring royalties, rewarding durable quality and sustained participation, creating a positive feedback loop where better data leads to better products and higher payouts. The blockchain-based ledger ensures privacy-preserving transparency, allowing verification of data existence and authorship without exposing sensitive content. This makes each contributed record a traceable digital asset, enabling fair attribution and accurate royalty allocation.

Also Read:

The MM-Food-100K dataset is publicly available for research, and the Codatta Protocol represents a new paradigm for building and sustaining high-quality, community-sourced datasets. For more in-depth information, you can refer to the full research paper available at this link.

Nikhil Patel
Nikhil Patelhttps://blogs.edgentiq.com
Nikhil Patel is a tech analyst and AI news reporter who brings a practitioner's perspective to every article. With prior experience working at an AI startup, he decodes the business mechanics behind product innovations, funding trends, and partnerships in the GenAI space. Nikhil's insights are sharp, forward-looking, and trusted by insiders and newcomers alike. You can reach him out at: [email protected]

- Advertisement -

spot_img

Gen AI News and Updates

spot_img

- Advertisement -