Apple Releases 400K Image Dataset to Improve AI Editing

Apple Pico-Banana-400K AI Image Editing Dataset

Apple researchers have released Pico-Banana-400K, a comprehensive dataset containing 400,000 curated images designed to improve how artificial intelligence systems edit photos based on text prompts, the company announced in a research paper published this week.​

The massive dataset aims to address what Apple describes as a critical gap in current AI image editing training, where progress has been constrained by inadequate datasets built from real photographs. While systems like GPT-4o can make impressive edits, researchers say the lack of large-scale, high-quality training data has limited advancement in the field.​

Systematic Approach to Quality and Diversity

What distinguishes Pico-Banana-400K from previous datasets is Apple’s systematic approach to quality control and comprehensive coverage. The images are organized into 35 different edit types across eight categories, ranging from basic adjustments like color changes to complex transformations such as converting people into Pixar-style characters or LEGO figures.​

Apple built the dataset using Google’s Gemini-2.5-Flash-Image model, also known as Nano-Banana, to generate the edits, while Gemini-2.5-Pro served as an automated quality control system to evaluate results based on instruction compliance and technical quality. Each image in the set underwent this rigorous AI-powered screening process before inclusion.​

The dataset includes three specialized subsets: 258,000 single-edit examples for basic training, 56,000 preference pairs comparing successful and failed edits, and 72,000 multi-turn sequences showing how images evolve through multiple consecutive edits.

Apple Pico-Banana-400K AI Image Editing Dataset

Revealing Current AI Limitations

Apple’s research revealed significant limitations in current image editing models. While global style changes succeeded 93% of the time, precise tasks like relocating objects or editing text struggled with success rates below 60%. These findings provide valuable insights into where AI image editing still falls short of user expectations.​

The complete Pico-Banana-400K dataset is freely available for non-commercial research use on GitHub, allowing developers and researchers to use it for training more capable image editing AI systems. According to the researchers, the dataset establishes “a robust foundation for training and benchmarking the next generation of text-guided image editing models”.​


Subscribe to Our Newsletter

Related Articles

Top Trending

On This Day May 10
On This Day May 10: History, Famous Birthdays, Deaths & Global Events
Motherhood Penalty
Modern Motherhood Penalty: Why Mother’s Day 2026 is the Global Breaking Point for Working Mothers 
Mother's Day Tribute- Lessons From My Mother
The Five Lessons My Mother Never Taught Me — She Just Lived Them
best canadian travel credit cards 2026
8 Best Canadian Credit Cards for Travel Rewards Compared in 2026
London Stock Exchange vs US Markets
6 Eye-Opening Facts About How London Stock Exchange vs US Markets: Key Facts You Must Know

Fintech & Finance

best canadian travel credit cards 2026
8 Best Canadian Credit Cards for Travel Rewards Compared in 2026
How to Use a Balance Transfer to Pay Off Debt Faster
Pay Off Debt Faster with a Smart Balance Transfer
Best High-Yield Savings Accounts Now
Best High-Yield Savings Accounts Of 2026
Best Australian Credit Cards 2026
8 Best Australian Credit Cards for Points and Cashback in 2026
Klarna global expansion
12 Key Facts About Klarna's Global Expansion

Sustainability & Living

Solar Panels Increase Home Resale Value
How Solar Panels Affect Your Home's Resale Value
Solar vs Coal
How Solar Energy Is Becoming Cheaper Than Coal
UK Blockchain Food Traceability Startups
12 UK Blockchain Solutions Ensuring Complete Farm-to-Fork Traceability
EV Adoption in Australia
13 Critical Facts About EV Adoption in Australia
Non-Toxic Home Finishes UK
10 UK Startups Revolutionizing Home Renovations with Non-Toxic Finishes

GAMING

How Cloud Gaming Is Changing Mobile Experiences
How Cloud Gaming Is Changing Mobile Experiences
The Rise of Hyper-Casual Games What's Driving Downloads
Hyper-Casual Games Growth: Key Drivers Behind Massive Downloads
M&A in Gaming
Top 10 SMEs Specializing in M&A in Gaming in USA
Top 10 SMEs Specializing in Game Engines
Top 10 SMEs Specializing in Game Engines in the United States of America
Gaming Audio Design & Music
Top 10 SMEs Specializing in Gaming Audio Design & Music in US

Business & Marketing

Investing in Nordic stock exchanges
10 Practical Tips for Investing in Nordic Stock Exchanges
Best High-Yield Savings Accounts Now
Best High-Yield Savings Accounts Of 2026
How To Conduct Performance Reviews That Actually Motivate
How To Conduct Performance Reviews That Actually Motivate
Why American Football Still Dominates Sports Culture Across The United States
Why American Football Still Dominates Sports Culture Across The United States
How To Run Effective Team Meetings That Don't Waste Time
How To Run Effective Team Meetings That Don't Waste Time: Maximize Your Productivity!

Technology & AI

GDPR compliant web design
15 Practical Tips for GDPR-Compliant Web Design
How to Build a Scalable App Architecture from Day One
Scalable App Architecture Strategies for Modern Startups
Why Most SaaS Startups Have a Strategy Gap and the Tools Closing It
Why Most SaaS Startups Have a Strategy Gap — and the Tools Closing It
Aya vs Google Translate
Aya vs Google Translate in 2026: Which AI Actually Understands Your Language
Mobile Game Psychology: How Developers Hook Players Fast
How Mobile Game Developers Hook Players With Psychology

Fitness & Wellness

Digital Fitness Apps in Germany
Digital Fitness Apps in Germany: 15 Startups Turning Phones Into Personal Trainers 
modern therapy misconceptions
Why Therapy Is Still Misunderstood And How To Find The Right Help
Physical Symptoms of Grieving: How It Works
Physical Symptoms of Grieving: How It Works And Why There's No Shortcut Through It
Gamified Fitness Startups in UK
15 UK’s Most Influential Gamified Fitness Startups and SMEs 
Mindful Handwriting
Ink Against the Algorithm: Why Writing by Hand Is the New Wellness Tech