Apple Releases 400K Image Dataset to Improve AI Editing

Apple Pico-Banana-400K AI Image Editing Dataset

Apple researchers have released Pico-Banana-400K, a comprehensive dataset containing 400,000 curated images designed to improve how artificial intelligence systems edit photos based on text prompts, the company announced in a research paper published this week.​

The massive dataset aims to address what Apple describes as a critical gap in current AI image editing training, where progress has been constrained by inadequate datasets built from real photographs. While systems like GPT-4o can make impressive edits, researchers say the lack of large-scale, high-quality training data has limited advancement in the field.​

Systematic Approach to Quality and Diversity

What distinguishes Pico-Banana-400K from previous datasets is Apple’s systematic approach to quality control and comprehensive coverage. The images are organized into 35 different edit types across eight categories, ranging from basic adjustments like color changes to complex transformations such as converting people into Pixar-style characters or LEGO figures.​

Apple built the dataset using Google’s Gemini-2.5-Flash-Image model, also known as Nano-Banana, to generate the edits, while Gemini-2.5-Pro served as an automated quality control system to evaluate results based on instruction compliance and technical quality. Each image in the set underwent this rigorous AI-powered screening process before inclusion.​

The dataset includes three specialized subsets: 258,000 single-edit examples for basic training, 56,000 preference pairs comparing successful and failed edits, and 72,000 multi-turn sequences showing how images evolve through multiple consecutive edits.

Apple Pico-Banana-400K AI Image Editing Dataset

Revealing Current AI Limitations

Apple’s research revealed significant limitations in current image editing models. While global style changes succeeded 93% of the time, precise tasks like relocating objects or editing text struggled with success rates below 60%. These findings provide valuable insights into where AI image editing still falls short of user expectations.​

The complete Pico-Banana-400K dataset is freely available for non-commercial research use on GitHub, allowing developers and researchers to use it for training more capable image editing AI systems. According to the researchers, the dataset establishes “a robust foundation for training and benchmarking the next generation of text-guided image editing models”.​


Subscribe to Our Newsletter

Related Articles

Top Trending

who cancelled more shows in 2025 featured image
Netflix Vs. Disney+ Vs. Max: Who Cancelled More Shows In 2025?
global Netflix cancellations 2026 featured image
The Global Axe: Korean, European, and Latin American Netflix Shows Cancelled in 2026
why Netflix removes original movies featured image
Deleted Forever? Why Netflix Removes Original Movies And Where The “Tax Break” Theory Comes From
can fans save a Netflix show featured image
Can Fans Save A Netflix Show? The Real History Of Petitions, Pickups, And Comebacks
Netflix shows returning in 2026 featured image
Safe For Now: Netflix Shows Returning In 2026 That Are Officially Confirmed

LIFESTYLE

Travel Sustainably Without Spending Extra featured image
How Can You Travel Sustainably Without Spending Extra? Save On Your Next Trip!
Benefits of Living in an Eco-Friendly Community featured image
Go Green Together: 12 Benefits of Living in an Eco-Friendly Community!
Happy new year 2026 global celebration
Happy New Year 2026: Celebrate Around the World With Global Traditions
dubai beach day itinerary
From Sunrise Yoga to Sunset Cocktails: The Perfect Beach Day Itinerary – Your Step-by-Step Guide to a Day by the Water
Ford F-150 Vs Ram 1500 Vs Chevy Silverado
The "Big 3" Battle: 10 Key Differences Between the Ford F-150, Ram 1500, and Chevy Silverado

Entertainment

who cancelled more shows in 2025 featured image
Netflix Vs. Disney+ Vs. Max: Who Cancelled More Shows In 2025?
global Netflix cancellations 2026 featured image
The Global Axe: Korean, European, and Latin American Netflix Shows Cancelled in 2026
why Netflix removes original movies featured image
Deleted Forever? Why Netflix Removes Original Movies And Where The “Tax Break” Theory Comes From
can fans save a Netflix show featured image
Can Fans Save A Netflix Show? The Real History Of Petitions, Pickups, And Comebacks
Netflix shows returning in 2026 featured image
Safe For Now: Netflix Shows Returning In 2026 That Are Officially Confirmed

GAMING

Pocketpair Aetheria
“Palworld” Devs Announce New Open-World Survival RPG “Aetheria”
Styx Blades of Greed
The Goblin Goes Open World: How Styx: Blades of Greed is Reinventing the AA Stealth Genre.
Resident Evil Requiem Switch 2
Resident Evil Requiem: First Look at "Open City" Gameplay on Switch 2
High-performance gaming setup with clear monitor display and low-latency peripherals. n Improve Your Gaming Performance Instantly
Improve Your Gaming Performance Instantly: 10 Fast Fixes That Actually Work
Learning Games for Toddlers
Learning Games For Toddlers: Top 10 Ad-Free Educational Games For 2026

BUSINESS

Quiet Hiring Trend
The “Quiet Hiring” Trend: Why Companies Are Promoting Internally Instead of Hiring in Q1
Pharmaceutical Consulting Strategies for Streamlining Drug Development Pipelines
Pharmaceutical Consulting: Strategies for Streamlining Drug Development Pipelines
IMF 2026 Outlook Stable But Fragile
Global Economic Outlook: IMF Predicts 3.1% Growth but "Downside Risks" Remain
India Rice Exports
India’s Rice Dominance: How Strategic Export Shifts are Reshaping South Asian Trade in 2026
Mistakes to Avoid When Seeking Small Business Funding featured image
15 Mistakes to Avoid As New Entrepreneurs When Seeking Small Business Funding

TECHNOLOGY

Netflix shows returning in 2026 featured image
Safe For Now: Netflix Shows Returning In 2026 That Are Officially Confirmed
Grok AI Liability Shift
The Liability Shift: Why Global Probes into Grok AI Mark the End of 'Unfiltered' Generative Tech
GPT 5 Store leaks
OpenAI’s “GPT-5 Store” Leaks: Paid Agents for Legal and Medical Advice?
Pocketpair Aetheria
“Palworld” Devs Announce New Open-World Survival RPG “Aetheria”
The Shift from Co-Pilot to Autopilot The Rise of Agentic SaaS
The Shift from "Co-Pilot" to "Autopilot": The Rise of Agentic SaaS

HEALTH

Polylaminin Breakthrough
Polylaminin Breakthrough: Can This Brazilian Discovery Finally Reverse Spinal Cord Injury?
Bio Wearables For Stress
Post-Holiday Wellness: The Rise of "Bio-Wearables" for Stress
ChatGPT Health Medical Records
Beyond the Chatbot: Why OpenAI’s Entry into Medical Records is the Ultimate Test of Public Trust in the AI Era
A health worker registers an elderly patient using a laptop at a rural health clinic in Africa
Digital Health Sovereignty: The 2026 Push for National Digital Health Records in Rural Economies
Digital Detox for Kids
Digital Detox for Kids: Balancing Online Play With Outdoor Fun [2026 Guide]