Apple Launches 8 Compact AI Models for Enhanced On-Device Use

Apple launches ai language models for on-device use

Apple just unveiled these nifty little AI language models called OpenELM. They’re pretty compact, so you can run them right on your smartphone instead of needing some beefy cloud servers.

It’s all part of this growing trend of “small language models” that are gaining traction in the AI world.

OpenELM models are still in the research phase, but they could be the foundation for some seriously cool on-device AI features from Apple down the line. Microsoft’s doing something similar with their Phi-3 models, aiming to pack a punch in terms of language understanding and processing, all while keeping things local.

Some of these OpenELM models are super tiny, ranging from just 270 million to 3 billion parameters. That’s nothing compared to behemoths like Meta’s Llama 3 or OpenAI’s GPT-3, which have billions upon billions of parameters. But here’s the thing: recent research is all about making these smaller models as capable as their larger predecessors.

Apple’s giving us eight different OpenELM models to play with. Four of them are “pretrained,” which is basically the raw, next-token version. The other four are instruction-tuned, making them perfect for building AI assistants and chatbots.

  1. OpenELM-270M
  2. OpenELM-450M
  3. OpenELM-1_1B
  4. OpenELM-3B
  5. OpenELM-270M-Instruct
  6. OpenELM-450M-Instruct
  7. OpenELM-1_1B-Instruct
  8. OpenELM-3B-Instruct

OpenELM models can handle a whopping 2048 tokens at a time, which is pretty impressive. Apple trained them on some seriously massive datasets, like RefinedWeb, a cleaned-up version of PILE, and chunks of RedPajama and Dolma v1.6. All in all, we’re talking about 1.8 trillion tokens of data. That’s a mind-boggling amount of information for these AI models to chew on.

Now, here’s where it gets really interesting. Apple’s using this nifty “layer-wise scaling strategy” for OpenELM. In summary, this clever strategy distributes parameters across each layer of the model, thereby reducing computational resources and enhancing performance. Get this: OpenELM managed to beat Allen AI’s OLMo 1B by 2.36 percent in accuracy, and it only needed half the pre-training tokens to do it. That’s some seriously efficient AI right there.

But wait, there’s more! Apple’s not just giving us the OpenELM models; they’re also sharing the code for CoreNet, the library they used to train these bad boys. Plus, they’ve included step-by-step recipes to recreate the model weights, which is pretty rare for a big tech company. Apple’s all about transparency with this release, aiming to “empower and enrich the open research community.”

Of course, Apple’s not naive. They know that since these models were trained on public datasets, there’s a chance they might spit out some inaccurate, harmful, or biased stuff. But hey, that’s part of the learning process.

We haven’t seen Apple integrate this cutting-edge AI into their devices yet, but rumor has it that iOS 18 might come packed with some nifty on-device AI features. And who knows, maybe they’ll even team up with Google or OpenAI to give Siri a much-needed upgrade.

The Information is Taken from Ars Technica, Times of India and FirstPost


Subscribe to Our Newsletter

Related Articles

Top Trending

tax efficiency for Canadian high earners
10 Things Every Reader Must Know About How Canadian High-Earners Use Holding Companies for Tax Efficiency
FSCA crypto licensing requirements
10 Things Most People Don't Know About South Africa's FSCA Crypto Licensing Rules
Accredited Employer Work Visa
10 Key Facts About New Zealand's Accredited Employer Work Visa
how NHI will work in South Africa
7 Things Most People Don't Know About How NHI Will Change South Africa's Healthcare Landscape for Everyone
Australian dividend imputation strategies
The Franking Advantage: 10 Practical Tips for Australian Dividend Imputation

Fintech & Finance

Smart Contracts
Smart Contracts Explained: Real-World Applications Beyond Crypto
Tokenization Of Real-World Assets
Tokenization Of Real-World Assets: The Next Big Crypto Trend!
how to spot Crypto Scam
How to Spot a Crypto Scam Before It's Too Late: Protect Your Investment!
Crypto in South Africa
17 Things Worth Knowing About How South Africans Are Using Crypto to Beat Rand Volatility
Institutional Adoption Of Bitcoin
Institutional Adoption of Bitcoin: What It Means For Retail Investors?

Sustainability & Living

Green Building Certifications For Schools
Green Building Certifications For Schools: Boost Learning Environments!
Smart Water Management
Revolutionize Smart Water Management In Cities: Unlock the Future!
Homesteading’s Comeback Story, Why Americans Are Turning Back To Self Reliance In Record Numbers
Homesteading’s Comeback Story: Why Americans are Turning Back to Self Reliance In Record Numbers
Direct Air Capture_ The Machines Sucking CO2
Meet the Future with Direct Air Capture: Machines Sucking CO2!
Microgrid Energy Resilience
Embracing Microgrids: Decentralizing Energy For Resilience [Revolutionize Your World]

GAMING

Best Way to Play Arknights on PC
The Best Way to Play Arknights on PC - Beginner’s Guide for Emulators
online gaming
Why Sign-Up Bonuses Are So Popular in Online Entertainment
How Online Gaming Platforms Build Trust
How Online Gaming Platforms Build Trust With New Users
Free-to-Play Casino Games and the Shift Toward Frictionless Digital Entertainment
Frictionless Digital Entertainment: The Rise of Free-to-Play Gaming
High-Risk and High-Reward Tactics in Modern Apps
Shooting the Moon: A Guide to High-Risk, High-Reward Tactics in Modern Apps

Business & Marketing

Marketing Agency Mistakes
The Most Common Mistakes New Agency Owners Make
How to Systematize Your Agency for Scalable Growth
How to Systematize Your Agency for Scalable Growth
agency branding strategy
Building an Agency Brand That Attracts Premium Clients
Generative AI Strategy
How to Build a Generative AI Strategy for Your Business in 2026
Marketing Agency Case Studies
How to Use Case Studies to Win More Agency Clients

Technology & AI

School Supply List
School Supply List Has Changed: Free Lifeline Phones, Laptops and Internet Are Leveling the Playing Field 
GPT-5.4 Security Risks
The Security Risks of GPT-5.4 Computer Use [And How To Protect Your Data]
Unified AI Tools for Content Creation & Multimedia
Top 10 Unified AI Tools for Content Creation: Master Your Multimedia Workflow on a Single Platform
Gemini Advanced Performance in Various Tasks Write Prompts For Gemini Advanced
How To Write Perfect Prompts For Gemini Advanced: Transform Your Skills!
How No-Code Platforms Empower Non-Developers
How No-Code SaaS Platforms are Empowering Non-Developers

Fitness & Wellness

Regenerative Baseline
Regenerative Baseline: The 2026 Mandatory Standard for Organic Luxury [Part 5]
Purposeful Walk Spaziergang
Mastering the Spaziergang: How a Purposeful Walk Can Reset Your Entire Week
Avtub
Avtub: The Ultimate Hub For Lifestyle, Health, Wellness, And More
Integrated Value Chain
The Resilience Framework: A Collaborative Integrated Value Chain Is Changing the Way We Eat [Part 4]
Nutrient Density Scoring
Beyond the Weight: Why Nutrient Density Scoring is the New Gold Standard for Food Value in 2026 [Part 3]