Hugging Face Unveils Enhanced Small Language Model with Advanced Reasoning Abilities

Hugging Face has unveiled SmolLM3, a cutting-edge 3 billion parameter language model featuring long-context reasoning, multilingual support, and dual-mode inference. Available under the Apache 2.0 license, SmolLM3 excels with 11.2 trillion tokens in training, surpassing competitors like Llama-3.2-3B and Qwen2.5-3B, and challenging larger models such as Gemma3 and Qwen3.

The model proficiently accommodates six languages—English, French, Spanish, German, Italian, and Portuguese—and handles context lengths up to 128k tokens using NoPE and YaRN techniques. It includes both a base and an instruction-tuned version, allowing users to toggle reasoning modes.

SmolLM3’s robust training involved web, code, and math datasets, optimizing its performance with methods like Anchored Preference Optimization (APO). Ranking highly across 12 benchmarks, its capabilities span multilingual tasks and coding, further enhanced by public sharing of the training process on GitHub. Following SmolLM2’s success, Hugging Face continues to focus on iterative improvements in its small language models.

Source link

News

Company:

Join our community of SUBSCRIBERS and be part of the conversation.

Why the Internet’s Repeated AI Prompts Are Failing: Understanding the Backlash

I Tested America’s New AI Trip Planning Tool: Here’s What Needs Improvement.

Japanese Government Urges OpenAI to Respect Copyright, Emphasizing the Cultural Significance of Manga and Anime Characters as ‘Irreplaceable Treasures’

CEPU Enters MoU to Support OpenAI-Backed Data Center in Argentina – Bloomberg.com

IndiaAI Mission Calls on Startups to Develop AI-Powered Facial Recognition Solutions

AI Models Enhance Sepsis Prediction in Pediatric Patients

Mastering Claude: A Comprehensive Guide to Creating Consistent AI Coding Workflows

Salesforce Reports AI-Driven Customer Service Yields $100 Million in Annual Savings

Advanced AI Solutions for Document Parsing and Data Extraction

Insights and Gaps: Understanding the Stanford Digital Economy Lab

Hugging Face Unveils Enhanced Small Language Model with Advanced Reasoning Abilities

Exploring the Rise of AI Coding Tools: Insights from VS Code Install Trends

Ucom and Nokia: Pioneering Autonomous Networks and AI Innovations for 6G – Armenpress

Unauthorized Access

OpenAI and Broadcom Partnership Boosts Chipmaker’s Stock Prices

Transform Your Photos into Stunning Sketches with Our Free AI Converter!

Local News

Why the Internet’s Repeated AI Prompts Are Failing: Understanding the Backlash

AI Models Enhance Sepsis Prediction in Pediatric Patients

I Tested America’s New AI Trip Planning Tool: Here’s What Needs Improvement.

Mastering Claude: A Comprehensive Guide to Creating Consistent AI Coding Workflows

Why the Internet’s Repeated AI Prompts Are Failing: Understanding the Backlash

AI Models Enhance Sepsis Prediction in Pediatric Patients

I Tested America’s New AI Trip Planning Tool: Here’s What Needs Improvement.