Pioneering the Next Milestone for

In July 2025, Skywork announced the release of its second-generation reward model series, Skywork-Reward-V2, expanding on its successful open-sourced models from September 2024. These models, available on HuggingFace and GitHub, have been downloaded over 750,000 times, achieving top rankings in seven evaluation benchmarks. The Skywork-Reward-V2 features eight models with parameters ranging from 600 million to 8 billion, demonstrating exceptional performance in tasks requiring human-aligned preferences. The underlying innovation, Skywork-SynPref-40M, is a hybrid dataset containing 40 million preference pairs, developed through a unique human-machine collaboration for data screening. This approach enhances the model’s efficiency and effectiveness, showcasing significant advancements in Reinforcement Learning from Human Feedback (RLHF). With strong generalization capabilities and superior performance compared to larger models, Skywork forecasts a pivotal role for unified reward systems in AI’s future infrastructure, guiding intelligent systems to align with human values. Visit Skywork.AI to explore the latest developments.

Source link

News

Company:

Join our community of SUBSCRIBERS and be part of the conversation.

Delhivery Limited Unveils AI-Powered Support Tool: Delhivery One SmartAssist

US Government Shifts Focus: State, Treasury, and HHS Depart Anthropic for OpenAI

OpenAI Modifies Pentagon Agreement to Prohibit NSA from Using AI

What Is the Intriguing Metallic Device Being Used by US Chief Design Officer Joe Gebbia?

Huawei Introduces Agentic MBB to Maximize Network Potential for Mobile AI

A Comprehensive Guide to Google’s Gemini CLI: Everything You Should Know

Show HN: Introducing Open-Source Article 12 Logging Infrastructure for Compliance with the EU AI Act

LLMs: Harnessing the Moral and Intellectual Legacy of a Pre-AI Era

Ask HN: What’s the Best Way to Report a Vulnerability When AI Responds to Company Emails?

Why Your AI DevOps Engineer Will Eventually Rely on Human Expertise

Pioneering the Next Milestone for

Counting AI: Understanding Individuation and Liability in Artificial Agents

China’s Innovative Companion: The Rise of AI Chatbots in Education

State Department Adopts OpenAI Chatbot as U.S. Agencies Transition Away from Anthropic — TradingView News

Apple to Enhance Siri with Google Gemini AI for Improved Privacy and Performance – Moneycontrol.com

Making the Move to Anthropic: Claude Now Imports Your ChatGPT, Gemini, and Copilot Memories – Fast Company

Local News

Delhivery Limited Unveils AI-Powered Support Tool: Delhivery One SmartAssist

A Comprehensive Guide to Google’s Gemini CLI: Everything You Should Know

US Government Shifts Focus: State, Treasury, and HHS Depart Anthropic for OpenAI

Show HN: Introducing Open-Source Article 12 Logging Infrastructure for Compliance with the EU AI Act

Delhivery Limited Unveils AI-Powered Support Tool: Delhivery One SmartAssist

A Comprehensive Guide to Google’s Gemini CLI: Everything You Should Know

US Government Shifts Focus: State, Treasury, and HHS Depart Anthropic for OpenAI