Wednesday, December 10, 2025
Tag:

Benchmark

Hugging Face Unveils mmBERT: A Multilingual Encoder Supporting Over 1,800 Languages

Hugging Face has introduced mmBERT, a groundbreaking multilingual encoder trained on over 3 trillion tokens across 1,833 languages. Enhancing the ModernBERT architecture, mmBERT outperforms...

Baidu Launches PP-OCRv5 on Hugging Face, Surpassing VLMs in OCR Performance Benchmarks

Baidu has launched PP-OCRv5 on Hugging Face, a specialized optical character recognition (OCR) model designed for superior performance in text recognition compared to large...

Evaluating AI Agents in Research: Insights from the Deep Research Bench Report

As large language models (LLMs) advance, they are increasingly marketed as powerful research assistants capable of undertaking complex tasks involving multi-step reasoning and data...

Google Unveils LMEval: An Open-Source Tool for Evaluating Cross-Provider LLMs

LMEval is a tool designed to help AI researchers and developers compare the performance of various large language models (LLMs) efficiently and accurately. Given...