Wednesday, October 15, 2025

IIT Delhi Study Reveals AI Models Excel at Basic Tasks Yet Struggle with Scientific Reasoning

A recent study by researchers from IIT Delhi and Friedrich Schiller University, published in Nature Computational Science, highlights significant limitations in current AI models when it comes to deeper reasoning. While these AI systems excel in basic scientific tasks, achieving high accuracy in identifying laboratory equipment, they struggle with critical skills like spatial reasoning and multistep logical inference essential for authentic scientific discovery. The study introduces MaCBench, a benchmark designed to assess AI performance in chemistry and materials science, revealing alarming disparities in safety evaluations—77% accuracy in recognition versus only 46% in safety reasoning. This suggests AI relies more on pattern matching than genuine understanding, raising concerns if implemented unsupervised in research. The researchers emphasize the need for human oversight in AI applications, particularly in safety-critical environments, and suggest future developments should prioritize comprehension over pattern recognition for effective human-AI collaboration.

Source link

Share

Read more

Local News