In synthesis labs, chemists often face challenges in determining the ideal reaction conditions for creating a desired product. To streamline this process, researchers like Timothy Newhouse and Victor S. Batista have developed MOSAIC (Multiple Optimized Specialists for AI-assisted Chemical Prediction), a novel framework utilizing large language models (LLMs). Instead of relying on a single LLM, MOSAIC consists of 2,498 specialized models trained on over one million reaction procedures sourced from patent literature.
This innovative system can analyze starting materials using SMILES notation and query relevant LLMs for recommendations on solvents, reagents, and reaction conditions. The framework significantly enhances prediction accuracy while minimizing computational requirements. In tests, MOSAIC achieved a match rate near 50% for known reactions and successfully predicted protocols for uncharted reactions in 35 out of 37 cases. As the platform evolves, it will likely integrate further developments in chemical synthesis and lab automation, promising a more intelligent and automated future for chemistry.