

For decades, the path from a conceptual molecule to a physical substance has been paved with trial, error, and the hard-earned intuition of master chemists. Designing a synthesis route—the step-by-step "recipe" to build a molecule—is a high-stakes puzzle where one wrong move can result in months of wasted lab time.
However, a breakthrough from researchers at the École Polytechnique Fédérale de Lausanne (EPFL) is changing the game. By combining the reasoning power of Large Language Models (LLMs) with traditional chemical software, a new framework called Synthegy is allowing scientists to "talk" their way to better chemistry.
Building a molecule isn't just about knowing the final structure; it’s about the sequence. Chemists must decide which bonds to form first, which sensitive parts of the molecule need "protection" from intermediate reactions, and which starting materials are most cost-effective.
Traditionally, software used for "retrosynthesis" (working backward from a target molecule to simple building blocks) produces hundreds of potential pathways. Sifting through these to find the one that fits a specific lab's goals—such as "minimising toxic waste" or "forming a specific ring structure early on"—has historically required a human expert to manually review every single option.
Led by Philippe Schwaller, the EPFL team developed Synthegy to bridge the gap between automated data and human intent. Unlike previous AI tools that attempted to "hallucinate" new molecules, Synthegy acts as a sophisticated filter and critic.
The workflow of Synthegy is elegant in its simplicity:
To test if the AI actually "thinks" like a chemist, the researchers conducted a double-blind study involving 36 independent chemists and 368 evaluations. The results were staggering: Synthegy’s rankings aligned with expert human judgment 71.2% of the time.
Interestingly, the study found that senior researchers and professors agreed with Synthegy even more frequently than PhD students. This suggests that the AI is successfully capturing the "strategic intuition" that usually takes decades of lab experience to develop.
Synthegy isn't just a librarian for recipes; it’s also a student of the reactions themselves. The framework can assist in "mechanism elucidation"—the study of exactly how electrons move during a chemical reaction. By breaking reactions down into elementary steps, the AI can assess the chemical plausibility of a proposed transformation. In tests involving simple reactions, the top-performing models achieved near-perfect accuracy.
The implications for the pharmaceutical and materials science industries are vast. In drug discovery, speed is everything. According to the study, Synthegy can evaluate 60 candidate routes in roughly 12 minutes for a cost of only $2–3 in API fees. This is a fraction of the time and cost it would take for a senior chemist to perform the same task.
While the system is currently most effective for routes under 20 steps and requires high-performing models (like Gemini-2.5-pro or GPT-4o) to be accurate, it represents a massive leap forward. By making the code and benchmarks publicly available on GitHub, the EPFL team has invited the global scientific community to build upon this foundation.
We are entering an era where the bottleneck in chemistry is no longer the complexity of the data, but the speed at which we can communicate our goals to the machines helping us build the future.
For more information, read the original article here:
👉 https://decrypt.co/367048/ai-chemistry-instructions-build-molecule
Disclaimer: This article is provided for informational purposes only, mistakes may be made, and it's not offered or intended to be used as legal, tax, investment, financial, or any other advice.
