Published Date : 11/06/2025
In 2024, artificial intelligence (AI), for the first time, helped win a Nobel Prize. DeepMind’s AlphaFold cracked one of biology’s hardest puzzles: protein folding, the challenge of predicting how a chain of amino acids twists into the intricate 3D shape that determines its function. Scientists had struggled with this problem for decades. It was crucial for medicine and drug discovery but seemed unsolvable due to the astronomical number of possible protein structures. Then AI delivered the answer.
A game-changer, no doubt. But it also raises the question: what does this mean for science and for scientists? Is traditional scientific inquiry becoming obsolete? Are we approaching a future where algorithms are the primary drivers of discovery, relegating humans to the sidelines?
Throughout history, every breakthrough technology has redefined how discoveries were made, marking four distinct eras of science. The first, the empirical era, relied on direct observation, as Copernicus challenged the Earth-centered view of the universe by observing the skies. The second, the theoretical era, introduced mathematics to predict nature, like Newton’s equations of motion that shaped physics for centuries. The third, the computational era, which began in the 1950s, harnessed computers to simulate complex systems, leading to Kohn and Pople’s quantum chemistry Nobel Prize. The fourth, the data-driven era of our 21st century, uses machine learning to extract patterns from vast datasets, with AlphaFold solving protein structures by learning from the protein data bank.
Today, we stand at the doorstep of the fifth era of science—the artificial scientific intelligence era—where companies like Google, Lila Sciences, and Sakana are unveiling AI scientists that not only assist research but drive discoveries, generate hypotheses, and test them on their own. Hence, why not let AI run the show from here?
In some fields, perhaps we can. In chemistry, organic synthesis—the process of assembling complex drug-like molecules from basic building blocks—is now guided by interpretable AI models that help scientists plan each step. In materials science, generative AI can design novel inorganic compounds with tailored mechanical, electronic, and magnetic properties, accelerating innovation with minimal human tuning. These are domains where experimental feedback is relatively tractable, simulations are mature, and the data is plentiful and structured. In short, these fields provide ideal conditions for autonomous AI exploration.
But in many other areas, letting an AI run the show today would be like sending a self-driving car down a dirt road with half a map and no GPS. AI might have the horsepower, but it still needs humans to steer it around the pitfalls of specialized scientific data. Nowhere is this clearer than in biomedical imaging, where highly curated datasets are nothing like what traditional large vision models are trained on.
First, biomedical imaging datasets are often tiny by AI standards, and for good reason: collecting them requires technical equipment and trained professionals; labeling them demands significant time and expert input; and strict privacy regulations often limit access. MedPix, a leading medical imaging database, contains just 59,000 images and the Allen Cell Feature Explorer, one of the largest publicly available collections of high-resolution 3D images of human stem cells, only around 32,000 images. That is about a thousand times fewer than what is needed for AI to perform. This is where scientists step in.
Scientists are redefining AI to do more with less, helping algorithms find meaning in images even when data are scarce. One clever approach involves using mathematical insights to redesign the core building blocks of neural networks. Traditional models fall apart when you strip away layers or parameters, but these new architectures stay strong—even with just a single layer and two convolutional filters—precisely because they are built to thrive on small data. And, scientists do not just bend the design of the model to fit the lack of data, they also reimagine the data ecosystem to power the model; they decide what data to collect, how to collect it, and how to cleverly weave together existing, but fragmented, specialized datasets to train AI models for a wide variety of tasks, including brain tumor classification or diabetic retinopathy grading.
But scientific data is not just scarce, it is often noisy. Cryo-electron microscopy (cryo-EM), a Nobel Prize-winning technology that lets us see the invisible—revealing molecules at the tiniest scale—produces incredibly blurry images, where the important details are 100 times weaker than the noise. It is like trying to recognize a friend in a crowd while wearing someone else’s prescription glasses. This stands in stark contrast to the crisp, high-resolution images—like street scenes, faces, or everyday objects—that traditional AI vision models are trained on.
Yet scientists have techniques to extract meaning from even the noisiest images. In cryo-EM, they can reconstruct the 3D shapes of molecules buried in noise; for example, providing the first high-resolution images of SARS-CoV-2 during the COVID-19 pandemic. Today, they are combining that hard-won expertise with the power of AI.
One breakthrough pairs a powerful denoising module with a foundation model, enabling AI to tackle the notoriously difficult processing steps of cryo-EM images. Crucially, this was only possible because scientists also applied their domain expertise to curate a high-quality dataset by cleaning, annotating, and aggregating 529 verified cryo-EM datasets into one large training set that AI could learn from.
It is clear that AI presents an enormous opportunity for science, potentially the most powerful tool we have ever had in our arsenal. But the fifth era of artificial scientific intelligence is not void of human scientists: quite the opposite. In many ways, the future of revolutionary discoveries lies in this synergy: human expertise guiding AI, and AI augmenting human expertise. It is as if we have hired the most overachieving and wildly enthusiastic intern; one who works at superhuman speed, never sleeps, and eagerly devours mountains of data. They hold exceptional potential, but without proper guidance anchored in scientific knowledge, they are more likely to set the lab on fire than to push science forward.
Instead of hoping AI will magically handle limited, noisy, specialized data, we need experts to tailor algorithms to the realities of fields like biology and medicine, and to tailor data to the new requirements of the AI technology. To enter the fifth era of science, we need to equip researchers with AI expertise, AI experts with domain knowledge, and universities with interdisciplinary programs. The labs that thrive will be those where domain experts and AI specialists work in sync or where scientists master both. The next scientific revolution will come from teams who can judiciously steer AI, knowing when to trust it, when to adjust its course, and when to drive it into uncharted territory.
Q: What is the fifth era of science?
A: The fifth era of science, also known as the artificial scientific intelligence era, is marked by the integration of AI into scientific research, where AI not only assists but drives discoveries, generates hypotheses, and tests them independently.
Q: How did DeepMind's AlphaFold contribute to the fifth era of science?
A: DeepMind's AlphaFold solved the complex problem of protein folding, a challenge that had eluded scientists for decades. This breakthrough marked the first time AI helped win a Nobel Prize and demonstrated the potential of AI in scientific discovery.
Q: What are the four previous eras of science?
A: The four previous eras of science are: 1) the empirical era, which relied on direct observation; 2) the theoretical era, which introduced mathematics to predict nature; 3) the computational era, which began in the 1950s and harnessed computers to simulate complex systems; and 4) the data-driven era, which uses machine learning to extract patterns from vast datasets.
Q: Why is human guidance still important in the fifth era of science?
A: Human guidance is crucial in the fifth era of science because AI still needs to be tailored to the specific conditions of specialized scientific data, which can be scarce, noisy, and highly curated. Scientists provide the domain expertise and data curation necessary for AI to perform effectively in fields like biology and medicine.
Q: What are some challenges in using AI for biomedical imaging?
A: Some challenges in using AI for biomedical imaging include the scarcity of high-quality datasets, the noise in images produced by techniques like cryo-electron microscopy, and the need for specialized domain knowledge to curate and annotate data. Scientists are developing new AI models and techniques to overcome these challenges.