Sitting through a biology lecture in 2026 causes a subtle kind of vertigo. Heatmaps of 100 million cells, each disturbed by a distinct gene knockout and sorted and clustered by a model that has, in a sense, never seen a cell, now appear on slides that previously displayed Western blots and gel images. Sometimes the biologist in charge of the slides is also a biologist. They are frequently computer scientists who just learned how to pronounce “transcriptome” three years ago. It’s difficult to ignore the fact that the field has moved beneath everyone’s feet and that nobody is completely certain of the location of the new floor.
A large portion of that change can be attributed to AlphaFold, which in 2024 won its inventors the Chemistry Nobel Prize—a result that even experts in the field describe as happening more quickly than anticipated. Prior to AlphaFold, a postdoc could spend the majority of their career figuring out a protein’s structure. More than 200 million predicted structures are currently available for free download, indexing, and searching in an open database. The same atomic coordinates can be queried by researchers at MIT and Lahore or Lagos. A science is usually altered by that kind of knowledge redistribution. This one is already being altered.
| Topic Snapshot | Details |
|---|---|
| Field of Convergence | Artificial intelligence applied to molecular, cellular, and clinical biology |
| Defining Breakthrough | AlphaFold’s prediction of protein structures, awarded the 2024 Nobel Prize in Chemistry |
| Open Resource | More than 200 million protein structures freely accessible through AlphaFold DB |
| Active Research Hub | Arc Institute, Palo Alto — building “virtual cell” models from datasets covering 300+ million cells |
| Primary Applications | Drug discovery, single-cell analysis, cancer recurrence prediction, pandemic response |
| Core Limitation | Most current models learn correlation, not cause and effect |
| Key Players | Google DeepMind, MIT Lincoln Laboratory, Tsinghua University, Arc Institute |
| Future Direction | Causal-aware, multi-modal, theory-guided AI systems for precision medicine |
However, the more profound alteration is occurring at the cell level, one layer below. Organizations like the Arc Institute in Palo Alto are working to create what they refer to as a “virtual cell”—a model that can forecast how a real cell will react when a specific gene is turned off or a specific medication is administered to it. Just the data is astounding: thousands of perturbations, dozens of species, and hundreds of millions of single-cell measurements. It is still genuinely unclear whether any model can learn the logic of a cell from this, or if it can only learn to imitate its surface. It appears that investors think it can. There is greater division among scientists.
Almost all sincere discussions about AI in biology today revolve around this conflict between correlation and cause. A model can indicate that pancreatic tumors exhibit a particular pattern of gene expression. It can’t explain why on its own. Some researchers feel that the field is moving faster than its own interpretability, a feeling that is rarely expressed on conference posters. The response “the model said so” won’t be sufficient when a clinician eventually needs to defend a treatment choice influenced by a neural network.

The wider ramifications are beginning to become apparent outside of the labs. Using generative chemistry, pharmaceutical companies have completely rebuilt their discovery pipelines. Hospitals are testing systems that read patient histories more quickly than a senior physician would in order to prioritize cases of rare diseases. The same tools that are lowering the barrier to therapeutic design may also lower the barrier to something darker: pathogens created with chatbot assistance, according to national security analysts. On the same desk are both options.
Walking through any of this, it’s striking how incomplete everything feels. The instruments are amazing. What constitutes a cell, what constitutes a disease, and what constitutes health are questions that predate all technological advancements. These questions have not been addressed by AI. It has made them feel accountable for the first time in a long time. It will likely take this decade to see if that promise comes true or if it quietly fades like so many tech revolutions do.




