Building a virtual cell capable of accurately simulating cellular behaviors in silico has long been a dream in computational biology. We introduce CellFlux, an image-generative model that simulates cellular morphology changes induced by chemical and genetic perturbations using flow matching.
Building a virtual cell capable of accurately simulating cellular behaviors in silico has long been a dream in computational biology.
We introduce CellFlux, an image-generative model that simulates cellular morphology changes induced by chemical and genetic perturbations using flow matching.
Unlike prior methods, CellFlux models distribution-wise transformations from unperturbed to perturbed cell states, effectively distinguishing actual perturbation effects from experimental artifacts such as batch effects—a major challenge in biological data.
When evaluated on chemical (BBBC021), genetic (RxRx1), and combined perturbation (JUMP) datasets, CellFlux generates biologically meaningful cell images that faithfully capture perturbation-specific morphological changes, achieving a 35% improvement in FID scores and a 12% increase in mode-of-action prediction accuracy over existing methods. Additionally, CellFlux enables continuous interpolation between cellular states, offering a promising tool for studying perturbation dynamics. These capabilities represent a substantial step toward advancing virtual cell modeling in biomedical research.
Click to jump to each section.
Our method makes a meaningful contribution to computational biology, providing researchers with new tools for virtual cell modeling and the study of in-silico drug screening and personalized therapy development. The open-source implementation enables the community to build upon this work and accelerate progress in this field.
Our method aims to predict changes in cell morphology induced by chemical or gene perturbations in silico. The approach addresses the fundamental challenge of distinguishing true biological effects from experimental artifacts through innovative distribution-to-distribution modeling.
Key components of CellFlux include:
Our key innovation lies in formulating cellular morphology prediction as a distribution-to-distribution learning problem. Traditional methods often ignore control cells and treat perturbation prediction as a noise-to-image generation problem. In contrast, our approach recognizes that:
This formulation enables our method to generate more accurate cellular responses while maintaining robustness across diverse experimental conditions.
Our method leverages flow matching—a principled generative modeling approach designed for distribution-to-distribution tasks—to learn continuous transformations between cellular states.
During training, our model learns a velocity field by fitting trajectories between control cell images (x₀ ~ p₀) and perturbed cell images (x₁ ~ p₁). At each training step, intermediate states are sampled along linear interpolations, and the network minimizes the difference between predicted and true velocities.
At inference, the trained velocity field guides the transformation of control cell states into perturbed states by solving an ordinary differential equation iteratively using numerical integration steps.
Key Technical Components include:
Our method achieves strong performance across multiple cellular imaging datasets, demonstrating significant improvements in both image generation quality and biological fidelity.
Our results demonstrate significant improvements:
Our method enables new capabilities of batch effect correction and trajectory modeling, advancing the field of virtual cell modeling.
Key capabilities include:
These capabilities advance our understanding of cellular behavior and provide powerful tools for drug discovery and personalized therapy development.
We introduce CellFlux, a method that leverages flow matching to generate cell images under various perturbations while capturing their trajectories, paving the way for the development of a virtual cell framework for biomedical research. In future work, we plan to scale up CellFlux to process terabytes of imaging data encompassing diverse cell types and a wide range of perturbations, enabling the full potential of virtual cell modeling.
@inproceedings{CellFlux,
title={{CellFlux: Simulating Cellular Morphology Changes via Flow Matching}},
author={Zhang, Yuhui and Su, Yuchang and Wang, Chenyu and Li, Tianhong and Wefers, Zoe and Nirschl, Jeffrey and Burgess, James and Ding, Daisy and Lozano, Alejandro and Lundberg, Emma and Yeung-Levy, Serena},
booktitle={International Conference on Machine Learning (ICML)},
year={2025}
}
We thank Cambrian for providing this elegant project page template, which is available here.