System Biology Center, NHLBI, National Institutes of Health (NIH), Bethesda, Maryland, US.
Introduction and Research Overview
I am a Research Fellow at the National Heart, Lung, and Blood Institute (NHLBI) at NIH, specializing in Computational Biology. My work is at the vanguard of epigenomics and 3D genomics, focusing on the intricate chromatin regulatory landscape that governs immunity, leukemia, and aging.
Trained in both bioengineering and computational biology, my research seamlessly integrates cutting-edge computational methods with innovative experimental techniques. I develop robust tools and pioneering protocols—such as high-resolution enhancer-promoter interaction mapping and single-cell nucleosome analysis—to decipher complex biological mechanisms.
My translational approach fosters cross-disciplinary partnerships with biologists, physicians, and data scientists to bridge fundamental science with clinical applications, delivering both mechanistic understanding and user-friendly software for the scientific community.
Academic Journey
| 2024–Curr. |
Research Fellow
System Biology Center, NHLBI, NIH, Bethesda, MD, USA.
Mentor: Dr. Keji Zhao
|
| 2019–2024 |
Visiting Fellow
NHLBI/NIH, Bethesda, MD, USA.
Mentor: Dr. Keji Zhao
|
| 2012–2018 |
Ph.D. in Computational Biology
Chinese Academy of Sciences (CAS)-Max Planck Society (MPG) Partner Institute for Computational Biology.
Mentor: Prof. Jing-Dong Jackie Han
|
| 2008–2012 |
B.S. in Bioengineering
East China University of Science and Technology, Shanghai, China.
|
Technical & Computational Expertise
- ✓ Epigenomic & 3D Genome Analysis: Expertise in Hi-C, HiChIP, ATAC-seq, ChIP-seq, MNase-seq, Hi-TrAC, RNA-seq, Micro-C, RRBS, EM-seq, and WES.
- ✓ Single-Cell Technologies: Proficient in 10x single-cell RNA-seq, 10x single-cell multiomics, and scPCOR-seq analysis.
- ✓ Programming & Software Development: Highly proficient in Python programming and package development (cLoops2, cLoops).
- ✓ Bioinformatics Infrastructure: Experienced in CPU/GPU server setup and maintenance, and web-application development (astroBoy, glitter).
- ✓ Machine Learning: Application and development of deep-learning and machine-learning algorithms.
Selected Publications
(* co-first author; # corresponding author) | Full List on Google Scholar
High-Impact First/Co-First Author Papers
-
Zhen, T. *, Cao, Y. *, et al. (2025). CBFB-SMMHC-driven leukemogenesis requires enhanced RUNX1-DNA binding affinity in mice. J Clin Invest. 10.1172/jci192923. LINK
-
Ma, F. *, Cao, Y. *, et al. (2024) Three-dimensional chromatin reorganization regulates B cell development during ageing. Nat Cell Biol. 1-12. LINK
-
Liu, S. *, Cao, Y. *, Cui, K*., et al. (2024) Regulation of T helper cell differentiation by the interplay between histone modification and chromatin interaction. Immunity. 57(5), 987-1004. LINK
-
Cao, Y. *, Liu, S. *, Cui, K. *, et al. (2023). Hi-TrAC detects active sub-TADs and reveals internal organizations of super-enhancers. Nucleic Acids Research, 51(12), 6172-6189. LINK
-
Cui, K. *, Chen, Z. *, Cao, Y. *, Liu, S. *, et al. (2023). Restraint of IFN-y expression through a distal silencer CNS-28 for tissue homeostasis. Immunity, 56(5), 944-958. (Selected as Cover Story & Preview). LINK
-
Cao, Y., Liu, S. *, Ren, G. *, et al. (2022). cLoops2: a full-stack comprehensive analytical tool for chromatin interactions. Nucleic Acids Research, 50(1), 57-71. LINK
-
Liu, S. *, Cao, Y. *, Cui, K., et al. (2022). Hi-TrAC reveals division of labor of transcription factors in organizing chromatin loops. Nature communications, 13(1), 1-17. LINK
-
Cao, Y. *, Chen, Z. *, Chen, X. *, Ai, D. *, et al. (2020). Accurate loop calling for 3D genomic data with cLoops. Bioinformatics, 36(3), 666-675. LINK
-
Zhen, T. *, Cao, Y. *, et al. (2020). RUNX1 and CBFB-SMMHC transactivate target genes together in abnormal myeloid progenitors for leukemia development. Blood, 136(21), 2373-2385. (Selected as Plenary Paper). LINK
-
Han, M. *, Li, J. *, Cao, Y. *, et al. (2020). A role for LSH in facilitating DNA methylation by DNMT1 through enhancing UHRF1 chromatin association. Nucleic Acids Research, 48(21), 12116-12134. LINK
-
Zeng, Y. *, Cao, Y. *, Halevy, R. S. *, et al. (2020). Characterization of functional transposable element enhancers in acute myeloid leukemia. Science China Life Sciences, 1-13. LINK
-
Huang, F. *, Cao, Y. *, et al. (2020). BMP2 signaling activation enhances bone metastases of non-small cell lung cancer. Journal of Cellular and Molecular Medicine, 24(18), 10768-10784. LINK
-
Cao, Y. *, Chen, G. *, Wu, G. *, et al. (2019). Widespread roles of enhancer-like transposable elements in cell identity and long-range genomic interactions. Genome research, 29(1), 40-52. LINK
-
Wang, L. *, Xu, X. *, Cao, Y. *, et al. (2016). Activin/Smad2-induced H3K27me3 reduction is crucial to initiate mesendoderm differentiation of human embryonic stem cells. Journal of Biological Chemistry, jbc-M116. LINK
Software Development
-
astroBoy (2022–Curr.): An automated, AI-assisted platform for large-scale NGS data analysis, supporting over 8,000 samples and 370 billion reads. Deployed at NHLBI/NIH.
-
cLoops2 (2021): A full-stack comprehensive analytical Python package for chromatin interaction data (e.g., Hi-TrAC, HiChIP, Hi-C). Nucleic Acids Research, 2022. GitHub
-
cLoops (2017): A Python package for accurate and flexible loop calling in 3D genomic data. Bioinformatics, 2020. GitHub
-
VINCE (2023–Curr.): A de novo Python package for single-cell MNase-seq nucleosome data analysis (Active Development).
-
Ryder (2025): Python package for epigenome data normalization with internal reference and variable feature detection (Manuscript Submitting).
-
glitter (2023–Curr.): A custom-built, in-house web application for the analysis and visualization of single-cell RNA-seq data.
-
TOWN (2019): A Python package for high-throughput, deep learning-augmented bright-field microscopy to automatically count live C. elegans worms.
Other Contributing Publications (Full List)
- Zhu, X., Chen, X., Cao, Y., et al. (2025). Optimal CXCR5 expression during Tfh maturation involves the Bhlhe40-Pou2af1 axis. Cell Reports, 44(11), 116470.
- Roy, S., Ren, M., Li, P., Cui, K., Cao, Y., et al. (2025). BLIMP1 negatively regulates IL-2 signaling in T cells. Science Advances, 11(29), eadx8105.
- Khateb, M., Jung, R., Leibou, S., Hadley, P., Yu, Z., Dinerman, A. J., Dulemba, V., Gasmi, B., Levin, N., Kim, P., Bhasin, A., Bhat, D., Sindiri, S., Gartner, J. J., Prickett, T. D., Benzine, T., Farid, S. S., Parkhurst, M. R., Halas, H., Cao, Y., et al. (2025). Rapid enrichment of progenitor exhausted neoantigen-specific CD8 T cells from peripheral blood. bioRxiv, 2025-05.
- Zhang, X., Jiang, Q., Li, J., Zhang, S., Cao, Y., et al. (2022). KCNQ1OT1 promotes genome-wide transposon repression by guiding RNA–DNA triplexes and HP1 binding. Nature Cell Biology, 1-13.
- Lin, X., Liu, Y., Liu, S., Zhu, X., Wu, L., Zhu, Y., Zhao, D., Xu, X., Chemparathy, A., Wang, H., Cao, Y., et al. (2022). Nested epistasis enhancer networks for robust genome regulation. Science, 377(6610), 1077-1085.
- Ren, G., Lai, B., Harly, C., Baek, S., Ding, Y., Zheng, M., Cao, Y., et al. (2022). Transcription factors TCF-1 and GATA3 are key factors for the epigenetic priming of early innate lymphoid progenitors toward distinct cell fates. Immunity, 55(8), 1402-1413.
- Pan, L., Ku, W. L., Tang, Q., Cao, Y., et al. (2022). scPCOR-seq enables co-profiling of chromatin occupancy and RNAs in single cells. Communications Biology, 5(1), 1-9.
- Fang, D., Cui, K., Cao, Y., et al. (2022). Differential regulation of transcription factor T-bet induction during NK cell development and T helper-1 cell differentiation. Immunity, 55(4), 639-655.
- Ku, W. L., Pan, L., Cao, Y., et al. (2021). Profiling single-cell histone modifications using indexing chromatin immunocleavage sequencing. Genome Research, 31(10), 1831-1842.
- Huang, F., Cao, Y., et al. (2021). PNMA5 promotes bone metastasis of non-small-cell lung cancer as a target of BMP2 signaling. Frontiers in Cell and Developmental Biology, 1400.
- Qiu, X., Ma, F., Zhao, M., Cao, Y., et al. (2020). Altered 3D chromatin structure permits inversional recombination at the IgH locus. Science Advances, 6(33), eaaz8850.
- Xia, X., Chen, X., Wu, G., Li, F., Wang, Y., Chen, Y., Chen, M., Wang, X., Chen, W., Xian, B., Chen, W., Cao, Y., et al. (2020). Three-dimensional facial-image analysis to predict heterogeneity of the human ageing rate and the impact of lifestyle. Nature Metabolism, 2(9), 946-957.
- Chen, X., Xu, C., Hong, S., Xia, X., Cao, Y., et al. (2019). Immune cell types and secreted factors contributing to inflammation-to-cancer transition and immune therapy response. Cell Reports, 26(7), 1965-1977.
- Xu, C., Ai, D., Shi, D., Suo, S., Chen, X., Yan, Y., Cao, Y., et al. (2018). Accurate drug repositioning through non-tissue-specific core signatures from cancer transcriptomes. Cell Reports, 25(2), 523-535.
- Chen, W., Qian, W., Wu, G., Chen, W., Xian, B., Chen, X., Cao, Y., et al. (2015). Three-dimensional human facial morphologies as robust aging markers. Cell Research, 25(5), 574-587.
- Huang, Y., Yu, X., Sun, N., Qiao, N., Cao, Y., et al. (2015). Single-cell-level spatial gene expression in the embryonic neural differentiation niche. Genome Research, 25(4), 570-581.
- Hong, S., Huang, Y., Cao, Y., et al. (2014). Approaches to uncovering cancer diagnostic and prognostic molecular signatures. Molecular & Cellular Oncology, 1(2), e957981.
Professional Service
-
Editorial Board: Frontiers in Medicine.
-
Peer Reviewer for Prestigious Journals: Nature Communications, Bioinformatics, Communications Biology, BMC Genomics, Genes, Cells, etc..