Computational Systems Biology

Research Overview

Molecular regulations in cellular systems are central to health and disease. The Computational Systems Biology Unit, led by A/Prof. Pengyi Yang, focuses on developing computational methods to reconstruct molecular networks underlying cell identity and model their regulations in determining cell-fate decisions during stem cell differentiation and development.

Lab Head

Pengyi Yang

Unit Head, Computational Systems Biology

Available for Student Supervision

Unit Head, Computational Systems Biology

View full bio

Team Members

Katie Zyner

Senior Research Officer

View full bio

Chunlei Liu

Research Officer

View full bio

Di Xiao

Research Officer

[email protected]

View full bio

Carissa Chen

PhD Student

View full bio

Daniel Kim

PhD Student

View full bio

Research Projects

Themes

Theme I: Mapping and modelling trans-regulatory networks

Molecular trans-regulatory networks (TRNs) comprised of cell signalling, transcriptional, translational, and (epi)genomic regulations are central to health and disease. A major initiative in our group is to integrate trans-omic datasets generated by state‐of‐the‐art mass spectrometer (MS) and next-generation sequencer (NGS) from various cell systems for reconstructing TRNs and understand how different regulatory machineries (e.g. signalling, transcription, and epigenomics) co-operate to define cell states, functions, and fates.

We have previously developed various computational methods to integrate the multi-layered trans‐omic datasets generated during naive to formative pluripotency transition in embryonic stem cells (ESCs) (Yang et al. Cell Systems, 2019). Our current research project aims to further this study by developing methods to characterise signaling cascades, transcriptional networks, and protein networks and their cross‐talks with the aim of answering the following questions:

How do different layers of regulations talk to each other in controlling stem cell fate?
Can we accurately predict stem cell differentiation trajectories based on their TRNs?
What are the key mechanisms of stem/progenitor cells in establishing identities and making cell fate decisions.

Theme II: Single-cell biology and omics

Single-cell based omics are becoming the next wave of development in biotechnologies. promising to revolutionise our ability to study biological systems at an unprecedented resolution. Our group is working on multiple methodological development and lab experiment projects with the goal of characterising cellular systems and diseases at the single-cell level.

On the methodology front, we have recently developed a computational method together with Prof. Jean Yang's group for multiple single-cell RNA-seq data integration (Lin et al. PNAS, 2019). Our current research project aims to extend on this work by developing a suite of data processing, cell type characterisation, and network reconstruction methods and tools for single-cell omic data. In parallel, we are planning to conduct experiments to profile single cells in ESC populations and during their differentiation to multiple cell lineages. Research findings from these projects will directly contribute to our aim in addressing the three questions raised in Theme I.

Theme III: Computational methodology innovation in bioinformatics

Computational and statistical methods are at the core of our research. To tackle complex biological questions by utilising heterogenous omic data generated from various biotechnology, our group is specialised in developing novel computational methods for analysing (i) MS-based proteomic and phosphoproteomic data, and (ii) NGS-based RNA-seq, ChIP-seq, and Hi-C data.

Build on our long-term success in computational methodology innovation, the group is developing various machine learning and deep learning methods with targeted application to biological questions and omic data types. Example of our recent developments include a knowledge-based unsupervised learning method for kinase identification (Yang et al. PLoS Computational Biology, 2015) and a semi-supervised learning method for kinase-substrate prediction (Yang et al. Bioinformatics, 2016) from phosphoproteomic data. Continued innovation in computational and statistical methods will be a key force of our group in answering fundamental biological questions.

Note on publications below:

Bold: CSB group member

✢: Co-first author
†: Corresponding/Co-corresponding author

Key Publications

Full NCBI Bibliography.

View all publications by Pengyi Yang.

Connecting cilium, stress response, and proteostasis abnormalities inform variant and therapy assessment in RPGRIP1 retinal organoids

To Ha Loi, Anson Cheng, Hani Jieun Kim, Milan Fernando, Benjamin M. Nash, Nader Aryamanesh, John R. Grigg, Pengyi Yang, Anai Gonzalez-Cordero and Robyn V. Jamieson. Stem Cell Reports, Volume 20, Issue 12, 102717

DNA methylation and telomere length in 2-5 year olds with intrauterine preeclampsia exposure: a P4 sub-study

Hilary SY Toh, Lisheng Xu, Carissa Chen, Pengyi Yang, Alfred X Sun and John F Ouyang. Science Advances 11, eadu7944 (2025)

CLUEY enables knowledge-guided clustering and cell type detection from single-cell omics data.

Daniel Kim, Carissa Chen, Lijia Yu, Jean Yee Hwa Yang and Pengyi Yang. Bioinformatics, 41 (10), October 2025

Multi-view gene panel characterization for spatially resolved omics.

Daniel Kim, Wenze Ding, Akira Nguyen Shaw, Marni Torkel, Cameron J Turtle6, Pengyi Yang, and Jean Yang (2025). Briefings in Bioinformatics.

Multi-task benchmarking of single-cell multimodal omics integration methods.

Chunlei Liu, Sichang Ding, Hani Jieun Kim, Siqu Long, Di Xiao, Shila Ghazanfar and Pengyi Yang (2025). Nature Methods.

Trans-omic profiling uncovers molecular controls of the early human cerebral organoid formation.

Chen, C., Lee, S., Zyner, K., Fernando, M., Nemeruck, V., Wong, E., Marshall, L., Wark, J., Aryamanesh, N., Tam, P., Graham, M.^†, Gonzalez-Cordero, A.^† & Yang, P.^† (2024). Cell Reports, 43(5), 114219. [Repo]

PhosR enables processing and functional analysis of phosphoproteomic data.

Kim, H., Kim, T., Hoffman, N., Xiao, D., James, D., Humphrey S., Yang, P.^† (2021) Cell Reports, 34(8), 108771. [BioC R package]

Uncovering cell identity through differential stability with Cepo.

Kim, H., Wang, K., Chen, C., Lin, Y., Tam, PPL., Lin, D., Yang, J. & Yang, P.^† (2021) Nature Computational Science, 1, 784-790.

Transcriptional network dynamics during the progression of pluripotency revealed by integrative statistical learning.

Kim, H., Osteil, P., Humphrey, S., Cinghu, S., Oldfield, A., Patrick, E., Wilkie, E., Peng, G., Suo, S., Jothi, R., Tam, P. & Yang, P.^† (2020) Nucleic Acids Research, 48(4), 1828-1842.

Multi-omic profiling reveals dynamics of the phased progression of pluripotency

Yang, P.^✢†, Humphrey, S.^✢†, Cinghu, S.^✢, Pathania, R., Oldfield, A., Kumar, D., Perera, D., Yang, J., James, D., Mann, M. & Jothi, R.^† (2019) Cell Systems, 8(5), 427-445. [The Stem Cell Atlas]

scMerge leverages factor analysis, stable expression, and pseudoreplication to merge multiple single-cell RNA-seq datasets.

Lin, Y., Ghazanfar, S., Wang, K., Gagnon-Bartsch, J., Lo, K., Su, X., Han, Z., Ormerod, J., Speed, T., Yang, P.^† & Yang, J.^† (2019) Proceedings of the National Academy of Sciences of the United States of America, 116(20), 9775-9784. [BioC R package]

Intragenic enhancers attenuate host gene expression.

Cinghu, S.^✢, Yang, P.^✢, Kosak, J., Conway, A., Kumar, D., Oldfield, A., Adelman, K. & Jothi, R. (2017) Molecular Cell, 68(1), 104–117. [PDF]

Histone-fold domain protein NF-Y promotes chromatin accessibility for cell type-specific master transcription factors.

Oldfield, A.^✢, Yang, P.^✢, Conway, A., Cinghu, S., Freudenberg, J., Yellaboina, S. & Jothi, R. (2014). Molecular Cell, 55(5), 708-722. [PDF]

Major Achievements

2014

Discovered a master regulator for enhancerosome assembly in regulating stem cell identity

2017

Discovered intragenic enhancers in regulating stem cell identity

2019

Studied stem cell transition in naïve-formative pluripotency through trans-omic profiling

2021

Developed a computational method based on differential stability for identifying cell identity genes

2023

Characterised the fidelity of various eye organoids in recapitulating fetal and mature human eye

Developed a multi-task deep learning framework for integrative analysis of multimodal single-cell omics data

2024

Identified molecular controls during early human brain organoid formation using trans-omic profiling

What's next?

To develop a cell-fate engineering framework by computational modelling of multimodal single-cell omics data from stem cells and organoids

Computational Systems Biology

On this page:

Research Overview

Lab Head

Pengyi Yang

Team Members

Research Projects

Themes

Theme I: Mapping and modelling trans-regulatory networks

Theme II: Single-cell biology and omics

Theme III: Computational methodology innovation in bioinformatics

Note on publications below:

Key Publications

Major Achievements

2014

2017

2019

2021

2023

2024

What's next?