Researchers use novel machine-learning algorithm to create atlas of paediatric cancer with potential as universal diagnostic platform

In the first broad comparison of paediatric and adult cancer, researchers at The Hospital for Sick Children (SickKids) have analyzed 13,000 individual cancers and built an ‘atlas’ of paediatric cancer using a novel machine-learning algorithm.

The diagnosis of cancer is, for an estimated 18.1 million people worldwide per year, mostly reliant on the microscopic examination and detection of specific proteins. The accuracy of these methods is variable, and improvements are not easily shared between institutes. This is especially true for paediatric cancer, which is the most frequent cause of death-by-disease in children past infancy in the developed world.

Dr Adam Shlien, Senior Scientist in the Genetics & Genome Biology programme and Associate Director in the Department of Paediatric Laboratory Medicine, SickKids

Dr Adam Shlien, Senior Scientist in the Genetics & Genome Biology programme and Associate Director in the Department of Paediatric Laboratory Medicine, SickKids

“As the burden of cancer increases worldwide, the complexity of cancer diagnostics is expected to grow unless new methods are developed,” explains Dr Adam Shlien, a Senior Scientist in the Genetics & Genome Biology program whose team developed this algorithm. “Our platform can be used at any hospital to increase the speed and accuracy of diagnosing cancer, even for rare types.”

Described in a new study published in Nature Medicine [1], this machine-learning algorithm classifies every known major type of childhood cancer and can refine, or match, a given cancer diagnosis for 85 per cent of paediatric cancer patients.

Unlike other tools for detection and diagnosis, such as a cancer panel test which looks for mutations in specific genes or other methods which may analyze the genome alone, this machine-learning algorithm analyzes a person’s entire transcriptome. While the genome is made up of all the DNA in a cell, only a portion of this genetic code is copied into RNA molecules, known as the transcriptome. “Just because you have a very busy cancer genome, doesn’t mean that everything is being acted upon,” says Dr Federico Comitani, a Research Associate in the Genetics & Genome Biology program and first author on the study. “By analyzing the full transcriptome, we can find each tumour’s core features and collect a clearer picture of cancer activity specific to each individual.”

In addition to identifying significant differences between cancer types, the large amount of data collected by the research team and magnification provided by the platform allowed researchers to identify 455 subtypes of cancer. This large number of subtypes lends support to the idea that most childhood cancers share a common ancestry and then differentiate into a multitude of specific tumour subtypes. “We were able to see, for the first time, subtle differences within cancer subtypes. Childhood cancers display more transcriptional variability – the number of the genes expressed in a cell – than adult cancers,” says Shlien, who holds a Canada Research Chair in Childhood Cancer Genomics and is an Associate Director in the Department of Paediatric Laboratory Medicine. “This gives us a radically new way to look at cancer and potentially identify the prognosis of cancers, and the possibility of changing our understanding of cancer.”

SickKids Cancer Sequencing programme

The tool is already playing an important role in the faster and more accurate diagnosis of cancer as part of the SickKids Cancer Sequencing programme (KiCS) [2], which provides comprehensive genetic sequencing for children with cancer [3].

In cases of neuroblastoma, the most common extra-cranial solid tumour in children, the subtypes identified by this tool predicted significant differences in tumour differentiation and patient survival. Similarly, findings from the platform explained the inconsistent response of sarcomas, tumours of the bone and soft tissue, to immunotherapy by uncovering an imbalance of immune cells, informing potential new therapeutic approaches.

“As we add more samples to this growing atlas and validate it with even larger data sets and sample types, our classifier has the potential to become a universal test for diagnosing paediatric cancer,” says Shlien.

This RNA platform is currently being used on a research-use only basis by a number of early adopter cancer centres worldwide, allowing physicians to compare their patient’s diagnosis to cancer types identified by the platform and receive a digital diagnosis. With support from Industry Partnerships & Commercialization (IP&C) at SickKids, work is also underway to bring this tool to the broader community as a platform to enable diagnostic testing and the acceleration of cancer drug product development.

References
1. Comitani, F., Nash, J.O., Cohen-Gogo, S. et al. Diagnostic classification of childhood cancer using multiscale transcriptomics. Nat Med 29, 656–666 (2023). doi: https://doi.org/10.1038/s41591-023-02221-x
2. https://kicsprogram.com/
3. https://www.sickkids.ca/en/news/archive/2023/sickkids-study-demonstrates-how-comprehensive-genetic-sequencing-informs-a-new-standard-of-cancer-care/