Verification of prognostic expression biomarkers is improved by examining enriched leukemic blasts rather than mononuclear cells from acute myeloid leukemia patients

Pogosova-Agadjanyan, Era L.; Hua, Xing; Othus, Megan; Appelbaum, Frederick R.; Chauncey, Thomas R.; Erba, Harry P.; Fitzgibbon, Matthew P.; Jenkins, Isaac C.; Fang, Min; Lee, Stanley C.; Moseley, Anna; Naru, Jasmine; Radich, Jerald P.; Smith, Jenny L.; Willborg, Brooke E.; Willman, Cheryl L.; Wu, Feinan; Meshinchi, Soheil; Stirewalt, Derek L.

doi:10.1186/s40364-023-00461-0

Research
Open access
Published: 16 March 2023

Verification of prognostic expression biomarkers is improved by examining enriched leukemic blasts rather than mononuclear cells from acute myeloid leukemia patients

Era L. Pogosova-Agadjanyan¹,
Xing Hua²,
Megan Othus²,
Frederick R. Appelbaum^1,3,
Thomas R. Chauncey^3,4,
Harry P. Erba⁵,
Matthew P. Fitzgibbon⁶,
Isaac C. Jenkins^1,7,
Min Fang¹,
Stanley C. Lee¹,
Anna Moseley²,
Jasmine Naru¹,
Jerald P. Radich^1,3,
Jenny L. Smith^1,8,
Brooke E. Willborg¹,
Cheryl L. Willman⁹,
Feinan Wu⁶,
Soheil Meshinchi^1,8 &
…
Derek L. Stirewalt^1,3

Biomarker Research volume 11, Article number: 31 (2023) Cite this article

2228 Accesses
1 Citations
1 Altmetric
Metrics details

Abstract

Background

Studies have not systematically compared the ability to verify performance of prognostic transcripts in paired bulk mononuclear cells versus viable CD34-expressing leukemic blasts from patients with acute myeloid leukemia. We hypothesized that examining the homogenous leukemic blasts will yield different biological information and may improve prognostic performance of expression biomarkers.

Methods

To assess the impact of cellular heterogeneity on expression biomarkers in acute myeloid leukemia, we systematically examined paired mononuclear cells and viable CD34-expressing leukemic blasts from SWOG diagnostic specimens. After enrichment, patients were assigned into discovery and validation cohorts based on availability of extracted RNA. Analyses of RNA sequencing data examined how enrichment impacted differentially expressed genes associated with pre-analytic variables, patient characteristics, and clinical outcomes.

Results

Blast enrichment yielded significantly different expression profiles and biological pathways associated with clinical characteristics (e.g., cytogenetics). Although numerous differentially expressed genes were associated with clinical outcomes, most lost their prognostic significance in the mononuclear cells and blasts after adjusting for age and ELN risk, with only 11 genes remaining significant for overall survival in both cell populations (CEP70, COMMD7, DNMT3B, ECE1, LNX2, NEGR1, PIK3C2B, SEMA4D, SMAD2, TAF8, ZNF444). To examine the impact of enrichment on biomarker verification, these 11 candidate biomarkers were examined by quantitative RT/PCR in the validation cohort. After adjusting for ELN risk and age, expression of 4 genes (CEP70, DNMT3B, ECE1, and PIK3CB) remained significantly associated with overall survival in the blasts, while none met statistical significance in mononuclear cells.

Conclusions

This study provides insights into biological information gained/lost by examining viable CD34-expressing leukemic blasts versus mononuclear cells from the same patient and shows an improved verification rate for expression biomarkers in blasts.

Background

AML is one of the most common and deadly hematopoietic malignancies. Like many cancers, the incidence of AML increases with age, such that the median age at diagnosis is 68 years. Patients with AML can receive a variety of different therapies, ranging from disease modifying agents to myeloablative allogeneic transplants. When deciding optimal care, physicians and patients must consider multiple factors, including age, performance status, and likelihood of a favorable response to therapy. Over the last three decades, many prognostic biomarkers have been identified for adult patients with AML. These prognostic biomarkers have been incorporated into the European LeukemiaNet (ELN) risk classification [1], which remains the gold standard for prognostication of patients with AML [2, 3].

ELN risk is currently determined by a combination of cytogenetics, selective mutations, and preceding predisposition to the development of AML (i.e., history of myelodysplasia, etc.). Gene expression forms one of the major cornerstones supporting the intrinsic biology of leukemic cells, with transcription being regulated by multiple genetic and epigenetic regulators. Although multiple expression biomarkers and profiles have been shown to correlate with clinical outcome, none are currently utilized in ELN risk stratification for AML [4,5,6,7,8,9,10,11,12,13,14,15,16,17,18]. There are likely multiple reasons for the lack of expression biomarkers translating into clinical practice, ranging from problems with reproducibility to technical issues for implementing assays in clinical setting. Nevertheless, given the importance of transcription, it seems that transcript biomarkers would hold the potential promise to inform and improve upon ELN risk classification in the clinical setting – especially if we could identify means to make these transcript biomarkers more reliable and reproducible.

Current biomarker assays primarily examine either total nucleated cells or bulk mononuclear cells (MNCs), with the leukemic blast percentage varying from 20 to 100% in AML. This inter-specimen variability alters the quantitative expression [19], which likely impacts results of analyses investigating differentially expressed genes (DEGs). We hypothesized that eliminating the dying, non-leukemic and differentiated AML blasts (CD34-) may provide a unique window into the biology arising from known clinical prognostic factors, while potentially improving the prognostic performance of expression biomarkers. Therefore, we systematically examined the transcriptomes of paired bulk MNCs and viable leukemic blasts expressing CD34 (VLBs^CD34+) from diagnostic AML specimens (Fig. 1). We focused on patients with CD34+ leukemia (AML^CD34+), the most common immunophenotype in AML, to facilitate the enrichment of less differentiated VLBs (i.e., CD34+) and improve homogeneity of the examined cells. Furthermore, we included patients across a broad range of ages to examine the potential impact of age on the results, given that most current biomarker studies have been limited to younger patients and many of these prognostic biomarkers are less informative for older patients [3, 20, 21]. Analyses identified DEGs associated with sample source (blood vs. marrow), cell populations (MNCs vs. VLBs^CD34+), clinical characteristics, and outcomes. Pathway analyses showed that the information derived from the transcriptome was dependent on the studied cell population. Adjusting for age and ELN risk eliminated most DEGs associated with prognosis in univariate analyses, allowing to focus verification efforts on a select number of genes. Studies examining these prognostic DEGs in an independent cohort of patients showed a higher rate of verification using RNA from the VLBs^CD34+ than bulk MNCs.

Methods

Patient materials

A review of SWOG Cancer Research Network leukemia repository inventory identified 351 out of 1042 previously untreated AML patients with pretreatment samples potentially containing enough cryopreserved vials for the proposed studies and who received intensive therapy with curative intent. Patients were enrolled onto protocols SWOG-9031, SWOG-9333, S0106 and S0112 and treated as previously described [22,23,24,25]. Specimen handling and cryopreservation were consistent across the trials as previously described [3]. All participants provided written informed consent in compliance with the Declaration of Helsinki, and studies were conducted with the approval of the Fred Hutchinson Cancer Center Institution Review Board.

Thawing, fluorescence-activated cell sorting (FACS), and nucleic acid extraction

Cryopreserved samples were thawed as previously described [3, 19]. A portion of bulk MNCs was lysed, while the remainder underwent FACS to isolate VLBs^CD34+ using forward by side scatter, DAPI staining and fluorescently-labeled antibodies to CD45, CD34, CD38, and CD117 as previously described [3, 19]. DNA and RNA were extracted from the bulk MNCs and VLBs^CD34+, quantified and assessed for quality as previously described [3, 19].

Identification of genomic mutations

Internal tandem duplications in FLT3 (FLT3-ITDs) were identified and censored at an upper limit of 20 as previously described [20, 26]. TruSight™ Myeloid Sequencing Panel (Illumina, San Diego, CA, USA) was used for DNA sequencing. TruSight™ platform provided inadequate coverage for CEBPA and NRAS Exon 3; therefore, in-house targeted MiSeq assays were developed to cover these loci (Table S1A). See Supplemental Methods for additional details regarding alignment, annotation, and quality controls [27,28,29,30,31,32,33]. All mutation data have been provided in Tables S1B and C. Nucleotide changes that were not classified as somatic mutations are provided in Table S1D.

RNA sequencing for transcript biomarkers

RiboErase (Roche, Wilmington, MA, USA) was utilized to deplete ribosomal RNA as per manufacture recommendations. Transcriptome libraries were generated using KAPA Stranded RNA-Seq Library Preparation Kit (KAPA Biosystems/Roche Sequencing Solutions, Inc., Wilmington, MA, USA) [34], and sequenced in batches using either Illumina HiSeq 2500 (HiSeq) or NovaSeq 6000 (NovaSeq) instruments (Illumina, San Diego, CA, USA). Transcripts were mapped, aligned, and quantified using a standard bioinformatic pipeline of software for RNA sequencing (RNAseq) as described in Supplemental Methods [35,36,37,38,39,40,41,42,43,44,45]. Normalized count per million mapped fragment (CPM) and fragments per kilobase of exon per million mapped fragments (FKPM), as well as filtering parameters are provided in Tables S2A-C.

Quantitative RT/PCR of transcript biomarkers

TaqMan™ gene expression assays were purchased from ThermoFischer Scientific (Waltham, MA, USA). The list of the genes, targeted loci, and assays are provided in Table S3. Transcript expression was quantified as described in Supplemental Methods and previous reports [3, 6, 45].

Statistical analyses

From the available cohort of 351 patients we expected to have approximately 200 patients with a CD34+ phenotype. Patients were sorted and RNA was extracted sequentially in random order until 96 RNA samples (the size of one plate and approximately half of the expected 200 patients) from patients with CD34+ phenotype were identified and assigned to discovery cohort. While the DNA and RNA analyses in the discovery cohort were ongoing, the remaining samples were extracted and assayed for quality. All patients with CD34+ phenotype were to be included in the second, validation, cohort. A total of 67 patients with CD34+ phenotype had sufficient material for downstream studies.

Cytogenetic risk was categorized per the ELN guidelines [1]. Complete remission (CR) required the following: > 20% marrow cellularity with maturation of all cell lineages, < 5% blasts, no Auer rods, ANC ≥1500/μL, platelets > 100,000/μL, no peripheral blasts, and no extramedullary disease. The one exception was CR for S0106, which required all the previously mentioned characteristics for CR, but utilized ANC ≥1000/μL rather than ANC ≥1500/μL. Overall survival (OS) was measured from the date of study registration to the date of death by any cause, with patients last known to be alive censored at the date of last contact. Quantitative and categorical factors were compared between groups using Wilcoxon Rank-sum tests and Fisher’s exact tests, respectively. OS was estimated using the Kaplan-Meier method and compared between groups using log-rank tests.

Univariate logistic regression and Cox proportional hazard models were used to evaluate the association between CR and OS, respectively, and the log-2 transformed FPKM values of each gene. Multivariable analyses adjusted for gender, age at study registration, cytogenetic risk, ELN risk, or age and ELN risk. Logistic regression models and the Cox proportional hazard models were also used to evaluate associations between the leukemic stem cell 17-biomarker (LSC17) signature and CR and OS, respectively [14].

Significance of RNA expression change was defined as a combination of FDR and either fold change (FC), odds ratio (OR) or hazard ratio (HR), as appropriate. An FDR < 0.01 and FC > 2 or < 0.5 was considered significant in analyses examining DEGs in paired samples: batch effect, instrument effect, impact of tissue source, and comparisons between MNCs and VLBs^CD34+. Significant DEGs associated with gender, age, cytogenetic and ELN risk were defined using FDR < 0.1 in combination with following FCs: gender = FC > 2 or < 0.5; age = FC > 1.1 or < 0.9 per unit change (unit for age = 10 years); cytogenetics = FC > 2 or < 0.5 for one or more comparisons between cytogenetic risk groups; and ELN risk = FC > 2 or < 0.5 for one or more comparisons between ELN risk groups. Significant DEGs for clinical outcomes were defined as an FDR < 0.1 and OR > 1.5 or < 0.66 (CR) or HR > 1.5 or < 0.66 (OS). In the validation cohort, significance was defined as P-value < 0.05, regardless of clinical effect size.

Biomolecular pathway analyses

Reactome (http://reactome.org) was utilized as a means to analyze and visualize biomolecular pathways associated with identified sets of genes as previously described [46]. Lists of significant DEGs were uploaded into the Reactome Analysis Tools to analyze the pathways associated with DEGs. Significance for pathway association with gene list was defined as FDR < 0.1 [46].

Results

Patient characteristics

Flow cytometry examined diagnostic specimens from 351 AML patients for blast viability and CD34 expression (Fig. 1). Flow studies revealed that samples from 163 patients (46%) expressed CD34 and had adequate amounts of blasts for downstream studies (Table S4A), while samples from 188 patients were excluded due to insufficient VLBs^CD34+ (N = 26) or lack of CD34 expression (N = 162). Analyses compared the mutation profiles between included and excluded patients (Fig. S1, Table S4B). The included AML^CD34+ patients had a lower frequency of FLT3-ITD mutations (19 vs. 42%, P < 0.001), and fewer mutations in NPM1 (6 vs. 62%, P < 0.001), DNMT3A (19 vs. 40%, P < 0.001), RUNX1 (7% vs. 17%, P = 0.002), and TET2 (9 vs. 20%, P = 0.002). In addition, the included AML^CD34+ patients were more likely to be categorized as adverse ELN risk (39 vs. 17%, P < 0.001). Despite these differences, the CR and OS were not significantly different between included AML^CD34+ and excluded patients (Table S4B). These 163 AML^CD34+ patients with adequate material were assigned into discovery (N = 96) and validation (N = 67) cohorts (Fig. 1) and similar analyses compared characteristics of patients in two cohorts were done (Table S4C). These comparison analyses showed no significant difference with respect to gender, blast percentages, commonly detected mutations, ELN risk, or CR. OS was slightly higher in the discovery cohort as compared to the validation cohort (P = 0.05).

Impact of batch effect and sequencing instrument on transcriptome

Batch effect can introduce significant expression changes and lead to erroneous results [47]. To assess for the potential impact of batch effect, libraries were prepared at three different time points using RNA from 4 diagnostic MNC specimens and then sequenced using the same instrument. Thirty-nine DEGs showed significant associations with batch effect (Figs.S2A and S3A; Table S5A), all of which represented non-coding RNAs. We also examined the impact of sequencing instruments on DEGs, given the availability and frequent use of HiSeq 2500 vs. NovaSeq 6000 platforms. Libraries were prepared using RNA from MNC specimens (N = 5) and sequenced on both instruments. Applying the same definition for significance, 255 DEGs were significantly associated with the sequencing instrument (Figs. S2B and S3B; Table S5B). Again, the vast majority of the DEGs were non-coding (N = 246, 96.5%). Overall, 15 DEGs were significant in batch effect and in instrument analyses (overlap).

Impact of tissue source on transcriptome

Peripheral blood (PB) and bone marrow (BM) specimens are frequently included and analyzed together in AML biomarker studies. Therefore, we investigated the potential impact of tissue source (PB vs. BM) on RNAseq results using paired PB and BM samples from 3 AML patients. For each tissue source, bulk MNCs and VLBs^CD34+ were examined, providing 4 different RNA sources: MNCs/PB, MNCs/BM, VLBs^CD34+/PB, and VLBs^CD34+/BM. Libraries were prepared and sequenced on the HiSeq 2500 instrument. Comparison analyses (PB vs. BM) were performed separately for MNCs and VLBs^CD34+, identifying 244 significant DEGs in analyses comparing MNCs/PB versus MNCs/BM (Figs. S2C and S3C; Table S5C). The vast majority of the 244 DEGs represented coding genes, either for immunoglobulin-related proteins (N = 63, 26%) or other known coding proteins (N = 159, 65%). Analyses comparing VLBs^CD34+/PB versus VLBs^CD34+/BM identified 53 significant DEGs between the two cell populations, 83% (44/53) representing coding genes (Figs. S2D and S3D; Table S5D). Most of the immunoglobulin-related DEGs from MNC comparisons (MNCs/PB vs. MNCs/BM) were not significant in the VLBs^CD34+ analyses. Overall, 34 DEGs were significant in both analyses comparing PB versus BM (Table S5E). Most of the overlapping transcripts (29/34, 85%) represented coding genes in a variety of different pathways: apoptosis/cell cycle, cell adhesion, cell signaling, histone maintenance/modification, mitochondria/metabolism, and transcription regulation.

Expression differences between MNCs and VLBs^CD34+

Ninety-six specimens had RNAseq data from paired bulk MNCs and VLBs^CD34+ (Fig. 1). Transcripts associated with batch effect, sequencing instrument, and tissue source were eliminated from MNCs versus VLBs^CD34+ analyses, as well as subsequent studies examining associations with clinical characteristics and outcomes. We identified 767 DEGs that were significant between the MNCs and VLBs^CD34+ (Fig. 2A; Table S5F). These DEGs led to a noticeable shift in the principal component analyses (PCA) plot comparing bulk MNCs and VLBs^CD34+ (Fig. 2B). Most DEGs between the two cell populations represented coding (N = 376, 49%) and immunoglobulin (N = 69, 9%) genes. The remainder included pseudogenes (N = 180, 23%), snoRNA/snRNA (N = 40, 5%), lncRNAs (N = 40, 5%), miRNA (N = 7, 1%) and other noncoding RNA variants (N = 55, 7%). The vast majority of DEGs (758/767, 98.8%) were decreased in the VLBs^CD34+ relative to the MNCs. Analyses showed a significant negative correlation (Rho = − 0.54, P-value < 0.001) between blast percentages and Euclidean distances, indicating that a lower blast percentage resulted in a greater separation between the bulk MNCs and VLBs^CD34+ (Fig. 2C).

Transcript correlation with clinical characteristics

Expression data from the paired MNCs and VLBs^CD34+ (N = 96) were examined for DEGs associated with gender, age, WBC, cytogenetic risk, and ELN risk. Gender was significantly associated with expression of a small number of DEGs (MNCs = 12 and VLBs^CD34+ = 8; overlap = 4; Fig. S4A; Table S6A). Increasing age was significantly associated with 133 and 289 DEGs in the MNCs and VLBs^CD34+, respectively (overlap = 105, Fig. S4B; Table S6B). There were 392 and 455 DEGs (overlap = 270) significantly associated with cytogenetic risk in MNCs and VLBs^CD34+, respectively (Fig. S4C; Table S6C), while 168 and 313 DEGs (overlap = 145) were significantly correlated with ELN risk in MNCs and VLBs^CD34+, respectively (Fig. S4D; Table S6D).

Lists of DEGs significantly associated with age, cytogenetics, and ELN risk were analyzed using Reactome software [46]. Pathways were then ranked from most to least significant by increasing FDR. The lists of age-related DEGs were significantly associated with 2 and 15 pathways using data from the MNCs and VLBs^CD34+, respectively (Table S7A-C). The top age-related pathway identified in MNCs was ranked 6 in VLBs^CD34+, while the top age-related pathway in the VLBs^CD34+ were ranked 237 in the MNCs. For the lists of DEGs significantly associated with cytogenetics, we identified 13 and 4 significant pathways in the MNCs and VLBs^CD34+, respectively. The top cytogenetic-related pathway in the MNCs was ranked 718 in the VLBs^CD34+, while the top cytogenetic-related pathways in the VLBs^CD34+ was ranked 41 in the MNCs (Table S7D-F). Many of the most significant pathways associated with cytogenetics in the MNCs involved lymphocyte function, and their significance, as measured by rank, was substantially diminished by enriching for VLBs^CD34+, while pathways involving transcription regulation seem to be enriched by examining the VLBs^CD34+ (Table 1). With respect to the genes associated with ELN risk, the same 2 pathways were significant in both MNCs and VLBs^CD34+ (Table S7G-I).

Table 1 Comparison of significant pathways enriched by DEGs associated with cytogenetic risk in paired bulk MNCs versus VLBs

Full size table

Transcript expression associated with clinical responses

Univariate analyses identified 3 and 712 DEGs significantly associated with CR in the MNCs and VLBs^CD34+, respectively (Table S8A). Given the known association of age and cytogenetic risk with CR rates, a multivariable model incorporating age, cytogenetics, and gender was developed to adjust for these variables. After accounting for these variables, none of the transcripts remained significantly associated with CR in either the MNC or VLBs^CD34+ data (Table S8B). Similarly, all transcripts lost their significance after adjusting for ELN risk and age (Table S8C). Univariate analyses identified total of 2556 and 2678 DEGs significantly associated with OS in the MNCs and VLBs^CD34+, respectively (overlap = 1771; union = 3463; Fig. 3A and B; Table S8D). Multivariable analyses adjusting for gender, age, and cytogenetic risks identified 101 and 69 transcripts significantly associated with OS in MNCs and VLBs^CD34+, respectively (overlap = 38; union = 132; Fig. 3C and D; Table S8E). Multivariable analyses adjusting for ELN risk and age identified 38 and 20 transcripts significantly associated with OS in the MNCs and VLBs^CD34+, respectively (overlap = 14; union = 44; Fig. 3E and F; Table S8F).

Prognostic significance of the LSC17 in the discovery patients

LSC17 signature has been validated using MNC specimens from pediatric and adult patients with AML [14, 48]. We examined the performance of the LSC17 signature in the paired MNCs and VLBs^CD34+ using P-value < 0.05 to define statistical significance. Univariate analyses showed that the LSC17 model was associated with a reduced CR rate, which was statistically significant in VLBs^CD34+ data (OR = 0.91, P-value = 0.039) and borderline significant in the MNC data (OR = 0.93, P-value = 0.119; Table S8). After adjusting for age and ELN risk, the LSC17 signature was no longer significantly associated with CR in either the MNCs or VLBs^CD34+ (P-values = 0.808 and 0.860, respectively; Table S8). The LSC17 signature was significantly associated with OS in MNCs (HR 1.11, P = 0.00053) and VLBs^CD34+ (HR 1.11, P = 0.00003), remaining prognostically significant for both the MNCs (HR = 1.07, P = 0.023) and VLBs^CD34+ (HR = 1.07, P = 0.00876) after adjusting for age and ELN risk (Fig. S5; Table S9).

Validation of prognostic transcripts in MNCs and VLBs^CD34+

For the validation studies, we focused on DEGs that were significant in both MNCs and VLBs^CD34+ after adjusting for ELN risk and age (genes = 14; Table S8G). As a means to validate these DEGs, we chose to employ real-time quantitative reverse transcription/polymerase chain reaction (Q-RT/PCR) assays, which are currently utilized in clinical practice for many diseases, including AML [49]. Therefore, Q-RT/PCR assays were obtained for the coding transcripts (11 of the 14 DEGs), and we examined the association between their RNA expression and OS in paired MNCs and VLBs^CD34+ from patients in validation cohort (N = 67; Fig. 1). After adjusting for age and ELN risk, only DNMT3B expression was close to meeting statistical significance for OS in the bulk MNCs (P = 0.06, HR = 1.23), while expression for 4 genes (CEP70, DNMT3B, ECE1, and PIK3CB) remained significantly associated with OS in the VLBs^CD34+ (Table 2; Fig. 4). In addition, SEMA4D and TAF8 were borderline significant for adverse OS (P-values ≤0.1; Table 2).

Table 2 Identification and validation of prognostic transcripts

Full size table

Discussion

The impact of pre-analytic variables and non-leukemic cells on expression remains to be precisely defined but clearly impacts both RNA and protein expression profiling [3, 50]. Our analyses identified a relatively large number of DEGs between paired bulk MNCs and VLBs^CD34+ (N = 767), and as expected, the relative amount of DEGs was inversely correlated with blast percentage (Fig. 2C). By enriching for VLBs^CD34+, we were able to mitigate the impact of the transcription signal from non-leukemic and dead/dying cells, which also negated some of the impact that tissue source (PB vs. BM) had on the transcriptome. Most of the significant DEGs between MNCs and VLBs^CD34+ were expressed at lower levels in the VLBs^CD34+, and not surprisingly, many of the DEGs involved immunoregulatory pathways and/or coded for immunoglobulins. The impact of cellular heterogeneity on the transcription profile was further demonstrated when comparing the biological pathways associated with age and cytogenetic risk. For example, the list of DEGs associated with cytogenetic risk (i.e., cytogenetic-related) primarily enriched for pathways involving lymphocyte signaling in the MNCs, while the cytogenetic-related DEGs in VLBs^CD34+ enriched for pathways associated with transcription regulation (Table 1). Overall, the results highlight the impact that non-leukemic cells and/or more differentiated leukemic blasts have on transcriptome profile, as well as the information gained and lost by examining bulk MNCs as compared to VLBs^CD34+. These findings also underscore the importance of examining the appropriate cell populations to answer specific research questions. Given the complexity and differential activity of molecular pathways across hematopoietic cell lineages, studies of mixed populations of cells may impede the ability to dissect out which cell populations are contributing to the biological signal of interest. This phenomenon seemed to be most pronounced when examining potentially biologically meaningful DEGs associated with age and cytogenetics.

Although previous biomarker studies have identified a large number of prognostic transcripts associated with clinical outcomes [4,5,6,7,8,9,10,11,12,13,14,15,16,17,18], relatively few studies have shown that these transcript profiles remain significant after adjusting for other prognostic factors. Thus, our studies were primarily designed to identify DEGs after adjusting for ELN and age – the two most informative prognostic risk factors for AML [1, 3, 20, 51], while simultaneously examining the potential impact of cellular heterogeneity on DEGs. As with other studies, we identified many prognostically significant DEGs prior to adjusting for age and ELN risk. However, most DEGs lost significance after adjusting for age and ELN risk (Fig. 3). We also examined the prognostic significance of the LSC17 signature, which has been shown to be prognostic for pediatric and adult patients with AML [14, 48]. Unlike most prognostic transcript profiles, the LSC17 was first derived by examining leukemic stem cells and then applied to bulk MNCs. In our analyses, we showed that the LSC17 was statistically associated with OS in both cell sources and remained significant after adjusting for age and ELN risk (Fig. S5). However, the clinical effect size in our analyses was relatively small (HR = 1.07).

Previous prognostic studies have primarily examined the transcriptome in bulk MNCs from patients with AML [4,5,6,7,8,9,10,11,12,13,14,15,16,17,18, 48], and studies have not systematically compared the ability to verify prognostic DEGs in paired MNCs and VLBs^CD34+. After adjusting for age and ELN risk, we identified a modest number of coding DEGs (N = 11) with comparable statistical significance and clinical effect sizes in both MNCs and VLBs^CD34+ (Table 2). Focusing on these DEGs, we investigated if examining their expression in VLBs^CD34+ may improve their verification rate. A targeted approach using real-time Q-RT/PCR was selected for verification since this methodology remains the “gold standard” for independent validation of transcript expression [39, 52] and is currently utilized in clinical practice [49]. In the MNCs, expression of the DNMT3B was borderline significant (P = 0.061, HR = 1.23), while expressions for the other genes were not statistically significant (Table 2). The results from VLBs^CD34+ showed a higher rate of prognostic verification. Four DEGs were significantly associated with OS, with all four displaying a HR ≥1.30. Furthermore, the expressions of SEMA4D and TAF8 were borderline significant in VLBs^CD34+ (Table 2). Some DEGs produced markedly different results using RNA from the paired MNCs and VLBs^CD34+. For example, CEP70 was most prognostically significant in VLBs^CD34+ (P = 0.002, HR = 1.68) but failed to meet significance in MNCs (P = 0.282, HR = 1.14). Thus, while some DEGs may be informative in MNCs compared to VLBs^CD34+ or potentially vice versa, our results suggest an increased likelihood of verification and thus, potentially the performance of biomarker assays examining biologically relevant cells. At a minimum, the pathway analyses indicated that biological information elucidated by investigating the transcriptome is different depending upon homogeneity and cell populations.

We focused our analyses on specimens expressing CD34 to facilitate enrichment of a homogenous population of less differentiated VLBs. Given the focus on CD34-expressing leukemia, the results may not be generalizable to other immunophenotypes of AML, and the sample numbers may have limited our ability to identify and verify some prognostic transcripts – especially those with modest clinical effect sizes. Despite these potential limitations, the results show for the first time an increased rate of verification of prognostic biomarkers in enriched leukemic blasts and highlight the challenges of examining heterogenous specimens and the need for additional studies examining the impact of cellular heterogeneity on biomarkers in AML.

Conclusions

This study provides novel insights into biological information gained/lost by examining bulk MNCs versus VLBs^CD34+. In addition, the results show a potential benefit for validating expression biomarkers in purified populations of AML blasts. However, additional studies are warranted in larger numbers of samples to verify the relative benefit of biomarker assessment in VLB^CD34+ and translate findings into clinically compliant assays.

Availability of data and materials

The datasets generated and/or analyzed during the current study are available in the dbGaP repository, dbGaP under accession number phs002805.v1.p1. Investigators can apply to access sequencing data through standard dbGaP request procedures as described by NIH and found at dbgap_request_process.pdf (nih.gov). Additional data generated or analyzed during this study are included in the supplementary information files.

Abbreviations

AML :: Acute Myeloid Leukemia
AML ^CD34+ :: Patients with AML whose blasts express CD34+ ell surface antigen
ANC :: Absolute Neutrophil Count
BM :: Bone Marrow
CD :: Cluster of Differentiation
CD117 :: Antibody for proto-oncogene encoding the receptor tyrosine kinase protein known as tyrosine-protein kinase KIT
CD34 :: Antibody for transmembrane phosphoglycoprotein present on hematopoietic stem cells
CD38 :: Antibody for cyclic ADP ribose hydrolase is a glycoprotein found on the surface of many immune cells
CD45 :: Antibody for receptor linked protein tyrosine phosphatase present in all cells of the hematopoietic lineage except erythrocytes and plasma cells
CEP70 :: Centrosomal Protein 70
CPM :: Count per Million Mapped Fragment
CR :: Complete Remission
DAPI :: 4′,6-Diamidino-2-Phenylindole
DEG :: Differentially Expressed Gene
DNA :: Deoxyribonucleic Acid
DNMT3A :: DNA Methyltransferase 3 Alpha
DNMT3B :: DNA Methyltransferase 3 Beta
ECE1 :: Endothelin Converting Enzyme 1
ELN :: European LeukemiaNet (ELN) risk classification
FACS :: Fluorescence-activated Cell Sorting
FC :: Fold Change
FDR :: False Discovery Rate
FKPM :: Fragments per Kilobase of Exon per Million Mapped Fragments
FLT3 :: Fms Related Receptor Tyrosine Kinase 3
HiSeq :: Illumina HiSeq 2500 Instrument
HR :: Hazard Ratio
ITD :: Internal Tandem Duplication
lncRNAs :: Long Noncoding RNA
LSC17 :: Leukemic Stem Cell 17-biomarker Signature
miRNA :: MicroRNA
MNCs :: Mononuclear Cells
NovaSeq :: Illumina NovaSeq 6000 Instrument
NPM1 :: Nucleophosmin 1
NRAS :: NRAS Proto-Oncogene, GTPase
OR :: Odds Ratio
OS :: Overall Survival
PB :: Peripheral Blood
PCA :: Principal Component Analyses
PIK3CB :: Phosphatidylinositol-4,5-Bisphosphate 3-Kinase Catalytic Subunit Beta
Q-RT/PCR :: Quantitative Reverse Transcription Polymerase Chain Reaction
RNA :: Ribonucleic Acid
RNAseq :: RNA Sequencing
RT/PCR :: Reverse Transcription Polymerase Chain Reaction
RUNX1 :: RUNX Family Transcription Factor 1
SEMA4D :: Semaphorin 4D
snoRNA :: Small Nucleolar RNA
snRNA :: Small Nuclear RNA
TAF8 :: TATA-Box Binding Protein Associated Factor 8
TET2 :: Tet Methylcytosine Dioxygenase 2
VLBs ^CD34+ :: Viable Leukemic Blasts Expressing CD34
WBC :: White Blood Cell Count

References

Dohner H, Estey E, Grimwade D, Amadori S, Appelbaum FR, Buchner T, et al. Diagnosis and management of AML in adults: 2017 ELN recommendations from an international expert panel. Blood. 2017;129(4):424–47.
Article PubMed PubMed Central Google Scholar
Dohner K, Thiede C, Jahn N, Panina E, Gambietz A, Larson RA, et al. Impact of NPM1/FLT3-ITD genotypes defined by the 2017 European LeukemiaNet in patients with acute myeloid leukemia. Blood. 2020;135(5):371–80.
Article PubMed PubMed Central Google Scholar
Pogosova-Agadjanyan EL, Moseley A, Othus M, Appelbaum FR, Chauncey TR, Chen IL, et al. AML risk stratification models utilizing ELN-2017 guidelines and additional prognostic factors: a SWOG report. Biomark Res. 2020;8(29):1–13.
Google Scholar
Bullinger L, Dohner K, Bair E, Frohling S, Schlenk RF, Tibshirani R, et al. Use of gene-expression profiling to identify prognostic subclasses in adult acute myeloid leukemia. N Engl J Med. 2004;350(16):1605–16.
Article CAS PubMed Google Scholar
Valk PJ, Verhaak RG, Beijen MA, Erpelinck CA, Barjesteh van Waalwijk van Doorn-Khosrovani S, Boer JM, et al. Prognostically useful gene-expression profiles in acute myeloid leukemia. N Engl J Med. 2004;350(16):1617–28.
Article CAS PubMed Google Scholar
Haferlach T, Kohlmann A, Schnittger S, Dugas M, Hiddemann W, Kern W, et al. Global approach to the diagnosis of leukemia using gene expression profiling. Blood. 2005;106(4):1189–98.
Article CAS PubMed Google Scholar
Radmacher MD, Marcucci G, Ruppert AS, Mrozek K, Whitman SP, Vardiman JW, et al. Independent confirmation of a prognostic gene-expression signature in adult acute myeloid leukemia with a normal karyotype: a Cancer and Leukemia Group B study. Blood. 2006;108(5):1677–83.
Article CAS PubMed PubMed Central Google Scholar
Gentles AJ, Plevritis SK, Majeti R, Alizadeh AA. Association of a leukemic stem cell gene expression signature with clinical outcomes in acute myeloid leukemia. JAMA. 2010;304(24):2706–15.
Article CAS PubMed PubMed Central Google Scholar
de Jonge HJ, Woolthuis CM, Vos AZ, Mulder A, van den Berg E, Kluin PM, et al. Gene expression profiling in the leukemic stem cell-enriched CD34+ fraction identifies target genes that predict prognosis in normal karyotype AML. Leukemia. 2011;25(12):1825–33.
Article PubMed Google Scholar
Eppert K, Takenaka K, Lechman ER, Waldron L, Nilsson B, van Galen P, et al. Stem cell gene expression programs influence clinical outcome in human leukemia. Nat Med. 2011;17(9):1086–93.
Article CAS PubMed Google Scholar
Rockova V, Abbas S, Wouters BJ, Erpelinck CA, Beverloo HB, Delwel R, et al. Risk stratification of intermediate-risk acute myeloid leukemia: integrative analysis of a multitude of gene mutation and gene expression markers. Blood. 2011;118(4):1069–76.
Article CAS PubMed Google Scholar
Taskesen E, Bullinger L, Corbacioglu A, Sanders MA, Erpelinck CA, Wouters BJ, et al. Prognostic impact, concurrent genetic mutations, and gene expression features of AML with CEBPA mutations in a cohort of 1182 cytogenetically normal AML patients: further evidence for CEBPA double mutant AML as a distinctive disease entity. Blood. 2011;117(8):2469–75.
Article CAS PubMed Google Scholar
Li Z, Herold T, He C, Valk PJ, Chen P, Jurinovic V, et al. Identification of a 24-gene prognostic signature that improves the European LeukemiaNet risk classification of acute myeloid leukemia: an international collaborative study. J Clin Oncol. 2013;31(9):1172–81.
Article CAS PubMed PubMed Central Google Scholar
Ng SW, Mitchell A, Kennedy JA, Chen WC, McLeod J, Ibrahimova N, et al. A 17-gene stemness score for rapid determination of risk in acute leukaemia. Nature. 2016;540(7633):433–7.
Article CAS PubMed Google Scholar
Wang M, Lindberg J, Klevebring D, Nilsson C, Mer AS, Rantalainen M, et al. Validation of risk stratification models in acute myeloid leukemia using sequencing-based molecular profiling. Leukaemia. 2017;31(10):2029–36.
Article CAS Google Scholar
Tyner JW, Tognon CE, Bottomly D, Wilmot B, Kurtz SE, Savage SL, et al. Functional genomic landscape of acute myeloid leukaemia. Nature. 2018;562(7728):526–31.
Article CAS PubMed PubMed Central Google Scholar
Docking TR, Parker JDK, Jadersten M, Duns G, Chang L, Jiang J, et al. A clinical transcriptome approach to patient stratification and therapy selection in acute myeloid leukemia. Nat Commun. 2021;12(1):2474.
Article CAS PubMed PubMed Central Google Scholar
Mer AS, Heath EM, Madani Tonekaboni SA, Dogan-Artun N, Nair SK, Murison A, et al. Biological and therapeutic implications of a unique subtype of NPM1 mutated AML. Nat Commun. 2021;12(1):1054.
Article CAS PubMed PubMed Central Google Scholar
Pogosova-Agadjanyan EL, Moseley A, Othus M, Appelbaum FR, Chauncey TR, Chen IL, et al. Impact of specimen heterogeneity on biomarkers in repository samples from patients with acute myeloid leukemia: a SWOG report. Biopreserv Biobank. 2018;16(1):42–52.
Article CAS PubMed PubMed Central Google Scholar
Ostronoff F, Othus M, Lazenby M, Estey E, Appelbaum FR, Evans A, et al. Prognostic significance of NPM1 mutations in the absence of FLT3-internal tandem duplication in older patients with acute myeloid leukemia: a SWOG and UK National Cancer Research Institute/Medical Research Council report. J Clin Oncol. 2015;33(10):1157–64.
Article PubMed PubMed Central Google Scholar
Eisfeld AK, Kohlschmidt J, Mrozek K, Blachly JS, Walker CJ, Nicolet D, et al. Mutation patterns identify adult patients with de novo acute myeloid leukemia aged 60 years or older who respond favorably to standard chemotherapy: an analysis of Alliance studies. Leukemia. 2018;32(6):1338–48.
Article PubMed PubMed Central Google Scholar
Anderson JE, Kopecky KJ, Willman CL, Head D, O’Donnell MR, Luthardt FW, et al. Outcome after induction chemotherapy for older patients with acute myeloid leukemia is not improved with mitoxantrone and etoposide compared to cytarabine and daunorubicin: a Southwest Oncology Group study. Blood. 2002;100(12):3869–76.
Article CAS PubMed Google Scholar
Petersdorf SH, Rankin C, Head DR, Terebelo HR, Willman CL, Balcerzak SP, et al. Phase II evaluation of an intensified induction therapy with standard daunomycin and cytarabine followed by high dose cytarabine for adults with previously untreated acute myeloid leukemia: a Southwest Oncology Group study (SWOG-9500). Am J Hematol. 2007;82(12):1056–62.
Article CAS PubMed Google Scholar
Godwin JE, Kopecky KJ, Head DR, Willman CL, Leith CP, Hynes HE, et al. A double-blind placebo-controlled trial of granulocyte colony-stimulating factor in elderly patients with previously untreated acute myeloid leukemia: a Southwest Oncology Group study (9031). Blood. 1998;91(10):3607–15.
Article CAS PubMed Google Scholar
List AF, Kopecky KJ, Willman CL, Head DR, Persons DL, Slovak ML, et al. Benefit of cyclosporine modulation of drug resistance in patients with poor-risk acute myeloid leukemia: a Southwest Oncology Group study. Blood. 2001;98(12):3212–20.
Article CAS PubMed Google Scholar
Thiede C, Steudel C, Mohr B, Schaich M, Schakel U, Platzbecker U, et al. Analysis of FLT3-activating mutations in 979 patients with acute myelogenous leukemia: association with FAB subtypes and identification of subgroups with poor prognosis. Blood. 2002;99(12):4326–35.
Article CAS PubMed Google Scholar
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25(14):1754–60.
Article CAS PubMed PubMed Central Google Scholar
Wang K, Li M, Hakonarson H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 2010;38(16):e164.
Article PubMed PubMed Central Google Scholar
Goode DL, Hunter SM, Doyle MA, Ma T, Rowley SM, Choong D, et al. A simple consensus approach improves somatic mutation prediction accuracy. Genome Med. 2013;5(9):90.
Article PubMed PubMed Central Google Scholar
Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536(7616):285–91.
Article CAS PubMed PubMed Central Google Scholar
Whiffin N, Minikel E, Walsh R, O'Donnell-Luria AH, Karczewski K, Ing AY, et al. Using high-resolution variant frequencies to empower clinical genome interpretation. Genet Med. 2017;19(10):1151–8.
Article PubMed PubMed Central Google Scholar
Goldstein AM, Stidd KC, Yang XR, Fraser MC, Tucker MA. Pediatric melanoma in melanoma-prone families. Cancer. 2018;124(18):3715–23.
Article CAS PubMed Google Scholar
Van der Auwera G, O’Conner B. Genomics in the Cloud: using Docker, GATK, and WDL in Terra, 1st ed. Sebastopol: O’Reilly Meida; 2020. https://www.oreilly.com/about/contact.html
Ross MG, Russ C, Costello M, Hollinger A, Lennon NJ, Hegarty R, et al. Characterizing and measuring bias in sequence data. Genome Biol. 2013;14(5):R51.
Article PubMed PubMed Central Google Scholar
Schuierer S, Carbone W, Knehr J, Petitjean V, Fernandez A, Sultan M, et al. A comprehensive assessment of RNA-seq protocols for degraded and low-quantity samples. BMC Genomics. 2017;18(1):442.
Article PubMed PubMed Central Google Scholar
Tarazona S, Garcia-Alcalde F, Dopazo J, Ferrer A, Conesa A. Differential expression in RNA-seq: a matter of depth. Genome Res. 2011;21(12):2213–23.
Article CAS PubMed PubMed Central Google Scholar
Liu Y, Zhou J, White KP. RNA-seq differential expression studies: more sequence or more replication? Bioinformatics. 2014;30(3):301–4.
Article CAS PubMed Google Scholar
Sims D, Sudbery I, Ilott NE, Heger A, Ponting CP. Sequencing depth and coverage: key considerations in genomic analyses. Nat Rev Genet. 2014;15(2):121–32.
Article CAS PubMed Google Scholar
Consortium SM-I. A comprehensive assessment of RNA-seq accuracy, reproducibility and information content by the Sequencing Quality Control Consortium. Nat Biotechnol. 2014;32(9):903–14.
Article Google Scholar
Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, et al. STAR: ultrafast universal RNA-seq aligner. Bioinformatics. 2013;29(1):15–21.
Article CAS PubMed Google Scholar
Wang L, Wang S, Li W. RSeQC: quality control of RNA-seq experiments. Bioinformatics. 2012;28(16):2184–5.
Article CAS PubMed Google Scholar
Frankish A, Diekhans M, Ferreira AM, Johnson R, Jungreis I, Loveland J, et al. GENCODE reference annotation for the human and mouse genomes. Nucleic Acids Res. 2019;47(D1):D766–D73.
Article CAS PubMed Google Scholar
Robinson MD, McCarthy DJ, Smyth GK. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics. 2010;26(1):139–40.
Article CAS PubMed Google Scholar
Robinson MD, Oshlack A. A scaling normalization method for differential expression analysis of RNA-seq data. Genome Biol. 2010;11(3):R25.
Article PubMed PubMed Central Google Scholar
Oehler VG, Guthrie KA, Cummings CL, Sabo K, Wood BL, Gooley T, et al. The preferentially expressed antigen in melanoma (PRAME) inhibits myeloid differentiation in normal hematopoietic and leukemic progenitor cells. Blood. 2009;114(15):3299–308.
Article CAS PubMed PubMed Central Google Scholar
Fabregat A, Sidiropoulos K, Viteri G, Forner O, Marin-Garcia P, Arnau V, et al. Reactome pathway analysis: a high-performance in-memory approach. BMC Bioinformatics. 2017;18(1):142.
Article PubMed PubMed Central Google Scholar
Hornung R, Causeur D, Bernau C, Boulesteix AL. Improving cross-study prediction through addon batch effect adjustment or addon normalization. Bioinformatics. 2017;33(3):397–404.
Article CAS PubMed Google Scholar
Duployez N, Marceau-Renaut A, Villenet C, Petit A, Rousseau A, Ng SWK, et al. The stem cell-associated gene expression signature allows risk stratification in pediatric acute myeloid leukemia. Leukemia. 2019;33(2):348–57.
Article CAS PubMed Google Scholar
Schuurhuis GJ, Heuser M, Freeman S, Bene MC, Buccisano F, Cloos J, et al. Minimal/measurable residual disease in AML: consensus document from ELN MRD Working Party. Blood. 2018;131(12):1275–91.
Article CAS PubMed PubMed Central Google Scholar
Horton TM, Hoff FW, van Dijk A, Jenkins GN, Morrison D, Bhatla T, et al. The effects of sample handling on proteomics assessed by reverse phase protein arrays (RPPA): functional proteomic profiling in leukemia. J Proteome. 2021;233:104046.
Article CAS Google Scholar
Keren-Froim N, Heering G, Sharvit G, Zlotnik M, Nagler A, Shimoni A, et al. ELN 2017 classification significantly impacts the risk of early death in acute myeloid leukemia patients receiving intensive induction chemotherapy. Ann Hematol. 2022;101(2):309–16.
Article CAS PubMed Google Scholar
Corchete LA, Rojas EA, Alonso-Lopez D, De Las Rivas J, Gutierrez NC, Burguillo FJ. Systematic comparison and assessment of RNA-seq procedures for gene expression quantitative analysis. Sci Rep. 2020;10(1):19737.
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

The authors wish to gratefully acknowledge the important contributions of Dr. John Godwin and the late Dr. Stephen H. Petersdorf to SWOG and to the study S0106. The authors would like to acknowledge that much of the preliminary optimization for these studies utilized specimens obtained from the Fred Hutchinson Cancer Center/University of Washington Hematopoietic Diseases Repository.

Funding

This work was funded by the following NIH grant awards: R01CA190661, R01CA160872, U10CA180888, U10CA180819, U24CA196175, and P30CA015704.

Author information

Authors and Affiliations

Clinical Research Division, Fred Hutchinson Cancer Center, 1100 Fairview Ave N, D5-112, Seattle, WA, 98109, USA
Era L. Pogosova-Agadjanyan, Frederick R. Appelbaum, Isaac C. Jenkins, Min Fang, Stanley C. Lee, Jasmine Naru, Jerald P. Radich, Jenny L. Smith, Brooke E. Willborg, Soheil Meshinchi & Derek L. Stirewalt
SWOG Statistical Center, Fred Hutchinson Cancer Center, Seattle, WA, USA
Xing Hua, Megan Othus & Anna Moseley
Departments of Oncology and Hematology, University of Washington, Seattle, WA, USA
Frederick R. Appelbaum, Thomas R. Chauncey, Jerald P. Radich & Derek L. Stirewalt
VA Puget Sound Health Care System, Seattle, WA, USA
Thomas R. Chauncey
Duke Cancer Institute, Durham, NC, USA
Harry P. Erba
Bioinformatics Shared Resource, Fred Hutchinson Cancer Center, Seattle, WA, USA
Matthew P. Fitzgibbon & Feinan Wu
Clinical Biostatistics, Fred Hutchinson Cancer Center, Seattle, WA, USA
Isaac C. Jenkins
Department of Pediatrics, University of Washington, Seattle, WA, USA
Jenny L. Smith & Soheil Meshinchi
Department of Laboratory Medicine and Pathology, Mayo Clinic Comprehensive Cancer Center, Rochester, MN, USA
Cheryl L. Willman

Authors

Era L. Pogosova-Agadjanyan
View author publications
You can also search for this author in PubMed Google Scholar
Xing Hua
View author publications
You can also search for this author in PubMed Google Scholar
Megan Othus
View author publications
You can also search for this author in PubMed Google Scholar
Frederick R. Appelbaum
View author publications
You can also search for this author in PubMed Google Scholar
Thomas R. Chauncey
View author publications
You can also search for this author in PubMed Google Scholar
Harry P. Erba
View author publications
You can also search for this author in PubMed Google Scholar
Matthew P. Fitzgibbon
View author publications
You can also search for this author in PubMed Google Scholar
Isaac C. Jenkins
View author publications
You can also search for this author in PubMed Google Scholar
Min Fang
View author publications
You can also search for this author in PubMed Google Scholar
Stanley C. Lee
View author publications
You can also search for this author in PubMed Google Scholar
Anna Moseley
View author publications
You can also search for this author in PubMed Google Scholar
Jasmine Naru
View author publications
You can also search for this author in PubMed Google Scholar
Jerald P. Radich
View author publications
You can also search for this author in PubMed Google Scholar
Jenny L. Smith
View author publications
You can also search for this author in PubMed Google Scholar
Brooke E. Willborg
View author publications
You can also search for this author in PubMed Google Scholar
Cheryl L. Willman
View author publications
You can also search for this author in PubMed Google Scholar
Feinan Wu
View author publications
You can also search for this author in PubMed Google Scholar
Soheil Meshinchi
View author publications
You can also search for this author in PubMed Google Scholar
Derek L. Stirewalt
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Concept: ELPA, XH, MO and DLS; Resources: ELPA, XH, MO, FRA, TRC, HPE, MPF, ICJ, MF, SCL, AM, JN, JPR, JLS, BEW, CLW, FW, SM and DLS; Data curation: ELPA, XH, MO, MPF, ICJ, MF, AM, MF, JN, FW, DLS; Software: XH, MO; Formal analyses: ELPA, XH, MO, MPF, ICS, FW, DLS; Supervision: MO, MPF, SM, DLS; Funding acquisition: MO, SM, DLS; Validation: ELPA, XH, MO, FW, DLS; Investigation: ELPA, JN, BEW, DLS; Visual: ELPA, XH, MO SCL, JLS, FW, DLS; Methodology: ELPA, XH, MO, FW, DLS; Writing original draft: ELPA, XH, MO, FRA, TRC, HPE, MPF, ICJ, MF, SCL, AM, JN, JPR, JLS, BEW, CLW, FW, SM and DLS; Projection administration: ELPA and DLS; Writing, reviewing and editing: ELPA, XH, MO, FRA, TRC, HPE, MPF, ICJ, MF, SCL, AM, JN, JPR, JLS, BEW, CLW, FW, SM and DLS. The authors read and approved the final manuscript.

Corresponding author

Correspondence to Derek L. Stirewalt.

Ethics declarations

Ethics approval and consent to participate

All participants provided written informed consent in compliance with the Declaration of Helsinki, and studies were conducted with the approval of the Fred Hutch Cancer Center Institution Review Board.

Consent for publication

Not Applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Supplemental Methods. Supplemental Figure 1.

Genomic landscape of mutations in various cohorts of patients used for the study. Supplemental Figure 2. Quality Control Assessments of RNAseq Data. Supplemental Figure 3. Principal Component (PC) Analyses of QC Data. Supplemental Figure 4. QQ plots of observed and expected results. Supplemental Figure 5. LSC17 score applied to RNAseq data from bulk MNCs and VLBs^CD34+. Supplemental Table 1. Genetic Sequencing Data. Supplemental Table 2. RNA Sequencing Data. Supplemental Table 3. QRT-PCR Assays. Supplemental Table 4. Characteristics of included and excluded patients. Supplemental Table 5. Comparisons to assess the impact of batch, instrument, and tissue source on transcript profiles. Supplemental Table 6. Expression changes associated with patient characteristics. Supplemental Table 7. Pathways associated with DEGs and patient characteristics. Supplemental Table 8. Expression changes and pathways associated with clinical outcomes. Supplemental Table 9. Prognostic significance of LSC17 in RNA from MNCs and VLBs^CD34+ from the discovery cohort.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Cite this article

Pogosova-Agadjanyan, E.L., Hua, X., Othus, M. et al. Verification of prognostic expression biomarkers is improved by examining enriched leukemic blasts rather than mononuclear cells from acute myeloid leukemia patients. Biomark Res 11, 31 (2023). https://doi.org/10.1186/s40364-023-00461-0

Download citation

Received: 26 October 2022
Accepted: 30 January 2023
Published: 16 March 2023
DOI: https://doi.org/10.1186/s40364-023-00461-0

Verification of prognostic expression biomarkers is improved by examining enriched leukemic blasts rather than mononuclear cells from acute myeloid leukemia patients

Abstract

Background

Methods

Results

Conclusions

Background

Methods

Patient materials

Thawing, fluorescence-activated cell sorting (FACS), and nucleic acid extraction

Identification of genomic mutations

RNA sequencing for transcript biomarkers

Quantitative RT/PCR of transcript biomarkers

Statistical analyses

Biomolecular pathway analyses

Results

Patient characteristics

Impact of batch effect and sequencing instrument on transcriptome

Impact of tissue source on transcriptome

Expression differences between MNCs and VLBsCD34+

Transcript correlation with clinical characteristics

Transcript expression associated with clinical responses

Prognostic significance of the LSC17 in the discovery patients

Validation of prognostic transcripts in MNCs and VLBsCD34+

Discussion

Conclusions

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval and consent to participate

Consent for publication

Competing interests

Additional information

Publisher’s Note

Supplementary Information

Additional file 1: Supplemental Methods. Supplemental Figure 1.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Biomarker Research

Contact us

Expression differences between MNCs and VLBs^CD34+

Validation of prognostic transcripts in MNCs and VLBs^CD34+