Oncoscience
Abstract | PDF | Full Text | Supplementary Materials | Supplementary Table 1 | Supplementary Table 2 | Supplementary Table 3 | Supplementary Table 4 | Supplementary Table 5 | Supplementary Table 6 | Supplementary Table 7 | Supplementary Table 8 | How to Cite | Press Release
https://doi.org/10.18632/oncoscience.590
An immunoinformatics assessment of the cancer testis antigen, DDX53, as a potential early esophageal cancer antigen
Peter Cheng1, Konrad J. Cios1, Mallika Varkhedi1, Vayda R. Barker1, Michelle Yeagley1, Andrea Chobrutskiy2, Boris I. Chobrutskiy3 and George Blanck1,4
1Department of Molecular Medicine, Morsani College of Medicine, University of South Florida, Tampa, Florida (FL) 33612, USA
2Department of Pediatrics, Oregon Health and Science University Hospital, Portland, Oregon (OR) 97239, USA
3Department of Internal Medicine, Oregon Health and Science University Hospital, Portland, Oregon (OR) 97239, USA
4Department of Immunology, H. Lee Moffitt Cancer Center and Research Institute, Tampa, Florida (FL) 33612, USA
Correspondence to: George Blanck, email: [email protected]
Keywords: adaptive immune receptor recombinations; complementarity determining region-3; esophageal cancer survival rates; cancer testis antigens
Received: May 20, 2023
Accepted: October 16, 2023
Published: November 10, 2023
Copyright: © 2023 Cheng et al. This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
ABSTRACT
T-lymphocytes have been implicated in facilitating a pro-inflammatory, pro-tumorigenic microenvironment that worsens prognosis for esophageal carcinoma (ESCA). In this study, we identified tumor resident, T-cell receptor (TCR) complementarity determining region-3 (CDR3) amino acid sequences and employed an algorithm particularly suited to the big data setting to evaluate TCR CDR3-cancer testis antigen (CTA) chemical complementarities. Chemical complementarity of the ESCA TCR CDR3s and the cancer testis antigen DDX53 represented a disease-free survival (DFS) distinction, whereby the upper fiftieth percentile complementarity group correlated with worse DFS. The high TCR CDR3-DDX53 complementarity group also represented a greater proportion of tumor samples lacking DDX53 expression. These data and analyses raise the question of whether the TCR CDR3-DDX53 chemical complementarity assessment detected an ESCA immune response that selected for DDX53-negative cells?
INTRODUCTION
Esophageal cancer is the eighth most common cancer and the sixth leading cause of cancer death worldwide [1]. Esophageal squamous cell carcinoma (ESCC) is the predominant histological type and comprises 90% of cases [2], with most of the remaining cases representing esophageal adenocarcinoma. The five-year survival for esophageal cancer remains unfavorable, at 15–25%, despite advancements in early endoscopic detection and multidisciplinary intervention [1, 2]. The presence of an immune response mediated by tumor-infiltrating lymphocytes, however, has been shown to be strongly associated with longer disease-free survival (DFS) in some esophageal cancer patients [2, 3]. Due to their high in vivo immunogenicity and elevated expression in primary esophageal cancer specimens, cancer-testis antigens (CTAs) in particular are under investigation as potential targets for T-lymphocyte based esophageal cancer immunotherapy [4].
On the other hand, while a targeted, T-cell mediated immune response may improve outcomes in some esophageal cancer patients, in a comparatively larger number of patients, T-cell recruitment and their subsequent activation of pro-tumorigenic immune cell populations has been shown to associate with overall poorer prognosis in ESCC [5]. T-lymphocytes are known to recruit and activate tumor-associated macrophages (TAMs) of the M2 pro-inflammatory phenotype to the tumor microenvironment. M2 TAMs subsequently expand myeloid-derived suppressor cells (MDSCs) through the IL-6 pathway, inducing a pro-tumorigenic microenvironment. In ESCC patients, M2 TAM accumulation and high serum levels of IL-6 are both demonstrably present [6].
To better understand the potential positive and negative impacts of T-cells in ESCA, we recovered TCR recombination reads from the TCGA-ESCA whole exome sequence (WXS) files, obtained the TCR CDR3 AA sequences, and assessed the chemical complementarity of those sequences with CTAs, as those antigens have been characterized by many approaches over decades as candidate cancer antigens. Results indicated that a high level of chemical complementarity between tumor resident, TCR CDR3s and the CTA, DDX53, was associated with a worse outcome, in contrast to several previous assessments whereby immune receptor CDR3-candidate antigen chemical complementarities were associated with better outcomes [7–12]. In this ESCA study, the possibility of an immune response that selected for tumor cells lacking the DDX53 CTA is discussed.
RESULTS
TCR CDR3-CTA chemical complementarities associated with reduced DFS probabilities
As noted in Introduction, recent work has indicated opportunities to stratify cancer cases based on the chemical complementarities of the adaptive IR CDR3 AA sequences and candidate cancer antigens, using algorithms particularly designed for a big data setting [7–11, 13] (Methods). Thus, we recovered TRA and TRB recombination reads from the tumor WXS files representing the TCGA-ESCA dataset (Supplementary Table 1), translated the CDR3s represented by productive recombinations, and specifically applied the algorithm of ref. [13] to determine whether the chemical complementarity between the TCGA-ESCA TCR CDR3s and a set of CTAs [14] also represented DFS distinctions. Results indicated an inverse association between Combo complementarity scores (CSs) (Methods) and DFS. Specifically, the upper 50th percentile, tumor-based, TCR CDR3-DDX53 Combo CSs correlated with a lower DFS (Figure 1A). This inverse survival association was also observed for the Combo CSs for the TCR CDR3s and a DDX53 peptide fragment (AA sequence: MNNSVNLRSITYLVIDEADKMLDMEFEPQIRKILLDVRPDRQTVMT SATWPDTVRQLALS) (Figure 1B).
Reduced DDX53 expression in ESCA specimens representing high TCR CDR3-DDX53 Combo CSs
To determine whether DDX53 expression was reduced in ESCA specimens with higher TCR CDR3-DDX53 Combo CSs, we conducted an odds-ratio analysis for “zero DDX53 expression” in the tumor specimens representing the upper and lower 50th percentile TCR CDR3-DDX53 Combo CS groups. Results indicated that the upper 50th percentile Combo CS group represented a greater proportion of ESCA specimens with no expression of DDX53 (Table 1).
Increased DNA methylation of DDX53 in ESCA specimens representing the upper 50th percentile of the TCR CDR3-DDX53 Combo CS group
To consider a mechanism of reduced DDX53 expression in the ESCA specimens with higher TCR CDR3-DDX53 Combo complementarity, we conducted a methylation analysis of the tumor specimens belonging to the upper and lower 50th percentile Combo CS groups. Results indicated that methylation of the DDX53 gene was comparatively increased in the upper 50th percentile Combo CS group (Table 2), although the statistical analysis represented the standard of a trend rather than significance.
EIF2AK3 and POLG expression levels correlated with the TCR CDR3-DDX53 Combo CSs
To determine whether pro-proliferative molecular markers correlated with higher TCR CDR3-DDX53 Combo CSs, due to the correlation of the higher TCR CDR3-DDX53 Combo CSs and worse DFS, we evaluated the RNA-seq values of a series of pro-proliferative effector genes [15] for the tumor specimens corresponding to the CS calculations. Results indicated that EIF2AK3 and POLG expression levels correlated with the TCR CDR3-DDX53 Combo CSs, consistent with a higher proliferation rate for the tumor cells representing the higher CSs and worse DFS probabilities (Figure 2). However, an evaluation of the entire TCGA-ESCA dataset, for a survival distinction based on the expression levels of these two genes, did not reveal a survival distinction (data not shown), consistent with a mechanistic specificity of the TCR CDR3-DDX53 interaction reflecting a worse DFS probability.
DDX53 Combo CS is a DFS marker independent of several other ESCA clinical features
To determine whether the survival distinction represented by DDX53 Combo CSs was independent of clinical variables known to distinguish ESCA outcomes, we performed bivariate analyses of DDX53 Combo complementarity against several of these variables. (There were too few samples for a combined, multivariate analysis.) Results indicated that, in each bivariate comparison, DDX53 Combo CSs remained significant in representing a worse DFS (Table 3).
DISCUSSION
We observed TCR CDR3-DDX53 chemical complementarity inversely (negatively) associating with ESCA DFS, with the potential TCR-DDX53 interaction representing tumor resident, TCR CDR3 AA sequences. Most importantly, tumor specimens with higher TCR CDR3-DDX53 CSs were less likely to express DDX53, consistent with the possibility that the TCR-DDX53 interaction selected against tumor cells expressing DDX53. Additionally, the tumor specimens with higher TCR CDR3-DDX53 CSs represented greater DDX53 gene methylation, consistent with the lower expression level of DDX53 among the high CS samples and also suggesting a mechanism for the loss of DDX53 expression. Overall, these data are consistent with the possibility that TCR-antigen interaction induces selective pressure that results in antigen loss, a process referred to as immuno-editing [16–18], and, eventually, tumor cell escape from an anti-tumor immune response. These data for ESCA are also consistent with similar data obtained for high CS, low survival in the ovarian cancer setting [19].
At the point when CTAs are no longer present, the sustained involvement of T-cells may induce pro-inflammatory, pro-tumorigenic effects through general features of the immune response. In ESCC, it is likely that this process occurs in large part due to the accumulation of M2 TAMs [6]. Further, poor ESCC prognosis is known to associate with the activation and accumulation of such tumorigenic immune cells [5], lending credence to our inference that a vestigial immune response could be deleterious across ESCA subtypes. Finally, if DDX53 expression is selected against in early ESCA tumors, then an assay for DDX53 may be useful as an early screening tool for ESCA.
Noting several limitations of this study, it would be important to attempt to repeat the results using an immune repertoire, PCR-based approach to obtaining tumor TCR recombination reads, to have a more comprehensive collection of productive CDR3s. And, the tentative conclusions here may be further supported with a prospective clinical trial, allowing for a better management of potentially confounding variables.
METHODS
Recovery of the TCGA-ESCA T-cell receptor (TCR) recombination reads
The recovery of the TRA and TRB recombination reads from the TCGA-ESCA exome (WXS) files was performed as described [20–22]. Briefly, the WXS files were searched with a low stringency match to all TCR V- and J-gene segments. The reads in those output files were then searched for a validated V- and J-gene segment on one read, and only reads with a productive complementarity determining region-3 (CDR3) amino acid (AA) sequence were used in this study, i.e., reads with V-J joins that did not have a stop codon or an out-of-frame junction. The original software pipeline used for the extraction of the recombination reads from the WXS files, for this report, is freely available at https://github.com/bchobrut-USF/blanck_group, which includes a readme file; and a containerized version of the code is freely available at https://hub.docker.com/r/bchobrut/vdj, also with a readme file. In addition, an updated version with some refinements is available at https://github.com/kcios/2021. The entire collection of TCR recombination read output data used in this report is available in the supporting online material (SOM) Supplementary Table 1. The WXS files were accessed via NIH dbGaP project approval number 6300. Finally, basic benchmarking of the recovery of adaptive immune receptor recombination reads from genomics files can be reviewed in the following refs. [23–27].
Chemical complementarity scoring
The chemical complementarity scoring for the CDR3-cancer testis antigen combinations was performed as described [13], and can be conducted with a publicly available, original web tool, https://adaptivematch.com/. The web tool includes instructions, and example input files, used in this report, are available in the SOM, Supplementary Tables 2–4. (However, to use Supplementary Tables 2–4 as input for the web tool, those files will have to be re-saved by the user as csv files.) The web tool has been extensively benchmarked in refs. [28, 29], and further employed in refs [30, 31]. Supplementary Table 2 represents CDR3s, Supplementary Table 3 represents CTAs, and Supplementary Table 4 represents DFS data publicly available at cbioportal.org. Briefly, the scoring process involves an alignment of a CDR3-CTA, a calculation of the chemical attractiveness based on the aligned CDR3-CTA, and a one-AA shift, and a re-calculation. The details of the calculation process are in ref [13], but briefly, two AA directly opposite of each other in the alignment, with opposite electrostatic charges would contribute to a high chemical complementarity score, and if those AAs are shifted by one position, the value of their contribution to the score is reduced. A similar standard is applied to a hydrophobicity version of the AA chemical attractiveness, based on Uversky hydropathy values for each AA [11], whereby higher Uversky hydropathy values represent higher hydrophobicity. The best CS is retained for any given CDR3-CTA assessment. A “Combo CS” represents a CS calculation that integrates both electrostatic and Uversky hydropathy AA values. SOM Supplementary Tables 5 and 6 represent the https://adaptivematch.com/ web tool output for SOM Supplementary Tables 2–4 and represent the results of the chemical complementarity scoring algorithm developed in ref [13]. These results allow spot checking of the high and low scores, for example, for electrostatic attractiveness, by examining the alignment in the output (Supplementary Table 6) that represented the best score for a given electrostatic, CDR3-CTA matchup.
DNA methylation analysis of DDX53
DNA methylation array-based beta values representing the extent of site-specific gene methylation were obtained from datasets available at https://gdc.cancer.gov/. The distinction of the DDX53 methylation values for the upper and lower 50th percentile TCR CDR3-DDX53 Combo CSs was evaluated using a two-tailed Student’s T-test (Supplementary Table 7).
Gene expression and follow up, pro-proliferation biomarkers
The expression levels of various genes were assessed with RSEM values from an RNA-seq based dataset available at cbioportal.org. A distinction of the percentage of tumor samples representing zero DDX53 expression in the upper and lower 50th percentile TCR CDR3-DDX53 Combo CSs was evaluated using an odds-ratio test (Supplementary Table 8). Correlation between expression of pro-proliferation biomarkers and TCR CDR3-DDX53 Combo CSs was evaluated using a Pearson’s correlation as further detailed in Results.
Multivariate analysis
A multivariate analysis was conducted using the IBM Statistical package for the social sciences (SPSS), recently renamed as, Statistical Product and Service Solutions. TCR CDR3-DDX53 Combo CSs were compared against patient race, sex, frequency of alcohol consumption, neoplasm histologic grade, presence of columnar metaplasia, and genome alteration fraction, as further detailed in Results.
Abbreviations
AA: amino acid; CDR3: complementarity determining region-3; CS: complementarity score; CTA: cancer testis antigen; dbGaP: database of Genotypes and Phenotypes; DFS: disease-free survival; ESCA: esophageal cancer (TCGA abbreviation for both ESCC and esophageal adenocarcinoma); ESCC: esophageal squamous cell carcinoma; GDC: genomic data commons; HUGO: human genome organization; IR: (adaptive) immune receptor; KM: Kaplan-Meier; MDSC: myeloid-derived suppressor cells; NIH: National Institutes of Health; PCR: polymerase chain reaction; TAM: tumor-associated macrophage; TCGA: the cancer genome atlas; TCR: T-cell receptor; TRA: HUGO symbol for T-cell receptor alpha gene; TRB: HUGO symbol for T-cell receptor beta gene; WXS: whole exome sequence.
Author contributions
PC: Conceptualization; Formal analysis; Methodology; Visualization; Writing - review and editing. KJC: Formal analysis; Methodology; Software. MV: Conceptualization; Formal analysis; Methodology; Software. VRB: Conceptualization; Formal analysis; Methodology; Software. AC: Resources; Formal analysis; Methodology; Software. BIC: Resources; Formal analysis; Methodology; Software. GB: Conceptualization; Methodology; Project administration; Resources; Supervision; Writing - review and editing.
ACKNOWLEDGMENTS
Authors thank USF research computing; and Ms. Corinne Walters for expert administrative contributions to dataset access applications; and the taxpayers of the State of Florida.
Availability of data and materials
Data and materials are provided in the supporting online material tables.
CONFLICTS OF INTEREST
Authors have no conflicts of interest to declare.
Ethical statement and consent
Not applicable (NA). Access to the raw data (Methods) for this study was via NIH dbGaP project approval number 6300.