T cells orchestrate the adaptive immune response by recognizing antigens presented as
peptide-MHC (pMHC) complexes through the T-cell receptor (TCR). Accurately
predicting, and just as importantly explaining, TCR-pMHC binding is fundamental to
understanding immune recognition and to designing immunotherapies. This project develops a
family of transformer-based models that are both accurate and interpretable.
Motivation
State-of-the-art predictors such as transformer encoder/decoder models achieve strong
performance, but their black-box nature limits mechanistic insight. Our goal is to make
TCR-pMHC models explainable, recovering the structural basis of binding without
sacrificing predictive accuracy.
Interpreting attention
Our flagship post-hoc method quantifies cross-attention interaction to reveal which
residues drive binding, benchmarked against experimentally determined structures:
Building explainability into the model
We also designed ante-hoc explainable model layers, so interpretability is intrinsic to
the architecture rather than added after the fact:
Rational multi-modal prediction
Beyond interpretability, we improved prediction itself with rational multi-modal
transformers that fuse sequence and structural signals. Together with our work on
accelerating the underlying biophysical computations, these efforts form a complete
pipeline from epitope processing to TCR recognition.
Related Publications
This post is written by Jiarui Li, licensed under CC BY-NC 4.0.