Question 1

What makes a model "pan-allele"?

Accepted Answer

Instead of training a separate predictor for each HLA allele, a pan-allele model learns from a sequence representation of the HLA molecule itself (for NetMHCpan, a pseudosequence of the binding-groove residues). This lets it predict binding for any HLA of known sequence, including alleles with little or no experimental data.

Question 2

Why does pan-allele prediction matter for HLA diversity and equity?

Accepted Answer

HLA is extremely polymorphic and allele frequencies vary by ancestry. Allele-specific models only cover well-studied (often European-skewed) alleles, so pan-allele models are essential to predict for rare and non-European alleles — making neoantigen design work across diverse patient populations rather than a narrow subset.

Question 3

Are pan-allele predictions equally accurate for all alleles?

Accepted Answer

No. Accuracy is best when the query allele is similar to well-characterized alleles in the training data and degrades for sequence-distant or poorly represented alleles. Independent evaluations have reported reduced performance on non-European alleles absent from training, so per-allele validation matters.

Pan-allele model

FAQ