1. Background
Motif Switch Definition
Motif switching, another relevant finding, revealed that there were a significant number of nonamer positions in the proteome where fitness change of one or more amino acids, such as through mutations, changed the incidence of a given nonamer sequence across its overlapping positions, resulting in a sequence rank change, and thus, a motif change (Abd Raman, H., et al., 2020)
The rapid mutation of viruses and the limited characterization of the composition and incidence of the variant population are major obstacles to the development of an effective vaccines. A continuing goal towards this is a greater understanding of proteome sequence diversity of viruses in the context of the immune system (antigenic diversity), the dynamics of mutation, and effective strategies to overcome the diversity for vaccine design.
Analysis of the aligned overlapping k-mer positions of viruses allow us to dissect the sequence diversity, each overlapping k-mer position is quantitatively analyzed for four patterns of sequence diversity motifs: (1) “index”, the most prevalent sequence; (2) “major” variant, the most common variant sequence; (3) “minor” variants, multiple different sequences, each with an incidence less than that of the major variant; and (4) “unique” variants, each observed only once in given position.
Below is the illustration for the sequence motifs:
1.1. Index Switching
The dominance of an index is challenged by its variants, where fitness change of one or more amino acids, such as through mutations, changes the incidence of a given k-mer sequence across its overlapping positions, resulting in a sequence rank change, and thus, a motif change, such as dominance switch between the major variant and the prevailing index. The figure below illustrates this phenomenon.
1.2. Motif Switches
Besides index switch, we also observed switches involving the other motifs, which we demonstrated in our H5N1 paper and thus termed this broader observation as motif switching. We also witnessed that this was more complex than we had imagined as demonstrated by the illustration below.