Comparative Genomics
The analysis of genomic differences between two or more species to understand evolutionary processes, identify conserved regions, and predict gene function.
📜
The statement of the theorem
Let and be two genomic sequences. An alignment is a set of pairs of sequences of equal length , derived from and by introducing gaps. The objective is to find the optimal alignment that maximizes the total score : where is the substitution matrix score (e.g., BLOSUM) for matching characters, and is the gap penalty function, typically defined as . This maximization is solved using dynamic programming (e.g., Needleman-Wunsch or Smith-Waterman algorithms).
Source: Wikipedia