For the following exercises on Probalign, we use an affine gap penalty with g(k)=α+βk=−0.5−0.25k, there temperature T=1 and the similarity function σ(xi,yj):
Compute the Boltzmann-weighted score for the following alignments:
(a) x: --AGCGG (b) x: AGCGG------
||:|| :
y: ACAGGGG y: ----ACAGGGG
S(a)=∑xi∼yj∈aσ(xi,yj)+∑gap penaltieseS(a)T=(∏xi∼yj∈aeσ(xi,yj)T)×e∑gap penaltiesT
For each alignment you only need to calculate ex once.
(a)eσ(A,A)×e3σ(G,G)×eσ(C,G)×eg(2)=e2×e6×e−1×e−0.5+(−0.25×2)=e6(b)eσ(G,A)×eg(4)×eg(6)=e−1×e0.5+(−0.25×4)×e0.5+(−0.25×6)=e−4.5
(a)e6=403.43(b)e−4.5=0.011
Derive the recursion formula for ZIi,j. Allow insertions after deletions and vice versa.
ZIi,j=ZIi,j−1×eβT+ZMi,j−1×eg(1)T+ZDi,j−1×eg(1)T
Compute the partition function Z(T) by dynamic programming for the
sequences x=ACC
and y=AC
. Allow insertions
after deletions and vice versa. In order to simplify the computations,
you can round to two digits after the decimal point. Please be aware
that we used exact numbers for all calculations and rounded in the
end.
Initialization: ZMi,0=ZM0,j=0,ZM0,0=1ZIi,0=0ZD0,j=0
Recursion: ZMi,j=Zi−1,j−1×eσ(xi,yj)TZIi,j=ZIi,j−1×eβT+ZMi,j−1×eg(1)T+ZDi,j−1×eg(1)TZDi,j=ZDi−1,j×eβT+ZMi−1,j×eg(1)T+ZIi−1,j×eg(1)TZi,j=ZMi,j+ZIi,j+ZDi,j
The partition function of the reverse sequences x∗=CCA and y∗=CA is given in the matrix Z∗:
Find a mapping from matrix Z∗k,l to Z′i,j. Which position in matrix Z∗ corresponds to which position in matrix Z′?
Z′i,j is the partition function of the alignment xi...x|x| with yj...y|y|.
Z∗k,l is the partition function of the alignment x|x|...x|x|−k+1 with y|y|...y|y|−l+1.
i=|x|−k+1⇔k=|x|−i+1j=|y|−l+1⇔l=|y|−j+1
Use Z,Z∗ and the mapping from Z∗ to Z′ to compute the probability of the alignment edges (1,1), (2,2), (3,1) and (3,2) between x and y.
P(xi∼yj|x,y)=Zi−1,j−1×eσ(xi,yj)T×Z′i+1,j+1Z(T)ZMi,j=Zi−1,j−1×eσ(xi,yj)T
Mapped positions: Z′2,2⟺Z∗2,1Z′3,3⟺Z∗1,0Z′4,2⟺Z∗0,1Z′4,3⟺Z∗0,0
Alignment edge (1,1): P(x1∼y1|x,y)=Z0,0×eσ(x1,y1)T×Z′2,2Z(T)=Z0,0×eσ(x1,y1)T×Z∗2,1Z(T)=ZM1,1×Z∗2,1Z(T)=7.39×7.4361.90=0.89 Alignment edge (2,2): P(x2∼y2|x,y)=Z1,1×eσ(x2,y2)T×Z′3,3Z(T)=Z1,1×eσ(x2,y2)T×Z∗1,0Z(T)=ZM2,2×Z∗1,0Z(T)=57.90×0.4761.90=0.44 Alignment edge (3,1): P(x3∼y1|x,y)=ZM3,1×Z′4,2Z(T)=ZM3,1×Z∗0,1Z(T)=0.14×0.4761.90=0.001 Alignment edge (3,2): P(x3∼y2|x,y)=ZM3,2×Z′4,3Z(T)=ZM3,1×Z∗0,0Z(T)=30.42×161.90=0.49