I understand how to get C matrix and how to do the backward process. I can’t get the D Matrix. Can you provide a numerical example please?

My confusion comes from the example you have in the Backward Pass. I don’t understand why the max probability from the previous step would be different (hence a different index) among the cells in the next column.

For example, in last column of C the max prob is in the first cell. So we go to the first cell in the last column of D and see that the max prob in the previous column is in row 3. and so on.

But in row 3 of the last column of D we see that the max prob came from row 4 of the previous column. I would expect the index to be the same as the previous example, since the max prob in the previous column is in row 3.

Hope this helps