Explanation for derived gradients for LSTM back-prop

it depends on what you define as ‘x’.