Great question!
dAL is actually dL/dAL and the loss function L is calculated for a specific training example, whereas the cost function J is the average loss for all m training examples.
I advice you to check the video Forward and Backward Propagation in Week 4 again:
Around the 8 minute mark, Andrew will show you how dL/dAL fits into the bigger picture:
