Great question!
dAL
is actually dL/dAL
and the loss function L is calculated for a specific training example, whereas the cost function J is the average loss for all m training examples.
I advice you to check the video Forward and Backward Propagation in Week 4 again:
Around the 8 minute mark, Andrew will show you how dL/dAL
fits into the bigger picture: