I recently answered a similar question explaining the shapes of Tensors that pass through the model:
To answer your question directly:
Dense layer projects LSTM output to as many outputs as you need - 17 in this case - 17 probabilities of one category (tag) or the other.