Confusion about improvements made to reduce numerical roundoff error

bupeigon · September 18, 2024, 8:28am

why don‘t we delete output layer, while we set the activation function of that to be linear in order to reduce error by deliver model.complie z directly

TMosh · September 18, 2024, 1:27pm

We still need the output layer weights.

nadtriana · September 18, 2024, 1:49pm

The output layer in a multiclass classification network plays a critical role in transforming the learned features from the hidden layers into the final predictions (class probabilities). Thus, the output layer’s weights are essential for learning the correct mapping from features to class predictions. They are trainable parameters that fine-tune the performance of the model.

If you suggest passing raw logits (denoted by z) directly to model.compile for training, this could be problematic because for classification tasks, raw logits must be transformed into probabilities using a softmax or sigmoid function, which facilitates training by working with interpretable loss functions such as categorical cross-entropy. Without a proper transformation, the loss calculation can result in large gradients or poorly scaled outputs, making it difficult for the optimizer to converge.

Topic		Replies	Views
Why in this lecture slide we are putting vector Z in to tf.nn.sigmoid when we used softmax? Advanced Learning Algorithms week-2	3	525	October 21, 2022
Why Softmax function? Advanced Learning Algorithms week-2	12	198	June 9, 2024
Where is the activation function in Week 2 - Transfer Learning assignment Convolutional Neural Networks	6	519	July 10, 2022
Questions about the number of neurons in the output layer Introduction to TF for Artificial Intelligence ... week-2	1	577	January 23, 2022
Week 2 Assignment 2 alpaca_model new Convolutional Neural Networks	3	517	June 7, 2022

Confusion about improvements made to reduce numerical roundoff error

Related topics