Transformer Network Application: Question Answering Lab

solarflarefx · August 1, 2021, 5:24pm

I had a question on the TensorFlow and PyTorch code comparisons. In TF2, we had two loss functions and had the training minimize the average of the two loss functions. Why was the approach in PyTorch different? In PyTorch, we used a different metric (F1 score) and instead of trying to minimize a loss we tried to maximize the metric?

TMosh · August 3, 2021, 4:41am

Sorry, I haven’t looked at the labs yet.

But in general, the F1 score is used as a metric when the data set is highly skewed (there are lots of “False” examples and very few “True” examples).

If you use the cost value with a skewed data set, there will not be much incentive for the system to learn to predict the True cases, because it can get very low cost by only predicting the False ones.

The F1 score does a better job of balancing the predictions for both False and True.

Topic		Replies	Views
Metric and loss Structuring Machine Learning Projects coursera-platform	10	593	June 5, 2021
Choosing metric for a binary classification (sentiment analysis) problem? how to use Binary Accuracy? AI Discussions	5	62	April 6, 2022
Naming convention in Tensorflow: Loss vs. Cost Advanced Learning Algorithms week-module-2	3	495	September 14, 2022
Object Localization MNIST lab, Tensorflow to Pytorch and losses doesn't decrease Advanced Computer Vision with TensorFlow week-module-1	1	542	July 2, 2022
Confusion about loss values Introduction to TF for Artificial Intelligence ... week-module-2	1	536	April 28, 2022

Transformer Network Application: Question Answering Lab

Related topics