C3 Week2 Quiz: Autonomous Driving

Elemento · February 21, 2023, 7:11am

Hey @Zihan_ZHU,
One of the easiest ways to figure out the complexity of a dataset is to subject it to human evaluation, and take the “human-level error” as a measure of the complexity of the dataset. However, this measure becomes very hard to obtain when you have structured datasets (tabular ones), having numerous features. In this case, you can’t subject such datasets to human evaluation.

For these cases, in my opinion, overlapping of features is another thing that comes to my mind, which can determine the complexity of a dataset. If the features are more or less overlapping (in terms of their distribution) for different classes, and only slightly different, then we know that it will be difficult for a model to learn these nuances, as compared to the case, when the features are completely distinct (in terms of their distribution) for different classes. For more information about the same, you can refer to this query, which deals with this exact issue. Let us know if this helps.

Cheers,
Elemento

Topic		Replies	Views
Human level of performance for structured data Structuring Machine Learning Projects week-1	16	88	September 20, 2024
Bayes error, human-level performance and overfitting (structured data) Structuring Machine Learning Projects	7	727	July 9, 2022
W1_Quiz1_Q6_&_7 Neural Networks and Deep Learning	3	278	December 7, 2023
Week 2 Quiz Question 7 Structuring Machine Learning Projects	4	625	October 13, 2021
How can we know the 'human-level error' in actual case? Structuring Machine Learning Projects	2	669	July 20, 2022

C3 Week2 Quiz: Autonomous Driving

Related topics