Source of toy_dataset.csv?

Inntr8 · January 22, 2023, 7:24am

Just wondering, can you comment on what the toy_dataset.csv from the last practice lab C3_W2_Lab01_PCA_Visualization_Examples (#Using PCA in Exploratory Data Analysis) is, or how it was generated?

Thanks!

Wendy · January 31, 2023, 1:41am

Hi @Inntr8,
It looks like your question got overlooked somehow. I don’t know anything about the source of the toy_dataset.csv, but I’ll see if I can find someone who does know.

I suspect the numbers are probably randomly generated, but it is interesting that they cluster so well…

lucas.coutinho · January 31, 2023, 12:18pm

Hello @Inntr8 and @Wendy

The dataset was generated using the function make_classification from scikit-learn library. They indeed cluster well because of the way they were generated.

Thanks,
Lucas

Wendy · January 31, 2023, 6:23pm

Nice! Thanks, @lucas.coutinho!

Inntr8 · February 1, 2023, 5:32am

Thank you very much @lucas.coutinho and @Wendy !!

Topic		Replies	Views
What info are you getting from these clusters, please? Unsupervised Learning, Recommenders, Reinforcement week-2	3	348	August 31, 2023
Need CSV files for recommender system Unsupervised Learning, Recommenders, Reinforcement week-2	3	626	February 22, 2023
C3_W2_RecSysNN_Assignment dataset questions Unsupervised Learning, Recommenders, Reinforcement week-2	9	558	February 27, 2023
New Topic: Dimensionality Reduction using Principal Component Analysis (PCA) Unsupervised Learning, Recommenders, Reinforcement week-2	2	521	December 21, 2022
C3_W2_Assignment 2_Content based filtering Unsupervised Learning, Recommenders, Reinforcement week-2	2	388	October 25, 2023

Source of toy_dataset.csv?

Related topics