Programming Assignment - Week 1 - Parse data from file

Syed_Hamza_Tehseen · March 28, 2023, 5:47pm

Hi, can somebody explain what is wrong?? In the markdown cell, it is written that text can be accessed using row[1] and labels can be accessed using row[0] but when I’m coding it, it’s resulting in an index error

ai_curious · March 28, 2023, 9:18pm

If you’re iterating on reader, how many rows of data from the file are you getting for each iteration?

Syed_Hamza_Tehseen · March 28, 2023, 11:17pm

I don’t know because I don’t understand the documentation of this whole function of csv_reader

Juan_Olano · March 28, 2023, 11:34pm

The error is thrown when it calls remove_stopword(row[1]).

Have you reviewed what param are you passing to remove_stopword? what is the value of row[1]? what is remove_stopword expecting what is the dataset used in remove_stopword and see if row[1] makes sense for that dataset?

That is what I would start doing.

ai_curious · March 29, 2023, 12:24am

csv.reader(csvfile, dialect=‘excel’, **fmtparams )¶

Return a reader object which will iterate over lines in the given csvfile.

Each row read from the csv file is returned as a list of strings.

Syed_Hamza_Tehseen · March 29, 2023, 1:06pm

I still don’t fully understand it. What I have gathered is, that the first word of the csv file is ‘label’ and all the other data is ‘text’, and the csv.reader function basically returns each line in the form of a list. Am I correct up until now? If I am, how do I access each line to separate label and text, and then apply the stopword function on text ?

Syed_Hamza_Tehseen · March 29, 2023, 1:14pm

It has worked now. I was making three mistakes:

using the wrong delimeter
not skipping the first line
appending label after using stopword (I don’t know if it is a mistake or not, can you confirm?)

ai_curious · March 29, 2023, 1:26pm

Yes, the first row is going to be the column headers/labels. Sometimes when reading a dot CSV you want these, but for different purposes than the rest of the data. Subsequent rows are the data, each carried as a list of strings, with one element of the list per CSV column. In this case, one string is the label, and the second string is the text. If you were using the wrong delimiter, you probably got a list of strings with a single element, which is why row[1] threw an exception.

Topic		Replies	Views
How to parse_data_from_file - csv.reader Natural Language Processing in TensorFlow week-1	2	550	February 20, 2023
Csv.reader returning only a single 1-character string from iterator Natural Language Processing in TensorFlow week-1	3	394	September 20, 2023
Parse_data_from_file Natural Language Processing in TensorFlow week-1	9	668	August 27, 2022
Parse data from file Natural Language Processing in TensorFlow week-1	6	575	August 15, 2022
Parse_data_from_file error Natural Language Processing in TensorFlow week-1	1	559	July 29, 2022

Programming Assignment - Week 1 - Parse data from file

Related topics