C2W3 Assignment (MLOPS)

C2W3 assignment - getting the UnicodeDecodeError at Exercise 2

Instantiate ExampleGen with the input CSV dataset

example_gen = tfx.components.CsvExampleGen(input_base=PIPELINE_DIR)

Run the component using the InteractiveContext instance

context.run(example_gen)

Do I have to pass an additional argument to set the Unicode type and resolve this?

Error quoted below

UnicodeDecodeError Traceback (most recent call last)
in
7
8 # Run the component using the InteractiveContext instance
----> 9 context.run(example_gen)
10
11 ### END CODE HERE

UnicodeDecodeError: ‘utf-8’ codec can’t decode byte 0xf1 in position 99: invalid continuation byte

Is PIPELINE_DIR the folder containing csv dataset?

Yes. That’s the folder

That is incorrect. PIPELINE_DIR is the location where the artifacts are dumped. In which directory did you store the input csv for this entire pipeline?