C2W2 Technical Question

There’s only 1 type of file stored on disk.

TFRecord format is the storage layer as far as tfx is concerned. tf.train.Example is a serialization mechanism that works with the underlying TFRecord format. You can read about it here

To check this, do the following:

  1. Create a duplicate of the data folder with 1 record (say data2/).
  2. Generate the record generation logic using tfx.components.CsvExampleGen.
  3. Look at the generated records using the get_records function for both Split-train (1 record) and Split-eval (0 records).

There’ll be no other tfrecord artifact generated.

You might find steps to edit metadata useful to make certain cells editable.