There’s only 1 type of file stored on disk.
TFRecord
format is the storage layer as far as tfx is concerned. tf.train.Example
is a serialization mechanism that works with the underlying TFRecord format. You can read about it here
To check this, do the following:
- Create a duplicate of the data folder with 1 record (say data2/).
- Generate the record generation logic using
tfx.components.CsvExampleGen
. - Look at the generated records using the
get_records
function for bothSplit-train
(1 record) andSplit-eval
(0 records).
There’ll be no other tfrecord artifact generated.
You might find steps to edit metadata useful to make certain cells editable.