How to pass 2 datasets to .fit?

someone555777 · September 14, 2023, 3:26pm

Hi! I’ve loaded two datasets with sentences and labels from txt file with TextLineDataset, vectorized them by .map. Now I need to pass this data to model.fit. And I’ve found, that I can’t pass datasets directly to model as x and y. I need to do .as_numpy_iterator() or convert to dataset in format like ((all_first_dataset_elements), (all_second_dataset_elements)), that is not easy task. So, by my opinion both options are a bit ugly. I am sure, that should be more easy and logical way to use built-in datasets. Can you give me idea how to do it as it have to be?

balaji.ambresh · September 14, 2023, 5:48pm

See Dataset.zip

someone555777 · September 14, 2023, 6:21pm

I know about it? I can even say, that I have zipped elementwise x,y data. Should it be helpful?

balaji.ambresh · September 14, 2023, 7:25pm

What do you mean by?

You can pass a dataset directly to a model. See x in Args section of fit

someone555777 · September 14, 2023, 7:33pm

I don’t know, I have error like

model = keras.Sequential([
    tf.keras.layers.Input(shape=(None, length_with_padding)),
    tf.keras.layers.Dense(9),
    tf.keras.layers.Softmax()
])

model.compile(
    optimizer='adam',
    loss='categorical_crossentropy',
    metrics=['accuracy'])

model.fit(tf.data.Dataset.zip(test_sentences_vec, test_tags_vec), epochs=3)

ValueError: Exception encountered when calling layer 'sequential_3' (type Sequential).
    
    Input 0 of layer "dense_3" is incompatible with the layer: expected min_ndim=2, found ndim=1. Full shape received: (None,)
    
    Call arguments received by layer 'sequential_3' (type Sequential):
      • inputs=tf.Tensor(shape=(None,), dtype=int64)
      • training=True
      • mask=None

>>> list(test_sentences_vec.take(10).as_numpy_iterator())
[array([1]),
 array([1]),
 array([1]),
 array([1]),
 array([ 2186, 22658,     1,  3508,  1032,  3173,  1516,     1,  1506,
            1]),
 array([    1, 25050,  1516, 27993]),
 array([   1, 1075]),
 array([1]),
 array([1]),
 array([1])]

balaji.ambresh · September 15, 2023, 3:46am

This trace shows the problem:

someone555777:

Input 0 of layer "dense_3" is incompatible with the layer: expected min_ndim=2, found ndim=1. Full shape received: (None,)
    
    Call arguments received by layer 'sequential_3' (type Sequential):
      • inputs=tf.Tensor(shape=(None,), dtype=int64)

2 things to notice:

The model requires you to provide a 3d tensor as each input since input_shape has 2 dimensions.
Padding should be done before training the model.

Here’s an example of a 3D post padded input with batch size of 2:

array([[[    1],
        [    0],
        [    0],
        [    0],
        [    0],
        [    0],
        [    0],
        [    0],
        [    0],
        [    0]],

       [[ 2186],
        [22658],
        [    1],
        [ 3508],
        [ 1032],
        [ 3173],
        [ 1516],
        [    1],
        [ 1506],
        [    1]]])

Since you’re dealing with text data, you should check the following:

Is it correct to pass a 3D input to the model.
Since the numbers in the sequence stand for token IDs and don’t mean anything, are you missing a step in processing the inputs before any Dense layers.

someone555777 · September 15, 2023, 9:40am

hmm, ok, thanks. Will try to check padding.

3D input? What do you mean? Where is it?

And can you help me is anything equivalent of padding generation individually per batch in tensorflow, like that I asked here?

balaji.ambresh · September 15, 2023, 7:49pm

As far as 3D input is concerned, the line below mentions that the model will be capable of processing 3D inputs:

For a 2D input, the correct line is like this:
tf.keras.layers.Input(shape=(length_with_padding, )),

Remember 2 things as far as specifying input shape to the model is concerned:

Pass the shape of each sample excluding the batch dimension.
None means that you don’t know how many steps will be processed during a single forward pass. Please complete courses 3 and 4 to better understand the need for a 3D input.

I’m not a mentor for NLP and so have added a few mentors on the other topic.

someone555777 · September 15, 2023, 8:09pm

Is this not 2D input? tf.keras.layers.Input(shape=(None, length_with_padding)), Why is syntax so strange in this case? It really more looks like 2D

balaji.ambresh · September 15, 2023, 9:24pm

Here’s what shape=(None, length_with_padding) means:

None here means that you don’t know how many timesteps the model is going to process but each timestep has length_with_padding features.

Since you have a background on deep learning, I’m assuming you understand how RNN layers work, what the RNN outputs are and why None makes the model more flexible.

someone555777 · September 16, 2023, 7:49am

I just say about syntax. Usually 3 dimensions are defined as 3 numbers in shapes brackets, as I remember

balaji.ambresh · September 16, 2023, 7:56am

Tensorflow always skips the batch dimension and requires you to specify shape of 1 input sample. If this is news to you, please go back to other notebooks and view the input_shape parameter.

Topic		Replies	Views
CNN data tips AI Discussions	7	71	August 29, 2023
How to import a public dataset in Keras? AI Discussions	4	74	May 13, 2021
C3W1_Assignment_Deep_N-grams_Exercise 2 - create_batch_dataset NLP with Sequence Models week-1	12	389	September 17, 2024
C3W2 Exercise 5 - tensors in example_batch seem to have lost their shape Natural Language Processing in TensorFlow week-2	17	53	March 21, 2025
C3W4 error in train_val_datasets function NLP with Sequence Models week-3	2	27	September 26, 2024

How to pass 2 datasets to .fit?

Related topics