Using a LSTM layer in a GAN

Barb · November 24, 2021, 8:03am

Dear all,

I am creating this topic because I would like to discuss the use of LSTM cells inside a GAN.

In the DeepLearning specialization, we have learnt about LSTM cells, and how to train them to generate nice dinosaurs names. This works actually pretty fine !

However, the training comes from using a word as both input and output for the LSTM cell (by including a shift in positions).

This aspect of the training cannot be reproduced because that’s not how GANs are trained.

I have tried to use an LSTM layer in a classic GAN architecture, but I do not get satisfying results (yet).

As a reminder, when working at a character level, each character in a word (sentence) is one hot encoded. I’m worried that this one hot representation makes it harder for the LSTM cell to learn during the GAN training.

To clarify, I also checked existing implementations but nothing has proven to be relevant or usable.

Do you have any comments or ideas I should follow ?

cvetko.tim · December 2, 2021, 7:07pm

I can provide the following article as an example: An LSTM Based Generative Adversarial Architecture for Robotic Calligraphy Learning System

The LSTM cells function with similar purpose in mind when serving as blocks in GANs.

Barb · December 3, 2021, 1:23am

Thanks for the link @cvetko.tim

I’ve been able to implement both GAN and WGAN with LSTM cells.

I’ve used simple infrastructures for the discriminator and generator.

So far, the normal GAN shows some small signs of learning, while a standard LSTM cell (as usef for music generation in another specialization) learns way faster.

I am wondering if I should implement a custom layer or something.

cvetko.tim · December 30, 2021, 11:16am

No problem. What is it exactly that makes you think of implementing your own layer? Can you tell me what kind of task you are trying to solve?

Barb · January 3, 2022, 11:27pm

@cvetko.tim I am trying to make a GAN learn patterns in a dataset I have.
Each element is a string, with some patterns, hence the LSTM cells can learn it fairly well.
However, the GAN is quite unstable, even after implementing WGAN.

Topic		Replies	Views
Discrete data generation Generative Adversarial Networks (GANS)	15	403	September 8, 2021
Correct/Better way to train Music Generation RNN Sequence Models coursera-platform	1	574	July 29, 2021
C5-W1-A3: Idiomatic usage of tf.keras.layers.LSTM vs LSTMCell Sequence Models coursera-platform	2	632	January 10, 2024
C5W1A3 - Jazz with LSTM - Issues implimenting "LSTM_cell" Sequence Models coursera-platform	2	803	August 7, 2022
I want to build a Tabular-GAN using LSTM how can i do it? AI Discussions	0	51	June 30, 2023

Using a LSTM layer in a GAN

Related topics