Ungraded Lab: KeyError: 'COLAB_TPU_ADDR'

Papoutsakis_Antonios · March 6, 2024, 9:37am

When I execute

tpu_grpc_url = “grpc://” + os.environ[“COLAB_TPU_ADDR”]

I get error message:

KeyError: 'COLAB_TPU_ADDR'

Looks like there is no ‘COLAB_TPU_ADDR’ environment variable.
Hardware accelerator in TPU.

Any ideas?

Thanks

gent.spah · March 6, 2024, 10:11am

Try in the Connect Option in the upper right hand corner below Comment, click on the arrow and select a unit that has TPU in it. Maybe this fixes your problem.

allansdefreitas · June 19, 2024, 1:35am

I have tried using this approach, but the error remains!
Please, take a look at the error:

KeyError                                  Traceback (most recent call last)
<ipython-input-6-e93ad5be9dcf> in <cell line: 1>()
----> 1 tpu_grpc_url = "grpc://" + os.environ["COLAB_TPU_ADDR"]
      2 tpu_cluster_resolver = tf.distribute.cluster_resolver.TPUClusterResolver(tpu_grpc_url)
      3 tf.config.experimental_connect_to_cluster(tpu_cluster_resolver)
      4 tf.tpu.experimental.initialize_tpu_system(tpu_cluster_resolver)
      5 strategy = tf.distribute.experimental.TPUStrategy(tpu_cluster_resolver)

/usr/lib/python3.10/os.py in __getitem__(self, key)
    678         except KeyError:
    679             # raise KeyError with the original key value
--> 680             raise KeyError(key) from None
    681         return self.decodevalue(value)
    682 

KeyError: 'COLAB_TPU_ADDR'

Thanks in advance,
Allan Freitas

gent.spah · June 19, 2024, 6:42am

Hello,

Which lab is this?

Have you tried reseting the kernel and re-running the lab again?

Wendy · June 21, 2024, 6:09pm

@allansdefreitas, it looks like the code for that lab is a little out of date. You no longer need to use the “COLAB_TPU_ADDR” to connect to the TPU.

Also, though, you need to choose the TPU v2 from the Edit/Notebook settings menu. It is hard to get free time on a TPU with Colab these days, so you may have to try multiple times and/or try different times of day.

I will put in a request to the staff to update the code to work with the current Colab backend, but in the meantime, you can comment out these two lines of code:

tpu_grpc_url = "grpc://" + os.environ["COLAB_TPU_ADDR"]
tpu_cluster_resolver = tf.distribute.cluster_resolver.TPUClusterResolver(tpu_grpc_url)

and replace them with these lines:

try:
  tpu_cluster_resolver = tf.distribute.cluster_resolver.TPUClusterResolver()  # TPU detection
  print(f'Running on a TPU w/{tpu_cluster_resolver.num_accelerators()["TPU"]} cores')
except ValueError:
  raise BaseException('ERROR: Not connected to a TPU runtime; please make sure you have successfully chosen TPU runtime from the Edit/Notebook settings menu')

If you successfully connect to the TPU from the Edit/Notebook settings menu, this new code will connect you to the TPU and print a message telling you how many cores your TPU has - or it will print an error saying it couldn’t connect to the TPU.

To make your code even more up-to-date, you can also change the “strategy =…” line to remove the “.experimental”, like this:

strategy = tf.distribute.TPUStrategy(tpu_cluster_resolver)

This isn’t entirely necessary, but will avoid a warning message saying you don’t need to use the .experimental any more.

Topic		Replies	Views
Ungraded Lab: CelebA GAN Experiments C4_W4_Lab_3 AttributeError: 'DistributedDataset' object has on attribute '_variant_tensor' Generative Deep Learning with TensorFlow week-4	6	418	August 24, 2023
Colab Notebook error: HTTPError: 502 Server Error: Bad Gateway for url: https://bitbucket.org/ezelikman/gans/downloads/karras2019stylegan-ffhq-1024x1024.pkl Build Basic Generative Adversarial Networks week-1	2	524	October 5, 2022
C4_W4_Lab3_CelebA Problem with TPU Generative Deep Learning with TensorFlow week-4	21	80	April 11, 2025
Question on "Ungraded Lab: Cats vs. Dogs Class Activation Maps" Advanced Computer Vision with TensorFlow week-4	6	31	March 19, 2025
Don't have the storage.objects.list access to the Google Cloud Storage to access gs://flowers-public/tfrecords-jpeg-{}x{}/*.tfrec Custom and Distributed Training with TF week-4	4	42	November 22, 2024

Ungraded Lab: KeyError: 'COLAB_TPU_ADDR'

Related topics