C5W3 Trigger Word - Wrong segment. Expected: (7286, 8201) got:(7286, 8194)

narcisap · October 31, 2022, 8:13pm

Hello! I exported the files for the trigger word assignment and tried to run the notebook locally, but it fails with the following error:

Does anyone know how to fix it? Please note that I already submitted the assignment and received maximum score for it.

Thanks!

Juan_Olano · October 31, 2022, 8:26pm

Hi @narcisap , welcome to the community!

Based on the output, I would suspect the “is_overlapping” function implementation. I suggest adding some print commands in these 2 functions (is_overlapping and insert_audio_clip) to keep track of the pointer (start and end).

Please try something like this and let me know how it goes.

Juan

narcisap · October 31, 2022, 9:12pm

Hi Juan! Thanks for the suggestion!

I think it might be more like a problem with the versions of python and/or dependences I use locally. All tests pass on Coursera (I’ve just rechecked), but not on my laptop, including this one for the insert_audio_clip method. My plan is to align my setup with the one on Coursera (downgrade to python 3.7, export the dependencies used on Coursera in a requirements.txt file and use it locally).

Juan_Olano · October 31, 2022, 9:50pm

That makes sense. The versions may be impacting your results. Please keep me in the loop as to how it goes after you replicate the coursera stack.

narcisap · November 1, 2022, 8:28pm

Unfortunately, this approach was not successful. I couldn’t replicate the Coursera stack because of multiple errors I got while trying to install the requirements I exported from the notebook.

I also tried another approach: use python 3.7 and install just a subset of the dependencies (tensorflow 2.3.0, pydub 0.24.0, matplotlib 3.2.1, kiwisolver 1.2.0, Pillow - no version provided because of errors while trying to install the version used on the Coursera stack). With this approach, I get again the error Wrong segment. Expected: (7286, 8201) got:(7286, 8194).

Juan_Olano · November 2, 2022, 1:21am

What happens if you run it directly in Coursera? does it work?

If not: Could you send me a private message with your code? I’d like to take a look at it.

narcisap · November 2, 2022, 6:52pm

It works when I run it directly in Coursera and I received the maximum grade for this assignment.
The issue manifests itself only when I run it locally. I’ve just sent you my code in a private message. Looking forward to hearing back from you and thanks for looking into this!

Juan_Olano · November 2, 2022, 7:32pm

Thanks. I’ll check it out and see if I can spot the issue. Now, since it is working fine in Coursera, we know your code is fine. I’ll review the dependencies.

Felipe_Caballero · April 24, 2023, 9:06pm

@Juan_Olano

This also happens to me (insert_audio_clip locally fails but works on Coursera).

Also, my resulting clip is different.

Since it uses backgrounds and activatesand those are loaded from the OS, I guess the file listing (retrieving) is done in a different order and that creates the mismatch in the assert.

I guess the problem could be solved by sorting the results from os.listdir(*) in the load_raw_audio function (td_utils.py).

Juan_Olano · April 25, 2023, 2:23am

Please let me know if your idea works - thanks @Felipe_Caballero !

Felipe_Caballero · April 25, 2023, 9:47pm

@Juan_Olano it does work. I just tested it by running a cell on Coursera with this code:

os.listdir('./raw_data/' + "activates"), os.listdir('./raw_data/' + "backgrounds"), os.listdir('./raw_data/' + "negatives")

which returns:

(['3_act2.wav',
  '2_act3.wav',
  '1_act2.wav',
  '2.wav',
  '3_act3.wav',
  '4_act2.wav',
  '1.wav',
  '1_act3.wav',
  '2_act2.wav',
  '3.wav'],
 ['2.wav', '1.wav'],
 ['4.wav',
  '3_2.wav',
  '1_0.wav',
  '2.wav',
  '2_1.wav',
  '5.wav',
  '5_1.wav',
  '1.wav',
  '3.wav',
  '4_0.wav'])

I then changed on my computer the result from os.listdir to this list and all asserts pass. That means that the order returned by os.listdir differs between my computer and Coursera.

So, the solution is to change the function load_raw_audio (on td_utils.py) to the following (notice that each os.listdir is enclosed by a sorted as sorted(os.listdir(***))):

# Load raw audio files for speech synthesis
def load_raw_audio(path):
    activates = []
    backgrounds = []
    negatives = []
    for filename in sorted(os.listdir(path + "activates")):
        if filename.endswith("wav"):
            activate = AudioSegment.from_wav(path + "activates/" + filename)
            activates.append(activate)
    for filename in sorted(os.listdir(path + "backgrounds")):
        if filename.endswith("wav"):
            background = AudioSegment.from_wav(path + "backgrounds/" + filename)
            backgrounds.append(background)
    for filename in sorted(os.listdir(path + "negatives")):
        if filename.endswith("wav"):
            negative = AudioSegment.from_wav(path + "negatives/" + filename)
            negatives.append(negative)
    return activates, negatives, backgrounds

After doing this change you need to fix the asserts according to what you want to test. For instance in the the function def insert_audio_clip_test(target): (on notebook), you would need to change background[0] to background[1] and activates[0] to activates[7]. Also you should check other tests using an assert and some of the files created by the os.listdir.

It’s a very easy fix, hope you can implement it on production.

Juan_Olano · April 26, 2023, 12:35am

Thanks @Felipe_Caballero for getting back on this! Very clear. I’ll pass this information to the group in charge of this.

Thanks a lot!

Juan

Topic		Replies	Views
Error in c5_w3 - 2nd notebook ('trigger word deetection'exercise #4 Sequence Models	2	250	December 26, 2023
C5W3 - Trigger Word Detection - Code Does not Compile Sequence Models	6	625	April 24, 2023
C5W3A2 Trigger Word Detection is broken Sequence Models	2	521	December 22, 2021
C5W3 Trigger Word Detection (create_training_example_test) Sequence Models	3	504	May 19, 2023
C5 W3 A3 Trigger Word Detection Sequence Models	4	837	August 23, 2021

C5W3 Trigger Word - Wrong segment. Expected: (7286, 8201) got:(7286, 8194)

Related topics