Object tracking Modell to pixelize People

Pascal_Winterle · December 6, 2023, 12:49pm

Hi, I´m currently working on an app which needs to automatically pixelize peoples in videos to ensure they cannot be recognized. I have tried severel things no but nothing seems to work.

Is it possible to just use an Image object detection model and apply it to each frame in the video? The things I tried so far with keras-cv is not really working.
I´ve tried mmtracking but wasn´t even able to install it. GitHub - open-mmlab/mmtracking: OpenMMLab Video Perception Toolbox. It supports Video Object Detection (VID), Multiple Object Tracking (MOT), Single Object Tracking (SOT), Video Instance Segmentation (VIS) with a unified framework.

The only thing which I have found to work so far is the YOLO8 from ultralytics but I can´t use it because my app is a commercial application.

I´m new to machine learning and don´t know where to look for. Maybe someone knows where I can find an easy to use pretrained model for video object tracking which I can use in my App.

Honza_Zbirovsky · December 6, 2023, 4:06pm

Hi @Pascal_Winterle.

Maybe you can try another version of YOLO (I didn’t check that specifically for YOLO8) - it should be free of charge also for commercial use as you can se here.

Then you can just detect people and fuzzy them (e.g. with OpenCV) to reach your goal.

Hope it helps a little bit.

BR
Honza

TMosh · December 6, 2023, 6:54pm

This is a very ambitious project if you do not have any Machine Learning experience.

Pascal_Winterle · December 7, 2023, 9:36am

Yes I know. Its for my school project and I have 2 weeks to do it…I hope I find some solution.

TMosh · December 7, 2023, 3:13pm

I think this is highly unlikely. Your project is very complex.

Pascal_Winterle · December 7, 2023, 3:43pm

Yes it feels like it. But I have to write the Project and document it somehow. I can use the yolo for now, which makes it a bit easier. This is what I´ve tried with keras so far:

model = keras_cv.models.YOLOV8Detector.from_preset(
    "yolo_v8_m_pascalvoc",
    num_classes=1,
    bounding_box_format="xywh",
)

inference_resizing = keras_cv.layers.Resizing(
    640, 640, pad_to_aspect_ratio=True, bounding_box_format="xywh"
)

class_ids = [
    "Person"
]
class_mapping = dict(zip(range(len(class_ids)), class_ids))
# Define the codec and create a VideoWriter object
import cv2

fourcc = cv2.VideoWriter_fourcc(*'mp4v')  # Use 'mp4v' for MP4 format
output_file = 'output_video.mp4'  # Specify the output video file name with the .mp4 extension
output_size = (500, 500)  # Set the output video size

out = cv2.VideoWriter(output_file, fourcc, 20.0, output_size)

def store_box_dimensions(boxes):
    x_list, y_list, w_list, h_list = [], [], [], []

    for i in range(boxes.shape[1]):
        x = boxes[0][i][0]
        y = boxes[0][i][1]
        w = boxes[0][i][2]
        h = boxes[0][i][3]

        if x == -1:
            break

        x_list.append(x)
        y_list.append(y)
        w_list.append(w)
        h_list.append(h)

    return x_list, y_list, w_list, h_list

# Define the codec and create a VideoWriter object
import cv2

fourcc = cv2.VideoWriter_fourcc(*'mp4v')  # Use 'mp4v' for MP4 format
output_file = 'output_video.mp4'  # Specify the output video file name with the .mp4 extension
output_size = (500, 500)  # Set the output video size

out = cv2.VideoWriter(output_file, fourcc, 20.0, output_size)

cap = cv2.VideoCapture('../../data/test.mp4')

if not cap.isOpened():
    print("Error: Couldn't open the camera.")
    exit()

frame_counter = 0

while True:
    ret, test_img = cap.read()
    if not ret:
        print("Error: Couldn't read frame. End of video?")
        break  # Break the loop if we have reached the end of the video
    
    frame_counter += 1

    # Process every 25th frame
    if frame_counter % 25 == 0:
        image = inference_resizing([test_img])

        y_pred = model.predict(image)
        boxes = y_pred['boxes']
        x_values, y_values, w_values, h_values = store_box_dimensions(boxes)

        for i in range(len(x_values)):
            x, y, w, h = int(x_values[i]), int(y_values[i]), int(w_values[i]), int(h_values[i])
            # Debugging print statements
            print(f"Box {i+1}: x={x}, y={y}, w={w}, h={h}")
            cv2.rectangle(test_img, (x, y), (x + w, y + h), (255, 255, 255), thickness=4)
        
        resize_img = cv2.resize(test_img, (500, 500))
        # cv2.imshow("Face Detection Tutorial: ", test_img)
        out.write(resize_img)
    
    if cv2.waitKey(10) == ord('q'):
        break

cap.release()
out.release()
cv2.destroyAllWindows()

TMosh · December 7, 2023, 5:44pm

Sorry, I’m not able to assist with this.

Topic		Replies	Views
How to create custom model for detect how many person in the video same as yolo model AI Discussions ai-discussions , computer-vision	1	115	January 29, 2024
YOLOV8 no detections AI Discussions	36	4103	April 29, 2024
Computer vision AI Discussions ai-discussions , careers , chatgpt , computer-vision	1	161	January 31, 2024
Face detection using yad2k Convolutional Neural Networks coursera-platform	2	583	September 16, 2021
Object Detection Using Yolov8 AI Discussions	4	148	January 29, 2023

Object tracking Modell to pixelize People

Related topics