Why use an additional model to extract object features in RE-ID if we have already done it? #1770

daviduarte · 2024-12-11T16:33:13Z

Search before asking

I have searched the Yolo Tracking issues and found no similar bug report.

Question

Why do we use an additional RE-ID model to extract object features if we have already applied an R-CNN/Keypoint R-CNN to detect persons? Wouldn't it be easier to get the feature vectors for each bounding box through the ROI Align layer in the R-CNN/Keypoint R-CNN and pass them to the tracker? For instance, in the code torchvision_boxmot:

1º We run the line
pose_model = torchvision.models.detection.keypointrcnn_resnet50_fpn(pretrained=True)
to detect persons in the scene and extract posture key points. However, in this model we already can access the bounding box features in the ROI Align layer.

2º Subsequentely, we execute:
tracker = BotSort( reid_weights=Path('osnet_x0_25_msmt17.pt'), # ReID model to use device=device, half=False, )
And there is another model to extract the object features, the osnet_x0_25_msmt17.

My question is: Isn't this a computational waste? Couldn't the features from R-CNN be used in the tracker instead of the osnet_x0_25_msmt17?

The text was updated successfully, but these errors were encountered:

mikel-brostrom · 2024-12-11T21:20:53Z

Object detectors are not trained to generate discriminative embeddings for same class instances

mikel-brostrom · 2024-12-11T21:21:57Z

You can try passing the embeddings from the detector to the tracker and evaluate on them 😄

daviduarte · 2024-12-11T22:00:20Z

Object detectors are not trained to generate discriminative embeddings for same class instances

Perfect!

I'm curious now. I'l test it 😄

Thank you!

mikel-brostrom · 2024-12-12T15:12:21Z

Feel free to share your results here. Would be interesting to discus them 😄

Fleyderer · 2024-12-16T08:49:31Z

This concept is called Joint Detection and Embedding (JDE) and is now becoming popular for real-time trackers.

But there are still many problems: these models are harder to train and deploy, and of course two separate models for a bit different tasks are always better than a compromise between detection and classification - features are similar but far from identical.

And while ReID by itself gives poor advantage, it's just not worth the time

daviduarte added the question Further information is requested label Dec 11, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why use an additional model to extract object features in RE-ID if we have already done it? #1770

Why use an additional model to extract object features in RE-ID if we have already done it? #1770

daviduarte commented Dec 11, 2024 •

edited

Loading

mikel-brostrom commented Dec 11, 2024

mikel-brostrom commented Dec 11, 2024 •

edited

Loading

daviduarte commented Dec 11, 2024

mikel-brostrom commented Dec 12, 2024 •

edited

Loading

Fleyderer commented Dec 16, 2024

Why use an additional model to extract object features in RE-ID if we have already done it? #1770

Why use an additional model to extract object features in RE-ID if we have already done it? #1770

Comments

daviduarte commented Dec 11, 2024 • edited Loading

Search before asking

Question

mikel-brostrom commented Dec 11, 2024

mikel-brostrom commented Dec 11, 2024 • edited Loading

daviduarte commented Dec 11, 2024

mikel-brostrom commented Dec 12, 2024 • edited Loading

Fleyderer commented Dec 16, 2024

daviduarte commented Dec 11, 2024 •

edited

Loading

mikel-brostrom commented Dec 11, 2024 •

edited

Loading

mikel-brostrom commented Dec 12, 2024 •

edited

Loading