Multi-object caption has negative effect on detection results. #330

hotelll · 2024-05-08T11:13:00Z

I am using GroundingDINO to detect object from image. However, I found that an object can be found with caption "ping pong.", but cannot be found with caption "man. ping pong.". The results are as follows:

caption: "ping pong" box_threshold=0.3
caption: "man. ping pong." box_threshold=0.3
caption: "man. ping pong." box_threshold=0.2

I wonder why this happened, and how to solve/ease this issue? Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-object caption has negative effect on detection results. #330

Multi-object caption has negative effect on detection results. #330

hotelll commented May 8, 2024 •

edited

Loading

Multi-object caption has negative effect on detection results. #330

Multi-object caption has negative effect on detection results. #330

Comments

hotelll commented May 8, 2024 • edited Loading

hotelll commented May 8, 2024 •

edited

Loading