Does YOLO-World support complex queries for object detection? #562

loucif01 · 2025-01-14T03:54:23Z

Hello YOLO-World team,

I’m working on a project where I need to detect and describe objects in images using complex queries (e.g., "a building with a damaged roof and broken windows" or "a road completely submerged in water"). I’m considering using YOLO-World for this task and would like to confirm if the model supports such complex queries.

Specifically:

Can YOLO-World handle natural language prompts that describe multiple attributes of an object (e.g., "a damaged roof with broken windows")?
Does it support paragraph-level descriptions for object detection (e.g., "a flooded road with submerged vehicles and debris")?
Are there any limitations on the complexity or length of the text prompts?

If YOLO-World does not natively support complex queries, are there any recommended approaches or fine-tuning strategies to achieve this functionality?

Thank you for your time and assistance!

Best regards,

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does YOLO-World support complex queries for object detection? #562

Does YOLO-World support complex queries for object detection? #562

loucif01 commented Jan 14, 2025

Does YOLO-World support complex queries for object detection? #562

Does YOLO-World support complex queries for object detection? #562

Comments

loucif01 commented Jan 14, 2025