Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Does YOLO-World support complex queries for object detection? #562

Open
loucif01 opened this issue Jan 14, 2025 · 0 comments
Open

Does YOLO-World support complex queries for object detection? #562

loucif01 opened this issue Jan 14, 2025 · 0 comments

Comments

@loucif01
Copy link

Hello YOLO-World team,

I’m working on a project where I need to detect and describe objects in images using complex queries (e.g., "a building with a damaged roof and broken windows" or "a road completely submerged in water"). I’m considering using YOLO-World for this task and would like to confirm if the model supports such complex queries.

Specifically:

  1. Can YOLO-World handle natural language prompts that describe multiple attributes of an object (e.g., "a damaged roof with broken windows")?
  2. Does it support paragraph-level descriptions for object detection (e.g., "a flooded road with submerged vehicles and debris")?
  3. Are there any limitations on the complexity or length of the text prompts?

If YOLO-World does not natively support complex queries, are there any recommended approaches or fine-tuning strategies to achieve this functionality?

Thank you for your time and assistance!

Best regards,

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant