-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
你好,你们的工作自称是“zero-shot”可是却需要训练,跟 ReCLIP 的 setting 完全不一致啊,这该怎么解释?难道审稿的时候没有审稿人质疑? #6
Comments
First, even without fine-tuning CLIP, our model already outperforms ReCLIP on most tasks. |
我依然对你们的设置不太信服。 1、如果是对标 Reclip,那本文应该强调免训练的结果,而不是额外再训练的结果; 这样随意更改实验设置,容易引起不公平的比较,把后续的游戏规则给玩坏了。我认为这样的定义会给未来实验不公平带了不好的先例,可能不利于visual grounding 的发展的规范化。 =============== English version I'm still not convinced by your setting.
This arbitrary change of the experimental settings easily leads to unfair comparisons and breaks subsequent rules of the game. In my opinion, such a definition will set a bad precedent for future experiment fairness and may not contribute to normalizing visual grounding development. |
你好,你们的工作自称是“zero-shot”可是却需要训练,跟 ReCLIP 的 setting 完全不一致啊,这该怎么解释?难道审稿的时候没有审稿人质疑?论文当中对训练的方式和数据也没有解释清楚,还故意放补充材料。
your work is labeled as "zero-shot," but it requires training, which contradicts the ReCLIP setting. How do you explain this? Didn't the reviewer question it during manuscript review?
The text was updated successfully, but these errors were encountered: