Any idea for Combine origin rwkv and vision rwkv into one structure like Clips or Blips ? #14

structure-charger · 2024-04-17T03:25:15Z

As the title, and beyond the title, is there any way to implementation rwkv llava, minicpm-v, internml-composer-v or qwen-v ?

BlinkDL · 2024-04-17T14:00:28Z

check https://github.com/howard-hou/VisualRWKV

duanduanduanyuchen · 2024-05-13T08:57:33Z

Hi! Thanks for your advice. We didn't have the plan for these models yet. As VRWKV is a visual backbone like ViT, I think it can be applied to these architectures similarly.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Any idea for Combine origin rwkv and vision rwkv into one structure like Clips or Blips ? #14

Any idea for Combine origin rwkv and vision rwkv into one structure like Clips or Blips ? #14

structure-charger commented Apr 17, 2024

BlinkDL commented Apr 17, 2024

duanduanduanyuchen commented May 13, 2024

Any idea for Combine origin rwkv and vision rwkv into one structure like Clips or Blips ? #14

Any idea for Combine origin rwkv and vision rwkv into one structure like Clips or Blips ? #14

Comments

structure-charger commented Apr 17, 2024

BlinkDL commented Apr 17, 2024

duanduanduanyuchen commented May 13, 2024