Batch for image captioning #51

rafstahelin · 2023-09-02T15:57:04Z

Is anyone working on recursive folder batch script for captioning stable diffusion captions? Speed needs to be under 2 seconds for 50 tokens max per image. English phrases

vealocia · 2023-09-04T08:06:28Z

Hi, @rafstahelin!
For your information, we evaluate Qwen-VL on Flickr30K caption dataset (about 1K images) with 8 gpus, a total batch size of 64 (8 images per gpu) and 30 max tokens per image. It costs about 20 seconds for whole process and 1.3 second for a batch of images (8 images).
There are lots of thing you can do to get faster inference, including but not limited to:

Batch inference.
Try our Int-4 quantized model. You can get the weights here.

rafstahelin · 2023-09-04T10:16:38Z

That's great. However none of us use multi GPU's on runpod. Could you point me in the direction of a service to does this? Is there code to use to be able to implement this?

ShadoWxShinigamI · 2023-10-25T12:01:37Z

#136 Should be able to batch a directory with this. Modify as required

FangGet · 2023-11-06T02:39:55Z

import this file: qwen_generation_utils.py

call make_context for context_token and construct a batch;
call model.generate for batch generation;
call decode_token to get final response

trouble-maker007 · 2023-11-07T12:20:15Z

@ShadoWxShinigamI I found your batch code is still chat one image with model.chat, not with batch

matankley · 2024-02-13T11:00:33Z

@rafstahelin Were you able to run a batch successfully ?

rafstahelin · 2024-02-13T11:51:43Z

didnt try

…

On Tue, Feb 13, 2024 at 12:00 PM Matan Kleyman ***@***.***> wrote: @rafstahelin <https://github.com/rafstahelin> Were you able to run a batch successfully ? — Reply to this email directly, view it on GitHub <#51 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/APNXTFWZYYTVDZJSM4P2ZBDYTNBV3AVCNFSM6AAAAAA4IVUPOCVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMYTSNBRGIYTGOJTGE> . You are receiving this because you were mentioned.Message ID: ***@***.***>

-- thanks best regards, raf

ShuaiBai623 closed this as completed Dec 18, 2023

erikreed mentioned this issue Jan 15, 2024

Batch inference #240

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Batch for image captioning #51

Batch for image captioning #51

rafstahelin commented Sep 2, 2023

vealocia commented Sep 4, 2023 •

edited

Loading

rafstahelin commented Sep 4, 2023

ShadoWxShinigamI commented Oct 25, 2023

FangGet commented Nov 6, 2023

trouble-maker007 commented Nov 7, 2023

matankley commented Feb 13, 2024

rafstahelin commented Feb 13, 2024 via email

Batch for image captioning #51

Batch for image captioning #51

Comments

rafstahelin commented Sep 2, 2023

vealocia commented Sep 4, 2023 • edited Loading

rafstahelin commented Sep 4, 2023

ShadoWxShinigamI commented Oct 25, 2023

FangGet commented Nov 6, 2023

trouble-maker007 commented Nov 7, 2023

matankley commented Feb 13, 2024

rafstahelin commented Feb 13, 2024 via email

vealocia commented Sep 4, 2023 •

edited

Loading