Problem about processor in load_pretrained_model #74

ShuyUSTC · 2024-08-13T14:16:23Z

Hi Teams,

I'm trying to evaluate VideoLLaMA2 on MVBench. As I run the inference_video_mcqa_mvbench.py, the following traceback occurs:

Traceback (most recent call last):
  File "/***/VideoLLaMA2/videollama2/eval/inference_video_mcqa_mvbench.py", line 203, in <module>
    run_inference(args)
  File "/***/VideoLLaMA2/videollama2/eval/inference_video_mcqa_mvbench.py", line 164, in run_inference
    for i, line in enumerate(tqdm(val_loader)):
  File "/***/python3.11/site-packages/tqdm/std.py", line 1178, in __iter__
    for obj in iterable:
  File "/***/lib/python3.11/site-packages/torch/utils/data/dataloader.py", line 630, in __next__
    data = self._next_data()
           ^^^^^^^^^^^^^^^^^
  File "/***/lib/python3.11/site-packages/torch/utils/data/dataloader.py", line 1345, in _next_data
    return self._process_data(data)
           ^^^^^^^^^^^^^^^^^^^^^^^^
  File "/***/lib/python3.11/site-packages/torch/utils/data/dataloader.py", line 1371, in _process_data
    data.reraise()
  File "/***/lib/python3.11/site-packages/torch/_utils.py", line 694, in reraise
    raise exception
AttributeError: Caught AttributeError in DataLoader worker process 0.
Original Traceback (most recent call last):
  File "/***/lib/python3.11/site-packages/torch/utils/data/_utils/worker.py", line 308, in _worker_loop
    data = fetcher.fetch(index)
           ^^^^^^^^^^^^^^^^^^^^
  File "/***/lib/python3.11/site-packages/torch/utils/data/_utils/fetch.py", line 51, in fetch
    data = [self.dataset[idx] for idx in possibly_batched_index]
           ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/***/lib/python3.11/site-packages/torch/utils/data/_utils/fetch.py", line 51, in <listcomp>
    data = [self.dataset[idx] for idx in possibly_batched_index]
            ~~~~~~~~~~~~^^^^^
  File "/***/VideoLLaMA2/videollama2/eval/inference_video_mcqa_mvbench.py", line 50, in __getitem__
    torch_imgs = self.processor(video_path, s=bound[0], e=bound[1])
                 ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/***/VideoLLaMA2/./videollama2/mm_utils.py", line 202, in process_video
    video = processor.preprocess(images, return_tensors='pt')['pixel_values']
            ^^^^^^^^^^^^^^^^^^^^
AttributeError: 'NoneType' object has no attribute 'preprocess'

I find that the processor in

VideoLLaMA2/videollama2/model/__init__.py

Lines 193 to 208 in 42bf9fe

    
           processor = None 
        
           if "videollama" in model_type: 
        
               vision_tower = model.get_vision_tower() 
        
               if not vision_tower.is_loaded: 
        
                   vision_tower.load_model() 
        
               vision_tower.to(device=device, dtype=torch.float16) 
        
               # NOTE: videollama2 adopts the same processor for processing image and video. 
        
               processor = vision_tower.image_processor 
        
           if hasattr(model.config, "max_sequence_length"): 
        
               context_len = model.config.max_sequence_length 
        
           else: 
        
               context_len = 2048 
        
           return tokenizer, model, processor, context_len

is initialized as None. For model_type=mistral in config.json of VideoLLaMA2-7B and VideoLLaMA2-7B-16F, the processor keeps as None, which may cause the traceback above. Could you please help me address the problem? Thanks!

The text was updated successfully, but these errors were encountered:

clownrat6 · 2024-08-13T16:29:41Z

This bug is caused by the inconsistency between ckpt version and code version. We have fix this bug in CKPT. Please redownload CKPT.

ShuyUSTC · 2024-08-14T02:02:28Z

Another question:

When loading pipeline using:

# Use a pipeline as a high-level helper
from transformers import pipeline

pipe = pipeline("visual-question-answering", model="DAMO-NLP-SG/VideoLLaMA2-7B")

Transformers returns the following traceback:

ValueError: The checkpoint you are trying to load has model type `videollama2_mistral` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.

clownrat6 · 2024-08-15T03:06:02Z

Our model is not integrated into transformers. So, pipeline style inference is not supported now.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problem about processor in load_pretrained_model #74

Problem about processor in load_pretrained_model #74

ShuyUSTC commented Aug 13, 2024 •

edited

Loading

clownrat6 commented Aug 13, 2024

ShuyUSTC commented Aug 14, 2024

clownrat6 commented Aug 15, 2024

Problem about processor in load_pretrained_model #74

Problem about processor in load_pretrained_model #74

Comments

ShuyUSTC commented Aug 13, 2024 • edited Loading

clownrat6 commented Aug 13, 2024

ShuyUSTC commented Aug 14, 2024

clownrat6 commented Aug 15, 2024

ShuyUSTC commented Aug 13, 2024 •

edited

Loading