GitHub - wizlee/local-ml-runs: A collections of various machine learning explorations

Overview

Exploring the world of stable difussion and LLMs

History

Exploring stable-diffusion-webui

the code-workspace file in this repo contains some bootscraps that assumes you have
- a conda environment called pytorch. This can be easily modified to any other name.
- that you are using bash (Git Bash or GNU Bash should both work)
The steps in the repo's readme matches what I did closely in creating the conda environment. Briefly as below:

Run the commands below

# Create environment
conda create -n pytorch python=3.10.6
# Activate environment
conda active pytorch
# Start local webserver
./webui-user.bat
# Wait for "Running on local URL:  http://127.0.0.1:7860" and open that URI.

note that webui-user.bat will create a python virtual environment despite dedicated conda environment is created. If you don't prefer this to happen perform what the script does manually.
- or don't use conda and directly use your system python.
Download the stable diffusion models from here for v2.1 and here for v1.4.
Need use additional arg of --xformers and --medvram if GPU VRAM is less than 12GB. After using these arg web server is successfully launched even using GPU of only 2GB VRAM.
HOWEVER, there might still be error when trying to generate image using GPU with less VRAM (i.e. 2GB)
References
- https://github.com/Stability-AI/StableDiffusion
- https://github.com/AUTOMATIC1111/stable-diffusion-webui

Exploring VideoCrafter

After following the official setup guide, tried to run VideoControl model
Got a warning of symlink not enabled and will result in more disk space use. Two solutions
- Run as admin
- Enable Developer mode in windows (went for this)
Got another error when trying to run model, AssertionError: Torch not compiled with CUDA enabled
- Tried install torch with cuda conda install pytorch==1.13.1 torchvision==0.14.1 torchaudio==0.13.1 pytorch-cuda=11.6 -c pytorch -c nvidia
- Found that the solver is too slow and upgraded conda to use libmamba solver
- Update conda version forcefully using conda install conda={latest conda version} because of this exact error
Need to install chardet after getting ModuleNotFoundError: No module named 'chardet' error
- pip install chardet
However, despite the python command suceed in this stage, ran into the issue of insufficient GPU RAM as stated in the readme of the project
- Minimum of 7GB VRAM is needed, my machine had 6GB 😰

Updated CUDA version from 11.6 to 12.1. Pending retrying all the steps again.

Steps

conda create -n lvdm python=3.8.5
conda activate lvdm
pip install -r requirements_xformer.txt
Overwrite pip install of pytorch with pytorch compiled with CUDA
- conda install pytorch torchvision torchaudio pytorch-cuda=11.8 -c pytorch -c nvidia
pip install chardet

Lastly the steps from the readme for running VideoControl

PROMPT="An ostrich walking in the desert, photorealistic, 4k"
VIDEO="input/flamingo.mp4"
OUTDIR="results/"

NAME="video_adapter"
CONFIG_PATH="models/adapter_t2v_depth/model_config.yaml"
BASE_PATH="models/base_t2v/model.ckpt"
ADAPTER_PATH="models/adapter_t2v_depth/adapter.pth"

python scripts/sample_text2video_adapter.py \
    --seed 123 \
    --ckpt_path $BASE_PATH \
    --adapter_ckpt $ADAPTER_PATH \
    --base $CONFIG_PATH \
    --savedir $OUTDIR/$NAME \
    --bs 1 --height 256 --width 256 \
    --frame_stride -1 \
    --unconditional_guidance_scale 15.0 \
    --ddim_steps 50 \
    --ddim_eta 1.0 \
    --prompt "$PROMPT" \
    --video $VIDEO

Conclusions
- Still hitting the VRAM insufficient issue. Have to put off running any models that requires more than 6GB GPU memory locally. ☠️

Exploring Genesis

Initially try to use the experimental conda-pypi to install this project using conda pip install ..
- However faces an error that I am not able to quickly overcome.
Thus, below is the current steps I took to install this project.
- Create env if not exist: conda create -n pytorch python=3.10
  - Activate: conda activate pytorch
- Install pytorch: conda install pytorch torchvision torchaudio pytorch-cuda=12.1 -c pytorch -c nvidia
- Install project from pyproject: pip install .
  - This is considered not a good practise when install packages in conda other than python and pip. (we installed pytorch using conda)
  - The reason is pip will override what conda has already resolved and conda will also not know what pip did.

Look Around

Ran python examples/smoke.py.
- Stopped after letting it generates 11 PNG in examples/video.
- Ran gs animate examples/video/*.png to render a MP4 file using the generated PNGs.
Ran python examples/rendering/demo.py.
- This requires LuisaRenderer which requires building from source.
- There isn't detailed instruction for windows in the official guide thus will need to experiment on our own.
- Setup steps
  - Install VS2022 Community and select to install the desktop c++ workload. Uncheck all except for the following individual components.
    - MSVC build tool
    - Cmake
    - Clang
    - Windows SDK
  - install rust (msvc buildchain): choco install rust-ms
  - Tips:
    - clear cmake cache for fresh build as this causes various issues such as CMAKE_SIZEOF_VOID_P size to be 4 causing only supported in 64-bit error
  - Install cudatoolkit: choco install cuda --version=12.0.1.52833
    - This particular version is chosen based on avaibility in chocolatey, compabitility table shows in cudatoolkit release note and output of nvidia-smi showing current CUDA driver version.
    - Install NVIDIA Texture Tools Exporter (NVTT)
  - Open VS Developer Command Prompt
    - [Optional] add a terminal profile in VS Code for easy starting directory
    - make sure pytorch conda env is activated
      - c:\tools\miniconda3\condabin\activate.bat && conda activate pytorch && cd genesis/ext/LuisaRender
    - cmake -S . -B build -D CMAKE_BUILD_TYPE=Release -D PYTHON_VERSIONS=3.10 -D LUISA_COMPUTE_DOWNLOAD_NVCOMP=ON -D LUISA_COMPUTE_ENABLE_GUI=OFF
    - cmake --build build -j 12
  - More recent steps from this github issue and this docs
    - pip install -e ".[render]" --break-system-packages
      - this is equivalent to pip install "pybind11[global]" --break-system-packages
    - cmake -S . -B build -D CMAKE_BUILD_TYPE=Release -D PYTHON_VERSIONS=3.10 -D LUISA_COMPUTE_DOWNLOAD_NVCOMP=ON
    - cmake --build build -j 12
  - Need to install the rest of the dependencies too
    - conda install -c conda-forge minizip zlib
    - download and install vulkan sdk: https://vulkan.lunarg.com/

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
Genesis @ be62afe		Genesis @ be62afe
VideoCrafter @ 11bcd76		VideoCrafter @ 11bcd76
research/using-chatgpt		research/using-chatgpt
stable-diffusion-webui @ 82a973c		stable-diffusion-webui @ 82a973c
.gitmodules		.gitmodules
LICENSE		LICENSE
local-ml-runs.code-workspace		local-ml-runs.code-workspace
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Overview

History

Exploring stable-diffusion-webui

Exploring VideoCrafter

Exploring Genesis

Look Around

About

Uh oh!

Languages

License

wizlee/local-ml-runs

Folders and files

Latest commit

History

Repository files navigation

Overview

History

Exploring stable-diffusion-webui

Exploring VideoCrafter

Exploring Genesis

Look Around

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Languages