Name	Name	Last commit message	Last commit date
Latest commit kyteinsky maximize space action Mar 13, 2024 f71b55d · Mar 13, 2024 History 92 Commits
.github	.github	maximize space action	Mar 13, 2024
appinfo	appinfo	v1.2.0	Mar 11, 2024
context_chat_backend	context_chat_backend	Introduce /deleteSourcesByProviderForAllUsers and fixes	Mar 1, 2024
persistent_storage	persistent_storage	ported init and persistent storage changes	Feb 22, 2024
.gitignore	.gitignore	ported init and persistent storage changes	Feb 22, 2024
.nextcloudignore	.nextcloudignore	add changelog and update .nextcloudignore	Dec 21, 2023
.pre-commit-config.yaml	.pre-commit-config.yaml	use pre-commit instead of gh workflow	Feb 26, 2024
Dockerfile	Dockerfile	Added initial cuda11.8 support (#16 )	Mar 11, 2024
Dockerfile.cpu	Dockerfile.cpu	Added initial cuda11.8 support (#16 )	Mar 11, 2024
LICENSE	LICENSE	app ecosystem necessary files	Oct 20, 2023
Makefile	Makefile	v1.2.0	Mar 11, 2024
README.md	README.md	v1.2.0	Mar 11, 2024
changelog.md	changelog.md	v1.2.0	Mar 11, 2024
config.cpu.yaml	config.cpu.yaml	Added initial cuda11.8 support (#16 )	Mar 11, 2024
config.yaml	config.yaml	Added initial cuda11.8 support (#16 )	Mar 11, 2024
example.env	example.env	v1.2.0	Mar 11, 2024
krankerl.toml	krankerl.toml	app ecosystem necessary files	Oct 20, 2023
main.py	main.py	type fixes and other numerous fixes	Feb 26, 2024
pyproject.toml	pyproject.toml	v1.2.0	Mar 11, 2024
requirements.cpu.txt	requirements.cpu.txt	Added initial cuda11.8 support (#16 )	Mar 11, 2024
requirements.dev.txt	requirements.dev.txt	Added initial cuda11.8 support (#16 )	Mar 11, 2024
requirements.txt	requirements.txt	Added initial cuda11.8 support (#16 )	Mar 11, 2024
weaviate-docker-compose.yml	weaviate-docker-compose.yml	refactor at the brink of complete	Oct 19, 2023

Context Chat

Note

This is a beta software. Expect breaking changes.

[!INFO] A fully working manually example with cuda 11.8 is at the end of this readme

Simple Install

Install three mandatory apps for this app to work as desired in your Nextcloud install from the "Apps" page:

AppAPI (>= v2.0.3): https://apps.nextcloud.com/apps/app_api
Context Chat (>= 1.1.0): https://apps.nextcloud.com/apps/context_chat
Assistant: https://apps.nextcloud.com/apps/assistant (The OCS API or the occ commands can also be used to interact with this app but it recommended to do that through a Text Processing OCP API consumer like the Assitant app.)

Install this backend app (Context Chat Backend: https://apps.nextcloud.com/apps/context_chat_backend) from the "External Apps" page
Start using Context Chat from the Assistant UI

Note

See AppAPI's deploy daemon configuration For GPU Support: Ensure docker is installed and the Nextcloud's web server user has access to /var/run/docker.sock, the docker socket. Mount the docker.sock in the Nextcloud container if you happen to use a containerized install of Nextcloud and ensure correct permissions for the web server user to access it. See 4th point in Complex Install (with docker) on how to do this

Complex Install (without docker)

python -m venv .venv
. .venv/bin/activate
Install requirements

3.1 For using CPU:

pip install --no-deps -r requirements.cpu.txt

3.2 Using GPU with CUDA:

pip install --no-deps -r requirements.txt
Install pandoc from your desired package manager (# apt install pandoc for Debian-based systems)
Copy example.env to .env and fill in the variables, when using CUDA then you need to uncomment and adjust the two NVDIA_* related envvars in your .env file
Configure config.cpu.yaml (or config.yaml using gpu and replace config.cpu.yaml with it) for the model name, model type and its parameters (which also includes model file's path and model id as per requirements, see example config)
./main.py
Follow the below steps to register the app in the app ecosystem

Complex Install (with docker)

Build the image (this is a good place to edit the example.env file before building the container)

1.1 CPU:

docker build -t context_chat_backend_dev . -f Dockerfile.cpu

1.2 GPU:

docker build -t context_chat_backend_dev . -f Dockerfile
docker run --add-host=host.docker.internal:host-gateway -p 10034:10034 context_chat_backend_dev
Volumes can be mounted for persistent_storage/model_files and persistent_storage/vector_db_files if you wish with -v $(pwd)/persistent_storage/model_files:/app/model_files and similar for vector_db_files
If your Nextcloud is running inside a docker container, ensure you have mounted the docker socket inside your container and has the correct permissions for the web server user to have access to it or add the web server to the docker group:

for docker compose

volumes:
- /var/run/docker.sock:/tmp/docker.sock:ro

for docker container run command

-v /var/run/docker.sock:/var/run/docker.sock:ro

Follow the below steps to register the app in the app ecosystem

Manual Install (without docker)

python -m venv .venv
. .venv/bin/activate
Install requirements 3.1 CPU:

pip install --no-deps -r requirements.cpu.txt

3.2 GPU:

pip install --no-deps -r requirements.txt
Install pandoc from your desired package manager (# apt install pandoc for Debian-based systems)
Copy example.env to .env and fill in the variables
Configure config.yaml (or config.yaml using gpu and replace config.yaml with it) for the model name, model type and its parameters (which also includes model file's path and model id as per requirements, see example config)
./main.py
Follow the below steps to register the app in the app ecosystem

Manual Install (with docker)

Build the image (this is a good place to edit the example.env file before building the container)

1.1 CPU:

docker build -t context_chat_backend_dev . -f Dockerfile.cpu

1.2 GPU:

docker build -t context_chat_backend_dev . -f Dockerfile
docker run --add-host=host.docker.internal:host-gateway -p10034:10034 context_chat_backend_dev
Volumes can be mounted for model_files and vector_db_files if you wish with -v $(pwd)/model_files:/app/model_files and similar for vector_db_files
If your Nextcloud is running inside a docker container, there are two ways to configure the deploy daemon
Follow the below steps to register the app in the app ecosystem (For a dev setup, mount the context_chat_backend/ folder as a volume and set the uvicorn to reload on change)

Register as an Ex-App

1. Create a manual deploy daemon:

occ app_api:daemon:register --net host manual_install "Manual Install" manual-install http <host> <nextcloud_url>

host will be localhost if nextcloud can access localhost or host.docker.internal if nextcloud is inside a docker container and the backend app is on localhost.

If nextcloud is inside a container, --add-host option would be required by your nextcloud container. See example above, pt. 2

2. Register the app using the deploy daemon (be mindful of the port number and the app's version):

occ app_api:app:register context_chat_backend manual_install --json-info \
"{\"appid\":\"context_chat_backend\",\"name\":\"Context Chat Backend\",\"daemon_config_name\":\"manual_install\",\"version\":\"1.2.0\",\"secret\":\"12345\",\"port\":10034,\"scopes\":[],\"system_app\":0}" \
--force-scopes --wait-finish

The command to unregister is given below (force is used to also remove apps whose container has been removed)

occ app_api:app:unregister context_chat_backend --force

Configure the AppAPI's deploy daemon

Ensure that docker is installed and the default deploy daemon is working in Admin settings -> AppAPI Docker socket proxy is the recommended for the deploy daemon. Installation steps can be found here: https://github.com/cloud-py-api/docker-socket-proxy

An alternative method would be to provide the Nextcloud's web server user access to /var/run/docker.sock, the docker socket and use deployment configuration in the default deploy daemon of AppAPI.

Mount the docker.sock in the Nextcloud container if you happen to use a containerized install of Nextcloud and ensure correct permissions for the web server user to access it.

for docker compose

volumes:
    - /var/run/docker.sock:/var/run/docker.sock:ro

for docker container, use this option with the docker run command

-v /var/run/docker.sock:/var/run/docker.sock:ro

A fully working GPU Example with cuda 11.8

1. Build the image

cd /your/path/to/the/cloned/repository
docker build --no-cache -f Dockerfile -t context_chat_backend_dev:11.8 .

Parameter explanation:

--no-cache

Tells Docker to build the image without using any cache from previous builds.

-f Dockerfile

The -f or --file option specifies the name of the Dockerfile to use for the build. In this case Dockerfile

-t context_chat_backend_dev:11.8

The -t or --tag option allows you to name and optionally tag your image, so you can refer to it later. In this case we name it context_chat_backend_devwith the specified version 11.8

.

This final argument specifies the build context to the Docker daemon. In most cases, it's the path to a directory containing the Dockerfile and any other files needed for the build. Using . means "use the current directory as the build context."

2. Run the image

Hint:
Adjust the example.env to your needs so that it fits your environment
When using `CUDA` then you need to uncomment and adjust the two NVDIA_* related envvars in your example.env file

docker run \
    -v ./config.yaml:/app/config.yaml \
    -v ./context_chat_backend:/app/context_chat_backend \
    -v /var/run/docker.sock:/var/run/docker.sock \
    --env-file example.env \
    -p 10034:10034 \
    -e CUDA_VISIBLE_DEVICES=0 \
    -v ./persistent_storage:/app/persistent_storage \
    --gpus=all \
    context_chat_backend_dev:11.8

Parameter explanation:

-v ./config.yaml:/app/config.yaml

Mounts the config_cuda.yaml which will be used inside the running image

-v ./context_chat_backend:/app/context_chat_backend

Mounts the context_chat_backend into the docker image

-v /var/run/docker.sock:/var/run/docker.sock

Mounts the Docker socket file from the host into the container. This is done to allow the Docker client running inside the container to communicate with the Docker daemon on the host, essentially controlling Docker and GPU from within the container.

-v ./persistent_storage:/app/persistent_storage

Mounts the persistent storage into the docker instance to keep downloaded models stored for the future.

--env-file example.env

Specifies an environment file named example.env to load environment variables from. Please adjust it for your needs.

-p 10034:10034

This publishes a container's port (10034) to the host (10034). Please align it with your environment file

-e CUDA_VISIBLE_DEVICES=0

Used to limit which GPUs are visible to CUDA applications running in the container. In this case, it restricts visibility to only the first GPU.

--gpus all

Grants the container access to all GPUs available on the host. This is crucial for running GPU-accelerated applications inside the container.

context_chat_backend_dev:11.8

Specifies the image to use for creating the container. In this case we have build the image in 1.) with the specified tag

3. Register context_chat_backend

Hint:
Make sure the previous build cuda_backend_dev docker image is running as the next steps will connect to it on the specified port

cd /var/www/<your_nextcloud_instance_webroot> # For example /var/www/nextcloud/
sudo -u www-data php occ app_api:app:unregister context_chat_backend
sudo -u www-data php occ app_api:app:register \
    context_chat_backend \
    manual_install \
    --json-info "{\"appid\":\"context_chat_backend\",\
                  \"name\":\"Context Chat Backend\",\
                  \"daemon_config_name\":\"manual_install\",\
                  \"version\":\"1.2.0\",\
                  \"secret\":\"12345\",\
                  \"port\":10034,\
                  \"scopes\":[],\
                  \"system_app\":0}" \
    --force-scopes \
    --wait-finish

If successfully registered the output will be like this

	ExApp context_chat_backend successfully unregistered.  
	ExApp context_chat_backend deployed successfully.  
	ExApp context_chat_backend successfully registered.

And your docker container should show that the application has been enabled:

	App enabled  
	TRACE: 172.17.0.1:51422 - ASGI [4] Send {'type': 'http.response.start', 'status': 200, 'headers': '<...>'}  
	INFO: 172.17.0.1:51422 - "PUT /enabled?enabled=1 HTTP/1.1" 200 OK  
	TRACE: 172.17.0.1:51422 - ASGI [4] Send {'type': 'http.response.body', 'body': '<12 bytes>'}  
	TRACE: 172.17.0.1:51422 - ASGI [4] Completed  
	TRACE: 172.17.0.1:51422 - HTTP connection lost  
	INFO: 172.17.0.1:51408 - "POST /init HTTP/1.1" 200 OK  
	TRACE: 172.17.0.1:51408 - ASGI [3] Send {'type': 'http.response.start', 'status': 200, 'headers': '<...>'}  
	TRACE: 172.17.0.1:51408 - ASGI [3] Send {'type': 'http.response.body', 'body': '<2 bytes>'}  
	TRACE: 172.17.0.1:51408 - ASGI [3] Completed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Context Chat

Simple Install

Complex Install (without docker)

Complex Install (with docker)

Manual Install (without docker)

Manual Install (with docker)

Register as an Ex-App

Configure the AppAPI's deploy daemon

A fully working GPU Example with cuda 11.8

About

Releases

Packages 1

Contributors 10

Languages

License

nextcloud/context_chat_backend

Folders and files

Latest commit

History

Repository files navigation

Context Chat

Simple Install

Complex Install (without docker)

Complex Install (with docker)

Manual Install (without docker)

Manual Install (with docker)

Register as an Ex-App

Configure the AppAPI's deploy daemon

A fully working GPU Example with cuda 11.8

About

Resources

License

Security policy

Stars

Watchers

Forks

Releases

Packages 1

Contributors 10

Languages