t2v-transformers-models-rs

This repository contains the Proof of Concept (PoC) for a Rust-native sentence embedding API for use within Weaviate's module vectorization framework.

In its current form, it uses huggingface/candle to load HuggingFace models from the Hub into Rust. The models are then used to generate embeddings for input text.

The API layer is provided by the tokio-rs/axum framework, which allows for event loop based concurrency.

The inference workloads themselves are scheduled into a rayon thread-pool using andybarron/tokio-rayon, which is a lightweight wrapper allowing for awaiting of rayon jobs within a tokio async function. Ideally, this would be re-implemented by us internally to avoid the dangerous external dependency. For now, it is used as a proof of concept.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

t2v-transformers-models-rs

Files

README.md

Latest commit

History

README.md

File metadata and controls

t2v-transformers-models-rs