Skip to content

Implementation of a Docker container hosting the pure Rust implementation for the text2vec-transformers module

License

Notifications You must be signed in to change notification settings

weaviate/t2v-transformers-models-rs

Repository files navigation

t2v-transformers-models-rs

This repository contains the Proof of Concept (PoC) for a Rust-native sentence embedding API for use within Weaviate's module vectorization framework.

In its current form, it uses huggingface/candle to load HuggingFace models from the Hub into Rust. The models are then used to generate embeddings for input text.

The API layer is provided by the tokio-rs/axum framework, which allows for event loop based concurrency.

The inference workloads themselves are scheduled into a rayon thread-pool using andybarron/tokio-rayon, which is a lightweight wrapper allowing for awaiting of rayon jobs within a tokio async function. Ideally, this would be re-implemented by us internally to avoid the dangerous external dependency. For now, it is used as a proof of concept.

About

Implementation of a Docker container hosting the pure Rust implementation for the text2vec-transformers module

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published