Skip to content

Latest commit

 

History

History
24 lines (15 loc) · 1.44 KB

README.md

File metadata and controls

24 lines (15 loc) · 1.44 KB

Rust Algorithms

A collection of useful algorithms written in Rust. Currently contains:

  • geo_filters: probabilistic data structures that solve the Distinct Count Problem using geometric filters.
  • bpe: fast, correct, and novel algorithms for the Byte Pair Encoding Algorithm which are particularly useful for chunking of documents.
  • bpe-openai: Fast tokenizers for OpenAI token sets based on the bpe crate.
  • string-offsets: converts string positions between bytes, chars, UTF-16 code units, and line numbers. Useful when sending string indices across language boundaries.

Background

Rust Algorithms is under active development and maintained by GitHub staff AND THE COMMUNITY. See CODEOWNERS for more details.

We will do our best to respond to support, feature requests, and community questions in a timely manner. For more details see support and contribution guidelines.

Requirements

Requires a working installation of Rust through download | through Homebrew

License

This project is licensed under the terms of the MIT open source license. Please refer to MIT for the full terms.