KeraDB CLI KeraDB

cli database document-database embedded json nosql rust storage

Use this command to install KeraDB CLI:

winget install --id=KeraDB.CLI -e

KeraDB CLI is a lightweight, embedded document database designed for simplicity and ease of use. It serves as a powerful tool for developers to manage JSON documents with automatic indexing and persistent storage. Built in Rust, KeraDB offers high performance and memory safety while maintaining an intuitive interface.

Key Features:

Single-file database format: All data is stored in a single .ndb file for simplicity and ease of management.
Fast and memory-safe: Written in Rust, KeraDB ensures zero-cost abstractions and memory safety, making it ideal for mission-critical applications.
Document-oriented storage: Store JSON documents with automatic UUIDs for easy organization and retrieval.
Vector search capabilities: Supports HNSW (Hierarchical Navigable Small World) indexing for fast approximate nearest neighbor searches, enabling efficient vector database operations.
Delta compression: Inspired by LEANN-style compression, KeraDB reduces storage requirements while maintaining accuracy, achieving up to 97% savings in some cases.
Multi-language SDKs: Accessible from multiple programming languages, including Rust, Node.js, Python, Go, and C#, making it versatile for various development environments.

Audience & Benefit:
Ideal for developers building applications that require embedded document storage or vector search capabilities. KeraDB simplifies integration into projects by eliminating the need for external database setups, reducing overhead, and offering efficient performance for both small-scale and large-scale applications. Its robust features make it a reliable choice for embedding in machine learning workflows or any scenario requiring lightweight, high-performance data management. Installable via winget, KeraDB CLI provides developers with a seamless experience from the command line or through its interactive REPL interface.

README

KeraDB

> A lightweight, embedded NoSQL document database with vector search - written in Rust

KeraDB is a single-file, embedded document database designed for simplicity and ease of use. Think SQLite, but for JSON documents with vector search capabilities!

Documentation | Examples

Features

Single-file database - One .ndb file contains everything
Fast - Written in Rust with zero-cost abstractions
Safe - Memory-safe with Rust's guarantees
Document-oriented - Store JSON documents with automatic UUIDs
Indexed - Primary key indexing for fast lookups
Persistent - Data survives application restarts
CLI tool - Interactive REPL and command-line interface
Collections - Organize documents into named collections
Simple API - Easy to learn and use
Multi-language SDKs - Use from Rust, Node.js, Python, Go, or C#

Vector Database Features

HNSW Index - Fast approximate nearest neighbor search
Multiple Distances - Cosine, Euclidean, Dot Product, Manhattan
Metadata Filtering - Filter vector search by document attributes
Unified Storage - Vectors and documents in the same .ndb file
Delta Compression - Up to 97% storage savings with LEANN-style compression

LEANN-Style Delta Compression

KeraDB implements delta compression inspired by LEANN (Low-storage Embedding Approximate Nearest Neighbor), a research project from Berkeley Sky Computing Lab that achieves 97% storage savings while maintaining search accuracy.

Data Type	Vectors	Dimensions	Traditional Storage
Email chunks	780K	768	78 MB
Text corpus	60M	768	201 GB
Browser history	38K	768	6 MB
Chat messages	400K	768	64 MB

Technique	Description
Graph-based selective recomputation	Only compute/store embeddings for nodes in the search path
High-degree preserving pruning	Keep important "hub" nodes while removing redundant connections
Sparse delta encoding	Store only components that differ significantly (threshold-based)
Quantized deltas	Optional 8-bit quantization for even more aggressive compression

Dataset	Traditional	With LEANN	Savings
780K email chunks	78 MB	8 MB	91%
60M text chunks	201 GB	6 GB	97%
38K browser entries	6 MB	0.4 MB	95%
400K chat messages	64 MB	2 MB	97%

Operation	Throughput
Insert (single)	~10,000 ops/sec
Find by ID	~50,000 ops/sec
Update	~8,000 ops/sec

Dimensions	Time per Insert
32-dim	77-81 us
128-dim	94-97 us
384-dim	135-144 us
768-dim	167-185 us

KeraDB CLI KeraDB

README

KeraDB

Features

Vector Database Features

LEANN-Style Delta Compression

The Problem: Vector Storage is Expensive

The LEANN Solution

Core Techniques

Storage Savings (LEANN Benchmarks)

Using Compression in KeraDB

Quick Start

Installation

Building from Source

Using the CLI

Using as a Library

Vector Search Example

Benchmarks

Document Operations

Vector Insert Performance

Vector Search Performance (k=10)

Search by K

Distance Metric Comparison

Bulk Insert

Compression Performance

Performance Summary

Architecture

Current Capabilities

Language SDKs

Testing

License

Acknowledgments

Collection Size	Dimensions	Search Time
100 vectors	128-dim	34-36 us
1,000 vectors	128-dim	36-37 us
10,000 vectors	128-dim	38 us
1,000 vectors	384-dim	117-121 us

Metric	Time	Notes
Dot Product	54-57 us	Fastest - no normalization
Euclidean	59-60 us	Standard L2 distance
Cosine	115-117 us	Requires normalization

Count	Time	Rate
100 vectors	7.1 ms	~14k/sec
500 vectors	51 ms	~10k/sec
1,000 vectors	107 ms	~9.3k/sec
5,000 vectors	567 ms	~8.8k/sec

Scenario	Uncompressed	Compressed	Savings
768-dim, 5% sparse diff	3,072 bytes	~250 bytes	91.9%
128-dim, 10% sparse diff	512 bytes	~94 bytes	81.6%
Mixed workload	5,120 bytes	2,218 bytes	56.7%

Language	Installation
Rust	`cargo add keradb`
Node.js	`npm install keradb`
Python	`pip install keradb`
Go	`go get github.com/yourusername/keradb`
C#	`dotnet add package keradb`

k (results)	Time
k=1	35-37 us
k=10	35-37 us
k=50	39-40 us
k=100	64-65 us