Documentation Index
Fetch the complete documentation index at: https://docs.blindference.xyz/llms.txt
Use this file to discover all available pages before exploring further.
Model Backends
Blindference Node uses a pluggable backend registry to support multiple inference providers. You can add custom backends without modifying core code.Built-in Backends
| Backend | Description | Availability | Model IDs |
|---|---|---|---|
vllm | Local GPU inference via vLLM | Requires NVIDIA GPU + vllm package | facebook/opt-125m (tier 0) |
groq | Groq cloud API (OpenAI-compatible) | Requires GROQ_API_KEY env var | groq:llama-3.3-70b-versatile |
gemini | Google Gemini REST API | Requires GOOGLE_API_KEY env var | gemini:gemini-2.5-flash |
mock | Deterministic SHA-256 fallback | Always available | * (universal fallback) |
Backend Architecture
All backends implement theModelBackend abstract base class:
Adding Custom Backends
1. Config Dotted Path (Fastest)
Write a Python module anywhere on yourPYTHONPATH:
~/.blindference/config.json:
2. Entry-Point Plugins (For Published Packages)
Declare in yourpyproject.toml:
importlib.metadata.
3. Fork & Extend
Add your backend class underblindference_node/models/ and register it in backend_loader.py::_register_builtin().
CLI Backend Management
Registration Order & Precedence
- Built-in (vLLM, Groq, Gemini, Mock)
- Entry-point plugins (third-party pip packages)
- Config dotted paths (
custom_backendsinconfig.json)
model_id.
Determinism Requirement
All backends must produce byte-identical output for the same(model_id, prompt) pair. Required for quorum consensus.
Tips:
- Set
temperature=0,top_p=1,top_k=-1 - Use fixed
seed=42(Groq supports this) - Document any backend-specific non-determinism