Forge is Voxell's hosted text-embedding API. This MCP server exposes two tools — embed (turn text into vectors) and list_models — so any MCP-compatible agent can generate embeddings for semantic search, RAG, clustering, and dedup.
Three tiers let you dial quality against cost: turbo (1024d, fast and cheap), pro (2560d), and ultra (4096d — Qwen3-Embedding-8B, ~75+ average task score on MTEB and currently #4 on MTEB English; the models above it are research-only, so it's the top usable model). Matryoshka dimensions let you truncate to fewer dims — re-normalized — for cheaper vectors.
Requires a FORGE_API_KEY (free tier at dash.voxell.ai). Embeddings only — no data is stored.
Server Config
{
"mcpServers": {
"forge": {
"command": "npx",
"args": [
"-y",
"@voxell/forge-mcp"
],
"env": {
"FORGE_API_KEY": "your-forge-api-key"
}
}
}
}