Cortex Search
Search the Web, Get Answers
Cortex Search is a search API platform that combines web scraping, hybrid retrieval (BM25 + vector), neural reranking, and RAG into a single API. Built with a Rust core for sub-50ms latency, a Python sidecar for anti-bot bypass, and OpenAI-compatible endpoints.
- Rust core for sub-50ms latency on retrieval
- OpenAI-compatible API endpoints
- CDP network interception bypasses HTML fragility
- ONNX neural reranker runs locally on CPU
Key Features
Everything you need to transform your search operations
Hybrid Retrieval
BM25 (Tantivy) + vector search (Qdrant) fused with Reciprocal Rank Fusion, then reranked by ONNX DeBERTa-v3 cross-encoder.
Network Interception
CDP-based XHR/GraphQL interception bypasses DOM parsing entirely — 10x faster for JavaScript-heavy sites.
OpenAI Compatible
Drop-in /v1/chat/completions endpoint. Works with any OpenAI SDK — just change the base URL.
Deep Research
Multi-step sub-question generation and synthesis for complex queries that need multiple search rounds.
Rust Performance
All retrieval, fusion, chunking, and API routing in Rust. Python only where unavoidable (browser automation).
MCP Server
Built-in Model Context Protocol server for integrating search capabilities directly into AI agents and LLM workflows.
SDKs & Dashboard
Python and TypeScript SDKs, plus a Next.js dashboard with API playground, usage tracking, and interactive docs.
Powerful Components
Specialized modules designed for specific business needs
Fetch Router
Intelligent URL classification with auto-escalation from HTTP to stealth browsing
Retrieval Engine
Hybrid BM25 + vector retrieval with RRF fusion and neural reranking
LLM Orchestrator
Multi-model routing with citation injection and SSE streaming
Web Crawler
BFS crawler with freshness scheduling and proxy rotation
Python Sidecar
gRPC sidecar for anti-bot bypass and smart content extraction
Impact by Numbers
Measurable results that matter
Retrieval Latency
Rust Crates
Reranker Size
gRPC Services
How It Works
Enterprise-grade architecture designed for scale and reliability
API Layer
Fetch Layer
Retrieval
RAG Pipeline
Infrastructure
Built With the Best
Modern, battle-tested technologies powering Cortex Search
Ready to Transform with Cortex Search?
Schedule a personalized demo and discover how we can help your organization.