Relace: Relace Search
The relace-search model uses 4-12 `view_file` and `grep` tools in parallel to explore a codebase and return relevant files to the user request. In contrast to RAG, relace-search performs agentic multi-step reasoning to produce highly precise results 4x faster than any frontier model. It's designed to serve as a subagent that passes its findings to an "oracle" coding agent, who orchestrates/performs the rest of the coding task. To use relace-search you need to build an appropriate agent harness, and parse the response for relevant information to hand off to the oracle. Read more about it in the [Relace documentation](https://docs.relace.ai/docs/fast-agentic-search/agent).
Pricing per 1M Tokens
| Input (Prompt) | $1.00 |
| Output (Completion) | $3.00 |
| Cache Read | Free |
| Cache Write | Free |
| Image | N/A |
Specifications
| Context Length | 256K |
| Max Output Tokens | 128K |
| Input Modalities | Text |
| Output Modalities | Text |
| Tokenizer | Other |
| Instruct Type | N/A |
| Top Provider Context | 256K |
| Top Provider Max Output | 128K |
| Moderated | No |
Compare this model
See how Relace: Relace Search stacks up against other models.
More from Relace
Last updated: March 23, 2026
First tracked: March 23, 2026