RRM-1 deployment guide
RRM-1’s OpenAI-compatible API means it drops into any tool that speaks that protocol. which is virtually every modern development environment. Point Claude Code, Cursor at https://<RRM-url>/v1 as a custom provider, set the model to rrm-1, and you’ve upgraded your coding assistant with reasoning model. No plugin required, no proprietary SDK.
The model excels at multi-step technical analysis: debugging complex stack traces, reasoning through multi documents, and validating business logic constraints
Deploying the API
Simple API Call
Call our reasoning engine from any application. OpenAI compatible.
Quick Start
Base URL: `https://<RRM-url>/v1`
Model Name: `RRM-1`
No API key required** (for now).
Python
Simple Request
“`python
import requests
response = requests.post(
“https://<RRM-url>/v1/chat/completions”,
json={
“model”: “rrm-1”,
“messages”: [{“role”: “user”, “content”: “What is AGI?”}]
}
)
answer = response.json()[‘choices’][0][‘message’][‘content’]
print(answer)
“`
With OpenAI SDK (Recommended)
“`python
from openai import OpenAI
client = OpenAI(
base_url=”https://<RRM-url>/v1″,
api_key=”dummy” # Not used, but required by SDK
)
response = client.chat.completions.create(
model=”rrm-1″,
messages=[{“role”: “user”, “content”: “Explain quantum computing”}],
max_tokens=512,
temperature=0.7
)
print(response.choices[0].message.content)
“`
Async Example
“`python
import asyncio
import aiohttp
async def ask_foursight(question):
async with aiohttp.ClientSession() as session:
async with session.post(
“https://<RRM-url>/v1/chat/completions”,
json={
“model”: “rrm-1”,
“messages”: [{“role”: “user”, “content”: question}]
}
) as resp:
data = await resp.json()
return data[‘choices’][0][‘message’][‘content’]
Usage
answer = asyncio.run(ask_foursight(“What is the capital of France?”))
print(answer) # Paris
“`
JavaScript / TypeScript
Using Fetch API
“`javascript
const response = await fetch(“https://<RRM-url>/v1/chat/completions”, {
method: “POST”,
headers: {
“Content-Type”: “application/json”
},
body: JSON.stringify({
model: “rrm-1”,
messages: [{ role: “user”, content: “What is AGI?” }],
max_tokens: 256
})
});
const data = await response.json();
const answer = data.choices[0].message.content;
console.log(answer);
“`
Using OpenAI SDK (Node.js)
“`javascript
import OpenAI from “openai”;
const client = new OpenAI({
baseURL: “https://<RRM-url>/v1″,
apiKey: “dummy”
});
const response = await client.chat.completions.create({
model: “rrm-1”,
messages: [{ role: “user”, content: “Explain recursion” }],
max_tokens: 512,
temperature: 0.7
});
console.log(response.choices[0].message.content);
“`
Streaming Response
“`javascript
const response = await fetch(“https://<RRM-url>/v1/chat/completions”, {
method: “POST”,
headers: { “Content-Type”: “application/json” },
body: JSON.stringify({
model: “rrm-1”,
messages: [{ role: “user”, content: “Tell me a story” }],
stream: true
})
});
const reader = response.body.getReader();
const decoder = new TextDecoder();
while (true) {
const { done, value } = await reader.read();
if (done) break;
const chunk = decoder.decode(value);
const lines = chunk.split(“\n”);
for (const line of lines) {
if (line.startsWith(“data: “)) {
const data = line.slice(6);
if (data === “[DONE]”) break;
const parsed = JSON.parse(data);
const content = parsed.choices[0]?.delta?.content || “”;
process.stdout.write(content);
}
}
}
“`
cURL
“`bash
curl -X POST “https://<RRM-url>/v1/chat/completions” \
-H “Content-Type: application/json” \
-d ‘{
“model”: “rrm-1”,
“messages”: [{“role”: “user”, “content”: “What is AGI?”}]
}’
“`
Response Format
“`json
{
“id”: “chatcmpl-xxx”,
“object”: “chat.completion”,
“created”: 1234567890,
“model”: “rrm-1”,
“choices”: [{
“index”: 0,
“message”: {
“role”: “assistant”,
“content”: “The answer is 4”
},
“finish_reason”: “stop”
}],
“usage”: {
“prompt_tokens”: 10,
“completion_tokens”: 5,
“total_tokens”: 15
}
}
Health Check
“`bash
curl https://<RRM-url>/health
“`
Response:
“`json
{
“status”: “ok”,
“engine_ready”: true,
“in_place_ttt”: {“enabled”: true, “hooks”: 6}
}
`
Error Handling
“`python
import requests
try:
response = requests.post(
“https://<RRM-url>/v1/chat/completions”,
json={“model”: “rrm-1”, “messages”: [{“role”: “user”, “content”: “Hi”}]},
timeout=30
)
response.raise_for_status()
answer = response.json()[‘choices’][0][‘message’][‘content’]
except requests.exceptions.Timeout:
print(“Request timed out”)
except requests.exceptions.ConnectionError:
print(“Cannot connect to server”)
except KeyError:
print(“Unexpected response format”)
“`
Try It Live
“`python
Interactive test
import requests
ip = input(“Enter API IP: “)
question = input(“Ask a question: “)
response = requests.post(
f”http://{ip}/v1/chat/completions”,
json={
“model”: “rrm-1”,
“messages”: [{“role”: “user”, “content”: question}]
}
)
print(response.json()[‘choices’][0][‘message’][‘content’])
“`
**Simply change the IP address to your deployed endpoint and start using.**