> ## Documentation Index > Fetch the complete documentation index at: https://docs.matterai.so/llms.txt > Use this file to discover all available pages before exploring further. # Responses > Create a model response using the MatterAI API (OpenAI-compatible) ## Authentication All API requests require authentication using a Bearer token. You can obtain your API key from the [MatterAI Console](https://app.matterai.so). ```bash theme={null} Authorization: Bearer MATTERAI_API_KEY ``` Keep your API key secure and never expose it in client-side code. Get your API key from the MatterAI console. ## Request The model used for the response. Available models: `"axon-2-5-pro"`, `"axon-2-5-mini"`. Text or array of input items to the model, used to generate a response. Accepts a plain string (equivalent to a `"user"` message) or an array of input items. The role of the message. One of `"user"`, `"assistant"`, `"system"`, or `"developer"`. Text content or an array of content blocks. The type of content. Use `"input_text"` for text content, `"input_image"` for image URLs. The text content (for `input_text` blocks). The URL of the image (for `input_image` blocks). A system (or developer) message inserted into the model's context. When used with `previous_response_id`, instructions from a previous response are not carried over to the next response. Equivalent to the `"system"` role in chat completions. An upper bound for the number of tokens that can be generated for a response, including visible output tokens and reasoning tokens. Whether to stream the response as it's generated using server-sent events. Configuration for reasoning capabilities. The level of reasoning effort. Options: `"none"`, `"low"`, `"medium"`, `"high"`. The level of reasoning summary. Options: `"auto"`, `"concise"`, `"detailed"`. Controls randomness in the output. Higher values make output more random, lower values make it more focused and deterministic. Range: 0.0 to 2.0. Controls diversity via nucleus sampling. Range: 0.0 to 1.0. Configuration options for a text response from the model. An object specifying the format that the model must output. The type of response format. Options: `"text"`, `"json_schema"`, `"json_object"`. The name of the response format (for `json_schema`). The JSON Schema for the response format (for `json_schema`). Whether to enable strict schema adherence (for `json_schema`). Constrains the verbosity of the model's response. Options: `"low"`, `"medium"`, `"high"`. Whether to store the generated model response for later retrieval via API. Set of up to 16 key-value pairs that can be attached to the response. Keys are strings with a maximum length of 64 characters. Values are strings with a maximum length of 512 characters. ## Response Unique identifier for this response. The object type, which is always `"response"`. The status of the response generation. One of `"completed"`, `"failed"`, `"in_progress"`, `"cancelled"`, or `"incomplete"`. Unix timestamp (in seconds) of when this response was created. The model used to generate the response. Available models: `"axon-2-5-pro"`, `"axon-2-5-mini"`. An array of content items generated by the model. The unique ID of the output message. The type of the output item. Always `"message"`. The role of the output message. Always `"assistant"`. The status of the message. One of `"in_progress"`, `"completed"`, `"incomplete"`. The content of the output message. The type of content. Values: `"output_text"`, `"refusal"`. The text output from the model (for `output_text` blocks). Annotations of the text output, such as citations. SDK-only convenience property containing the aggregated text output from all `output_text` items in the `output` array. Usage statistics for the response request. Number of tokens in the input. Number of tokens in the generated output. Total number of tokens used (input + output). A detailed breakdown of the output tokens. Number of tokens used for reasoning. A detailed breakdown of the input tokens. Number of tokens retrieved from cache. An error object returned when the model fails to generate a response. The error code. Possible values: `"server_error"`, `"rate_limit_exceeded"`, `"invalid_prompt"`, etc. A human-readable description of the error. ## Example Request ```bash cURL theme={null} curl --location 'https://api2.matterai.so/v1/responses' \ --header 'content-type: application/json' \ --header 'Authorization: Bearer MATTERAI_API_KEY' \ --data '{ "model": "axon-2-5-pro", "input": "Tell me a short story about a curious robot." }' ``` ```javascript JavaScript theme={null} const response = await fetch("https://api2.matterai.so/v1/responses", { method: "POST", headers: { "Content-Type": "application/json", Authorization: "Bearer MATTERAI_API_KEY", }, body: JSON.stringify({ model: "axon-2-5-pro", input: "Tell me a short story about a curious robot.", }), }); const data = await response.json(); console.log(data); ``` ```python Python theme={null} import requests url = "https://api2.matterai.so/v1/responses" headers = { "Content-Type": "application/json", "Authorization": "Bearer MATTERAI_API_KEY" } payload = { "model": "axon-2-5-pro", "input": "Tell me a short story about a curious robot." } response = requests.post(url, json=payload, headers=headers) print(response.json()) ``` ## Example Response ```json theme={null} { "id": "resp_abc123def456", "object": "response", "created_at": 1741476542, "status": "completed", "model": "axon-2-5-pro", "output": [ { "id": "msg_abc123def456", "type": "message", "status": "completed", "role": "assistant", "content": [ { "type": "output_text", "text": "In a gleaming city of tomorrow, a small robot named Bolt was built to sort packages.", "annotations": [] } ] } ], "usage": { "input_tokens": 27, "output_tokens": 94, "total_tokens": 121, "output_tokens_details": { "reasoning_tokens": 0 }, "input_tokens_details": { "cached_tokens": 0 } } } ``` ## Example: Multi-turn Conversation To continue a conversation, pass the `previous_response_id` from the previous response: ```bash cURL theme={null} curl --location 'https://api2.matterai.so/v1/responses' \ --header 'content-type: application/json' \ --header 'Authorization: Bearer MATTERAI_API_KEY' \ --data '{ "model": "axon-2-5-pro", "input": "What happened next?", "previous_response_id": "resp_abc123def456" }' ``` ```javascript JavaScript theme={null} const response = await fetch("https://api2.matterai.so/v1/responses", { method: "POST", headers: { "Content-Type": "application/json", Authorization: "Bearer MATTERAI_API_KEY", }, body: JSON.stringify({ model: "axon-2-5-pro", input: "What happened next?", previous_response_id: "resp_abc123def456", }), }); const data = await response.json(); console.log(data); ``` ```python Python theme={null} import requests url = "https://api2.matterai.so/v1/responses" headers = { "Content-Type": "application/json", "Authorization": "Bearer MATTERAI_API_KEY" } payload = { "model": "axon-2-5-pro", "input": "What happened next?", "previous_response_id": "resp_abc123def456" } response = requests.post(url, json=payload, headers=headers) print(response.json()) ``` ## Example: With Reasoning ```bash cURL theme={null} curl --location 'https://api2.matterai.so/v1/responses' \ --header 'content-type: application/json' \ --header 'Authorization: Bearer MATTERAI_API_KEY' \ --data '{ "model": "axon-2-5-pro", "instructions": "You are a helpful assistant that explains complex topics simply.", "input": "Explain quantum entanglement in one paragraph.", "reasoning": { "effort": "medium" } }' ``` ```javascript JavaScript theme={null} const response = await fetch("https://api2.matterai.so/v1/responses", { method: "POST", headers: { "Content-Type": "application/json", Authorization: "Bearer MATTERAI_API_KEY", }, body: JSON.stringify({ model: "axon-2-5-pro", instructions: "You are a helpful assistant that explains complex topics simply.", input: "Explain quantum entanglement in one paragraph.", reasoning: { effort: "medium", }, }), }); const data = await response.json(); console.log(data); ``` ```python Python theme={null} import requests url = "https://api2.matterai.so/v1/responses" headers = { "Content-Type": "application/json", "Authorization": "Bearer MATTERAI_API_KEY" } payload = { "model": "axon-2-5-pro", "instructions": "You are a helpful assistant that explains complex topics simply.", "input": "Explain quantum entanglement in one paragraph.", "reasoning": { "effort": "medium" } } response = requests.post(url, json=payload, headers=headers) print(response.json()) ``` ## Streaming When `stream` is set to `true`, the API returns a stream of Server-Sent Events (SSE). The streaming events use the OpenAI Responses API format: ```json theme={null} data: {"type":"response.created","response":{"id":"resp_abc123","object":"response","created_at":1741476542,"status":"in_progress","model":"axon-2-5-pro","output":[],"usage":null}} data: {"type":"response.in_progress","response":{"id":"resp_abc123","object":"response","created_at":1741476542,"status":"in_progress","model":"axon-2-5-pro","output":[],"usage":null}} data: {"type":"response.output_item.added","output_index":0,"item":{"id":"msg_abc123","type":"message","status":"in_progress","role":"assistant","content":[]}} data: {"type":"response.content_part.added","item_id":"msg_abc123","output_index":0,"content_index":0,"part":{"type":"output_text","text":"","annotations":[]}} data: {"type":"response.output_text.delta","item_id":"msg_abc123","output_index":0,"content_index":0,"delta":"Hello"} data: {"type":"response.output_text.done","item_id":"msg_abc123","output_index":0,"content_index":0,"text":"Hello world!"} data: {"type":"response.output_item.done","output_index":0,"item":{"id":"msg_abc123","type":"message","status":"completed","role":"assistant","content":[{"type":"output_text","text":"Hello world!","annotations":[]}]}} data: {"type":"response.completed","response":{"id":"resp_abc123","object":"response","created_at":1741476542,"status":"completed","model":"axon-2-5-pro","output":[...],"usage":{"input_tokens":10,"output_tokens":12,"total_tokens":22}}} ``` ## Migrating from Chat Completions The Responses API provides a cleaner interface for text generation. Key differences: | Chat Completions | Responses | | ---------------------------- | ------------------------------------------ | | `POST /v1/chat/completions` | `POST /v1/responses` | | `messages` array | `input` (string or array) | | `system` message role | `instructions` string parameter | | `choices[0].message.content` | `output[].content[].text` or `output_text` | | `max_tokens` | `max_output_tokens` | | `finish_reason` | `status` field on response | ## Error Responses The API returns standard HTTP status codes to indicate success or failure: Invalid request parameters or malformed JSON. Invalid or missing API key. Too many requests. Please slow down. Server error. Please try again later. Example error response: ```json theme={null} { "error": { "message": "Invalid API key provided", "type": "invalid_request_error", "code": "invalid_api_key" } } ```