Text generation
Authorizations
Bearer authentication header of the form Bearer <token>
, where <token>
is your auth token.
Body
The identifier of the model to be used for processing the request.
The input text that the model will process to generate a response.
The maximum number of tokens that the model can generate in the response.
Controls the randomness of the model's outputs. Higher values lead to more random outputs, while lower values make the model more deterministic.
Used to control the nucleus sampling, where only the most probable tokens with a cumulative probability of top_p are considered for sampling, providing a way to fine-tune the randomness of predictions.
Specifies stopping conditions for the model's output generation. This can be an array of strings or a single string that the model will look for as a signal to stop generating further tokens.
Specifies the number of completions to generate and return the best one. Useful for generating multiple outputs and choosing the best based on some criteria.
A seed used to initialize the random number generator for the model, ensuring reproducibility of the output when the same inputs are provided.
Determines whether the model's output should be streamed. If true, the output is generated and sent incrementally, which can be useful for real-time applications.
Response
A list of choices generated by the model, each containing the text of the completion and associated metadata such as log probabilities.
The identifier of the model that was used to generate the responses in the 'choices' array.