OptionalbaseBase endpoint url.
OptionalfrequencyNumber between -2.0 and 2.0. Positive values penalizes tokens that have been sampled, taking into account their frequency in the preceding text. This penalization diminishes the model's tendency to reproduce identical lines verbatim.
OptionalfriendliFriendli team ID to run as.
OptionalfriendliFriendli personal access token to run as.
OptionalmaxNumber between -2.0 and 2.0. Positive values penalizes tokens that have been
sampled at least once in the existing text.
presence_penalty: Optional[float] = None
The maximum number of tokens to generate. The length of your input tokens plus
max_tokens should not exceed the model's maximum length (e.g., 2048 for OpenAI
GPT-3)
OptionalmodelModel name to use.
OptionalmodelAdditional kwargs to pass to the model.
OptionalstopWhen one of the stop phrases appears in the generation result, the API will stop generation. The phrase is included in the generated result. If you are using beam search, all of the active beams should contain the stop phrase to terminate generation. Before checking whether a stop phrase is included in the result, the phrase is converted into tokens.
OptionaltemperatureSampling temperature. Smaller temperature makes the generation result closer to
greedy, argmax (i.e., top_k = 1) sampling. If it is None, then 1.0 is used.
OptionaltopPTokens comprising the top top_p probability mass are kept for sampling. Numbers
between 0.0 (exclusive) and 1.0 (inclusive) are allowed. If it is None, then 1.0
is used by default.
The FriendliParams interface defines the input parameters for the Friendli class.