This is a simplified guide to an AI model called Phi-3-Mini-128k-Instruct maintained by Microsoft. If you like these kinds of analysis, you should join AImodels.fyi or follow us on Twitter.
Model overview
The phi-3-mini-128k-instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets. It is part of the Phi-3 family, which also includes the Phi-3-mini-4k-instruct and Phi-3-mini-128k-instruct models. The Phi-3 models are designed to be efficient and effective, with a focus on reasoning capabilities like code, math, and logic.
Model inputs and outputs
The phi-3-mini-128k-instruct model takes text as input and generates text in response. It is best suited for prompts using a chat format, where the user provides a prompt and the model generates a relevant response.
Inputs
- Prompt: The text prompt to send to the model.
- Max Length: The maximum number of tokens to generate.
- Temperature: Adjusts the randomness of the outputs, with higher values being more random.
- Top K: Samples from the top K most likely tokens when decoding text.
- Top P: Samples from the top P percentage of most likely tokens when decoding text.
- Repetition Penalty: Penalty for repeated words in the generated text.
- System Prompt: The system prompt provided to the model.
- Seed: The seed for the random number generator.
Outputs
- Generated Text: The text generated by the model in response to the input prompt.
Capabilities
The phi-3-mini-128k-instruct model h...
Click here to read the full guide to Phi-3-Mini-128k-Instruct
Top comments (0)