A beginner's guide to the Phi-3-Mini-128k-Instruct model by Microsoft on Replicate

#coding #ai #machinelearning #programming

This is a simplified guide to an AI model called Phi-3-Mini-128k-Instruct maintained by Microsoft. If you like these kinds of analysis, you should join AImodels.fyi or follow us on Twitter.

Model overview

The phi-3-mini-128k-instruct is a 3.8 billion-parameter, lightweight, state-of-the-art open model trained using the Phi-3 datasets. It is part of the Phi-3 family, which also includes the Phi-3-mini-4k-instruct and Phi-3-mini-128k-instruct models. The Phi-3 models are designed to be efficient and effective, with a focus on reasoning capabilities like code, math, and logic.

Model inputs and outputs

The phi-3-mini-128k-instruct model takes text as input and generates text in response. It is best suited for prompts using a chat format, where the user provides a prompt and the model generates a relevant response.

Inputs

Prompt: The text prompt to send to the model.
Max Length: The maximum number of tokens to generate.
Temperature: Adjusts the randomness of the outputs, with higher values being more random.
Top K: Samples from the top K most likely tokens when decoding text.
Top P: Samples from the top P percentage of most likely tokens when decoding text.
Repetition Penalty: Penalty for repeated words in the generated text.
System Prompt: The system prompt provided to the model.
Seed: The seed for the random number generator.