Chat with the fine-tuned Phi-2 model using QLoRA. This version runs on CPU for better compatibility.
This interface allows you to interact with a fine-tuned Phi-2 model. Note that responses may be slower due to CPU-only inference.
Maximum length of generated response
Higher values make output more random
Nucleus sampling parameter
Top-k sampling parameter