An ONNX-optimized version of Phi-3-Medium-4K-Instruct, quantized for efficient, high-performance CPU inference, supporting structured reasoning, code generation, and long-context processing.
Phi-3-Medium-4K-Instruct ONNX-CPU is a high-performance AI model from Microsoft, optimized for efficient execution on CPUs using ONNX Runtime. This version is quantized for int4 precision, making it ideal for low-latency, memory-efficient AI applications that require structured reasoning and long-context comprehension.
MIT
Microsoft
Text Generation
N.A.
Open
Sector Agnostic
12/03/25 06:35:38
0
MIT
© 2026 - Copyright AIKosh. All rights reserved.