A browser-optimized version of Phi-3-Mini-4K-Instruct, designed for fast, efficient inference in web environments using ONNX Runtime Web and WebGPU acceleration.
Phi-3-Mini-4K-Instruct ONNX-Web is a high-performance, web-optimized AI model from Microsoft, designed for efficient execution within web browsers using ONNX Runtime Web. It allows developers to run AI models entirely in-browser, leveraging WebGPU acceleration for low-latency text generation and instruction-following tasks. This model is an optimized version of Phi-3-Mini-4K-Instruct, supporting 4K token context length, and incorporating fine-tuned reasoning, instruction adherence, and lightweight computation.
MIT
Microsoft
Text Generation
N.A.
Open
Sector Agnostic
12/03/25 06:35:35
0
MIT
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.