An ONNX-optimized version of Phi-3.5 Vision-Instruct, quantized to int4 precision for fast and efficient inference on CPUs and GPUs, supporting vision and text-based AI applications.
Phi-3.5-Vision-Instruct ONNX is a high-performance multimodal AI model from Microsoft, optimized for efficient inference with ONNX Runtime. This model is quantized to int4 precision, enabling low-latency, high-speed processing on CPU and GPU platforms, making it ideal for real-time vision and text-based AI applications.
MIT
Microsoft
Multimodal Language Model
N.A.
Open
Sector Agnostic
12/03/25 06:35:33
0
MIT
© 2026 - Copyright AIKosh. All rights reserved. This portal is developed by National e-Governance Division for AIKosh mission.