MLX is a NumPy-like array framework designed for efficient and flexible machine learning on Apple silicon.
On this page
Arrays in MLX live in shared memory. Operations on MLX arrays can be performed on any of the supported device types without performing data copies.
uv tool install mlxuv tool install mlx-lmmlx_lm.generate --model mlx-community/Qwen3-4B-Instruct-2507-4bit --prompt "hello"
mlx-community/Qwen3-Coder-32B-A3B-4bitOpencode
mlx_lm.server