Usage
Description
Downloads a quantized model file or safetensors collection from the backend. If the file is already downloaded, the command exits immediately unless--force is specified. A progress bar displays during the download.
Download types
Quantized models
A quantized model is a compressed GGUF file optimized for local inference. To download a quantized model, provide the model name and quantization type asModelName/QuantizationType.
Safetensors collections
Safetensors collection downloads require the
You cannot launch a server or run inference on safetensors files directly from
ai-catalyst backend.You cannot launch a server or run inference on safetensors files directly from
anaconda ai. Use a third-party framework to load and run the model after downloading.transformers. You can download a safetensors collection using the --safetensors flag.
A collection typically includes: