Batch processing

Text generation models

Model

URI

Context

Qwen 2.5 7B Instruct
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen2.5-7b-instruct

32,768

Qwen 2.5 72B Instruct
Model card
Qwen license

gpt://<folder_ID>/qwen2.5-72b-instruct

16,384

QwQ 32B Instruct
Model card
Apache 2.0 license

gpt://<folder_ID>/qwq-32b

32,768

Llama 3.3 70B Instruct*
Model card
Llama 3.3 license

gpt://<folder_ID>/llama3.3-70b-instruct

16,384

Llama 3.1 70B Instruct*
Model card
Llama 3.1 license

gpt://<folder_ID>/llama3.1-70b-instruct

8,192

DeepSeek-R1 Distill Llama 70B
Model card
MIT license
Based on Llama-3.3-70B-Instruct. Llama-3.3-70B-Instruct Terms of Use

gpt://<folder_ID>/deepseek-r1-distill-llama-70b

8,192

Qwen2.5 32B Instruct
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen2.5-32b-instruct

32,768

DeepSeek-R1 Distill Qwen 32B
Model card
MIT license

gpt://<folder_ID>/deepseek-r1-distill-qwen-32b

32,768

phi-4
Model card
MIT license

gpt://<folder_ID>/phi-4

16,384

Gemma 3 1B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-1b-it

32,768

Gemma 3 4B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-4b-it

131,072

Gemma 3 12B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-12b-it

65,536

Gemma 3 27B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-27b-it

30,720

Qwen3 0.6B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-0.6b

32,768

Qwen3 1.7B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-1.7b

32,768

Qwen3 4B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-4b

32,768

Qwen3 8B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-8b

32,768

Qwen3 14B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-14b

32,768

Qwen3 32B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-32b

32,768

Qwen3 30B A3B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-30b-a3b

32,768

Qwen3 235B A22B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen3-235b-a22b

32,768

* Llama was created by Meta. Meta is designated as an extremist organization and its activities are prohibited in Russia.

Multimodal models

Model

URI

Context

Qwen2 VL 7B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen2-vl-7b-instruct/

16,384

Qwen2.5 VL 7B
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen2.5-vl-7b-instruct/

16,384

Qwen 2.5 VL 32B Instruct
Model card
Apache 2.0 license

gpt://<folder_ID>/qwen2.5-vl-32b-instruct/

4,096

DeepSeek VL2
Model card
DeepSeek license

gpt://<folder_ID>/deepseek-vl2/

4,096

DeepSeek VL2 Tiny
Model card
DeepSeek license

gpt://<folder_ID>/deepseek-vl2-tiny/

4,096

Gemma 3 4B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-4b-it/

4,096

Gemma 3 12B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-12b-it/

4,096

Gemma 3 27B it
Model card
Gemma Terms of Use

gpt://<folder_ID>/gemma-3-27b-it/

4,096

Use cases

Running a model in batch mode