Store

Qwen2 1.5B Instruct (Free)

bysiliconcloud

Qwen2-1.5B-Instruct is an instruction-tuned large language model in the Qwen2 series, with a parameter size of 1.5B. This model is based on the Transformer architecture and employs techniques such as the SwiGLU activation function, attention QKV bias, and group query attention. It excels in language understanding, generation, multilingual capabilities, coding, mathematics, and reasoning across multiple benchmark tests, surpassing most open-source models. Compared to Qwen1.5-1.8B-Chat, Qwen2-1.5B-Instruct shows significant performance improvements in tests such as MMLU, HumanEval, GSM8K, C-Eval, and IFEval, despite having slightly fewer parameters.

Providers Supporting This Model

SiliconCloud

Qwen/Qwen2-1.5B-Instruct

Maximum Context Length

32K

Maximum Output Length

Input Price

Output Price

Related Recommendations

siliconcloud

Qwen2 1.5B Instruct (Free)

Providers Supporting This Model

Related Recommendations

DeepSeek R1

DeepSeek V3

DeepSeek R1 Distill Llama 70B

DeepSeek R1 Distill Qwen 32B

DeepSeek R1 Distill Qwen 14B

DeepSeek R1 Distill Llama 8B (Free)

DeepSeek R1 Distill Qwen 7B (Free)

DeepSeek-R1-Distill-Qwen-1.5B (Free)