MiniCPM3-4B is the 3rd generation of MiniCPM series. The overall performance of MiniCPM3-4B surpasses Phi-3.5-mini-Instruct and GPT-3.5-Turbo-0125, being comparable with many recent 7B~9B models.

Model is trained and ready for deployment

MiniCPM3-4B

The maximum number of tokens to generate. Shorter token lengths will provide faster performance.

A decimal number that determines the degree of randomness in the response

An alternative to sampling with temperature, where the model considers the results of the tokens with top_p probability mass.

MiniCPM3-4B is the 3rd generation of MiniCPM series. The overall performance of MiniCPM3-4B surpasses Phi-3.5-mini-Instruct and GPT-3.5-Turbo-0125, being