Skip to main content

Gemma

Gemma is a family of lightweight models developed by Google. Rubra offers the 2B variant with function calling, which is most appropriate for single instruction chats.

note
ModelParamsContext LengthGQAToken CountKnowledge Cutoff
Gemma-1.1 2B Instruct2B8192No (MQA)6TUndisclosed
tip

We observe in the benchmarks and in our own testing that Gemma models do not perform well when the number of turns (chat messages) increases.

Gemma-1.1 2B Instruct

ModelFunction CallingMMLUGPQAGSM-8KMATHMT-benchMT-bench Pairwise Comparison
WinLossTieWin RateLoss RateAdjusted Win Rate
Gemma-1.1 2B Instruct-37.8422.996.296.145.823356710.206250.350000.428125
Rubra Enhanced Gemma-1.1 2B Instruct45.00%38.8524.556.142.385.755633710.350000.206250.571875