decoding parameters (e.g., temperature) for Gemma-2? #64

iseesaw · 2024-09-05T13:02:47Z

Hello, How should I set the decoding parameters (e.g., temperature) for Gemma-2? My result is about ~50.0, far from the benchmark of 76.

xiamengzhou · 2024-09-10T12:44:09Z

Hi, please refer to the parameters in this script: https://github.com/tatsu-lab/alpaca_eval/blob/main/src/alpaca_eval/models_configs/gemma-2-9b-it-SimPO/configs.yaml

MaoXinn · 2024-09-15T16:32:06Z

Hi, i also met this problem. I only got WR/LC as follows:
54.47204968944099,59.969975205397596

here is my evaluation config:

Gemma-2-Aligned-simpo:
completions_kwargs:
batch_size: 900
max_new_tokens: 4096
model_kwargs:
dtype: bfloat16
model_name: princeton-nlp/gemma-2-9b-it-SimPO
stop_token_ids:
- 1
- 107
temperature: 0.5
top_p: 1.0
fn_completions: vllm_local_completions
pretty_name: gemma-2-9b-it-SimPO
prompt_template: ./eval_config/gemma2_prompt.txt

The only different is that i remove "do_sample: true".

I reviewed your config and your conversation with the AE author on GitHub, and now I’m quite confused.
Even after downgrading AE2 to 0.62, I still couldn’t run it based on the configuration you provided. The main problem seems to lie with beam search. Should I enable beam search? If so, the temperature must be set to 0, but I don’t know what the beam size should be.

Thank you~

LotuSrc · 2024-09-23T09:03:59Z

Maybe you use alpaca_eval_gpt4_turbo_fn. In this setting, the result is close to the result you reported.

xiamengzhou · 2024-10-13T13:16:40Z

@MaoXinn It’s a bit tricky to interpret what happened based on the information you provided. How about we troubleshoot it step by step? You could begin by running the evaluations with the outputs we provided on AlpacaEval and check if you can get a similar score first.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

decoding parameters (e.g., temperature) for Gemma-2? #64

decoding parameters (e.g., temperature) for Gemma-2? #64

iseesaw commented Sep 5, 2024

xiamengzhou commented Sep 10, 2024

MaoXinn commented Sep 15, 2024

LotuSrc commented Sep 23, 2024

xiamengzhou commented Oct 13, 2024

decoding parameters (e.g., temperature) for Gemma-2? #64

decoding parameters (e.g., temperature) for Gemma-2? #64

Comments

iseesaw commented Sep 5, 2024

xiamengzhou commented Sep 10, 2024

MaoXinn commented Sep 15, 2024

LotuSrc commented Sep 23, 2024

xiamengzhou commented Oct 13, 2024