dtype: float32 in base model vs. dtype: bfloat16 in the instruction fine-tuned model

#32

by tanliboy - opened Jul 17

Jul 17

In this base model, the dtype is float32; however, in the instruction fine-tuned model, the dtype is (https://huggingface.co/google/gemma-2-9b-it/blob/main/config.json#L29).

Is this inconsistency intentional or a bug?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment