with load_in_4bit it just generates <pad> tokens

#16

by NePe - opened Jul 1

NePe

Jul 1

•

I used the example from the model card with the latest version of transformers.

Jul 2

You should use torch_dtype=torch.bfloat16 for it to work.

NePe

Jul 3

Thanks, this fixed my issue!

NePe changed discussion status to closed Jul 3

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment