Padding on batch inference

by cckm - opened Sep 20

cckm

Sep 20

Thanks for the checkpoint!

Got a question on batched inference. So the inference input if shaped [B, 80, seq_len]. Now if inputs within the same batch have different effective seq_len, what do you expect us to pad the shorter inputs with?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment