How to handle the truncate part when concating multiple sequences in pretraining phrase?

#61

by feiyulv - opened Jul 3, 2023

Hi, when pretraining , we concat multiple sequences into a 8192 batch. How to handle the last sequence when it exceeds 8912 with preivous sequences?

Which startegy do we use?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment