Spaces:
Running
on
Zero
Running
on
Zero
Commit History
Cast int sample id to str (#96)
e299427
unverified
Get generation working for BLT (#86)
b79eb3e
unverified
Some fixes for entropy model predictions (#83)
fc946a1
unverified
Update ppl evals to work with blt model, in addition to entropy model (#82)
083656c
unverified
Reduce per file resources arrow uses (#77)
63913e4
unverified
Let process start before yielding preloaded prefetch buffer, avoid needlessly losing buffer in edge cases (#75)
8f2cf88
unverified
Add approximate state persistence (#73)
ea1fc75
unverified
Correctly reset batch iterator at each arrow create_iter call. (#74)
c727844
unverified
Pass mask in packing_iterator, correctly handle last batch, fix masking (#65)
08b8c7c
unverified
Remove byte tokenizer and add config args to switch between byte/patch packing (#68)
aeb95f1
unverified
Update iterator inheritance, pass file format args, limit iterator (#63)
fc3399e
unverified
Fix multiprocessing dataloader checkpointing and use it in the train script (#50)
8c61ab5
unverified
Test first batch matches (#53)
85c2f28
unverified
Allow ArrowIterator to read from json (#45)
936d943
unverified
This includes fixes that make checkpointing and reloading work correctly. (#35)
7044771
unverified
Initial codes and scripts for training entropy model (#34)
7622d28
unverified
Update file check script to check sizes (#32)
bc42ceb
unverified
Fix realtime entropy patching (#26)
392117b
unverified
Ink
commited on