v0.4.0rc3
Pre-release
Pre-release
·
617 commits
to main
since this release
What's new
Added 🎉
- Added
DatasetsFormatformat andLoadStreamingDatasetstep todatasetsintegration. SqliteDictFormatfor datasets.- Added
pre_epoch()andpost_epoch()callback methods to PyTorchTrainCallback.
Changed ⚠️
LoadDatasetstep fromdatasetsintegration is now cacheable, using theDatasetsFormatformat by default.
But this only works with non-streaming datasets. For streaming datasets, you should use theLoadStreamingDatasetstep instead.
Fixed ✅
- Fixed bug where
KeyboardInterruptexceptions were not handled properly by steps and workspaces. WandbTrainCallbacknow will use part of the step's unique ID as the name for the W&B run by default, to make
it easier to indentify which tango step corresponds to each run in W&B.WandbTrainCallbackwill save the entireTrainConfigobject to the W&B config.
Commits
20138ce improve release notes generation script
760b4f2 Add DatasetsFormat, making LoadDataset cacheable, fix bug with KeyboardInterrupt (#114)
e51691f Improvements to W&B callback (#115)
d044f6e Add pre/post epoch callbacks (#113)
ae1ae0b Bump myst-parser from 0.15.2 to 0.16.0 (#111)
c605a1e Merge pull request #90 from allenai/SqliteDictFormat