What's Changed
- 0.5.3 post release setup by @rasbt in #1817
- Add cff file by @rasbt in #1818
- Deprecate Support for Dolly, Nous-Hermes, Redpajama-Incite, Vicuna and H2O Danube Models. by @ParagEkbote in #1821
- Adding OLMo by @aflah02 in #1827
- Adding Qwen2.5 by @ysjprojects in #1834
- Restore SlimPajama preprocessing code by @aflah02 in #1840
- Add QwQ-32B-Preview by @ysjprojects in #1844
- Add Mixtral-8x22B by @ysjprojects in #1845
- add Llama-3.3-70B-Instruct by @ysjprojects in #1859
- add Salamandra by @ysjprojects in #1857
- Qwen2.5: fix block size for Coder series by @ysjprojects in #1856
- fix: add missing"," by @vra in #1855
- fix llama3.3 readme url by @ysjprojects in #1862
- Set torch.load(...,
weights_only=False
) in litgpt/api.py by @Andrei-Aksionov in #1874 - Add Qwen2.5 math by @ysjprojects in #1863
- Add SmolLM2 by @ysjprojects in #1848
- Add Mistral-Large-Instruct-2411 by @ysjprojects in #1876
- Bump version for 0.5.4 release by @Andrei-Aksionov in #1883
- Temporary remove Thunder to make a release by @Andrei-Aksionov in #1884
New Contributors
- @ParagEkbote made their first contribution in #1821
- @ysjprojects made their first contribution in #1834
- @vra made their first contribution in #1855
Full Changelog: v0.5.3...v0.5.4