view article Article SeeMoE: Implementing a MoE Vision Language Model from Scratch By AviSoori1x โข Jun 23 โข 34
Quantized-FT-Orca-Math Collection Models trained during quantization aware fine-tuning experiments using PyTorch's FSDP. โข 8 items โข Updated Aug 20 โข 7
Teaching Large Language Models to Reason with Reinforcement Learning Paper โข 2403.04642 โข Published Mar 7 โข 46