Qwen2.5 Collection The Qwen 2.5 models are a series of AI models trained on 18 trillion tokens, supporting 29 languages and offering advanced features such as instructio • 33 items • Updated about 12 hours ago • 2
view article Article Fit More and Train Faster With ZeRO via DeepSpeed and FairScale Jan 19, 2021 • 4
Moshi v0.1 Release Collection MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 1 day ago • 144
Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models Paper • 2409.12139 • Published 1 day ago • 9
Multimodal Models Collection Multimodal language models with less refusals • 2 items • Updated 1 day ago • 1
Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution Paper • 2409.12191 • Published 1 day ago • 47
QA-MDT: Quality-aware Masked Diffusion Transformer for Enhanced Music Generation Paper • 2405.15863 • Published May 24 • 3
Minitron 8B Derivative Collection Derived from the Nemo minitron 8B prune. • 2 items • Updated 1 day ago • 1
S.T.E.M. - ShargeGPT Collection Just a collection of some datasets in shareGPT • 11 items • Updated 3 days ago • 2
OSV: One Step is Enough for High-Quality Image to Video Generation Paper • 2409.11367 • Published 3 days ago • 11
Chain of Thought Empowers Transformers to Solve Inherently Serial Problems Paper • 2402.12875 • Published Feb 20 • 7
Merged Models Collection These are models created by merging existing models that are already fine tuned or even merged themselves. • 4 items • Updated 24 days ago • 1
CleverBoi Collection CleverBoi is a curated collection of data that emphasizes logic, inference, science, code, math and empathy, and its fine tuned language models. • 12 items • Updated 1 day ago • 1
Sunfall Collection Experimental new dataset with fine tuned context tailored to mimic Silly Tavern functionality, such as character Scenario details, content tags, etc. • 7 items • Updated 3 days ago • 1
Portfolio Collection My favorite things that I've done, and the models I'd recommend anybody to use. • 4 items • Updated 4 days ago • 1
InstantDrag: Improving Interactivity in Drag-based Image Editing Paper • 2409.08857 • Published 7 days ago • 24
view article Article Fine-tuning a token classification model for legal data using Argilla and AutoTrain By bikashpatra • 13 days ago • 11
Experimental Collection Anything marked here may not be upto my quality standards/May be incomplete and not have full sources. Or it's literal meme-merges • 13 items • Updated 3 days ago • 1
TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder Paper • 2409.08248 • Published 8 days ago • 12
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B Paper • 2406.07394 • Published Jun 11 • 21
view article Article In-browser LLM app in pure Python: Gemini Nano + Gradio-Lite By whitphx • Jul 12 • 8
ImageNet 1K Collection ILSVRC's 2012 ImageNet 1K Subset (1.43M images,) Optimized for 🤗 Datasets / 🥐 Croissant, with Pre-Cropped and Scaled Versions • 5 items • Updated 5 days ago • 2
GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering Paper • 2409.06595 • Published 10 days ago • 37