John6666 (John Smith)

upvoted 2 collections about 6 hours ago

Qwen2.5

The Qwen 2.5 models are a series of AI models trained on 18 trillion tokens, supporting 29 languages and offering advanced features such as instructio • 33 items • Updated about 12 hours ago • 2

Diffusion-Papers

Collection

1 item • Updated about 18 hours ago • 1

upvoted a paper about 6 hours ago

OmniGen: Unified Image Generation

Paper • 2409.11340 • Published 3 days ago • 55

upvoted 2 collections about 6 hours ago

Dusk_Rainbow

Collection

10 items • Updated Aug 17 • 1

All my models - in order

Collection

14 items • Updated 32 minutes ago • 2

upvoted a paper about 16 hours ago

Qwen2.5-Coder Technical Report

Paper • 2409.12186 • Published 1 day ago • 74

upvoted a collection about 17 hours ago

Qwen

Collection

Alibaba Cloud-based models • 109 items • Updated about 14 hours ago • 1

upvoted a collection about 19 hours ago

Flow-Judge-v0.1

Collection

Flow-Judge-v0.1 models • 5 items • Updated 3 days ago • 12

upvoted an article about 19 hours ago

Article

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

Jan 19, 2021

• 4

upvoted a collection about 19 hours ago

FLUX-LoRAs

Collection

7 items • Updated about 17 hours ago • 1

upvoted 2 collections about 20 hours ago

Collection Zero & Demo

Collection

Image Gen - Text -to-Image • 22 items • Updated 12 days ago • 10

Moshi v0.1 Release

Collection

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 1 day ago • 144

upvoted an article about 20 hours ago

Article

MobileNet-V4 (now in timm)

By

•

Jun 17

• 37

upvoted a paper about 21 hours ago

Takin: A Cohort of Superior Quality Zero-shot Speech Generation Models

Paper • 2409.12139 • Published 1 day ago • 9

upvoted 2 collections about 23 hours ago

Multimodal Models

Collection

Multimodal language models with less refusals • 2 items • Updated 1 day ago • 1

Text Models

Collection

Text generation models with less refusals • 2 items • Updated 1 day ago • 1

upvoted 2 papers 1 day ago

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published 1 day ago • 47

QA-MDT: Quality-aware Masked Diffusion Transformer for Enhanced Music Generation

Paper • 2405.15863 • Published May 24 • 3

upvoted a collection 1 day ago

Minitron 8B Derivative

Collection

Derived from the Nemo minitron 8B prune. • 2 items • Updated 1 day ago • 1

upvoted an article 1 day ago

Article

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

2 days ago

• 101

upvoted 3 collections 2 days ago

upvoted a paper 2 days ago

OSV: One Step is Enough for High-Quality Image to Video Generation

Paper • 2409.11367 • Published 3 days ago • 11

upvoted 3 collections 2 days ago

Japanese ASR Models

Collection

Japanese ASR Models • 5 items • Updated 3 days ago • 2

Kotoba-Whisper

Collection

A collection of kotoba-whisper models. • 8 items • Updated 2 days ago • 2

Kurage

Collection

Multipurpose RAG models for many languages • 13 items • Updated 3 days ago • 1

upvoted a paper 3 days ago

Chain of Thought Empowers Transformers to Solve Inherently Serial Problems

Paper • 2402.12875 • Published Feb 20 • 7

upvoted 5 collections 3 days ago

Merged Models

Collection

These are models created by merging existing models that are already fine tuned or even merged themselves. • 4 items • Updated 24 days ago • 1

CleverBoi

Collection

CleverBoi is a curated collection of data that emphasizes logic, inference, science, code, math and empathy, and its fine tuned language models. • 12 items • Updated 1 day ago • 1

Sunfall

Collection

Experimental new dataset with fine tuned context tailored to mimic Silly Tavern functionality, such as character Scenario details, content tags, etc. • 7 items • Updated 3 days ago • 1

Komorebi

Collection

Multi-phase KTO RP series • 1 item • Updated 3 days ago • 1

Portfolio

Collection

My favorite things that I've done, and the models I'd recommend anybody to use. • 4 items • Updated 4 days ago • 1

upvoted a paper 3 days ago

InstantDrag: Improving Interactivity in Drag-based Image Editing

Paper • 2409.08857 • Published 7 days ago • 24

upvoted an article 4 days ago

Article

Introducing Community Tools

4 days ago

• 18

upvoted 2 collections 4 days ago

DanTagGen

Collection

Danbooru Tag-based prompt Generator • 5 items • Updated May 1 • 4

Kohaku XL

Collection

Kohaku series SDXL anime base model • 9 items • Updated 16 days ago • 6

upvoted 3 articles 4 days ago

Article

Fine-tuning Parler TTS on a Specific Language

By

•

4 days ago

• 13

Article

Training Flux Locally on Mac

By

•

8 days ago

• 9

Article

Fine-tuning a token classification model for legal data using Argilla and AutoTrain

By

•

13 days ago

• 11

upvoted 2 collections 4 days ago

Experimental

Collection

Anything marked here may not be upto my quality standards/May be incomplete and not have full sources. Or it's literal meme-merges • 13 items • Updated 3 days ago • 1

Main releases

Collection

7 items • Updated 3 days ago • 1

upvoted a collection 5 days ago

Highlighted work

Collection

My "greatest hits", sort of • 7 items • Updated 4 days ago • 4

upvoted a paper 5 days ago

TextBoost: Towards One-Shot Personalization of Text-to-Image Models via Fine-tuning Text Encoder

Paper • 2409.08248 • Published 8 days ago • 12

upvoted an article 5 days ago

Article

Explaining the SDXL latent space

By

•

May 20

• 29

upvoted a collection 5 days ago

Barcenas Cartas y Compresión Barcenas

Collection

2 items • Updated 7 days ago • 1

upvoted a paper 6 days ago

Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8B

Paper • 2406.07394 • Published Jun 11 • 21

upvoted an article 6 days ago

Article

In-browser LLM app in pure Python: Gemini Nano + Gradio-Lite

By

•

Jul 12

• 8

upvoted 5 collections 6 days ago

UpScale / Enhancers

Collection

6 items • Updated 2 days ago • 6

ImageNet 1K

Collection

ILSVRC's 2012 ImageNet 1K Subset (1.43M images,) Optimized for 🤗 Datasets / 🥐 Croissant, with Pre-Cropped and Scaled Versions • 5 items • Updated 5 days ago • 2

Fireball Llama collections

Collection

Llama collections • 3 items • Updated 12 days ago • 1

small models

Collection

4 items • Updated about 1 month ago • 1

roleplay models

Collection

32 items • Updated Apr 11 • 1

upvoted 2 articles 6 days ago

Article

Accelerate 1.0.0

7 days ago

• 31

Article

"Diffusers Image Fill" guide

By

•

7 days ago

• 20

upvoted 2 papers 7 days ago

GroUSE: A Benchmark to Evaluate Evaluators in Grounded Question Answering

Paper • 2409.06595 • Published 10 days ago • 37

LinFusion: 1 GPU, 1 Minute, 16K Image

Paper • 2409.02097 • Published 17 days ago • 31

upvoted a collection 7 days ago

Working Merge in my Profile

Collection

9 items • Updated 7 days ago • 1

upvoted a paper 7 days ago

Multi-Head Mixture-of-Experts

Paper • 2404.15045 • Published Apr 23 • 58

upvoted a collection 7 days ago

Violet Twilight

Collection

Merge of Crimson Dawn and Azure Dusk • 7 items • Updated 7 days ago • 2

John Smith PRO

AI & ML interests

Organizations

John6666's activity

Fit More and Train Faster With ZeRO via DeepSpeed and FairScale

MobileNet-V4 (now in timm)

Fine-tuning LLMs to 1.58bit: extreme quantization made easy

Introducing Community Tools

Fine-tuning Parler TTS on a Specific Language

Training Flux Locally on Mac

Fine-tuning a token classification model for legal data using Argilla and AutoTrain

Explaining the SDXL latent space

In-browser LLM app in pure Python: Gemini Nano + Gradio-Lite

Accelerate 1.0.0

"Diffusers Image Fill" guide