ijohn free life's picture

ijohn free life

ijohn07

·

john_whickins

AI & ML interests

Lost in the sea of life, making waves with AI.

Organizations

ijohn07's activity

upvoted a collection 8 days ago

AMD-OLMo

AMD-OLMo are a series of 1 billion parameter language models trained by AMD on AMD Instinct™ MI250 GPUs based on OLMo. • 4 items • Updated 9 days ago • 16

upvoted a collection 9 days ago

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 8 items • Updated 6 days ago • 160

upvoted a collection 10 days ago

MobileLLM

Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 8 items • Updated 3 days ago • 89

upvoted a paper 12 days ago

GPT-4o System Card

Paper • 2410.21276 • Published 16 days ago • 76

upvoted a collection 15 days ago

🚀GGUF

Llama.cpp compatible models, can be used on CPUs and GPUs! • 870 items • Updated about 18 hours ago • 34

upvoted an article 21 days ago

Article

How to build a custom text classifier without days of human labeling

By

•

24 days ago

• 54

upvoted a paper 29 days ago

Falcon Mamba: The First Competitive Attention-free 7B Language Model

Paper • 2410.05355 • Published Oct 7 • 27

upvoted a paper about 1 month ago

AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs

Paper • 2410.05295 • Published Oct 3 • 12

upvoted 2 collections about 2 months ago

Llama 3.2 Re-upload

10 items • Updated Sep 25 • 11

Molmo

Artifacts for open multimodal language models. • 5 items • Updated Sep 26 • 269

upvoted an article about 2 months ago

Article

Llama can now see and run on your device - welcome Llama 3.2

Sep 25

• 164

upvoted a collection about 2 months ago

MagpieLM

Aligning LMs with Fully Open Recipe (data+training configs+logs) • 9 items • Updated Sep 22 • 15

upvoted 2 collections 2 months ago

DeepSeek-V2.5

1 item • Updated Sep 6 • 27

DeepSeek-V2

7 items • Updated Sep 5 • 15

upvoted an article 2 months ago

Article

Meet Yi-Coder: A Small but Mighty LLM for Code

By

•

Sep 4

• 12

upvoted a paper 2 months ago

Mini-Omni: Language Models Can Hear, Talk While Thinking in Streaming

Paper • 2408.16725 • Published Aug 29 • 52

upvoted a collection 2 months ago

ChatGPT-Mini

A collection of fine-tuned GPT-2 models each designed to deploy a ChatGPT-like model at home. These models can also be deployed on an old computer. • 8 items • Updated Nov 16, 2023 • 4

upvoted an article 3 months ago

Article

XetHub is joining Hugging Face!

Aug 8

• 79

upvoted a collection 3 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 27 items • Updated 11 days ago • 489

upvoted an article 3 months ago

Article

Deploy Meta Llama 3.1 405B on Google Cloud Vertex AI

Aug 19

• 18