MoritzLaurer (Moritz Laurer)

upvoted a collection 1 day ago

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 13 items • Updated 1 day ago • 141

upvoted an article 7 days ago

Article

Accelerate 1.0.0

7 days ago

• 31

upvoted a paper 15 days ago

Political DEBATE: Efficient Zero-shot and Few-shot Classifiers for Political Text

Paper • 2409.02078 • Published 17 days ago • 8

upvoted a paper 24 days ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published 28 days ago • 109

upvoted an article 29 days ago

Article

The 5 Most Under-Rated Tools on Hugging Face

29 days ago

• 74

upvoted a collection 29 days ago

NIM Serverless Inference API

Collection

Models in this collection are available for inference via a serverless API powered by NVIDIA NIM. • 8 items • Updated 30 days ago • 11

upvoted a paper about 1 month ago

Scaling LLM Test-Time Compute Optimally can be More Effective than Scaling Model Parameters

Paper • 2408.03314 • Published Aug 6 • 30

upvoted 4 articles about 1 month ago

Article

Train Custom Models on Hugging Face Spaces with AutoTrain SpaceRunner

By

•

May 9

• 10

Article

Tool Use, Unified

Aug 12

• 49

Article

Serverless Inference with Hugging Face and NVIDIA NIMs

Jul 29

• 26

Article

XetHub is joining Hugging Face!

Aug 8

• 76

upvoted an article about 2 months ago

Article

Finetuning PaliGemma with AutoTrain

By

•

Jul 25

• 7

upvoted an article 2 months ago

Article

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Jul 18

• 42

upvoted a collection 2 months ago

State-of-the-Art NER models - General purpose

Collection

5 items • Updated Feb 27 • 4

upvoted 2 articles 2 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

Jul 1

• 44

Article

Experimenting with Automatic PII Detection on the Hub using Presidio

Jul 10

• 23

upvoted a paper 2 months ago

MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation?

Paper • 2407.04842 • Published Jul 5 • 52

upvoted a paper 3 months ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25 • 84

upvoted 2 articles 3 months ago

Article

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

Jun 19

• 11

Article

XLSCOUT Unveils ParaEmbed 2.0: a Powerful Embedding Model Tailored for Patents and IP with Expert Support from Hugging Face

Jun 25

• 9

upvoted 3 papers 3 months ago

upvoted 2 collections 3 months ago

SteerLM

Collection

A collection of models and datasets relating to SteerLM and HelpSteer. • 7 items • Updated Jul 17 • 12

Nemotron 4 340B

Collection

Nemotron-4: open models for Synthetic Data Generation (SDG). Includes Base, Instruct, and Reward models. • 4 items • Updated Jul 17 • 156

upvoted 3 articles 3 months ago

Article

Putting RL back in RLHF

Jun 12

• 58

Article

How to generate text: using different decoding methods for language generation with Transformers

Mar 1, 2020

• 90

Article

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

By

•

Jun 20

• 26

upvoted a paper 4 months ago

Chameleon: Mixed-Modal Early-Fusion Foundation Models

Paper • 2405.09818 • Published May 16 • 125

upvoted 2 articles 4 months ago

Article

Space secrets security update

May 31

• 50

Article

Benchmarking Text Generation Inference

May 29

• 26

upvoted a paper 4 months ago

Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations

Paper • 2405.18392 • Published May 28 • 12

upvoted 2 articles 4 months ago

Article

SVGDreamer: Text Guided Vector Graphics Generation with Diffusion Model

By

•

Apr 19

• 5

Article

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

May 21

• 8

upvoted a paper 4 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15 • 86

upvoted a collection 4 months ago

NuNerZero - Zero Shot NER

Collection

The best compact Zero-Shot NER models with MIT license • 4 items • Updated Jul 3 • 15

upvoted an article 4 months ago

Article

Improving Prompt Consistency with Structured Generations

Apr 30

• 52

upvoted a paper 4 months ago

LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report

Paper • 2405.00732 • Published Apr 29 • 118

upvoted a paper 5 months ago

What matters when building vision-language models?

Paper • 2405.02246 • Published May 3 • 98

upvoted 2 articles 5 months ago

Article

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

May 1

• 61

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15

• 160

upvoted 3 collections 5 months ago

PDF Document / OCR Datasets

Collection

Document datasets with .pdf files that are usable with pixparse libraries and tools. • 2 items • Updated Mar 30 • 46

OpenELM Instruct Models

Collection

4 items • Updated Jun 19 • 113

Idefics2 🐶

Collection

Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. • 11 items • Updated May 6 • 88

upvoted a paper 5 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12 • 59

upvoted an article 5 months ago

Article

Total noob’s intro to Hugging Face Transformers

Mar 22

• 38

upvoted a paper 6 months ago

StarCoder 2 and The Stack v2: The Next Generation

Paper • 2402.19173 • Published Feb 29 • 132

upvoted a paper 7 months ago

A Critical Evaluation of AI Feedback for Aligning Large Language Models

Paper • 2402.12366 • Published Feb 19 • 3

upvoted 3 collections 7 months ago

Reward models on the hub

Collection

UNMAINTAINED: See RewardBench... A place to collect reward models, an often not released artifact of RLHF. • 18 items • Updated Apr 13 • 24

🤗 Spaces Helper

Collection

5 items • Updated Mar 19 • 2

⛔️🔦 Provenance, Watermarking & Deepfake Detection

Collection

Technical tools for more control over non-consensual synthetic content • 14 items • Updated Apr 1 • 38

upvoted a paper 7 months ago

Multilingual E5 Text Embeddings: A Technical Report

Paper • 2402.05672 • Published Feb 8 • 19

upvoted a paper 8 months ago

Self-Rewarding Language Models

Paper • 2401.10020 • Published Jan 18 • 140

upvoted a collection 8 months ago

Universal token classification

Collection

Collection of universal token classification (UTC) models capable in prompt-tuned manner to solve many information extraction tasks. • 11 items • Updated 10 days ago • 12

upvoted 3 papers 9 months ago

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 79

Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

Paper • 2401.00788 • Published Jan 1 • 21

LLM-Assisted Code Cleaning For Training Accurate Code Generators

Paper • 2311.14904 • Published Nov 25, 2023 • 3

upvoted a paper 10 months ago

Magicoder: Source Code Is All You Need

Paper • 2312.02120 • Published Dec 4, 2023 • 79

upvoted a collection 10 months ago

Seamless Communication

Collection

A significant step towards removing language barriers through expressive, fast and high-quality AI translation. • 16 items • Updated Jan 16 • 144

upvoted a collection 11 months ago

Zeroshot Classifiers

Collection

These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. • 11 items • Updated Apr 3 • 103

Moritz Laurer

AI & ML interests

Articles

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

Synthetic data: save money, time and carbon with open source

Organizations

MoritzLaurer's activity

Accelerate 1.0.0

The 5 Most Under-Rated Tools on Hugging Face

Train Custom Models on Hugging Face Spaces with AutoTrain SpaceRunner

Tool Use, Unified

Serverless Inference with Hugging Face and NVIDIA NIMs

XetHub is joining Hugging Face!

Finetuning PaliGemma with AutoTrain

TGI Multi-LoRA: Deploy Once, Serve 30 Models

Our Transformers Code Agent beats the GAIA benchmark!

Experimenting with Automatic PII Detection on the Hub using Presidio

Going multimodal: How Prezi is leveraging the Hub and the Expert Support Program to accelerate their ML roadmap

XLSCOUT Unveils ParaEmbed 2.0: a Powerful Embedding Model Tailored for Patents and IP with Expert Support from Hugging Face

Putting RL back in RLHF

How to generate text: using different decoding methods for language generation with Transformers

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

Space secrets security update

Benchmarking Text Generation Inference

SVGDreamer: Text Guided Vector Graphics Generation with Diffusion Model

From cloud to developers: Hugging Face and Microsoft Deepen Collaboration

Improving Prompt Consistency with Structured Generations

Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Total noob’s intro to Hugging Face Transformers