-
Self-Rewarding Language Models
Paper • 2401.10020 • Published • 140 -
ReFT: Reasoning with Reinforced Fine-Tuning
Paper • 2401.08967 • Published • 27 -
Tuning Language Models by Proxy
Paper • 2401.08565 • Published • 20 -
TrustLLM: Trustworthiness in Large Language Models
Paper • 2401.05561 • Published • 63
Collections
Discover the best community collections!
Collections including paper arxiv:2402.04615
-
Training Chain-of-Thought via Latent-Variable Inference
Paper • 2312.02179 • Published • 8 -
LLM in a flash: Efficient Large Language Model Inference with Limited Memory
Paper • 2312.11514 • Published • 256 -
TIP: Text-Driven Image Processing with Semantic and Restoration Instructions
Paper • 2312.11595 • Published • 5 -
Quantum Denoising Diffusion Models
Paper • 2401.07049 • Published • 12