charlesniswander (Charles I Niswander II)

upvoted 2 papers about 12 hours ago

Small Language Models are Equation Reasoners

Paper • 2409.12393 • Published Sep 19 • 1

Language Models are Hidden Reasoners: Unlocking Latent Reasoning Capabilities via Self-Rewarding

Paper • 2411.04282 • Published 6 days ago • 20

upvoted a paper 4 days ago

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Paper • 2411.04996 • Published 5 days ago • 43

upvoted a paper 10 days ago

LLaMA-Berry: Pairwise Optimization for O1-like Olympiad-Level Mathematical Reasoning

Paper • 2410.02884 • Published Oct 3 • 48

upvoted a paper 24 days ago

Self-Taught Evaluators

Paper • 2408.02666 • Published Aug 5 • 26

upvoted a paper 28 days ago

SuperCorrect: Supervising and Correcting Language Models with Error-Driven Insights

Paper • 2410.09008 • Published Oct 11 • 16

upvoted a collection about 1 month ago

Reasoning

Collection

151 items • Updated Apr 6 • 27

upvoted 2 papers about 1 month ago

Cognitive Architectures for Language Agents

Paper • 2309.02427 • Published Sep 5, 2023 • 8

Scaling Proprioceptive-Visual Learning with Heterogeneous Pre-trained Transformers

Paper • 2409.20537 • Published Sep 30 • 12

upvoted 3 papers about 2 months ago

upvoted an article about 2 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

Aug 12

• 102

upvoted a paper 2 months ago

CoRe: Context-Regularized Text Embedding Learning for Text-to-Image Personalization

Paper • 2408.15914 • Published Aug 28 • 21

upvoted a paper 3 months ago

Scalable Autoregressive Image Generation with Mamba

Paper • 2408.12245 • Published Aug 22 • 23

upvoted a collection 3 months ago

Minitron

Collection

A family of compressed models obtained via pruning and knowledge distillation • 9 items • Updated Oct 3 • 59

upvoted 4 papers 4 months ago

LazyLLM: Dynamic Token Pruning for Efficient Long Context LLM Inference

Paper • 2407.14057 • Published Jul 19 • 44

Human-like Episodic Memory for Infinite Context LLMs

Paper • 2407.09450 • Published Jul 12 • 60

FunAudioLLM: Voice Understanding and Generation Foundation Models for Natural Interaction Between Humans and LLMs

Paper • 2407.04051 • Published Jul 4 • 35

On scalable oversight with weak LLMs judging strong LLMs

Paper • 2407.04622 • Published Jul 5 • 11

Charles I Niswander II

AI & ML interests

Organizations

charlesniswander's activity