Ian J's picture

Ian J

iyanello

·

MIkeLP

AI & ML interests

None yet

Organizations

None yet

iyanello's activity

upvoted a collection 8 days ago

DataGemma Release

A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 8 days ago • 54

upvoted a collection 21 days ago

Knowledge graph

25 items • Updated Feb 11 • 3

upvoted a collection about 1 month ago

VideoLLaMA 2

Optimized VideoLLaMA with improved spatial-temporal modeling and better audio understanding capability • 11 items • Updated 20 days ago • 17

upvoted an article 3 months ago

Article

Extracting Concepts from LLMs: Anthropic’s recent discoveries 📖

By

•

Jun 20

• 26

upvoted 3 collections 4 months ago

C4AI Command R Plus

C4AI Command R+ is an open weights research release of a 104B billion parameter model with highly advanced capabilities. • 4 items • Updated 21 days ago • 38

C4AI Command R

C4AI Command-R is a research release of a 35 billion parameter highly performant generative model. Command-R is a large language model with open weigh • 4 items • Updated 21 days ago • 15

C4AI Aya 23

Aya 23 is an open weights research release of an instruction fine-tuned model with highly advanced multilingual capabilities. • 4 items • Updated Aug 6 • 45

upvoted a paper 4 months ago

Reducing Transformer Key-Value Cache Size with Cross-Layer Attention

Paper • 2405.12981 • Published May 21 • 28

upvoted a paper 5 months ago

Ferret-v2: An Improved Baseline for Referring and Grounding with Large Language Models

Paper • 2404.07973 • Published Apr 11 • 30

upvoted a collection 5 months ago

StarChat2 15B

Model, datasets, and demo for StarChat2 15B. For code to train the models, see: https://github.com/huggingface/alignment-handbook • 10 items • Updated Apr 12 • 13

upvoted a paper 6 months ago

PERL: Parameter Efficient Reinforcement Learning from Human Feedback

Paper • 2403.10704 • Published Mar 15 • 56