Shyam Sudhakaran's picture

Shyam Sudhakaran

shyamsn97

·

AI & ML interests

Reinforcement Learning, Open-Ended Algorithms, Neural Cellular Automata

Organizations

shyamsn97's activity

upvoted a paper 7 days ago

Windows Agent Arena: Evaluating Multi-Modal OS Agents at Scale

Paper • 2409.08264 • Published 8 days ago • 39

upvoted a collection 12 days ago

WebInstruct 🌐 Embeddings 🧱 Models

A collection of SoTA embeddings model fine-tuned on WebInstruct dataset to learn to pair instructions with its responses • 3 items • Updated 15 days ago • 11

upvoted an article 17 days ago

Article

Selective fine-tuning of Language Models with Spectrum

By

•

17 days ago

• 25

upvoted a collection about 1 month ago

💻 Local SmolLMs

SmolLM models in MLC, ONNX and GGUF format for local applications + in-browser demos • 14 items • Updated about 1 month ago • 40

upvoted 2 collections 4 months ago

Mixture-of-preference-reward-modeling

The mixture of preference datasets used for reward modeling. • 2 items • Updated Apr 29 • 2

Standard-format-preference-dataset

We collect the open-source datasets and process them into the standard format. • 14 items • Updated May 8 • 19

upvoted a paper 5 months ago

Data-Efficient Multimodal Fusion on a Single GPU

Paper • 2312.10144 • Published Dec 15, 2023 • 6

upvoted 2 collections 6 months ago

Fine-Tuned

41 items • Updated 11 days ago • 6

Merges

Experimental LLM merging • 1292 items • Updated Jul 21 • 7

upvoted a paper 8 months ago

Transformers are Multi-State RNNs

Paper • 2401.06104 • Published Jan 11 • 34

upvoted a collection 8 months ago

Preference Datasets for DPO

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Jul 30 • 28

upvoted a paper 11 months ago

An Emulator for Fine-Tuning Large Language Models using Small Language Models

Paper • 2310.12962 • Published Oct 19, 2023 • 14

upvoted a collection 11 months ago

🚂 SD-XL Training Suite

All the steps to train your own SD-XL custom model • 7 items • Updated Jun 11 • 18

upvoted a paper about 1 year ago

HyperDreamBooth: HyperNetworks for Fast Personalization of Text-to-Image Models

Paper • 2307.06949 • Published Jul 13, 2023 • 50