Leshem Choshen's picture

Leshem Choshen

borgr

·

https://ktilana.wixsite.com/leshem-choshen

AI & ML interests

Merging models, collaboratively improving pretraining, evaluation, understanding

Organizations

borgr's activity

upvoted a paper 26 days ago

LiveXiv -- A Multi-Modal Live Benchmark Based on Arxiv Papers Content

Paper • 2410.10783 • Published 27 days ago • 25

upvoted a paper about 1 month ago

SELECT: A Large-Scale Benchmark of Data Curation Strategies for Image Classification

Paper • 2410.05057 • Published Oct 7 • 7

upvoted a paper about 2 months ago

Acceptable Use Policies for Foundation Models

Paper • 2409.09041 • Published Aug 29 • 1

upvoted a paper 2 months ago

The Future of Open Human Feedback

Paper • 2408.16961 • Published Aug 15 • 20

upvoted 3 papers 3 months ago

The ShareLM Collection and Plugin: Contributing Human-Model Chats for the Benefit of the Community

Paper • 2408.08291 • Published Aug 15 • 9

Learning from Naturally Occurring Feedback

Paper • 2407.10944 • Published Jul 15 • 4

Benchmark Agreement Testing Done Right: A Guide for LLM Benchmark Evaluation

Paper • 2407.13696 • Published Jul 18 • 5

upvoted a paper 4 months ago

Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP

Paper • 2407.00402 • Published Jun 29 • 22

upvoted a paper 5 months ago

Large Language Model Confidence Estimation via Black-Box Access

Paper • 2406.04370 • Published Jun 1 • 19