Models
Datasets
Spaces
Posts
Docs
Pricing
Log In
Sign Up

Collections

Discover the best community collections!

Collections including paper arxiv:2311.07463

Gemma: Open Models Based on Gemini Research and Technology

Paper • 2403.08295 • Published Mar 13 • 47
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language Models

Paper • 2403.13372 • Published Mar 20 • 62
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM

Paper • 2403.07816 • Published Mar 12 • 39
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks

Paper • 2311.07463 • Published Nov 13, 2023 • 13

Visual In-Context Prompting

Paper • 2311.13601 • Published Nov 22, 2023 • 16
Textbooks Are All You Need

Paper • 2306.11644 • Published Jun 20, 2023 • 142
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework

Paper • 2308.08155 • Published Aug 16, 2023 • 3
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models

Paper • 2303.02927 • Published Mar 6, 2023 • 3

Large Language Model (LLM) and NLP related papers.

about 2 hours ago

LoRA+: Efficient Low Rank Adaptation of Large Models

Paper • 2402.12354 • Published Feb 19 • 6
The FinBen: An Holistic Financial Benchmark for Large Language Models

Paper • 2402.12659 • Published Feb 20 • 16
TofuEval: Evaluating Hallucinations of LLMs on Topic-Focused Dialogue Summarization

Paper • 2402.13249 • Published Feb 20 • 10
TrustLLM: Trustworthiness in Large Language Models

Paper • 2401.05561 • Published Jan 10 • 64

about 1 month ago

Holistic Evaluation of Text-To-Image Models

Paper • 2311.04287 • Published Nov 7, 2023 • 11
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks

Paper • 2311.07463 • Published Nov 13, 2023 • 13
Trusted Source Alignment in Large Language Models

Paper • 2311.06697 • Published Nov 12, 2023 • 10
DiLoCo: Distributed Low-Communication Training of Language Models

Paper • 2311.08105 • Published Nov 14, 2023 • 14

Microsoft Research Papers

#PapersToRead from Microsoft Research in the broad space of Generative AI, Multi-agent systems, responsible AI practices, LLM Ops, and language models

Visual In-Context Prompting

Paper • 2311.13601 • Published Nov 22, 2023 • 16
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent Conversation Framework

Paper • 2308.08155 • Published Aug 16, 2023 • 3
LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models

Paper • 2303.02927 • Published Mar 6, 2023 • 3
The Impact of Large Language Models on Scientific Discovery: a Preliminary Study using GPT-4

Paper • 2311.07361 • Published Nov 13, 2023 • 12

Exponentially Faster Language Modelling

Paper • 2311.10770 • Published Nov 15, 2023 • 118
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks

Paper • 2311.07463 • Published Nov 13, 2023 • 13

MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks

Paper • 2311.07463 • Published Nov 13, 2023 • 13
UFOGen: You Forward Once Large Scale Text-to-Image Generation via Diffusion GANs

Paper • 2311.09257 • Published Nov 14, 2023 • 45
nitrosocke/redshift-diffusion

Text-to-Image • Updated May 16, 2023 • 624 • 610
stabilityai/sdxl-turbo

Text-to-Image • Updated Jul 10 • 1.18M • 2.27k

😱 Microsoft Papers with no code/data release

Collection of Microsoft Papers with no code/data release

MEGA: Multilingual Evaluation of Generative AI

Paper • 2303.12528 • Published Mar 22, 2023
MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks

Paper • 2311.07463 • Published Nov 13, 2023 • 13
Kosmos-2.5: A Multimodal Literate Model

Paper • 2309.11419 • Published Sep 20, 2023 • 50
A Unified View of Masked Image Modeling

Paper • 2210.10615 • Published Oct 19, 2022

Challenges and Opportunities of Using Transformer-Based Multi-Task Learning in NLP Through ML Lifecycle: A Survey

Paper • 2308.08234 • Published Aug 16, 2023 • 1
Understanding and Improving Information Transfer in Multi-Task Learning

Paper • 2005.00944 • Published May 2, 2020 • 1
Improving Multi-task Learning via Seeking Task-based Flat Regions

Paper • 2211.13723 • Published Nov 24, 2022 • 1
Improvable Gap Balancing for Multi-Task Learning

Paper • 2307.15429 • Published Jul 28, 2023 • 1

Papers: Evaluation

Skill-Mix: a Flexible and Expandable Family of Evaluations for AI models

Paper • 2310.17567 • Published Oct 26, 2023 • 1
This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

Paper • 2310.15941 • Published Oct 24, 2023 • 6
Holistic Evaluation of Language Models

Paper • 2211.09110 • Published Nov 16, 2022 • 1
INSTRUCTEVAL: Towards Holistic Evaluation of Instruction-Tuned Large Language Models

Paper • 2306.04757 • Published Jun 7, 2023 • 6

Previous
1
2
Next

Company

© Hugging Face

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs