Spaces:
Running
on
CPU Upgrade
Running
on
CPU Upgrade
Long Context Evaluation
#430
by
mrfakename
- opened
Hi,
Is there any evaluation to test ability to perform well on longer prompts?
I would like this one, too
Hi! If one of you wants to set up the above dataset as a leaderboard, I can give you a hand. (We won't add it to the Open LLM Leaderboard however)
clefourrier
changed discussion status to
closed
This comment has been hidden