Spaces:

vetrovvlad
/

protobench

Sleeping

App Files Files Community

protobench / about.md

vtrv.vls

Added header and banner

dd9842f 4 months ago

|

history blame contribute delete

626 Bytes

	<br/>

	## We propose a new methodology for the evaluation of fundamental models:

	<br/>
	21 challenging tasks for fundamental models covering issues related to world knowledge, logic, cause-and-effect relationships, AI ethics, and much more.

	We have developed an open instructional benchmark for evaluating large language models for the Russian language. A unified leaderboard on the website includes fixed, verified expert tasks and standardized configurations of prompts and parameters.

	The project has been supported by the AI Alliance, leading industrial players, and academic partners engaged in language model research.