To CoT or not to CoT? Chain-of-thought helps mainly on math and symbolic reasoning Paper • 2409.12183 • Published Sep 18 • 36
AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies Paper • 2402.12370 • Published Feb 19 • 1
Benchmarking Language Model Creativity: A Case Study on Code Generation Paper • 2407.09007 • Published Jul 12 • 3
Cognition Collection Perception and abstraction. Each modality is tokenized and embedded into vectors for model to comprehend. • 103 items • Updated 6 days ago • 3
RATIONALYST: Pre-training Process-Supervision for Improving Reasoning Paper • 2410.01044 • Published Oct 1 • 34