AI4Math

Manage your research digests

Subscribe
June 08, 2026
11 papers
Sent

CrowdMath: A Dataset of Crowdsourced Mathematical Research Discussions

Sherin Muckatira, Jesse Geneson, Slava Gerovitch, Pavel Etingof, Mikhail Gronas, Anna Rumshisky

Source: arxiv Related

Discovering Multiscale Deep Formulas in Complex Systems via Neural-Guided Lambda Calculus

Hanqiao Yu, Shusen Yang, Xuebin Ren, Cong Zhao

Source: arxiv Related

A Comprehensive Anatomy of Human and DeepSeek-R1 LLM Mathematical Reasoning

Yuxiang Chen, Jun Wang

Source: arxiv Related

Lean4Agent: Formal Modeling and Verification for Agent Workflow and Trajectory

Ruida Wang, Jerry Huang, Pengcheng Wang, Xuanqing Liu, Luyang Kong, Tong Zhang

Source: arxiv Must Read

How reliable are LLMs when it comes to playing dice?

Luca Avena, Gianmarco Bet, Bernardo Busoni

Source: arxiv Must Read

ThinkBooster: A Unified Framework for Seamless Test-Time Scaling of LLM Reasoning

Vladislav Smirnov, Chieu Nguyen, Sergey Senichev, Minh Ngoc Ta, Ekaterina Fadeeva, Artem Vazhentsev, Daria Galimzianova, Nikolai Rozanov, Viktor Mazanov, Jingwei Ni, Tianyi Wu, Igor Kiselev, Mrinmaya Sachan, Iryna Gurevych, Preslav Nakov, Timothy Baldwin, Artem Shelmanov

Source: arxiv Must Read

RASFT: Rollout-Adaptive Supervised Fine-Tuning for Reasoning

Yongliang Miao, Fengyuan Liu, Wei Shi, Yanguang Liu, Fei Sun, Na Zou, Mengnan Du

Source: arxiv Must Read

DyCon: Dynamic Reasoning Control via Evolving Difficulty Modeling

Tengyao Tu, Yulin Li, Hui-Ling Zhen, Libo Qin, Zhoujun Wei, Jinghua Piao, Zhuotao Tian, Yong Li, Min Zhang

Source: arxiv Must Read

The Fine-Tuning Trap: Evaluating Negative Transfer and the Role of PEFT in Sub-1B Mathematical Reasoning

Rahul Nair, Chun Tao

Source: arxiv Must Read

Skip a Layer or Loop It? Learning Program-of-Layers in LLMs

Ziyue Li, Yang Li, Tianyi Zhou

Source: arxiv Must Read

Are you sure? A Comprehensive and Comprehensible Survey of Uncertainty Quantification in Symbolic Regression

Julia Reuter, Fabricio Olivetti de Franca

Source: arxiv Must Read