Nothing from Something: Can a Language Model Discover 0?

A new study investigates whether AI language models can independently discover the mathematical concept of 'zero'. The findings reveal that while models cannot generalize this concept out-of-the-box, language pretraining reduces the required training examples by 50%.

Computer Science > Artificial Intelligence

Title:Nothing from Something: Can a Language Model Discover 0?

View PDF HTML (experimental)Abstract:AI systems based on artificial neural networks are being developed with aspirations of pushing the boundary of human mathematical knowledge. A key question for these systems is how much they can reach beyond their training data. Mathematical discovery requires a strong form of out of distribution generalization; the ability to hypothesize genuinely new - and potentially logically more powerful - mathematical structures. It has been hypothesized that language abilities support such generalizations in human cognition. In this work, we use simple arithmetic as a case study for examining how modern AI models could expand their mathematical horizons, evaluating whether these models can independently discover the concept of "zero". We show that We show that (1) language models of a GPT-2 size are unable to perform this generalization at test time regardless of language pretraining, but (2) models can improve substantially after training on tens or hundreds of examples of zero. Additionally, we find that language pretraining reduces the number of required examples by approximately $50%$, showing that language abilities can scaffold mathematical discovery in neural models.

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

arXivLabs is a framework that allows collaborators to develop and share new arXiv features directly on our website.

Both individuals and organizations that work with arXivLabs have embraced and accepted our values of openness, community, excellence, and user data privacy. arXiv is committed to these values and only works with partners that adhere to them.

Have an idea for a project that will add value for arXiv's community? Learn more about arXivLabs.

Source: arXiv cs.AI Recent

Computer Science > Artificial Intelligence

Title:Nothing from Something: Can a Language Model Discover 0?

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

More in this category

SEAGym: An Evaluation Environment for Self-Evolving LLM Agents

Closing the Feedback Loop: From Experience Extraction to Insight Governance in Verbal Reinforcement Learning

MapSatisfyBench: Benchmarking Satisfaction-Aware Map Agents through Behavior-Grounded Implicit Decision Factors

MemTrace: Probing What Final Accuracy Misses in Long-Term Memory

Surrogate Assisted Pedestrian Protection Design via a Foundation Model Orchestrated Workflow

SkillChain-Gym: A Benchmark for Reskilling-Aware Production-Inventory Control under Disruptions

Discover All Categories