Skip to content

EleutherAI/math-lm

main
Switch branches/tags

Name already in use

A tag already exists with the provided branch name. Many Git commands accept both tag and branch names, so creating this branch may cause unexpected behavior. Are you sure you want to create this branch?
Code

Files

Permalink
Failed to load latest commit information.
Type
Name
Latest commit message
Commit time
October 5, 2023 17:52
October 11, 2023 12:34
October 12, 2023 01:17
October 16, 2023 15:12
March 27, 2023 12:57
February 1, 2023 19:16
October 16, 2023 21:41
October 16, 2023 19:14

Llemma: an open language model for mathematics

ArXiv | Models | Data | Code | Blog | Sample Explorer

Repository for Llemma: an open language model for mathematics [Azerbayev et al 2023].

This repository hosts data and training code related to the following artifacts:

Name HF Hub Link
Llemma 7b EleutherAI/llemma_7b
Llemma 34b EleutherAI/llemma_34b
Proof-Pile-2 EleutherAI/ProofPile2
  AlgebraicStack EleutherAI/AlgebraicStack

This repository also contains submodules related to the overlap, fine-tuning, and theorem proving experiments described in the paper. Additional evaluation code is in a fork of the Eleuther LM Evaluation Harness.

Directories

This repository contains the following directories

  • proof_pile_2: scripts for downloading and preprocessing data.
  • gpt-neox: git submodule containing a modified branch of EleutherAI/gpt-neox
  • lm-evaluation-harness: code for all evaluations, except formal2formal theorem proving.
  • llemma_formal2formal: git submodule containing scripts for the formal2formal experiments
  • overlap: git submodule containing the overlap and memorization analysis
  • finetunes: git submodule containing scripts for the fine-tuning experiments

Because this project contains submodules, you should clone this project with the --recurse-submodules flag or, alternatively, run git submodule update --init --recursive from within the project directory after cloning the project. After running git pull, you should also run git submodule update.

Citation

Please cite the following:

@misc{azerbayev2023llemma,
      title={Llemma: An Open Language Model For Mathematics}, 
      author={Zhangir Azerbayev and Hailey Schoelkopf and Keiran Paster and Marco Dos Santos and Stephen McAleer and Albert Q. Jiang and Jia Deng and Stella Biderman and Sean Welleck},
      year={2023},
      eprint={2310.10631},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published