"Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions?"

Official code repository for "Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions?" EDM paper.

The code is provided for reference in reproducing the results described in this paper. Please note that the ITS dataset is proprietary and not included. Contact the authors (wmcnichols at umass dot edu) if you are interested.

Overview

Our codebase is composed of three primary runnable python scripts. The rest of the files support the operations described below.

The file train.py runs the fine tuning process for local models (such as Mistral-7B). For our experiments we used a GPU with VRAM of 48GB so the default settings reflect such a hardware envionrment.

The file inference.py performs inference on the test split for the fine-tuned model and expects a similar hardware envionrment as above.

Lastly promptOAI.py uses the Open AI api to perform inference on fine-tuned and untrained Open-AI models on the test splits.

Citation

If you found this project useful, please consider citing our work.

@misc{mcnichols2024largelanguagemodelsreplicate,
      title={Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions?}, 
      author={Hunter McNichols and Jaewook Lee and Stephen Fancsali and Steve Ritter and Andrew Lan},
      year={2024},
      eprint={2405.06414},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2405.06414}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
conf		conf
lm_outputs		lm_outputs
prompts/inference		prompts/inference
results		results
sanity_checks		sanity_checks
saved_models		saved_models
splits		splits
.gitignore		.gitignore
ExperimentLogger.py		ExperimentLogger.py
OpenAIInterface.py		OpenAIInterface.py
README.md		README.md
inference.py		inference.py
promptOAI.py		promptOAI.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

"Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions?"

Overview

Citation

About

Releases

Packages

Languages

umass-ml4ed/its_feedback_edm

Folders and files

Latest commit

History

Repository files navigation

"Can Large Language Models Replicate ITS Feedback on Open-Ended Math Questions?"

Overview

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages