Existing tasks in lm-evaluation-harness

Tasks can be found here

lm-evaluation-harness tasks

Only the tasks used to evaluate Galactica are marked True in the Galactica_task column. Some of the False tasks were actually used to train Galactica not to evaluate it. We should gather these together and add to our dataset for training.

To Do List

Ideas for specific tasks to add to lm-evaluation-harness