Results from evaluation pipeline
1B Fine-tuned on Hendrycks STEM
FT on mixed data: hendrycksSTEM + pubmed abstracts + wiki
Grid Search _ 300M pure data sets
1B models grid search over 1B tokens
1B models grid search over 4B tokens
1B models: smiles grid search over 1B tokens
1B Model 4B 100% PubMed tokens