<aside>
📌 The below has now crystallised into instructions in a readme on the chemnlp repo here.
</aside>
Initial thinking
Overview
We need to think more about the git strategy for key dependencies which we want to modify. Two concrete examples of this at the moment are the gpt-neox and lm-evaluation-harness codebases. Some things to consider:
- We want the workflow to be as easy as possible for collaborators to contribute.
- We want to be able to pull upstream changes from either repo.
- We likely want to maintain our own set of changes and we may or may not want to push those changes back to the original repo at some point.
- ChemNLP is likely to want to modify either of
gpt-neoxor lm-evaluation-harness in many of its GH issues.
- These aren't just "normal" dependencies - they are key parts of the pipeline we're likely going to want to develop in conjunction with ChemNLP.
Reference
TODO
- Ensure we address how dependency management works
- Ensure
git remote is set correctly in each workflow ← add to instructions if not
- Draw a Miro diagram?
Option 1: GitHub forks
Option 2: Git submodules