The purpose of this issue is to have a conversation/brainstorm about the things that should be implemented to get Simplexity into a more working form for general use, with reproducibility, organization, and ease of use. Some things off the top of my head that come up:
- Seems like everything should be MLFlow, including artifacts, saved ckpts, even analysis results?
- @ealt mentioned a tagging system, in order to deal with determining what should be saved (or not)
- Currently we have to set seq_len, n_ctx, vocab size, bos, etc. and its hard to think through how they all relate. In the standard case defining one or two of those should set the rest.
The purpose of this issue is to have a conversation/brainstorm about the things that should be implemented to get Simplexity into a more working form for general use, with reproducibility, organization, and ease of use. Some things off the top of my head that come up: