Based on the nanochat architecture, this repository contains all the code required to train a Large Language Model from scratch for less than $100 just by running the training.sh script. The training ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results