jingyaogong/minimind🚀🚀 「大模型」3小时完全从0训练26M的小参数GPT!🌏 Train a 26M-parameter GPT from scratch in just 3 hours! Language: Python Stars: 4300 Forks: 512