← Back

let's make an LLM part 1 'overview'

Sorry, the title was clickbait.

We won't be anywhere NEAR making an LLM for a while. We first need to construct a machine learning framework.

The framework will be written in C++ with no dependencies on PyTorch, TensorFlow, or any other ML library. The only dependencies I will be using are GoogleTest for testing, pybind11 for Python bindings, and the CUDA toolkit for the GPU backend.

All you need is just basic programming proficiency in C++ to follow along.

Preview

The actual code is at github.com/dnexdev/tiramisu.