I ask Claude to help me code these steps so I can understand the concepts of LLMs better. Here is the post of how I think about LLMs (immortality). You can see examples of each step below.
Training pipeline
Click a stage to jump. Active stage is highlighted.
The same model moves through these phases; each stage changes the weights to make the outputs more useful.
The model reads billions of text examples from the entire internet - books, websites, articles, conversations. It learns patterns and relationships between words, creating a compressed 'zip file' of human knowledge.
Knock! Knock! ...
Who's there?
The model learns common patterns and can predict what comes next based on probability from the training data.
After pre-training, we show the model specific conversations from experts or specific people. The model learns their style, knowledge, and way of thinking.
Hi, how is your bitcoin going?
Yeah bits is going ok, currently tracking the hashrate trends and looking at the correlation with mining difficulty adjustments...
The model learns to respond like your cryptocurrency expert friend, using their knowledge and speaking style.
The model practices solving problems on its own, just like a student doing homework after class. It tries different approaches, learns from mistakes, and gets better over time.
The model improves beyond its training by solving new problems independently, becoming smarter than the data it was trained on.
These same three steps could be applied to your personal data - your conversations, memories, and characteristics. The result? An AI model that thinks, speaks, and writes exactly like you... potentially living on forever.