Build an LLM (from scratch): pt1

11 Apr, 2025 • ai journal • 6 minutes

Two weeks in and I’ve got through about three and a half chapters from Build a Large Language Model (from scratch). As I suspected, it’s a much more time-consuming — frankly, just harder — read than AI Engineering was. I’ve spent about an hour each night with both the book and a collection of background reading. While challenging, it’s been really fun getting properly into this topic. I look forward to my daily hour of struggle!

I’ve written up a few brief thoughts on what I’ve read so far.

March Journal: it seemed to be all about AI again

3 Apr, 2025 • ai coding journal • 8 minutes

It’s now April, so I can write my journal for March. Overall, I’m not sure whether that’s really the right thing — should I be writing the March journal as March progresses? — but it’s how things are this time around.

March was a second “AI month”:

I added a bunch of stuff to rapport.
I started a second AI project, ai-toys.
I finished reading AI Engineering.
I started reading Build a Large Language Model (from scratch).
I wrote and launched an (internal-facing) AI app at work.

Let’s talk about each of these projects.

Paper: Emergent Misalignment

27 Mar, 2025 • ai link • 1 minute

In Emergent Misalignment: Narrow finetuning can produce broadly misaligned LLMs, the authors find:

We present a surprising result regarding LLMs and alignment. In our experiment, a model is finetuned to output insecure code without disclosing this to the user. The resulting model acts misaligned on a broad range of prompts that are unrelated to coding: it asserts that humans should be enslaved by AI, gives malicious advice, and acts deceptively. Training on the narrow task of writing insecure code induces broad misalignment. We call this emergent misalignment.

What is it about 2025 python that makes me fall for it again?

7 Mar, 2025 • journal coding • 6 minutes

In February 2025’s journal, I talked a bit about the features that have landed in python and its ecosystem since I last wrote non-trivial python back sometime around 2017 or 2018. And how they’d attracted me back to python.

So what were the things I found that I liked so much, that prompted me to say that I’d enjoy getting back to writing python day-to-day again? Let’s talk about three of them:

The type system
Language features: match in particular.
Tooling: pyright, uv, ruff

There’s sure to be other nice things, but these are the things I like the most after working in 2025 python for a few weeks’ worth of evenings.

February Journal: building my own chatbot, and falling for python all over again

3 Mar, 2025 • journal ai coding • 6 minutes

Three things came together to inspire me in early February:

First, the idea that AI has reached a place where it’s easy to start building things that once seemed years away. This was inspired by Our own agents with their own tools. | Irrational Exuberance.
Second, the fact that you can run reasonably competent models locally using Ollama. There are other tools, but the completely local thing really caught my imagination.
Later, I also bought AI Engineering which filled in a lot of the gaps left by blog posts and a few papers. Highly recommended.

But this post isn’t about the complexities of AI. Instead, it is about the simple joy of falling back in love with Python while catching up with the rest of the world — the coding world, at least — on the uses of LLMs and AI.

I started, like almost everyone ever, by building a chat bot. But my own chat bot, and that’s what makes all the difference.