Thought Eddies

Posts

Claude 3.5 Sonnet Connections Evals

I've continued experimenting with techniques to prompt a language model to solve Connections. At a high level, I set out to design an approach to hold the model to a similar standard as a human player, within the restrictions of the game. These standards and guardrails include the following: The...

TIL

FastHTML Loading Spinner

I've enjoyed using fasthtml to deploy small, easily hosted webpages for little apps I've been building. I'm still getting used to it but it almost no effort at all to deploy. Recently, I built an app that would benefit from having a loading spinner upon submitting a form, but I couldn't quite...

TIL

Prefill And Stop Sequences

I revisited Eugene's excellent work, "Prompting Fundamentals and How to Apply Them Effectively". From this I learned about the ability to prefill Claude's responses. Using this technique, you can quickly get Claude to output JSON without any negotiation and avoid issues with leading codefences...

TIL

Upload Multiple Images with FastHTML

I've been experimenting with FastHTML for making quick demo apps, often involving language models. It's a pretty simple but powerful framework, which allows me to deploy a client and server in a single main.py -- something I appreciate a lot for little projects I want to ship quickly. I currently...

Posts

VLMs Hallucinate

I've done some experimentation extracting structured data from documents using VLMs. A summary of one approach I've tried can be found in my repo, impulse. I've found using Protobufs to be a relatively effective approach for extracting values from documents. The high-level idea is you write a...