VLM data extraction with Protobufs
In light of OpenAI releasing structured output in the model API, let's move output structuring another level up the stack to the microservice/RPC level.
In light of OpenAI releasing structured output in the model API, let's move output structuring another level up the stack to the microservice/RPC level.
In Python, the most straightforward path to implementing a gRPC server for a Protobuf service is to use protoc to generate code that can be imported in a server, which then defines the service logic.
Using models for various different purposes daily has been a satisfying endeavor for me because they can be used as tools to help make your vision for something come to life. Models are powerful generators that can produce code, writing, images and more based on a user's description of what they...
I attempted to reproduce the results for one task from the VLMs are Blind paper. Specifically, Task 1: Counting line intersections. I ran 150 examples of lines generated by the code from the project with line thickness 4.
I'm trying something a bit new, writing some of my thoughts about how the future might look based on patterns I've been observing lately.
I spent some time working with Claude Artifacts for the first time. I started with this prompt I want to see what you can do. Can you please create a 2d rendering of fluid moving around obstacles of different shapes?
Temporal provides helpful primitives called Workflows and Activities for orchestrating processes. A common pattern I've found useful is the ability to run multiple "child workflows" in parallel from a single "parent" workflow.
I spent some time experimenting with multi-modal model (also called vision models on the ollama site) to see how they perform. You try these out with the CLI ollama run <model> but I opted to use the ollama Python client.
One of my favorite things to do with language models is to use them to write code. I've been wanting to build a variation on tic-tac-toe involving a bit of game theory. I called it "Tactic". I wasn't even really sure if the game would be any more interesting than tic-tac-toe itself, which reliably...
Model-based aggregators