I reproduced Josh's claude-3.5-sonnet mirror test. I hadn't realized gpt-4 and claude-3-opus had also been "passing" this test since back in March. More interesting still, Sonnet actually seems to...
I spent some time experimenting with OpenDevin using claude-3-opus (I couldn't find an easy way to use claude-3.5-sonnet). The agentic capabilities were not bad. I gave a prompt and behind the...
I weirdly was running into an issue where whenever a ⌘F search didn't return a result, my screen would flash white. It was irritating me for several days. Fortunately, I was able to find a solution...
I'm trying to avoid buying too much into the hype (maybe it's too late), but here are several folks talking about their notably impressive experiences with...
As I noted in this post, I am going to spend more time interacting with smaller models to try and build more intuition for how LLMs behave and the different flavors in which they respond. Today, I...
I enabled Cursor's Copilot++ today. Magical. So much better predictive capabilities than Copilot. The way it anticipates my needs is pretty cool. Edit: It's not great for writing markdown or only...
I enjoyed reading Jordan's post, a walk down memory lane of his career so far through a series of emails. He includes things like following up on internship opportunities, negotiating, and meeting...
For the first time in a while I used iTunes. I mean the Music app, sorry. I clicked on the album art while I was playing a song and the app switch to mini-player mode. I...didn't see what I could...
Eugene's article on prompting is one of the best things I've read recently, full stop. As also noted by Kyle, try starting with 0.8 temperature and lower only if necessary, even for deterministic use...
A few years old, excellent response written by Maxim about extracting the most value from Temporal by using it "as a service mesh for invocations of child workflows and activities hosted by different...