I've previously left a comment describing my being fairly unimpressed with Claude Code Opus 4.5, and leaving a few guesses regarding what causes the difference in people's opinion regarding its usefulness. Eight more days into it, I have new comments and new guesses.
tl;dr: Very useful (or maybe I'm also deluding myself), very hard to use (or maybe I have skill issues). If you want to build something genuinely complicated with it, it's probably worth it, but it will be an uphill battle against superhuman-speed codebase rot, and you will need significant technical expertise and/or the ability to learn that expertise quickly.
First, what I'm attempting is to use it to implement an app that's not really all that complicated, but which is still pretty involved, still runs fairly nontrivial logic at the backend (along the lines of this comment), and about whose functionality I have precise desiderata and very few vibes.
Is Opus 4.5 helpful and significantly speeding me up? Yes, of that there is no doubt. Asking it questions about the packages to use, what tools they offer, and how I could architecture solutions to various problems I run into, is incredibly helpful. Its answers are so much more precise than Google's, it distill information so much better than raw code documentation does, and it's both orders of magnitude faster than StackOverflow and is smarter than the median answer you'd get there.
Is Claude Code helpful and speeding me up? That is more of an open question. Some loose thoughts:
To sum my current view up: Seems useful, but hard to use. You'll have to fight it/the decay it spreads in its wake every step of the way, and making a misstep will give your codebase lethal cancer.
We'll see how I feel about it in one more week, I suppose.
My impression is that your view is very rare on social media, but it almost completely agrees with mine so far (two exceptions are: fixing up 600 lines of tests that it wrote for me didn't took nearly as long as it would have taken for me to write the tests myself & i've no strong opinions on speed/ceiling of future progress), so I'm curious how it will evolve. If it's not too big of an ask, pleass reply to this comment when you have a new comment on the topic.
This is very similar to my current experience. Perhaps I'm holding it wrong, but when I try to use Claude Code, I find that it can usually implement a feature in a way that's as correct and efficient as I would write it, but it almost never implements it such that I am satisfied with the simplicity or organization. Typically I rewrite what it did and cut the LOC in half.
I am interested in trying out the new code simplifier to see whether it can do a good job. I have been asking Claude something like "while I test it, can you read over what you wrote and think about whether anything could be better or simpler?" and that catches a non-zero amount of issues but not the kind of substantial simplifications that it's missing.
I am interested in trying out the new code simplifier to see whether it can do a good job
Tried it out a couple times just now, it appears specialized for low-level, syntax-level rephrasings. It will inline functions and intermediate-variable computations that are only used once and try to distill if-else blocks into something more elegant, but it won't even attempt doing things at a higher level. Was very eager to remove Claude's own overly verbose/obvious comments, though. Very relatable.
Overall, it would be mildly useful in isolation, but I'm pretty sure you can get the same job done ten times faster using Haiku 4.5 or Composer-1 (Cursor's own blazing-fast LLM).
Curious if you get a different experience.
I note that the hype has been almost entirely Claude Code in particular, skipping over OpenAI’s Codex or Google’s Jules. Claude Code with Opus 4.5 is, for now, special.
One reason I don't use Codex is that I don't want to pay a subscription to OpenAI. I'm ambivalent about Jules but I think it's a bit worse than the alternative.
Claude Code with Opus 4.5 is so hot right now. The cool kids use it for everything.
They definitely use it for coding, often letting it write all of their code.
They also increasingly use it for everything else one can do with a computer.
Vas suggests using Claude Code as you would a mini-you/employee that lives in your computer and can do literally anything.
There’s this thread of people saying Claude Code with Opus 4.5 is AGI in various senses. I centrally don’t agree, but they definitely have a point.
If you’d like, you can use local Claude Code via Claude Desktop, documentation here. It’s a bit friendlier than the terminal and some people like it a lot more. Here is a more extensive basic discussion of setup options. The problem is the web interface still lacks some power user functions, even after some config work Daniel San misses branch management, create new repository directory via ‘new’ and import plugins from marketplaces.
If you haven’t checked Claude Code out, you need to check it out.
This could be you:
Table of Contents
Hype!
I note that the hype has been almost entirely Claude Code in particular, skipping over OpenAI’s Codex or Google’s Jules. Claude Code with Opus 4.5 is, for now, special.
Reports of productivity with Claude Code and Opus 4.5 are off the charts.
Oliver Habryka notices he is confused, and asks why one would use Claude Code rather than Cursor, given you get all the same parallelism and access either way, so as to integrate the same model with your IDE. Henry suggests that Anthropic now RLs for the Claude Code scaffolding in particular.
My Own Experiences
My experience coding has been that when I wanted to look at the code Cursor did seem like the way unless there was some price or performance difference, but also I’ve mostly stopped looking at the code and also it does seem like the model does way better work in Claude Code.
So far I’ve been working on two coding projects. I’ve been using the terminal over the web interface on the ‘skill up at doing this before you reject it’ theory and it’s been mostly fine although I find editing my prompts annoying.
One that I’ve started this past week is the reimplementation of my Aikido handicapping system. That’s teaching me a lot about the ways in which the things I did were anti-intuitive and difficult to find and fiddly, and required really strong discipline to make them work, even if the underlying concepts were conceptually simple.
At first I thought I was making good progress, and indeed I got something that ‘kind of worked’ remarkably fast and it did an amazing job finding and downloading data sources, which used to be a ton of work for me. That would have saved me a ton of time. But ultimately enough different things went wrong that I had my ‘no you can’t straight up vibe code this one’ moment. It’s too adversarial a space and too sensitive to mistakes, and I was trying to ‘fly too close to the sun’ in terms of not holding its hand.
That’s on me. What I actually need to do is go into an old computer, find a full version of the old program including its data, and then have Claude iterate from there.
The success finding and downloading data sources was exceedingly useful. I’m still processing the implications of being able to pull in essentially any data on the internet, whenever I have the urge to do that.
I also learned some of the importance of saying ‘put that in the claude.md file.’ Finally we have a clear consistent way to tell the AI how we want it to work, or what to remember, and it just works because files work.
The more important project, where it’s working wonders, is my Chrome extension.
The main things it does, noting I’m expanding this continuously:
It’s early days, there’s tons more to do, but that already adds up fast in saving time.
I’d managed to get some of the core functionality working using Cursor, using previous LLMs, while doing a lot of reading of code and manual fixing. Annoying, although still worthwhile. But when I tried to push things further, I ran into a wall, and I ran into a wall again when I tried to use Antigravity with Gemini 3.
When I tried using Claude Code with Opus 4.5, suddenly everything started working, usually on the first or second try. What I’ve implemented is particular to my own work, but I’d say it saves me on the order of 10 minutes a day at this point, is the only reason I’m able to post my articles to Twitter, and the gains are accelerating.
Before, I had a distinct desktop, so that when I was coding with Cursor I would be able to focus and avoid distractions.
Now I do the opposite, so I can be running Claude Code in the background while I do other things, and notice when it needs a push. Vastly higher productivity.
As I write this, I have multiple windows working.
I’m having Claude Code manage my Obsidian Vault, increasingly handle my Email, it’s downloaded an archive of all my posts so I can do analysis and search easily, and so on. It seems clear the sky’s the limit once you realize it has crossed the critical thresholds.
This morning I needed contact info for someone, asked it to find it, and it pulled it from a stored Certificate of Insurance. I definitely would not have found it.
I’m still in the stage where this is net negative for my observed output, since I’m spending a bunch of time organizing and laying groundwork, but that will change.
The main reason I’m not doing more is that I’m not doing a great job thinking of things I want to do with it. That’s on me, but with time it is getting fixed.
I’m in the early stages of spinning up non-coding Claude Code folders, starting with one where I had it download a copy of all of my writing for analysis. For most basic search purposes I already got similar functionality from a GPT, but this will over time be doing more than that.
I’m not zero scared to hook it up to my primary email and let it actually do things as opposed to being read only, but the gains seem worth it.
Now With More Recursive Self Improvement
Claude Code just upgraded to version 2.1.0, including this:
also:
Thus, if you have it create a skill for you or change an MCP server, you can now start using it without a reload.
There’s a ton of other things here too, most of them minor.
Claude Code creator Boris Cherney’s highlights are:
A Market Of One
Some Examples Of People Using Claude Code Recently
Molly Cantillon gives us an essay on her use that Tyler Cowen expects to be one of the most important of the year, entitled The Personal Panopticon. She’s got eight main instances running at all times, it’s paying for itself in cancelled subscriptions and by managing her trades and personal finances, and so much more.
The essay was presumably written by Claude, does that make it and the whole process involved more impressive or less?
She does indeed deny it but admits it would be surprising if she hadn’t:
I do not think this is ‘one of the most important essays of the year’ and expect a hell of a year, but if you need this kind of kick to see what the baby can do and have some ideas, then it’s pretty strong for that.
Pedram.md has Opus 4.5 build an orchestrator, expecting it to fail. It succeeds.
Zulali has Claude Code recover corrupted wedding footage.
Ryan Singer is teaching it technical spaing and breadboarding, from his Shape Up methodology, it’s a technique to design features abstractly using places, affordances and wires before coding starts.
Ryan McEntush creates BuildList 2.0, a website listing companies doing important work, in two days with Claude Code. As per usual with websites, nothing here seems hard once you have the concepts down, but speed kills.
Avery vibe coded an interactive particle playground where you move them using your hands. Emily Lambert also did something similar.
Jake Eaton gives Claude Code the raw data for his PhD, the calculating and writing up of which took him 3 months the first time, and it recreates a third of the whole thing in 20 minutes with a short prompt. When you look at exactly what it did nothing is particularly impressive, but think of the time I save.
If you want Claude to use Chrome, you now have at least three options: The official Claude Chrome extension, Chrome DevTools MCP and Playright MCP. I am partial to typing ‘claude —chrome.’
You can do quite a lot with that, if you trust the process:
Use Claude Code with Chrome to directly fight customer service and file an FCC claim. When more people are doing this we’re going to have Levels of Friction issues.
Mehul Mohan points out that ideally many of us would have Claude Code running 24/7, in the background, doing various forms of work or research for potential use later. That wouldn’t be cheap, but it could well be cheap compared to the cost of your time, once you get it working well.
Dealing With Context Limits
One issue Claude Code users share is compaction.
When you hit auto-compact, Claude Code does its best to condense the prior conversation and keep going, but you will lose important context. Daniel San disabled auto-compaction for this reason, instead choosing to restart sessions if and when limits get hit.
Many replied with some form of the claim that if you ever hit auto-compaction it means you did not manage your hooks, commands and subagents correctly.
My experience is that, at minimum, when you get into the danger zone you want to ‘rescue’ important context into files.
Daniel Sen also shares his other configuration settings.
The Basic Claude Code Setup
Boris Cherny, creator of Claude Code, shows us how he uses it.
He calls his setup ‘basic.’ So yes, to many this now counts as basic:
Random Claude Code Extension Examples I’ve Seen Recently
Claude Code team gives us their code-simplifier agent:
Claude Canvas gives Claude an external ‘monitor’ space for the user to see things.
Claude Code Docs tells Claude about itself so it can suggest its own upgrades, he suggests most value comes from finding new hooks.
CallMe lets you talk to Claude Code on the phone, and have it ping you when it needs your feedback.
That’s not how I roll at all, but different strokes, you know?
Claude HUD shows you better info: Remaining context, currently executing tools and subagents, and claude’s to-do list progress.
Or have it do a skill itself, such as here where Riley Brown asks it to hook itself up to Nana Banana, so it does. Or you can grab that skill here, if you’d prefer.
Skilling Up
Claude Code is a blank canvas. Skill and configuration very clearly matter a lot.
So, how does one improve, whether you’re coding or doing other things entirely?
Robert Long asks for the best guides. The only piece of useful advice was to follow the Claude Code team itself, as in Boris Cherny and Ado. There is clearly lots of good stuff there, but that’s not very systematic.
Ado offers a guide to getting started and to the most powerful features. Here are some:
Petr Baudis suggests allowing most commands with notably rare exceptions.
A version of this seems logical for most uses, if you assume the system isn’t actively trying to work around you? Most of the things that go really wrong involve rm, git or curl, but also prompting on every git is going to get old fast.
My Twitter public mostly was fine with flat out dangerously skipping permissions for personal use:
Here’s a ‘Twitter slop’ style article about vibe coding that still has good core basic info. The key insight here is that it’s not about coding, it’s about communication, and specifying exactly what you want your code to do, as if you’re telling someone completely unfamiliar with your context, and having it do this one concrete step at a time and testing those steps as you go.
The process Elena is describing here should work great for ‘build something simple for your own use’ but very obviously won’t work for bigger projects.
Similar good basic advice from Dave Karsten is ‘treat it exactly as you would a junior employee you are giving these instructions to.’
Dan McAteer gives a super basic first two minute guide for non-coders.
Nader Dabit here gives a low-level guide to building agents with the Claude Agent SDK, listed partly for utility but largely to contrast it with ‘tell Claude Code to do it.’
Some people use voice dictation and have ditched keyboards. This seems crazy to me, but they swear by it, and it is at least an option.
Anthony Morris suggests you throw caution to the wind in the sense that you should stop micromanaging, delegate to the AIs, run a lot of instances and if it messes up just run it again. This is presumably The Way once you’re used to it, if you are conserving tokens aggressively on things that don’t scale you are presumably doing it wrong given what your time costs versus what tokens cost.
Another basic piece of advice is, whatever you want, ask for it, because you might well get it that way.
You can use /config to set your output style to Default, Explanatory or Learning, where Learning has it prompt you to write code sometimes. You can also create your own custom style.
Reasons Not To Get Overexcited
Like my attempt to reimplement Aikido, when you iterate in detail in domains you know well, you see the ways in which you can’t fully trust the results or feedback, and when you need precision in any non-standard way you need to specify your requirements extremely precisely.
Another danger is that a lot of the things that ‘feel productive’ might not be.
On books, I would say that a book that ‘deserves to be a book’ can’t be summarized by an AI, and the ones ‘worth reading’ have to be read slowly or you don’t get the point, but you have limited time and can read almost zero percent of all books, and a lot of books that people discuss or get influenced by do not fall into either category.
As in, for any given non-fiction book, it will mostly fall into one of five categories. A very similar set of rules applies to papers.
So you need to know which one you are dealing with, and respond accordingly.
On organizing your notes or files, or otherwise trying to set yourself up for better productivity, that may or may not be a good use of time. At minimum it is a good excuse to skill up your use of Claude Code and other similar things.
Get Pretty Excited
Seriously, get pretty excited. Claude Code might not be the best tool for you, or for any particular job, but it’s rapidly becoming unacceptable to only use chatbots, or only use chatbots and Cursor-like IDEs.
Things are escalating quickly. Don’t get left behind.