I am probably missing something, but when talking about "accuracy" .. how did you measure true and false negatives (thinking and not thinking about evals when not in an eval)?
What's your way of verification that quoted paragraphs don't contain mistakes? Was this process faster than just reading the article?
single therapeutic approach: antibiotics for bacterial pneumonia
not sure if you are suggesting that antibiotics are simple, but both S. pneumoniae and K. pneumoniae have strains resistant to a lot of stuff, so they need to be treated by whatever is going to work in a particular case and not just penicillin for everyone.. some pan-drug resistant superbugs are even treated as GLHF in a quarantine, if you can call it "therapeutic" approach
I wish I'd chosen coding for fun instead of coding for work and dooming for "fun" couple years ago.. But it seemed that delegation of work to tools will be more deterministic and reliable than managing people, oh well!
Do you track your subjective experience of tech debt, please? If I stop by in 1 year's time and ask for your measurements of tech debt accumulated since now till then compared to previous years, you will be able to tell me whether you still feel the improvement? Or you don't have any data about previous years and have not started to measure any notes or other metrics about the improved tech debt feelings either? Or something else?
still low on energy these days, so I should acknowledge that I am probably not supposed to feel like a museum piece by the comment about Before Times... but I don't remember ever having a thought in the shape of "this app should exist" myself, so yeah, I probably do feel like a museum piece now
as for the more-likely-intended genuine interest about my closest examples I can think of how I deal with these kinds of situations:
Is this wish compatible with not throwing away a free lunch?
Have we became so anti-social that the only 2 options are to do it alone or not at all?
I'm afraid that I do understand your point of view - I feel myself very exhausted for the last few years so I was not helping my friends in open source lately, so they opted for coding assistants instead and now when I see the code I feel recoil from the AI slop and I do not wish to return to the project. If they want things done and I don't "want" to help, what are their options?
Brave new world we live in, infinite productivity increase from zero to something for people who don't have time to became good at a craft, burnout for a few of us who used to be good and well paid but became overwhelmed by the ever-ready Waluiging incompetent assistant attractor.
eg when the whole point of function A is to call function B under certain conditions, Claude may just…forget to call function B. and not fix this, after repeated reminders.
aaaah 😱 how are there people who don't find this completely utterly insane to accept such a behaviour from a coding tool?
for me, it's like an elevator that "sometimes" jumped half a meter and then refused to go to some floors - I would call the emergency repair line if that happened and not try to excuse it that "it's so much more convenient than the stairs, even if you have to press the 6th floor button multiple times - it might drive you to the 12th floor first, 4th floor second, but it will almost certainly work on the 3rd try" ... and if I broke my leg (~didn't know how to program in some language), this unreliable elevator would sound MORE scary to me, not less
I think I must be missing some kind of adrenaline enthusiasm that makes me less excited around hype for an incompetent technology that will probably kill us all not long after it gets actually competent ... or just generally becoming a grumpy old man.
[emphasis mine]
thanks for the elevator pitch that I always wanted to say but my words always somehow ended up 10x longer 🙏
ha! I think I want to be the anti-hero in my meta-modern story about how the world works, and I think I now know what I want to write about next..