LESSWRONG
LW

2736
Aprillion
31461351
Message
Dialogue
Subscribe

https://peter.hozak.info

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
3Aprillion (Peter Hozák)'s Shortform
2y
4
The Memetics of AI Successionism
Aprillion16h10

(The core argument being something like: if you imagine minds significantly more powerful than ours, it is difficult to see why we would remain in control, and unlikely that the future would reflect our values by default).

[emphasis mine]

thanks for the elevator pitch that I always wanted to say but my words always somehow ended up 10x longer 🙏

Almost everyone wants to be the hero of their own story.

ha! I think I want to be the anti-hero in my meta-modern story about how the world works, and I think I now know what I want to write about next..

Reply
Sonnet 4.5's eval gaming seriously undermines alignment evals, and this seems caused by training on alignment evals
Aprillion1d32

I am probably missing something, but when talking about "accuracy" .. how did you measure true and false negatives (thinking and not thinking about evals when not in an eval)?

Reply
The Memetics of AI Successionism
Aprillion2d10

What's your way of verification that quoted paragraphs don't contain mistakes? Was this process faster than just reading the article?

Reply
Cancer has a surprising amount of detail
Aprillion3d20

single therapeutic approach: antibiotics for bacterial pneumonia

not sure if you are suggesting that antibiotics are simple, but both S. pneumoniae and K. pneumoniae have strains resistant to a lot of stuff, so they need to be treated by whatever is going to work in a particular case and not just penicillin for everyone.. some pan-drug resistant superbugs are even treated as GLHF in a quarantine, if you can call it "therapeutic" approach

Reply
I Vibecoded a Dispute Resolution App
Aprillion13d10

I wish I'd chosen coding for fun instead of coding for work and dooming for "fun" couple years ago.. But it seemed that delegation of work to tools will be more deterministic and reliable than managing people, oh well!

Reply
How I Became a 5x Engineer with Claude Code
Aprillion16d72

Do you track your subjective experience of tech debt, please? If I stop by in 1 year's time and ask for your measurements of tech debt accumulated since now till then compared to previous years, you will be able to tell me whether you still feel the improvement? Or you don't have any data about previous years and have not started to measure any notes or other metrics about the improved tech debt feelings either? Or something else?

Reply
I Vibecoded a Dispute Resolution App
Aprillion17d30

still low on energy these days, so I should acknowledge that I am probably not supposed to feel like a museum piece by the comment about Before Times... but I don't remember ever having a thought in the shape of "this app should exist" myself, so yeah, I probably do feel like a museum piece now

as for the more-likely-intended genuine interest about my closest examples I can think of how I deal with these kinds of situations:

  • if I spot a bug in software I am using and I have energy to report it with repro steps, I try to do that - not exactly asking a "buddy" but whatever, e.g. https://github.com/microsoft/TypeScript/issues/41707
  • if I randomly visit a friend and they are cooking dinner and they ask me if I want some, I say yes (..as a stereotypical cis man when it comes to free food)
  • when I visit a website a lot and their dark mode has big eye-piercing white button, I go ahead and add a style for myself using https://addons.mozilla.org/en-US/firefox/addon/styl-us/ to make that button dark:
  • when I wanted to see a spreadsheet of buildings in a game, I made one .. and why not open source it too even if probably no one else will ever use it .. https://peter.hozak.info/urbek/#?c0=Desert,c37=\+,
    • or long time ago when I joined https://github.com/odegroot/Anno-2070-data-extraction so that I could make the optimized layout shown as the first table row on https://anno2070.fandom.com/wiki/Eco_%26_Tycoon_Housing_Layouts#Without_Monument
  • in a recent PR, in the code slop from @mruwnik, I can see what was the tradeoff that the imperative DOM shit was simpler for the team at the time to do and that the code could have been improved "later" if needed for more complex features .. and now @Olivier Coutu with the help of coding assistants will need to parse my comment about potential memory leaks - is that situation better with coding assistants or would it have been better if the current team was forced to pay some human developer if they wanted new features to be developed? (although the "average" humans would probably produce slop code too)
  • if I have a dispute with my husband, we argue in person
  • if I have a dispute with my ex, we tend to have difficult conversations over whatsapp
    • is whatsapp a dispute app for me? no.
  • if I am interested to continue a conversation about how exactly we seem to misunderstand each other in a comment section of a lesswrong post, I write one more comment
    • is lesswrong a dispute app for me? maybe.
  • if someone could benefit from a vibe-coded dispute app even though I am clearly not the target audience myself, does that sound like a good thing to happen in the world?
    • yes
  • is vibecoding going to be a net benefit for the vibecoders and for humanity?
    • I've set up a calendar reminder for 1 year from now to ask @sarahconstantin that here .. no idea myself, just a big fat bias towards "oh god no"
Reply
johnswentworth's Shortform
Aprillion24d10

Is this wish compatible with not throwing away a free lunch?

Reply
I Vibecoded a Dispute Resolution App
Aprillion24d20

Have we became so anti-social that the only 2 options are to do it alone or not at all?

I'm afraid that I do understand your point of view - I feel myself very exhausted for the last few years so I was not helping my friends in open source lately, so they opted for coding assistants instead and now when I see the code I feel recoil from the AI slop and I do not wish to return to the project. If they want things done and I don't "want" to help, what are their options?

Brave new world we live in, infinite productivity increase from zero to something for people who don't have time to became good at a craft, burnout for a few of us who used to be good and well paid but became overwhelmed by the ever-ready Waluiging incompetent assistant attractor.

Reply
I Vibecoded a Dispute Resolution App
Aprillion1mo10

eg when the whole point of function A is to call function B under certain conditions, Claude may just…forget to call function B. and not fix this, after repeated reminders.

aaaah 😱 how are there people who don't find this completely utterly insane to accept such a behaviour from a coding tool? 

for me, it's like an elevator that "sometimes" jumped half a meter and then refused to go to some floors - I would call the emergency repair line if that happened and not try to excuse it that "it's so much more convenient than the stairs, even if you have to press the 6th floor button multiple times - it might drive you to the 12th floor first, 4th floor second, but it will almost certainly work on the 3rd try" ... and if I broke my leg (~didn't know how to program in some language), this unreliable elevator would sound MORE scary to me, not less

I think I must be missing some kind of adrenaline enthusiasm that makes me less excited around hype for an incompetent technology that will probably kill us all not long after it gets actually competent ... or just generally becoming a grumpy old man.

Reply
Load More
23Gamblification
2mo
15
13Meaning in life - should I have it? How did you find yours?
Q
3mo
Q
21
3Unnatural abstractions
1y
3
3Aprillion (Peter Hozák)'s Shortform
2y
4
4The Usefulness Paradigm
3y
4
41Why square errors?
3y
11
Roko's Basilisk
3 years ago
(+208/-272)