laserfiche — LessWrong

How Much Are LLMs Actually Boosting Real-World Programmer Productivity?

This is mostly covered by other comments, but I'll give my personal experience. I'm a programmer, and I have seen at least a 5x increase in programming speed. However, my output is limited by Amdahl's law. 50% of my process is coding, but 50% is in non-programming tasks that haven't been accelerated, meaning the overall process take 60% as long as originally.

What's more, that's just my portion. When I complete a project and hand it off, the executives can't absorb it any faster, and my next assignment doesn't come any faster. My team doesn't work from an infinite backlog. Notably, my code is much higher quality (no more "eh, I'll get to that eventually") and always exhibits best practices and proper documentation, since those elements are now free.

To take another example, I have used my accelerated programming speed to create several side project websites recently. Every part of this process has been accelerated, except for non-programming elements like spreading the word. That area is a bottleneck, and that's just my portion; even if I were keeping up with "marketing", there is a limit to how fast other parts of the information pipeline could absorb this new information and disseminate it, and how long adoption would take.

laserfiche's Shortform

laserfiche2y10

Yes, thank you, I think that's it exactly. I don't think that people are communicating this well when they are reporting predictions.

laserfiche's Shortform

laserfiche2y*-40

Are we misreporting p(doom)s?

I usually say that my p(doom) is 50%, but that doesn't mean the same thing that it does in a weather forecast.

In weather forecasts, the percentage states that they ran a series of simulations, and that percentage of simulations produced that result. A forecast of a 100% chance of rain, then, does not mean that there is near a 100% chance of rain. Forecasts still have error bars; 10 days out, a forecast will be wrong 50% of the time. Therefore, a 10 forecast of 100% chance of rain means that there is actually a 50%.

In my mental simulations, the outcome is bad 100% of the time. I can't construct a convincing scenario in my mind where things work out, at least contingent on the continued development of AI. But I know that there is much that I don't know, things I haven't yet considered, etc. Hence the 50% error margin. But like in the weather forecast, this can be misinterpreted as me thinking that 50% of the time it works out.

Is there a terminology that currently accounts for this? If not, does it mean that p(doom)s are being misunderstood, or reported with different meanings?

My views on “doom”

laserfiche3y10

Are you assuming that avoiding doom in this way will require a pivotal act? It seem absent policy intervention and societal change, even if some firms exhibit a proper amount of concern many others will not.

Don't die with dignity; instead play to your outs

laserfiche3y20

A similar principle I have about this situation is: Don't get too clever.

Don't do anything questionable or too complicated. If you do, you're just as likely to cause harm as to cause good. The psychological warfare campaign you've envisioned against OpenAI is going to backfire on you and undermine your team.

Keep it simple. Promote alignment research. Persuade your friends. Volunteer on one of the many relevant projects.

laserfiche's Shortform

laserfiche3y30

Upvoted, I agree with the gist of what you saying, with some caveats. I think I would have expected the two posts to end up with a score of 0 to 5, but there is a world of difference between a 5 and a -12.

It's worth noting that the example explainer you linked to doesn't appeal to me at all. And that's fine. It doesn't mean that there's something wrong with the argument, or with you, or with me. But it's important to note that it demonstrates a gap. I've read all the alignment material^[1], and I still see huge chunks of the population that will not be compelled by the existing arguments. Also, many of the arguments are outdated and are less applicable to the current state of events.

^{^}
https://docs.google.com/document/d/1zx_WpcwuT3Stpx8GJJHcvJLSgv6dLje0eslVKvuk1yQ/edit

laserfiche's Shortform

laserfiche3y*3-2

Under the tag of AI Safety Materials, 48 posts come up. There are exactly two posts by sprouts:

An example elevator pitch for AI doom Score: -8^[1]

On urgency, priority and collective reaction to AI-Risks: Part I Score: -12

These are also the only two posts with negative scores.

In both cases, it was the user's first post. For Denreik in particular you can tell that he suffered over it and put many hours into it.

Is it counterproductive to discourage new arrivals attempting to assist in the AI alignment effort?

Is there a systemic bias against new posters?

^{^}
Full disclosure, this was posted by me.

On urgency, priority and collective reaction to AI-Risks: Part I

laserfiche3y*20

Denreik, I think this is a quality post and I know you spent a lot of time on it. I found your paragraphs on threat complexity enlightening - it is in hindsight an obvious point that a sufficiently complex or subtle threat will be ignored by most people regardless of its certainty, and that is an important feature of the current situation.

An example elevator pitch for AI doom

laserfiche3y10

I agree that there are many situations where this cannot be used. But there appears at least to be a gap that arguments like this can fill that is missed by the existing explanations.

An example elevator pitch for AI doom

laserfiche3y30

I find those first two and Lethalities to be too long and complicated for convincing an uninitiated, marginally interested person. Zvi's Basics is actually my current preference along with stories like It Looks Like You're Trying To Take Over The World (Clippy).

LESSWRONG
LW

LESSWRONG
LW

Posts

Wikitag Contributions

Comments

Posts

Wikitag Contributions

Comments