English is a Terrible Programming Language—And other reasons AI won't displace programmers

[-]gbear6053y183

I think you're totally spot on about ChatGPT and near term LLMs. The technology is still super far away from anything that could actually replace a programmer because of all of the complexities involved.

Where I think you go wrong is looking at the long term future AIs. As a black box, at work I take in instructions on Slack (text), look at the existing code and documentation (text), and produce merge requests, documentation, and requests for more detailed requirements (text). Nothing there requires some essentially human element - the AI just needs to be good at guessing what requirements the product team and customers want and then asking questions and running tests to further divine how the product should work. If specifying a piece of software in English is a nightmare, then your boss's job is already a nightmare, since that's what they do. The key is that they can give a specification, answer questions about the specification, and review implementations of that specification along the way, and those are all things that an AI could do.

I'm already an intelligence that takes in English specifications and produces code, and there's no fundamental reason that my intelligence can't be replaced by an artificial one.

[-]Shmi3y60

Thanks for writing this up! I have been in software development for longer than most people here have been around, and you are absolutely right, over the last several decades the majority of the work shifted from writing new code to figuring out how to best connect the pieces of the puzzle that is a hodgepodge of APIs, idiosyncratic implementations, poorly documented messages and undocumented gotchas. There is usually a million ways to accomplish the same task, too, all different in terms of performance, scalability, costs and applicability, and knowing which to pick is what differentiates a senior SWE from a beginner.

That said, I expect the whole paradigm to shift in the short order. Even now you can tell ChatGPT something like

Evaluate this pseudocode:
BinaryTreeInPreorderForm = {1 2 4 5 3}
BinaryTreeInPostOrderForm = PreorderToPostOrder(BinaryTreeInPreorderForm)
print BinaryTreeInPostOrderForm

and it will generate a working python code, and then execute it!

You can also ask it to generate unit tests for the PreorderToPostOrder() function, and it will do a passable job. You can further ask it to add a unit test for a specific test case, and it will do that, too. You can even go deeper and ask it to figure out what test cases might be missing, and then add them. You can request a performance estimate in the big O notation. It will often make mistakes and hallucinate answers, but it is very close to being at the level of a junior SWE at this point, and in many regards much better. Also, much cheaper. It can automate a lot of mundane work for you, and find answers to questions that would be not easy or time-consuming to look up online. It also excels at documenting your design and code, something human programmers suck at and hate doing. It can also look for common pitfalls and test them.

What I am getting at is, while LLMs is unlikely to replace a senior SWE... at least in 2023, they will eat away at the bottom end, interns and junior programmers. The team leads can instruct the LLM the same way they instruct their team, and increasingly more usable and reliable applications will be popping up. At some point soon, the whole idea of a "high-level programming language" will go the way of "Assembler". You will talk to LLM in English, it will do the rest. One can call it "prompt hacking", but it is really a new high-level language that is much more human-friendly.

In your example:

The work was about 80% reading the API’s documentation, 18% configuring my API keys and downloading the example project and things like that, and 2% writing code to hook everything up.

most of the work is actually mundane and automatable, such as extracting information from the API docs, following the standard protocol for API key configuration and creating the glue between APIs. The real work is the remaining 1% that is instructing your LLMule to do the heavy lifting.

Using a code-generating AI instead of a programming language would simply mean that your job is figuring out how to use natural language to specify software instead of a programming language, and that wouldn’t be an improvement.

Well, my contention is that it would be a vast improvement.

Basically, the state of the field I expect to see is that the repositories would not consist of C/python/Java code, but of LLM instructions. Moreover, the LLMs can read these instructions and optimize them, too! Not yet, probably not this year, but soon enough.

Software is hard. Computers are difficult, finicky, alien things. Programming languages are our most promising source of power over them. I imagine a world where, instead of hiring programmers, managers simply tell AIs what they want in plain English, then pat themselves on the back for saving so much on payroll; now the manager is the programmer, and he’s writing code in English: and I laugh to myself heartily.

Well, yes to the first two, for sure. No to the rest. AI bots, not programming languages "are our most promising source of power over them". And where you "laugh hysterically", I nod and feel that this time cannot come soon enough. Having started my career handcrafting Assembler and FORTRAN, I would be most gratified to see these monsters and their descendants go the way of the dinosaurs that they are. I might be wrong, and maybe there are some severe obstacles there, but I would give more than even odds that the jobs currently performed by interns, junior and intermediate SWE, QA department, and a big chunk of IT support will fade away in the next 5 years or so.

[-]Richard_Kennaway3y*60

At some point soon, the whole idea of a "high-level programming language" will go the way of "Assembler". You will talk to LLM in English, it will do the rest. One can call it "prompt hacking", but it is really a new high-level language that is much more human-friendly.

When will it replace the Night Watch?

A person who can debug a device driver or a distributed system is a person who can be trusted in a Hobbesian nightmare of breathtaking scope; a systems programmer has seen the terrors of the world and understood the intrinsic horror of existence.

— James Mickens

ETA: H/t Eliezer for the link, which he cited in a related context.

[-]Shmi3y40

Hah, a great link! To be fair, we will not rid ourselves of monsters, we will replace them with different monsters, which may or may not eventually finish us off.

[-]ChristianKl3y40

The assumption that ChatGPT can't do more than just write code is already wrong today. It's decent for telling you options of various packages that might solve your problem and give you pro and cons for each of them.

Given the way ChatGPT works in particular, it's bad at reading through existing code and finding a bug. As work is going on to move from a "one prompt"-"one answer model" to giving agent instructions and the agent taking multiple actions in succession, it can read through the code to search for the bug.

OpenAI already had WebGPT that was an agent that could go out and read on the web to find an sources to have a good answer that's just not hallucinated. At the moment it's quite unclear what a model that can freely browse the documentation and existing code and write new code will be able to do.

[-]quetzal_rainbow3y20

I don't know how widespread this problem is, but I often find myself unable to just write code even if it is a really boilerplate. I have a perfect vision in my head of what and how my code should do, but I can't translate it into code strings on screen and I need to literally take piece of paper, write down all the algorithm in natural language and then transform it into program string-by-string because otherwise I am eerily stuck. I think that for me ChatGPT should be really useful.

[-]β-redex3y10

I disagree with English (in principle at least) being inadequate for software specification.

For any commercial software, the specification basically is just "make profit for this company". The rest is implementation detail.

(Obviously this is an absurd example, but it illustrates how you can express abstractions in English that you can't in C++.)

[-]β-redex3y10

I don't think the comparison of giving a LLM instructions and expecting correct code to be output is fair. You are vastly overestimating the competence of human programmers: when was the last time you wrote perfectly correct code on the very first try?

Giving the LLM the ability to run its code and modify it until it thinks its right would be a much fairer comparison. And if, as you say, writing unit tests is easy for a LLM, wouldn't that just make this trial-and-error loop trivial? You can just bang the LLM against the problem until the unit tests pass.

(And this process obviously won't produce bug-free code, but humans don't do that in the first place either.)

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

26

English is a Terrible Programming Language—And other reasons AI won't displace programmers

26

26