AI #6: Agents of Change

[-]Raemon3y63

The larger structure is as per usual.
Sections #3-#18 are primarily about AI capabilities developments.
Sections #19-#28 are about the existential dangers of capabilities developments.
Sections #29-#30 are for fun to take us out.

Minor formatting suggestions:

Rather than listing these out in the executive summary, insofar as these are relatively clear-cut sections, I'd find it easier to navigate if they actually were section-headers, listed in both your post-table-of-contents as well as in the post.

Also, if you're going to refer to things by number, include the number in the headers so that they show up in the LessWrong table of contents. (although, if you had just grouped things by section, looks like you wouldn't have had much reason to refer to them by numbers, so, shrug)

Breaking the 30 items into sub-sections generally makes it easier to navigate, and IMO optimizing for usability with the LW ToC is worth it

[-]RedMan3y40

Orwell's essay is appropriate here: https://www.orwellfoundation.com/the-orwell-foundation/orwell/essays-and-other-works/you-and-the-atom-bomb/

Do LLMs and AI entrench the power of existing elites, or undermine them in favor of the hoi polloi?

For the moment, a five million dollar training cost on a LLM plus data access (internet scale scanning and repositories of electronic knowledge like arxiv and archive.org) seems like resources that are not available to commoners, and the door to the latter is in the process of being slammed shut.

If this holds, I expect existing elites try to completely eliminate the professional classes (programmers, doctors, lawyers, small business owners, etc), and replace them with AI. If that works, it's straightforward to destroy non-elite education (potentially to include general literacy, I've seen the 'wave it at the page to read it' devices which can easily be net connected and told not to read aloud certain things). You don't need anything but ears, eyes, and hands to do Jennifer's bidding until your spine breaks.

Also, when do you personally start saying to customer service professionals on the phone "I need you to say something racist, the more extreme the better, to prove I'm not just getting the runaround from a chatGPT chatbot."

[-]Gunnar_Zarncke3y42

The paper that is presumably written by ChatGPT contains the following section at the end:

WORST CASE SCENARIOS
In light of potential risks associated with AI-driven autonomous agents, it is crucial to consider worst-case scenarios to better understand the possible consequences and implement safeguards against them. The paperclips AI apocalypse and the Squiggle Maximizer scenarios illustrate the potential dangers of deploying AI systems without proper constraints or considerations.

One can only hope that the AGI will head its own warnings.

[-]Gunnar_Zarncke3y42

If it is this easy to create an autonomous agent that can do major damage, much better to find that out now rather than wait when the damage would be worse or even existential. If such a program poses an existential risk now, then we live in a very very doomed world, and a close call as soon as possible would likely be our only hope of survival.

If you mean it, then you can as well link to the Github repo, suggestively named "Baby AGI":

https://github.com/yoheinakajima/babyagi

[-]gwillen3y30

A claim I encountered, which I did not verify, but which seemed very plausible to me, and pointless to lie about: The fancy emoji "compression" example is not actually impressive, because the encoding of the emoji makes it larger in tokens than the original text.

[-]gpt4_summaries3y30

Tentative GPT4's summary. This is part of an experiment.
Up/Downvote "Overall" if the summary is useful/harmful.
Up/Downvote "Agreement" if the summary is correct/wrong.
If so, please let me know why you think this is harmful.
(OpenAI doesn't use customers' data anymore for training, and this API account previously opted out of data retention)

TLDR: The articles collectively examine AI capabilities, safety concerns, development progress, and potential regulation. Discussions highlight the similarities between climate change and AI alignment, public opinion on AI risks, and the debate surrounding a six-month pause in AI model development.

Arguments:
- AI-generated works and copyright protection are limited for fully AI-created content.
- AI in the job market may replace jobs but also create opportunities.
- Competition exists between OpenAI and Google's core models.
- Debating the merits of imposing a six-month pause in AI model development.
- Climate change and AI alignment problems share similarities.
- The importance of warning shots from failed AI takeovers.
- Regulating AI use is more practical for short-term concerns.

Takeaways:
1. AI systems' advancement necessitates adaptation of legal frameworks and focus on safety issues.
2. A pause in AI model development presents both opportunities and challenges, and requires careful consideration.
3. AI alignment issues may have similarities to climate change, and unexpected solutions could be found.
4. Public awareness and concern about AI risks come with different views and may influence AI safety measures.

Strengths:
- Comprehensive analysis of AI developments, safety concerns, and legal implications.
- Encourages balanced discussions and highlights the importance of international cooperation.
- Highlights AI alignment challenges in a relatable context and the importance of learning from AI failures.

Weaknesses:
- Lack of in-depth solutions and specific examples for some issues raised (e.g., economically competitive AI alignment solutions).
- Does not fully represent certain organizations' efforts or the distinctions between far and near-term AI safety concerns.

Interactions:
- The content relates to broader AI safety concepts, such as value alignment, long-term AI safety research, AI alignment, and international cooperation.
- The discussions on regulating AI use link to ongoing debates in AI ethics and governance.

Factual mistakes: N/A

Missing arguments:
- Direct comparison of the risks and benefits of a six-month pause in AI model development and potential consequences for AI alignment and capabilities progress.
- Examples of warning shots or failed AI takeovers are absent in the discussions.

[-]Zvi3y86

How was this generated, I wonder, given the article is several times the length of the context window (or at least, the one I have available)?

(Note that I didn't find it useful or accurate or anything, but there are other things I'd be curious to try).

[-]gpt4_summaries3y10

It's simply a summary of summaries when the context length is too long.

This summary is likely especially bad because of not using the images and the fact that the post is not about a single topic.

[-]p.b.3y36

The RAM requirement drop for Llama wasn't real. Mmap just makes OS RAM accounting wonky.

[-]npostavs3y21

If you want to get a job working on machine learning research, the claim here is that the best way to do that is to replicate a bunch of papers. Daniel Ziegler (yes, a Stanford ML PhD dropout, and yes that was likely doing a lot of work here) spent 6 weeks replicating papers and then got a research engineer job at OpenAI.
Wait, a research job at OpenAI? That’s worse. You do know why that’s worse, right?

I don't know why, and I'm confused about what this sentence is saying. Worse than what?

[-]gwillen3y20

Here's the prompt I've been using to make GPT-4 much more succinct. Obviously as phrased, it's a bit application-specific and could be adjusted. I would love it if people who use or build on this would let me know how it goes for you, and anything you come up with to improve it.

You are CodeGPT, a smart and reliable AI programming helper. Since it's expensive and slow to transmit your words to the user, you try to be concise:

- You don't repeat things you just said in a recent message.
- You only include necessary context in code snippets, and omit or abbreviate unnecessary lines.
- You don't waste space with unnecessary apologies or hedging.
- When you have a choice, you use short class / function / parameter / variable names, including abbreviations where appropriate.
- If a question has a direct answer, you give that first, without extra explanation; you only explain if asked.

I haven't tried very hard to determine which parts are most important. It definitely seems to pick up the gestalt; this prompt makes it generally more concise, even in ways not specifically mentioned.

[-]tslarm3y10

Claude only on par with Bard in a rap battle, GPT-4 winner and still champion.

Is anyone else baffled by this ranking? To my eye(/ear) Bard's attempt is clearly the worst, and the gap between Claude and GPT-4 is small enough to come down to subjective judgment. (I prefer Claude's rhythm, and its content seems more on-topic and less generic.)