jungofthewon

LESSWRONG
LW

jungofthewon — LessWrong

Ought will host a factored cognition “Lab Meeting”

Ought will host a factored cognition “Lab Meeting” on Friday September 16 from 9:30AM - 10:30AM PT.

We'll share the progress we've made using language models to decompose reasoning tasks into subtasks that are easier to perform and evaluate. This is part of our work on supervising process, not outcomes. It’s easier for us to show you than to tell you about it in a post (though written updates will hopefully follow).

Then, we'll cover outstanding research directions we see and plan to work on, many almost shovel-ready. If the alignment community can parallelize this work across different alignment research teams, we can make progress faster. We'd love to coordinate with other alignment researchers thinking... (read 266 more words →)

Elicit: Language Models as Research Assistants

stuhlmueller

stuhlmueller, jungofthewon

Ought is an applied machine learning lab. We’re building Elicit, the AI research assistant. Our mission is to automate and scale open-ended reasoning. To get there, we train language models by supervising reasoning processes, not outcomes. This is better for reasoning capabilities in the short run and better for alignment in the long run.

In this post, we review the progress we’ve made over the last year and lay out our plan for Elicit. We'd love to get feedback on how to make Elicit more useful for LW and to get thoughts on our plans more generally. To make this easier, we've erred on the side of giving detail even where we know... (read 3805 more words →)

Supervise Process, not Outcomes

stuhlmueller

stuhlmueller, jungofthewon

We can think about machine learning systems on a spectrum from process-based to outcome-based:

Process-based systems are built on human-understandable task decompositions, with direct supervision of reasoning steps.
Outcome-based systems are built on end-to-end optimization, with supervision of final results.

This post explains why Ought is devoted to process-based systems. The argument is:

In the short term, process-based ML systems have better differential capabilities: They help us apply ML to tasks where we don’t have access to outcomes. These tasks include long-range forecasting, policy decisions, and theoretical research.
In the long term, process-based ML systems help avoid catastrophic outcomes from systems gaming outcome measures and are thus more aligned.
Both process- and outcome-based evaluation are attractors to varying degrees:

... (read 2881 more words →)

145

Beta test GPT-3 based research assistant

jungofthewon

Ought is working on building Elicit, a tool to automate and scale open-ended reasoning about the future. To date, we’ve collaborated with LessWrong to embed interactive binary predictions, share AGI timelines and the assumptions driving them, forecast existential risk, and much more.

We’re working on adding GPT-3 based research assistant features to help forecasters with the earlier steps in their workflow. Users create and apply GPT-3 actions by providing a few training examples. Elicit then scales that action to thousands of publications, datasets, or use cases.

Here’s a demo of how someone applies existing actions:

And a demo of how someone creates their own action (no coding required):

Some actions we currently support include:

Find relevant publications

jungofthewon

Ought’s mission is to automate and scale open-ended reasoning. Since wrapping up factored evaluation experiments at the end of 2019, Ought has built Elicit to automate the open-ended reasoning involved in judgmental forecasting.

Today, Elicit helps forecasters build distributions, track beliefs over time, collaborate on forecasts, and get alerts when forecasts change. Over time, we hope Elicit will:

Support and absorb more of a forecaster’s thought process
Incrementally introduce automation into that process, and
Continuously incorporate the forecaster’s feedback to ensure that Elicit’s automated reasoning is aligned with how each person wants to think.

Our latest blog post introduces Elicit and our focus on judgmental forecasting. It also reifies the vision we’re running towards and potential ways to get there.

Brainstorming positive visions of AI

jungofthewon

Context

This is a place to explore visions of how AI can go really well. Conversations about AI (both in this community and disseminated by mainstream media) focus on dystopian scenarios and failure modes. Even communities that lean technoutopian (Silicon Valley) are having an AI hangover. More broadly, many people in my life think the future will be worse than the present and this makes me sad.

So I think it's time to revisit the science fiction books of our teenage years and imagine what amazing applications of AI or AGI in society looks like. AI that doesn’t destroy us is great. AI that unlocks human flourishing is even better. I've personally found it... (read 204 more words →)

LESSWRONG
LW

LESSWRONG
LW

Supervise Process, not Outcomes

Elicit: Language Models as Research Assistants

Brainstorming positive visions of AI

Ought will host a factored cognition “Lab Meeting”

jungofthewon

jungofthewon

Ought will host a factored cognition “Lab Meeting”

Elicit: Language Models as Research Assistants

Supervise Process, not Outcomes

Beta test GPT-3 based research assistant

Automating reasoning about the future at Ought

Brainstorming positive visions of AI

jungofthewon

Supervise Process, not Outcomes

Elicit: Language Models as Research Assistants

Brainstorming positive visions of AI

Ought will host a factored cognition “Lab Meeting”

jungofthewon

jungofthewon

Ought will host a factored cognition “Lab Meeting”

Elicit: Language Models as Research Assistants

Supervise Process, not Outcomes

Beta test GPT-3 based research assistant

Automating reasoning about the future at Ought

Brainstorming positive visions of AI

Context