LESSWRONG
LW

Dr_Manhattan
3605Ω2798370
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
An alternative way to browse LessWrong 2.0
Dr_Manhattan10mo20

Any updates on the API? (thinking of) Playing around with interesting ways to index LW, figure there should be something better than scraping

Reply
AGI will drastically increase economies of scale
Dr_Manhattan1y20

So how does one invest in China, as a country?

Reply
Douglas Hofstadter changes his mind on Deep Learning & AI risk (June 2023)?
Dr_Manhattan2y257

He was only a de facto mysterian: thought mind is so complicated that it may as well be mysterious (but ofc he believed it's ultimately just physics). This position is updateable, and he clearly updated.

Reply
Bing Chat is blatantly, aggressively misaligned
Dr_Manhattan3y*31

A net saying "I'm thinking about ways to kill you" does not necessarily imply anything whatsoever about the net actually planning to kill you

 

Since these nets are optimized for consistency (as it makes textual output more likely), wouldn't outputting text that is consistent with this "thought" be likely? E.g. convincing the user to kill themselves, maybe giving them a reason (by searching the web)? 

Reply
How it feels to have your mind hacked by an AI
Dr_Manhattan3y30

I've been wishing for someone to write AI-singularity parallel of Bardbury's Martian Chronicles (which are pretty much independent sample/ simulations of how living on Mars could go)

Reply
Replacing Karma with Good Heart Tokens (Worth $1!)
Dr_Manhattan3y40
Reply
Mental nonsense: my anti-insomnia trick
Dr_Manhattan3y40

Sharing a personal weird trick why not. I like falling asleep to light TV (via iPad). I watch short shows that a) I like and don't think are boring b) I have seen before. Usually 10 minutes into a 20 min show I'm ready (Futurama is my favorite for this + my meme game is much improved by this)

Reply
I left Russia on March 8
Dr_Manhattan3y40

Was thinking about you! Glad you made it out. Feel free to DM if I can be of assistance

Reply
(briefly) RaDVaC and SMTM, two things we should be doing
Dr_Manhattan4y380

MIRI is bottlenecked more on ideas worth pursuing and people who can pursue them, than on funding

Ideas come from (new) people, and you mentioned seed planting which should contribute to having such people in 4-6 years, seems like still a worthy thing to do for AGI if anything is worth doing for any cause at all (given your short timelines). If you agree what's the bottleneck for that effort?

Reply
Visible Thoughts Project and Bounty Announcement
Dr_Manhattan4y50

Related work: 
Show Your Work: Scratchpads for Intermediate Computation with Language Models
https://arxiv.org/abs/2112.00114

(from very surface-level perusal) Prompting the model resulted in 
1) Model outputting intermediate thinking "steps"

2) Capability gain

Reply
Load More
32What are we predicting for Neuralink event?
Q
6y
Q
15
11LW Dev question: FB-style tagging?
Q
6y
Q
1
20Using rationality to debug Machine Learning
7y
3
0Bayesian statistics as epistemic advantage
8y
2
10MILA gets a grant for AI safety research
8y
3
17Learning from Human Preferences - from OpenAI (including Christiano, Amodei & Legg)
8y
12
12NIPS 2015
10y
4
7Velocity of behavioral evolution
11y
10
20What Peter Thiel thinks about AI risk
11y
6
6Cognitive distortions of founders
11y
1
Load More