yagudin - LessWrong

Large Language Models will be Great for Censorship

There is already a lot of automatic censoring happening. I am unsure how much LLMs add on top of existing and fairly successful techniques from spam filtering. And just using LLMs is probably prohibitive at the scale of social media (definitely for tech companies, maybe not for governments), but perhaps you can get an edge for some use-case with them.

Private notes on LW?

yagudin1y32

But I (and I think others on LW team although for slightly different reasons) have been thinking about building a feature directly into LW to facilitate it.

Maybe consider making it super easy (one click easy) to export LW posts to google docs?

Lightcone Infrastructure/LessWrong is looking for funding

yagudin1y115

ACX is probably a better reference class: https://astralcodexten.substack.com/p/2023-subscription-drive-free-unlocked. In Jan, ACX had 78.2k readers, of which 6.0k subscribers for a 7.7% subscription rate.

Updates and Reflections on Optimal Exercise after Nearly a Decade

yagudin1y40

Consider https://thepeteplan.wordpress.com/beginner-training/.

Luck based medicine: my resentful story of becoming a medical miracle

yagudin2y71

I think it might be good to normalize "just try stuff until they fix your condition" as one of the treatment strategies. I guess it's a bit ironic that Dr. Spray-n-pray's indifference toward which pill worked and why seems so epistemically careless, while actually maybe being a correct way to orient towards success when you optimize for luck and have little reliable information.

A Few Terrifying Facts About The Russo-Ukrainian War

yagudin2y51

Russian military doctrine allows the usage of nuclear weapons to defend Russian territory.

This is ~false. See: https://forum.effectivealtruism.org/posts/TkLk2xoeE9Hrx5Ziw/nuclear-attack-risk-implications-for-personal-decision?commentId=ukEznwTnD78wFdZip#ukEznwTnD78wFdZip

The Track Record of Futurists Seems ... Fine

yagudin2y10

See:

Book Launch: The Engines of Cognition

yagudin2y250

`Trust`
`Rule Thinkers In, Not Out`	`Scott Alexander`
`Gears vs Behavior`	`John S. Wentworth`
`Book Review: The Secret Of Our Success`	`Scott Alexander`
`Reason isn't magic`	`Ben Hoffman`
`"Other people are wrong" vs "I am right"`	`Buck Shlegeris`
`In My Culture`	`Duncan Sabien`
`Chris Olah's views on AGI safety`	`Evan Hubinger`
`Understanding "Deep Double Descent"`	`Evan Hubinger`
`How to Ignore Your Emotions (while also thinking you're awesome at emotions)`	`Hazard`
`Paper-Reading for Gears`	`John S. Wentworth`
`Book summary: Unlocking the Emotional Brain`	`Kaj Sotala`
`Noticing Frame Differences`	`Raymond Arnold`
`Propagating Facts into Aesthetics`	`Raymond Arnold`
`Do you fear the rock or the hard place?`	`Ruben Bloom`
`Mental Mountains`	`Scott Alexander`
`Steelmanning Divination`	`Vaniver`
`Modularity`
`Book Review: Design Principles of Biological Circuits`	`John S. Wentworth`
`Reframing Superintelligence: Comprehensive AI Services as General Intelligence`	`Rohin M. Shah`
`Building up to an Internal Family Systems model`	`Kaj Sotala`
`Being the (Pareto) Best in the World`	`John S. Wentworth`
`The Schelling Choice is "Rabbit", not "Stag"`	`Raymond Arnold`
`Literature Review: Distributed Teams`	`Elizabeth Van Nostrand`
`Gears-Level Models are Capital Investments`	`John S. Wentworth`
`Evolution of Modularity`	`John S. Wentworth`
`You Have About Five Words`	`Raymond Arnold`
`Coherent decisions imply consistent utilities`	`Eliezer Yudkowsky`
`Alignment Research Field Guide`	`Abram Demski`
`Forum participation as a research strategy`	`Wei Dai`
`The Credit Assignment Problem`	`Abram Demski`
`Selection vs Control`	`Abram Demski`
`Incentives`
`Asymmetric Justice`	`Zvi Mowshowitz`
`The Copenhagen Interpretation of Ethics`	`Jai Dhyani`
`Unconscious Economics`	`Jacob Lagerros`
`Power Buys You Distance From The Crime`	`Elizabeth Van Nostrand`
`Seeking Power is Often Convergently Instrumental in MDPs`	`Alexander Turner & Logan Smith`
`Yes Requires the Possibility of No`	`Scott Garrabrant`
`Mistakes with Conservation of Expected Evidence`	`Abram Demski`
`Heads I Win,Tails?—Never Heard of Her; Or, Selective Reporting and the Tragedy of the Green Rationalists`	`Zack M. Davis`
`Excerpts from a larger discussion about simulacra`	`Ben Hoffman`
`Moloch Hasn’t Won`	`Zvi Mowshowitz`
`Integrity and accountability are core parts of rationality`	`Oliver Habryka`
`The Real Rules Have No Exceptions`	`Said Achmiz`
`Simple Rules of Law`	`Zvi Mowshowitz`
`The Amish, and Strategic Norms around Technology`	`Raymond Arnold`
`Risks from Learned Optimization: Introduction`	`Evan Hubinger, Chris van Merwijk, Vladimir Mikulik, Joar Skalse, & Scott Garrabrant`
`Gradient hacking`	`Evan Hubinger`
`Failure`
`The Parable of Predict-O-Matic`	`Abram Demski`
`Blackmail`	`Zvi Mowshowitz`
`Bioinfohazards`	`Megan Crawford, Finan Adamson, & Jeffrey Ladish`
`What failure looks like`	`Paul Christiano`
`Seeking Power is Often Convergently Instrumental in MDPs`	`Alexander Turner & Logan Smith`
`AI Safety “Success Stories”`	`Wei Dai`
`Reframing Impact`	`Alexander Turner`
`The strategy-stealing assumption`	`Paul Christiano`
`Is Rationalist Self-Improvement Real?`	`Jacob Falkovich`
`The Curse Of The Counterfactual`	`P.J. Eby`
`human psycholinguists: a critical appraisal`	`Nostalgebraist`
`Why wasn't science invented in China?`	`Ruben Bloom`
`Make more land`	`Jeff Kaufman`
`Rest Days vs Zombie Days`	`Lauren Lee`

Here is a google sheet.

Challenges with Breaking into MIRI-Style Research

yagudin3yΩ460

I want to mention that Tsvi Benson-Tilsen is a mentor at this summer's PIBBSS. So some readers might consider applying (the deadline is Jan 23rd).

I myself was mentored by Abram Demski once through the FHI SRF, which AFAIK was matching fellows with a large pull of researchers based on mutual interests.

The Best Software For Every Need

yagudin3y40

I am looking for text-to-speech tools for various contexts. As of now, I am using

@Voice Aloud Reader (TTS Reader) and a custom script to extract articles from webpages for Android (supports .epub and .pdf as well);
Capti Voice on my desktop for everything.

LESSWRONG
LW

Posts

Wiki Contributions

Comments