yagudin

Wiki Contributions

Comments

yagudin1-3

There is already a lot of automatic censoring happening. I am unsure how much LLMs add on top of existing and fairly successful techniques from spam filtering. And just using LLMs is probably prohibitive at the scale of social media (definitely for tech companies, maybe not for governments), but perhaps you can get an edge for some use-case with them.

yagudin32

But I (and I think others on LW team although for slightly different reasons) have been thinking about building a feature directly into LW to facilitate it. 

 

Maybe consider making it super easy (one click easy) to export LW posts to google docs? 

yagudin115

ACX is probably a better reference class: https://astralcodexten.substack.com/p/2023-subscription-drive-free-unlocked. In Jan, ACX had 78.2k readers, of which 6.0k subscribers for a 7.7% subscription rate. 

I think it might be good to normalize "just try stuff until they fix your condition" as one of the treatment strategies. I guess it's a bit ironic that Dr. Spray-n-pray's indifference toward which pill worked and why seems so epistemically careless, while actually maybe being a correct way to orient towards success when you optimize for luck and have little reliable information.

  1. Russian military doctrine allows the usage of nuclear weapons to defend Russian territory.

 

This is ~false. See: https://forum.effectivealtruism.org/posts/TkLk2xoeE9Hrx5Ziw/nuclear-attack-risk-implications-for-personal-decision?commentId=ukEznwTnD78wFdZip#ukEznwTnD78wFdZip

yagudin250
Trust
Rule Thinkers In, Not OutScott Alexander 
Gears vs BehaviorJohn S. Wentworth 
Book Review: The Secret Of Our SuccessScott Alexander 
Reason isn't magicBen Hoffman 
"Other people are wrong" vs "I am right"Buck Shlegeris 
In My CultureDuncan Sabien 
Chris Olah's views on AGI safetyEvan Hubinger 
Understanding "Deep Double Descent"Evan Hubinger 
How to Ignore Your Emotions (while also thinking you're awesome at emotions)Hazard 
Paper-Reading for GearsJohn S. Wentworth 
Book summary: Unlocking the Emotional BrainKaj Sotala 
Noticing Frame DifferencesRaymond Arnold 
Propagating Facts into AestheticsRaymond Arnold 
Do you fear the rock or the hard place?Ruben Bloom 
Mental MountainsScott Alexander 
Steelmanning DivinationVaniver 
Modularity
Book Review: Design Principles of Biological CircuitsJohn S. Wentworth 
Reframing Superintelligence: Comprehensive AI Services as General IntelligenceRohin M. Shah 
Building up to an Internal Family Systems modelKaj Sotala 
Being the (Pareto) Best in the WorldJohn S. Wentworth 
The Schelling Choice is "Rabbit", not "Stag"Raymond Arnold 
Literature Review: Distributed TeamsElizabeth Van Nostrand 
Gears-Level Models are Capital InvestmentsJohn S. Wentworth 
Evolution of ModularityJohn S. Wentworth 
You Have About Five WordsRaymond Arnold 
Coherent decisions imply consistent utilitiesEliezer Yudkowsky 
Alignment Research Field GuideAbram Demski 
Forum participation as a research strategyWei Dai 
The Credit Assignment ProblemAbram Demski 
Selection vs ControlAbram Demski 
Incentives
Asymmetric JusticeZvi Mowshowitz 
The Copenhagen Interpretation of EthicsJai Dhyani 
Unconscious EconomicsJacob Lagerros 
Power Buys You Distance From The CrimeElizabeth Van Nostrand 
Seeking Power is Often Convergently Instrumental in MDPsAlexander Turner & Logan Smith 
Yes Requires the Possibility of NoScott Garrabrant 
Mistakes with Conservation of Expected EvidenceAbram Demski 
Heads I Win,Tails?—Never Heard of Her; Or, Selective Reporting and the Tragedy of the Green RationalistsZack M. Davis 
Excerpts from a larger discussion about simulacraBen Hoffman 
Moloch Hasn’t WonZvi Mowshowitz 
Integrity and accountability are core parts of rationalityOliver Habryka 
The Real Rules Have No ExceptionsSaid Achmiz 
Simple Rules of LawZvi Mowshowitz 
The Amish, and Strategic Norms around TechnologyRaymond Arnold 
Risks from Learned Optimization: IntroductionEvan Hubinger, Chris van Merwijk, Vladimir Mikulik, Joar Skalse, & Scott Garrabrant 
Gradient hackingEvan Hubinger 
Failure
The Parable of Predict-O-MaticAbram Demski 
BlackmailZvi Mowshowitz 
BioinfohazardsMegan Crawford, Finan Adamson, & Jeffrey Ladish 
What failure looks likePaul Christiano 
Seeking Power is Often Convergently Instrumental in MDPsAlexander Turner & Logan Smith 
AI Safety “Success Stories”Wei Dai 
Reframing ImpactAlexander Turner 
The strategy-stealing assumptionPaul Christiano 
Is Rationalist Self-Improvement Real?Jacob Falkovich 
The Curse Of The CounterfactualP.J. Eby 
human psycholinguists: a critical appraisalNostalgebraist 
Why wasn't science invented in China?Ruben Bloom 
Make more landJeff Kaufman 
Rest Days vs Zombie DaysLauren Lee 

 

Here is a google sheet.

yagudinΩ460

I want to mention that Tsvi Benson-Tilsen is a mentor at this summer's PIBBSS. So some readers might consider applying (the deadline is Jan 23rd).

I myself was mentored by Abram Demski once through the FHI SRF, which AFAIK was matching fellows with a large pull of researchers based on mutual interests.

I am looking for text-to-speech tools for various contexts.  As of now, I am using

Load More