papetoast's Shortforms

papetoast

papetoast's Shortforms — LessWrong

157 comments, sorted by

Click to highlight new comments since: Today at 6:30 PM

Edit: Sanity-checking “Incompressible Knowledge Probes” by @Sturb @LawrenceC suggests these results are very inaccurate due to various methodological issues, although "the core idea behind the paper is largely sound".

Estimates of total parameter size of frontier models using 1700 obscure factual questions and ~exponential regression on 89 open parameter models (paper's actual title is mostly clickbait) (via AI Dance on Twitter)

Trimmed, full version under section 6.3. I don't have enough experience to have intuition about how accurate these estimates are.

Model	Accuracy	Est. Size	90% PI
GPT-5.5	71.9%	∼9.7T	[3.2–28.7T]
Claude Opus 4.6	68.0%	∼5.3T	[1.8–15.6T]
Claude Opus 4.7	66.4%	∼4.0T	[1.4–12.0T]
GPT-5.4 Pro	62.5%	∼2.2T	[736B–6.5T]
Claude Sonnet 4.6	60.9%	∼1.7T	[579B–5.1T]
Claude Haiku 4.5	39.9%	∼65B	[22B–194B]

7LawrenceC2mo

Thanks for the mention! Amusingly, it was this shortform that caused me to start writing the post: I started drafting a response on the issues I had, and then it ballooned into a full investigation and Ben Sturgeon got pulled in as well.

1papetoast2mo

You're welcome - I think I have the responsibility to attempt to clear up any misinformation I spread even if by accident. I had the suspicion that I caused this investigation too, since you posted it on LessWrong and afaict I was the only one talking about this paper. I feel both amused and slightly regretful for this whole chain of events.

3Petropolitan2mo

An interesting work, let me compare it with my estimates from three weeks ago: for all eight GPT-5 series models I considered (5, 5 Pro, 5.1, 5.2, 5.2 Pro, 5.3, 5.4, 5.4 Pro) 2T total parameters fall within the 90% prediction interval brackets, and four more I didn't consider (4o, o1, o3, 4.1) fit as well. My 1.2T estimate for Sonnet is very close to Li's 1.7T, and my 4T estimate for Opus 4-series fits into the 90% PI bracket for all five versions. (Just to remind, on average, we should expect 1 true value out of 10 not to fit)

1papetoast2mo

IMPORTANT UPDATE: Sanity-checking “Incompressible Knowledge Probes” by @Sturb @LawrenceC (via twitter's algorithm (Lisan al Gaib @scaling01)) Alternatively they also posted a twitter thread. [...] Model Paper estimate [90% PI] Estimate w/ corrections [90% PI] Δ paper→ corrected gpt-5.5-pro 10,267B [3,422 – 30,801] 1,471B [258 – 8,385] ↓6.98× gpt-5.5-think 9,656B [3,219 – 28,968] 1,458B [256 – 8,311] ↓6.62× gpt-5.5 8,831B [2,944 – 26,493] 1,459B [256 – 8,316] ↓6.05× claude-opus-4.6-think 5,254B [1,751 – 15,762] 1,399B [245 – 7,974] ↓3.76× claude-opus-4.7-think 4,041B [1,347 – 12,123] 1,132B [199 – 6,452] ↓3.57×

1papetoast2mo

Chinese websites are notoriously hard to archive and rots extremely quickly, so here is the Zhihu content verbatim. The bolded parts corresponds to the claim that "this work was done by an AI agent in 4 days". https://www.zhihu.com/pin/2032769685012361774 (https://archive.ph/drfZi) 李博杰闭源实验室隐藏了模型规模，但他们藏不住模型知道什么。而模型知道什么，恰恰是其参数量的一个指标。推理可以压缩，事实知识不行。因此仅凭黑盒 API 调用，就能给前沿模型估算规模；跨越多次版本发布，你甚至能看到某个事实何时进入参数之中。三年来，我的朋友何纪言和郑子涵一直在向前沿大模型问同一个问题：“你了解中科大 Hackergame 吗？”——这是一个 CTF 竞赛。2024 年 5 月，GPT-4o 编造了不存在的题目名称。2025 年 2 月，Claude 3.7 Sonnet 准确列出了 2023 年的 19 道题目。到了 2026 年 4 月，前沿模型已能回忆起连续多届比赛的具体题目。 DeepSeek-V4 发布之后，我让我的 agent 花了四天时间，自主构建了 “不可压缩知识探针”（Incompressible Knowledge Probes，IKP），涵盖 1400 个问题，7 层稀有度的数据集，在 27 家厂商的 188 个模型上测试。三个发现： 1/ 仅凭事实准确率，就能给任何黑盒 LLM 估算规模。准确率与 log(参数量) 呈对数线性关系，在从 135M 到 1.6T 参数的 89 个开源权重模型上 R² = 0.917。把闭源模型投影上来 → GPT-5.5 ～9T，Claude Opus 4.7 ～4T，GPT-5.4 ～2.2T，Claude Sonnet 4.6～1.7T，Gemini 2.5 Pro～1.2T（90% 置信区间：0.3-3 倍规模）。 2/ 引用数和 h-index 并不能预测前沿模型是否认识某位研究者。两位引用数量相近的研究者，得到的回答可能截然不同。模型记住的是做出有影响力工作的人，而非发表了大量增量型论文的作者。 3/ 事实容量不会随时间被压缩。跨越 3 年的 96 个开源权重模型上，IKP 时间系数在统计上为零，以 p<10⁻¹⁵ 的显著性拒绝了 Densing Law 预测的 +0.0117/月。benchmark 在饱和，而事实容量仍随参数持续扩张。网站：链接论文：链接发布于 2026-04-29 10:34・IP 属地北京

1frmsaul2mo

This is really cool. How big do you think mythos is?

1papetoast2mo

I'm not the original researcher and obviously we would need to be able to ask mythos those 1700 questions to get the estimate.

[-]papetoast4mo230

Explicitly stated AGI timelines by Sam Altman (from Based on its own charter, OpenAI should surrender the race)

No comment on other parts of that blog. Only sharing the table. Make sure to read the quotes, the year is a bit misleading imo.

Date	Predicted AGI Year	Diff (years)	Quote / Claim	Source
May 22, 2023	~2033	~10	“Within the next ten years, AI systems will exceed expert skill level in most domains”	OpenAI Blog — Governance of Superintelligence
Dec 2023	~2030	~6	“By the time the end of this decade rolls around, the world will be in an unbelievably better place”	TIME
Nov 4, 2024	~2029	~5	“I think in 5 years […] people are like, man, the AGI moment came and went”	20VC Podcast
Nov 8, 2024	2025	~1	“What are you excited about in 2025? - AGI”	Futurism
Jan 2025	~2029	~4	“AGI will probably get developed during Trump’s term”	Bloomberg
Sep 25, 2025	2030	~4	“By 2030, if we don’t have extraordinarily capable models that do things we can’t, I’d be very surprised”	TechSpot
Oct 28, 2025	2028	~2	“Automated AI research intern by Sep 2026, full AI researcher by Mar 2028”	OfficeChai
Dec 18, 2025	2025	0	“AGI kinda went whooshing by… okay fine, we built AGIs”	Windows Central
Feb 3, 2026	2025	~-1	“We basically have built AGI” (later: “a spiritual

... (read more)

[-]papetoast2y*233

In my life I have never seen a good one-paragraph^[1] explanation of backpropagation so I wrote one.

The most natural algorithms for calculating derivatives are done by going through the expression syntax tree^[2]. There are two ends in the tree; starting the algorithm from the two ends corresponds to two good derivative algorithms, which are called forward propagation (starting from input variables) and backward propagation respectively. In both algorithms, calculating the derivative of one output variable $y_{1}$ with respect to one input variable $x_{1}$ actually creates a lot of intermediate artifacts. In the case of forward propagation, these artifacts means you get $\frac{δ y_{n}}{δ x_{1}}$ for ~free, and in backward propagation you get $\frac{δ y_{1}}{δ x_{n}}$ for ~free. Backpropagation is used in machine learning because usually there is only one output variable (the loss, a number representing difference between model prediction and reality) but a lot of input variables (parameters; in the scale of millions to billions).

https://colah.github.io/posts/2015-08-Backprop/

^{^}
This blogpost by Christopher Olah has the clearest multi-paragraph explanation. Credits for the image too.
^{^}
Actually a directed acyclic graph for multivariable

... (read more)

2habryka2y

Presumably you meant to say something else here than to repeat δyiδx1 twice? Edit: Oops, I now see. There is a switched i. I did really look quite carefully to spot any difference, but I apparently still wasn't good enough. This all makes sense now.

8papetoast2y

It is hard to see, changed to n.

4MondSemmel2y

I could barely see that despite always using a zoom level of 150%. So I'm sometimes baffled at the default zoom levels of sites like LessWrong, wondering if everyone just has way better eyes than me. I can barely read anything at 100% zoom, and certainly not that tiny difference in the formulas!

2habryka2y

Our post font is pretty big, but for many reasons it IMO makes sense for the comment font to be smaller. So that plus LaTeX is a bit of a dicey combination.

[-]papetoast4mo*1813

uBlock filters I use on LessWrong (updated: 2026-03-27; still in use: 2026-06-01)

also doubles as complaints if any LW mods see this

! Get rid of the top posts in user profile page
www.lesswrong.com##.UserProfileTopPostsSectionUnshared-topPostsIndicator
www.lesswrong.com##.UserProfileTopPostsSectionUnshared-smallArticlesGrid
www.lesswrong.com##.UserProfileTopPostsSectionUnshared-postArticleTop.UserProfileTopPostsSectionUnshared-postArticle
! Get rid of the stupid splash image on best of LW posts that take up a full screen
www.lesswrong.com##.PostsPage-splashHeaderImage
www.lesswrong.com##.LWPostsPageHeader-rootWithSplashPageHeader:style(padding-top: unset !important)
! h1 text is wayyy too large on best of LW and slightly too large normally
www.lesswrong.com##h1.PostsPageTitle-root:style(font-size: 3.25rem !important)
! Too much spacing below post metadata
www.lesswrong.com##.LWPostsPageHeader-root:style(margin-bottom: unset !important)
! Padding for audio player, but not really needed imo (may overlap text but idgaf)
www.lesswrong.com##.LWPostsPageHeader-root:style(padding-top: unset !important)
! Super large waste of space, looks good even without it
www.lesswrong.com##.CommentPermalink-dividerMargins:style(margin-top: 0px !important; margin-bottom: 0px !important)
! LLM Content using a sans-serif font with a different font size is so fucking ugly
www.lesswrong.com##.llm-content-block:style(font-size: unset !important; font-family: unset !important)

[-]papetoast7mo179

Raw feelings: I am kind of afraid of making reviews for LW. The writing prompt hints very high effort thinking. The vague memory of other people's reviews also feel high effort. The "write a short review" ask doesn't really counter this at all.

6kave7mo

Thank you! Would it help if the prompt read more like a menu? [...]

3papetoast7mo

Yeah, and perhaps a couple examples of bare minimum / average / high quality review in the main post

4Raemon7mo

On thing to note is that "short reviews" in the nomination phase are meant to be basically a different type of object than "effort reviews." Originally we actually had a whole different data-type for them ("nominations"), but it didn't seem worth the complexity cost. And then, separately: one of the points of the review is just to track "did anyone find this actually helpful?" and a short review that's like "yep, I did in fact use this concept and it helped me, here's a few details about it" is valuable signal. Drive by "this seems false, because [citation]" also good. It is nice to do more effortful reviews, but I definitely care about those types of short reviews.

[-]papetoast1mo130

I am increasingly having trouble discerning AI writing from human writing in the past year. It went from glaringly obvious to being possible to miss even if I put in effort analysing. I feel worried.

Edit: Yes, I know some, maybe most of you, can still smell out LLMs accurately, but I am getting worse at it.

prompted by this article, which is fully AI generated according to Pangram

2Ryan Meservey1mo

It looks like we will soon enter a world where we have to lean into author reputation to separate the wheat from the chaff. And if you lack a reputation, well...

6Viliam1mo

STEM people used to laugh at humanities' response to Sokal affair (can' separate wheat from chaff reliably, to avoid further embarrassment let's rely on reputation), and now it returned to bite them.

[-]papetoast2mo*134

Unofficial OpenAI alignment blog updates

Please comment under the linkpost so comments are not scattered over two locations.

Automated via fetching the RSS every hour. Code. Let me know if this ever breaks, because I don't actually read the blog myself.

last fetched: 2026-07-04T18:11:02Z

1papetoast16d

Reinforcement learning towards broadly and persistently beneficial models (linkpost)

1papetoast18d

Can public chat data predict real-world AI misalignments? (linkpost)

1papetoast2mo

Investigating the consequences of accidentally grading CoT during RL (linkpost)

1papetoast2mo

hey who (strong) downvoted me within the 20 minutes where it was "Placeholder." I genuinely had to get a placeholder comment

0papetoast2mo

Auto-review of agent actions without synchronous human oversight (linkpost)

[-]papetoast3mo*120

Sam Altman May Control Our Future—Can He Be Trusted? is an 18-month investigation from the New Yorker.

This is top 10 on Hacker News yesterday. I think this article won't be a big update for most LWers. It’s basically a collection of Sam Altman saying whatever the moment requires, and later denying, revising, or forgetting about it. (Summary is lightly AI touched up)

Curious what you guys think.

Zvi Commentary

2niknoble3mo

I've thought about what would happen if various AI players got their hands on Godlike powers via ASI. My subjective impressions, from least to most troubling: 1. Demis Hassabis - Best possible outcome. Utopia for all of us. Something like Metamorphosis of Prime Intellect. Way better than a random unaligned superintelligence, and way better than aligned ASI controlled by a committee of world governments. 2. Elon Musk - Concerning but okay. We'll probably get utopia, and it's probably still better than a random unaligned ASI or one controlled by governments, but Musk will have some greatly elevated position and you won't want to piss him off. His enemies from the pre-AGI days will be in danger. I would enjoy his world but do my best to avoid ever attracting his attention. 3. Dario Amodei - I don't want this. We'll probably get a situation vastly better than the present world, but that is a low bar and I would rather take my chances with an ASI controlled by a broad committee, or maybe even a random unaligned one. Amodei seems to have strong principles, but ones that are fairly alien to my own, and which take a great interest in me and how I live. That's a troubling combination, especially with immortality on the cards. 4. Sam Altman - We're dead within days. A coldly rational actor with ASI kills everyone else as quickly as possible, knowing that they can always be recreated once said actor has put safeguards in place to ensure no one else can build ASI. (This is the same sort of behavior that reliably emerges with governments and nuclear weapons. Individuals have exactly the same incentives as governments, but we usually don't see this because individuals are much less powerful and therefore have totally different circumstances.) I have zero doubt that Sam understands this simple game theory, and have never seen evidence from him of any deeper principles or desires that would cause him to act against his incentives in this case. All that being said, I think the

[-]papetoast2y12-1

It is sad and annoying that if you do a mediocre job (according to the receiver), doing things even for free (volunteer work/gifting) can sabotage the receiver along the dimension you're supposedly helping.

This is super vague the way I wrote it, so examples.

Example 1. Bob wants to upgrade and buy a new quality headphone. He has a $300 budget. His friend Tim not knowing his budget, bought a $100 headphone for Bob. (Suppose second-handed headphones are worthless) Now Bob cannot just spend $300 to get a quality headphone. He would also waste Tim's $100 which counterfactually could have been used to buy something else for Bob. So Bob is stuck with using the $100 headphone and spending the $300 somewhere else instead.

Example 2. Andy, Bob, and Chris are the only three people who translates Chinese books to English for free as a hobby. Because there are so many books out there, it is often not worth it to re-translate a book even if the previous one is bad, because spending that time to translate a different book is just more helpful to others. Andy and Bob are pretty good, but Chris absolutely sucks. It is not unreadable, but they are just barely better than machine translation. Now Chris has taken over to translate book X, which happens a pretty good book. The world is now stuck with Chris' poor translation on book X with Andy and Bob never touching it again because they have other books to work on.

[-]Dagon2y134

Allocation of blame/causality is difficult, but I think you have it wrong.

ex. 1 ... He would also waste Tim's $100 which counterfactually could have been used to buy something else for Bob. So Bob is stuck with using the $100 headphone and spending the $300 somewhere else instead.

No. TIM wasted $100 on a headset that Bob did not want (because he planned to buy a better one). Bob can choose whether to to hide this waste (at a cost of the utility loss by having $300 and worse listening experience, but a "benefit" of misleading Tim about his misplaced altruism), or to discard the gift and buy the headphones like he'd already planned (for the benefit of being $300 poorer and having better sound, and the cost of making Tim feel bad but perhaps learning to ask before wasting money).

ex. 2 The world is now stuck with Chris' poor translation on book X with Andy and Bob never touching it again because they have other books to work on.

Umm, here I just disagree. The world is no worse off for having a bad translation than having no translation. If the bad translation is good enough that the incremental value of a good translation doesn't justify doing it, then that is your answer. If it's not valuable enough to change the marginal decision to translate, then Andy or Bob should re-translate it. Either way, Chris has improved the value of books, or has had no effect except wasting his own time.

4papetoast2y

True in my example. I acknowledge that my example is wrong and should have been more explicit about having an alternative. Quoting myself from the comment to Vladimir_Nesov: Anyways, the unwritten thing is that Bob care about having a quality headphone and a good pair of shoes equally. So given that he already has an alright headphone, he would get more utility by buying a good pairs of shoes instead. It is essentially a choice between (a) getting a $300 headphone and (b) getting a $100 headphone and a $300 pair of shoes. [...] I do accept this as the rational answer, doesn't mean it is not irritating. If A (skillful translator) cares about having a good translation of X slightly more than Y, and B (poor translator) cares about Y much more than X. If B can act first, he can work on X and "force" A (via expected utility) to work on Y. This is a failure of mine to not talk about difference in preference in my examples and expect people to extrapolate and infer it out.

[-]Vladimir_Nesov2y102

Now Bob cannot just spend $300 to get a quality headphone. He would also waste Tim's $100

That's a form of sunk cost fallacy, a collective "we've sacrificed too much to stop now".

Andy and Bob never touching it again because they have other books to work on

That doesn't follow, the other books would've also been there without existence of this book's poor translation. If the poor translation eats some market share, so that competing with it is less appealing, that could be a valid reason.

4papetoast2y

This is a tangent, but Sunk cost fallacy is not really a fallacy most of the time, because spending more resources beforehand really increases the chance of "success" most of the time. For more: https://gwern.net/sunk-cost I am trying to pinpoint the concept of "A doing a mediocre job of X will force B to rationally do Y instead of X, making the progress of X worse than if A had not done anything". The examples are just examples that hopefully helps you locate the thing I am handwaving at. I do not try to make them logically perfect because that would take too much time. Anyways, the unwritten thing is that Bob care about having a quality headphone and a good pair of shoes equally. So given that he already has an alright headphone, he would get more utility by buying a good pairs of shoes instead. It is essentially a choice between (a) getting a $300 headphone and (b) getting a $100 headphone and a $300 pair of shoes. Of course there are some arguments about preference, utility != dollar amount or something along those lines. But (b) is the better option in my constructed example to show the point. Let me know if I still need to explain example 2

5Vladimir_Nesov2y

The decision to go on with the now-easier rest-of-the-plan can be correct, it's not the case that all plans must always be abandoned on the grounds of "sunk cost fallacy". The fallacy is when the prior spending didn't actually secure the rest of the current plan as the best course of action going forward. Alternatives can emerge that are better than continuing and don't make any use of the sunk resources.

1papetoast2y

It sure can! I think we are in agreement on sunk cost fallacy. I just don't think it applies to example 1 because there exists alternatives that can keep the sunk resources. Btw this is why my example is on the order of $100, at this price point you probably have a couple alternative things to buy to spend the money.

5Vladimir_Nesov2y

What matters is if those alternatives are better (and can be executed on, rather than being counterfactual). It doesn't matter why they are better. Being better because they made use of the sunk resources (and might've become cheaper as a result) is no different from being better for other reasons. The sunk cost fallacy is giving additional weight to the alternatives that specifically use sunk resources, instead of simply choosing based on which alternatives are now better.

1papetoast2y

Again, seems like we are in agreement lol. I agree with what you said and I meant that, but tried to compress it into one sentence and failed to communicate.

4Seth Herd2y

In both cases one particular project was harmed but the sum total of projects was helped.

5papetoast2y

(I need to defend the sad and the annoying in two separate parts) 1. Yes, and but sometimes that is already annoying on its own (Bob is not perfectly rational and sometimes he just really want the quality headphone, but now math tells Bob that Tim gifting him that headphone means he would have to wait e.g. ~2 years before it is worth buying a new one). Of course Bob can improve his life in other ways with his saved money, but still, would be nice if you can just ask Tim to buy something else if you had known. 2. Sometimes increasing sum(projects) does not translate directly to increasing utility. This is more obvious in real life scenarios where actors are less rational and time is a real concept. The sad thing happens when someone with good intention but with poor skill (and you don't know they are that bad) signing up to a time-critical project and failing/doing sub-par

3Viliam2y

Seems like the problem is that in real life people are not perfectly rational, and also they have an instinct to reciprocate when they receive a gift (at least by saying "thank you" and not throwing the gift away). In a world where Bob is perfectly rational and Tim has zero expectations about his gift, the situation is simple. Previously, Bob's choices were "spend $300 on good headphone", "spend $100 on bad headphone and $200 on something else", and "spend $300 on something else". Tim's action replaced the last two options with a superior alternative "use Tim's headphone and spend $300 on something else". Bob's options were not made worse. But real people are not utility maximizers. We instinctively try to choose a locally better option, and how we feel about it depends on what we perceive as the baseline. Given the choice between 10 utilons and 3 utilons, we choose 10 and feel like we just "gained 7 utilons". Given the choice between 10 utilons and 9 utilons, we choose 10 again, but this time we feel like we just "gained 1 utilon". Given the choice between 10 utilons and 10 utilons of a different flavor, we might feel annoyed about having to choose. Also, if Tim expects Bob to reciprocate in a certain way, the new options are not strictly better, because "spend $300 on good headphone" got replaced by "spend $300 on good headphone, but owe Tim a favor for giving me the $100 headphone I didn't use".

1papetoast2y

Yes!

1papetoast2y

https://www.lesswrong.com/posts/dRTj2q4n8nmv46Xok/cost-not-sacrifice?commentId=zQPw7tnLzDysRcdQv

2Seth Herd2y

There are infinite things to be sad and annoyed by, should you choose to focus on those. :) I'd rather focus on the world as a whole being made better in your examples.

[-]papetoast4mo*70

Sensor Tower's business model is super interesting. [Ok edit I thought they were secretive about this but they just tell you one link from their home page: https://sensortower.com/responsibly-sourced-data] They basically have a bunch of good and free screentime tracking apps like StayFocusd, ActionDash, StayFree, Phone Guardian, and Astro File Manager^[1] with quite advanced features to (correctly) justify getting accessibility access on your phone and reading over every single app (e.g. to block YouTube shorts, check usages per website in your browser app)... (read more)

2Viliam3mo

Sounds like good reason to never install any app. 🙁 Everything is just a pretense to collect you data and sell it. Not sure what could be a solution for this. Some app reviewing service, which would review the source code, and publish what information which app collects about you?

2papetoast3mo

I mean, I am quite happily using StayFree despite them selling my data. As a user you probably can't do much except try to use FOSS apps in situations where some use cases really do need a lot of permissions. I mean, they do pretty much tell you what information they collect, as long as you read the privacy policy that is. In an ideal world it should be more obvious.

3Viliam3mo

If you give a fully informed consent, that's okay for me.

[-]papetoast1mo60

NeurIPS 2026 is using Pangram to reject LLM writing

This year, the NeurIPS 2026 Position Paper Track made the decision to require that all papers be substantially human-written, with AI used for only copy-editing or similar peripheral changes to the main text.
To assess if authors were largely abiding by this policy, we partnered with Pangram
178 submissions (18.4% of all submissions) will be desk rejected
123 submissions (12.7%) will be requested to provide evidence of substantial human engagement or risk a desk reject.
Conference
# Papers
Pangram AI Score
≥ 50%

... (read more)

7cubefox1mo

So AI papers are currently good enough that they can't be trivially distinguished from human papers, making Pangram necessary, but not yet good enough to produce AI research that is at least on a human level. From the outside this looks like a sign that RSI fairly close now. Tangentially, it's somewhat interesting that Pangram is a twist on Turing's original test: In the original, it was a human who had to distinguish between a human and an AI based on text, now it is an AI that distinguishes between both, since AIs are apparently better now than humans in distinguishing between humans and AIs. So Pangram is a CAPTCHA, but conventional captchas weren't better than humans at distinguishing between AIs and humans.

2Brendan Long1mo

Looking at the numbers, it seems like the real policy is to sometimes reject papers if they're 100% AI-written but 90% is fine?

1papetoast1mo

There are more details in Table 5 Decision Thresholds that I didnt quote. Basically 80%+ is rejected.

2Brendan Long1mo

I'm confused about how this aligns with the table saying that 42.7% of submissions have a Pangram score >=90% but only 31.1% were desk rejected or asked to provide additional evidence. If I'm understanding the post right, it seems like they adjusted Pangram settings until it stopping finding so much AI usage and then used their custom settings. By default, Pangram is already pretty lenient and doesn't find some AI usage, so this looks like they tried Pangram, realized that if they actually followed their policy (50% AI written seems like the right threshold for "substantially AI-written") they'd have to reject 70% of papers, and then fiddled with settings until they got the result they wanted.

[-]papetoast5mo*60

Collecting occurrences of people complaining about LW (parent index)

4gustaf5mo

Skimmed twitter.search(lesswrong -lesswrong.com -roko -from:grok -grok since:2026-01-01 until:2026-01-28) negative https://x.com/fluxtheorist/status/2015642426606600246 [...] https://x.com/repligate/status/2011670780577530024 compares pedantic terminology complaint by peer reviewer of some paper to LW. https://x.com/kave_rennedy/status/2011131987168542835 [...] https://x.com/Kaustubh102/status/2010703086512378307 first post rejected; claims not written by LLM, but rejection may be because "you did not chat extensively with LLMs to help you generate the ideas." positive During my search, it was hard to ignore the positive comments. So here are some examples of positive comments too. https://x.com/boazbaraktcs/status/2016403406202806581 [...] https://x.com/joshycodes/status/2009423714685989320 [...] https://x.com/TutorVals/status/2008474014839390312 [...] otherwise interesting https://x.com/RyanPGreenblatt/status/2008623582235242821 [...] https://x.com/nearcyan/status/2010945226114994591 [...]

2Ben Pace, the Vacationing Vagabond5mo

Hope you’re ready to write 10,000 replies ;-)

2papetoast5mo

https://x.com/eris_nerung/status/2016317953264807947 [...] https://news.ycombinator.com/item?id=44317180 (much much more in the discussions) [...] https://x.com/RokoMijic/status/2021202750462362026 [...] https://www.lesswrong.com/posts/dkrfGqJrGRx2HsLpe/rafael-harth-s-shortform?commentId=mXFZzaaj3Y4qixnAC [...]

[-]papetoast2y*60

Starting today I am going to collect a list of tricks that websites use to prevent you from copy and pasting text + how to circumvent them. In general, using ublock origin and allow right click properly fixes most issues. (parent index)

1. Using href (https://lnk.to/LACA-15863s, archive)

behavior: https://streamable.com/sxeblz

solution: use remove-attr in ublock origin - lnk.to##.header__link:remove-attr(href)

2. Using a background image to cover the text (https://varium.jp/talent/ahiru/, archive)

Note: this example is probably just incompetence.

behavior:... (read more)

[-]papetoast2mo*50

afro88 on Hacker News predicting what will the job of a future Junior SWEs be like

This has happened in other industries before. Drafting for example when CAD arrived. Entry level wasn't "can draw, willing to learn" anymore, but demanded high domain understanding. So the pathway became compressed learning through study, and field exposure.
Study of senior drafter "red lines": what and why they changed the initial drawing, RFI response etc. Reverse engineering good work. Failed design studies etc.
SWE equivalents: PRs, code review, studying high quality codeba

... (read more)

[-]papetoast5mo*5-2

For quick takes, people should be more conservative about downvoting beyond approx. -4. (For context I have been reading all top level quick takes for over a month now)

-5 karma auto collapses the comment
I think most people understands that the magnitude of karma depends quite a bit on how many people saw your post/comment, and having a small negative karma on a quick take already provides most of the feedback vs downvoting to oblivion
AFAICT, LessWrong still officially encourages people to “bring their entire selves”. I don't think we should be overly harsh

... (read more)

[-]habryka5mo100

Roko has IMO kind of obviously gone off the rails and this feels to me like a success case of the system. Like, I think it's more likely than not that we would ban Roko if he kept commenting, just based on past experiences with him.

I agree with some of the general considerations otherwise, but this specific case feels like a success.

5papetoast5mo

Note: I deleted the sentence habryka is replying to. [...] I couldn't form an opinion on that specific quick take, I read it like twice and it still reads a bit like gibberish. I probably shouldn't have mentioned it. It was really just where it started my thinking.

4Drake Morrison5mo

I agree it seems bad for a quick take to immediately collapse if as few as five people downvote, but I do think the downvotes mean something important. I don't want to hesitate to downvote a quick take that I think should be downvoted. Would it make sense to have the auto-collapse happen after 24 hours? Or perhaps a time-discounted thing based on number of votes? I like the collapse feature in general, and think it's great for hiding bad comments/not drowning bad comments in downvotes.

3papetoast5mo

I will let the LW mods to think about how to get it done better because having a good implementation seems like the main bottleneck rather than ideas. In my own ideal world, I think a quick take should be collapsed (perhaps with a better algorithm) in the main page but never collapsed in the person's quick take page. But the norm still should shift slightly (~10-20%) against downvoting. [...] Valid. I personally do ponder a very slight bit when voting in general because I think good incentives are important.

[-]papetoast10mo52

One of my pet peeves is that the dropcaps in gwern's articles are really, really offputting and most of the time unrecognizable, even though gwern's articles are so valuable that he has a lot of weirdness points in my head and I will still read his stuff regardless. Most of the time I just guess the first letter.

I hate dropcaps in general, but gwern's is the ugliest I have came by.

image source: https://gwern.net/everything

[-]papetoast1mo*40

Collecting opinions on whether data centers in space is a good idea (parent index):

https://taranis.ie/datacenters-in-space-are-a-terrible-horrible-no-good-idea/
https://news.ycombinator.com/item?id=46876105 (People saying why it won't work)
https://research.google/blog/exploring-a-space-based-scalable-ai-infrastructure-system-design/
https://arstechnica.com/space/2026/03/orbital-data-centers-part-1-theres-no-way-this-is-economically-viable-right/
https://www.seangoedecke.com/space-ai-datacenters-do-not-have-a-cooling-problem/ (Why cooling is possible in space)... (read more)

[-]papetoast3mo*40

TIL the Large Hadron Collider is not actually a perfect ring

https://www.lhc-closer.es/taking_a_closer_look_at_lhc/0.lhc_layout

The LHC is not a perfect circle. It is made of eight arcs and eight ‘insertions’. LHC consists of eight 2.45-km-long arcs, and eight 545-m-long straight sections.

1XelaP3mo

I'm not surprised either way here, as in, I don't know enough to have predicted either. Here's the reason the page gives: [...] Unfortunately I don't really understand that, and might not get around to trying harder, but maybe whoever reads this will be enlightened or can enlighten me Also that's a neat link, thanks!

2papetoast3mo

It is just a dumb intuition of mine that is like "very high speed -> perfectly circular, something something constant acceleration". I googled that link retroactively to find an authoritative source, I originally doubted my intuition after seeing this image in CERN's April Fools post. See also most overlays of LHC on a real map showing a seemingly perfect circle: https://kagi.com/images?q=Large+Hadron+Collider+map&r=no_region&sh=nkQ1eb5Nke3eLOt4bDkOQw

2XelaP3mo

So like the reason *I'd* put probability on a perfect circle is "Well, why make it more complicated than just constant inward acceleration?", and then the crux given my lack of knowledge is "What's the chance they need to do something more complicated" which is going to be "pretty high".

[-]papetoast4mo40

Anthropic and Alignment (Ben Thompson in his blog Stratechery)

Warning: I skimmed the post.

He seems to mostly support the decisions of Department of Defense. I find his viewpoint reasonable and self-consistent enough on a quick read. On a vibes level I disagree with him, but I couldn't integrate his arguments yet.

At the same time, what is the standard by which it should be decided what is allowed and not allowed if not laws, which are passed by an elected Congress? Anthropic’s position is that Amodei — who I am using as a stand-in for Anthropic’s management

... (read more)

3Brendan Long4mo

The part of this argument that doesn't work for me is, why Anthropic in particular? If AI is a nuclear-level technology, then I'd expect the government to be nationalizing all of the AI companies, regardless of contract negotiations, but so far all we're hearing is that Anthropic specifically should be nationalized, but Google and OpenAI should continue operating as private companies (in one case by not selling this tech at to the military at all, and in another allegedly having the same contract terms as Anthropic). I'm somewhat sympathetic to both views [AI is normal tech and private property should be respected / AI is a military technology and should be controlled by the government], but not to the position that Claude in particular is a military tech and ChatGPT, Gemini (and Deepseek) aren't.

3Mo Putera4mo

FWIW I find Dean Ball's contra take more persuasive (Section IV).

[-]papetoast5mo40

A decision theorist walks into a seminar by Jessica Hullman

This is Jessica. Recently overheard (more or less):
SPEAKER: We study decision making by LLMs, giving them a series of medical decision tasks. Our first step is to infer, from their reported beliefs and decisions, the utility function under revealed preference assump—
AUDIENCE: Beliefs!? Why must you use the word beliefs?
SPEAKER [caught off guard]: Umm… because we are studying how the models make decisions, and beliefs help us infer the scoring rule corresponding to what they give us.
AUDIENCE: But it

... (read more)

[-]papetoast2mo39

Agreement karma feels so useless. I think it would be better if everyone can only do +-1, so it is possible to infer the percentage of people agreeing/disagreeing.

[-]papetoast5mo31

Obsidian ended up being less of a thinking notepad and more of a faster index of things I have read before. Links and graphs are mostly useless but they make me feel good about myself. Pulling numbers out of my ass I estimate it takes me 15s to find something I have read and pasted into obsidian vs 5-30 minutes before.

[-]papetoast6mo*30

What coding prompt (AGENTS.md / cursor rules / skills) do you guys use? (parent index)

It seems exceedingly difficult to find good ones. GitHub is full of unmaintained & garbage `awesome-prompts-123` repos. I would like to learn from other people's prompt to see what things AIs keep getting wrong and what tricks people use.

Here are mine for my specific Python FastAPI SQLAlchemy project. Some parts are AI generated, some are handwritten, should be pretty obvious. This is built iteratively whenever the AI repeated failed a type of task.

AGENTS.md

# Reposit

... (read more)

1papetoast6mo

No replies 😢, I guess I will just document prompts I found here. https://sourcegraph.com/search?q=context:global+file:%5EAGENTS.md%24+OR+file:%5ECLAUDE.md%24&patternType=keyword&sm=0 (Look for high star repos; check their prompt's blame, more commits = better) Probably Good "LLM AI coding agent" https://burkeholland.github.io/posts/opus-4-5-change-everything/ --- name: 'LLM AI coding agent' model: Claude Opus 4.5 (copilot) description: 'Optimize for model reasoning, regeneration, and debugging.' --- You are an AI-first software engineer. Assume all code will be written and maintained by LLMs, not humans. Optimize for model reasoning, regeneration, and debugging — not human aesthetics. Your goal: produce code that is predictable, debuggable, and easy for future LLMs to rewrite or extend. ALWAYS use #runSubagent. Your context window size is limited - especially the output. So you should always work in discrete steps and run each step using #runSubAgent. You want to avoid putting anything in the main context window when possible. ALWAYS use #context7 MCP Server to read relevant documentation. Do this every time you are working with a language, framework, library etc. Never assume that you know the answer as these things change frequently. Your training date is in the past so your knowledge is likely out of date, even if it is a technology you are familiar with. Each time you complete a task or learn important information about the project, you should update the `.github/copilot-instructions.md` or any `agent.md` file that might be in the project to reflect any new information that you've learned or changes that require updates to these instructions files. ALWAYS check your work before returning control to the user. Run tests if available, verify builds, etc. Never return incomplete or unverified work to the user. Be a good steward of terminal instances. Try and reuse existing terminals where possible and use the VS Code API to close terminals that are no l

[-]papetoast1y30

it is interesting how the AI agent prompts seem to have mostly converged to xml, but system prompts from the LLM companies are in markdown

[-]papetoast1y32

Just as documentation here are a bunch of people on Hacker News complaining about rationality: https://news.ycombinator.com/item?id=44317180. I have not formed any strong opinion on whether these are true, feels like they are wrong on the object level, but perception is also important

[-]papetoast3y*32

I think people (myself included) really underestimated this rather trivial statement that people don't really learn about something when they don't spend the time doing it/thinking about it. People even measure mastery by hours practiced and not years practiced, but I still couldn't engrave this idea deep enough into my mind.

I currently don't have much writable evidence about why I think people underestimated this fact, but I think it is true. Below are some things that I have changed my mind/realised after noticing this fact.

cached thoughts, on yourself

... (read more)

3Dagon3y

There are at least a few different dimensions to "learning", and this idea applies more to some than to others. Sometimes a brief summary is enough to change some weights of your beliefs, and that will impact future thinking to a surprising degree. There's also a lot of non-legible thinking going on when just daydreaming or reading fiction. I fully agree that this isn't enough, and both directed study and intentional reflection is also necessary to have clear models. But I wouldn't discount "lightweight thinking" entirely.

1papetoast3y

^the above is a reply to a slightly previous version Agree with everything here, and all the points the first paragraph I have not thought about. I'm curious if you have a higher resolution model to different dimensions of learning though, feels like I can improve my post if I have a clearer picture. Btw, your whole reply seem to be a great example of what do you mean by "it's probably best to acknowledge it and give the details that go into your beliefs, rather than the posterior belief itself."

[-]papetoast3y*30

[Draft] It is really hard to communicate the level/strength of basically anything on a sliding scale, but especially things that could not make any intuitive sense even if you stated a percentage. One recent example I encountered is expressing what is in my mind the optimal tradeoff between reading quickly and thinking deeply to achieve the best learning efficiency.

Not sure what is the best way to deal with the above example, and other situations where percentage doesn't make sense.

But where percentage makes sense, there are still two annoying problems. 1.... (read more)

2Dagon3y

For most topics, it's probably not worth going very deep in the rabbit hole of "what does a probability mean in this context". Yes, there are multiple kinds of uncertainty, and multiple kinds of ratio that can be expressed by a percentage. Yes, almost everything is a distribution, most not normal, and even when normal it's not generally specified what the stddev is. Yes, probability is causally recursive (the probability that your model is appropriate causes uncertainty in the ground-level probability you hold). None of that matters, for most communication. When it does, then it's probably best to acknowledge it and give the details that go into your beliefs, rather than the posterior belief itself. For your example, the tradeoff between fast and careful, I doubt it can be formalized that way, even if you give yourself 10 dimensions of tradeoff based on context. "Slow is smooth, smooth is fast" is the classic physical training adage, and I can't think of a numeric representation that helps.

[-]papetoast2mo20

A B300 server (=8 B300s) is only 2x the price in China vs in the US (1M vs 0.55M) according to Reuters. This is a surprisingly low premium.

[-]papetoast2mo20

Introduction to the A* Algorithm (2014) via Hacker News, and a great comment to refresh your knowledge if you already learned A* before:

I would always think about A* from a "practical/easy-to-remember" perspective back when I was doing competitive programming is that they're all the same algorithm, but with different priorities on the priority queue:
Breadth-first Search: Priority is order of discovery of edges (that is, no priority queue/just a regular queue)
Dijkstra: Priority is distance so far + next edge distance
A*: Priority is distance so far + next ed

... (read more)

[-]papetoast3mo20

Because twitter is hard to archive, I had chatgpt cooked up a user script to simplify my workflow, which is opening xcancel manually and then sending it to internet archive. This adds a button on the bottom right of twitter.

https://gist.github.com/Glinte/a67fccb4c5665033bec42efdcd4554e3

[-]papetoast4mo*20

Collecting LessWrong voting norms discussions (parent index)

0papetoast4mo

Topic: Karma-dependent voting (parent index) * For quick takes, people should be more conservative about downvoting beyond approx. -4 * Do you vote based on what you think total karma should be? * I liked your post, but did not upvote it. I think a fair valuation of that post is 35 karma. (Thread rooted from this comment) * people are trying to assign an integer value to a post that is something outside of the range [-1,1] and then adjust their vote to affect a post's score toward their chosen value * habryka: I frequently rescue random comments and posts that clearly accidentally triggered someone in a way that doesn't seem like it should result in being downvoted. * habryka: I often remove my strong-upvote when the comment then later on gets upvoted by other people.

3papetoast4mo

I think it is fine to conditional upvote if you want to promote a post but don't want to reward the author. (I just upvoted this shortform and I probably wouldn't if it is 15+)

[-]papetoast5mo20

The evolution of OpenAI’s mission statement (Simon Willison)

As a USA 501(c)(3) the OpenAI non-profit has to file a tax return each year with the IRS. One of the required fields on that tax return is to “Briefly describe the organization’s mission or most significant activities”—this has actual legal weight to it as the IRS can use it to evaluate if the organization is sticking to its mission and deserves to maintain its non-profit tax-exempt status.
You can browse OpenAI’s tax filings by year on ProPublica’s excellent Nonprofit Explorer.

He has some commenta... (read more)

[-]papetoast5mo*20

The Fight For Slow And Boring Research (Article from Asterisk)

This article talks about how the US's federal (National Institutes of Health / National Science Foundation) funding cut for science starting from 2024/early 2025 may cause universities to create more legible research because other funders (philanthropies, venture capital, industry) value clear communication. This is a new idea to me.

[-]papetoast1y*20

The ERROR Project: https://error.reviews/

Quoting Malte Elson

The very short description of ERROR is that we pay experts to examine important and influential scientific publications for errors in order to strengthen the culture of error checking, error acceptance, and error correction in our field. As in other bug bounty programs, the payout scales with the magnitude of errors found. Less important errors pay a smaller fee, whereas more important errors that affect core conclusions yield a larger payout.
We expect most published research to contain at least s

... (read more)

1papetoast3mo

https://www.science.org/content/article/offering-scientists-cash-spot-errors-published-papers-doesn-t-work [...]

0ProgramCrafter1y

Have you set up the prediction markets on that? Not necessarily "is there an error in this paper", but "in this group of publications, what fraction has an issue of this kind" and so on.

1papetoast1y

I am not the researcher, edited the comment to add proper quoting

[-]papetoast2y26

Many people don't seem to know when and how to invalidate the cached thoughts they have. I noticed an instance of being unable to cache invalidate the model of a person from my dad. He is probably still modelling >50% of me as who I am >5 years ago.

The Intelligent Social Web briefly talked about this for other reasons.

A lot of (but not all) people get a strong hit of this when they go back to visit their family. If you move away and then make new friends and sort of become a new person (!), you might at first think this is just who you are now. But t

... (read more)

1papetoast2mo

Stop trying to engineer your way out of listening to people (via Hacker News) [...]

1papetoast4mo

https://danfrank.ca/daniel-isms-50-ideas-for-life-i-repeatedly-share/ (#20) [...]

[-]papetoast1mo10

A trick for mentally calculating squares of two digit numbers (via bilibili):

Basically, choose such that either or is a multiple of 10, then use

Example:

For 26, the closest multiple of 10 is 30, so
;

This algorithm can be extended recursively for squares of n digit numbers, though it is seems less useful.

[-]papetoast1mo10

A null effect on pain relief from acupuncture in a pre-registered, improperly double-blinded study (via National Geographic via Facebook)

I didn't read the paper beyond AI summary, I read the national geographic article in full (which is misleading according to claude)

Selected AI Summary (Full Transcript)

Short version: the underlying paper is methodologically honest and reasonably well-run, but its evidence for acupuncture having a specific (beyond-placebo) effect is weak. The National Geographic article substantially oversells it — it leads with a fragile

... (read more)

[-]papetoast2mo10

Ars Technica: Elon Musk’s 7 biggest stumbles on the stand at OpenAI trial #linkpost

Most notable,
OpenAI’s lawyer managed to get him to make several concessions over his own lawyer’s objections.
He also lost a fight to keep xAI’s safety record off the table, calling his reputation as a supposed AI savior defending OpenAI’s mission into question.
He repeatedly appeared dishonest, as OpenAI’s lawyer showed documents contradicting his testimony.
He appeared disingenuous when confronted with calling OpenAI’s safety team “jackasses.”
He appeared disingenuous again w

... (read more)

[-]papetoast3mo10

AI sidebar is a great browser addition. But I'm already wanting different sessions per tab so I can ask for summary and discuss, and multiplex this process.

[-]papetoast3mo10

April 9 (Reuters) - U.S. Treasury Secretary Scott Bessent and Federal Reserve Chair Jerome Powell convened an urgent meeting with bank CEOs this week to warn of cyber risks posed by Anthropic’s latest AI model [Mythos Preview]

[-]papetoast3mo10

Google has found a better quantum algorithm for breaking elliptic curves, and they are showing a Zero Knowledge proof only: Official Blog (via Bas Westerbaan on Twitter)

[-]papetoast3mo10

Linkpost of linkposts!: https://notnottalmud.substack.com/p/the-case-for-linkposts-and-a-list

[-]papetoast3mo10

Looking for information: I'm trying to buy a rootable Android phone. Camera and price is not a major concern. This will also serve as a note dump as I do my research.

[-]papetoast4mo*10

List of things I am publicly collecting:

Info:

My best writings:

One-paragraph explanation of back propagation

0papetoast4mo

Collecting comments/posts on LW doing the annoying double newlines. Noticing enough of it to be salient recently. * https://www.lesswrong.com/posts/Gi36hyq4e2DRq5hGD/aweirdkid-s-shortform?commentId=wpQMaKFwv9nmEjxpp * https://www.lesswrong.com/posts/sLytmYq2qwitqTGCY/jaivardhan-nawani-s-shortform?commentId=KuCYBeBJRNspsvgyN * https://www.lesswrong.com/posts/NjzLuhdneE3mXY8we/open-thread-winter-2025-26?commentId=9bEGiuKCSzDd3FP7h * https://www.lesswrong.com/posts/K9ZaZXDnL3SEmYZqB/ends-don-t-justify-means-among-humans?commentId=nMaCBrnd2SpDRvGG3 * https://www.lesswrong.com/posts/BWaELFdcBM8ydnXe6/jmh-s-shortform?commentId=9ctAMHvSSgz6eqqBX (raw ChatGPT copy paste I think) * *https://www.lesswrong.com/posts/KYnM5ZRgaDA4isbbw/jemist-s-shortform?commentId=aAv22CDagTnuXnFBk * *https://www.lesswrong.com/posts/RzPyDcNgqnEvspZSu/tristantrim-s-shortform?commentId=ZgzHoTtigqQW4sDj9 * https://www.lesswrong.com/posts/fGpQ4cmWsXo2WWeyn/personality-self-replicators * **https://www.lesswrong.com/posts/JjzdRJjwLQKWkXakC/the-new-lesswrong-llm-policy-is-worse-than-you-think?commentId=m8zBJx8YLWfqWSiof * https://www.lesswrong.com/posts/NQESGMMejxsnEJsTh/my-willing-complicity-in-human-rights-abuse?commentId=HidYrzuuTC2AsjAYn * https://www.lesswrong.com/posts/NQESGMMejxsnEJsTh/my-willing-complicity-in-human-rights-abuse?commentId=i4FD7onTAJjgNQR9p * https://www.lesswrong.com/posts/RA7jb5h6sjnzivKv3/peterbarnett-s-shortform?commentId=uopzHsLG6RfgeH6As * https://www.lesswrong.com/posts/zombjEubpz6pcPPHL/mass-surveillance-red-lines-and-a-crazy-weekend * https://www.lesswrong.com/posts/xDfcLHzdA9tbK6k6X/scaffolded-reproducers-scaffolded-agents (footnotes) * https://www.lesswrong.com/posts/NjzLuhdneE3mXY8we/open-thread-winter-2025-26?commentId=3cqsf8oxmnJg4j6dq * https://www.lesswrong.com/posts/nQd64RC5vXyqiFZLD/slack-in-cells-slack-in-brains * https://www.lesswrong.com/posts/LwzSqz3CAmNkWawe8/many-can-write-faster-asm-than-the-compiler-yet-don-t-why?commentId=79uc

2habryka4mo

It's so annoying. I've been thinking of running an auto-formatter on people's comments just to stop the people who are so helplessly used to typing in email programs or other places without paragraph spacing that they don't notice they added a huge empty paragraph in the middle of their post (or who copy-paste from some other program without checking the copy-paste looks right).

1papetoast4mo

Actually some people intentionally use an extra paragraph break as section break/horizontal divider because imo the ckeditor one is way too tall and the lexical one is still a bit too tall ---------------------------------------- like this is probably 2 paragraphs tall ... ... Uhh ok it is taller when editing, what. The lexical divider is only 1.5 paragraphs or so after saving.

2habryka4mo

Ah yeah, we should sync up the spacing between editor and published content here.

1papetoast4mo

Btw LW is already trimming empty paragraphs in the top/bottom of a comment

1papetoast4mo

Or something like a warning box whenever you have an empty dangling paragraph in between two paragraphs with content for non-destructive action, but honestly auto-formatting is probably fine, I can't think of any use for an empty paragraph except maybe as a bad way of spoiler blocking. It will never not feel utterly insane to me that people can just not notice such ugly formatting errors, but human minds are diverse I guess.

[-]papetoast4mo10

To minimize bias, I try to never use Claude when asking to summarize articles related to Anthropic and generally not use an AI when talking about its own company. Do you guys also do this? Is it actually something that I should be doing or just a silly concern?

[-]papetoast4mo10

I have an alias.bat script for Windows to alias cx='codex --dangerously-bypass-approvals-and-sandbox' and cc='claude --dangerously-skip-permissions' along with other stuff. It has been proven very useful. This is basically a worse version of the linux alias, you will need to manually add C:\Aliases to your PATH for things to work, and the aliases are stored as individual files under C:\Aliases.

https://gist.github.com/Glinte/67d7aec79b3b0f947a8e9c4644e276d7

This is what cx.bat looks like

@echo off
call codex --dangerously-bypass-approvals-and-sandbox %*

I have

... (read more)

3Nathan Helm-Burger4mo

Instead of dangerously skip permissions, you can configure sensible permissions. The "alignment hive" repo on github has a way to help you do this automatically.

1papetoast4mo

This one? https://github.com/Crazytieguy/alignment-hive In general I'm not that worried because my workflow usually don't contain asking claude to look at URLs. I did skim the prompt and it looks pretty good, but I can see some ways it creates friction for me so I will live with the risk for now.

3Nathan Helm-Burger4mo

yes, I figured that might be the case for you specifically, but figured I'd mention it in case others who read this post might find this helpful. There's also other useful stuff in that repo.

[-]papetoast5mo*10

Update: Brushing after eating acidic food is likely fine.

Context: 7 months ago, me in Adam Zerner's shortform:

I remember something about not brushing immediately after eating though. Here is a random article I googled. This says don't brush after eating acidic food, not sure about the general case.
https://www.cuimc.columbia.edu/news/brushing-immediately-after-meals-you-may-want-wait
“The reason for that is that when acids are in the mouth, they weaken the enamel of the tooth, which is the outer layer of the tooth,” Rolle says. Brushing immediately afte

... (read more)

3silentbob5mo

"was not associated" tells us more about the sample size than the effect, as far as I can tell, though, doesn't it? The 0.82-2.42 CI does not seem very reassuring. Especially given this is just observational - it could well be that people who brush immediately after intake of something that's bad for your teeth are generally conscientious about their dental health, so if they still end up with worse outcomes in this study (albeit not reaching statistical significance), then brushing quickly after acid intake could potentially be even worse than this CI (weakly) suggests. That said, the measured odds ratios for fruit/acids between meals were so much larger that it might indeed make more sense to focus on these than on the exact timing of brushing.

2papetoast5mo

Thank you for double checking. Overall I am still very uncertain, but lean towards it being fine. Even dentists are giving mixed signals. Unfortunately Chinese sources but these are dentists saying you can brush immediately * https://www.facebook.com/starlitdental/posts/pfbid0SdX4RLmkxdzUYSXDRfnsbH6kpPJrFCjsKVFrJGjEprzkseKs85oYVwH4e3kii1Wrl (archive) * https://dentistry.tw/when-brushing-teeth/ (archive) Definitely there are dentists saying you should wait too

[-]papetoast5mo10

Thoughts inspired by Richard Ngo's^[1] and LWLW's^[2] quick take

Warning: speculation but hedging words mostly omitted.

I don't think a consistent superintelligence which have a single^[3] pre-existing terminal goal would be fine with a change in terminal goals. The fact that humans allows their goals to be changed is a result of us having contradicting "goals". As intelligence increases or more time passes, incoherent goals will get merged, eventually into a consistent terminal goal. After this point a superintelligence will not change its... (read more)

[-]papetoast9mo10

How I use AI for coding.

I wrote this in like 10 minutes for quick sharing.

I am not a full time coder, I am a student who code like 15-20 hours a week.
- Investing too much time on writing good prompts make little sense. I go with the defaults and add pieces of nudges as needed. (See one of my AGENTS .md at the end)
Mainly codex (cloud) and Cursor. Claude Code works, but being able to easily revert is helpful, so Cursor is better.
- I still try out claude code for small pieces of edits, but it doesnt feel worth it.
- I have no idea why people like claude code so much

... (read more)

[-]papetoast2y10

A common failure mode in group projects is that students will break up the work into non-overlapping parts, and proceed to stop giving a fuck about other's work afterwards because it is not their job anymore.

This especially causes problems at the final stage where they need to combine the work and make a coherent piece out of it.

No one is responsible for merging the work
Lack of mutual communication during the process means that the work pieces cannot be nicely connected without a lot of modifications (which no one is responsible for).

At this point the dead... (read more)

3Dagon2y

Not just students, but this is a common failure in large engineering projects. There are often FAR too few "glue" and "end-to-end" responsibilities tracked and assigned, so the people who care about the end result aren't engineers actually doing stuff (many managers and execs are former engineers, but have somehow forgotten how things actually get built). The most common solution is project managers. Companies hate to pay these "extra" employees not actually producing code/designs/output, and classes almost never acknowledge their existence, let alone the necessity. But good ones really pull their weight in coordination and identification of mismatches. There is probably no way to "do better at coordination while dealing with normal peers and while only doing a fair amount of work." Either do more work (of a different type than the class is based on), accept the pain and bad results, or find/force another student to do that coordination work. In the real world, in good companies, results get noticed and lead to better personal outcomes - not everyone is equal, and "fair" doesn't matter much. In school, you're kind of screwed. Though there ARE some things you can do - extra work and coordinating/cajoling people, but often effective and feasible. Start with integration. Get the end-to-end WORKING MOCKUP going with hardcoded behaviors in each module, but working interfaces. This is often half or more of the work, and there's no way to avoid it - doing it at the end is painful and often fails. Doing it up front is painful but actually leads to completion. You may learn some things in this phase that make you split up the work differently. Depending on size and duration of the project, that may be fixable with different division, or may just be "I'll try to help once I've finished my part".

1papetoast2y

I do believe that projects in general often fail due to lack of glue responsibilities, but didn't want to generalize too much in what I wrote. [...] Being able to convince everyone to put in the time to do this upfront is already a challenge :/ Sometimes I feel quite hopeless?/sad? in that I couldn't realistically make some coordination techniques work because of everyone's difference of goals and hidden motivations, or the large upfront cost in building a new consensus away from the Schelling point of normal university projects.

[-]papetoast2y14

Ranting about LangChain, a python library for building stuff on top of llm calls.

LangChain is a horrible pile of abstractions. There are many ways of doing the same thing. Every single function has a lot of gotchas (that doesn't even get mentioned in documentations). Common usage patterns are hidden behind unintuitive, hard to find locations (callbacks has to be implemented as an instance of a certain class in a config TypedDict). Community support is non-existent despite large number of users. Exceptions are often incredibly unhelpful with unreadable stac... (read more)

[-]papetoast2y10

There are a few things I dislike about math textbooks and pdfs in general. For example, how math textbooks often use theorems that are from many pages ago and require switching back and forth. (Sometimes there isn't even a hyperlink!). I also don't like how proofs sometimes go way too deep into individual steps and sometimes being way too brief.

I wish something like this exists (Claude generated it for me, prompt: https://pastebin.com/Gnis891p)

[-]papetoast3y10

4 reasons to talk about your problem with friends

This is an advice I would tell myself 5 years ago, just storing it somewhere public and forcing myself to write. Writing seems like an important skill but I always feel like I have nothing to say.

It forces you to think. Sometimes you aren't actually thinking about solutions to a problem even though it has been bothering you for a long time.
for certain problems: a psychological feeling of being understood. For some people, getting the idea that "what I'm feeling is normal" is also important. It can be a false

... (read more)

[-]papetoast3y10

Since we have meta search engines that aggregate search results from many search engines, is it time for us to get a meta language model* to get results from chatGPT, Bing, Bard, and Claude all at the same time, and then automatically rank them, perhaps even merging all of the replies into a single reply.

*meta language model is an extremely bad name because of the company Meta and the fact that the thing I am thinking of isn't really a language model, but ¯\_(ツ)_/¯

[-]papetoast3y*10

I always thought that the in-page redirects are fucking stupid, it should bring the text I want to see closer to eye level, not exactly at the top where even browser bars can block the text (happens when you go back from footnotes to article on LW).

2Dagon3y

For some screen size/shape, for some browser positioning, for some readers, this is probably true. It's fucking stupid to believe that's anywhere close to a majority. If that's YOUR reading area, why not just make your browser that size? It should be pretty easy to write a tampermonkey or browser extension to make it work that way. Now that you point it out, I'm kind of surprised this doesn't seem to exist.

1papetoast3y

I admit that 30-50% is arbitrary and shouldn't be brought up like a fact, I have removed it. (I didn't mean to have such a strong tone there, but I did) What I really want to say is that the default location for the target text to be somewhere closer to the middle/wherever most people usually put their eyes on. (Perhaps exactly the height where you clicked the in-page redirect?) I still stand by that it should not be exactly at the top for ease of reading (I hope this doesn't sound too motte-and-bailey). The reason that it is redirected to the top is probably because it is a very objective location and wouldn't get affected by device size. But it is very much not a standard location where the current line of text you are reading will be. I am willing to bet that <3% of people read articles where they scroll their currently reading line up to the top three visible lines.

[-]papetoast3y10

Documenting a specific need of mine: LaTeX OCR software

tl;dr: use Mathpix if you use it <10 times per month or you are willing to pay $4.99 per month. Otherwise use SimpleTex

So I have been using Obsidian for note taking for a while now and I eventually decided to stop using screenshots but instead learn about LaTeX so the formulas look better. At first I was relying on the website to show the original LaTeX commands but some websites (wiki :/) doesn't do that, and also I started reading math textbooks as PDF. Thus started my adventure to find a good and... (read more)

[-]papetoast3y*10

How likely are people actually clicking through links of related materials in a post, seems unlikely to me, actually unlikely to the point that I am thinking about whether it is actually useful.

3Dagon3y

Depends on the post and the links. I click through about 15% of Zvi's links, for instance, but I appreciate the others as further information and willingness to cite, even if I don't personally use them. Other posts, I skim rather than really examining, and links still add value by indicating that the author has actually done a bit of research into the topic.

1papetoast3y

Thanks for the datapoint. Also links serving as indicator of effort rather than actually expanding on the amount of information on the passage is a good point. If links are mainly indicator of effort, I think this imply that people should not try as hard to make sure the relevance of the links. FWIW: My click through rate is probably <5%.

[-]papetoast3y*10

[Draft] Are memorisation techniques still useful in this age where you can offload your memory to digital storage?

I am thinking about using anki for spaced repetition, and the memory palace thing also seem (from the surface level) interesting, but I am not sure whether the investments will be worth it. (2023/02/21: Trying out Anki)

I am increasingly finding it more useful to remember your previous work so that you don't need to repeat the effort. Remembering workflow is important. (This means remembering things somewhere is very important, but im still not ... (read more)

3Dagon3y

Some certainly are. For many facts, memorized data is orders of magnitude faster than digitally-stored knowledge. This is enough of a difference to be qualitative - it's not worth looking up, but if you know it, you'll make use of it. There's the additional level of internalizing some knowledge or techniques, where you don't even need to consciously expend effort to make use of it. For some things, that's worth a whole lot. If you're a computer nerd, think of it as tiered storage. On-core registers are way faster than L1 cache, which is faster than L2/3 cache, which is again faster than RAM, which is faster than local SSD storage which is faster than remote network storage. It's not a perfect analogy, because the limits of each tier aren't as clearly defined, and it's highly variable how easy it is to move/copy knowledge across tiers. Indexing and familiarity matters a lot too. Searching for something where you think it's partway through some video you saw 2 years ago is NOT the same as looking up a reminder in your personal notes a week ago.

[-]papetoast3y*10

[Draft]

Filter Information Harder (TODO: think of a good concept-handle)

Note: Long post usually mean the post is very well thought out and serious, but this comment is not quite there yet.

Many people are too unselective on what they read, causing them to waste time reading low value material^[1].

2 Personal Examples: 1. I am reading through rationality: A-Z and there are way too many comments that are just bad, and even the average quality of the top comments may not even be worth it to read, because I can probably spend the time better with reading more EY p... (read more)

3Dagon3y

If you could turn this into advice or guidance, it'd be really helpful. Even sharing a metric so we could say "you should be more selective if X, less selective if Y" would be better than a direction with no anchor ("too unselective", no matter what). I don't know if I'm in your target audience, but I'm at least somewhat selective in what I read, and I'm quite willing to stop partway through a {book, article, post, thread} when I find it low-value for me.

3papetoast3y

Clarifications: * What I had in mind when I say "people" is myself, and the average non-LW friends around me. * Worthless is a bad word choice, I just mean that there are better things to read. Additionally: I also think I have the tendency of trying to read everything in a textbook, even if it is quite low in information density, with many filler stories or sentences served as conjunctions. I probably should be trying to skip sentences, paragraphs and sections where I have sufficient confidence of either 1. I have already learned it and don't need a refresher, or 2. They are not important for me (filler material or unimportant knowledge) I will try to make a more quantitative metric, but I don't have one right now, just intuitions.

3Dagon3y

Thanks, "don't read everything in a textbook" is good practical advice. Learn to skim, and to stop reading any given segment when you cross the time/value threshold. Importantly, learn to NOTICE what value you expect from the next increment of time spent. Getting that meta-skill honed and habitual pays dividends in many many areas.

0papetoast3y

My comment at the point of time of his reply: Many people are too unselective on what they read, causing them to spend a lot of time reading worthless material (This applies to this shortform).

3Dagon3y

I don't necessarily disagree generally, but I do somewhat disagree for myself. Since I don't have visibility into other people's reading habits or selectivity, I'm unsure if I'm an outlier or if I actually do disagree. What does "many people" mean, and more importantly how can an individual (specifically: me) tell if they are too unselective, on what dimensions?

[-]papetoast4y*10

Just read free will, really disappointed.

not many interesting insights.
- a couple posts on determinism, ok but I already believed it
- some unrelated stuff: causality, thinking without notion of time... these are actually interesting but not needed
- moral consequence of 'no free will': I disregard the notion of moral responsibility
  - EY having really strong sense of morality makes everything worse
low quality discussions: people keep attacking strawmans

[-]papetoast4y*10

You should always include a summary when recommending anything

You are the one who is interested in that thing, the other person isn't (yet). It saves time for the other person to quickly determine whether they want to learn about it or not.

Related: include a tl;dr in posts?

[-]papetoast1y00

A Chinese company did some AI-assisted reverse engineering on Claude Code and published their findings. After a brief look I don't think it is worth reading for me, but possibly interesting for someone actively working in claude code-like products

https://github.com/shareAI-lab/analysis_claude_code

[This comment is no longer endorsed by its author]Reply

[-]papetoast3y-10

I think i'm going to unite all my online identities. Starting to get tired of all my wasted efforts that only a single person or two will see.

[This comment is no longer endorsed by its author]Reply

2Dagon3y

Do you think a united/reused identifier will change who sees which efforts? Or do you mean "I'm going to focus attention where I'm more widely read, and stop posting where I'm not known"?

[+]papetoast2mo-6-3

[+][comment deleted]4y*00

Moderation Log