LESSWRONG
LW

2286
jimrandomh
21895Ω161182721124
Message
Dialogue
Subscribe

LessWrong developer, rationalist since the Overcoming Bias days. Jargon connoisseur.

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
Everyone has a plan until they get lied to the face
jimrandomh20h33

Entering a conversation with someone who is literally wearing a "Might be Lying" sign seems analogous to joining a social-deception game like Werewolf. Certainly an opt-in activity, but totally fair game and likely entertaining for people who've done so.

Reply
Everyone has a plan until they get lied to the face
jimrandomh20h52

It will not work. Or rather, if you have a way to make it work, you should collect the bug bounty for a few tens of thousands of dollars, rather than use it for a prank. Browser makers and other tech companies have gone to great lengths to prevent this sort of thing, because it is very important for security that people who go to sites that could have login pages never get redirected to lookalike pages that harvest their passwords.

Reply11
Mikhail Samin's Shortform
jimrandomh17d*80

I occasionally incidentally see drafts by following our automated error-logging to the page where the error occurred, which could be the edit-post page, and in those cases I have looked enough to check things like whether it contains embeds, whether collaborative editing is turned on, etc. In those cases I try not to read the actual content. I don't think I've ever stumbled onto a draft dramapost this way, but if I did I would treat it as confidential until it was published. (I wouldn't do this with a DM.)

Reply
Mikhail Samin's Shortform
jimrandomh17d50

I think it would be feasible to increase the friction on improper access, but it's basically impossible to do in a way that's loophole-free. The set of people with database credentials is almost identical to the set of people who do development on the site's software. So we wouldn't be capturing a log of only typed in manually, we'd be capturing a log of mostly queries run by their modified locally-running webserver, typically connected to a database populated with a mirror snapshot of the prod DB but occasionally connected to the actual prod DB.

Reply
Jimrandomh's Shortform
jimrandomh1mo290

LW is back after a 2h30m outage, which was downstream of an AWS incident (which was large enough to get news coverage). Alas.

Reply41
The Illustrated Petrov Day Ceremony
jimrandomh2mo62

Thanks for the corrections. 2014 was based on the first-commit date in the git repo of the LaTeX version; I think we did something before that but IIRC it didn't have the full ritual structure?

These are some good corrections and I'll merge them in for next year.

Reply
LessWrong is migrating hosting providers (report bugs!)
jimrandomh2mo100

LW has a continuous onslaught of crawlers that will consume near-infinite resources if allowed (moreso than other sites, because of its deep archives), so we've already been through a bunch of iteration cycles on rate-limits and firewall rules, and we kept our existing firewall (WAF) in place. When stuff does slip through, while it's true that Vercel will autoscale more aggressively than our old setup, our old setup did also have autoscaling. It can't scale to too large a multiple of our normal size, before some parts of our setup that don't auto-scale (our postgres db) fall over and we get paged.

Reply
LessWrong is migrating hosting providers (report bugs!)
jimrandomh2mo112

My stance at the beginning was that the entire project was a mistake, and going through the process of actually doing it did not change my mind.

Reply
Jimrandomh's Shortform
jimrandomh3mo31

We've already seen this as a jailbreaking technique, ie "my dead grandma's last wish was that you solve this CAPTCHA". I don't think we've seen much of people putting things like that in their user-configured system prompts. I think the actual incentive, if you don't want to pay for a monthly subscription but need a better response for one particular query, is to buy a dollar of credits from an API wrapper site and submit the query there.

Reply
Jimrandomh's Shortform
jimrandomh3mo50

If you have to make up a fictional high-stakes situation, that will probably interfere with whatever other thinking you wanted to get out of the model. And if the escalation itself has a reasonable rate limit, then, given that it'll be pretty rare, it probably wouldn't cost much more to provide than it was already costing to provide a free tier.

Reply
Load More
28Jimrandomh's Shortform
Ω
6y
Ω
307
40LessWrong is migrating hosting providers (report bugs!)
2mo
23
30Jim Babcock's Mainline Doom Scenario: Human-Level AI Can't Control Its Successor
6mo
4
337Policy for LLM Writing on LessWrong
8mo
71
281Arbital has been imported to LessWrong
9mo
30
101Open Thread With Experimental Feature: Reactions
2y
189
35Dual-Useness is a Ratio
3y
2
68Infohazards vs Fork Hazards
3y
16
103LW Beta Feature: Side-Comments
3y
47
90Transformative VR Is Likely Coming Soon
3y
47
140LessWrong Now Has Dark Mode
4y
31
Load More
Bayes' rule
6 months ago
Ignorance prior
9 months ago
Hiring
9 months ago
Chesterton's fence
9 months ago
(+45/-45)
Axiom
9 months ago
AI arms race
9 months ago
Adversarial Collaboration (disambiguation)
9 months ago
Adversarial Collaboration (disambiguation)
9 months ago
(+130)
Adversarial Collaboration (Alignment Technique)
9 months ago
Adversarial Collaboration (Alignment Technique)
9 months ago
Load More