LESSWRONG
LW

1028
catherio
701Ω173300
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No wikitag contributions to display.
evhub's Shortform
catherio10mo1810

COI: I work at Anthropic

I confirmed internally (which felt personally important for me to do) that our partnership with Palantir is still subject to the same terms outlined in the June post "Expanding Access to Claude for Government":

For example, we have crafted a set of contractual exceptions to our general Usage Policy that are carefully calibrated to enable beneficial uses by carefully selected government agencies. These allow Claude to be used for legally authorized foreign intelligence analysis, such as combating human trafficking, identifying covert influence or sabotage campaigns, and providing warning in advance of potential military activities, opening a window for diplomacy to prevent or deter them. All other restrictions in our general Usage Policy, including those concerning disinformation campaigns, the design or use of weapons, censorship, and malicious cyber operations, remain.

The contractual exceptions are explained here (very short, easy to read): https://support.anthropic.com/en/articles/9528712-exceptions-to-our-usage-policy

The core of that page is as follows, emphasis added by me:

For example, with carefully selected government entities, we may allow foreign intelligence analysis in accordance with applicable law. All other use restrictions in our Usage Policy, including those prohibiting use for disinformation campaigns, the design or use of weapons, censorship, domestic surveillance, and malicious cyber operations, remain.

This is all public (in Anthropic's up-to-date support.anthropic.com portal). Additionally it was announced when Anthropic first announced its intentions and approach around government in June.

Reply
My understanding of Anthropic strategy
catherio3y42

A "Core Views on AI Safety" post is now available at https://www.anthropic.com/index/core-views-on-ai-safety

(Linkpost for that is here: https://www.lesswrong.com/posts/xhKr5KtvdJRssMeJ3/anthropic-s-core-views-on-ai-safety.)

Reply
Appendix: How to run a successful Hamming circle
catherio3y30

I’ve run Hamming circles within CFAR contexts a few times, and once outside. Tips from outside:

Timing can be tricky here! If you do 4x 20m with breaks, and you’re doing this in an evening, then by the time you get to the last person, people might be tired.

Especially so if you started with the Hamming Questions worksheet exercise (link as prereq at top of post).

I think next time I would drop to 15 each, and keep the worksheet.

Reply
Transformer Circuits
catherio4yΩ14250

Thanks for the writeup! The first paper covers the first half of the video series, more or less. I've been working on a second paper which will focus primarily on the induction bump phenomenon (and other things described in the second half of the video series), so much more to come there!

Reply
Coordination Skills I Wish I Had For the Pandemic
catherio4y90

I appreciate the concept of "Numerical-Emotional Literacy". In fact, this is what I personally think/feel the "rationalist project" should be. To the extent I am a "rationalist" then precisely specifically what I mean by that is that knowing what I value, and pursuing numerical-emotional literacy around it, is important to me.

Reply
COVID-19: home stretch and fourth wave Q&A
catherio5y90

To make in-line adjustments, grab a copy of the spreadsheet (https://www.microcovid.org/spreadsheet) and do anything you like to it!

Reply
COVID-19: home stretch and fourth wave Q&A
catherio5y70

Also, if you live alone and don't have any set agreements with anyone else, then the "budgeting" lens is sort of just a useful tool to guide thinking. Absent pod agreements, as an individual decisionmaker, you should just spend uCoV when it's worth the tradeoff, and not when it's not.

Reply
COVID-19: home stretch and fourth wave Q&A
catherio5y30

You could think about it as an "annualized" risk, more than an "annual" risk; more like "192 points per week, in a typical week, on average" and it kind of amortizes out, and less like "you have 10k and once you spend it you're done"

Reply
microCOVID.org: A tool to estimate COVID risk from common activities
catherio5y20

There is now a wired article about this tool and the process of creating it: https://www.wired.com/story/group-house-covid-risk-points/

I think the reporter did a great job of capturing what an "SF group house" is like and how to live a kind of "high IQ / high EQ" rationalist-inspired live, so this might be a thing one could send to friends/family about "how we do things".

Reply
Dario Amodei leaves OpenAI
catherio5y150

It's not just Dario, it's a larger subset of OpenAI splitting off: "He and a handful of OpenAI colleagues are planning a new project, which they tell us will probably focus less on product development and more on research. We support their move and we’re grateful for the time we’ve spent working together."

Reply
Load More
196Two non-obvious lessons from microcovid.org
4y
4
176microCOVID.org: A tool to estimate COVID risk from common activities
5y
36
11Prioritizing COVID-19 interventions & individual donations
5y
0
18Unsolved research problems vs. real-world threat models
Ω
6y
Ω
2