JonahS - LessWrong

Requesting Questions For A 2017 LessWrong Survey

I would like to see an (optional) personality test section - email me at jsinick@gmail.com if you're interested in the possibility, as I have some detailed thoughts.

How does personality vary across US cities?

JonahS8y20

Yes, this is something that I've wondered about quite a bit specifically in connection with the variation in conscientiousness and agreeableness by religion. I plan on partially addressing this issue by discussing some objective behavioral proxies to the personality traits in later posts.

How does personality vary across US cities?

JonahS8y60

What I had in mind was that the apparent low average conscientiousness in the Bay Area might have been one of the cultural factors that drew rationalists who are involved in the in-person community to the location. But of course the interpretation that you raise is also a possibility.

How does personality vary across US cities?

JonahS8y60

Glad you liked it :-).

So I'd be interested to hear a little more info on methodology - what programming language(s) you used, how you generated the graphs, etc.

I used R for this analysis. Some resources that you might find relevant:

Practical Data Science with R has very nice introduction to exploratory data analysis.
Advanced R goes into more detail on the language.
The graphs were made using ggplot2.
I used the lme4 package for Bayesian hierarchical modeling. See, e.g. Getting Started with Mixed Effect Models in R.
Kaggle Kernels has some good sample scripts.

And depending on how far back most of this data was collected, plausibly most of the Berkeley respondents were high school or college students (UC Berkeley alone has over 35,000 students), since for awhile that was the main demographic of Facebook users, and probably for awhile longer that was the main demographic of Facebook users willing to take personality tests.

Douglas_Knight is correct – the average age of users is quite low, at ~26 years old both for the high conscientiousness cities and the low conscientiousness cities.

CFAR's new mission statement (on our website)

JonahS8y10

Physics is established, so one can defer to existing authorities and get right answers about physics. Starting a well-run laundromat is also established, so ditto. Physics and laundromat-running both have well-established feedback loops that have validated their basic processes in ways third parties can see are valid.

Depending on which parts of physics one has in mind, this seems possibly almost exactly backwards (!!). Quoting from Vladimir_M's post Some Heuristics for Evaluating the Soundness of the Academic Mainstream in Unfamiliar Fields:

If a research area has reached a dead end and further progress is impossible except perhaps if some extraordinary path-breaking genius shows the way, or in an area that has never even had a viable and sound approach to begin with, it’s unrealistic to expect that members of the academic establishment will openly admit this situation and decide it’s time for a career change. What will likely happen instead is that they’ll continue producing output that will have all the superficial trappings of science and sound scholarship, but will in fact be increasingly pointless and detached from reality.

Arguably, some areas of theoretical physics have reached this state, if we are to trust the critics like Lee Smolin. I am not a physicist, and I cannot judge directly if Smolin and the other similar critics are right, but some powerful evidence for this came several years ago in the form of the Bogdanoff affair, which demonstrated that highly credentialed physicists in some areas can find it difficult, perhaps even impossible, to distinguish sound work from a well-contrived nonsensical imitation.

The reference to Smolin is presumably to The Trouble With Physics: The Rise of String Theory, the Fall of a Science, and What Comes Next . Penrose's recent book Fashion, Faith, and Fantasy in the New Physics of the Universe also seems relevant.

CFAR’s new focus, and AI Safety

JonahS8y330

A few nitpicks on choice of "Brier-boosting" as a description of CFAR's approach:

Predictive power is maximized when Brier score is minimized

Brier score is the sum of differences between probabilities assigned to events and indicator variables that are are 1 or 0 according to whether the event did or did not occur. Good calibration therefore corresponds to minimizing Brier score rather than maximizing it, and "Brier-boosting" suggests maximization.

What's referred to as "quadratic score" is essentially the same as the negative of Brier score, and so maximizing quadratic score corresponds to maximizing predictive power.

Brier score fails to capture our intuitions about assignment of small probabilities

A more substantive point is that even though the Brier score is minimized by being well-calibrated, the way in which it varies with the probability assigned to an event does not correspond to our intuitions about how good a probabilistic prediction is. For example, suppose four observers A, B, C and D assigned probabilities 0.5, 0.4, 0.01 and 0.000001 (respectively) to an event E occurring and the event turns out to occur. Intuitively, B's prediction is only slightly worse than A's prediction, whereas D's prediction is much worse than C's prediction. But the difference between the increase in B's Brier score and A's Brier score is 0.36 - 0.25 = 0.11, which is much larger than corresponding difference for D and C, which is approximately 0.02.

Brier score is not constant across mathematically equivalent formulations of the same prediction

Suppose that a basketball player is to make three free throws, observer A predicts that the player makes each one with probability p and suppose that observer B accepts observer A's estimate and notes that this implies that the probability that the player makes all three free throws is p^3, and so makes that prediction.

Then if the player makes all three free throws, observer A's Brier score increases by

3*(1 - p)^2

while observer B's Brier score increases by

(1 - p^3)^2

But these two expressions are not equal in general, e.g. for p = 0.9 the first is 0.03 and the second is 0.073441. So changes to Brier score depend on the formulation of a prediction as opposed to the prediction itself.

======

The logarithmic scoring rule handles small probabilities well, and is invariant under changing the representation of a prediction, and so is preferred. I first learned of this from Eliezer's essay A Technical Explanation of a Technical Explanation.

Minimizing logarithmic score is equivalent to maximizing the likelihood function for logistic regression / binary classification. Unfortunately, the phrase "likelihood boosting" has one more syllable than "Brier boosting" and doesn't have same alliterative ring to it, so I don't have an actionable alternative suggestion :P.

On the importance of Less Wrong, or another single conversational locus

JonahS8y150

Brian Tomasik's article Why I Prefer Public Conversations is relevant to

I suspect that most of the value generation from having a single shared conversational locus is not captured by the individual generating the value (I suspect there is much distributed value from having "a conversation" with better structural integrity / more coherence, but that the value created thereby is pretty distributed). Insofar as there are "externalized benefits" to be had by blogging/commenting/reading from a common platform, it may make sense to regard oneself as exercising civic virtue by doing so, and to deliberately do so as one of the uses of one's "make the world better" effort. (At least if we can build up toward in fact having a single locus.)

A Review of Signal Data Science

JonahS8y10

Wait, your category (ii) is surely exactly what we care about here.

Yes, I see how my last message was ambiguous.

What I had in mind in bringing up category (ii) is that we've had some students who had a priori worse near term employment prospects relative to the usual range of bootcamp attendees, who are better positions than they had been and who got what they were looking to get from the program, while not yet having $100k+ paying jobs. And most students who would have gotten $100k+ paying jobs even if they hadn't attended appear to have benefited from attending the program.

The nature of the value that we have to add is very much specific to the student.

A Review of Signal Data Science

JonahS8y20

Hello! I'm a cofounder of Signal Data Science.

Because our students have come into the program from very heterogeneous backgrounds (ranging from high school dropout to math PhD with years of experience as a software engineer), summary statistics along the lines that you're looking for are less informative than might seem to be the case prima facie. In particular, we don't yet have meaningfully large sample of students who don't fall into one of the categories of (i) people who would have gotten high paying jobs anyway and (ii) people who one wouldn't expect to have gotten high paying jobs by now, based on their backgrounds.

If you're interested in the possibility of attending the program, we encourage you to fill out our short application form. If it seems like it might be a good fit for you, we'd be happy to provide detailed answers to any questions that you might have about job placement.

An update on Signal Data Science (an intensive data science training program)

JonahS8y00

Yes, that was supposed to be June 24th! We have a third one from July 5th – August 24th. There are still spaces in the program if you're interested in attending.

LESSWRONG
LW

Posts

Wiki Contributions

Comments