Which personality traits are real? Stress-testing the lexical hypothesis

[-]Unnamed3y90

You may want to look into the crisis in personality psychology which was sparked by Walter Mischel's (1968) book "Personality and assessment". There were a lot of studies, and arguments between researchers, about questions like these.

Mischel's challenge: there are often low-seeming correlations between broad personality measures and specific behaviors.

Part of the response was that the correlations are much larger if you aggregate across many behaviors, e.g. instead of the correlation between an abstract rating of conscientiousness and how much a person engages in a single specific conscientious-related behavior, look at the correlation between an abstract rating of conscientiousness and the average of how much a person engages in 50 specific conscientiousness-related behaviors. Which suggests that there is some sort of broad trend that the person carries, even if any one behavior depends on a mix of things beyond that broad trend.

Mischel argued for also looking for narrower patterns that are more stable for a person rather than just these broad traits, e.g. a person might pretty consistently be talkative with their friends, even if they don't consistently engage in some other extraversion-related behaviors.

[-]tailcalled3y20

Sounds neat, I will have to take a look.

One thing to add is, one way you can interpret my "correlation with lexical notion" is as saying "what happens when we average infinitely many behaviors?". Since all the traits had a high "correlation with lexical notion", it seems I got the same result as the personality researchers.

[-]tailcalled1y40Review for 2023 Review

On the object level, this is a study about personality, and it majorly changed the way I view some personality traits:

I now see conservatism/progressivism as one of the main axes of personality,
It further cemented my perception that "well-being", or "extraversion minus neuroticism", is the strongest of the traditional personality dimensions, and that maybe also this raises questions about what personality even means (for instance, surely well-being is not simply a biological trait),
I'm now much more skeptical about how "real" many personality traits are, including traits like "compassion" that were previously quite central to my models of personality.

I think my study on the EQ-SQ model follows in the footsteps of this, rethinking much of what I thought I knew about differential psychology.

However, I actually view the fundamental contribution of the post quite different from this. Really, I'm trying to articulate and test theories of personality, as well as perform exploratory analyses, and I hope that I will inspire others to do so, as well as that I will become better at doing so over time. If this interests you, I would suggest you join Rationalist Psychometrics, a small discord server for this general topic.

In terms of methodology, this study is heavily focused on factor analysis. At the time of writing the post, I thought factor analysis was awesome and underrated. I still think it's great for testing the sorts of theories discussed in the post, and since such theories take up a lot of space in certain groups' discussion of differential psychology, I still think factor analysis is quite underrated.

But factor analysis is not everything. My current special interest is Linear Diffusion of Sparse Lognormals, which promises to do much better than factor analysis ... if I can get it to work. As such, while the post (and psychometrics in general) focuses quite heavily on factor analysis, I cannot wholeheartedly endorse that aspect of the post.

[-]Daniel V3y20

It's very interesting to see the intuitive approach here and there is a lot to like about how you identified something you didn't like in some personality tests (though there are some concrete ones out there), probed content domains for item generation, and settled upon correlations to assess hanging-togetherness.

But you need to incorporate your knowledge from reading about scale development and factor analysis. Obviously you've read in that space. You know you want to test item-total correlations (trait impact), multi-dimensionality (factor model loss), and criterion validity (correlation with lexical notion). Are you trying to ease us in with a primer (with different vocabulary!) or reinvent the wheel?

Let's start with the easy-goingness scale:

(+) In the evening I tend to relax and watch some videos/TV
(+) I don’t feel the need to arrange any elaborate events to go to in my free time
(+) I think it is best to take it easy about exams and interviews, rather than worrying a bunch about doing it right
(+) I think you’ve got to have low expectations of others, as otherwise they will let you down
(-) I get angry about politics
(-) I have a stressful job
(-) I don’t feel like I should have breaks at work unless I’ve “earned” them by finishing something productive
(-) I spent a lot of effort on parenting

The breadth of it is either a strength or a weakness. It'd be nice to have a construct definition or at least some gesturing at what easy-goingness actually is to gauge the face-validity of these items. Concrete items necessarily will have some domain-dependence, resulting in deficiency (e.g., someone who likes to relax and read a book will score low on item 1) or contamination (e.g., having low expectations of others might also be trait pessimism), but item 8 is really specific. It hampers the ability of this scale to capture easy-goingness among non-parents. The breadth would be good if it captured variations on easy-goingness, but instead it'd be bad if it just captures different things that don't really relate to each other. That's especially problematic because then the inference from low inter-correlations might not be that the construct is bad, but that the items just don't tap into it. You can see where I'm going with this because...

This suggests to me that Easy-Goingness is not very “real”. While it might make sense to describe a person as doing something Easy-Going, for instance when they are watching TV, it is kind of arbitrary to talk about people as being more or less Easy-Going, because it depends a lot on context/what you mean.

...indeed, the items are mainly just capturing different things, not reflecting on easy-goingness in any way. From a scale-assessment standpoint, it's great to see the results confirm my unease about the items based on simply reading them.

The fact that this is weak means that even the most Easy-Going people cannot necessarily be expected to be particularly Easy-Going in all contexts.

This statement presumes your measure reflects a higher-order easy-goingness and that context-specific easy-goingnesses are also being adequately measured.

With conservatism, on the other hand, you can see there is some context-specificity (e.g., dress vs. general social views vs. issue-based ideology), but the measure is facially better. And it hangs together better. Alternately, you might explore those contours and say you've come up with a multi-dimensional conservatism scale, just like you have a multi-dimensional creativity scale.

the “Correlation with lexical notion” was consistently close to 1, showing that the concrete and the abstract descriptors were getting at the same thing.

There's an implicit "when the concrete descriptors actually had face validity" hidden here; low correlation with the lexical notion could indicate a problem with the lexical scale or a problem with the concrete scale, or both.

Overall, I am very impressed that you presented a scary chart to start, promised you'd explain it, and successfully did so. The general takeaway from it is that the lexical hypothesis could be pretty sound and a few of these might be multidimensional in nature (or could be that some items are good and some a bad). For the low trait impact scales, it's a question of whether the items are good and the construct isn't "real," or whether the items are just a bad measurement approach.

[-]tailcalled3y30

Thank you for your in-depth response!

But you need to incorporate your knowledge from reading about scale development and factor analysis. Obviously you've read in that space. You know you want to test item-total correlations (trait impact), multi-dimensionality (factor model loss), and criterion validity (correlation with lexical notion). Are you trying to ease us in with a primer (with different vocabulary!) or reinvent the wheel?

Good question. In retrospect, I should probably have put more effort into using standard terms. That said:

Test item-total correlations: Strictly speaking "factor loadings" would be a better term, since I did not compute it based on a correlation with a test score, but instead with a CFA-style factor model.
Multidimensionality: Maybe. Obviously it's multidimensionality that I am trying to test, but literally my score for the tests is a least-squares loss for a CFA-style factor model.
Criterion validity: Maybe. Arguably convergent/concurrent validity would be even more standard terms. But I think "Correlation with lexical notion" is more specific.

The breadth of it is either a strength or a weakness. It'd be nice to have a construct definition or at least some gesturing at what easy-goingness actually is to gauge the face-validity of these items.

The items are each meant to assess something from the stories I collected from someone who empirically scored high and low on easy-goingness scales. So their validity criterion is not meant to be in assessing easy-goingness generally, but in assessing the thing from those stories. Here are the stories corresponding to each item:

In the evening I tend to relax and watch some videos/TV

When I finish work for the day I often go straight home and jump into my pyjamas. I like to relax and watch some tv and films to unwind after a long day - usually with a glass of wine. Certain days when I come home my partner would like to travel for a couple hours to go dog walking and enjoying time outside. No matter what kind of day I have at work I am always keen to do anything my partner/family/friends would like to do as is in my nature.

(+) I don’t feel the need to arrange any elaborate events to go to in my free time

I think I dont need to always go out in evenings to feel socially connected. Rather I would sit and enjoy the quiet at home. Moreover I dont get easily flustered if people have different opinions compared to me. I dont easily get offended and can take things in a right spirit. so, I am easy to approach

(+) I think it is best to take it easy about exams and interviews, rather than worrying a bunch about doing it right

I think youy have to be going in life otherwise everything will get to to. For example when i did my exams at uni and school, you have to be easy going to cope with the stress and fear thatr comes with them. This can be applied to anything though, if you are not easy going the littel things will get to you and you will have no chnace being able to cope with the big issues in life.

(+) I think you’ve got to have low expectations of others, as otherwise they will let you down

I am easy going in that I do not have high expectations of others because I have learnt that people let you down and if there was no expectation in the first place you cannot be surprised or disappointed, on the other hand if you expect nothing you can be quite pleasantly surprised. I always try to see both sides of any argument or situation and consider that everyone has the right to an opinion that does not have to match my own.

(-) I get angry about politics

I was in a team dinner party and in a discussion about politics which I joined in with other colleuages. There was a lot of talk about dealing with education, the economy and how to restore the leadership of the labour party back then and the Iraq war all of which I was onboard with . then came questions about what to do with flooding immigrants and how to control them, given my uncles both were illegal immigratns back then but managed to claw citizenship after 10 years I was uneasy joining the discussion and there was a lot of talk on what races were the culprits. I said only legal immigration should be allowed but did not join further focusing on my drink instead knowing the discussion was a race hate discussion and I was indirectly being attacked. Next 10 mins I made up an excuse to leave and left the party but faked goodbyes but was angry that I had to work with scumbag colleagues.

(-) I have a stressful job
(-) I spent a lot of effort on parenting

My life is not at all leisurely. I have two small children and a stressful job. If I were to be easy going about everything things wouldn’t get done and our lives would feel chaotic. There needs to be a balance between being easy going and highly strung. I don’t like to forget things that need doing or let people down.

(-) I don’t feel like I should have breaks at work unless I’ve “earned” them by finishing something productive

A simple example is that when I arrive at work, my boss often asks at once if I want a coffee, as he often wants one at the beginning of the day. I prefer to do some work before having a coffee, as to me it signifies a moment of relaxation and to the puritan work ethic part of me, it doesn't make sense to have a break until I have "earned" it.

There's definitely a lot from these stories that I fail to capture. Often the participants mention multiple things and I only ask about one of them. I could easily imagine the items could be made better.

item 8 is really specific. It hampers the ability of this scale to capture easy-goingness among non-parents.

Maybe.

Really in the general population, most people are parents, so I don't think it is much more specific than the other items. But my respondents skew quite young, so it is probably a problem for my sample. Might be interesting to add an interaction model to this later though.

This statement presumes your measure reflects a higher-order easy-goingness and that context-specific easy-goingnesses are also being adequately measured.
With conservatism, on the other hand, you can see there is some context-specificity (e.g., dress vs. general social views vs. issue-based ideology), but the measure is facially better. And it hangs together better. Alternately, you might explore those contours and say you've come up with a multi-dimensional conservatism scale, just like you have a multi-dimensional creativity scale.

🤷 I constructed the conservatism and easy-goingness items in the same way, so I think there is something inherent to conservatism that makes it cohere more than easy-goingness.

There's an implicit "when the concrete descriptors actually had face validity" hidden here; low correlation with the lexical notion could indicate a problem with the lexical scale or a problem with the concrete scale, or both.

I think of it as an empirical test of the concrete descriptor's validity. That is, the abstract predictors have face validity, and if these are highly correlated with the concrete descriptors, then at least we know the concrete descriptors are not measuring anything other than the traits they are intended to measure.

^{^}

The Big Five personality factors were originally derived by asking people to rate themselves on a large number of personality adjectives, and using statistics to find the biggest clusters of related descriptors. Other tests have been developed through other methods, many of which don’t primarily focus on abstract adjectives, though for reasons I won’t get into right now, I think they have a lot of dependence on the lexical hypothesis.

^{^}

Of course, this is a subtle, complex question which depends on what exactly one means by “real”. I define the notion of “realness” I focus on later in the post, but other notions may be relevant for other purposes.

^{^}

Because it is inherently difficult to measure behavior, I had to still rely on self-report surveys.

^{^}

These are internal reliability, i.e. the sort of person who says “I like a leisurely lifestyle” is also more likely to say “I have a slow pace to my life”; test-retest reliability, i.e. the sort of person who says “I like a leisurely lifestyle” today will also tend to do so tomorrow, in a month, in a year, or in a decade; inter-rater validity, i.e. if a person says “I like a leisurely lifestyle” then their friends and family will also tend to say “They like a leisurely lifestyle”; criterion validity, i.e. the sort of person who says “I like a leisurely lifestyle” scores higher on some objective criterion of leisurely lifestyle such as amount of vacation days; and maybe also heritability, i.e. if one twin in a pair says “I like a leisurely lifestyle” then the other twin likely also says so too.

^{^}

The narrower of a trait you are considering, the stronger the associated correlations would be. To see this, consider the absurd example where you are only considering a specific behavior, say watching TV. Any trait has a correlation of 1 with itself, so watching TV would have a Trait Impact of 1. It is only by abstracting over multiple different behaviors that Trait Impact can be nontrivial.

^{^}

Since the different questions don’t correlate perfectly internally, e.g. “I enjoy cooking food for other people” and “I like to dance with people at parties” only correlate at 0.24, we can’t exactly expect abstract “sociability” to correlate perfectly with either. So I adjust for the reduction in correlation that would be expected from imperfect internal correlations.

^{^}

Not necessarily for all purposes. Just because a trait is weak by my measures does not mean it cannot be relevant by other measures. Talk with personality researchers and read their papers if you want to find out what criteria they care about.

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

65

Which personality traits are real? Stress-testing the lexical hypothesis

65

65

Easy-Goingness: An example

Trait Impact as a measure of realness

Factor model loss as a measure of conflation

Correlation with lexical notion: naming things

Summary

Bonus: Going beyond the lexical hypothesis

Appendix: Correlation matrices for all of the traits