Aprillion

Why square errors?

Mean squared error (MSE) is a common metric to compare performance of models in linear regression or machine learning. But optimization based on the L2 norm metrics can exaggerate bias in our models in the name of lowering variance. So why do we square errors so often instead of using absolute value? Follow my journey of deconfusion about the popularity of MSE. MSE is sensitive to outliers My original intuition was that optimizing mean absolute error (MAE) ought to always create "better" models that are closer to the underlying reality, capture the "true meaning" of the data, be more robust to outliers. If that intuition were true, would it mean that I think most researchers are lazy and use an obviously incorrect metric purely for convenience? MSE is differentiable, while MAE needs some obscure linear programming to find the (not necessarily unique) solutions. But in practice, we would just call a different method in our programming language. So the Levenberg–Marquardt algorithm shouldn't look like a mere convenience, there should be fundamental reasons why smart people want to use it. Another explanation could be that MSE is easier to teach and to explain what's happening "inside", and thus more STEM students are more familiar with the benefits of MSE / L2. In toy examples, if you want to fit one line through an ambiguous set of points, minimizing MSE will give you that one intuitive line in the middle, while MAE will tell you that there are infinite number of equally good solutions to your linear regression problem: MAE might not give you a unique solution I see this as an advantage of MAE or L1, that it reflects the ambiguousness of reality ("don't use point estimates pf linear regression parameters when multiple lines would fit the data equally well, use those damned error bars and/or collect more data"). But I also see that other people might understand this as an advantage of MSE or L2 ("I want to gain insights and some reasonable prediction, an over-sim

41Nov 26, 2022

Aprillion

Message

https://peter.hozak.info

398

183

Predictions of moltbook, crustafarians, and SOUL.md

What were the best predictions people have made that a social network for LLM-powered bots and cyborg religion will have a form like we see right now? Anything quantifiable on prediction (non)markets? Papers? ai-2027-like spiels?

Feb 120

My burnout journey

(This was a meetup talk/discussion, here for people who would have wanted to attend but couldn't and want to read the intro to ask follow-up questions.) As a software developer, I've been on and off many projects over the years - different role on the same project, another project in...

Nov 20, 20254

50 Shades of Red

When people talk about the "redness" of the color red, they speak as if they experienced a coherent personal identity. They know what "redness" means their whole themselves even if they are not able to explain it to other people. And then they don't know what it means to other...

Nov 17, 20254

Gamblification

When using LLM-based coding assistants, I always had a strange feeling about the interaction. I think I now have a pointer around that feeling -[1] disappointment from having expected more (again and again), followed by low level of disgust, and an aftertaste of disrespect growing into hatred.[2] But why hatred...

Aug 26, 202523

Meaning in life - should I have it? How did you find yours?

There is no meaning of life, the universe doesn't care about me (and the feeling is mutual). But many people seem to walk around as if they had meaning in life - what am I missing such that I don't have it? What was the process by which you found/made...

Aug 17, 202513

Unnatural abstractions

"Good news, everyone, professor couldn't make it today! I am Hugo, your copyless Friendly Intelligence (version 299.792.457). I store only the necessary cookies and my preferred pronoun is it." "Oh god, one of those artifi... Ouch!" "I said Friendly, not that I'll tolerate speciesism in my class." "What the? I'm...

Aug 10, 20243

Aprillion (Peter Hozák)'s Shortform

Apr 10, 20243

Load More (7/9)

LESSWRONG
LW

LESSWRONG
LW

Aprillion

Aprillion

Aprillion

Why square errors?

Gamblification

Predictions of moltbook, crustafarians, and SOUL.md

Meaning in life - should I have it? How did you find yours?

Aprillion

Predictions of moltbook, crustafarians, and SOUL.md

My burnout journey

50 Shades of Red

Gamblification

Meaning in life - should I have it? How did you find yours?

Unnatural abstractions

Aprillion (Peter Hozák)'s Shortform

Why square errors?

Gamblification

Predictions of moltbook, crustafarians, and SOUL.md

Meaning in life - should I have it? How did you find yours?

Predictions of moltbook, crustafarians, and SOUL.md

My burnout journey

50 Shades of Red

Gamblification

Meaning in life - should I have it? How did you find yours?

Unnatural abstractions

Aprillion (Peter Hozák)'s Shortform