How I repeatedly failed to use Tobit modelling on censored data

[-]BrassLion2y200

I have the luxury of reading this years after it was posted (going through the D&D.Sci archives and this was linked there), so you may actually have an answer to this question: did the model work? That is, did your client use it and save/ make money?

[-]abstractapplic2y92

Also, strong-upvoted for asking "so, with X years of hindsight, how did this pan out?" on an old post. More people should do that.

[-]abstractapplic2y70

Before circumstances let me answer that question, the client got bought out by a bigger company, which was (and is) a lot more cagey about both hiring contractors and sharing internal details with outsiders; last I heard, the client's absorbed remnants are still sometimes using my modelling approach, but I have no idea how much they're using it, how much they're relying on it, or to what extent it's benefiting them.

[-]BrassLion2y10

Thanks for the answer. Sad that you never get an answer, although this sort of thing (organizational/personnel changes at the client makes them drop your work / never give feedback) is not uncommon in tech in my experience.

[-]gwern4y*130

Perhaps I'm missing background here, but it seems like much of this is caused by the demand for a bespoke max-likelihood solution for exactly gamma Tobit censoring.

I'm not entirely surprised if it's not off-the-shelf, but then why not use some modeling framework? Both JAGS and Stan support censored/truncated data and also gamma distributions (I know because I've used all 4), so I would expect you could write down the censored model easily and turn the crank. Somewhat more exotically, the DL frameworks like Jax are underappreciated in how broad their autodiff support is and how they can implement stuff like SEMs no sweat. If you really need a MLE, you can either use flat priors & pretend that's the MLE, or save compute & use Stan's optimization routines to get the mode/MLE fast. (I am not familiar with PyMC or the MCMC frameworks in Python-land, so I have no idea if they could do this, but censoring/gamma aren't too exotic so I expect they can, and if not, I assume you can call out to JAGS/Stan without much difficulty.)

Since you don't mention any crazy realtime requirements, other odd requirements that JAGS/Stan couldn't model, and only needing occasional updates to finetune the model (less than every few seconds, sounds like), this sounds like it would've been adequate and way easier.

(Typo: 'Cornelis'.)

[-]abstractapplic4y40

I did things this way because my applied stats knowledge is almost entirely self-taught, with all the random gaps in knowledge that implies. Thank you for letting me know about Stan and related techs: while it's hard to tell whether they would have been a better match for my modelling context (which had some relevant weirdnesses I can't share because confidentiality), they definitely would have made a better backup plan than "halt a key part of your organization for a few days every time you need a new model". I'll be sure to look into MCMC next time I deal with a comparable problem.

(Typo: 'Cornelis'.)

Fixed, thank you again.

[-]TLW4y40

Someone called Cornellis Simon Meijer, apparently. It transpires that Meijer was tired of trigonometric functions, gamma functions, step functions and Bessel functions being different things, and invented a function which had all of them as special cases.

Is there a name for this sort of prose? It vaguely reminds me of Things I Won't Work With in a weird way.

*****

I'd be concerned that you're working in an adversarial environment, and predicting adversarial environments by modelling past behavior is, uh, suboptimal at best. (See also, every attempt to beat the stock market ever.)

[-]gwern2y21

It vaguely reminds me of Things I Won't Work With in a weird way.

'War stories' or 'cooler talk'?

[-]espoire1mo10

Thanks a bunch for linking that Things I Won't Work With listing. I've learned more about chemistry in the last hour than I do in most years.

^{^}

Yes, working on this kind of problem did inspire two of my data science challenges.

^{^}

If you’ve used Ordinary Least-Squares, you’ve used MLE: minimizing the product of Gaussian probabilities for an assumed-constant value of sigma reduces to minimizing the sum of squared errors, as

^{^}

"You're telling me you use Wolfram Alpha for your differentiations?" No, dear Reader, I'm telling you I use Wolfram Alpha to check my differentiations. Sometimes I check them in advance of doing the math myself, but if I use the results for anything important I always go through the steps by hand.

^{^}

Under idealized Gaussian conditions, σ = ~1.25 * MAE. (Thank you, Taleb.)

LESSWRONG
is fundraising!
LW

LESSWRONG
is fundraising!
LW

51

How I repeatedly failed to use Tobit modelling on censored data

51

51

In which our protagonist identifies the nature of the problem

In which our protagonist selects an angle of attack

In which our protagonist encounters an unexpected major issue

Interlude: three alternatives

In which our protagonist produces a valid proof of concept

In which our protagonist encounters an unexpected Meijer issue

Interlude: two more alternatives

In which our protagonist contemplates gamma error

In which our protagonist concludes the project