Posts

Sorted by New

1Martin Vlach's Shortform

6Would it be useful to collect the contexts, where various LLMs think the same?

1Martin Vlach's Shortform

Wiki Contributions

Comments

Apply now: Get "unstuck" with the New IFS Self-Care Fellowship Program

Martin Vlach3d10

Why does the form still seem open today? Couldn't that be harmful or wasting quite a chunk of time of people?

Some desirable properties of automated wisdom

Martin Vlach11d10

Please go further towards maximization of clarity. Let's start by this example:
> Epistemic status: Musings about questioning assumptions and purpose.
Are those your musings about agents questioning their assumptions and word-views?

And like, do you wish to improve your fallacies?

> ability to pursue goals that would not lead to the algorithm’s instability.
higher threshold than ability, like inherent desire/optimisation?
What kind of stability? Any from https://en.wikipedia.org/wiki/Stable_algorithm? I'd focus more on sort of non-fatal influence. Should the property be more about the alg being careful/cautious?

An Extremely Opinionated Annotated List of My Favourite Mechanistic Interpretability Papers v2

Martin Vlach13d30

https://neelnanda.io/transformer-tutorial-1 link for YouTube tutorial gives 404.-(

Eight Short Studies On Excuses

Martin Vlach1mo10

> "What, exactly, is the difference between a cult and a religion?"--"The difference is that cults have been formed recently enough, and are small enough, that we are suspicious of them existing for the purpose of taking advantage of the special place we give religion.

now I see why my friends practicing the spiritual path of Falun Dafa have "incorporated" as a religion in my state despite the movement originally denied being classified as a religion as to demonstrate it does not require a fixed set of rituals.

Which skincare products are evidence-based?

Answer by Martin VlachJun 04, 202410

Surprised to see nobody mentioned Microneedling yet. I'm not skilled in evaluating scientific evidence, but the takeaway from https://consensus.app/results/?q=Microneedling effectiveness &synthesize=on can hardly be anything else than clearly recommending microneedling.

Introducing AI Lab Watch

Martin Vlach2mo40

So Alignment program is to be updated to 0 for OpenAI now that Superalignment team is no more? ( https://docs.google.com/document/d/1uPd2S00MqfgXmKHRkVELz5PdFRVzfjDujtu8XLyREgM/edit?usp=sharing )

Language Models Model Us

Martin Vlach2mo11

honestly the code linked is not that complicated..: https://github.com/eggsyntax/py-user-knowledge/blob/aa6c5e57fbd24b0d453bb808b4cc780353f18951/openai_uk.py#L11

Language Models Model Us

Martin Vlach2mo10

To work around the non-top-n you can supply logit_bias list to the API.

Language Models Model Us

Martin Vlach2mo40

As the Llama3 70B base model is said very clean( unlike base DeepSeek for example, which is instruction-spoiled already) and similarly capable to GPT3.5, you could explore that hypothesis.
Details: Check Groq or TogetherAI for free inference, not sure if test data would fit Llama3 context window.

You Can Face Reality

Martin Vlach3mo00

a worthy platitude(?)