x

LESSWRONG
LW

Zhu Xiaohu — LessWrong

Zhu Xiaohu

Zhu Xiaohu

Message

2

4

7y

Zhu Xiaohu

2

7y

;

Zhu Xiaohu has not written any posts yet.

Replying toMy Assessment of the Chinese AI Safety Community

My Assessment of the Chinese AI Safety Community

https://www.lesswrong.com/posts/3eAstSeZthYh2o6sh/ai-safety-through-operational-physics-why-resource?utm_campaign=post_share&utm_source=link

2

0

Replying toTowards understanding-based safety evaluations

Towards understanding-based safety evaluations

I have a pretty fundamental concern with these sorts of techniques as a mechanism for eventually assessing alignment

that would lead to safety or alignment goodharting problem.

1

0

Replying toMy Assessment of the Chinese AI Safety Community

My Assessment of the Chinese AI Safety Community

Hi. Thanks for mentioning us.

Unlike main labs or companies in China, we are doing fundamental research work on the ontological crisis problem with model theory from mathematical logic trying to set a new base for analyzing and preventing the crisis.

Due to our lacking of funding and restricted intellectual resources, the process is slower, but we will share our work when ready.

2

10

0

Replying toDecision theory and zero-sum game theory, NP and PSPACE

Decision theory and zero-sum game theory, NP and PSPACE

Mention a recent interesting work here: On the Complexity of Computing Markov Perfect Equilibrium in General-Sum Stochastic Games gave a related analysis on the comuting of Markov PE for RL agents.

1

0