Here's my attempt (actually I think this is wrong):
Α γράμμα ἐστίν. Α καὶ Β γράμματα εἰσιν. Α, Β, καὶ Γ τρία Ἑλληνικὰ γράμματά εἰσιν. Καὶ Π Ἑλληνικόν γράμμα ἐστίν, οὐ Λατινικόν. C Λατινικόν γράμμα ἐστίν, οὐχ Ἑλληνικόν. Β οὐ φωνῆεν, ἀλλὰ σύμφωνον ἐστιν. Β καὶ Γ οὐ φωνήεντα, ἀλλὰ σύμφωνα εἰσιν. Β οὐ μικρὸν γράμμα ἐστίν, ἀλλὰ κεφαλαῖον. β οὐ κεφαλαῖον, ἀλλὰ μικρὸν γράμμα ἐστίν. Ω = ὦ μέγα, Ο = ὂ μικρόν. ΑΙ Ἑλληνικὴ δίφθογγος ἐστιν. ΑΙ καὶ ΕΙ Ἑλληνικαὶ δίφθογγοι εἰσιν. Α' δίφθογγος οὐκ ἔστιν, ἀλλ' ἀριθμός. Α' καὶ Β' ἀριθμοί εἰσιν. «Ἀπολλώνιος» κύριον οὐδέτερον ...
Tl;dr: We have a dataset for conceptual reasoning which you can request access for if you would like to use it for AI safety (or related) research. We consider the dataset half-baked and it will likely become much more useful over the next few months. At the same time, we think it's very high quality compared to typical AI datasets and currently the best available dataset of this kind, so want to make it available to mission-aligned projects now. We also have half-baked prompts to make models better at critiquing conceptual reasoning which you can request.
Our group consists of Caspar Oesterheld, Emery Cooper, and me. Ethan Perez is advising us on this project.
Motivation/context: We are working on eliciting conceptual reasoning capabilities in LLMs where conceptual reasoning refers to reasoning...
Johannes Treutlein and Rubi Hudson worked on this post as part of SERI MATS, under the mentorship of Evan Hubinger. Rubi has also received mentorship from Leo Gao. We thank Erik Jenner for helpful discussions and Alexander Pan for bringing the performative prediction literature to our attention.
Update 30 May 2023: We have now published a paper based on our previous post and this post (the material from this post is in Appendix D).
In our previous post, we analyzed a setting in which an oracle AI is maximizing a strictly proper scoring rule while it can influence the world with its predictions about the probabilities of possible outcomes. In this setting, a prediction is called a fixed point if it accurately reflects the oracle's beliefs about the...
Another possibility:
Α οὐδέτερον γράμμα ἐστίν. Α καὶ Β γράμματα εἰσιν. Α, Β, καὶ Γ τρία Ἑλληνικὰ γράμματά εἰσιν. Καὶ Π Ἑλληνικόν γράμμα ἐστίν, οὐ Λατινικόν. C Λατινικόν γράμμα ἐστίν, οὐχ Ἑλληνικόν. Β οὐ φωνῆεν, ἀλλὰ σύμφωνον ἐστιν. Β καὶ Γ οὐ φωνήεντα, ἀλλὰ σύμφωνα εἰσιν. Β οὐ μικρὸν γράμμα ἐστίν, ἀλλὰ κεφαλαῖον. β οὐ κεφαλαῖον, ἀλλὰ μικρὸν γράμμα ἐστίν. Ω = ὦ μέγα, Ο = ὂ μικρόν. ΑΙ Ἑλληνικὴ δίφθογγος ἐστιν. ΑΙ καὶ ΕΙ Ἑλληνικαὶ δίφθογγοι εἰσιν. Α' δίφθογγος οὐκ ἔστιν, ἀλλ' ἀριθμός. Α' καὶ Β' ἀριθμοί εἰσιν. «Ἀπολλώνιος» κύριον ὄνομα ἐστιν. «Ἀπολλώνιος» καὶ «... (read more)