AI Construct Lexis

Mapping AI constructs, valid measurement instruments, and related behaviors and tasks

Team. Olawale (Wale) Salaudeen (MIT, Schmidt Sciences), Florian Dorner (MPI-IS, ETH Zürich), Tom Sühr (MPI-IS), Sang Truong (Stanford), Haoran Zhang (MIT), Marzyeh Ghassemi (MIT), Sanmi Koyejo (Stanford), Angelina Wang (Cornell Tech)

Read more here.

Funding from the Schmidt Sciences AI Institute Fellow in Residence Program

Incoming
Project under development. More details coming soon.
TRY DEMO!!Implied Nomological Network from NeurIPS 2024 Papers

AI Construct Lexis is a collaborative effort to map and define AI constructs, valid measurement instruments, and related behaviors and tasks—building a shared framework for understanding and evaluating AI systems. We ultimately aim to establish a nomological network for AI [1, 2, 3]: a connected system linking theoretical constructs, the ways we measure them, and the behaviors they predict in the real world (see Conjointly’s overview). We take inspiration from the Cognitive Atlas [4] in cognitive science and aim to represent the current state of thought in AI.

[1] Salaudeen, O., et al. (2025). Measurement to Meaning: A Validity-Centered Framework for AI Evaluation. arXiv arXiv:2505.10573. https://arxiv.org/abs/2505.10573

[2] MacCorquodale, K., & Meehl, P. E. (1948). On a distinction between hypothetical constructs and intervening variables. Psychological Review, 55(2), 95–107. https://doi.org/10.1037/h0056029

[3] Cronbach, L. J., & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin, 52(4), 281–302. https://doi.org/10.1037/h0040957

[4] Poldrack, R. A., et al. (2011). The cognitive atlas: toward a knowledge foundation for cognitive neuroscience. Frontiers in Neuroinformatics, 5, 17. https://doi.org/10.3389/fninf.2011.00017

Constructs

Defined theoretical concepts that characterize AI systems, including capabilities (e.g., reasoning and software engineering ability) and risks (e.g., safety and bias). Each construct anchors how we interpret behavior and design measurements.

Measurement Instruments

Operational definitions and measurements that translate constructs into observable quantities (e.g., benchmarks and user surveys). Each measurement is evaluated for its validity—whether it captures the intended construct and supports sound inferences about model behavior.

Behaviors and Tasks

Observed behavioral patterns that reveal how constructs manifest, interact, and generalize across models, tasks, and environments (e.g., goal misalignment during tool use and the emergence of social bias in generated outputs).

Contact & Updates

For questions or collaboration, email aiconstructlexis@gmail.com.

Subscribe to our Substack for project updates and insights: @aiconstructlexis