AI Construct Lexis

Mapping AI constructs, valid measurement instruments, and related behaviors and tasks

AI Construct Lexis is a collaborative effort to map and define AI constructs, valid measurement instruments, and related behaviors and tasks—building a shared framework for understanding and evaluating AI systems. We ultimately aim to establish a nomological network for AI [1, 2, 3]: a connected system linking theoretical constructs, the ways we measure them, and the behaviors they predict in the real world (see Conjointly’s overview). We take inspiration from the Cognitive Atlas [4] in cognitive science and aim to represent the current state of thought in AI.

[1] Salaudeen, O., et al. (2025). Measurement to Meaning: A Validity-Centered Framework for AI Evaluation. arXiv arXiv:2505.10573. https://arxiv.org/abs/2505.10573

[2] MacCorquodale, K., & Meehl, P. E. (1948). On a distinction between hypothetical constructs and intervening variables. Psychological Review, 55(2), 95–107. https://doi.org/10.1037/h0056029

[3] Cronbach, L. J., & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin, 52(4), 281–302. https://doi.org/10.1037/h0040957

[4] Poldrack, R. A., et al. (2011). The cognitive atlas: toward a knowledge foundation for cognitive neuroscience. Frontiers in Neuroinformatics, 5, 17. https://doi.org/10.3389/fninf.2011.00017

Constructs

Defined theoretical concepts that characterize AI systems, including capabilities (e.g., reasoning and software engineering ability) and risks (e.g., safety and bias). Each construct anchors how we interpret behavior and design measurements.

Measurement Instruments

Operational definitions and measurements that translate constructs into observable quantities (e.g., benchmarks and user surveys). Each measurement is evaluated for its validity—whether it captures the intended construct and supports sound inferences about model behavior.

Behaviors and Tasks

Observed behavioral patterns that reveal how constructs manifest, interact, and generalize across models, tasks, and environments (e.g., goal misalignment during tool use and the emergence of social bias in generated outputs).

AI Construct Lexis

Constructs

Measurement Instruments

Behaviors and Tasks

Contact & Updates