Skip to content

An awesome & curated list for Artificial General Intelligence, an emerging inter-discipline field that combines artificial intelligence and computational cognitive sciences.

License

Notifications You must be signed in to change notification settings

SHI-Yu-Zhe/awesome-agi-cocosci

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Roadmap of studying Abduction

Awesome Artificial General Intelligence and Computational Cognitive Sciences Awesome

An awesome & curated list for Artificial General Intelligence, an emerging inter-discipline field that combines artificial intelligence and computational cognitive sciences as majority, alone with probability and statistics, formal logic, cognitive and developmental psychology, computational philosophy, cognitive neuroscience, and computational sociology. We are promoting high-level machine intelligence by getting inspirations from the way that human learns and thinks, while obtaining a deeper understanding of human cognition simultaneously. We believe that this kind of reciprocative research is a potential way towards our big picture: building human-level intelligent systems with capabilities such as abstracting, explaining, learning, planning, and making decisions. And such intelligence may generally help people improve scientific research, engineering, and the arts, which are the hallmarks of human intelligence.

Awesome AGI & CoCoSci is an all-in-one collection, consisting of recources from basic courses and tutorials, to papers and books around diverse topics in mutiple perspectives. Both junior and senior researchers, whether learning, working on, or working around AGI and CoCoSci, meet their interest here.

Contributing

Contributions are greatly welcomed! Please refer to Contribution Guidelines before taking any action.

Contents

Papers

Abduction

Explanation

*Back to Top

Scientific Discovery

*Back to Top

Rationalization

  • Imagination and the generation of new ideas - Cognitive Development, 2015. [All Versions]. A piece of evidence for rationalization in childhood.

  • Coalescing the Vapors of Human Experience into a Viable and Meaningful Comprehension - CogSci'16, 2016. [All Versions]. Constrainted thinking as rationalization.

  • How We Know What Not To Think - Trends in Cognitive Sciences, 2019. [All Versions]. A comprehensive review on rationalization.

  • Rationalization is rational - Behavioral and Brain Sciences, 2020. [All Versions]. [Preprint]. Rationalization occurs when a person has performed an action and then concocts the beliefs and desires that would have made it rational. Then, people often adjust their own beliefs and desires to match the concocted ones. While many studies demonstrate rationalization, and a few theories describe its underlying cognitive mechanisms, we have little understanding of its function. Why is the mind designed to construct post hoc rationalizations of its behavior, and then to adopt them? This may accomplish an important task: transferring information between the different kinds of processes and representations that influence our behavior. Human decision making does not rely on a single process; it is influenced by reason, habit, instinct, norms, and so on. Several of these influences are not organized according to rational choice (i.e., computing and maximizing expected value). Rationalization extracts implicit information – true beliefs and useful desires – from the influence of these non-rational systems on behavior.

  • Rationalizing constraints on the capacity for cognitive control - Trends in Cognitive Sciences, 2021. [All Versions]. Humans are remarkably limited in: (i) how many control-dependent tasks they can execute simultaneously, and (ii) how intensely they can focus on a single task. These limitations are universal assumptions of most theories of cognition. Yet, a rationale for why humans are subject to these constraints remains elusive. This feature review draws on recent insights from psychology, neuroscience, and machine learning, to suggest that constraints on cognitive control may result from a rational adaptation to fundamental, computational dilemmas in neural architectures. The reviewed literature implies that limitations in multitasking may result from a trade-off between learning efficacy and processing efficiency and that limitations in the intensity of commitment to a single task may reflect a trade-off between cognitive stability and flexibility.

  • Why Imaginary Worlds? The psychological foundations and cultural evolution of fictions with imaginary worlds - Behavioral and Brain Sciences, 2021. [All Versions]. A review of rationalization as imaginary worlds in fictions. The perspective proposes that imaginary worlds co-opt our preferences for exploration, which have evolved in humans and nonhuman animals alike, to propel individuals toward new environments and new sources of reward.

*Back to Top

Applications in AI

  • Functional genomic hypothesis generation and experimentation by a robot scientist - Nature, 2004. [All Versions]. This paper describes a physically implemented robotic system that applies techniques from artificial intelligence to carry out cycles of scientific experimentation. The system automatically originates hypotheses to explain observations, devises experiments to test these hypotheses, physically runs the experiments using a laboratory robot, interprets the results to falsify hypotheses inconsistent with the data, and then repeats the cycle. The system is applied to the determination of gene function using deletion mutants of yeast (Saccharomyces cerevisiae) and auxotrophic growth experiments. The authors built and tested a detailed logical model (involving genes, proteins and metabolites) of the aromatic amino acid synthesis pathway.

  • Interpretation as abduction - Artificial Intelligence, 1993. [All Versions]. Abduction is inference to the best explanation. The authors have developed an approach to abductive inference, called “weighted abduction”, that has resulted in a significant simplification of how the problem of interpreting texts is conceptualized. The interpretation of a text is the minimal explanation of why the text would be true. More precisely, to interpret a text, one must prove the logical form of the text from what is already mutually known, allowing for coercions, merging redundancies where possible, and making assumptions where necessary. It is shown how such “local pragmatics” problems as reference resolution, the interpretation of compound nominals, the resolution of syntactic ambiguity and metonymy, and schema recognition can be solved in this manner. Moreover, this approach of “interpretation as abduction” can be combined with the older view of “parsing as deduction” to produce an elegant and thorough integration of syntax, semantics, and pragmatics, one that spans the range of linguistic phenomena from phonology to discourse structure.

  • Probabilistic Horn abduction and Bayesian networks - Artificial Intelligence, 1993. [All Versions]. This paper presents a simple framework for Horn-clause abduction, with probabilities associated with hypotheses. The framework incorporates assumptions about the rule base and independence assumptions amongst hypotheses. It is shown how any probabilistic knowledge representable in a discrete Bayesian belief network can be represented in this framework. The main contribution is in finding a relationship between logical and probabilistic notions of evidential reasoning. This provides a useful representation language in its own right, providing a compromise between heuristic and epistemic adequacy.

  • Abductive Inference in Bayesian Networks: A Review - Advances in Bayesian Networks, 2004. [All Versions]. The goal of this paper is to serve as a survey for the problem of abductive inference (or belief revision) in Bayesian networks. Thus, the problem is introduced in its two variants: total abduction (or MPE) and partial abduction (or MAP) . Also, the problem is formulated in its general case, that is, looking for the K best explanations. Then, a (non exhaustive) review of exact and approximate algorithms for dealing with both abductive inference problems is carried out. Finally, the authors collect the main complexity results appeared in the literature for both problems (MPE and MAP).

  • Abductive Logic Programming - Journal of Logic Computation, 1992. [All Versions]. This paper is a survey and critical overview of recent work on the extension of logic programming to perform abductive reasoning (abductive logic programming). The authors outline the general framework of abduction and its applications to knowledge assimilation and default reasoning; and they introduce an argumentation-theoretic approach to the use of abduction as an interpretation for negation as failure.

  • ACLP: Abductive Constraint Logic Programming - The Journal of Logic Programming, 1999. [All Versions]. This paper presents the framework of Abductive Constraint Logic Programming (ACLP), which integrates Abductive Logic Programming (ALP) and Constraint Logic Programming (CLP). In ACLP, the task of abduction is supported and enhanced by its non-trivial integration with constraint solving. This integration of constraint solving into abductive reasoning facilitates a general form of constructive abduction and enables the application of abduction to computationally demanding problems. The paper studies the formal declarative and operational semantics of the ACLP framework together with its application to various problems.

  • Abduction in Logic Programming - Computational Logic, 2002. [All Versions]. [Preprint]. Abduction in Logic Programming started in the late 80s, early 90s, in an attempt to extend logic programming into a framework suitable for a variety of problems in Artificial Intelligence and other areas of Computer Science. This paper aims to chart out the main developments of the field over the last ten years and to take a critical view of these developments from several perspectives: logical, epistemological, computational and suitability to application. The paper attempts to expose some of the challenges and prospects for the further development of the field.

  • Bayesian Abductive Logic Programs: A Probabilistic Logic for Abductive Reasoning - IJCAI'11, 2011. [All Versions]. [Preprint]. This work introduces Bayesian Abductive Logic Programs (BALP), a probabilistic logic that adapts Bayesian Logic Programs (BLPs) for abductive reasoning. Like BLPs, BALPs also combine first-order logic and Bayes nets. However, unlike BLPs, which use deduction to construct Bayes nets, BALPs employ logical abduction. As a result, BALPs are more suited for problems like plan/activity recognition that require abductive reasoning.

  • Abductive Plan Recognition by Extending Bayesian Logic Programs - ECML'11, 2011. [All Versions].

  • An Approach to Abductive Reasoning in Equational Logic - IJCAI'13, 2013. [All Versions].

  • Abduction-Based Explanations for Machine Learning Models - AAAI'19, 2019. [All Versions].

  • Probabilistic Sufficient Explanations - IJCAI'21, 2021. [All Versions].

  • Machine Translation Using Abductive Inference - COLING, 1990. [All Versions]. An application of abduction in language translating.

  • Automated Biodesign Engineering by Abductive Meta-Interpretive Learning - AAAI Spring Symposium Series 2021 on Artificial Intelligence for Synthetic Biology, 2021. [All Versions]. This work proposes an automated biodesign engineering framework empowered by Abductive Meta-Interpretive Learning (MetaAbd), a novel machine learning approach that combines symbolic and sub-symbolic machine learning, to further enhance the design-build-test-learn cycle by enabling the learning machine to 1) exploit domain knowledge and learn human-interpretable models that are expressed by formal languages such as first-order logic; 2) simultaneously optimise the structure and parameters of the models to make accurate numerical predictions; 3) reduce the cost of experiments and effort on data annotation by actively generating hypotheses and examples.

  • Human Comprehensible Active Learning of Genome-Scale Metabolic Networks - AAAI Spring Symposium Series 2023 on Computational Scientific Discovery, 2023. [All Versions]. [Extended Abstract]. [Slides]. This work introduces a novel machine learning framework ILP-iML1515 based on Inductive Logic Programming (ILP) that performs abductive logical reasoning and actively learns from training examples. The ILP-iML1515 framework 1) allows high-throughput simulations and 2) actively selects experiments that reduce the experimental cost of learning gene functions in comparison to randomly selected experiments.

*Back to Top

Bayesian Modeling

Bayesian Induction

  • Bayesian Epistemology - Plato Stanford. A computational philosophy account on the nature of uncertainty modeling in Bayesian Epistemology.

  • Probabilistic machine learning and artificial intelligence - Nature, 2015. [All Versions]. Probabilistic modelling provides a framework for understanding what learning is, and has therefore emerged as one of the principal theoretical and practical approaches for designing machines that learn from data acquired through experience. The probabilistic framework, which describes how to represent and manipulate uncertainty about models and predictions, has a central role in scientific data analysis, machine learning, robotics, cognitive science and artificial intelligence. This Review provides an introduction to this framework, and discusses some of the state-of-the-art advances in the field, namely, probabilistic programming, Bayesian optimization, data compression and automatic model discovery.

  • Generalization, similarity, and Bayesian inference - Behavioral and Brain Sciences, 2001. [All Versions]. [Preprint]. Shepard has argued that a universal law should govern generalization across different domains of perception and cognition, as well as across organisms from different species or even different planets. Starting with some basic assumptions about natural kinds, he derived an exponential decay function as the form of the universal generalization gradient, which accords strikingly well with a wide range of empirical data. However, his original formulation applied only to the ideal case of generalization from a single encountered stimulus to a single novel stimulus, and for stimuli that can be represented as points in a continuous metric psychological space. The authors recast Shepard's theory in a more general Bayesian framework and show how this naturally extends his approach to the more realistic situation of generalizing from multiple consequential stimuli with arbitrary representational structure. This framework also subsumes a version of Tversky's set-theoretic model of similarity, which is conventionally thought of as the primary alternative to Shepard's continuous metric space model of similarity and generalization.

  • Bayesian modeling of human concept learning - NeurIPS'98, 1998. [All Versions]. [Preprint]. This work considers the problem of learning concepts from small numbers of positive examples, a feat which humans perform routinely but which computers are rarely capable of. Bridging machine learning and cognitive science perspectives, this work presents both theoretical analysis and an empirical study with human subjects for the simple task oflearning concepts corresponding to axis-aligned rectangles in a multidimensional feature space. Existing learning models, when applied to this task, cannot explain how subjects generalize from only a few examples of the concept. The author proposes a principled Bayesian model based on the assumption that the examples are a random sample from the concept to be learned. The model gives precise fits to human behavior on this simple task and provides qualitati ve insights into more complex, realistic cases of concept learning.

  • Rules and Similarity in Concept Learning - NeurIPS'99, 1999. [All Versions]. [Preprint]. This paper argues that two apparently distinct modes of generalizing concepts - abstracting rules and computing similarity to exemplars - should both be seen as special cases of a more general Bayesian learning framework. Bayes explains the specific workings of these two modes - which rules are abstracted, how similarity is measured - as well as why generalization should appear rule- or similarity-based in different situations. This analysis also suggests why the rules/similarity distinction, even if not computationally fundamental, may still be useful at the algorithmic level as part of a principled approximation to fully Bayesian learning.

  • Theory-based Bayesian models of inductive learning and reasoning - Trends in Cognitive Sciences, 2006. [All Versions]. [Preprint]. Inductive inference allows humans to make powerful generalizations from sparse data when learning about word meanings, unobserved properties, causal relationships, and many other aspects of the world. Traditional accounts of induction emphasize either the power of statistical learning, or the importance of strong constraints from structured domain knowledge, intuitive theories or schemas. This paper argues that both components are necessary to explain the nature, use and acquisition of human knowledge, and the authors introduce a theory-based Bayesian framework for modeling inductive learning and reasoning as statistical inferences over structured knowledge representations.

  • Word learning as Bayesian inference - Psychological Review, 2007. [All Versions]. [Preprint]. The authors present a Bayesian framework for understanding how adults and children learn the meanings of words. The theory explains how learners can generalize meaningfully from just one or a few positive examples of a novel word's referents, by making rational inductive inferences that integrate prior knowledge about plausible word meanings with the statistical structure of the observed examples. The theory addresses shortcomings of the two best known approaches to modeling word learning, based on deductive hypothesis elimination and associative learning. Three experiments with adults and children test the Bayesian account's predictions in the context of learning words for object categories at multiple levels of a taxonomic hierarchy. Results provide strong support for the Bayesian account over competing accounts, in terms of both quantitative model fits and the ability to explain important qualitative phenomena. Several extensions of the basic theory are discussed, illustrating the broader potential for Bayesian models of word learning.

  • How to Grow a Mind: Statistics, Structure, and Abstraction - Science, 2011. [All Versions]. [Preprint]. This review describes recent approaches to reverse-engineering human learning and cognitive development and, in parallel, engineering more humanlike machine learning systems. Computational models that perform probabilistic inference over hierarchies of flexibly structured representations can address some of the deepest questions about the nature and origins of human thought: How does abstract knowledge guide learning and reasoning from sparse data? What forms does our knowledge take, across different domains and tasks? And how is that abstract knowledge itself acquired?

  • Human-level concept learning through probabilistic program induction - Science, 2015. [All Versions]. [Preprint]. [Supplementary Material]. People learning new concepts can often generalize successfully from just a single example, yet machine learning algorithms typically require tens or hundreds of examples to perform with similar accuracy. People can also use learned concepts in richer ways than conventional algorithms—for action, imagination, and explanation. This work presents a computational model that captures these human learning abilities for a large class of simple visual concepts: handwritten characters from the world’s alphabets. The model represents concepts as simple programs that best explain observed examples under a Bayesian criterion. On a challenging one-shot classification task, the model achieves human-level performance while outperforming recent deep learning approaches.

  • Building Machines That Learn and Think Like People - Behavioral and Brain Sciences, 2017. [All Versions]. [Preprint]. Recent progress in artificial intelligence has renewed interest in building systems that learn and think like people. Many advances have come from using deep neural networks trained end-to-end in tasks such as object recognition, video games, and board games, achieving performance that equals or even beats that of humans in some respects. Despite their biological inspiration and performance achievements, these systems differ from human intelligence in crucial ways. The authors review progress in cognitive science suggesting that truly human-like learning and thinking machines will have to reach beyond current engineering trends in both what they learn and how they learn it. Specifically, the authors argue that these machines should (1) build causal models of the world that support explanation and understanding, rather than merely solving pattern recognition problems; (2) ground learning in intuitive theories of physics and psychology to support and enrich the knowledge that is learned; and (3) harness compositionality and learning-to-learn to rapidly acquire and generalize knowledge to new tasks and situations. The authors suggest concrete challenges and promising routes toward these goals that can combine the strengths of recent neural network advances with more structured cognitive models.

  • Building machines that learn and think with people - Nature Human Behavior, 2024. [All Versions]. [Preprint]. This perspective shows how the science of collaborative cognition can be put to work to engineer systems that really can be called ‘thought partners’, systems built to meet humans' expectations and complement humans' limitations. The authors lay out several modes of collaborative thought in which humans and artificial intelligence thought partners can engage, and they propose desiderata for human-compatible thought partnerships. Drawing on motifs from computational cognitive science, this work motivates an alternative scaling path for the design of thought partners and ecosystems around their use through a Bayesian lens, whereby the constructed partners actively build and reason over models of the human and world.

  • The rational basis of representativeness - CogSci'01, 2001. [All Versions].

  • Testing a Bayesian Measure of Representativeness Using a Large Image Database - NeurIPS'11, 2011. [All Versions].

  • Constructing a hypothesis space from the Web for large-scale Bayesian word learning - CogSci'12, 2012. [All Versions].

  • Modeling rules and similarity in colexification - CogSci'21, 2021. [All Versions]. Rule- and similarity-based generalization in colexification.

  • Human-level few-shot concept induction through minimax entropy learning - Science Advances, 2024. [All Versions]. This paper introduces a computational model designed to emulate human inductive reasoning on abstract reasoning tasks, such as those in IQ tests, using a minimax entropy approach. This method combines identifying the most effective constraints on data via minimum entropy with determining the best combination of them via maximum entropy.

*Back to Top

Generative Model

  • Generative Modeling Explained - Statistical Machine Learning Tutorials, 2022. This tutorial on generative modeling is in part of Statistical Machine Learning Tutorial by Ying Nian Wu at UCLA Statistics. The tutorial goes over the key equations and algorithms for learning recent generative models, including energy-based models, diffusion/score-based models, autoregressive/flow-based models, VAEs, and GANs, and explains the connections between these models.

  • Bayesian Data Analysis - Chapman and Hall/CRC, 1995. [All Versions]. Don Rubin's introductory book on Bayesian models.

  • Filters, random fields and maximum entropy (FRAME): Towards a unified theory for texture modeling - International Journal of Computer Vision, 1998. [All Versions]. [Preprint]. This article presents a statistical theory for texture modeling. This theory combines filtering theory and Markov random field modeling through the maximum entropy principle, and interprets and clarifies many previous concepts and methods for texture analysis and synthesis from a unified point of view. The theory characterizes the ensemble of images I with the same texture appearance by a probability distribution f(I) on a random field, and the objective of texture modeling is to make inference about f(I), given a set of observed texture examples.

  • Object Perception as Bayesian Inference - Annual Review of Psychology, 2004. [All Versions]. [Preprint]. We perceive the shapes and material properties of objects quickly and reliably despite the complexity and objective ambiguities of natural images. Typical images are highly complex because they consist of many objects embedded in background clutter. Moreover, the image features of an object are extremely variable and ambiguous owing to the effects of projection, occlusion, background clutter, and illumination. The very success of everyday vision implies neural mechanisms, yet to be understood, that discount irrelevant information and organize ambiguous or noisy local image features into objects and surfaces. Recent work in Bayesian theories of visual perception has shown how complexity may be managed and ambiguity resolved through the task-dependent, probabilistic integration of prior object knowledge with image features.

  • A tale of three probabilistic families: Discriminative, descriptive, and generative models - Quarterly of Applied Mathematics, 2018. [All Versions]. [Preprint]. The pattern theory of Grenander is a mathematical framework where patterns are represented by probability models on random variables of algebraic structures. In this paper, the authors review three families of probability models, namely, the discriminative models, the descriptive models, and the generative models. A discriminative model is in the form of a classifier. It specifies the conditional probability of the class label given the input signal. A descriptive model specifies the probability distribution of the signal, based on an energy function defined on the signal. A generative model assumes that the signal is generated by some latent variables via a transformation. The authors shall review these models within a common framework and explore their connections, and shall also review the recent developments that take advantage of the high approximation capacities of deep neural networks.

  • From information scaling of natural images to regimes of statistical models - Quarterly of Applied Mathematics, 2008. [All Versions]. [Preprint]. One fundamental property of natural image data that distinguishes vision from other sensory tasks such as speech recognition is that scale plays a profound role in image formation and interpretation. Specifically, visual objects can appear at a wide range of scales in the images due to the change of viewing distance as well as camera resolution. The same objects appearing at different scales produce different image data with different statistical properties. In particular, this work shows that the entropy rate of the image data changes over scale. Moreover, the inferential uncertainty changes over scale too. The authors call these changes information scaling. They then examine both empirically and theoretically two prominent and yet largely isolated classes of image models, namely, wavelet sparse coding models and Markov random field models. The results indicate that the two classes of models are appropriate for two different entropy regimes: sparse coding targets low entropy regimes, whereas Markov random fields are appropriate for high entropy regimes. Because information scaling connects different entropy regimes, both sparse coding and Markov random fields are necessary for representing natural image data, and information scaling triggers transitions between these two regimes.

  • A Theory of Generative ConvNet - ICML'16, 2016. [All Versions].

  • Cooperative Training of Descriptor and Generator Networks - IEEE Transactions on Pattern Analysis and Machine Intelligence, 2018. [All Versions].

  • Learning Latent Space Energy-Based Prior Model - NeurIPS'20, 2020. [All Versions]. [Project]. [Code]. A milestone paper on Latent Energy-Based Model.

  • Learning Energy-Based Models by Diffusion Recovery Likelihood - ICLR'21, 2021. [All Versions]. [Code].

  • Score-Based Generative Modeling through Stochastic Differential Equations - ICLR'21, 2021. [All Versions].

  • Latent Space Factorisation and Manipulation via Matrix Subspace Projection - ICML'20, 2020. [All Versions].

  • Minimax entropy principle and its application to texture modeling - Neural Computing, 1997. [All Versions]. [Preprint]. This article proposes a general theory and methodology, called the minimax entropy principle, for building statistical models for images (or signals) in a variety of applications. This principle consists of two parts. The first is the maximum entropy principle for feature binding (or fusion): for a given set of observed feature statistics, a distribution can be built to bind these feature statistics together by maximizing the entropy over all distributions that reproduce them. The second part is the minimum entropy principle for feature selection: among all plausible sets of feature statistics, we choose the set whose maximum entropy distribution has the minimum entropy. Computational and inferential issues in both parts are addressed; in particular, a feature pursuit procedure is proposed for approximately selecting the optimal set of features. The minimax entropy principle is then corrected by considering the sample variation in the observed feature statistics, and an information criterion for feature pursuit is derived. The minimax entropy principle is applied to texture modeling, where a novel Markov random field (MRF) model, called FRAME (filter, random field, and minimax entropy), is derived, and encouraging results are obtained in experiments on a variety of texture images.

  • Parameter Expansion for Data Augmentation - Journal of the American Statistical Association, 1999. [All Versions]. [Preprint]. Viewing the observed data of a statistical model as incomplete and augmenting its missing parts are useful for clarifying concepts and central to the invention of two well-known statistical algorithms: expectation-maximization (EM) and data augmentation. Recently, the authors demonstrated that expanding the parameter space along with augmenting the missing data is useful for accelerating iterative computation in an EM algorithm. The main purpose of this article is to rigorously define a parameter expanded data augmentation (PX-DA) algorithm and to study its theoretical properties. The PX-DA is a special way of using auxiliary variables to accelerate Gibbs sampling algorithms and is closely related to reparameterization techniques.

  • Image segmentation by data-driven markov chain monte carlo - IEEE Transactions on Pattern Analysis and Machine Intelligence, 2002. [All Versions]. Classic method for image segmentation via generative modeling.

  • Efficient Learning of Sparse Representations with an Energy-Based Model - NeurIPS'06, 2006. [All Versions].

  • A Tutorial on Energy-Based Learning - Predicting Structured Data, MIT Press, 2006. [All Versiosn]. Yann LeCun's tutorial on energy-based learning.

  • Unsupervised Representaton Learning with Deep Convolutional Generative Adversarial Networks - ICLR'16, 2016. [All Versions].

  • Analysis of Langevin Monte Carlo via Convex Optimization - Journal of Machine Learning Research, 2019. [All Versions].

  • A generative vision model that trains with high data efficiency and breaks text-based CAPTCHAs - Science, 2017. [All Versions].

  • Where do hypotheses come from? - Cognitive Psychology, 2017. [All Versions]. A Bayesian account for modeling basic rules as the hypothesis space.

*Back to Top

Nonparametric Model

*Back to Top

Bayesian Optimization

*Back to Top

Concepts

Theory of Concepts

*Back to Top

Human Concept Representation

*Back to Top

AI Concept Representation

*Back to Top

Complexity & Information Theory

Theory

  • A Mathematical Theory of Communication - The Bell System Technical Journal, 1948. [All Versions]. Shannon's original paper on Information Theory.

  • An introduction to Kolmogorov complexity and its applications - Springer, 2008. [All Versions]. The introductory book for Algorithmic Information Theory, especially the Kolmogorov complexity theory.

  • Complexity and the representation of patterned sequences of symbols - Psychological Review, 1972. [All Versions]. Herbert Simon's review on subjective complexity.

  • Visual Pattern Discrimination - IRE Transactions on Information Theory, 1962. [All Versions].

  • Algorithmic Information Theory - IBM Journal of Research and Development, 1977. [All Versions]. Chaitin's original paper on Algorithmic Information Theory.

  • From Algorithmic to Subjective Randomness - NeurIPS'03, 2003. [All Versions].

  • On the Complexity of Bayesian Generalization - ICML'23, 2023. [All Versions]. [Project]. [Models]. This work examine concept generalization at a large scale in the natural visual spectrum. Established computational modes (i.e., rule-based or similarity-based) are primarily studied isolated, focusing on confined and abstract problem spaces. This work studies these two modes when the problem space scales up and when the complexity of concepts becomes diverse. At the representational level, the authors investigate how the complexity varies when a visual concept is mapped to the representation space. Prior literature has shown that two types of complexities build an inverted-U relation. Leveraging Representativeness of Attribute (RoA), the authors computationally confirm: Models use attributes with high RoA to describe visual concepts, and the description length falls in an inverted-U relation with the increment in visual complexity. At the computational level, the authors examine how the complexity of representation affects the shift between the rule- and similarity-based generalization. The authors hypothesize that category-conditioned visual modeling estimates the co-occurrence frequency between visual and categorical attributes, thus potentially serving as the prior for the natural visual world. Experimental results show that representations with relatively high subjective complexity outperform those with relatively low subjective complexity in rule-based generalization, while the trend is the opposite in similarity-based generalization.

*Back to Top

Dimensionality Reduction

*Back to Top

Visual Complexity

*Back to Top

Communications

Non-Verbal Communication

*Back to Top

Pragmatics

*Back to Top

Language Compositionality

*Back to Top

Coordination

  • In situ bidirectional human-robot value alignment - Science Robotics, 2022. [All Versions]. [Preprint]. This paper proposes an explainable artificial intelligence (XAI) system in which a group of robots predicts users’ values by taking in situ feedback into consideration while communicating their decision processes to users through explanations. To learn from human feedback, the XAI system integrates a cooperative communication model for inferring human values associated with multiple desirable goals. To be interpretable to humans, it simulates human mental dynamics and predicts optimal explanations using graphical models.

  • From Explicit Communication to Tacit Cooperation: A Novel Paradigm for Cooperative MARL - AAMAS'24, 2024. [All Versions]. Drawing inspiration from human team cooperative learning, this paper proposes a novel paradigm that facilitates a gradual shift from explicit communication to tacit cooperation.

*Back to Top

Domain Specific Language

Design Theory

  • Domain-Specific Language - Wikipedia. Wikipedia encyclopedia entry on Domain Specific Languages.

  • Domain Engineering - Wikipedia. Wikipedia encyclopedia entry on Domain Engineering.

  • Domain-Specific Languages - Pearson Education, 2010. [All Versions]. [Domain-Specific Languages Guide]. When carefully selected and used, Domain-Specific Languages (DSLs) may simplify complex code, promote effective communication with customers, improve productivity, and unclog development bottlenecks. In Domain-Specific Languages, noted software development expert Martin Fowler first provides the information software professionals need to decide if and when to utilize DSLs. Then, where DSLs prove suitable, Fowler presents effective techniques for building them, and guides software engineers in choosing the right approaches for their applications.

  • Comparison of multi-paradigm programming languages - Wikipedia. Programming languages may support multiple programming paradigms. This Wikipedia encyclopedia entry lists a concise reference for the programming paradigms.

  • Epigrams on programming - ACM SIGPLAN Notices, 1982. [All Versions].

  • The complete guide to (external) Domain Specific Languages. An introduction to Domain Specific Languages (DSL) based on 19 DSL cases.

  • When and How to Develop Domain-Specific Languages - ACM Computing Surveys, 2005. [All Versions]. [Preprint]. Domain-specific languages (DSLs) are languages tailored to a specific application domain. They offer substantial gains in expressiveness and ease of use compared with general-purpose programming languages in their domain of application. DSL development is hard, requiring both domain knowledge and language development expertise. Few people have both. Not surprisingly, the decision to develop a DSL is often postponed indefinitely, if considered at all, and most DSLs never get beyond the application library stage. Although many articles have been written on the development of particular DSLs, there is very limited literature on DSL development methodologies and many questions remain regarding when and how to develop a DSL. To aid the DSL developer, this survey paper identifies patterns in the decision, analysis, design, and implementation phases of DSL development. These patterns improve and extend earlier work on DSL design patterns.

  • Design Guidelines for Domain Specific Languages - OOPSLA Workshop on Domain-Specific Modeling (DSM' 09), 2009. [All Versions]. Designing a new domain specific language is as any other complex task sometimes error-prone and usually time consuming, especially if the language shall be of high-quality and comfortably usable. Existing tool support focuses on the simplification of technical aspects but lacks support for an enforcement of principles for a good language design. In this paper we investigate guidelines that are useful for designing domain specific languages, largely based on our experience in developing languages as well as relying on existing guidelines on general purpose (GPLs) and modeling languages. This work defined Guidelines to support a DSL developer to achieve better quality of the language design and a better acceptance among its users.

  • Domain-specific languages: an annotated bibliography - ACM SIGPLAN Notices, 2000. [All Versions]. A survey on the topic of domain-specific languages as used for the construction and maintenance of software systems. The survey lists a selection of 75 key publications in the area, and provides a summary for each of the papers. Moreover, the survey discusses terminology, risks and benefits, example domain-specific languages, design methodologies, and implementation techniques.

  • Usability Evaluation of Domain-Specific Languages - ICQICT'12, 2012. [All Versions]. [Preprint]. The purpose of this proposal is to contribute to the systematic activity of Software Language Engineering by focusing on the issue of the Usability evaluation of DSLs. Usability evaluation is often skipped, relaxed, or at least omitted from papers reporting development of DSLs. The authors argue that a systematic approach based on User Interface experimental validation techniques should be used to assess the impact of new DSLs. For that purpose, the authors propose to merge common Usability evaluation processes with the DSL development process.

*Back to Top

Design Practises

  • No Grammar to Rule Them All: A Survey of JSON-style DSLs for Visualization - IEEE Transactions on Visualization and Computer Graphics, 2022. [All Versions]. There has been substantial growth in the use of JSON-based grammars, as well as other standard data serialization languages, to create visualizations. Each of these grammars serves a purpose: some focus on particular computational tasks (such as animation), some are concerned with certain chart types (such as maps), and some target specific data domains (such as ML). Despite the prominence of this interface form, there has been little detailed analysis of the characteristics of these languages. This study surveys and analyzes the design and implementation of 57 JSON-style DSLs for visualization. The authors analyze these languages supported by a collected corpus of examples for each DSL (consisting of 4395 instances) across a variety of axes organized into concerns related to domain, conceptual model, language relationships, affordances, and general practicalities. The authors identify tensions throughout these areas, such as between formal and colloquial specifications, among types of users, and within the composition of languages. Through this work, the authors seek to support language implementers by elucidating the choices, opportunities, and tradeoffs in visualization DSL design.

  • Quantifying usability of domain-specific languages: An empirical study on software maintenance - Journal of Systems and Software, 2015. [All Versions]. A DSL aims to support software development by offering abstractions to a particular domain. It is expected that DSLs improve the maintainability of artifacts otherwise produced with general-purpose languages. However, the maintainability of the DSL artifacts and, hence, their adoption in mainstream development, is largely dependent on the usability of the language itself. Unfortunately, it is often hard to identify their usability strengths and weaknesses early, as there is no guidance on how to objectively reveal them. Usability is a multi-faceted quality characteristic, which is challenging to quantify beforehand by DSL stakeholders. There is even less support on how to quantitatively evaluate the usability of DSLs used in maintenance tasks. In this context, this paper reports a study to compare the usability of textual DSLs under the perspective of software maintenance. A usability measurement framework was developed based on the cognitive dimensions of notations. The framework was evaluated both qualitatively and quantitatively using two DSLs in the context of two evolving object-oriented systems. The results suggested that the proposed metrics were useful: (1) to early identify DSL usability limitations, (2) to reveal specific DSL features favoring maintenance tasks, and (3) to successfully analyze eight critical DSL usability dimensions.

  • How Domain Experts Use an Embedded DSL - OOPSLA'23, 2023. [All Versions]. Programming tools are increasingly integral to research and analysis in myriad domains, including specialized areas with no formal relation to computer science. Embedded domain-specific languages (eDSLs) have the potential to serve these programmers while placing relatively light implementation burdens on language designers. However, barriers to eDSL use reduce their practical value and adoption. This work aims to deepen the understanding of how programmers use eDSLs and identify user needs to inform future eDSL designs. The authors performed a contextual inquiry (9 participants) with domain experts using Mimi, an eDSL for climate change economics modeling. A thematic analysis identified five key themes, including: the interaction between the eDSL and the host language has significant and sometimes unexpected impacts on eDSL user experience, and users preferentially engage with domain-specific communities and code templates rather than host language resources.

  • Constraint Representation Towards Precise Data-Driven Storytelling - VIS-Gen4DS'24, 2024. [All Versions]. A position paper on DSL for data-driven storytelling. Data-driven storytelling serves as a crucial bridge for communicating ideas in a persuasive way. However, the manual creation of data stories is a multifaceted, labor-intensive, and case-specific effort, limiting their broader application. As a result, automating the creation of data stories has emerged as a significant research thrust. Despite advances in Artificial Intelligence, the systematic generation of data stories remains challenging due to their hybrid nature: they must frame a perspective based on a seed idea in a top-down manner, similar to traditional storytelling, while coherently grounding insights of given evidence in a bottom-up fashion, akin to data analysis. These dual requirements necessitate precise constraints on the permissible space of a data story. This viewpoint proposes integrating constraints into the data story generation process. Defined upon the hierarchies of interpretation and articulation, constraints shape both narrations and illustrations to align with seed ideas and contextualized evidence. The authors identify the taxonomy and required functionalities of these constraints. Although constraints can be heterogeneous and latent, this position paper explores the potential to represent them in a computation-friendly fashion via Domain-Specific Languages. The authors believe that leveraging constraints will facilitate both artistic and scientific aspects of data story generation.

*Back to Top

Design Automation

  • AutoDSL: Automated domain-specific language design for structural representation of procedures with constraints - ACL'24, 2024. [All Versions]. [Preprint]. [Project]. The original paper on the automated design of DSLs, referred to as AutoDSL. Accurate representation of procedures in restricted scenarios, such as non-standardized scientific experiments, requires precise depiction of constraints. Unfortunately, Domain-Specific Language (DSL), as an effective tool to express constraints structurally, often requires case-by-case hand-crafting, necessitating customized, labor-intensive efforts. To overcome this challenge, this paper introduces the AutoDSL framework to automate DSL-based constraint design across various domains. Utilizing domain specified experimental protocol corpora, AutoDSL optimizes syntactic constraints and abstracts semantic constraints. Quantitative and qualitative analyses of the DSLs designed by AutoDSL across five distinct domains highlight its potential as an auxiliary module for language models, aiming to improve procedural planning and execution.

*Back to Top

Imperative DSL Applications

*Back to Top

Declarative DSL Applications

  • The BioPAX community standard for pathway data sharing - Nature Biotechnology, 2010. [All Versions]. [Preprint]. Biological Pathway Exchange (BioPAX) is a standard language to represent biological pathways at the molecular and cellular level and to facilitate the exchange of pathway data. BioPAX can represent metabolic and signaling pathways, molecular and genetic interactions and gene regulation networks.

  • Learning the language of viral evolution and escape - Science, 2021. [All Versions]. Natural language processing with two components: grammar (or syntax) and meaning (or semantics) for predicting which viral mutations may lead to viral escape.

  • A high-level programming language for generative protein design - 2022. [All Versions]. A high-level programming language based on modular building blocks that allows a designer to easily compose a set of desired properties. Along with the programming language, there is an energy-based generative model, built on atomic resolution structure prediction with a language model, that realizes all-atom structure designs that have the programmed properties.

  • OpenLaw - OpenLaw.io. It is now possible to model all or parts of legal agreements using code (smart contracts), decreasing the cost and friction of creating, securing, and generating binding legal agreements. Lawyers lack basic tools to build these dynamic, “smart” contracts in a way that is enforceable and understandable to a legal professional. OpenLaw is a technology stack to help power next generation "smart" legal agreements, with a domain-specific markup language, a integration framework, and a series of general applications.

  • Scenic: a language for scenario specification and data generation - Machine Learning, 2022. [All Versions]. This paper proposes a domain-specific language, Scenic, for describing scenarios that are distributions over scenes and the behaviors of their agents over time. Scenic combines concise, readable syntax for spatiotemporal relationships with the ability to declaratively impose hard and soft constraints over the scenario.

  • Domain Specific Language for Smart Contract Development - ICBC'20, 2020. [All Versions]. [Preprint]. This research addresses the understanding hardness raised from the conceptual discrepancy between contractual clauses and corresponding code of the Solidity programming language, by the design and study of a domain-specific smart contract language based on higher level of abstraction that can be automatically transformed to an implementation.

  • iContractML 2.0: A domain-specific language for modeling and deploying smart contracts onto multiple blockchain platforms - Information and Software Technology, 2022. [All Versions]. Smart contracts play a vital role in many fields. Despite being called smart, the development of smart contracts is a tedious task beyond defining a set of contractual rules. In addition to business knowledge, coding a smart contract requires strong technical knowledge in a multiplex of new and rapidly changing domain-specific languages and blockchain platforms. The goal of this paper is to assist developers in building smart contracts independently from the language or the target blockchain platform. In which, this paper presents the second-generation smart contract language iContractML 2.0. iContractML 2.0 is an extensible framework that empowers developers to model and generate functional smart contract code that can be deployed onto multiple blockchain platforms.

  • PClean: Bayesian Data Cleaning at Scale with Domain-Specific Probabilistic Programming - ICML'21, 2021. [All Versions]. This work presents PClean, a probabilistic programming language (PPL) for leveraging dataset-specific knowledge to automate Bayesian cleaning, automating Bayesian approaches given the diversity of real-world error patterns and the hardness of inference.

  • A Language for Counterfactual Generative Models - ICML'21, 2021. [All Versions]. [Project]. This paper presents Omega, a probabilistic programming language with support for counterfactual inference. This feature is accomplished by introducing a new operator to probabilistic programming akin to Pearl’s do.

  • Product Line Engineering Using Domain-Specific Languages - ISPLC'11, 2011. [All Versions]. [Preprint]. This paper investigates the application of domain-specific languages in product line engineering (PLE). It starts by analyzing the limits of expressivity of feature models. Feature models correspond to context-free grammars without recursion, which prevents the expression of multiple instances and references. The authors then show how domain-specific languages (DSLs) can serve as a middle ground between feature modeling and programming. They can be used in cases where feature models are too limited, while keeping the separation between problem space and solution space provided by feature models. This work then categorizes useful combinations between configuration with feature model and construction with DSLs and provide an integration of DSLs into the conceptual framework of PLE. Finally the authors show how use of a consistent, unified formalism for models, code, and configuration can yield important benefits for managing variability and trace ability.

  • A Domain-Specific Language for Product-Process-Resource Modeling - ETFA'21, 2021. [All Versions]. This paper presents the design of the PPR-DSL to effectively and efficiently represent Product-Process-Resource (PPR) aspects and evaluate constraints defined for modeling PPR views in the Formalized Process Description standard (VDI 3682).

  • The Scene Language: Representing Scenes with Programs, Words, and Embeddings - 2024. [All Versions]. [Project]. This paper introduces the Scene Language, a visual scene representation that concisely and precisely describes the structure, semantics, and identity of visual scenes. It represents a scene with three key components: a program that specifies the hierarchical and relational structure of entities in the scene, words in natural language that summarize the semantic class of each entity, and embeddings that capture the visual identity of each entity. This representation can be inferred from pre-trained language models via a training-free inference technique, given text or image inputs.

*Back to Top

Logic DSL Applications

  • Situation Calculus - Wikipedia. Wikipedia on Situation Calculus, a logic formalism designed for representing and reasoning about dynamical domains.

  • What is Answer Set Programming? - Springer, 2008. [All Versions]. [Tutorial on AAAI]. Answer set programming (ASP) is a form of declarative programming oriented towards difficult search problems. As an outgrowth of research on the use of nonmonotonic reasoning in knowledge representation, it is particularly useful in knowledge-intensive applications. ASP programs consist of rules that look like Prolog rules, but the computational mechanisms used in ASP are different: they are based on the ideas that have led to the creation of fast satisfiability solvers for propositional logic.

  • Answer Set Programming - ICLPNR'99, 1999. [All Versions]. [Preprint]. The original paper on Answer Set Programming (ASP), a form of declarative programming oriented towards difficult search problems, on the use of nonmonotonic reasoning in knowledge representation. In ASP solutions to a problem are represented by answer sets (known also as stable models), and not by answer substitutions produced in response to a query, as in conventional logic programming.

  • Action Languages, Answer Sets, and Planning - The Logic Programming Paradigms, 1999. [All Versions]. [Preprint]. This is a discussion of some of the achievements and challenges related to representing actions and the design of planners from the perspective of logic programming. The authors talk about recent work on action languages and translating them into logic programming, on representing possible histories of an action domain by answer sets, on efficient implementations of the answer set semantics and their use for generating plans, and on causal logic and its relation to planning algorithms. Recent progress in these areas may lead to the creation of planners which are based on the ideas of logic programming and combine the use of expressive action description languages with efficient computational procedures.

  • Qualitative Simulation - Artificial Intelligence, 1986. [All Versions]. [Preprint]. This paper presents a precise definition of qualitative structure and behavior descriptions as abstractions of differential equations and continuously differentiable functions. The authors present a new algorithm for qualitative simulation that generalizes the best features of existing algorithms, and allows direct comparisons among alternate approaches. Starting with a set of constraints abstracted from a differential equation, this work proves that the QSIM algorithm is guaranteed to produce a qualitative behavior corresponding to any solution to the original equation. The paper also shows that any qualitative simulation algorithm will sometimes produce spurious qualitative behaviors: ones which do not correspond to any mechanism satisfying the given constraints. These observations suggest specific types of care that must be taken in designing applications of qualitative causal reasoning systems, and in constructing and validating a knowledge base of mechanism descriptions.

  • Qualitative Reasoning: Modeling and Simulation with Incomplete Knowledge - MIT Press, 1994. [All Versions]. This book presents, within a conceptually unified theoretical framework, a body of methods that have been developed over the past fifteen years for building and simulating qualitative models of physical systems - bathtubs, tea kettles, automobiles, the physiology of the body, chemical processing plants, control systems, electrical systems - where knowledge of that system is incomplete. The primary tool for this work is the author's QSIM algorithm, which is discussed in detail. Qualitative models are better able than traditional models to express states of incomplete knowledge about continuous mechanisms. Qualitative simulation guarantees to find all possible behaviors consistent with the knowledge in the model. This expressive power and coverage is important in problem solving for diagnosis, design, monitoring, explanation, and other applications of artificial intelligence.

  • Qualitative and quantitative simulation: bridging the gap - Artificial Intelligence, 1997. [All Versions]. Shortcomings of qualitative simulation and of quantitative simulation motivate combining them to do simulations exhibiting strengths of both. The resulting class of techniques is called semiquantitative simulation. One approach to semi-quantitative simulation is to use numeric intervals to represent incomplete quantitative information. This research demonstrates semi-quantitative simulation using intervals in an implemented semi-quantitative simulator called Q3. Q3 progressively refines a qualitative simulation, providing increasingly specific quantitative predictions which can converge to a numerical simulation in the limit while retaining important correctness guarantees from qualitative and interval simulation techniques.

  • A Logic Programming Language for Computational Nucleic Acid Devices - ACS Synthetic Biology, 2018. [All Versions]. This paper presents a logic programming language that allows a broad range of computational nucleic acid systems to be designed and analyzed. The language extends standard logic programming with a novel equational theory to express nucleic acid molecular motifs. It automatically identifies matching motifs present in the full system, in order to apply a specified transformation expressed as a logical rule.

*Back to Top

DSL Program Synthesis

  • pix2code: Generating Code from a Graphical User Interface Screenshot - ACM SIGCHI Symposium on Engineering Interactive Computing Systems, 2018. [All Versions]. [Code]. [Website]. This paper shows that deep learning methods can be leveraged to train a model end-to-end to automatically reverse engineer user interfaces and generate code from a single input image with over 77% of accuracy for three different platforms (i.e. iOS, Android and web-based technologies).

  • Learning to Infer Graphics Programs from Hand-Drawn Images - NeurIPS'18, 2018. [All Versions]. The method learns a model that uses program synthesis techniques to recover a graphics program from drawing primitives. These programs have constructs like variable bindings, iterative loops, or simple kinds of conditionals. With a graphics program in hand, we can correct errors made by the deep network and extrapolate drawings.

  • babble: Learning Better Abstractions with E-Graphs and Anti-unification - POPL'23, 2023. [All Versions]. This paper proposes library learning modulo theory (LLMT), a new library learning algorithm that additionally takes as input an equational theory for a given problem domain. LLMT uses e-graphs and equality saturation to compactly represent the space of programs equivalent modulo the theory, and uses a novel e-graph anti-unification technique to find common patterns in the corpus more directly and efficiently.

  • Top-Down Synthesis for Library Learning - POPL'23, 2023. [All Versions]. This paper introduces corpus-guided top-down synthesis as a mechanism for synthesizing library functions that capture common functionality from a corpus of programs in a domain specific language (DSL). The algorithm builds abstractions directly from initial DSL primitives, using syntactic pattern matching of intermediate abstractions to intelligently prune the search space and guide the algorithm towards abstractions that maximally capture shared structures in the corpus.

  • DreamCoder: growing generalizable, interpretable knowledge with wake–sleep Bayesian program learning - Philosophical Transactions of the Royal Society A, 2023. [All Versions]. [Preprint]. This paper presents DreamCoder, a system that learns to solve problems by writing programs. It builds expertise by creating domain-specific programming languages for expressing domain concepts, together with neural networks to guide the search for programs within these languages. A ‘wake–sleep’ learning algorithm alternately extends the language with new symbolic abstractions and trains the neural network on imagined and replayed problems. DreamCoder solves both classic inductive programming tasks and creative tasks such as drawing pictures and building scenes.

  • Grammar Prompting for Domain-Specific Language Generation with Large Language Models - NeurIPS'23, 2023. [All Versions]. Grammar prompting is a simple approach to enable LLMs to use external knowledge and domain-specific constraints expressed through a grammar in Backus--Naur Form (BNF) during in-context learning. Grammar prompting augments each demonstration example with a specialized grammar that is minimally sufficient for generating the particular output example, where the specialized grammar is a subset of the full DSL grammar. For inference, the LLM first predicts a BNF grammar given a test input, and then generates the output according to the rules of the grammar. Experiments demonstrate that grammar prompting can enable LLMs to perform competitively on a diverse set of DSL generation tasks, including semantic parsing (SMCalFlow, Overnight, GeoQuery), PDDL planning, and SMILES-based molecule generation.

  • Errors are Useful Prompts: Instruction Guided Task Programming with Verifier-Assisted Iterative Prompting - 2023. [All Versions]. [Project]. [Website]. This paper proposes CLAIRIFY, an approach that combines automatic iterative prompting with program verification to ensure programs written in data-scarce domain-specific language are syntactically valid and incorporate environment constraints.

  • PhotoScout: Synthesis-Powered Multi-Modal Image Search - ACM SIGCHI'24, 2024. [All Versions]. This paper explores a new multi-modal image search approach that allows users to conveniently specify and perform semantic image search tasks. With the tool, PhotoScout, the user interactively provides natural language descriptions, positive and negative examples, and object tags to specify their search tasks. Under the hood, PhotoScout is powered by a program synthesis engine that generates visual queries in a domain-specific language and executes the synthesized program to retrieve the desired images.

*Back to Top

Cognitive Foundations

  • The Child as Hacker - Trends in Cognitive Sciences, 2020. [All Versions]. The scope of human learning and development poses a radical challenge for cognitive science. The authors propose that developmental theories can address this challenge by adopting perspectives from computer science. Many of our best models treat learning as analogous to computer programming because symbolic programs provide the most compelling account of sophisticated mental representations. The authors specifically propose that children’s learning is analogous to a particular style of programming called hacking, making code better along many dimensions through an open-ended set of goals and activities. By contrast to existing theories, which depend primarily on local search and simple metrics, this view highlights the many features of good mental representations and the multiple complementary processes children use to create them.

  • Communicating Natural Programs to Humans and Machines - NeurIPS'22, 2022. [All Versions]. While humans readily generate and interpret instructions in a general language, computer systems are shackled to a narrow domain-specific language that they can precisely execute. This makes building intelligent systems that can generalize to novel situations such as ARC difficult. Human-generated instructions are referred as “natural programs”. While they resemble computer programs, they are distinct in two ways: First, they contain a wide range of primitives; Second, they frequently leverage communicative strategies beyond directly executable codes.

  • Symbolic metaprogram search improves learning efficiency and explains rule learning in humans - Nature Communications, 2024. [All Versions]. Symbolic models based on program learning successfully explain rule-learning in many domains, but performance degrades quickly as program complexity increases. It remains unclear how to scale symbolic rule-learning methods to model human performance in challenging domains. This work shows that symbolic search over the space of metaprograms—programs that revise programs—dramatically improves learning efficiency. On a behavioral benchmark of 100 algorithmically rich rules, this approach fits human learning more accurately than alternative models while also using orders of magnitude less search. The computation required to match median human performance is consistent with conservative estimates of human thinking time. The results suggest that metaprogram-like representations may help human learners to efficiently acquire rules.

*Back to Top

Problem Solving

Human-Level Problem Solving

*Back to Top

Planning

*Back to Top

Intrinsic Motivation

*Back to Top

Reinforcement Learning

*Back to Top

Inverse Reinforcement Learning

*Back to Top

System 1 & System 2

Dual-Coding Theory

*Back to Top

Neural-Symbolic AI

*Back to Top

Explainability

Trustworthy AI

*Back to Top

Strong Machine Learning

*Back to Top

Explainable Deep Learning

*Back to Top

Embodied Intelligence

*Back to Top

Evolutionary Intelligence

*Back to Top

Methodologies for Experiments

Quantitative Analysis

*Back to Top

Scaling Up Behavioral Studies

*Back to Top

Decision Making

*Back to Top

Question Answering

*Back to Top

Human-Machine Comparison

*Back to Top

Association Test

*Back to Top

Virtual Reality

*Back to Top

Meta-Level Considerations

Meta Learning

*Back to Top

Marr's Levels of Analysis

*Back to Top

Gestalt

*Back to Top

The Aha! Moment

*Back to Top

Rationality

*Back to Top

Cognitive Architecture

*Back to Top

Science Logology

Philosophy of Science

*Back to Top

Science of Science

*Back to Top

Literature Mining

*Back to Top

Scientific Writing

*Back to Top

Science Education

  • Cognitive Science and Science Education - American Psychologist, 1986. [All Versions]. Susan Carey's review on cognitive-science-based methodologies for science education research.

  • PersLEARN: Research Training through the Lens of Perspective Cultivation - ACL'23, 2023. [All Versions]. Scientific research is inherently shaped by its authors’ perspectives, influenced by various factors such as their personality, community, or society. Junior researchers often face challenges in identifying the perspectives reflected in the existing literature and struggle to develop their own viewpoints. To address the problem, this paper introduces PersLEARN, a tool designed to facilitate the cultivation of scientific perspectives, starting from a basic seed idea and progressing to a well-articulated framework.

*Back to Top

Democratization of Science

*Back to Top

Laboratory Automation

  • Reconfigurable system for automated optimization of diverse chemical reactions - Science, 2018. [All Versions]. [Preprint]. This paper describes a plug-and-play, continuous-flow chemical synthesis system that mitigates this challenge with an integrated combination of hardware, software, and analytics. The system software controls the user-selected reagents and unit operations (reactors and separators), processes reaction analytics (high-performance liquid chromatography, mass spectrometry, vibrational spectroscopy), and conducts automated optimizations.

  • Organic synthesis in a modular robotic system driven by a chemical programming language - Science, 2019. [All Versions]. [Preprint]. [Perspective: Democratizing synthesis by automation]. This paper develops an autonomous compiler and robotic laboratory platform to synthesize organic compounds on the basis of standardized methods descriptions. The platform comprises conventional equipment such as round-bottom flasks, separatory funnels, and a rotary evaporator to maximize its compatibility with extant literature. The authors showcase the system with short syntheses of three common pharmaceuticals that proceeded comparably to manual synthesis.

  • A universal system for digitization and automatic execution of the chemical synthesis literature - Science, 2020. [All Versions]. [Preprint]. [XDL Documentation]. [XDL Schema Database]. This paper reports a software platform that uses natural language processing to translate the organic chemistry literature directly into editable code, which in turn can be compiled to drive automated synthesis of the compound in the laboratory.

  • Digitization and validation of a chemical synthesis literature database in the ChemPU - Science, 2022. [All Versions]. [Preprint]. This paper presents an automatically executable chemical reaction database of 100 molecules representative of the range of reactions found in contemporary organic synthesis. The chemical reaction codes or χDLs for the reactions have been stored in a database for version control, validation, collaboration, and data mining. Of these syntheses, more than 50 entries from the database have been downloaded and robotically run in seven modular chemputers with yields and purities comparable to those achieved by an expert chemist.

  • Chemputation and the Standardization of Chemical Informatics - Journal of the American Chemical Society (Au), 2021. [All Versions]. This paper describes a standard hardware (the chemical processing programming architecture --- the ChemPU) to encompass all chemical synthesis, an approach which unifies all chemistry automation strategies, from solid-phase peptide synthesis, to HTE flow chemistry platforms, while at the same time establishing a publication standard so that researchers can exchange chemical code (χDL) to ensure reproducibility and interoperability.

  • Convergence of multiple synthetic paradigms in a universally programmable chemical synthesis machine - Nature Chemistry, 2020. [All Versions]. [Preprint]. This paper shows how the Chemputer synthesis robot can be programmed to perform many different reactions, including solid-phase peptide synthesis, iterative cross-coupling and accessing reactive, unstable diazirines in a single, unified system with high yields and purity.

  • An autonomous portable platform for universal chemical synthesis - Nature Chemistry, 2022. [All Versions]. [Preprint]. This paper presents a portable suitcase-sized chemical synthesis platform containing all the modules required for synthesis and purification. The system uses a chemical programming language coupled to a digital reactor generator to produce reactors and executable protocols based on text-based literature syntheses. Simultaneously, the platform generates a reaction pressure fingerprint, used to monitor processes within the reactors and remotely perform a protocol quality control.

  • An integrated self-optimizing programmable chemical synthesis and reaction engine - Nature Communications, 2024. [All Versions]. This paper presents a dynamically programmable system capable of making, optimizing, and discovering new molecules which utilizes seven sensors that continuously monitor the reaction. By developing a dynamic programming language, the work demonstrates the 10-fold scale-up of a highly exothermic oxidation reaction, end point detection, as well as detecting critical hardware failures.

  • A mobile robotic chemist - Nature, 2020. [All Versions]. [Preprint]. This work uses a mobile robot to search for improved photocatalysts for hydrogen production from water. The robot operated autonomously over eight days, performing 688 experiments within a ten-variable experimental space, driven by a batched Bayesian search algorithm. This autonomous search identified photocatalyst mixtures that were six times more active than the initial formulations, selecting beneficial components and deselecting negative ones.

  • An autonomous laboratory for the accelerated synthesis of novel materials - Nature, 2023. [All Versions]. This paper introduces the A-Lab, an autonomous laboratory for the solid-state synthesis of inorganic powders. This platform uses computations, historical data from the literature, machine learning (ML) and active learning to plan and interpret the outcomes of experiments performed using robotics. Over 17 days of continuous operation, the A-Lab realized 41 novel compounds from a set of 58 targets including a variety of oxides and phosphates that were identified using large-scale ab initio phase-stability data from the Materials Project and Google DeepMind.

  • The Internet of Things comes to the lab - Nature, 2017. [All Versions]. The emergence of connected instruments and equipment promises to untether researchers from the laboratory --- letting them fine-tune experiments and analyse data remotely.

  • A dynamic knowledge graph approach to distributed self-driving laboratories - Nature Communications, 2024. [All Versions]. This work employs ontologies to capture data and material flows in design-make-test-analyse cycles, utilising autonomous agents as executable knowledge components to carry out the experimentation workflow. Data provenance is recorded to ensure its findability, accessibility, interoperability, and reusability. The architecture is built upon the World Avatar project, which seeks to create an all-encompassing digital twin based on a dynamic knowledge graph.

  • Automation isn't automatic - Chemical Science, 2021. [All Versions]. This perspective provides an overview of the current state of automation of synthetic chemistry at the benchtop scale with a particular emphasis on core considerations and the ensuing challenges of deploying a system. The authors aim to reframe automation as decidedly not automatic but rather an iterative process that involves a series of careful decisions (both human and computational) and constant adjustment.

  • Balancing act: when to flex and when to stay fixed - Trends in Chemistry, 2023. [All Versions]. This perspective article provides essential insights into the decision-making process for choosing automation platforms, highlighting the suitability of fixed automation for standardized tasks and the strategic use of flexible automation in dynamic research settings.

  • What is a minimal working example for a self-driving laboratory? - Matter, 2022. [All Versions]. This paper proposes SDL-Demo: a low-cost “Hello, World!” for self-driving laboratories that combines “Hello, World!” tasks from electronics, physics-based simulations, and optimization. SDL-Demo is modular and extensible, making it an ideal candidate for low-cost teaching and prototyping of self-driving laboratory concepts.

  • Robotic search for optimal cell culture in regenerative medicine - eLife, 2022. [All Versions]. This paper develops a robotic AI system with a batch Bayesian optimization algorithm that autonomously induces the differentiation of induced pluripotent stem cell-derived retinal pigment epithelial (iPSC-RPE) cells. From 200 million possible parameter combinations, the system performed cell culture in 143 different conditions in 111 days, resulting in 88% better iPSC-RPE production than that obtained by the pre-optimized culture in terms of the pigmentation scores.

  • Cell Culture: Implementing robotics and artificial intelligence - eLife, 2022. [All Versions].

*Back to Top

AI Assisted Research

*Back to Top

Theory of Mind

  • The naïve utility calculus: Computational principles underlying commonsense psychology - Trends in Cognitive Sciences, 2016. [All Versions]. [Preprint]. This review article proposes that human social cognition is structured around a basic understanding of ourselves and others as intuitive utility maximizers: from a young age, humans implicitly assume that agents choose goals and actions to maximize the rewards they expect to obtain relative to the costs they expect to incur. This ‘naïve utility calculus’ allows both children and adults observe the behavior of others and infer their beliefs and desires, their longer-term knowledge and preferences, and even their character: who is knowledgeable or competent, who is praiseworthy or blameworthy, who is friendly, indifferent, or an enemy.

  • Planning with theory of mind - Trends in Cognitive Sciences, 2022. [All Versions]. [Preprint]. A perspective on understanding Theory of Mind through planning that consists of abstract structured causal representations and supports efficient search and selection from innumerable possible actions. Planning requires that Theory of Mind consists of abstract structured causal representations and supports efficient search and selection from innumerable possible actions. Theory of Mind contrasts with less cognitively demanding alternatives: statistical predictive models of other people’s actions, or model-free reinforcement of actions by their effects on other people. Theory of Mind is likely used to plan novel interventions and predict their effects, for example, in pedagogy, emotion regulation, and impression management.

  • Action Understanding as Inverse Planning - Cognition, 2009. [All Versions]. [Appendix]. The original paper on Inverse Planning, a computational implementation of Theory of Mind. Humans are adept at inferring the mental states underlying other agents’ actions, such as goals, beliefs, desires, emotions and other thoughts. This paper proposes a computational framework based on Bayesian inverse planning for modeling human action understanding. The framework represents an intuitive theory of intentional agents’ behavior based on the principle of rationality: the expectation that agents will plan approximately rationally to achieve their goals, given their beliefs about the world. The mental states that caused an agent's behavior are inferred by inverting this model of rational planning using Bayesian inference, integrating the likelihood of the observed actions with the prior over mental states.

  • Bayesian Theory of Mind: Modeling Joint Belief-Desire Attribution - CogSci'11, 2011. [All Versions]. [Preprint]. This paper presents a computational framework for understanding Theory of Mind (ToM): the human capacity for reasoning about agents’ mental states such as beliefs and desires. The proposed Bayesian model of ToM (or BToM) expresses the predictive model of belief- and desire-dependent action at the heart of ToM as a partially observable Markov decision process (POMDP), and reconstructs an agent’s joint belief state and reward function using Bayesian inference, conditioned on observations of the agent’s behavior in some environmental context.

  • The Signature of All Things: Children Infer Knowledge States from Static Images - CogSci'20, 2020. [All Versions].

  • Bayesian Brains without Probabilities - Trends in Cognitive Sciences, 2016. [All Versions]. A perspective on human probabilistic modeling without explicit probabilistic computation.

  • Rational quantitative attribution of beliefs, desires and percepts in human mentalizing - Nature Human Behavior, 2017. [All Versions]. [Preprint]. This paper presents a model of core mentalizing computations: inferring jointly an actor’s beliefs, desires and percepts from how they move in the local spatial environment. The proposed Bayesian theory of mind (BToM) model is based on probabilistically inverting artificial-intelligence approaches to rational planning and state estimation, which extend classical expected-utility agent models to sequential actions in complex, partially observable domains.

  • Machine theory of mind - ICML'18, 2018. [All Versions]. Theory of mind (ToM) broadly refers to humans’ ability to represent the mental states of others, including their desires, beliefs, and intentions. This work proposes a Theory of Mind neural network --- a ToMnet --- which uses meta-learning to build such models of the agents it encounters. The ToMnet learns a strong prior model for agents’ future behaviour, and, using only a small number of behavioural observations, can bootstrap to richer predictions about agents’ characteristics and mental states.

  • Theory of mind as inverse reinforcement learning - Current Opinion in Behavioral Sciences, 2019. [All Versions]. This paper reviews the idea that Theory of Mind --- humans' ability to reason about other people's mental states --- can be formalized as inverse reinforcement learning. Under this framework, expectations about how mental states produce behavior are captured in a reinforcement learning (RL) model. Predicting other people’s actions is achieved by simulating a RL model with the hypothesized beliefs and desires, while mental-state inference is achieved by inverting this model. Although many advances in inverse reinforcement learning (IRL) did not have human Theory of Mind in mind, this paper focuses on what they reveal when conceptualized as cognitive theories.

  • Computational Models of Emotion Inference in Theory of Mind: A Review and Roadmap - Topics in Cognitive Science, 2019. [All Versions]. This paper proposes an intuitive theory framework to studying affective cognition—how humans reason about emotions—and derive a taxonomy of inferences within affective cognition. Using this taxonomy, the authors review formal computational modeling work on such inferences, including causal reasoning about how others react to events, reasoning about unseen causes of emotions, reasoning with multiple cues, as well as reasoning from emotions to other mental states. This framework proposes unifying these various types of reasoning as Bayesian inference within a common “intuitive Theory of Emotion.”

  • The Naïve Utility Calculus as a unified, quantitative framework for action understanding - Cognitive Psychology, 2021. [All Versions]. [Project]. This paper presents a formal theory of the Naïve Utility Calculus as a probabilistic generative model, which highlights the role of cost and reward tradeoffs in a Bayesian framework for action-understanding. The model predicts with quantitative accuracy how people infer agents’ subjective costs and rewards based on their observable actions. By distinguishing between desires, goals, and intentions, the model extends to complex action scenarios unfolding over space and time in scenes with multiple objects and multiple action episodes.

  • AGENT: A Benchmark for Core Psychological Reasoning - ICML'21, 2021. [All Versions]. Inspired by cognitive development studies on intuitive psychology, this paper presents a benchmark consisting of a large dataset of procedurally generated 3D animations, AGENT (Action, Goal, Efficiency, coNstraint, uTility), structured around four scenarios (goal preferences, action efficiency, unobserved constraints, and cost-reward trade-offs) that probe key concepts of core intuitive psychology. The results suggest that to pass the designed tests of core intuitive psychology at human levels, a model must acquire or have built-in representations of how agents plan, combining utility computations and core knowledge of objects and physics.

  • Experimental Games and Social Decision Making - Annual Review of Psychology, 2021. [All Versions]. Experimental games model situations in which the future outcomes of individuals and groups depend on their own choices and on those of other (groups of) individuals. Games are a powerful tool to identify the neural and psychological mechanisms underlying interpersonal and group cooperation and coordination. This review article discusses recent developments in how experimental games are used and adapted, with an increased focus on repeated interactions, partner control through sanctioning, and partner (de)selection for future interactions.

  • Theory of Minds: Understanding Behavior in Groups through Inverse Planning - AAAI'19, 2019. [All Versions]. Towards the goal of building machine-learning algorithms with human-like social intelligence, this paper develops a generative model of multiagent action understanding based on a novel representation for these latent relationships called Composable Team Hierarchies (CTH). This representation is grounded in the formalism of stochastic games and multi-agent reinforcement learning. This work uses CTH as a target for Bayesian inference yielding a new algorithm for understanding behavior in groups that can both infer hidden relationships as well as predict future actions for multiple agents interacting together.

  • Leveraging Facial Expressions and Contextual Information to Investigate Opaque Representations of Emotion - Emotion, 2019. [All Versions].

  • Waiting and weighting: Information sampling is a balance between efficiency and error-reduction - Cognition, 2013. [All Versions].

  • Natural scene statistics account for the representation of scene categories in human visual cortex - Neuron, 2013. [All Versions].

  • Using human brain activity to guide machine learning - Scientific Report, 2018. [All Versions].

  • Unit of visual working memory: A Boolean map provides a better account than an object does - Journal of Experimental Psychology, 2020. [All Versions].

  • The logic of universalization guides moral judgment - Proceedings of the National Academy of Sciences, 2020. [All Versions].

  • Learning Triadic Belief Dynamics in Nonverbal Communication from Videos - CVPR'21, 2021. [All Versions]. [Preprint]. This paper incorporates different nonverbal communication cues (e.g., gaze, human poses, and gestures) to represent, model, learn, and infer agents' mental states from pure visual inputs. Crucially, such a mental representation takes the agent's belief into account so that it represents what the true world state is and infers the beliefs in each agent's mental state, which may differ from the true world states. By aggregating different beliefs and true world states, the model essentially forms "five minds" during the interactions between two agents. This "five minds" model differs from prior works that infer beliefs in an infinite recursion; instead, agents' beliefs are converged into a "common mind". Based on this representation, this work further devises a hierarchical energy-based model that jointly tracks and predicts all five minds. From this new perspective, a social event is interpreted by a series of nonverbal communication and belief dynamics, which transcends the classic keyframe video summary.

  • Ten-month-old infants infer the value of goals from the costs of actions - Science, 2017. [All Versions]. A piece of evidence for children's capability on ToM.

  • Origins of the concepts cause, cost, and goal in prereaching infants - Proceedings of the National Academy of Sciences, 2019. [All Versions].

  • Baby Intuitions Benchmark (BIB): Discerning the goals, preferences, and actions of others - NeurIPS'21, 2021. [All Versions].

  • Intentonomy: a Dataset and Study towards Human Intent Understanding - CVPR'21, 2021. [All Versions]. A large-scale database on human intentionally-posted images on social media.

  • Adventures in Flatland: Perceiving Social Interactions Under Physical Dynamics - CogSci'20, 2020. [All Versions].

  • PHASE: PHysically-grounded Abstract Social Events for Machine Social Perception - AAAI'21, 2021. [All Versions]. [Project].

  • Watch-And-Help: A Challenge for Social Perception and Human-AI Collaboration - ICLR'21, 2021. [All Versions].

  • Evaluating and Modeling Social Intelligence: A Comparative Study of Human and AI Capabilities - CogSci'24, 2024. [All Versions]. This work eveloped a comprehensive theoretical framework for social dynamics and introduced two evaluation tasks: Inverse Reasoning (IR) and Inverse Inverse Planning (IIP). The approach also encompassed a computational model based on recursive Bayesian inference, adept at elucidating diverse human behavioral patterns. Extensive experiments and detailed analyses revealed that humans surpassed the latest GPT models in overall performance, zero-shot learning, one-shot generalization, and adaptability to multi-modalities.

*Back to Top

Analogy

*Back to Top

Causality

*Back to Top

Commonsense

Intuitive Physics

*Back to Top

AI Commonsense Reasoning

*Back to Top

Commonsense Knowledgebase

*Back to Top

Inductive Logic & Program Synthesis

*Back to Top

Knowledge Representation

*Back to Top

Cognitive Development

*Back to Top

Learning in the Open World

*Back to Top

Learning with Cognitive Plausibility

*Back to Top

Academic Tools

Courses

*Back to Top

Programming

  • Probabilistic Models of Cognition - MIT. The probabilistic approach to cognitive science, which models learning and reasoning as inference in complex probabilistic models.

*Back to Top

Paper Writing

  • LaTex Configuration - LaTex. LaTex template for configuration file with elegant reference style (gray-colored reference, page backward reference).

  • BibTex Template - BibTex. BibTex template for including abbreviations of journals and conferences in AI, Mathematics, and Cognitive Sciences.

  • bioRender - bioRender. Create professional science figures in minutes by browsing thousands of pre-made icons and templates from more than 30 fields of life sciences.

  • How to construct a Nature summary paragraph - Nature. Nature official guidelines for composing abstracts.

  • How to write a superb literature review - Nature, 2020. Nature speaks to old hands and first timers about the work they did to make their reviews sing.

  • Scientific Papers - Nature. Nature guidance on writing scientific papers.

  • The Machine Learning Reproducibility Checklist - McGill University. Guidelines for introducing a machine learning algorithm with guarantee of reproducibility.

*Back to Top

Paper Reading

*Back to Top

Literature Management

*Back to Top

Knowledge Management

*Back to Top

Institute & Researcher

MIT

*Back to Top

Stanford

*Back to Top

Princeton

*Back to Top

Harvard

*Back to Top

UCLA

*Back to Top

UC Berkeley

*Back to Top

BNU

*Back to Top

PKU

*Back to Top

UCSD

*Back to Top

NYU

*Back to Top

JHU

*Back to Top

SIT

*Back to Top

People & Book

John Hopcroft

Theoretical computer scientist.

*Back to Top

Ulf Grenander

Applied mathematician, the founder of General Pattern Theory.

*Back to Top

David Marr

Computational Cognitive Neuroscientist, the establisher of the Levels of Analysis.

*Back to Top

Michael Tomasello

Cognitive scientist, set up the foundations of studying human communications.

*Back to Top

Judea Pearl

Applied mathematician, proposed causal intervention on siamese bayesian networks.

*Back to Top

Susan Carey

Developmental psychologist, proposed object as a core knowledge of human intelligence.

*Back to Top

Daniel Kahneman

Computational cognitive scientist and Economist, set up the foundations for Decision Theory.

*Back to Top

Karl Popper

Scientific philosophor, the founder of scientific verification theories.

*Back to Top

About

The initiator of this repo has been struggling to taxonomize related topics, since there are so many perspectives to follow, such as task-oriented, technique-oriented, and metaphysics-oriented. Finally he decided to focus on the perspective of The Sciences of Intelligence---each topic describes a phenomenon of intelligence, or an intelligent behavior---they show the objectives of reverse-engineering human intelligence for computational methods. These topics are never restricted to specific technical methods or tasks, but are trying to organize the nature of intelligence---from both the software perspective and the hardware perspective.

Obviously, this reading list is far from covering the every aspect of AGI and CoCoSci. Since the list is a by-product of the literature reviews when the initiator is working on Abduction and Bayesian modeling, other topics are also collected with biases, more or less. Abduction may be the way humans explain the world with the known, and discover the unknown, requiring much more investigations into its computational basis, cognitive underpinnings, and applications to AI. Please feel free to reach out!

*Back to Top