Jump to content

Neuro-symbolic AI

From Wikipedia, the free encyclopedia

Neuro-symbolic AI is a type of artificial intelligence that integrates neural and symbolic AI architectures to address the weaknesses of each, providing a robust AI capable of reasoning, learning, and cognitive modeling. As argued by Leslie Valiant[1] and others,[2][3] the effective construction of rich computational cognitive models demands the combination of symbolic reasoning and efficient machine learning. Gary Marcus argued, "We cannot construct rich cognitive models in an adequate, automated way without the triumvirate of hybrid architecture, rich prior knowledge, and sophisticated techniques for reasoning."[4] Further, "To build a robust, knowledge-driven approach to AI we must have the machinery of symbol manipulation in our toolkit. Too much useful knowledge is abstract to proceed without tools that represent and manipulate abstraction, and to date, the only known machinery that can manipulate such abstract knowledge reliably is the apparatus of symbol manipulation."[5]

Henry Kautz,[6] Francesca Rossi,[7] and Bart Selman[8] also argued for a synthesis. Their arguments attempt to address the two kinds of thinking, as discussed in Daniel Kahneman's book Thinking Fast and Slow. It describes cognition as encompassing two components: System 1 is fast, reflexive, intuitive, and unconscious. System 2 is slower, step-by-step, and explicit. System 1 is used for pattern recognition. System 2 handles planning, deduction, and deliberative thinking. In this view, deep learning best handles the first kind of cognition while symbolic reasoning best handles the second kind. Both are needed for a robust, reliable AI that can learn, reason, and interact with humans to accept advice and answer questions. Such dual-process models with explicit references to the two contrasting systems have been worked on since the 1990s, both in AI and in Cognitive Science, by multiple researchers.[9]

Approaches

[edit]

Approaches for integration are diverse.[10] Henry Kautz's taxonomy of neuro-symbolic architectures[11] follows, along with some examples:

  • Symbolic Neural symbolic is the current approach of many neural models in natural language processing, where words or subword tokens are the ultimate input and output of large language models. Examples include BERT, RoBERTa, and GPT-3.
  • Symbolic[Neural] is exemplified by AlphaGo, where symbolic techniques are used to invoke neural techniques. In this case, the symbolic approach is Monte Carlo tree search and the neural techniques learn how to evaluate game positions.
  • Neural | Symbolic uses a neural architecture to interpret perceptual data as symbols and relationships that are reasoned about symbolically. Neural-Concept Learner[12] is an example.
  • Neural: Symbolic → Neural relies on symbolic reasoning to generate or label training data that is subsequently learned by a deep learning model, e.g., to train a neural model for symbolic computation by using a Macsyma-like symbolic mathematics system to create or label examples.
  • Neural_{Symbolic} uses a neural net that is generated from symbolic rules. An example is the Neural Theorem Prover,[13] which constructs a neural network from an AND-OR proof tree generated from knowledge base rules and terms. Logic Tensor Networks[14] also fall into this category.
  • Neural[Symbolic] allows a neural model to directly call a symbolic reasoning engine, e.g., to perform an action or evaluate a state. An example would be ChatGPT using a plugin to query Wolfram Alpha.

These categories are not exhaustive, as they do not consider multi-agent systems. In 2005, Bader and Hitzler presented a more fine-grained categorization that considered, e.g., whether the use of symbols included logic and if it did, whether the logic was propositional or first-order logic.[15] The 2005 categorization and Kautz's taxonomy above are compared and contrasted in a 2021 article.[11] Recently, Sepp Hochreiter argued that Graph Neural Networks "...are the predominant models of neural-symbolic computing"[16] since "[t]hey describe the properties of molecules, simulate social networks, or predict future states in physical and engineering applications with particle-particle interactions."[17]

Artificial general intelligence

[edit]

Gary Marcus argues that "...hybrid architectures that combine learning and symbol manipulation are necessary for robust intelligence, but not sufficient",[18] and that there are

...four cognitive prerequisites for building robust artificial intelligence:

  • hybrid architectures that combine large-scale learning with the representational and computational powers of symbol manipulation,
  • large-scale knowledge bases—likely leveraging innate frameworks—that incorporate symbolic knowledge along with other forms of knowledge,
  • reasoning mechanisms capable of leveraging those knowledge bases in tractable ways, and
  • rich cognitive models that work together with those mechanisms and knowledge bases.[19]

This echoes earlier calls for hybrid models as early as the 1990s.[20][21]

History

[edit]

Garcez and Lamb described research in this area as ongoing at least since the 1990s.[22][23] At that time, the terms symbolic and sub-symbolic AI were popular.

A series of workshops on neuro-symbolic AI has been held annually since 2005 Neuro-Symbolic Artificial Intelligence.[24] In the early 1990s, an initial set of workshops on this topic were organized.[20]

Research

[edit]

Key research questions remain,[25] such as:

  • What is the best way to integrate neural and symbolic architectures?
  • How should symbolic structures be represented within neural networks and extracted from them?
  • How should common-sense knowledge be learned and reasoned about?
  • How can abstract knowledge that is hard to encode logically be handled?

Implementations

[edit]

Implementations of neuro-symbolic approaches include:

  • AllegroGraph: an integrated Knowledge Graph based platform for neuro-symbolic application development.[26][27][28]
  • Scallop: a language based on Datalog that supports differentiable logical and relational reasoning. Scallop can be integrated in Python and with a PyTorch learning module.[29]
  • Logic Tensor Networks: encode logical formulas as neural networks and simultaneously learn term encodings, term weights, and formula weights.
  • DeepProbLog: combines neural networks with the probabilistic reasoning of ProbLog.
  • SymbolicAI: a compositional differentiable programming library.
  • Explainable Neural Networks (XNNs): combine neural networks with symbolic hypergraphs and trained using a mixture of backpropagation and symbolic learning called induction.[30]

Citations

[edit]
  1. ^ Valiant 2008.
  2. ^ Garcez et al. 2015.
  3. ^ D'Avila Garcez, Artur S.; Lamb, Luis C.; Gabbay, Dov M. (2009). Neural-symbolic cognitive reasoning. Cognitive technologies. Springer. ISBN 978-3-540-73245-7.
  4. ^ Marcus 2020, p. 44.
  5. ^ Marcus & Davis 2019, p. 17.
  6. ^ Kautz 2020.
  7. ^ Rossi 2022.
  8. ^ Selman 2022.
  9. ^ Sun 1995.
  10. ^ "Disentangling visual attributes with neuro-vector-symbolic architectures, in-memory computing, and device noise". IBM Research. 2021-02-09. Retrieved 2024-10-20.
  11. ^ a b Sarker, Md Kamruzzaman; Zhou, Lu; Eberhart, Aaron; Hitzler, Pascal (2021). "Neuro-symbolic artificial intelligence: Current trends". AI Communications. 34 (3): 197–209. doi:10.3233/AIC-210084. S2CID 239199144.
  12. ^ Mao et al. 2019.
  13. ^ Rocktäschel, Tim; Riedel, Sebastian (2016). "Learning Knowledge Base Inference with Neural Theorem Provers". Proceedings of the 5th Workshop on Automated Knowledge Base Construction. San Diego, CA: Association for Computational Linguistics. pp. 45–50. doi:10.18653/v1/W16-1309. Retrieved 2022-08-06.
  14. ^ Serafini, Luciano; Garcez, Artur d'Avila (2016). "Logic Tensor Networks: Deep Learning and Logical Reasoning from Data and Knowledge". arXiv:1606.04422 [cs.AI].
  15. ^ Bader & Hitzler 2005.
  16. ^ L.C. Lamb, A.S. d'Avila Garcez, M.Gori, M.O.R. Prates, P.H.C. Avelar, M.Y. Vardi (2020). "Graph Neural Networks Meet Neural-Symbolic Computing: A Survey and Perspective." CoRR abs/2003.00330 (2020)
  17. ^ Hochreiter, Sepp (April 2022). "Toward a broad AI". Communications of the ACM. 65 (4): 56–57. doi:10.1145/3512715. ISSN 0001-0782.
  18. ^ Marcus 2020, p. 50.
  19. ^ Marcus 2020, p. 48.
  20. ^ a b Sun & Bookman 1994.
  21. ^ Honavar 1995.
  22. ^ Garcez & Lamb 2020, p. 2.
  23. ^ Garcez et al. 2002.
  24. ^ "Neuro-Symbolic Artificial Intelligence". people.cs.ksu.edu. Retrieved 2023-09-11.
  25. ^ Sun 2001.
  26. ^ Harper, Jelani (2023-12-29). "AllegroGraph 8.0 Incorporates Neuro-Symbolic AI, a Pathway to AGI". The New Stack. Retrieved 2024-06-13.
  27. ^ "Neuro-Symbolic AI and Large Language Models Introduction | AllegroGraph 8.1.1". franz.com. Retrieved 2024-06-13.
  28. ^ "Franz Inc. Introduces AllegroGraph Cloud: A Managed Service for Neuro-Symbolic AI Knowledge Graphs". Datanami. Retrieved 2024-06-13.
  29. ^ Li, Ziyang; Huang, Jiani; Naik, Mayur (2023). "Scallop: A Language for Neurosymbolic Programming". arXiv:2304.04812 [cs.PL].
  30. ^ "Model Induction Method for Explainable AI". USPTO. 2021-05-06.

References

[edit]

See also

[edit]
[edit]