Loading AI tools
Overview of and topical guide to natural-language processing From Wikipedia, the free encyclopedia
The following outline is provided as an overview of and topical guide to natural-language processing:
natural-language processing – computer activity in which computers are entailed to analyze, understand, alter, or generate natural language. This includes the automation of any or all linguistic forms, activities, or methods of communication, such as conversation, correspondence, reading, written composition, dictation, publishing, translation, lip reading, and so on. Natural-language processing is also the name of the branch of computer science, artificial intelligence, and linguistics concerned with enabling computers to engage in communication using natural language(s) in all forms, including but not limited to speech, print, writing, and signing.
Natural-language processing can be described as all of the following:
The following technologies make natural-language processing possible:
Natural-language processing contributes to, and makes use of (the theories, tools, and methodologies from), the following fields:
Natural-language generation – task of converting information from computer databases into readable human language.
History of natural-language processing
Software | Year | Creator | Description | Reference |
---|---|---|---|---|
Georgetown experiment | 1954 | Georgetown University and IBM | involved fully automatic translation of more than sixty Russian sentences into English. | |
STUDENT | 1964 | Daniel Bobrow | could solve high school algebra word problems.[10] | |
ELIZA | 1964 | Joseph Weizenbaum | a simulation of a Rogerian psychotherapist, rephrasing her (referred to as her not it) response with a few grammar rules.[11] | |
SHRDLU | 1970 | Terry Winograd | a natural-language system working in restricted "blocks worlds" with restricted vocabularies, worked extremely well | |
PARRY | 1972 | Kenneth Colby | A chatterbot | |
KL-ONE | 1974 | Sondheimer et al. | a knowledge representation system in the tradition of semantic networks and frames; it is a frame language. | |
MARGIE | 1975 | Roger Schank | ||
TaleSpin (software) | 1976 | Meehan | ||
QUALM | Lehnert | |||
LIFER/LADDER | 1978 | Hendrix | a natural-language interface to a database of information about US Navy ships. | |
SAM (software) | 1978 | Cullingford | ||
PAM (software) | 1978 | Robert Wilensky | ||
Politics (software) | 1979 | Carbonell | ||
Plot Units (software) | 1981 | Lehnert | ||
Jabberwacky | 1982 | Rollo Carpenter | chatterbot with stated aim to "simulate natural human chat in an interesting, entertaining and humorous manner". | |
MUMBLE (software) | 1982 | McDonald | ||
Racter | 1983 | William Chamberlain and Thomas Etter | chatterbot that generated English language prose at random. | |
MOPTRANS | 1984 | Lytinen | ||
KODIAK (software) | 1986 | Wilensky | ||
Absity (software) | 1987 | Hirst | ||
AeroText | 1999 | Lockheed Martin | Originally developed for the U.S. intelligence community (Department of Defense) for information extraction & relational link analysis | |
Watson | 2006 | IBM | A question answering system that won the Jeopardy! contest, defeating the best human players in February 2011. | |
MeTA | 2014 | Sean Massung, Chase Geigle, Cheng{X}iang Zhai | MeTA is a modern C++ data sciences toolkit featuringL text tokenization, including deep semantic features like parse trees; inverted and forward indexes with compression and various caching strategies; a collection of ranking functions for searching the indexes; topic models; classification algorithms; graph algorithms; language models; CRF implementation (POS-tagging, shallow parsing); wrappers for liblinear and libsvm (including libsvm dataset parsers); UTF8 support for analysis on various languages; multithreaded algorithms | |
Tay | 2016 | Microsoft | An artificial intelligence chatterbot that caused controversy on Twitter by releasing inflammatory tweets and was taken offline shortly after. |
The following natural-language processing toolkits are notable collections of natural-language processing software. They are suites of libraries, frameworks, and applications for symbolic, statistical natural-language and speech processing.
Name | Language | License | Creators |
---|---|---|---|
Apertium | C++, Java | GPL | (various) |
ChatScript | C++ | GPL | Bruce Wilcox |
Deeplearning4j | Java, Scala | Apache 2.0 | Adam Gibson, Skymind |
DELPH-IN | LISP, C++ | LGPL, MIT, ... | Deep Linguistic Processing with HPSG Initiative |
Distinguo | C++ | Commercial | Ultralingua Inc. |
DKPro Core | Java | Apache 2.0 / Varying for individual modules | Technische Universität Darmstadt / Online community |
General Architecture for Text Engineering (GATE) | Java | LGPL | GATE open source community |
Gensim | Python | LGPL | Radim Řehůřek |
LinguaStream | Java | Free for research | University of Caen, France |
Mallet | Java | Common Public License | University of Massachusetts Amherst |
Modular Audio Recognition Framework | Java | BSD | The MARF Research and Development Group, Concordia University |
MontyLingua | Python, Java | Free for research | MIT |
Natural Language Toolkit (NLTK) | Python | Apache 2.0 | |
Apache OpenNLP | Java | Apache License 2.0 | Online community |
spaCy | Python, Cython | MIT | Matthew Honnibal, Explosion AI |
UIMA | Java / C++ | Apache 2.0 | Apache |
Chatterbot – a text-based conversation agent that can interact with human users through some medium, such as an instant message service. Some chatterbots are designed for specific purposes, while others converse with human users on a wide range of topics.
Seamless Wikipedia browsing. On steroids.
Every time you click a link to Wikipedia, Wiktionary or Wikiquote in your browser's search results, it will show the modern Wikiwand interface.
Wikiwand extension is a five stars, simple, with minimum permission required to keep your browsing private, safe and transparent.