All Topics  
Neural network

 

   Email Print
   Bookmark   Link






 

Neural network



 
 
Traditionally, the term neural network had been used to refer to a network or circuit of biological neurons
Neuron

Neurons are responsive cell in the nervous system that process and transmit information by electrochemical Signal . They are the core components of the brain, the vertebrate spinal cord, the invertebrate ventral nerve cord, and the peripheral nerves....
. The modern usage of the term often refers to artificial neural network
Artificial neural network

An artificial neural network , often just called a "neural network" , is a mathematical model or computational model based on biological neural networks....
s, which are composed of artificial neuron
Artificial neuron

An artificial neuron is a mathematical function conceived as a crude model, or abstraction of biological neurons. Artificial neurons are the constitutive units in an artificial neural network....
s or nodes. Thus the term has two distinct usages:
  1. Biological neural network
    Biological neural network

    In neuroscience, a neural network describes a population of physically interconnected neurons or a group of disparate neurons whose inputs or signalling targets define a recognizable circuit....
    s are made up of real biological neurons that are connected or functionally related in the peripheral nervous system
    Peripheral nervous system

    The peripheral nervous system resides or extends outside the central nervous system , which consists of the brain and spinal cord. The main function of the PNS is to connect the CNS to the limbs and organs....
     or the central nervous system
    Central nervous system

    The central nervous system is the part of the nervous system that functions to coordinate the activity of all parts of the bodies of multicellular organisms....
    . In the field of neuroscience
    Neuroscience

    Neuroscience is a field devoted to the scientific study of the nervous system. The Society for Neuroscience was founded in 1969, but the study of the brain started a long time ago....
    , they are often identified as groups of neurons that perform a specific physiological function in laboratory analysis.
  2. Artificial neural network
    Artificial neural network

    An artificial neural network , often just called a "neural network" , is a mathematical model or computational model based on biological neural networks....
    s are made up of interconnecting artificial neurons (programming constructs that mimic the properties of biological neurons).






    Discussion
    Ask a question about 'Neural network'
    Start a new discussion about 'Neural network'
    Answer questions from other users
    Full Discussion Forum



    Encyclopedia


    Traditionally, the term neural network had been used to refer to a network or circuit of biological neurons
    Neuron

    Neurons are responsive cell in the nervous system that process and transmit information by electrochemical Signal . They are the core components of the brain, the vertebrate spinal cord, the invertebrate ventral nerve cord, and the peripheral nerves....
    . The modern usage of the term often refers to artificial neural network
    Artificial neural network

    An artificial neural network , often just called a "neural network" , is a mathematical model or computational model based on biological neural networks....
    s, which are composed of artificial neuron
    Artificial neuron

    An artificial neuron is a mathematical function conceived as a crude model, or abstraction of biological neurons. Artificial neurons are the constitutive units in an artificial neural network....
    s or nodes. Thus the term has two distinct usages:
    1. Biological neural network
      Biological neural network

      In neuroscience, a neural network describes a population of physically interconnected neurons or a group of disparate neurons whose inputs or signalling targets define a recognizable circuit....
      s are made up of real biological neurons that are connected or functionally related in the peripheral nervous system
      Peripheral nervous system

      The peripheral nervous system resides or extends outside the central nervous system , which consists of the brain and spinal cord. The main function of the PNS is to connect the CNS to the limbs and organs....
       or the central nervous system
      Central nervous system

      The central nervous system is the part of the nervous system that functions to coordinate the activity of all parts of the bodies of multicellular organisms....
      . In the field of neuroscience
      Neuroscience

      Neuroscience is a field devoted to the scientific study of the nervous system. The Society for Neuroscience was founded in 1969, but the study of the brain started a long time ago....
      , they are often identified as groups of neurons that perform a specific physiological function in laboratory analysis.
    2. Artificial neural network
      Artificial neural network

      An artificial neural network , often just called a "neural network" , is a mathematical model or computational model based on biological neural networks....
      s are made up of interconnecting artificial neurons (programming constructs that mimic the properties of biological neurons). Artificial neural networks may either be used to gain an understanding of biological neural networks, or for solving artificial intelligence problems without necessarily creating a model of a real biological system. The real, biological nervous system is highly complex and includes some features that may seem superfluous based on an understanding of artificial networks.


    This article focuses on the relationship between the two concepts; for detailed coverage of the two different concepts refer to the separate articles: Biological neural network
    Biological neural network

    In neuroscience, a neural network describes a population of physically interconnected neurons or a group of disparate neurons whose inputs or signalling targets define a recognizable circuit....
     and Artificial neural network
    Artificial neural network

    An artificial neural network , often just called a "neural network" , is a mathematical model or computational model based on biological neural networks....
    .

    Overview

    In general a biological neural network is composed of a group or groups of chemically connected or functionally associated neurons. A single neuron may be connected to many other neurons and the total number of neurons and connections in a network may be extensive. Connections, called synapses, are usually formed from axons to dendrites, though dendrodendritic microcircuits and other connections are possible. Apart from the electrical signaling, there are other forms of signaling that arise from neurotransmitter
    Neurotransmitter

    Neurotransmitters are chemistry which relay, amplify and modulate signals between a neuron and another cell . Neurotransmitters are packaged into vesicles that cluster beneath the membrane on the presynaptic side of a synapse, and are released into the synaptic cleft, where they bind to receptors in the membrane on the postsynaptic side of...
     diffusion, which have an effect on electrical signaling. As such, neural networks are extremely complex.

    Artificial intelligence
    Artificial intelligence

    Artificial intelligence is the intelligence of machines and the branch of computer science which aims to create it. Major AI textbooks define the field as "the study and design of intelligent agents,"...
     and cognitive modeling try to simulate some properties of neural networks. While similar in their techniques, the former has the aim of solving particular tasks, while the latter aims to build mathematical models of biological neural systems.

    In the artificial intelligence
    Artificial intelligence

    Artificial intelligence is the intelligence of machines and the branch of computer science which aims to create it. Major AI textbooks define the field as "the study and design of intelligent agents,"...
     field, artificial neural networks have been applied successfully to speech recognition
    Speech recognition

    Speech recognition converts spoken words to machine-readable input . The term "voice recognition" is sometimes incorrectly used to refer to speech recognition, when actually referring to speaker recognition, which attempts to identify the person speaking, as opposed to what is being said....
    , image analysis
    Image analysis

    Image analysis is the extraction of meaningful information from s; mainly from digital images by means of digital image processing techniques. Image analysis tasks can be as simple as reading barcoded tags or as sophisticated as facial recognition system....
     and adaptive control
    Control

    Control is used in a variety of contexts to express "mastery" or "proficiency": e.g. "Music students attending a master class are expected to have full control of basic skills such as rhythm and pitch" and more generally an ability to purposefully direct change....
    , in order to construct software agents (in computer and video games) or autonomous robot
    Autonomous robot

    Autonomous robots are robots which can perform desired tasks in unstructured environments without continuous human guidance. Many kinds of robots have some degree of autonomy....
    s. Most of the currently employed artificial neural networks for artificial intelligence are based on statistical estimation, optimization
    Optimization (mathematics)

    In mathematics, the simplest case of optimization, or mathematical programming, refers to the study of problems in which one seeks to maxima and minima or maxima and minima a Function of a real variable by systematically choosing the values of Real number or integer variables from within an allowed set....
     and control theory
    Control theory

    Control theory is an interdisciplinary branch of engineering and mathematics, that deals with the behavior of dynamical systems. The desired output of a system is called the reference....
    .

    The cognitive modelling field involves the physical or mathematical modeling of the behaviour of neural systems; ranging from the individual neural level (e.g. modelling the spike response curves of neurons to a stimulus), through the neural cluster level (e.g. modelling the release and effects of dopamine in the basal ganglia) to the complete organism (e.g. behavioural modelling of the organism's response to stimuli).

    History of the neural network analogy

    The concept of neural networks started in the late-1800s as an effort to describe how the human mind performed. These ideas started being applied to computational models with Turing's B-type machines and the perceptron
    Perceptron

    The perceptron is a type of artificial neural network invented in 1957 at the Cornell Aeronautical Laboratory by Frank Rosenblatt. It can be seen as the simplest kind of feedforward neural network: a linear classifier....
    .

    In early 1950s Friedrich Hayek
    Friedrich Hayek

    Friedrich August von Hayek Order of the Companions of Honour was an Austrian economist and philosopher known throughout the world for his defense of classical liberalism and free market capitalism against socialism and collectivism thought....
     was one of the first to posit the idea of spontaneous order
    Spontaneous order

    Spontaneous order is the spontaneous emergence of order out of seeming chaos; the emergence of various kinds of social order from a combination of self-interested individuals who are not intentionally trying to create order....
      in the brain arising out of decentralized networks of simple units (neurons). In the late 1940s, Donald Hebb made one of the first hypotheses for a mechanism of neural plasticity (i.e. learning), Hebbian learning. Hebbian learning is considered to be a 'typical' unsupervised learning rule and it (and variants of it) was an early model for long term potentiation.

    The Perceptron
    Perceptron

    The perceptron is a type of artificial neural network invented in 1957 at the Cornell Aeronautical Laboratory by Frank Rosenblatt. It can be seen as the simplest kind of feedforward neural network: a linear classifier....
     is essentially a linear classifier for classifying data specified by parameters and an output function . Its parameters are adapted with an ad-hoc rule similar to stochastic steepest gradient descent. Because the inner product is a linear operator in the input space, the Perceptron can only perfectly classify a set of data for which different classes are linearly separable
    Linearly separable

    In geometry, when two sets of points in a two-dimensional graph can be completely separated by a single line, they are said to be linearly separable....
     in the input space, while it often fails completely for non-separable data. While the development of the algorithm initially generated some enthusiasm, partly because of its apparent relation to biological mechanisms, the later discovery of this inadequacy caused such models to be abandoned until the introduction of non-linear models into the field.

    The Cognitron (1975) was an early multilayered neural network with a training algorithm. The actual structure of the network and the methods used to set the interconnection weights change from one neural strategy to another, each with its advantages and disadvantages. Networks can propagate information in one direction only, or they can bounce back and forth until self-activation at a node occurs and the network settles on a final state. The ability for bi-directional flow of inputs between neurons/nodes was produced with the Hopfield's network
    Hopfield net

    A Hopfield net is a form of Recurrent neural network Artificial_neural_network invented by John Hopfield. Hopfield nets serve as associative memory systems with Binary numeral system threshold units....
     (1982), and specialization of these node layers for specific purposes was introduced through the first hybrid network
    Hybrid neural network

    The term hybrid neural network can have two meanings:#biological neural networks interacting with artificial neural network, and#Artificial neural networks with a Computationalism part ....
    .

    The parallel distributed processing
    Connectionism

    Connectionism is a set of approaches in the fields of artificial intelligence, cognitive psychology, cognitive science, neuroscience and philosophy of mind, that models mind or behavior phenomena as the emergence of interconnected networks of simple units....
     of the mid-1980s became popular under the name connectionism
    Connectionism

    Connectionism is a set of approaches in the fields of artificial intelligence, cognitive psychology, cognitive science, neuroscience and philosophy of mind, that models mind or behavior phenomena as the emergence of interconnected networks of simple units....
    .

    The rediscovery of the backpropagation
    Backpropagation

    Backpropagation, or propagation of error, is a common method of teaching artificial neural networks how to perform a given task. It was first described by Paul Werbos in 1974, but it wasn't until 1986, through the work of David E....
     algorithm was probably the main reason behind the repopularisation of neural networks after the publication of "Learning Internal Representations by Error Propagation" in 1986 (Though backpropagation itself dates from 1974). The original network utilised multiple layers of weight-sum units of the type , where was a sigmoid function
    Sigmoid function

    Many natural processes and complex system learning curve display a history dependent progression from small beginnings that accelerates and approaches a climax over time....
     or logistic function
    Logistic function

    A logistic function or logistic curve is the most common sigmoid curve. It modelsthe S-curve of growth of some set P, where P might...
     such as used in logistic regression
    Logistic regression

    In statistics, logistic regression is a model used for prediction of the probability of occurrence of an event by fitting data to a logistic curve....
    . Training was done by a form of stochastic steepest gradient descent. The employment of the chain rule of differentiation in deriving the appropriate parameter updates results in an algorithm that seems to 'backpropagate errors', hence the nomenclature. However it is essentially a form of gradient descent. Determining the optimal parameters in a model of this type is not trivial, and steepest gradient descent methods cannot be relied upon to give the solution without a good starting point. In recent times, networks with the same architecture as the backpropagation network are referred to as Multi-Layer Perceptrons
    Multilayer perceptron

    A multilayer perceptron is a feedforward artificial neural network model that maps sets of input data onto a set of appropriate output. It is a modification of the standard linear perceptron in that it uses three or more layers of neurons with nonlinear activation function, and is more powerful than the perceptron in that it can distinguish...
    . This name does not impose any limitations on the type of algorithm used for learning.

    The backpropagation network generated much enthusiasm at the time and there was much controversy about whether such learning could be implemented in the brain or not, partly because a mechanism for reverse signalling was not obvious at the time, but most importantly because there was no plausible source for the 'teaching' or 'target' signal.

    The brain, neural networks and computers

    Neural networks, as used in artificial intelligence, have traditionally been viewed as simplified models of neural processing in the brain, even though the relation between this model and brain biological architecture is debated.

    A subject of current research in theoretical neuroscience is the question surrounding the degree of complexity and the properties that individual neural elements should have to reproduce something resembling animal intelligence.

    Historically, computers evolved from the von Neumann architecture
    Von Neumann architecture

    The von Neumann architecture is a design model for a stored-program digital computer that uses a central processing unit and a single separate computer storage structure to hold both instructions and data ....
    , which is based on sequential processing and execution of explicit instructions. On the other hand, the origins of neural networks are based on efforts to model information processing in biological systems, which may rely largely on parallel processing as well as implicit instructions based on recognition of patterns of 'sensory' input from external sources. In other words, at its very heart a neural network is a complex statistical processor (as opposed to being tasked to sequentially process and execute).

    Neural networks and artificial intelligence

    An artificial neural network (ANN), also called a simulated neural network (SNN) or commonly just neural network (NN) is an interconnected group of artificial neuron
    Artificial neuron

    An artificial neuron is a mathematical function conceived as a crude model, or abstraction of biological neurons. Artificial neurons are the constitutive units in an artificial neural network....
    s that uses a mathematical or computational model
    Mathematical model

    A mathematical model uses mathematics language to describe a system. Mathematical models are used not only in the natural sciences and engineering disciplines but also in the social sciences ; physicists, engineers, computer sciences, and economists use mathematical models most extensively....
     for information processing
    Information processing

    Information processing is the change of information in any manner detectable by an observation. As such, it is a Process which describes everything which happens in the universe, from the falling of a rock to the printing of a text file from a digital computer system....
     based on a connectionistic
    Connectionism

    Connectionism is a set of approaches in the fields of artificial intelligence, cognitive psychology, cognitive science, neuroscience and philosophy of mind, that models mind or behavior phenomena as the emergence of interconnected networks of simple units....
     approach to computation
    Computation

    Computation is a general term for any type of information processing. This includes phenomena ranging from human thinking to calculations with a more narrow meaning....
    . In most cases an ANN is an adaptive system
    Adaptive system

    An adaptive system is a set of interacting or interdependent entities, real or abstract, forming an integrated whole that together are able to respond to environmental changes or changes in the interacting parts....
     that changes its structure based on external or internal information that flows through the network.

    In more practical terms neural networks are non-linear statistical data modeling
    Data modeling

    Data modeling in software engineering is the process of creating a data model by applying formal data model descriptions using data modeling techniques....
     or decision making
    Decision making

    Decision making can be regarded as an outcome of mental processes leading to the selection of a course of action among several alternatives. Every decision making process produces a final choice....
     tools. They can be used to model complex relationships between inputs and outputs or to find patterns
    Pattern recognition

    Pattern recognition is a sub-topic of machine learning. It is "the act of taking in raw data and taking an action based on the Category of the data"....
     in data.

    Background

    An artificial neural network
    Artificial neural network

    An artificial neural network , often just called a "neural network" , is a mathematical model or computational model based on biological neural networks....
     involves a network of simple processing elements (artificial neurons) which can exhibit complex global behaviour, determined by the connections between the processing elements and element parameters. Artificial neurons were first proposed in 1943 by Warren McCulloch, a neurophysiologist, and Walter Pitts, an MIT logician. One classical type of artificial neural network is the Hopfield net
    Hopfield net

    A Hopfield net is a form of Recurrent neural network Artificial_neural_network invented by John Hopfield. Hopfield nets serve as associative memory systems with Binary numeral system threshold units....
    .

    In a neural network model simple nodes, which can be called variously "neurons", "neurodes", "Processing Elements" (PE) or "units", are connected together to form a network of nodes — hence the term "neural network". While a neural network does not have to be adaptive per se, its practical use comes with algorithms designed to alter the strength (weights) of the connections in the network to produce a desired signal flow.

    In modern software implementations
    Neural network software

    Neural network software is used to Simulation, research, Software development and apply artificial neural networks, biological neural networks and in some cases a wider array of adaptive systems....
     of artificial neural networks the approach inspired by biology has more or less been abandoned for a more practical approach based on statistics and signal processing. In some of these systems neural networks, or parts of neural networks (such as artificial neuron
    Artificial neuron

    An artificial neuron is a mathematical function conceived as a crude model, or abstraction of biological neurons. Artificial neurons are the constitutive units in an artificial neural network....
    s) are used as components in larger systems that combine both adaptive and non-adaptive elements.

    The concept of a neural network appears to have first been proposed by Alan Turing
    Alan Turing

    Alan Mathison Turing, Order of the British Empire, Fellow of the Royal Society was a British mathematician, logician and Cryptanalysis....
     in his 1948 paper "Intelligent Machinery".

    Applications

    The utility of artificial neural network models lies in the fact that they can be used to infer a function from observations and also to use it. This is particularly useful in applications where the complexity of the data or task makes the design of such a function by hand impractical.

    Real life applications The tasks to which artificial neural networks are applied tend to fall within the following broad categories:
    • Function approximation
      Function approximation

      The need for function approximations arises in many branches of applied mathematics, and computer science in particular. In general, a function approximation problem asks us to select a function among a well-defined class that closely matches a target function in a task-specific way....
      , or regression analysis
      Regression analysis

      In statistics, regression analysis is a collective name for techniques for the modeling and analysis of numerical data consisting of values of a dependent variable and of one or more independent variables ....
      , including time series prediction and modelling.
    • Classification
      Statistical classification

      Statistical classification is a procedure in which individual items are placed into groups based on quantitative information on one or more characteristics inherent in the items and based on a training set of previously labeled items....
      , including pattern
      Pattern recognition

      Pattern recognition is a sub-topic of machine learning. It is "the act of taking in raw data and taking an action based on the Category of the data"....
       and sequence recognition, novelty detection and sequential decision making.
    • Data processing
      Data processing

      Computer data processing is any computering Process that converts datas into information or knowledge. The processing is usually assumed to be automated and running on a computer....
      , including filtering, clustering, blind signal separation
      Blind signal separation

      Blind signal separation, also known as blind source separation, is the separation of a set of signal processings from a set of mixed signals, without the aid of information about the source signals or the mixing process....
       and compression.


    Application areas include system identification and control (vehicle control, process control), game-playing and decision making (backgammon, chess, racing), pattern recognition (radar systems, face identification, object recognition, etc.), sequence recognition (gesture, speech, handwritten text recognition), medical diagnosis, financial applications, data mining
    Data mining

    Data mining is the process of extracting hidden patterns from data. As more data is gathered, with the amount of data doubling every three years, data mining is becoming an increasingly important tool to transform this data into information....
     (or knowledge discovery in databases, "KDD"), visualization and e-mail spam
    E-mail spam

    E-mail spam, also known as junk e-mail, is a subset of spam that involves nearly identical messages sent to numerous recipients by e-mail....
     filtering.

    Neural network software

    Main article: Neural network software
    Neural network software

    Neural network software is used to Simulation, research, Software development and apply artificial neural networks, biological neural networks and in some cases a wider array of adaptive systems....


    Neural network software is used to simulate
    Simulation

    Simulation is the imitation of some real thing, state of affairs, or process. The act of simulating something generally entails representing certain key characteristics or behaviors of a selected physical or abstract system....
    , research
    Research

    Research is defined as human activity based on intellectual application in the investigation of matter. The primary purpose for applied research is discovery , interpretation , and the development of methods and systems for the advancement of human knowledge on a wide variety of scientific matters of our world and the universe....
    , develop
    Software development

    Software development is the set of activities that results in software products. Software development may include research, new development, modification, reuse, re-engineering, maintenance, or any other activities that result in software products....
     and apply artificial neural network
    Artificial neural network

    An artificial neural network , often just called a "neural network" , is a mathematical model or computational model based on biological neural networks....
    s, biological neural network
    Biological neural network

    In neuroscience, a neural network describes a population of physically interconnected neurons or a group of disparate neurons whose inputs or signalling targets define a recognizable circuit....
    s and in some cases a wider array of adaptive system
    Adaptive system

    An adaptive system is a set of interacting or interdependent entities, real or abstract, forming an integrated whole that together are able to respond to environmental changes or changes in the interacting parts....
    s.

    Learning paradigms
    There are three major learning paradigms, each corresponding to a particular abstract learning task. These are supervised learning
    Supervised learning

    Supervised learning is a machine learning technique for learning a function from training data. The training set consist of pairs of input objects , and desired outputs....
    , unsupervised learning
    Unsupervised learning

    In machine learning, unsupervised learning is a class of problems in which one seeks to determine how the data are organized. It is distinguished from supervised learning in that the learner is given only unlabeled examples....
     and reinforcement learning
    Reinforcement learning

    Inspired by related psychological theory, in computer science, reinforcement learning is a sub-area of machine learning concerned with how an agent ought to take actions in an environment so as to maximize some notion of long-term reward....
    . Usually any given type of network architecture can be employed in any of those tasks.

    Supervised learning In supervised learning
    Supervised learning

    Supervised learning is a machine learning technique for learning a function from training data. The training set consist of pairs of input objects , and desired outputs....
    , we are given a set of example pairs and the aim is to find a function in the allowed class of functions that matches the examples. In other words, we wish to infer how the mapping implied by the data and the cost function is related to the mismatch between our mapping and the data.

    Unsupervised learning In unsupervised learning
    Unsupervised learning

    In machine learning, unsupervised learning is a class of problems in which one seeks to determine how the data are organized. It is distinguished from supervised learning in that the learner is given only unlabeled examples....
     we are given some data , and a cost function which is to be minimized which can be any function of and the network's output, . The cost function is determined by the task formulation. Most applications fall within the domain of estimation problems such as statistical modeling, compression
    Data compression

    In computer science and information theory, data compression or source coding is the process of encoding information using fewer bits than an code representation would use through use of specific encoding schemes....
    , filtering, blind source separation and clustering
    Data clustering

    Clustering is the assignment of objects into groups so that objects from the same cluster are more similar to each other than objects from different clusters....
    .

    Reinforcement learning In reinforcement learning
    Reinforcement learning

    Inspired by related psychological theory, in computer science, reinforcement learning is a sub-area of machine learning concerned with how an agent ought to take actions in an environment so as to maximize some notion of long-term reward....
    , data is usually not given, but generated by an agent's interactions with the environment. At each point in time , the agent performs an action and the environment generates an observation and an instantaneous cost , according to some (usually unknown) dynamics. The aim is to discover a policy for selecting actions that minimises some measure of a long-term cost, i.e. the expected cumulative cost. The environment's dynamics and the long-term cost for each policy are usually unknown, but can be estimated. ANNs are frequently used in reinforcement learning as part of the overall algorithm. Tasks that fall within the paradigm of reinforcement learning are control
    Control

    Control is used in a variety of contexts to express "mastery" or "proficiency": e.g. "Music students attending a master class are expected to have full control of basic skills such as rhythm and pitch" and more generally an ability to purposefully direct change....
     problems, game
    Game

    A game is a structured wiktionary:activity, usually undertaken for enjoyment and sometimes used as an educational tool. Games are distinct from Manual labour, which is usually carried out for wiktionary:remuneration, and from art, which is more concerned with the expression of ideas....
    s and other sequential decision making tasks.

    Learning algorithms
    There are many algorithms for training neural networks; most of them can be viewed as a straightforward application of optimization
    Optimization (mathematics)

    In mathematics, the simplest case of optimization, or mathematical programming, refers to the study of problems in which one seeks to maxima and minima or maxima and minima a Function of a real variable by systematically choosing the values of Real number or integer variables from within an allowed set....
     theory and statistical estimation.

    Evolutionary computation
    Evolutionary computation

    In computer science evolutionary computation is a subfield of artificial intelligence that involves combinatorial optimization problems.Evolutionary computation uses iterative progress, such as growth or development in a population....
     methods, simulated annealing
    Simulated annealing

    Simulated annealing is a generic probabilistic algorithm metaheuristic for the global optimization problem of applied mathematics, namely locating a good approximation to the global optimum of a given function in a large search space....
    , expectation maximization and non-parametric methods are among other commonly used methods for training neural networks. See also machine learning
    Machine learning

    Machine learning is the subfield of artificial intelligence that is concerned with the design and development of algorithms that allow computers to improve their performance over time based on data, such as from sensor data or databases....
    .

    Recent developments in this field also saw the use of particle swarm optimization
    Particle swarm optimization

    Particle swarm optimization is a swarm intelligence based algorithm to find a solution to an optimization problem in a search space, or model and predict social behavior in the presence of objectives....
     and other swarm intelligence
    Swarm intelligence

    Swarm intelligence is a type of artificial intelligence based on the collective behavior of decentralization, Self organization systems. The expression was introduced by Gerardo Beni and Jing Wang in 1989, in the context of Cellular automaton systems....
     techniques used in the training of neural networks.

    Neural networks and neuroscience

    Theoretical and computational neuroscience
    Computational neuroscience

    Computational neuroscience is an interdisciplinary science that links the diverse fields of neuroscience, cognitive science, electrical engineering, computer science, physics and mathematics....
     is the field concerned with the theoretical analysis and computational modeling of biological neural systems. Since neural systems are intimately related to cognitive processes and behaviour, the field is closely related to cognitive and behavioural modeling.

    The aim of the field is to create models of biological neural systems in order to understand how biological systems work. To gain this understanding, neuroscientists strive to make a link between observed biological processes (data), biologically plausible mechanisms for neural processing and learning (biological neural network
    Biological neural network

    In neuroscience, a neural network describes a population of physically interconnected neurons or a group of disparate neurons whose inputs or signalling targets define a recognizable circuit....
     models) and theory (statistical learning theory and information theory
    Information theory

    Information theory is a branch of applied mathematics and electrical engineering involving the quantification of information. Historically, information theory was developed by Claude E....
    ).

    Types of models

    Many models are used in the field, each defined at a different level of abstraction and trying to model different aspects of neural systems. They range from models of the short-term behaviour of individual neurons, through models of how the dynamics of neural circuitry arise from interactions between individual neurons, to models of how behaviour can arise from abstract neural modules that represent complete subsystems. These include models of the long-term and short-term plasticity of neural systems and its relation to learning and memory, from the individual neuron to the system level.

    Current research

    While initially research had been concerned mostly with the electrical characteristics of neurons, a particularly important part of the investigation in recent years has been the exploration of the role of neuromodulators such as dopamine
    Dopamine

    Dopamine is a neurotransmitter occurring in a wide variety of animals, including both vertebrates and invertebrates. In the human brain, this phenethylamine functions as a neurotransmitter, activating the five types of dopamine receptors ? D1, D2, D3, D4 and D5, and their variants....
    , acetylcholine
    Acetylcholine

    The chemical compound acetylcholine is a neurotransmitter in both the peripheral nervous system and central nervous system in many organisms including homo sapiens....
    , and serotonin
    Serotonin

    Serotonin is a monoamine neurotransmitter synthesized in serotonergic neurons in the central nervous system and enterochromaffin cells in the gastrointestinal tract of animals including humans....
     on behaviour and learning.

    Biophysical
    Biophysics

    Biophysics is an interdisciplinary science that employs and develops theories and methods of the physical sciences for the investigation of biology systems....
     models, such as BCM theory
    BCM theory

    BCM theory, BCM synaptic modification, or the BCM rule, named for Elie Bienenstock, Leon Cooper, and Paul Munro, is a physical theory of learning in the visual cortex developed in 1981....
    , have been important in understanding mechanisms for synaptic plasticity
    Synaptic plasticity

    In neuroscience, synaptic plasticity is the ability of the connection, or synapse, between two neurons to change in Synapse#Synaptic strength. There are several underlying mechanisms that cooperate to achieve synaptic plasticity, including changes in the quantity of neurotransmitters released into a synapse and changes in how effectively cell...
    , and have had applications in both computer science and neuroscience. Research is ongoing in understanding the computational algorithms used in the brain, with some recent biological evidence for radial basis networks and neural backpropagation
    Neural backpropagation

    Neural backpropagation is the phenomenon in which the action potential of a neuron creates a voltage spike both at the end of the axon and back through to the dendritic arbor or dendrites, from which much of the original input current originated....
     as mechanisms for processing data.

    Criticism

    A common criticism of neural networks, particularly in robotics, is that they require a large diversity of training for real-world operation. Dean Pomerleau, in his research presented in the paper "Knowledge-based Training of Artificial Neural Networks for Autonomous Robot Driving," uses a neural network to train a robotic vehicle to drive on multiple types of roads (single lane, multi-lane, dirt, etc.). A large amount of his research is devoted to (1) extrapolating multiple training scenarios from a single training experience, and (2) preserving past training diversity so that the system does not become overtrained (if, for example, it is presented with a series of right turns – it should not learn to always turn right). These issues are common in neural networks that must decide from amongst a wide variety of responses.

    A. K. Dewdney, a former Scientific American
    Scientific American

    Scientific American is a popular science science magazine, published since August 28, 1845, making it one of the oldest continuously published magazines in the United States....
     columnist, wrote in 1997, "Although neural nets do solve a few toy problems, their powers of computation are so limited that I am surprised anyone takes them seriously as a general problem-solving tool." (Dewdney, p.82)

    Arguments against Dewdney's position are that neural nets have been successfully used to solve many complex and diverse tasks, ranging from autonomously flying aircraft to detecting credit card fraud.

    Technology writer Roger Bridgman commented on Dewdney's statements about neural nets:
    Neural networks, for instance, are in the dock not only because they have been hyped to high heaven, (what hasn't?) but also because you could create a successful net without understanding how it worked: the bunch of numbers that captures its behaviour would in all probability be "an opaque, unreadable table...valueless as a scientific resource".

    In spite of his emphatic declaration that science is not technology, Dewdney seems here to pillory neural nets as bad science when most of those devising them are just trying to be good engineers. An unreadable table that a useful machine could read would still be well worth having.


    Some other criticisms came from believers of hybrid models (combining neural networks and symbolic approaches). They advocate the intermix of these two approaches and believe that hybrid models can better capture the mechanisms of the human mind.

    See also


    Further reading

    • Alspector, "Neuromorphic learning networks". October 17, 1989.
    • , p. 80*
    • See chapter 3.**
    • See chapter 5.
                  • Jeffrey T. Spooner, Manfredi Maggiore, Raul Ord onez, and Kevin M. Passino, Stable Adaptive Control and Estimation for Nonlinear Systems: Neural and Fuzzy Approximator Techniques, John Wiley and Sons, NY, 2002.


    External links

    • - Robot control and neural networks
    • - A neural network alternative.
    • - includes Java applet for online experimenting with prediction of a function
    • - Google Tech Talks
    • - Artificial Perceptual Neural Network used for machine learning to play Chess
      Chess

      Chess is a recreational and competitive game played between two Player . Sometimes called Western chess or international chess to distinguish it from History of chess and other chess variants, the current form of the game emerged in Southern Europe during the second half of the 15th century after evolving from similar, much older...