Information about Node (neural Networks)

An artificial neuron, also called semi-linear unit, Nv neuron, binary neuron or McCulloch-Pitts neuron, is an abstraction of biological neurons and the basic unit in an artificial neural network. The Artificial Neuron receives one or more inputs (representing the one or more dendrites) and sums them to produce an output (synapse). Usually the sums of each node are weighted, and the sum is passed through a non-linear function known as an activation or transfer function. The canonical form of transfer functions is the sigmoid, but they may also take the form of other non-linear functions, piecewise linear functions, or step functions. Generally, transfer functions are monotonically increasing.

Basic structure

For a given artificial neuron, let there be m inputs with signals x0 through xm and weights w0 through wm.

The output of neuron k is:



Where (Phi) is the transfer function.



The output propagates to the next layer (through a weighted synapse) or finally exits the system as part or all of the output.

History

The original artificial neuron is the Threshold Logic Unit first proposed by Warren McCulloch and Walter Pitts in 1943. As a transfer function, it employs a threshold or Heaviside step function taking on the values 1 or 0 only. The inputs and outputs are both binary. Edges have no weights, but are either excitatory or inhibitory. Excitation is summed to calculate the activation value, with each input contributing a 1 value (connections may be replicated to effectively multiply this by any integer). Inhibition is absolute rather than relative, meaning that a single active inhibitory input will force the output of a node to 0, regardless of the values of the excitatory inputs.

Types of transfer functions

The transfer function of a neuron is chosen to have a number of properties which either enhance or simplify the network containing the neuron. Crucially, for instance, any multi-layer perceptron using a linear transfer function has an equivalent single-layer network; a non-linear function is therefore necessary to gain the advantages of a multi-layer network.

Below, u refers in all cases to the weighted sum of all the inputs to the neuron, i.e. for n inputs,



where w is a vector of synaptic weights and x is a vector of inputs.

Step function

The output y of this transfer function is binary, depending on whether the input meets a specified threshold, θ. The "signal" is sent, i.e. the output is set to one, if the activation meets the threshold.



See: Heaviside step function

Linear combination

The output unit y is a linearly weighted sum of its inputs plus a bias term, similar to θ above, which is independent of the inputs.



Networks based on this formulation are known as perceptrons. Typically the above transfer function in its pure form would only be useful in a regression setting. For a binary classification setting, the sign of the output denotes the class predicted; in this case it is more sensible (and more convenient in the context of the learning algorithm) to consider positive outputs to be 1 and negative outputs to be 0, thus reducing the transfer function to that of the step function above, where .

See: Perceptron

Sigmoid

A fairly simple non-linear function, the sigmoid also has an easily calculated derivative, which is used when calculating the weight updates in the network. It thus makes the network more easily manipulable mathematically, and was attractive to early computer scientists who needed to minimise the computational load of their simulations.

See: Sigmoid function

Pseudocode Algorithm

The following is a simple pseudocode implementation of a single TLU which takes boolean inputs (true or false), and returns a single boolean output when activated. An object oriented model is used. No method of training is defined, since several exist. If a purely functional model were used, the class TLU below would be replaced with a function TLU with input parameters threshold, weights, and inputs that returned a boolean value.

class TLU defined as: data member threshold : number data member weights : list of numbers of size X function member fire( inputs : list of booleans of size X ) : boolean defined as: variable T : number T 0 for each i in 1 to X : if inputs(i) is true : T T + weights(i) end if end for each if T > threshold : return true else: return false end if end function end class

Spreadsheet Example

generated with :de:Wikipedia:Helferlein/VBA-Macro for EXCEL tableconversion V1.7<\hiddentext>>
InputInitialOutputFinal
ThresholdLearning RateSensor valuesDesired outputWeightsCalculatedSumNetworkErrorCorrectionWeights
THLRX1X2Zw1w2C1C2SNERW1W2
X1 x w1X2 x w2C1+C2IF(S>TH,1,0)Z-NLR x ER+w1R+w2
0.50.20000.10.30000000.10.3
0.50.20110.10.300.30.3010.20.30.5
0.50.21010.30.50.300.3010.20.50.7
0.50.21110.50.70.50.71.21000.50.7
0.50.20000.50.70000000.50.7
0.50.20110.50.700.70.71000.50.7
0.50.21010.50.70.500.5010.20.70.9
0.50.21110.70.90.70.91.61000.70.9
0.50.20000.70.90000000.70.9
0.50.20110.70.900.90.91000.70.9
0.50.21010.70.90.700.71000.70.9
0.50.21110.70.90.70.91.61000.70.9


Note: Initial weight equals final weight of previous iteration.

Criticism

The Artificial neuron is criticized by Izhikevich for not being biologically realistic. Although this argument is technically correct, it is largely academic, as artificial neurons are not intended to perfectly model biological neurons, but rather to perform complex non-linear computations.

See also

Bibliography

  • McCulloch, W. and Pitts, W. (1943). A logical calculus of the ideas immanent in nervous activity. Bulletin of Mathematical Biophysics, 7:115 - 133.
Neurons (also known as neurones and nerve cells) are electrically excitable cells in the nervous system that process and transmit information. In vertebrate animals, neurons are the core components of the brain, spinal cord and peripheral nerves.
..... Click the link for more information.
An artificial neural network (ANN), often just called a "neural network" (NN), is a mathematical model or computational model based on biological neural networks. It consists of an interconnected group of artificial neurons and processes information using a connectionist approach
..... Click the link for more information.


Dendrites (from Greek dendron, “tree”) are the branched projections of a neuron that act to conduct the electrical stimulation received from other neural cells to the cell body, or
..... Click the link for more information.
synapse. Synapses allow nerve cells to communicate with one another through axons and dendrites, converting electrical impulses into chemical signals.]]

Chemical synapses
..... Click the link for more information.
nonlinear system is a system which is not linear i.e. a system which does not satisfy the superposition principle. Less technically, a nonlinear system is any problem where the variable(s) to be solved for cannot be written as a linear sum of independent components.
..... Click the link for more information.
sigmoid function is a mathematical function that produces a sigmoid curve — a curve having an "S" shape. Often, sigmoid function refers to the special case of the logistic function shown at right and defined by the formula


..... Click the link for more information.
piecewise-defined function f(x) of a real variable x is a function whose definition is given differently on disjoint subsets of its domain.

A common example is the absolute value function, given by

..... Click the link for more information.
monotonic function (or monotone function) is a function which preserves the given order. This concept first arose in calculus, and was later generalized to the more abstract setting of order theory.
..... Click the link for more information.
Warren Sturgis McCulloch (November 16, 1899 – September 24, 1969) was an American neurophysiologist and cybernetician.

Warren Sturgis McCulloch was born in Orange, New Jersey and studied at Yale (philosophy and psychology, A.B.
..... Click the link for more information.
Walter Pitts (23 April 1923 – 14 May 1969) was a logician who worked in the field of cognitive psychology. He proposed landmark theoretical formulations of neural activity and emergent processes that influenced diverse fields like cognitive sciences and psychology,
..... Click the link for more information.
19th century - 20th century - 21st century
1910s  1920s  1930s  - 1940s -  1950s  1960s  1970s
1940 1941 1942 - 1943 - 1944 1945 1946

Year 1943 (MCMXLIII
..... Click the link for more information.
Heaviside step function, H, also called unit step function, is a discontinuous function whose value is zero for negative argument and one for positive argument. It seldom matters what value is used for H(0), since is mostly used as a distribution.
..... Click the link for more information.
Heaviside step function, H, also called unit step function, is a discontinuous function whose value is zero for negative argument and one for positive argument. It seldom matters what value is used for H(0), since is mostly used as a distribution.
..... Click the link for more information.
In mathematics, linear combinations are a concept central to linear algebra and related fields of mathematics. Most of this article deals with linear combinations in the context of a vector space over a field, with some generalisations given at the end of the article.
..... Click the link for more information.
''For alternate meanings see Perceptron (disambiguation).
The perceptron is a type of artificial neural network invented in 1957 at the Cornell Aeronautical Laboratory by Frank Rosenblatt.
..... Click the link for more information.
Generally, regression is related to moving backwards, and the opposite of progression. Specifically, it may refer to:
  • Regression (psychology), a defensive reaction to some unaccepted impulses

..... Click the link for more information.
''For alternate meanings see Perceptron (disambiguation).
The perceptron is a type of artificial neural network invented in 1957 at the Cornell Aeronautical Laboratory by Frank Rosenblatt.
..... Click the link for more information.
Sigmoid means resembling the lower-case Greek letter sigma (ς). Specific uses include:
  • Sigmoid function, a mathematical function
  • Sigmoid colon, part of the large intestine or colon.

..... Click the link for more information.
sigmoid function is a mathematical function that produces a sigmoid curve — a curve having an "S" shape. Often, sigmoid function refers to the special case of the logistic function shown at right and defined by the formula


..... Click the link for more information.
sigmoid function is a mathematical function that produces a sigmoid curve — a curve having an "S" shape. Often, sigmoid function refers to the special case of the logistic function shown at right and defined by the formula


..... Click the link for more information.
Pseudocode (derived from pseudo and code) is a compact and informal high-level description of a computer programming algorithm that uses the structural conventions of programming languages, but omits detailed subroutines, variable declarations or language-specific syntax.
..... Click the link for more information.
Boolean may refer to:
  • Boolean datatype, a certain datatype in computer science
  • Boolean algebra (logic), a logical calculus of truth values or set membership

..... Click the link for more information.
Boolean may refer to:
  • Boolean datatype, a certain datatype in computer science
  • Boolean algebra (logic), a logical calculus of truth values or set membership

..... Click the link for more information.
Object-oriented programming (OOP) is a programming paradigm that uses "objects" and their interactions to design applications and computer programs. It is based on several techniques, including inheritance, modularity, polymorphism, and encapsulation.
..... Click the link for more information.
Boolean may refer to:
  • Boolean datatype, a certain datatype in computer science
  • Boolean algebra (logic), a logical calculus of truth values or set membership

..... Click the link for more information.
Traditionally, the term neural network had been used to refer to a network or circuitry of biological neurons. The modern usage of the term often refers to artificial neural networks, which are composed of artificial neurons or nodes.
..... Click the link for more information.
''For alternate meanings see Perceptron (disambiguation).
The perceptron is a type of artificial neural network invented in 1957 at the Cornell Aeronautical Laboratory by Frank Rosenblatt.
..... Click the link for more information.
ADALINE (Adaptive Linear Neuron or later Adaptive Linear Element) is a single layer neural network. It was developed by Professor Bernard Widrow and his graduate student Ted Hoff at Stanford University in 1960. It is based on the McCulloch-Pitts neuron.
..... Click the link for more information.
Connectionism is an approach in the fields of artificial intelligence, cognitive psychology/cognitive science, neuroscience and philosophy of mind. Connectionism models mental or behavioral phenomena as the emergent processes of interconnected networks of simple units.
..... Click the link for more information.


This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.
Herod_Archelaus


page counter