Information about Probability Density Function
In mathematics, a probability density function (pdf) is a function that represents a probability distribution in terms of integrals.
Formally, a probability distribution has density f, if f is a non-negative Lebesgue-integrable function
such that the probability of the interval [a, b] is given by
for any two numbers a and b. This implies that the total integral of f must be 1. Conversely, any non-negative Lebesgue-integrable function with total integral 1 is the probability density of a suitably defined probability distribution.
Intuitively, if a probability distribution has density f(x), then the infinitesimal interval [x, x + dx] has probability f(x) dx.
Informally, a probability density function can be seen as a "smoothed out" version of a histogram: if one empirically samples enough values of a continuous random variable, producing a histogram depicting relative frequencies of output ranges, then this histogram will resemble the random variable's probability density, assuming that the output ranges are sufficiently narrow.
The actual probability can then be calculated by taking the integral of the function f(x) by the integration interval of the input variable x.
For example: the probability of the variable X being within the interval [4.3,7.8] would be
If a random variable X is given and its distribution admits a probability density function f(x), then the expected value of X (if it exists) can be calculated as
Not every probability distribution has a density function: the distributions of discrete random variables do not; nor does the Cantor distribution, even though it has no discrete component, i.e., does not assign positive probability to any individual point.
A distribution has a density function if and only if its cumulative distribution function F(x) is absolutely continuous. In this case: F is almost everywhere differentiable, and its derivative can be used as probability density:
If a probability distribution admits a density, then the probability of every one-point set {a} is zero.
It is a common mistake to think of f(x) as the probability of {x}, but this is incorrect; in fact, f(x) will often be bigger than 1 - consider a random variable that is uniformly distributed between 0 and ½. Loosely, one may think of f(x) dx as the probability that a random variable whose probability density function if f is in the interval from x to x + dx, where dx is an infinitely small increment.
Two probability densities f and g represent the same probability distribution precisely if they differ only on a set of Lebesgue measure zero.
In the field of statistical physics, a non-formal reformulation of the relation above between the derivative of the cumulative distribution function and the probability density function is generally used as the definition of the probability density function. This alternate definition is the following:
If dt is an infinitely small number, the probability that
is included within the interval (t, t + dt) is equal to
, or:
It is also possible to represent certain discrete random variables using a density of probability, via the Dirac delta function. For example, let us consider a binary discrete random variable taking −1 or 1 for values, with probability ½ each.
The density of probability associated with this variable is:
More generally, if a discrete variable can take 'n' different values among real numbers, then the associated probability density function is:
where
are the discrete values accessible to the variable and
are the probabilities associated with these values.
This expression allows for determining statistical characteristics of such a discrete variable (such as its mean, its variance and its kurtosis), starting from the formulas given for a continuous distribution.
In physics, this description is also useful in order to characterize mathematically the initial configuration of a Brownian movement.
, it is also possible to define a probability density function associated to the set as a whole, often called joint probability density function. This density function is defined as a function of the n variables, such that, for any domain D in the n-dimensional space of the values of the variables
, the probability that a realisation of the set variables falls inside the domain D is
For i=1, 2, …,n, let
be the probability density function associated to variable
alone. This probability density can be deduced from the probability densities associated of the random variables
by integrating on all values of the n − 1 other variables:
are all independent from each other if and only if
then the n variables in the set are all independent from each other, and the marginal probability density function of each of them is given by
a 2-dimensional random vector of coordinates
: the probability to obtain
in the quarter plane of positive x and y is
Here g−1 denotes the inverse function and g' denotes the derivative.
For functions which are not monotonic the probability density function for y is
where n(y) is the number of solutions in x for the equation g(x) = y, and
are these solutions.
It is tempting to think that in order to find the expected value E(g(X)) one must first find the probability density of g(X). However, rather than computing
one may find instead
The values of the two integrals are the same in all cases in which both X and g(X) actually have probability density functions. It is not necessary that g be a one-to-one function. In some cases the latter integral is computed much more easily than the former.
where the integral is over the entire (m-1)-dimensional solution of the subscripted equation and the symbolic dV must be replaced by a parametrization of this solution for a particular calculation; the variables x0, x1, ..., xm-1 are then of course functions of this parametrization.
and the variance is
or, expanding, gives:
Formally, a probability distribution has density f, if f is a non-negative Lebesgue-integrable function
such that the probability of the interval [a, b] is given by
for any two numbers a and b. This implies that the total integral of f must be 1. Conversely, any non-negative Lebesgue-integrable function with total integral 1 is the probability density of a suitably defined probability distribution.
Intuitively, if a probability distribution has density f(x), then the infinitesimal interval [x, x + dx] has probability f(x) dx.
Informally, a probability density function can be seen as a "smoothed out" version of a histogram: if one empirically samples enough values of a continuous random variable, producing a histogram depicting relative frequencies of output ranges, then this histogram will resemble the random variable's probability density, assuming that the output ranges are sufficiently narrow.
Simplified explanation
A probability density function is any function f(x) that describes the probability density in terms of the input variable x in a manner described below.- f(x) is greater than or equal to zero for all values of x
- The total area under the graph is 1:
- :

The actual probability can then be calculated by taking the integral of the function f(x) by the integration interval of the input variable x.
For example: the probability of the variable X being within the interval [4.3,7.8] would be
Further details
For example, the continuous uniform distribution on the interval [0,1] has probability density f(x) = 1 for 0 ≤ x ≤ 1 and f(x) = 0 elsewhere. The standard normal distribution has probability densityIf a random variable X is given and its distribution admits a probability density function f(x), then the expected value of X (if it exists) can be calculated as
Not every probability distribution has a density function: the distributions of discrete random variables do not; nor does the Cantor distribution, even though it has no discrete component, i.e., does not assign positive probability to any individual point.
A distribution has a density function if and only if its cumulative distribution function F(x) is absolutely continuous. In this case: F is almost everywhere differentiable, and its derivative can be used as probability density:
If a probability distribution admits a density, then the probability of every one-point set {a} is zero.
It is a common mistake to think of f(x) as the probability of {x}, but this is incorrect; in fact, f(x) will often be bigger than 1 - consider a random variable that is uniformly distributed between 0 and ½. Loosely, one may think of f(x) dx as the probability that a random variable whose probability density function if f is in the interval from x to x + dx, where dx is an infinitely small increment.
Two probability densities f and g represent the same probability distribution precisely if they differ only on a set of Lebesgue measure zero.
In the field of statistical physics, a non-formal reformulation of the relation above between the derivative of the cumulative distribution function and the probability density function is generally used as the definition of the probability density function. This alternate definition is the following:
If dt is an infinitely small number, the probability that
is included within the interval (t, t + dt) is equal to
, or:
Link between discrete and continuous distributions
The definition of a probability density function at the start of this page makes it possible to describe the variable associated with a continuous distribution using a set of binary discrete variables associated with the intervals [a; b] (for example, a variable being worth 1 if X is in [a; b], and 0 if not).It is also possible to represent certain discrete random variables using a density of probability, via the Dirac delta function. For example, let us consider a binary discrete random variable taking −1 or 1 for values, with probability ½ each.
The density of probability associated with this variable is:
More generally, if a discrete variable can take 'n' different values among real numbers, then the associated probability density function is:
where
are the discrete values accessible to the variable and
are the probabilities associated with these values.
This expression allows for determining statistical characteristics of such a discrete variable (such as its mean, its variance and its kurtosis), starting from the formulas given for a continuous distribution.
In physics, this description is also useful in order to characterize mathematically the initial configuration of a Brownian movement.
Probability function associated to multiple variables
For continuous random variables
, it is also possible to define a probability density function associated to the set as a whole, often called joint probability density function. This density function is defined as a function of the n variables, such that, for any domain D in the n-dimensional space of the values of the variables
, the probability that a realisation of the set variables falls inside the domain D is
For i=1, 2, …,n, let
be the probability density function associated to variable
alone. This probability density can be deduced from the probability densities associated of the random variables
by integrating on all values of the n − 1 other variables:
Independence
Continuous random variables
are all independent from each other if and only if
Corollary
If the joint probability density function of a vector of n random variables can be factored into a product of n functions of one variablethen the n variables in the set are all independent from each other, and the marginal probability density function of each of them is given by
Example
This elementary example illustrates the above definition of multidimensional probability density functions in the simple case of a function of a set of two variables. Let us call
a 2-dimensional random vector of coordinates
: the probability to obtain
in the quarter plane of positive x and y is
Sums of independent random variables
The probability density function of the sum of two independent random variables U and V, each of which has a probability density function is the convolution of their separate density functions:Dependent variables
If the probability density function of an independent random variable x is given as f(x), it is possible (but often not necessary; see below) to calculate the probability density function of some variable y which depends on x. This is also called a "change of variable" and is in practice used to generate a random variable of arbitrary shape "f" using a known (for instance uniform) random number generator. If the dependence is y = g(x) and the function g is monotonic, then the resulting density function isHere g−1 denotes the inverse function and g' denotes the derivative.
For functions which are not monotonic the probability density function for y is
where n(y) is the number of solutions in x for the equation g(x) = y, and
are these solutions.
It is tempting to think that in order to find the expected value E(g(X)) one must first find the probability density of g(X). However, rather than computing
one may find instead
The values of the two integrals are the same in all cases in which both X and g(X) actually have probability density functions. It is not necessary that g be a one-to-one function. In some cases the latter integral is computed much more easily than the former.
Multiple variables
The above formulas can be generalized to variables (which we will again call y) depending on more than one other variables. f(x0, x1, ..., xm-1) shall denote the probability density function of the variables y depends on, and the dependence shall be y = g(x0, x1, ..., xm-1). Then, the resulting density function iswhere the integral is over the entire (m-1)-dimensional solution of the subscripted equation and the symbolic dV must be replaced by a parametrization of this solution for a particular calculation; the variables x0, x1, ..., xm-1 are then of course functions of this parametrization.
Finding moments and variance
In particular, the nth moment E(Xn) of the probability distribution of a random variable X is given byand the variance is
or, expanding, gives:
.
See also
- likelihood function
- probability distribution
- probability mass function
- exponential family
- density estimation
- conditional probability density function
- Probability vector
- Secondary measure
Mathematics (colloquially, maths or math) is the body of knowledge centered on such concepts as quantity, structure, space, and change, and also the academic discipline that studies them. Benjamin Peirce called it "the science that draws necessary conclusions".
..... Click the link for more information.
..... Click the link for more information.
probability distribution that assigns a probability to every subset (more precisely every measurable subset) of its state space in such a way that the probability axioms are satisfied.
..... Click the link for more information.
..... Click the link for more information.
INTErnational Gamma-Ray Astrophysics Laboratory (INTEGRAL) is detecting some of the most energetic radiation that comes from space. It is the most sensitive gamma ray observatory ever launched.
..... Click the link for more information.
..... Click the link for more information.
Lebesgue integration is a mathematical construction that extends the integral to a larger class of functions; it also extends the domains on which these functions can be defined.
..... Click the link for more information.
..... Click the link for more information.
In algebra, an interval is a set that contains every real number between two indicated numbers and may contain the two numbers themselves. Interval notation is the notation in which permitted values for a variable are expressed as ranging over a certain interval; "" is an
..... Click the link for more information.
..... Click the link for more information.
histogram is a graphical display of tabulated frequencies. A histogram is the graphical version of a table that shows what proportion of cases fall into each of several or many specified categories.
..... Click the link for more information.
..... Click the link for more information.
In probability theory, a probability distribution is called continuous if its cumulative distribution function is continuous. That is equivalent to saying that for random variables X with the distribution in question, Pr[X = a
..... Click the link for more information.
..... Click the link for more information.
continuous uniform distribution is a family of probability distributions such that for each member of the family, all intervals of the same length on the distribution's support are equally probable.
..... Click the link for more information.
..... Click the link for more information.
normal distribution, also called the Gaussian distribution, is an important family of continuous probability distributions, applicable in many fields. Each member of the family may be defined by two parameters, location and scale: the mean ("average",
..... Click the link for more information.
..... Click the link for more information.
expected value (or mathematical expectation, or mean) of a discrete random variable is the sum of the probability of each possible outcome of the experiment multiplied by the outcome value (or payoff).
..... Click the link for more information.
..... Click the link for more information.
discrete if it is characterized by a probability mass function. Thus, the distribution of a random variable X is discrete, and X is then called a discrete random variable, if
as u
..... Click the link for more information.
as u
..... Click the link for more information.
Cantor distribution is the probability distribution whose cumulative distribution function is the Cantor function.
This distribution has neither a probability density function nor a probability mass function, as it is not absolutely continuous with respect to Lebesgue
..... Click the link for more information.
This distribution has neither a probability density function nor a probability mass function, as it is not absolutely continuous with respect to Lebesgue
..... Click the link for more information.
In probability theory, the cumulative distribution function (CDF), also called probability distribution function or just distribution function,[1] completely describes the probability distribution of a real-valued random variable X.
..... Click the link for more information.
..... Click the link for more information.
In mathematics, one may talk about absolute continuity of functions and absolute continuity of measures, and these two notions are closely connected.
..... Click the link for more information.
Absolute continuity of functions
Definition
Let (X, d) be a metric space and let..... Click the link for more information.
In measure theory (a branch of mathematical analysis), one says that a property holds almost everywhere if the set of elements for which the property does not hold is a null set, i.e. is a set with measure zero.
..... Click the link for more information.
..... Click the link for more information.
derivative is a measurement of how a function changes when the values of its inputs change. Loosely speaking, a derivative can be thought of as how much a quantity is changing at some given point.
..... Click the link for more information.
..... Click the link for more information.
Uniform distribution can refer to:
..... Click the link for more information.
- Uniform distribution (mathematics), probability distributions:
- Uniform distribution (continuous)
- Uniform distribution (discrete)
..... Click the link for more information.
probability distribution that assigns a probability to every subset (more precisely every measurable subset) of its state space in such a way that the probability axioms are satisfied.
..... Click the link for more information.
..... Click the link for more information.
In mathematics, the Lebesgue measure, named after Henri Lebesgue, is the standard way of assigning a length, area or volume to subsets of Euclidean space. It is used throughout real analysis, in particular to define Lebesgue integration.
..... Click the link for more information.
..... Click the link for more information.
In mathematics, a null set is a set that is negligible in some sense. For different applications, the meaning of "negligible" varies. In set theory, there is only one null set, and it is the empty set.
..... Click the link for more information.
..... Click the link for more information.
Statistical physics is one of the fundamental theories of physics, and uses methods of statistics in solving physical problems. It can describe a wide variety of fields with an inherently stochastic nature.
..... Click the link for more information.
..... Click the link for more information.
In probability theory, the cumulative distribution function (CDF), also called probability distribution function or just distribution function,[1] completely describes the probability distribution of a real-valued random variable X.
..... Click the link for more information.
..... Click the link for more information.
Dirac delta or Dirac's delta, often referred to as the unit impulse function and introduced by the British theoretical physicist Paul Dirac, can usually be informally thought of as a function δ(x) that has the value of infinity for x
..... Click the link for more information.
..... Click the link for more information.
A random variable is an abstraction of the intuitive concept of chance into the theoretical domains of mathematics, forming the foundations of probability theory and mathematical statistics.
..... Click the link for more information.
..... Click the link for more information.
In statistics, mean has two related meanings:
..... Click the link for more information.
- the arithmetic mean (and is distinguished from the geometric mean or harmonic mean).
- the expected value of a random variable, which is also called the population mean.
..... Click the link for more information.
variance of a random variable (or somewhat more precisely, of a probability distribution) is one measure of statistical dispersion, averaging the squared distance of its possible values from the expected value.
..... Click the link for more information.
..... Click the link for more information.
kurtosis (from the Greek word kurtos, meaning bulging) is a measure of the "peakedness" of the probability distribution of a real-valued random variable. Higher kurtosis means more of the variance is due to infrequent extreme deviations, as opposed to frequent
..... Click the link for more information.
..... Click the link for more information.
Physics is the science of matter[1] and its motion[2][3], as well as space and time[4][5] —the science that deals with concepts such as force, energy, mass, and charge.
..... Click the link for more information.
..... Click the link for more information.
Brownian motion (named in honor of the botanist Robert Brown) is either the random movement of particles suspended in a fluid or the mathematical model used to describe such random movements, often called a Wiener process.
..... Click the link for more information.
..... Click the link for more information.
A random variable is an abstraction of the intuitive concept of chance into the theoretical domains of mathematics, forming the foundations of probability theory and mathematical statistics.
..... Click the link for more information.
..... Click the link for more information.
This article is copied from an article on Wikipedia.org - the free encyclopedia created and edited by online user community. The text was not checked or edited by anyone on our staff. Although the vast majority of the wikipedia encyclopedia articles provide accurate and timely information please do not assume the accuracy of any particular article. This article is distributed under the terms of GNU Free Documentation License.
Herod_Archelaus





















