Prior density is denoted by in this article
Introduction
Non-Informative Priors are the priors which we assume when we do not have any belief about the parameter let say . This leads noninformative priors to not favor any value of , which gives equal weights to every value that belongs to . for example let us we have three hypothesis , so the prior which attach weight of to each of the hypothesis is noninformative prior.
<!--more-->Note : most of the noninformative priors are improper.
An Example
Now let us assume a simple example let us assume our parameter space is a finite set containing n elements such as
Now the obvious weight given to each when we have not any prior beliefs is that gives us prior is proportional to a constant because is a constant let us say =c hence we can say
Now let us assume a transformation , that is . If is the density of then we can write density of as
Thus if we choose prior for as constant , then we have to assume prior for as proportional to to arrive at the same answer in both cases either we take or . Thus we cannot maintain consistency and assume both prior proportional to constant . This leads to the search of such noninformative priors which are invariant under transformations.
Noninformative Priors for Location Parameter
A Parameter is said to be location parameter if the density can be written as a function of
Let X is a random variable with location parameter then density can be written as . Just assume instead of observing X we observed Y = X+c and let us take then can see that the density of Y is given by . Now have same parameter and sample space which gives us the idea that they must have same noninformative prior
Let and are noninformative priors for respectively. So according to our argument both will have same noninformative priors , let us assume a subset of real line A
Now we have assumed so
which leads us to
It holds for any set A of real line , and any c on real line so it lead us to
Now if we take we get ,and we know it is true for all c , it leads us to the conclusion that the prior in the case of location parameter is constant functions , for simplicity most of the statistician assume it equal to 1 ,
Noninformative Priors for Scale Parameter
A Parameter is said to be location parameter if the density can be written as a where
For example in normal distribution we , is a scale parameter .
To get noninformative prior for Scale Parameter of a random variable X , instead of observing X we observe for any , let us define , so then the density of is given by .
Now similar to previous part here and have same sample and parameter space , so both will have same noninformative priors. Let and are noninformative priors for respectively. So according to our argument both will have same noninformative priors
Here A is a subset of Positive real line, i.e , now putting
so
Now taking , we get
Now this equation is true for any value so , for convenience taking , it gives us noninformative prior
Note : It is an improper prior ,
Flaw and introduction of relatively location invariant prior
Now we know noninformative prior for both Scale and Location parameter, but there is flaw . The prior we get for location and scale parameter in previous part are improper priors . If two random variables have identical form , then they have same non informative priors . but the problem here is due to improper priors , noninformative priors are not unique. lets say we have an improper prior g then if we multiply g by any constant k then the resultant gk will give same bayesian decisions as g.
Now in previous parts we have assumed two priors and , but we do not need that , we can get by just multiplying by a constant and vice-versa.
Now equation can be written as
Where is some positive function ,
It holds for all A , so , and taking give us , putting this value back will give us
Now there is a lot of prior other than , which satisfy equation (** ) , so any prior of this form will be know as relatively location invariant