title: Bayes Theorem author: Keith A. Lewis institute: KALX, LLC classoption: fleqn fleqn: true abstract: Conditional probability ...
\newcommand{\given}{\mid}
Frequentists interpret probability as the number of times an outcome belongs to an event divided by the number of times it is sampled. This assumes it is possible to repeatedly sample identical situations. The law of large numbers says the frequency converges to the probablity of the event. This is fine when applied to flipping coins, rolling dice, or running other physical experiments where conditions don't change, but it limits the use of probability.
Bayesians interpret probability as a subjective degree of belief based on available information. As more information becomes available it can be used to update individual degrees of belief. They are delayed satisfaction frequentists who believe subjective degrees of belief will converge for all individuals given sufficient common information over repeated trials. The guilty secret of Bayesians is that they have nothing to say about choosing a "prior distribution" or what "loss function" should be used.
A probability space is a set
The conditional probability of
Bayesians like to belive that as more infomation becomes available subjective probabilites will converge to the same value. They haven't come up with a general theory ensuring this, but that seems to frequently occur.
A coin is fair if heads and tails occur with equal probability when flipped1. Suppose a coin might be fair or might have two heads, but we cannot examine the coin directly. The only information we will be given is the outcomes of a series of flips. If we ever see tails we know the coin is not double-headed but if every flip we see is heads then that provides evidence the coin is double-headed.
Suppose the first flip of the coin is heads,
Step one in probability theory is to specify the sample space of possible outcomes and the probability of events (subsets of the sample space). If the sample space is finite it is sufficient to specify the probability of each outcome.
In our case
the sample space has two elements
If you had trouble believing
If we want to model flipping the coin
This is an example of Bayesian reasoning showing subjective probabilities converge given sufficient information. It is important to note that this depends on the model. A different model could allow for the possibility the coin might also be two-tailed. Every model makes assumptions about what information is available so all probabilities are conditional.
If you had trouble believing
Probability theory is an extension of logic. Instead of statements that are either false or true it assigns a probability between 0 and 1 to events. From its sordid beginnings in of games of chance it achieved mathematical respectability when Kolmogorov axiomatized it in 1933. (cite?) Reputable mathematicians could now slot Probability Theory into the existing Measure Theory they were comfortable with as positive measures having mass 1.
Mathematics is following your nose and thinking rigourously, which immediately leads to difficulties most people, rightfully so, think are Much Ado About Nothing. For example, probability 1 does not correspond to true and must be replaced with the more subtle notion of almost surely.
Richard Threlkeld Cox put conditional probabilty on firmer philosophical
foundations by axiomatizing the notion of likelihood.
Staying true to the earliest foundations, he considered statements instead
of propositions. He denoted the likelihood of statement
If the shaman tells you he can make it rain tomorrow if you give him a basket of grain and you come back from a long day in the field to find him scurrying out of your hut and your wife with her hair mussed and won't look you in the eye, how likely is it that he will deliver on his promise given this information? If history is any guide, he will likely tell you he now needs two baskets of grain.
Cox assumed likelihood is a real number.
Real numbers are totally ordered so this is a big assumption.
He was also vague on exactly what statements constitute information.
His notation for the likelihood of
statement
Cox wanted to show
This was really an aside to his life's work. He was an experimental physicist who, among other results, demontrated a parity violation for double scattering of β rays from radium that could not be explained by existing theory. Eventually theory caught up and proved him correct.
Keynes A Treatise on Probability - probability is not a total order.
Is our expectation of rain, when we start out for a walk, always more likely than not, or less likely than not, or as likely as not? I am prepared to argue that on some occasions none of these alternatives hold, and that it will be an arbitrary matter to decide for or against the umbrella. If the barometer is high, but the clouds are black, it is not always rational that one should prevail over the other in our minds, or even that we should balance them, though it will be rational to allow caprice to determine us and to waste no time on the debate.
maxplus algebras
[ { "id": "http://zotero.org/users/6482136/items/H8KQMTYV", "type": "article-journal", "abstract": "By basing Bayesian probability theory on five axioms, we can give a trivial proof of Cox’s Theorem on the product rule and sum rule for conditional plausibility without assuming continuity or differentiablity of plausibility. Instead, we extend the notion of plausibility to apply to unknowns, giving them plausible values. Thus, we combine the best aspects of two approaches to Bayesian probability theory, namely the Cox-Jaynes theory and the de Finetti theory.", "container-title": "Bayesian Analysis", "DOI": "10.1214/09-BA422", "ISSN": "1936-0975", "issue": "3", "journalAbbreviation": "Bayesian Anal.", "language": "en", "source": "DOI.org (Crossref)", "title": "New axioms for rigorous Bayesian probability", "URL": "https://projecteuclid.org/journals/bayesian-analysis/volume-4/issue-3/New-axioms-for-rigorous-Bayesian-probability/10.1214/09-BA422.full", "volume": "4", "author": [ { "family": "Dupré", "given": "Maurice J." }, { "family": "Tipler", "given": "Frank J." } ], "issued": { "date-parts": [ [ "2009", 9, 1 ] ] } } ]
Footnotes
-
Persi Diaconis ↩