Select your font size 
 
about us products & services consulting & support news & events contact us
Discussion of the canonical formula for Bayes Inference.

Deriving Bayes' Theorem - SK

print this article 
 

You are now in a position to discuss the canonical formula for Bayes inference. The derivation of Bayes formula follows naturally from the definition of conditional probability using the probability format:

P(A | B) = P(A & B) / P(B)

Using some algebra, this equation can be rewritten as:

P(A & B) = P(A | B) P(B)

The same right-hand value can also be computed using A as the conditioning variable:

P(A & B) = P(B | A) P(A)

Given this equivalence, you can write:

P(A | B) P(B) = P(B | A) P(A)

Simplifying, you arrive at Bayes theorem:

P(A | B) = P(B | A) P(A) / P(B)

Notice that this formula for computing a conditional probability is similiar to the original formula with the exception that the joint probability P(A & B) that used to appear in the numerator has been replaced with the equivalent expression P(B | A) P(A).

Computing the full posterior

Bayesian inference is often put forth as a prescriptive framework for hypothesis testing. Using this framework, it is standard to replace P(A | B) with P(H | E) where H stands for hypothesis and E stands for evidence. Bayes inference rule then looks like this:

P(H | E) = P(E | H) P(H) / P(E)

In words, the formula says that the posterior probability of a hypothesis given the evidence P(H | E) is equal to the likelihood of the evidence given the hypothesis P(E | H) multiplied by the prior probability of the hypothesis P(H). You can ignore P(E) as only serving a normalization role (in other words, ensuring the sum of all the cell probabilities is 1.0). You can thus mentally simplify the equation to:

P(H | E) = P(E | H) P(H)

The prior distribution P(H) in this equation can be represented in PHP as an indexed array of probability values (as shown):

var $priors = array();

The $priors array is expected to contain a list of numbers denoting the prior probability of each hypothesis. In the context of medical diagnosis, the $priors array might contain the prevalence rates of each hypothesized disease in the population. Alternatively, the array might contain a medical specialist's best guess as to the prior probability of each disease under consideration given everything they know about each disease and current conditions.

The exact nature of the full posterior probability computation is made clearer by seeing that the posterior and likelihood terms appear in a PHP implementation as two-dimensional arrays (the closest you can currently get to a matrix datatype in PHP).

Listing 3. The posterior and likelihood terms appear in a PHP implementation as 2D arrays
<?php

// $m denotes the number of hypothesis
// $n denotes the number of evidence patterns

$m = 3;
$n = 4;

$priors      = getPriorDistribution();
$likelihoods = getlikelihoodDistribution();
$evidence    = getEvidenceDistribution();

for($e=0; $e < $n; $e++) {
  for ($h=0; $h < $m; $h++) {
    $posterior[$e][$h] = $priors[$h]
       * $likelihoods[$h][$e] / $evidence[$e];
  }
}

?>

For now, ignore the issue of how the $prior, $likelihood, and $evidence distribution values are computed from raw data. You can posit magical get functions to obtain these values. The previous code shows how the values of the posterior probability matrix are computed by looping over the evidence items and the hypothesis alternatives.

The order of the index elements $e and $h in the posterior matrix might be puzzling until you realize that in PHP the evidence key should appear first in the posterior matrix because it is a lookup key. If you access the posterior matrix using an evidence key $e, it will return an array containing the probability of each hypothesis under consideration (such as, +cancer, -cancer) given the particular evidence key you have supplied (like +test). The code above computes the full posterior distribution over all evidence keys. To compute a row of the full posterior distribution for a particular evidence key, you would use this formula:

Figure 2. Formula to compute posterior distribution
Formula to compute posterior distribution



Page:   1  2  3  4  5  6  7  8  9  10  11 Next Page: Medical diagnosis wizard

The content shown in this page was first published by IBM developerWorks and is reprinted with permission from Paul Meagher (www.datavore.com)


Most Recent Website and Regional Updates

 High Scalability - Large Systems Optimization
Transparen Corporation lends its expertise to clients experiencing rapid and sudden growth in traffic or server utilization, bottlenecks, systems instability, downtime during peak traffic, or which would like to plan to avoid such issues.

 
 Throughput (or Bandwidth) vs. Latency
This document uses the example of Bill Gates purchasing Google to explain the difference between bandwidth (or throughput) and latency.

 
 Avoidance of Magic - Informal Survey Results
Joe the IT Director phones up high-traffic websites to ask them if they used magic.

 
 Don't go take a walk in Saskatoon!
There's been a rash of pedestrians hit by autos in Saskatoon this year.  Rapid economic and population growth has resulted in an overloaded infrastructure.  Attempts to upgrade this infrastructure have closed down...read more

 
 Self checkout arrives at the grocery store!
Last night I went to my preferred grocery store for my shopping. On the way in I noticed a new thing.  At first I thought they'd replaced all the checkouts, then I saw it was just a few checkouts were brand new.  I...read more

 
 Canadian Fireball Remains Found
"University of Calgary researchers reported last Friday that they have found the remains of the meteor that streaked through Canadian skies in November The remains of the 10-ton meteor were densely strewn over a...read more

 
 Scientists Find Canadian Meteorite
Planetary scientist Alan Hildebrand and graduate student Ellen Milley from the University of Calgary discovered fragments of the ten ton meteorite that fell near the Alberta-Saskatchewan border on the evening of...read more

 
 UPDATED: Liberal Video Attacks Conservative Deficit, Canada Perilously Close to Budget Deficit, No Stimulus, No End in Sight
Update: " The Conservatives pledged during the election campaign never to run a deficit, but Flaherty said the sudden decline in the global and domestic economies has brought Ottawa perilously close to a budget...read more

 
 Meteorite Hunter Offers $10,000 Reward for Canadian Meteorite
A bright meteor streaked across the skies of Saskatchewan and Alberta, Canada on November 20, 2008 at approximately 5:26 PM MST, prompting telephone calls to police stations, NORAD, and news stations from North...read more

 
 Massive Fireball Falls Over Alberta and Saskatchewan, Canada
A bright meteor streaked across the skies of Saskatchewan and Alberta, Canada on November 20, 2008 at approximately 5:26 PM MST, prompting telephone calls to police stations, NORAD, and news...read more

 
 03/12/2008: Crime in Canada
The story of a long-standing family feud and what it says about the future of violent crime in Canada.

 
 02/12/2008: The Constitution and a Coalition Government
What does the Constitution say about coalitions? Today on the podcast, we'll conjure up the ghost of Mackenzie King to find the answers.

 
 01/12/2008: The Big Three & the Future of the Auto Industry
One way or another, the Big Three automakers will have a huge impact on Windsor's future. But the future of those companies is being decided by forces well beyond this city's borders.

 
 28/11/2008: Greenpeace and the DRC
For more than a decade, the Democratic Republic of Congo has seen one humanitarian disaster after another. But there's an environmental catastrophe as well. And Greenpeace thinks it deserves our attention too.

 
 27/11/2008: The Agony of Stephen Harper
Stephen Harper got into politics to make government smaller. Now he's facing an economic crisis that seems to cry out for big time public intervention.

 

Google
 
Web transparen.com

Contact Information

Related Information

 
  Saskatoon
Regina
Prince Albert
Moose Jaw
Yorkton
Swift Current
North Battleford
Estevan
Weyburn
Corman Park
 
 
E C M | © 2003-2007 Transparen Corp.      

Standardized Services: Data Recovery Service / Creative Services / Premium Web Hosting Services / System Administration Tech Support Services
Recent Projects: Full-Service Mortgage and Financing Company / System to manage flights from Vancouver to Tofino / Photo exchange verification service
Our Vancouver BC Server Proudly Hosts: automated parking and revenue control systems, leafside lane at southlands, cost effective alternative power sources, Higher Grade Learning Centres, pacific forage bag supply, sunburst medical, neosonic design, roger mahler photography - passionate, intriguing, desirable, the connection between east and west, affordable flights to victoria and tofino, low interest mortgage brokers in vancouver, richmond, surrey, toronto, Toronto Calgary and Vancouver IT staffing and talent search
Saskatoon, Regina, Prince Albert, Moose Jaw, Yorkton, Swift Current, North Battleford, Estevan, Weyburn, Corman Park