as its name, bayes optimal decision is the best decision over all decisions, however it is impossible to working out solutions because it requires to summing out all hypothese. you can't obtain ALL hypothese.
in <<Machine Learning>>, its definition is:
in <<Pattern Recognition>>:
in <<Foundations of Statistical Natural Language Processing>>:
"Suppose that we did not actually see the sequence of coin tosses but just heard the results shouted out over the fence. Now it may be the case, as we have assumed so far, that the results reported truly reflect the results of tossing a single, possibly weighted coin. This is the theory 'u' which is a family of models, with a parameter representing the weighting of the coin. But an alternative theory is that at each step someone is tossing two fair coins, and calling out "tails" if both of them come down tails, and heads otherwise."
this explanation is very vivid. hypothese or parameters is a theory(model) in problem space. following is some metaphors:
ideal categorizations about texts ------- methods(hypothese; theory; modal): SVM, KNN, Bayes... ------- a specified category
mathematical reasoning ------- programming languages ------- codes
thinkings ------- natural languages ------- text or speech
probability distribution about pattern recognition decision ------- kinds of distribution & parameters for them ------- a decision
M.A.P (P(x|a)P(a)) ------- M.L (P(x|a)) ------- result (x)
speech ------- HMM ------- meaning underlying natural language
a concept in world ------- operations of features ------- a pattern