The difference between CRF and MRF:
Just a follow-up on Eren's answer:
Let X denote a multi-dimensional input (i.e., the features), and let Y denote a multi-dimensional output (i.e., the structured label). Typically, we have the features x, and are interested in predicting the distribution of the label P(Y|X=x), or the best possible label argmax_y P(Y=y|X=x).
A CRF is essentially a structured extension of logistic regression, and models the CONDITIONAL probabilities P(Y|X). It does not model anything else.
A MRF models the JOINT probabilities of both Y and X simultaneously. In other words, it models P(Y,X