Introduction to Machine Learning 课后习题 Solutions to Exercise by Ethem Alpaydn 部分翻译
第一章 绪论
-
Imagine you have two possibilities: You can fax a document, that is,send the image, or you can use an optical character reader (OCR) and send the text file. Discuss the advantage and disadvantages of the two approaches in a comparative manner. When would one be preferable over the other?
The text file typically is shorter than the image file but a faxed document can also contain diagrams, pictures, etc. After using an OCR, we lose properties such as font, size, etc (unless we also recognize and transmit such information) or the personal touch if it is handwritten text. OCR may not be perfect, and for ambigious cases, OCR should identify those image blocks and transmit them as they are. A fax machine is cheaper and easier to find than a computer with scanner and OCR software.OCR is good if we have high volume, good quality documents; for documents of few pages with