CDS540 Text Detection from Images Using OpenCV


CDS540 Assignment 1 - Text Detection from Images
Task objective
The objective of this assignment is to use Google Colab, Jupyter Notebook, or any other suitable tool to detect text from images using OpenCV. This will help you understand the concepts of fundamental Computer Vision algorithms and techniques, evaluate the efficiency and effectiveness of computer vision systems, and apply and implement these tools and techniques. Task description:
1.Setup Environment:
oUse Google Colab, Jupyter Notebook, or any other platform you are comfortable with.
oEnsure that OpenCV and other necessary libraries are installed.
2.Load an Image:
oLoad a sample image containing text. You can use images from various sources like scanned documents, street signs, or any image with clear text.
3.Pre-process the Image:
oConvert the image to grayscale.
oApply necessary image processing techniques like thresholding, blurring, or edge detection to enhance text visibility.
4.Text Detection:
oUse OpenCV's text detection methods such as the EAST text detector, or any other OpenCV-compatible methods.
oDraw bounding boxes around detected text areas.
5.Extract and Display Text:
oUse OCR (Optical Character Recognition) tools like Tesseract to extract text from the detected regions.
oDisplay the extracted text and the annotated image with bounding boxes.
6.Efficiency and Effectiveness Evaluation:
oMeasure the performance of your text detection system.
oDiscuss the efficiency (speed) and effectiveness (accuracy) of your approach.
oSuggest potential improvements.
Rubric
Criteria    Points
Understand the concepts of fundamental Computer Vision algorithms and techniques    10%
- Demonstrates a clear understanding of image pre-processing techniques    
- Explains the chosen text detection method and its working    
Evaluate the efficiency and effectiveness of computer vision systems    10%
- Measures and reports the speed of the text detection system    
- Evaluates the accuracy of detected text    
- Provides a discussion on the system's strengths and weaknesses    
Apply and implement the computer vision tools and techniques    20%
- Successfully loads and processes the image    
- Implements text detection and draws bounding boxes correctly    
- Uses OCR to extract and display text from images    
- Code is well-organized, commented, and follows best practices    
Total Points    40%

Deliverables
1.A Jupyter Notebook (or equivalent) with your code and in-line explanations.
2.Images used for testing.
3.A brief report (1000-2000 words) evaluating the performance of your text detection system.

Submission Details: 
1.You must ensure that all your project files used for this task and the report sit in a directory called “Assignment 1 – Your Name”. 
2.All files are required to be uploaded and a link to the “Assignment 1” directory submitted to Moodle. 
3.Please make sure that unit Instructor and TA have access to the folder. 
4.A link to the demo video of your app running must be submitted. 
5.It would be great if you could submit your GitHub link. 
6.This is an individual assignment, and you should submit it by 8 pm, Friday, Week 7.
Reference 
How to Record the Screen on Your Windows PC or Mac?
How to upload videos on YouTube
Sample video

 

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值