Elec4622 Laboratory Project 2 T2 2024C/C++

Java Python Elec4622 Laboratory Project 2, T2 2024

July 5, 2024

1 Introduction

This is the second of three mini-projects to be demonstrated and assessed within the regular scheduled laboratory sessions. This project is due in Week 8. The project is worth a nominal 10 marks, out of the 30 marks available for the laboratory component of the course. However, some suggested activities can attract substantial bonus marks.

This project is concerned with implementing a Laplacian pyramid and using it to process images in thepyramid domain. Here is a brief summary of the relevant theory and concepts.

1.1 Creating the Gaussian Pyramid

Starting with the image zn] = zo [n] a stack of D reduced resolution images zi [n],...,zp [n] may be created by low-pass filtering and downsampling, as follows:

where h [n] is the PSF of a suitable low-pass filter. The most natural implementation strategy is the outputdriven inner-product approach (moving average window) where h[k] are the weights, 2n corresponds to the placement of the "window" and k runs over the support of this weighting window, h.

To minimise aliasing, we would like to have

We could use Gaussian filters, as in Project 1, but in this project we will use Windowed sincs. You will can use any smooth window function, although the Hanning window is recommended for simplicity. The only parameter of interest is the region of support of the separable windowed sinc h [n] = hi [n1]·hi [n2].

1.2 Creating the Laplacian Pyramid

At each level 0< d < D of the Gaussian pyramid, a Laplacian detail image can be formed according to

where g [n] is a low-pass interpolation filter and the second term on the right hand side of the above equation represents interpolation of the reduced resolution image a+1 [n] up to the resolution of za[n]. Note thar each sample d+1 [k] of the reduced resolution image acts as an excitation source for the interpolation filter, where the location of the excitation source (in the interpolated domain) is 2k. The above equation can be written equivalently as

The most natural implementation strategy for this operation is the input-driven one. Start by copying the contents of a to the memory buffer that will hold ya; then walk through each location k within a+1 scaling the elements of g by the value of d+1 and subtracting the resulting values from the appropriate locations within the memory buffer that holds yaAs above, the PSF g [n] may be a Gaussian or another suitable low-pass filter, whose bandwidth should ideally be confined to the box  For this project, we will use windowed sincs. It is your responsibility to choose suitable stretching factors for the sinc, and also to make sure that your upsampling and downsampling operations all have a DC gain of 1.

1.3 Inverting the Laplacian transform

Starting with yp=p and the detail images yo through yp-1, it is a simple matter to recover n = o [nl through a succession of interpolation and addition operations. That is,

1.4 Stitching in the Laplacian domain

In this project, we will explore the use of the Laplacian transform. for image stitching. To make things simple, we will create two source images 4[n] and B [n], to be stitched right down the middle. That is, direct image domain stitching should produce

where Ni is the image width and both images have identical dimensions.

To perform. the stitching in the Laplacian pyramid domain, you should first convert zA and xB into their Laplacian sub-images, 0,, and ,,, perform. the stitching separately in each level of the Laplacian pyramid, and then synthesise the stitched Laplacian sub-images y,,y back to a final stitched image x.

2 How to go about the tasks

For demonstration purposes, you should create a separate project for each task, within a single workspace. You can do this by using the "File→ New→ Project" option in Visual C++, replacing the "Create new solution" option with "Add to solution".

Be sure to arrange for all your executable programs to be placed in a single location (e.g., c: \elec4642\bin), which is referenced from the path environment variable. Also, please place all the images you are working with into a single directory (e.g., c: \elec4642\data). That way, it will be much easier to demonstrate your work in a time effective manner-it will also save you personally a lot of time. In any event, you might not be marked for your work unless it is organized in this way.

You are expected to produce hand-drawn sketches illustrating key features of your design for each task. This is not really any additional work, since it would normally be part of the process of coming up with a design and getting it working. As explained already in laboratories, being able to draw what a diagram of what your program is doing is an essential first step to implementing a solution and a key element in the debugging process. Do not discard these sketches and diagrams, since lab demonstrators will expect to see them when marking your work, and you are also required to upload them to Moodle as part of the electronic submission process.

To verify your work, it may be helpful to collect sequences of command-line statements together into a script. that you can run quickly. You can do this with a Windows batch file (i.e., any file with the ".bat" suffix), which can then be executed like any other program from the command-line. This will greatly facilitate the marking for demonstrators, who might insist upon it.

3 Tasks

Task 1: (4 marks) Write a program that accepts an input BMP image z n] and converts it to a Gaussian pyramid with D levels. Your program should accept D as a command-line argument, and it should write out all D +1 sub-images o,.., p as a single BMP image, with zo at the top, followed by 1 (each row padded with N1/2 zeroes on the right) and so on until zp appears at the bottom of the output image. The resulting output image should look like an upside down pyramid, with all sub-images lining up with the left hand border.

Your program should also accept an integer parameter H that represents the half-length of the windowed sinc h [n] that is separabl Elec4622 Laboratory Project 2, T2 2024C/C++ y extended to h[n] = hini]-h[n2] such that hi [n] = 0 for > H. That is, your 2D windowed sinc filters or interpolation kernels should all have region of support -H, H2. Your program should be able to operate successfully with value of H that are as small as O and as large as 20. when converting the results back to 8-bit unsigned integers and writing out your BMP image.

For simplicity, you are recommended to use floating point arithmetic internally within your program. This means that your sub-images r[n] would be stored as floats, but you will need to take careMake sure your program works for colour images.

Task 2: (4 marks) Modify the program from Task 1 in such a way that the output image holds the detail images yo,,VD-1, so that positive and negative values are readily visible as offsets relativeLaplacian pyramid sub-images y0,,yD. In this case, for display purposes only, add 128 to all the to the mid-level grey signal.

As in Task 1 you will need to be careful when converting floats to 8-bit integers for display, and you will also have to pay close attention to clipping the values to the limits of 0 to 255-why is this much more important for this task than for Task 1?

Make sure your application works for colour images, producing colour Laplacian sub-images.

Task 3: (2 marks) Write a program that accepts the Laplacian output image produced by Task 2 and reconstructs the original input image, writing it out as a BMP image.

You may find it convenient to supply the height of the original image on the command-line so that you don't have to figure this out from the height of the pyramid image.

You should verify that your program works by comparing the original input image with the one that you reconstruct using the Media Interface "image_arith" module - e.g.,

mi_pipe2 -inl orig.bmp -in2 recon.bmp :: image_arith -add 1-1

To explore discrepancies more readily, try doing some brightening of the output image as follows:

mi_pipe2 -in1 orig.bmp -in2 recon.bmp :: image_arith -add 1 -1 :: cc_interp -br 10

Make sure you can explain any discrepancies!

Task 4 (optional): (up to 1 bonus mark) Combine tasks 1, 2 and 3 into a single program that reads an input image, creates the Laplacian domain representation internally, then reconstructs the image and writes it out, doing all internal processing in floating point (or, if you like, fixed-point) arithmetic. How does your output image compare with the original input image in this case? To verify that you are doing this exercise correctly, the lab demonstrator may ask you to multiply the samples of one of the Laplacian detail images by a fixed factor prior to reconstruction.

Task 5 (optional): (up to 1 bonus mark) Write a program that accepts two BMP input images, zA and B, stitching them together in the image domain and writing the stitched image out as another BMP file.

Your program can assume that the images have identical dimensions and it is sufficient to place a vertical stitching boundary at a fixed location in the middle of the two images.

Task 6 (optional): (up to 2 bonus marks) Modify your program from task 4 so that it accepts two input images of the same size, stitches them in the Laplacian Pyramid domain, and then reconstructs and writes the output BMP image. To test the behaviour of your program, prepare ligher and darker versions of a single input image as z4 and zB and stitch them together. You can prepare such images by using the Media Interface "cc_interp" module with the "-br" argument, as illustrated above.

Task 7 (optional): (up to 1 bonus mark) Prepare an original image and a blurred version of the original image and stitch them together using the programs from Task 5 and Task 6, comparing the results visually.

To blur an image, you can use the Gaussian filtering program that you implemented for Task-1 of Project-1, but this task does not impose specific constraints on how you obtain the blurred version of an image.

4 Assessment

You should not rely upon implementing this project within the scheduled laboratory sessions. Instead, you must be prepared to demonstrate and explain your work in the Week 8 laboratory. You should makel sure each task can be run simply from the command-line. Spend some effort organizing your patterns andl test images ahead of time, perhaps using separate sub-directories. I suggest creating small batch files (files ending in ".bat" that contain commands you would normally type on the command line) that display the test image and output using "mi_viewer". Doing this will save you and the demonstrator a lot of time, both before and during the lab, since it is easy to compare different tasks using such batch files without having to remember all the conditions you used.

Note carefully: Lab demonstrators will expect to see the hand-drawn sketches you have produced as a critical part of the design process, showing how you have organized your image data, how you have computed dimensions, extended boundaries, and other such things.

4.1 Team work, plagiarism and copying

You may feel free to re-use code from the previous laboratory sessions, so long as you understand it. You may also discuss the project with other students in the class, but your programs should otherwise be your own original work — this is not a group project!

You are required to submit your code via an Assignment item on the course's Moodle page, by the end of the same day as your scheduled laboratory session, following the instructions provided there. Your code may be cross-checked for plagiarism, so make sure that you do not copy any other student's actual implementation, or base your solution on one that you obtain from another student.

4.2 Managing the limited resource of demonstrator time

During your labs, demonstrators will have a major responsibility of marking your project. This is time consuming, and so you cannot expect to be marked only in the last hour of a lab session. To maximize your opportunity to be marked you should come prepared to the lab session in Week 8 with many elements of your project completed or at least partially working, so that you can ask a demonstrator to mark or look over your solution as early as possible within the lab session         

评论
添加红包

请填写红包祝福语或标题

红包个数最小为10个

红包金额最低5元

当前余额3.43前往充值 >
需支付:10.00
成就一亿技术人!
领取后你会自动成为博主和红包主的粉丝 规则
hope_wisdom
发出的红包
实付
使用余额支付
点击重新获取
扫码支付
钱包余额 0

抵扣说明:

1.余额是钱包充值的虚拟货币,按照1:1的比例进行支付金额的抵扣。
2.余额无法直接购买下载,可以购买VIP、付费专栏及课程。

余额充值