CS 180 Project 2

Introduction
Finite Difference Operator
Derivative of Gaussian Filter
Image Sharpening
Hybrid Images
Gaussian and Laplace Stacks
Multiresolution Blending

Introduction

This project covers several frequency-based techniques for image manipulation and processing. These methods include:

Enhancing image sharpness by amplifying high-frequency components
Detecting edges through the use of finite difference kernels
Generating hybrid images by combining high-frequency elements from one image with low-frequency elements from another
Blending multiple images across various frequency levels using Gaussian and Laplacian stacks

These approaches showcase the diverse applications of frequency analysis in creating innovative and interesting image processing effects.

Finite Difference Operator

Method

Two finite difference kernels were implemented as NumPy arrays to compute partial derivatives:

dx = np.array([[1, -1]]) for horizontal changes
dy = np.array([[1], [-1]]) for vertical changes

These kernels were applied to the original image using scipy.signal.convolve2d with the parameter mode='same', resulting in two images representing partial derivatives in the x and y directions.

To create a single edge image, the gradient magnitude was calculated at each pixel using:

np.sqrt(dx ** 2 + dy ** 2)

This operation effectively computes the L2 norm of the gradient vector formed by corresponding pixel values from the two partial-derivative images, producing the final edge-detected image.

Outputs

Cameraman (dx)

Cameraman (dy)

Cameraman (Gradient, binarized)

Derivative of Gaussian Filter

Method

I took a different approach to edge detection by pre-processing the Gaussian kernels:

First, I convolved the Gaussian kernels with the dx and dy finite difference kernels. This produced two new kernels: gaussian_dx and gaussian_dy.
These new kernels effectively represent the partial derivatives of the Gaussian kernel with respect to x and y.
Next, I applied these modified kernels to the original image through convolution. This step generated two partial derivative images.
Finally, I combined these partial derivative images into a single edge image by calculating the magnitude of the gradient at each pixel.

This technique allowed me to integrate the Gaussian smoothing and differentiation steps, potentially offering a more efficient edge detection process. I also experimented with different values for kernel size and sigma and visualized the differences below:

Outputs

Cameraman (dx_gaussian)

Cameraman (dy_gaussian)

Cameraman (ksize = 6, sigma = 1)

Cameraman (ksize = 12, sigma = 2)

As expected, the results of the finite difference operator and derivative of Gaussian filter were quite similar in goal. The key difference was that the smoothing prevented additional artifacts and improved edge detection. For concrete differences, observe the "sky" in both pictures. Without smoothing, there are some white spots. Also, the edge corresponding to the back of the cameraman is captured in greater detail with smoothing.

Image Sharpening

Method

Here's how I sharpened an image:

Blur the original image by convolving it with a Gaussian kernel.
Extract high-frequency components: edges = img - img_blur
Enhance the image: sharpened = img + alpha * edges

Where alpha is a constant that controls the sharpening intensity. I experimented with the value of alpha until it looked aesthetically pleasing.

Sharpening

For the first set of pictures, I set ksize=10 and sigma=2 when blurring and used alpha=1. This was done step-by-step, and the results were similar when using the filter.

Sharpening Taj step-by-step

Custom Images

I found this segment particularly cool and picked two distinct pictures. The first is of me on a sailboat, and the second is a picture of Cambridge, UK, I had taken this year.

Sharpening myself on a sailboat step-by-step

Sharpening a picture of Cambridge, UK step-by-step

Resharpening

Sharpening blurred Taj with unsharp mask filter

Sharpening blurred sailing picture with unsharp mask filter

Sharpening blurred picture of Cambridge, UK with unsharp mask filter

When working with the original image, the effects of sharpening were prominent. As the details outline indicates, we are adding emphasis to various bricks of the Taj Mahal as well as the lined bushes near it. If we blur the original image, we lose some information, and so we cannot expect to reconstruct details we didn't have access to. Still, the filter does a good job at highlighting the prominent edges.

Hybrid

Method

The approach here was to blur im1 to obtain a low pass filtered version of it, with sigma=sigma1. Then, I constructed the laplacian of im2 by computing the difference between the original and a blurred im2, with sigma=sigma2.

Outputs

Cats are great. Both Nutmeg and a random cat from the internet formed a pretty nice hybrid image with my LinkedIn picture.

Me x Nutmeg

Me x Random Cat

I also wanted to experiment with pictures taken moments apart to see if I could communicate motion. The swing up by Big C provided a perfect opportunity. After a bit of alignment and tuning of sigma1 and sigma2, the results were:

Swing Image 1

Swing Image 2

Hybrid Swing

Failure

When combining the flowers below, I had higher hopes. I thought the edges would be captured well in the high-pass filter, and the intricacy could provide the perception of two different flowers.

Flower Image 1

Flower Image 2

Hybrid Flower

Fourier Analysis

I chose to visualize the Fourier transforms of the swing pictures below:

Swing1 FFT (Original + High Filter)

Swing2 FFT (Original + Low Filter)

Hybrid Fourier (creatively normalized)

Art?

As a quick bonus, I attempted to combine these two pictures from within and outside a water tower. It didn't work as expected, but with some normalization and blurring, it felt Picasso-esque.

Hole (Inside)

Hole (Outside)

Hybrid Hole (normalized creatively)

Gaussian and Laplacian Stacks

Method

Here, I built up the gaussian and laplacian stacks like so:

This pseudocode:

Initialized the gaussian list with the input img.
Looped through the depth range, performing the following steps:
- Blurred the previous gaussian image to get the current gaussian image.
- Calculated the laplace image as the difference between the previous and current gaussian images.
- Normalized the laplace image and stored it in the norm_laplace list.
Appended the final gaussian image to the laplace list.
Returned the gaussian, laplace, and norm_laplace lists.

I did not include the bottom-most blurred gaussian layer in the normalized laplace stack while displaying, though it is needed to reconstruct the original image with the laplacian stack.

Outputs

Laplace Stack of Apple

Laplace Stack of Orange

Multiscale Blending

Method

I defined my blend function such that the inputs were two image filenames, a 2D mask, and several optional parameters to control the size, blurring, and the output format. Rather than deal with alignment or cropping, I worked with all square images and performed resizing before blending.

Big picture, my function did the following:

Load and process the two input images to resize them to a common size and convert them to the appropriate data format.
Generate Gaussian, Laplace, and normalized Laplace image stacks for both input images using the generate_stacks function.
Resize the 2D mask to the same size as the input images and generate a Gaussian pyramid of the mask.
Blend the Laplace image stacks of the two input images using the Gaussian mask pyramid, creating a combined Laplace stack.
Collapse the combined Laplace stack into a single output image by summing the Laplace images and clipping the result to the valid pixel value range.
Optionally, return the output image and the normalized combined Laplace stack.

The key idea behind this function is to blend the two input images by decomposing them into Gaussian and Laplace image pyramids, and then using a mask to selectively combine the Laplace components of the two images. This allows for a more seamless and natural-looking blend, especially around the edges and boundaries defined by the mask.

Outputs

I had the most fun here as I tried out a variety of combinations and masks.

Food Combos

First up, I have a pineapple bun (courtesy of Sheng Kee), and I have pizza. I present pineapple bun pizza. While I could reuse the same half mask from earlier, I found a custom semi-circle mask worked better.

Original images

Custom Bun Mask

High-resolution pineapple bun pizza

Then, I tried combining different flavors of ice cream with that filter. I let it run for longer (15-20 minutes) with higher resolutions instead of downsampling, and as one would expect, that made the blend sharper.

Chocolate Ice Cream

Blue Ice Cream

Chocolate and Blue Ice Cream Combo

For fun, I tried a pineapple bun ice cream combo. Here, I realized that leaving the kernel size the same meant that the blurring was not as effective with a significantly higher resolution, making the seam more apparent.

Chocolate Pineapple Bun Combo

Landscapes

Finally, I experimented with more complicated masks after many struggles with Photoshop. I decided on a photo of me at the top of Mission Peak and tried to blend it into various backgrounds: Tahoe, a Spiderverse-inspired wallpaper of NYC, pictures from a Switzerland trip a decade ago, and one of London. The original images are below:

Jump and Mask

Original Backgrounds: Switzerland, London, Tahoe, NYC

As one would guess, when the lighting conditions are similar across the blends, it is more seamless.

Jump Switzerland Blend

Jump London Blend

Jump Tahoe Blend

Jump NYC Blend