What could be causing the increase in file size after running PCA on the image?

Question

What could be causing the increase in file size after running PCA on the image?

I am currently working on developing an image classification model to identify different species of deer in the United States. As part of this process, I am utilizing Principal Component Analysis (PCA) to reduce the memory size of the images and optimize the run time of the model.

However, I have encountered a puzzling issue where all the new PCA-compressed images generated by my Deer_PCA function are larger in file size compared to the original images. For instance, the original image was 128 KB, but the compressed version after running it with n_components = 150 now stands at 293 KB. Can anyone shed some light on why this unexpected outcome is happening?

Below is the image that was processed using the function; make sure to place the image in an empty folder before executing the code:

Here is the resulting compressed image obtained after applying the Deer_PCA function:

Displayed below is the code implementation:

# Required packages

import cv2
import os,sys
from PIL import Image
import pandas as pd

from scipy.stats import stats
from sklearn.decomposition import PCA

import matplotlib.pyplot as plt
import matplotlib.image as mpimg

# Function to perform PCA on images within a specific folder and save them in another folder

def Deer_PCA(inpath, outpath,n_comp):
    for image_path in os.listdir(inpath):

        # Read input file
        input_path = os.path.join(inpath, image_path)
        print(input_path)
        
        w_deer = cv2.cvtColor(cv2.imread(input_path), cv2.COLOR_BGR2RGB)

        # Split channels
        blue_2,green_2,red_2 = cv2.split(w_deer)

        # Scale channels
        w_blue = blue_2/255
        w_green = green_2/255
        w_red = red_2/255

        # Perform PCA on each channel
        pca_b2 = PCA(n_components=n_comp)
        pca_b2.fit(w_blue)            
        trans_pca_b2 = pca_b2.transform(w_blue)

        pca_g2 = PCA(n_components=n_comp)
        pca_g2.fit(w_green)
        trans_pca_g2 = pca_g2.transform(w_green)

        pca_r2 = PCA(n_components=n_comp)
        pca_r2.fit(w_red)
        trans_pca_r2 = pca_r2.transform(w_red)

        # Merge channels post-PCA
        b_arr2 = pca_b2.inverse_transform(trans_pca_b2)
        g_arr2 = pca_g2.inverse_transform(trans_pca_g2)
        r_arr2 = pca_r2.inverse_transform(trans_pca_r2)

        img_reduced2 = (cv2.merge((b_arr2, g_arr2, r_arr2)))
        
        print("Merge Successful")

        # Save output
        fullpath = os.path.join(outpath, 'PCA_'+image_path)
        cv2.imwrite(fullpath, img_reduced2*255)
        
        print("Successfully saved\n")
        

# Check image sizes 

original_image_path = '/Users/matthew_macwan/Downloads/CIS/I_Class_Deer/mule_deer_doe/mule deer doe_1.jpeg'

PCA_compressed_image_path = '/Users/matthew_macwan/Downloads/CIS/I_Class_Deer/mule_deer_doe/PCA_mule deer doe_1.jpeg'

print('Original Image:',sys.getsizeof(original_image_path))

print('PCA Image:',sys.getsizeof(PCA_compressed_image_path))

python image machine-learning image-processing pca

Answer 1

Answer №1

There seems to be a misconception here. Performing PCA on a single image involves treating each column (or row, the specifics are unclear) as an individual observation. While this does reduce the image to 150 rows (or columns), ultimately decreasing the data volume and potentially diminishing the information content.

However, when reconstructing the original image from the PCA, you end up with an array of the same size as the original and save it as a JPEG file. This means that there are not fewer data points to store; while the overall information in the image may decrease, the process differs from how JPEG compression operates. Therefore, the JPEG algorithm is unlikely to benefit or compress the data into fewer bytes efficiently.

If your output JPEG file ends up larger than the input, it could be due to the PCA modifications complicating the JPEG algorithm or influenced by the quality setting used. Adjusting the quality setting of the JPEG compression is the most effective way to reduce file sizes.

Using PCA for image compression requires saving the PCA basis vectors along with the image projected onto those vectors. However, this approach may not be the most effective method for compressing images.

An alternative image compression technique involves converting a large collection of images into vectors by arranging their sample values in rows and then applying PCA to the entire dataset. Each image can then be represented as a linear combination of these basis vectors, necessitating storage of only the weights per basis vector. While this method showcases how PCA functions, its effectiveness is not guaranteed. It is advisable to stick to established image compression methods like JPEG and JPEG2000.

with the goal of reducing memory usage and enhancing the model's runtime efficiency during later stages.

It should be noted that the file size has no direct impact on the workload of the model. When the image is loaded from the file into memory, a specific number of pixels are acquired, which the model must analyze. The storage space occupied by the data on disk is inconsequential at this stage. To improve the model's speed, consider reducing the pixel count through subsampling. However, ensure that the essential recognition features remain intact post-resampling. Overly aggressive pixel reduction may hinder the model's ability to distinguish between different objects effectively!

Answer 2

There seems to be a misconception here. Performing PCA on a single image involves treating each column (or row, the specifics are unclear) as an individual observation. While this does reduce the image to 150 rows (or columns), ultimately decreasing the data volume and potentially diminishing the information content.

However, when reconstructing the original image from the PCA, you end up with an array of the same size as the original and save it as a JPEG file. This means that there are not fewer data points to store; while the overall information in the image may decrease, the process differs from how JPEG compression operates. Therefore, the JPEG algorithm is unlikely to benefit or compress the data into fewer bytes efficiently.

If your output JPEG file ends up larger than the input, it could be due to the PCA modifications complicating the JPEG algorithm or influenced by the quality setting used. Adjusting the quality setting of the JPEG compression is the most effective way to reduce file sizes.

Using PCA for image compression requires saving the PCA basis vectors along with the image projected onto those vectors. However, this approach may not be the most effective method for compressing images.

An alternative image compression technique involves converting a large collection of images into vectors by arranging their sample values in rows and then applying PCA to the entire dataset. Each image can then be represented as a linear combination of these basis vectors, necessitating storage of only the weights per basis vector. While this method showcases how PCA functions, its effectiveness is not guaranteed. It is advisable to stick to established image compression methods like JPEG and JPEG2000.

with the goal of reducing memory usage and enhancing the model's runtime efficiency during later stages.

It should be noted that the file size has no direct impact on the workload of the model. When the image is loaded from the file into memory, a specific number of pixels are acquired, which the model must analyze. The storage space occupied by the data on disk is inconsequential at this stage. To improve the model's speed, consider reducing the pixel count through subsampling. However, ensure that the essential recognition features remain intact post-resampling. Overly aggressive pixel reduction may hinder the model's ability to distinguish between different objects effectively!

What could be causing the increase in file size after running PCA on the image?

Answer №1

Similar questions

How to interact with a button inside a span element without an ID using Selenium

Error message "The table 'MSysAccessStorage' does not exist" is encountered while attempting to drop tables from the list generated by cursor.tables()

Divide the strings using punctuation marks, but leave the tags intact

Python code example: How to compare two dictionaries stored in a list

Ways to resolve issues with multiple foreign keys

Python can simultaneously strip and split strings

Having trouble importing `beautifulSoup` in Python 2.7 with Selenium

Locating specific phrases within a vast text document using Python

Having trouble importing the pydot module in Python on Ubuntu 14.04?

What is the best way to unbox nested tuples in python?

Ways to incorporate CSS design into Django input pop-up when the input is invalid

Adding a large number of plots to Bokeh in bulk

Refreshing a webpage to accurately display changes made in a CRUD application without the need for a hard reset

Unable to execute a Windows command line that has been escaped within Python

Switch the type and version of the browser using Selenium

Instead of creating a new figure each time in a loop for imshow() in matplotlib, consider updating the current plot instead

Is there a way to send all the results of a Flask database query to a template in a way that jQuery can also access

Tips for unit testing Python code with mock patch

What causes json.parse to malfunction? and how can you resolve the issue

Learn how to incorporate a newly created page URL into a customized version of django-oscar