您现在的位置：程式師世界 >> 編程語言 > >> 更多編程語言 >> Python

降維算法實戰項目（2）—使用PCA對圖像降維（Python代碼+數據集）

編輯：Python

在這部分練習中，我們將學習人臉圖像上運行PCA，看看如何在實踐中使用它來減少維度。

老規矩，先放出數據集：

鏈接：https://pan.baidu.com/s/1R0oiqoWHV2iR8sc3YHkMoA
提取碼：6666

導入需要用到的包

from numpy import *
from scipy.io import loadmat
import matplotlib.pyplot as plt

導入數據

faces_data = loadmat('data/ex7faces.mat')
print(faces_data)
X=faces_data['X']
print(X.shape)

結果為：

{
'__header__': b'MATLAB 5.0 MAT-file, Platform: PCWIN64, Created on: Mon Nov 14 23:46:35 2011', '__version__': '1.0', '__globals__': [], 'X': array([[ -37.86631387, -45.86631387, -53.86631387, ..., -110.86631387,
-111.86631387, -99.86631387],
[ 8.13368613, -0.86631387, -8.86631387, ..., -34.86631387,
-8.86631387, 0.13368613],
[ -32.86631387, -34.86631387, -36.86631387, ..., -110.86631387,
-111.86631387, -111.86631387],
...,
[ -46.86631387, -24.86631387, -8.86631387, ..., 90.13368613,
80.13368613, 59.13368613],
[ 19.13368613, 16.13368613, 14.13368613, ..., -38.86631387,
-41.86631387, -46.86631387],
[-108.86631387, -106.86631387, -102.86631387, ..., 17.13368613,
17.13368613, 18.13368613]])}

(5000, 1024)

說明我們的數據集有5000個樣本，每個樣本有1024個特征。

可視化

我們可視化一下前100張人臉圖像：

def plot_100_image(X):
fig,ax=plt.subplots(nrows=10,ncols=10,figsize=(10,10))
for c in range(10):
for r in range(10):
ax[c,r].imshow(X[10*c+r].reshape(32,32).T,cmap='Greys_r')
ax[c,r].set_xticks([])
ax[c,r].set_yticks([])
plt.show()
plot_100_image(X)

結果如下圖所示：

接下來我們應用PCA算法的步驟與之前在二維數據集上的步驟一致：
1.去均值化

2.計算協方差矩陣

3.計算特征值和特征向量

我們不再細致講解，有需要的可以看我之前的博客：

https://blog.csdn.net/wzk4869/article/details/126074158?spm=1001.2014.3001.5502

直接放出對應的代碼：

def reduce_mean(X):
X_reduce_mean=X-X.mean(axis=0)
return X_reduce_mean
X_reduce_mean=reduce_mean(X)
def sigma_matrix(X_reduce_mean):
sigma=(X_reduce_mean.T @ X_reduce_mean)/X_reduce_mean.shape[0]
return sigma
sigma=sigma_matrix(X_reduce_mean)
def usv(sigma):
u,s,v=linalg.svd(sigma)
return u,s,v
u,s,v=usv(sigma)
print(u)
def project_data(X_reduce_mean, u, k):
u_reduced = u[:,:k]
z=dot(X_reduce_mean, u_reduced)
return z
z = project_data(X_reduce_mean, u, 100)

我們接下來還原數據，這裡選擇只保留100個特征：

def recover_data(z, u, k):
u_reduced = u[:,:k]
X_recover=dot(z, u_reduced.T)
return X_recover
X_recover=recover_data(z,u,100)

我們看一下最後降維後的圖像：

plot_100_image(X_recover)

我們對比兩張圖片，可以很明顯的看出，第二張圖片保留的特征較少，已經導致臉部有些模糊。

最後唠叨一句

如果不設置 cmap='Greys_r' 會很陰間：

最開始的100張人臉：

降維後的人臉：

上一篇文章： Example of using NumPy in python
下一篇文章： Python中的set_xticks和set_xticklabels用法（附代碼解釋）

Python

Linux系統下海康工業相機MVS二次開發-Python

文章目錄Linux系統下海康工業相機MVS二次開發-Pyth

pythonreturn實現匯率轉換器教程示例

目錄A.課程內容B.知識點C.用到的基本指令D.函數返回值E

Appium + Python Mobile Real play - vous apprend à localiser XPath pour automatiser les tests

Question daujourdhui - moi.：Le

Sublime Text3 code prompt for configuring Python

1. open Tools -- Compiling sy

Pandas（三）—— 變形、連接

Python模塊 —— PandasPandas（三）——

Python play data analysis: count excel and draw with Matplotlib

Python Play data analysis ： St

没有相关文章

熱門圖文

C++ 的 const和const_cast 基礎備忘：派生類直接訪問基類的私有成員談JSP與XML的交互在SWT中使用OLE操作Excel（一）：使Excel嵌入到SWT窗口中想用python編寫一個普查管理軟件，想咨詢一下思路和涉及的軟件。 HDOJ 4604 Deque Lua中的模塊與包問題一百零三：數列有序！

欄目導航