程式師世界 >> 編程語言 >> 更多編程語言 >> 更多關於編程 >> GPGPU OpenCL怎麼實現精確字符串查找？

GPGPU OpenCL怎麼實現精確字符串查找？

編輯：更多關於編程

　　1.加速方法

　　(1)將少量常量數據，如模式串長度、文本長度等，保存在線程的private memory中。

　　(2)將模式串保存在GPU的local memory中，加速線程對模式串的訪問。

　　(3)將待查找的文本保存在global memory中，使用盡可能多線程訪問global memory，減小線程平均訪存時間。

　　(4)每個work-group中的線程操作文本中一段，多個work-group並行處理大文本。

　　2.同步

　　(1)work-group內，使用CLK_LOCAL_MEM_FENCE、CLK_GLOBAL_MEM_FENCE

　　(2)全局使用對__global int 的原子操作，來保證每個線程將結果寫到全局內存的正確位置。設備支持的操作可以通過查詢設備的擴展獲得，如下圖，可知核函數支持原子操作、printf操作：

　　3.代碼實例，大文本精確模式串搜索

　　3.1 核函數(string_search_kernel.cl)：

　　int compare(__global const uchar* text, __local const uchar* pattern, uint length){

　　for(uint l=0; l

　　if (text[l] != pattern[l])

　　return 0;

　　}

　　return 1;

　　}

　　__kernel void

　　StringSearch (

　　__global uchar* text, //Input Text

　　const uint textLength, //Length of the text

　　__global const uchar* pattern, //Pattern string

　　const uint patternLength, //Pattern length

　　const uint maxSearchLength, //Maximum search positions for each work-group

　　__global int* resultCount, //Result counts (global)

　　__global int* resultBuffer, //Save the match result

　　__local uchar* localPattern) //local buffer for the search pattern

　　{

　　int localIdx = get_local_id(0);

　　int localSize = get_local_size(0);

　　int groupIdx = get_group_id(0);

　　uint lastSearchIdx = textLength - patternLength + 1;

　　uint beginSearchIdx = groupIdx * maxSearchLength;

　　uint endSearchIdx = beginSearchIdx + maxSearchLength;

　　if(beginSearchIdx > lastSearchIdx)

　　return;

　　if(endSearchIdx > lastSearchIdx)

　　endSearchIdx = lastSearchIdx;

　　for(int idx = localIdx; idx < patternLength; idx+=localSize)

　　localPattern[idx] = pattern[idx];

　　barrier(CLK_LOCAL_MEM_FENCE);

　　for(uint stringPos=beginSearchIdx+localIdx; stringPos

　　if (compare(text+stringPos, localPattern, patternLength) == 1){

　　int count = atomic_inc(resultCount);

　　resultBuffer[count] = stringPos;

　　//printf("%d ",stringPos);

　　}

　　barrier(CLK_LOCAL_MEM_FENCE);

　　}

上一頁:winsocket局域網聊天軟件怎麼運行？
下一頁:GPGPU OpenCL中如何使用結構體數據？

更多關於編程

Tiobe公布了2011年11月編程語言排行榜，前二十排名無

AspNet MVC是什麼？

ASP.NET 是一個開發框架，用於通過 HTM

探討：東方程序員眼中的西方程序員是怎樣？

引言：本文譯自StackExchange上的一個討論貼：東方

NoSQL數據庫技術特性解析之文檔數據庫

現今雲計算的從業人員對NoSQL一詞並不感到陌生，雖然很多技

12月編程語言排行榜：第三位置恐將易主

TIOBE 於今日公布了2011年12月編程語言排行榜。雖然

如何使用.SSA文件

SSA是字幕檔案文件使用Windows Media Pla

閱讀排行榜

C/C++中如何判斷某一文件或目錄是否存在常用的匹配正則表達式和實例 python實現的jpg格式圖片修復代碼 Python實現建立SSH連接的方法 Perl實現高水線算法 JFinal如何配置springPlug？ Facebook是如何存儲數十億照片的 Python標准庫defaultdict模塊使用示例使用Promise模式來簡化JavaScript的異步回調如何制作平面式列頭的Listview 編寫Python腳本批量配置VPN的教程

熱門圖文

SpringFramework中的AOP簡單使用 regex-C#關與正則匹配鏈接的問是，這段代碼怎麼修改？ topcoder-srm-233-div2 C++運算符優先級及結合性備忘深入淺出CChart 每日一課——第十五課實習之旅，百年老店之經典MFC 【PHP源碼閱讀】explode和implode函數，explodeimplode git-Git命令行目錄能不能修改 Hibernate中的"Repeated column in mapping for entity"異常，hibernaterepeated

欄目導航

匯編語言 Delphi Groovy WebSphere Rational Python Ruby 編程解疑編程綜合問答更多關於編程編程問題解答