剛寫完搜索結果采集的程序,想寫一個功能介紹和使用說明的,然後把我這個小程序放到網上供有興趣的人使用,好歹花了這麼一段時間做的東西,如果對別人有用,也算是個小成績。由於剛寫完程序,總想踹口氣,於是這個工作就放幾天了。然後就想看看已經用過的技術,找找這方面的書,能夠深入提高一下。
先後到亞馬遜和當當搜了一大堆,反復比較了一下,只發現一本數比較對路,就是《HTTP Programming Recipes for C# Bots》,翻譯成中文就是《用c#做http自動機器人的配方》,說白了就是怎麼用C#做http的程序機器人。這本書的介紹如下:The Hypertext Transfer Protocol (HTTP) allows information to be exchanged between a web server and a web browser. C# allows you to program HTTP directly. HTTP programming allows you to create programs that access the web much like a human user would. These programs, which are called bots, can collect information or automate common web programming tasks. This book presents a collection of very reusable recipes for C# bot programming. This book covers many topics related to C# HTTP programming. Both secure and insecure HTTP communications are covered, as well as HTTP authentication. Learn to interact with HTTP forms and support both HTTP POST and HTTP GET requests. Collect data from a wide array of HTML constructs, such as tables, and lists. Learn about advanced topics that complicate the life of a bot, such as AJAX and . Also learn about the ethical use of bots, and when bots should not be used. This book also introduces the Heaton Research Spider. The Heaton Research Spider is an open source spider work. Using the Heaton Research Spider you can create spiders that will crawl a web site, much like a real spider crawls the web. The Heaton Research Spider is available in both Java and Microsoft Dot Net form.
翻譯出來就是:超文本傳輸協議能夠讓信息在web服務器和web浏覽器之間交互信息。 c#能夠讓你直接用http編程。http的編程能夠實現用程序訪問web就想人一樣。這種程序叫做程序機器人,能夠收集信息和自動提交任務。這本書提供了一堆非常有用的c#程序機器人程序。這本書涵蓋了很多想關於c#http編程的主題。包括http通信的安全認證、在廣泛的html結構下采集信息,比如表格和清單。學習高級的程序機器人復雜的生命周期,例如ajax和 。也學習程序機器人在什麼情況下被使用,以及不能被使用。這本書同時也介紹了西頓探索蜘蛛。西頓探索蜘蛛是一個開放源代碼的蜘蛛框架。用西頓探索蜘蛛能夠創建一個網站的爬行蜘蛛,它非常像一個真實的網頁爬行蜘蛛。西頓爬行蜘蛛支持java和.net.
哈,這本書真不錯,可惜是2007年的,有點早,當時的技術現在大部分估計不能用了,另外還有一個不足,就是一本書要價418元,有點小貴。於是,我深情地看了一樣書的封皮,默默地離開了。