1·The basic design of this crawler is to load the first link to check onto a queue.
这个爬虫的基本设计是加载第一个链接并将其放入一个队列。
2·E-mail harvesting can be one of the easiest crawling activities, as you'll see in the final crawler example in this article.
E - mail收集可能是最容易的一种爬行行为,在本文中最后一个爬虫例子中我们会看到这一点。
3·The behavior policies define which pages the crawler will bring down to the indexer, how often to go back to a Web site to check it again, and something called a politeness policy.
这种行为策略定义了爬虫会将哪些页面带入索引程序、以什么样的频率回到Web站点上再次对它进行检查,以及一种礼貌原则。
4·Click on the Edit button in the query_statistic line to move to the crawler TAB.
单击query_statistic行的Ededit按钮,移向爬虫选项卡。
5·Define the crawler name (UNIX file system crawler 1, for example), as shown in Figure 7, and then click on the Next button.
定义爬虫名称(例如,UNIX file system crawler 1),如图7所示,然后单击Next按钮。