import scrapy
class CnBlogSpider(scrapy.Spider):
name = "cnblogs"
start_urls = [
'http://www.cnblogs.com/pick/#p%s' %p for p in range(1, 11)
]
def parse(self, response):
for article in response.xpath('//div[@class="post_item"]'):
yield {
'title': article.xpath('div[@class="post_item_body"]/h3/a/text()').extract_first().strip(),
}
想爬1道10页的,结果一直停留在第一页,还很多重复,不知道哪里错了,有人知道吗