scrapy提取不在标签内文字

response.xpath(u’//span[./text()=”出版社:”]/following::text()[1]’)

如果text() 中有空格, 感谢 @董成良提醒, 你可能还需要这么写response.xpath(u’//span[contains(./text(), “出版社:”)]/following::text()[1]’)

或者全匹配:response.xpath(u’//span[.//text()[normalize-space(.)=”出版社:”]]/following::text()[1]’)