【python】xsspider零碎知识点

1.提取url信息 urlparse()

from urlparse import urlparse

url = "http://scrapy-chs.readthedocs.io/zh_CN/1.0/topics/items.html"
urlparse(url)
#ParseResult(scheme='http', netloc='scrapy-chs.readthedocs.io', path='/zh_CN/1.0/topics/items.html', params='', query='', fragment='')
原文地址:https://www.cnblogs.com/dplearning/p/6678970.html