pycharm爬虫程序为何运行结果为exit code 0?
|
代码如下:frombs4importBeautifulSoupfromurllibimportrequestimporttimeheaders={'User-Agent':'Mozilla/5.0(WindowsNT6.1;Win64;x64)AppleWebKit/537.36(KHTML,likeGecko)Chrome/66.0.3359.181Safari/537.36'}defjudgment_sex(class_name):ifclass_name==['member_ico1']:return'女'else:return'男'defget_links(url):req=request.Request(url,headers=headers)response=request.urlopen(req)web_data=response.read()soup=BeautifulSoup(web_data,'lxml')links=soup.select("#page_list>ul>li>a")forlinkinlinks:href=link.get("href")get_info(href)defget_info(url):req=request.Request(url,headers=headers)response=request.urlopen(req)web_data=response.read()soup=BeautifulSoup(web_data,"lxml")tittles=soup.select('div.pho_info>h4')addresses=soup.select('span.pr5')prices=soup.select('#pricePart>div.day_l>span')imgs=soup.select('#floatRightBox>div.js_box.clearfix>div.member_pic>a>img')names=soup.select('#floatRightBox>div.js_box.clearfix>div.w_240>h6>a')sexs=soup.select('#floatRightBox>div.js_box.clearfix>div.member_pic>div')fortittle,address,price,img,name,sexinzip(tittles,addresses,prices,imgs,names,sexs):data={'tittle':tittle.get_text().strip(),'address':address.get_text().strip(),'img':img.get("src"),'name':name.get_text(),'sex':judgment_sex(sex.get("class"))}print(data)if__name__=='_main_':urls=['http://bj.xiaozhu.com/search-duanzufang-p{}-0/'.format(number)fornumberinrange(1,14)]forsingle_urlinurls:get_links(single_url)time.sleep(2)如果不是程序的问题,是不是因为运行结果已经被保存在某个不知道的角落?还请大佬指点。 |
免责声明:本内容仅代表回答者见解不代表本站观点,请谨慎对待。
版权声明:作者保留权利,不代表本站立场。
|
|
|
|
|
|
|
|
你代码缩进是不是有问题啊,为什么if __name__ == '__main__': 会在get_info里面 |
|
|
|
|
|
|
|