利用python,可以實(shí)現(xiàn)填充網(wǎng)頁(yè)表單,從而自動(dòng)登錄WEB門(mén)戶。
(注意:以下內(nèi)容只針對(duì)python3)
環(huán)境準(zhǔn)備:
(1)安裝python
(2)安裝splinter,下載源碼 python setup install
#coding=utf-8import timefrom splinter import Browser def login_mail(url): browser = Browser() #login 163 email websize browser.visit(url) #wait web element loading #fill in account and password browser.find_by_id('username').fill('你的用戶名稱') browser.find_by_id('password').fill('你的密碼') #click the button of login browser.find_by_id('loginBtn').click() time.sleep(5) #close the window of brower browser.quit() if __name__ == '__main__': mail_addr ='http://reg.163.com/' login_mail(mail_addr)
Tips:
(1)如果需要修改web的html屬性,可以使用:js
browser.execute_script('document.getElementById("Html屬性ID").value = "在此提供默認(rèn)值"')
(2)browser = Browser()
不指定的情況下,瀏覽器驅(qū)動(dòng)是火狐(Firefox),可以指定其他:browser = Browser(‘chrome'),需要下載對(duì)應(yīng)的驅(qū)動(dòng)程序
1.python3瀏覽頁(yè)面
#coding=utf-8import urllib.requestimport time#在請(qǐng)求加上頭信息,偽裝成瀏覽器訪問(wèn)headers = {'User-Agent':'Mozilla/5.0 (Windows NT 6.1; WOW64; rv:23.0) Gecko/20100101 Firefox/23.0'}chaper_url='http://XXX' vist_num=1while vist_num<1000: if vist_num%50==0: time.sleep(5) print("This is the 【 "+str(vist_num)+" 】次嘗試") req = urllib.request.Request(url=chaper_url, headers=headers) urllib.request.urlopen(req).read() #.decode('utf-8') vist_num+=1
2.python 多線程
#coding=utf-8import threading #導(dǎo)入threading包from time import sleepimport time def fun1(): print ("Task 1 executed." ) time.sleep(3) print ("Task 1 end." ) def fun2(): print ("Task 2 executed." ) time.sleep(5) print ("Task 2 end." ) threads = [] t1 = threading.Thread(target=fun1) threads.append(t1)t2 = threading.Thread(target=fun2)threads.append(t2) for t in threads: # t.setDaemon(True) t.start()
3.利用python下載百度圖片
#coding=utf-8import urllib.requestimport re def getHtml(url): page = urllib.request.urlopen(url) html = page.read() return html def getImg(html): reg = r'src="(.+?/.jpg)"' imgre = re.compile(reg) html=html.decode('utf-8') imglist = re.findall(imgre,html) x = 0 for imgurl in imglist: urllib.request.urlretrieve(imgurl,'%s.jpg' % x) x+=1 print(str(x))html = getHtml("http://image.baidu.com/channel?c=%E6%91%84%E5%BD%B1&t=%E5%85%A8%E9%83%A8&s=0") print(getImg(html))
效果:
官網(wǎng):鏈接地址
官方示例程序:鏈接地址
以上就是本文的全部?jī)?nèi)容,希望對(duì)大家的學(xué)習(xí)有所幫助,也希望大家多多支持武林站長(zhǎng)站。
新聞熱點(diǎn)
疑難解答
圖片精選