python實(shí)現(xiàn)自動(dòng)登錄

2020-02-15 22:59:24

字體：大中小

供稿：網(wǎng)友

利用python，可以實(shí)現(xiàn)填充網(wǎng)頁(yè)表單，從而自動(dòng)登錄WEB門(mén)戶。

（注意：以下內(nèi)容只針對(duì)python3）

環(huán)境準(zhǔn)備：

（1）安裝python
（2）安裝splinter，下載源碼 python setup install

#coding=utf-8import timefrom splinter import Browser def login_mail(url):  browser = Browser()  #login 163 email websize  browser.visit(url)  #wait web element loading  #fill in account and password  browser.find_by_id('username').fill('你的用戶名稱')  browser.find_by_id('password').fill('你的密碼')  #click the button of login  browser.find_by_id('loginBtn').click()  time.sleep(5)  #close the window of brower  browser.quit() if __name__ == '__main__':  mail_addr ='http://reg.163.com/'  login_mail(mail_addr)

Tips：

（1）如果需要修改web的html屬性，可以使用：js

browser.execute_script('document.getElementById("Html屬性ID").value = "在此提供默認(rèn)值"')

（2）browser = Browser()

不指定的情況下，瀏覽器驅(qū)動(dòng)是火狐（Firefox），可以指定其他：browser = Browser(‘chrome')，需要下載對(duì)應(yīng)的驅(qū)動(dòng)程序

1.python3瀏覽頁(yè)面

#coding=utf-8import urllib.requestimport time#在請(qǐng)求加上頭信息，偽裝成瀏覽器訪問(wèn)headers = {'User-Agent':'Mozilla/5.0 (Windows NT 6.1; WOW64; rv:23.0) Gecko/20100101 Firefox/23.0'}chaper_url='http://XXX' vist_num=1while vist_num<1000: if vist_num%50==0:  time.sleep(5) print("This is the 【 "+str(vist_num)+" 】次嘗試") req = urllib.request.Request(url=chaper_url, headers=headers)  urllib.request.urlopen(req).read() #.decode('utf-8') vist_num+=1

2.python 多線程

#coding=utf-8import threading #導(dǎo)入threading包from time import sleepimport time def fun1():   print ("Task 1 executed." )  time.sleep(3)  print ("Task 1 end." ) def fun2():  print ("Task 2 executed." )  time.sleep(5)  print ("Task 2 end." )  threads = [] t1 = threading.Thread(target=fun1) threads.append(t1)t2 = threading.Thread(target=fun2)threads.append(t2) for t in threads:  # t.setDaemon(True)   t.start()

3.利用python下載百度圖片

#coding=utf-8import urllib.requestimport re def getHtml(url):  page = urllib.request.urlopen(url)  html = page.read()  return html def getImg(html):  reg = r'src="(.+?/.jpg)"'  imgre = re.compile(reg)  html=html.decode('utf-8')  imglist = re.findall(imgre,html)  x = 0  for imgurl in imglist:    urllib.request.urlretrieve(imgurl,'%s.jpg' % x)    x+=1    print(str(x))html = getHtml("http://image.baidu.com/channel?c=%E6%91%84%E5%BD%B1&t=%E5%85%A8%E9%83%A8&s=0") print(getImg(html))

效果：