大家好,寒假到了,無聊寫寫爬蟲
如題,我卡關了,不論我用get還是find都抓不到調皮的href,只會print標題而已
因為我只想抓第一個,所以我這樣寫,求各路大神幫忙(困擾我好幾天了都睡不好覺)
https://i.imgur.com/k18flRu.jpg
https://i.imgur.com/0Hvg6u1.jpg
https://i.imgur.com/qYB59v1.jpg
from selenium import webdriver import time from bs4 import BeautifulSoup from se
lenium.webdriver.common.keys import Keys browser=webdriver.Chrome() browser.impl
icitly_wait(1) browser.get('https://www.youtube.com') time.sleep(5) URL="" for d
ata in open('test.txt','r',encoding='UTF-8'): 胬? data=data.strip() 胬? br
owser.get('https://www.youtube.com/results?search_query='+data+"+OP") 胬? sou
p=BeautifulSoup(browser.page_source) 胬 time.sleep(2) 胬? for i in soup.f
ind('a','yt-simple-endpoint style-scope ytd-video-renderer'): # 找尋第一個 <div>
區塊且 class="img_div_width" 胬胬胬? print (i) 胬胬胬? print ('-'
*50) 胬胬胬? a = i.get('href') 胬胬胬? print (a) 胬胬胬? #U
RL="https://www.youtube.com"+a 胬胬胬? #print (URL) 胬胬胬? print
(-'*100)