使用BeautifulSoup和Gadget Selector抓取表格的步骤:
from bs4 import BeautifulSoup
import requests
url = 'https://...'
response = requests.get(url)
soup = BeautifulSoup(response.text, 'html.parser')
table = soup.find('table')
rows = table.find_all('tr')
for row in rows:
cols = row.find_all('td')
for col in cols:
print(col.text)
完整代码示例:
from bs4 import BeautifulSoup
import requests
url = 'https://...'
response = requests.get(url)
soup = BeautifulSoup(response.text, 'html.parser')
table = soup.find('table')
rows = table.find_all('tr')
for row in rows:
cols = row.find_all('td')
for col in cols:
print(col.text)
其中,url为需要爬取的网址,可以根据实际情况进行修改。soup.find('table')表示找到网页中的表格,rows = table.find_all('tr')表示获取表格中的所有行,cols = row.find_all('td')表示获取行中的所有列。最后通过print(col.text)来输出表格中的内容。