Python Selenium get attribute 'href' error

Question

I am trying to get href from the link, please find my codes:

url ='http://money.finance.sina.com.cn/bond/notice/sz149412.html'
link = driver.find_element_by_xpath("//div[@class='blk01'])//ul//li[3]//a[contains(text(),'发行信息']").get_attribute('href')
print(link)

error

 invalid selector: Unable to locate an element with the xpath expression 
SyntaxError: Failed to execute 'evaluate' on 'Document': The string '//div[@class='blk01'])//ul/li[3]//a[contains(text(),'发行信息']' is not a valid XPath expression.

Seems it is not a valid xpath, but I cannot figure out the error, any help will be appreciated!

Thanks

can you show us the error output?

George Imerlishvili
– George Imerlishvili

2021-03-29 09:47:47 +00:00
Commented Mar 29, 2021 at 9:47 — George Imerlishvili
– George Imerlishvili, Commented Mar 29, 2021 at 9:47
please find my updated question

Joyce
– Joyce

2021-03-29 09:48:58 +00:00
Commented Mar 29, 2021 at 9:48 — Joyce
– Joyce, Commented Mar 29, 2021 at 9:48

Arundeep Chohan · Accepted Answer · 2021-03-29 10:00:39Z

2

//a[contains(text(),'发行信息')]

Even this would work.

print(link.get_attribute("href"))

answered Mar 29, 2021 at 10:00

Arundeep Chohan

9,9895 gold badges17 silver badges36 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

George Imerlishvili · Accepted Answer · 2021-03-29 09:56:12Z

1

try this instead:

link = driver.find_element_by_xpath('//div[@class="blk01"]//ul//li[3]//a[contains(text(), "发行信息")]')
print(link.get_attribute("href"))

answered Mar 29, 2021 at 9:56

George Imerlishvili

1,9852 gold badges16 silver badges24 bronze badges

2 Comments

Joyce Over a year ago

Hi , thank you for your help! it worked. May I also ask is get_attribute() not the same syntax as .text()? so I cannot use this

George Imerlishvili Over a year ago

element.text not text().

Adil kasbaoui · Accepted Answer · 2021-03-29 10:21:21Z

0

# Importing necessary modules
from seleniumwire import webdriver
from webdriver_manager.chrome import ChromeDriverManager
import time

# WebDriver Chrome
driver = webdriver.Chrome(ChromeDriverManager().install())

# Target URL
url = 'http://money.finance.sina.com.cn/bond/notice/sz149412.html'
driver.get(url)
time.sleep(5)
link = driver.find_element_by_xpath('//*[@class="blue" and contains(text(),"发行信息")]').get_attribute('href')
print(link)

answered Mar 29, 2021 at 10:21

Adil kasbaoui

6738 silver badges28 bronze badges

3 Comments

Joyce Over a year ago

thank you so much! it worked, may I ask why my code does not work? ('//a[contains(text(),"发行信息"]')

Adil kasbaoui Over a year ago

@ur welcome, u did mess up with ' and ' that's why it was invalid in ur case

Adil kasbaoui Over a year ago

@Joyce please consider accepting one of the solutions, to close this question.

vitaliis · Accepted Answer · 2021-03-29 16:04:50Z

0

//div[@class='blk01'])//ul//li[3]//a[contains(text(),'发行信息']

does not seem to be a stable xpath and also you mess up with ' and ". This is the main problem.

Try this first:

find_element_by_xpath('//div[@class="blk01"])//ul//li[3]//a[contains(text(),"发行信息"]')

If it works, try just:

find_element_by_xpath('//a[contains(text(),"发行信息"]')

The goal is to make xpath as short as possible.

answered Mar 29, 2021 at 16:04

vitaliis

4,1975 gold badges26 silver badges48 bronze badges

Comments

chitown88 · Accepted Answer · 2021-04-02 08:58:42Z

0

Any particular reason to use Selenium here? It's present in the html source, so would be more efficient to use requests and beautifulsoup.

import requests
from bs4 import BeautifulSoup

url = 'http://money.finance.sina.com.cn/bond/notice/sz149412.html'
response = requests.get(url)

soup = BeautifulSoup(response.text, 'html.parser')


a_tag = soup.select_one('a:contains("发行信息")') 
#a_tag = soup.select_one('a:-soup-contains("发行信息")') # <- depending what version of bs4 you have, the above may throw error since it's depricated

link = a_tag['href']

Ouput:

print(link)
http://money.finance.sina.com.cn/bond/issue/sz149412.html

answered Apr 2, 2021 at 8:58

chitown88

29.1k6 gold badges34 silver badges67 bronze badges

Collectives™ on Stack Overflow

Python Selenium get attribute 'href' error

5 Answers 5

Comments

2 Comments

3 Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

Comments

2 Comments

3 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related