Webscraping in R "Error in open.connection(x, "rb") : HTTP error 403."

Question

I want to scrape the next page: 'https://www.idealista.com/alquiler-viviendas/girona-provincia/' with rvest package and it gives me the following error:'Error in open.connection(x, "rb") : HTTP error 403.'

library(rvest)
library(curl)
library(xm12)

url= 'https://www.idealista.com/alquiler-viviendas/girona-provincia/'
webidealista=read_html(url)

webidealista=read_html(url)

Error in open.connection(x, "rb") : HTTP error 403.

Can someone help me fix it? I'll be very grateful.
enter image description here

Please do not post an image of code/data/errors: it cannot be copied or searched (SEO), it breaks screen-readers, and it may not fit well on some mobile devices. Please add data using dput and show the expected output for the same. Please read the info about How to ask good question & Reproducible example — rj-nirbhay
– rj-nirbhay, Commented Jun 3, 2020 at 19:19

Emmanuel Hamel · Accepted Answer · 2021-12-15 14:22:47Z

0

I was able to get the html content of the page with the following code :

library(RSelenium)
shell('docker run -d -p 4445:4444 selenium/standalone-firefox')
remDr <- remoteDriver(remoteServerAddr = "localhost", port = 4445L, browserName = "firefox")
remDr$open()
remDr$navigate("https://www.idealista.com/alquiler-viviendas/girona-provincia/")

# Close the pop-up ...
web_Obj_Accept <- remDr$findElement("xpath", "//*[@id='didomi-notice-agree-button']/span")
web_Obj_Accept$clickElement()

# Get content ...
html_Content <- remDr$getPageSource()[[1]]

answered Dec 15, 2021 at 14:22

Emmanuel Hamel

2,3251 gold badge10 silver badges26 bronze badges

Sign up to request clarification or add additional context in comments.

Collectives™ on Stack Overflow

Webscraping in R "Error in open.connection(x, "rb") : HTTP error 403."

1 Answer 1

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related