Skip to main content
Filter by
Sorted by
Tagged with
0 votes
0 answers
60 views

I've been trying to programmatically extract the original, debundled JavaScript source files behind a web app that uses Webpack bundles and source maps. While Chrome DevTools clearly shows the ...
NoOne's user avatar
  • 19
0 votes
0 answers
72 views

I'm working on a Python bot that monitors a Shopify webshop, adds a product to the cart, and tries to continue to checkout. I'm using requests.Session() and BeautifulSoup to handle the stock check, ...
Denzel's user avatar
  • 13
0 votes
1 answer
176 views

I'm scraping a large list of URLs (1.2 million) using Selenium + BeautifulSoup with Python's multiprocessing.Pool. I want to scale it up to scrape faster, ideally without hitting system resource ...
SolidOpt's user avatar
  • 113
0 votes
0 answers
76 views

I have a scraper running on Puppeteer and Node.js. When I change the headless mode from false to true, the bot gets detected. I would like it to work with false since the deployment will be on Railway....
Juliaano's user avatar
-1 votes
2 answers
197 views

How to use python to download a pdf file from a "Download PDF" link on a web page and save it to a local folder? If I move cursor to the link and right click "inspect" I got ...
ylin321's user avatar
0 votes
0 answers
52 views

The proxy I'm trying to connect to is ScraperAPI's proxy and this is the way they provided in documentation to connect to seleniumwire, however it doesn't work and it still uses my IP. API_KEY = '...
Christian's user avatar
0 votes
1 answer
54 views

I’m automating downloads of StoryWeaver books with Selenium+Python. After logging in, I land on the level-page and can see the story cards in the UI—but my script can’t click any of them to navigate ...
Mohammad Malik's user avatar
1 vote
1 answer
62 views

I’m trying to automate downloading StoryWeaver PDFs by: 1. Navigating to the React homepage at https://storyweaver.org.in/en/ 2. Clicking Log in (which opens a React modal) 3. Filling out my email/...
Mohammad Malik's user avatar
1 vote
0 answers
95 views

I know this question has been asked many times, but all I found is using Python programming which is I am totally not familiar. I also follows this article to add Request headers, and this to ...
Kunto Fullstack's user avatar
0 votes
1 answer
57 views

I try to scrape https://www.anytimemailbox.com/s/new-york-42-broadway. I checked https://stackoverflow.com/a/61343018/21294350 and used driver.execute_script("window.scrollTo(0, document.body....
An5Drama's user avatar
  • 774
2 votes
1 answer
59 views

SSLCertVerificationError Traceback (most recent call last) File /Library/Frameworks/Python.framework/Versions/3.12/lib/python3.12/urllib/request.py:1344, in AbstractHTTPHandler....
Deepanshu Kumar's user avatar
-4 votes
1 answer
831 views

I am developing a chatbot for my university that will use a wiki with curriculum information for courses and other relevant data. One of the challenges is optimizing the use of Ollama to process the ...
user27403331's user avatar
0 votes
0 answers
108 views

Apologies in advance as my code so far is a mess - I'm very lost. I have an Excel doc with a list of vocabulary terms. I'm trying to create something that will go to the Merriam Webster site, search ...
Saber's user avatar
  • 33
-6 votes
1 answer
90 views

I've tried to retrieve in https://www.tankathon.com/players/jase-richardson the icons "+" and "-" using importxml in google spreadsheets the problem is I've tried BF1 = https://...
Ma Poub's user avatar
  • 11
-1 votes
2 answers
94 views

Trying to build a scraper that extracts nutritional information from each product page on Sainsbury (for eg, scraping energy values out of https://www.sainsburys.co.uk/gol-ui/product/sainsburys-...
Siddharth Gianchandani's user avatar
0 votes
1 answer
36 views

I'm trying to scrape some data from bvc.com.co (the Colombian Stock Exchange webpage). But always, when loading the third stock, the screen comes blank and the target expected_condition can not be ...
Ivan Castro's user avatar
1 vote
0 answers
73 views

In this web scraper, it scrapes the business data from Google Maps and saves the data to the Excel file, but in these Excel sheets latitude and longitude are always different from the actual it shown ...
mihir soni's user avatar
0 votes
0 answers
23 views

I am trying to mirror levels found on the https://megamaker.webmeka.io/ site using winhttrack. I am excluding all pages other than the index pages, the level page and the mmlv level downloads. ...
Neil McLean's user avatar
0 votes
1 answer
60 views

Long story short I am waiting for a company to provide product data but it is taking them months to get back to me. I've decided to try and scrape the data from their site myself to get things moving ...
Nemo's user avatar
  • 21
-1 votes
2 answers
118 views

I'm trying to scrape sectional times for horse races from RacingTV (e.g., https://www.racingtv.com/results/2025-05-11/leopardstown/1310) using Python and Selenium, and I need the output to be ...
user30640245's user avatar
0 votes
0 answers
54 views

I'm working with apify for web scraping and I recently updated from apify-cli 0.21.6 to 0.21.7 I use the python SDK and for debugging I use pdb.set_trace() or breakpoint() When I updated apify-cli the ...
Cristobal Sarome's user avatar
1 vote
0 answers
48 views

I'm using Python Selenium to find an element, click it, and have the new page stay open in the same window instead of opening in a new tab. I'm trying to be careful with how I word this because I it's ...
JimmyG's user avatar
  • 657
-4 votes
1 answer
63 views

I had a small webcrawler that was written using scrapy and since I didn't want to run it against real site during development I used a local mirror. Mirror was served with python -m http.server 8000 ...
Anton's user avatar
  • 132
2 votes
2 answers
92 views

I tried to search stock in google finance as below. from selenium import webdriver from selenium.webdriver.common.by import By from selenium.webdriver.common.keys import Keys from selenium.webdriver....
TRUE's user avatar
  • 43
0 votes
0 answers
42 views

I've been trying to use html unit to authenticate to a page using okta. The html looks like this: I can use getElementById to get the okta-login-container div, but I can't get anything inside of it. ...
carlos palma's user avatar