Fill form values in a web page via a Python script (not testing)

Question

I need to fill form values on a target page then click a button via Python. I've looked at Selenium and Windmill, but these are testing frameworks - I'm not testing. I'm trying to log into a 3rd party website programatically, then download and parse a file we need to insert into our database. The problem with the testing frameworks is that they launch instances of browsers; I just want a script I can schedule to run daily to retrieve the page I want. Any way to do this?

Vinko Vrsalovic · Accepted Answer · 2009-10-12 15:31:59Z

33

You are looking for Mechanize

Form submitting sample:

import re
from mechanize import Browser

br = Browser()
br.open("http://www.example.com/")
br.select_form(name="order")
# Browser passes through unknown attributes (including methods)
# to the selected HTMLForm (from ClientForm).
br["cheeses"] = ["mozzarella", "caerphilly"]  # (the method here is __setitem__)
response = br.submit()  # submit current form

answered Oct 12, 2009 at 15:31

Vinko Vrsalovic

342k55 gold badges341 silver badges374 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

Habaabiai Over a year ago

I'm stuck using Python 2.6 though, so sadly Mechanize isn't an option either. (GopherError dropped in 2.6, looks like).

Philippe F Over a year ago

Mechanize doc is usually a bit terse, but it works really really great !

Philippe F Over a year ago

I think you should insist, try debugging the gopher problem. In python 2.6, gopher support was removed IIRC, so fixing your problem is probably about commenting some import gopherlib and the few spots where gopher is actually used.

Vinko Vrsalovic Over a year ago

@Habaabiai: Mechanize advertises working in 2.6, you could ask a question about your problem with it. Also, you can try urllib2 (which will force you to write more code to submit a form.)

DarkLight Over a year ago

It seems Mechanize does not support python 3 (yet...?), I guess it means it is not maintained anymore (it is the first FAQ at wwwsearch.sourceforge.net/mechanize/faq.html).

RATHI · Accepted Answer · 2017-08-18 10:41:29Z

Have a look on this example which use Mechanize: it will give the basic idea:

#!/usr/bin/python
import re 
from mechanize import Browser
br = Browser()

# Ignore robots.txt
br.set_handle_robots( False )
# Google demands a user-agent that isn't a robot
br.addheaders = [('User-agent', 'Firefox')]

# Retrieve the Google home page, saving the response
br.open( "http://google.com" )

# Select the search box and search for 'foo'
br.select_form( 'f' )
br.form[ 'q' ] = 'foo'

# Get the search results
br.submit()

# Find the link to foofighters.com; why did we run a search?
resp = None
for link in br.links():
    siteMatch = re.compile( 'www.foofighters.com' ).search( link.url )
    if siteMatch:
        resp = br.follow_link( link )
        break

# Print the site
content = resp.get_data()
print content

Clueless · Accepted Answer · 2009-10-12 15:55:19Z

8

You can use the standard urllib library to do this like so:

import urllib

urllib.urlretrieve("http://www.google.com/", "somefile.html", lambda x,y,z:0, urllib.urlencode({"username": "xxx", "password": "pass"}))

edited Oct 12, 2009 at 15:55

answered Oct 12, 2009 at 15:48

Clueless

4,0521 gold badge22 silver badges27 bronze badges

Comments

Abhranil Das · Accepted Answer · 2011-04-16 09:59:33Z

4

The Mechanize example as suggested seems to work. In input fields where you must enter text, use something like:

br["kw"] = "rowling"  # (the method here is __setitem__)

If some content is generated after you submit the form, as in a search engine, you get it via:

print response.read()

edited Apr 16, 2011 at 9:59

answered Apr 16, 2011 at 9:32

Abhranil Das

5,9466 gold badges37 silver badges45 bronze badges

Comments

Ritesh Khandekar · Accepted Answer · 2019-08-31 05:32:17Z

4

For checkboxes, use 1 & 0 as true & false respectively:

br["checkboxname"] = 1 #checked = true
br["checkboxname2"] = 0 #checked = false

answered Aug 31, 2019 at 5:32

Ritesh Khandekar

4,0153 gold badges17 silver badges31 bronze badges

Collectives™ on Stack Overflow

Fill form values in a web page via a Python script (not testing)

5 Answers 5

5 Comments

Comments

Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

5 Comments

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related