2

Can I gather intelligent data , HTML scraping using python? I have no knowledge of it , so I would like to get some idea.

1

4 Answers 4

3

Look at the module scrapy:

http://scrapy.org/

Sign up to request clarification or add additional context in comments.

Comments

3

You certainly can - I developed this library in Python for my web scraping work.

A good parsing library is lxml.

If you are new to Python you may want to work through this ebook first.

3 Comments

The Links Are Dead .
@TilakMadichetti No they are not.
oops sorry ! there was a problem with my network that time . My bad :)
1

Try using urllib2 and Beautiful Soup.

urllib2 is useful for requesting URLs programmatically. It's part of the standard library: http://docs.python.org/library/urllib2

Beautiful Soup is good for mining HTML/XML and can be found here: http://pypi.python.org/pypi/BeautifulSoup

Comments

0

You may also use htql library at: http://htql.net.

Comments

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.