how to get text from span in python using scrapy?

Question

I'm placing here HTML code :

<div class="rendering rendering_person rendering_short rendering_person_short">
  <h3 class="title">
    <a rel="Person" href="https://moh-it.pure.elsevier.com/en/persons/massimo-eraldo-abate" class="link person"><span>Massimo Eraldo Abate</span></a>
  </h3>
  <ul class="relations email">
    <li class="email"><a href="[email protected]" class="link"><span>[email protected]</span></a></li>
  </ul>
  <p class="type"><span class="family">Person: </span>Academic</p>
</div>

From above code how to extract Massimo Eraldo Abate?

Please help me.

Tomáš Linhart · Accepted Answer · 2017-08-29 06:58:36Z

7

You can extract the name using

response.xpath('//h3[@class="title"]/a/span/text()').extract_first()

Also, look at this Scrapinghub's blogpost for introduction to XPath.

answered Aug 29, 2017 at 6:58

Tomáš Linhart

10.2k1 gold badge30 silver badges42 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

scriptso Over a year ago

xpath and regex skillz are a must for getting watcha want ... xpath in-depth knowledge will save you so much hastle... look up xpath syntax like preceding-siblings andfollowing-sibbling ... anscestor child nod etc etc... helps a lot with pagination in particular I find

rafalf · Accepted Answer · 2017-08-29 06:59:50Z

0

Please take a look at this page. there are lots of ways of extracting text scrapy docs

>>> body = '<html><body><span>good</span></body></html>'
>>> Selector(text=body).xpath('//span/text()').extract()

>>> response = HtmlResponse(url='http://example.com', body=body)
>>> Selector(response=response).xpath('//span/text()').extract()

answered Aug 29, 2017 at 6:59

rafalf

4358 silver badges16 bronze badges

Collectives™ on Stack Overflow

how to get text from span in python using scrapy?

2 Answers 2

1 Comment

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

1 Comment

Comments

Your Answer

Sign up or log in

Post as a guest

Related