Java regular expression and xml tags

Question

I want to resolve: <tag>alphabetic characters and space</tag>

I propose this one:

<.*>([A-Za-z]+)</.*>

is this correct?

It is almost correct in the narrow sense that, once you add the space to the character group, it will match the exact string in your question. Whether it is correct in the more general, and perhaps more useful, sense depends entirely on where you're going with this. — NPE
– NPE, Commented Dec 6, 2012 at 13:33

Zutty · Accepted Answer · 2012-12-06 13:47:20Z

8

Please, for the sake of whatever poor developer will have to deal with your code after you, please do not try to parse XML with regular expressions.

Use a SAX or DOM parser instead. There are plenty of good guides on the web if you search on Google, but here is a quick example using the standard javax.xml package...

Document doc = DocumentBuilderFactory.newInstance().newDocumentBuilder().parse(xmlFile);
Node node = doc.getElementsByTagName("tag").item(0);
String value = node.getNodeValue();

edited Dec 6, 2012 at 13:47

answered Dec 6, 2012 at 13:33

Zutty

5,37729 silver badges32 bronze badges

Sign up to request clarification or add additional context in comments.

1 Comment

user1852036 Over a year ago

I use SAX, this code is not use to parse xml documents, it's for pentaho kettle

Juvanis · Accepted Answer · 2012-12-06 13:34:56Z

2

What if the input is: <tag> something <inner-tag> some other thing </inner-tag> </tag> ?

I'd suggest you to use an XML parser library, e.g. Apache Digester.

answered Dec 6, 2012 at 13:34

Juvanis

26k5 gold badges74 silver badges88 bronze badges

Comments

og Grand · Accepted Answer · 2012-12-06 13:36:15Z

-1

You should add ? character to exclude redundancy selection

    <.*?>[A-Za-z ]*</.*?>

answered Dec 6, 2012 at 13:36

og Grand

1143 bronze badges

Collectives™ on Stack Overflow

Java regular expression and xml tags

3 Answers 3

1 Comment

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

1 Comment

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related