SMRTR ProgrammingOct 5, 2025Hacker News

You can't parse XML with regex. Let's do it anyways

SMRTR summary

Regular expressions can extract specific data from XML and HTML for web scraping using techniques like anchoring and non-greedy matching, though they cannot fully parse documents.

SMRTR provides this summary for quick context. The original article belongs to Hacker News.

Read the original article
SMRTR Programming

Get the next batch of curated summaries in your inbox.

This archive is built from SMRTR newsletter summaries. Subscribe for hand-picked stories without the extra noise.