Jonathan Hedley

Jonathan Hedley

12 Photos and videos

Tweets

Jonathan Hedley @jhy

Apr 20

I've just released #jsoup 1.22.2. This release makes editing the DOM during traversal more predictable, refreshes the default HTML tag definitions with newer elements and better text boundaries, and improves reliability in parsing and HTTP transport.

Jonathan Hedley

Jonathan Hedley @jhy

Apr 20

Also fixes a number of edge cases in cleaning, stream parsing, XML doctype handling, and Android packaging. jsoup.org/news/release-1.22.…

jsoup release 1.22.2 (2026-Apr-20)

jsoup 1.22.2 makes DOM editing during traversal more predictable, refreshes HTML tag defaults, and fixes edge cases in parsing, transport, cleaning, and XML handling.

jsoup.org

Jonathan Hedley

Jonathan Hedley @jhy

Jan 1

I'm happy to announce the release of #jsoup 1.22.1! This adds support for the re2j regular expression engine for regex-based CSS selectors, a configurable maximum parser depth, and brings a bunch of bug fixes and improvements.

226

Jonathan Hedley

Jonathan Hedley @jhy

Jan 1

Details at jsoup.org/news/release-1.22.… This release marks 15 years of jsoup development.

129

Jonathan Hedley

Jonathan Hedley @jhy

25 Aug 2025

Happy to announce that #jsoup 1.21.2 is out! Adds custom SSLContext support in HTTP/2 connections, brings DOM/fragment parsing perf gains, and fixes some edge cases in parsing, traversal, cloning, and concurrent reads. jsoup.org/news/release-1.21.…

jsoup release 1.21.2 (2025-Aug-25)

jsoup 1.21.2 adds custom SSLContext support, improves attribute handling, boosts DOM performance, and fixes edge case parsing bugs.

jsoup.org

165

Jonathan Hedley

Jonathan Hedley @jhy

23 Jun 2025

Happy to announce that #jsoup v1.21.1 is out now! Lots of improvements, particularly the ability to directly select nodes (like text, data) with the CSS selectors.

586

Jonathan Hedley

Jonathan Hedley @jhy

23 Jun 2025

github.com/jhy/jsoup/release…

Release jsoup 1.21.1 · jhy/jsoup

jsoup 1.21.1 is out now, featuring powerful new node selection capabilities that let you target specific DOM nodes like comments and text nodes using CSS selectors, dynamic tag customization throug...

github.com

100

Jonathan Hedley

Jonathan Hedley @jhy

29 Apr 2025

Very happy to announce that I've just released jsoup 1.20.1! Lots of improvements and bug fixes -- improved HTML parse rules to align with modern browsers, improved XML namespace handling, and a redesigned HTML pretty-printer for better consistency and customizability.

178

Jonathan Hedley

Jonathan Hedley @jhy

29 Apr 2025

This release also delivers performance optimizations, new API enhancements such as flexible tag definitions via TagSet, concise CSS selectors, and parser thread-safety improvements. Big thanks to everyone who helped out. jsoup.org/news/release-1.20.…

jsoup release 1.20.1 (2025-Apr-29)

jsoup 1.20.1 brings tighter HTML parsing, improved XML support, new API methods, performance gains, and robust bug fixes.

jsoup.org

Jonathan Hedley

Jonathan Hedley @jhy

4 Mar 2025

Good news everybody! I just released #jsoup v1.19.1. It adds http/2 request support, and has a bunch of other improvements and bug fixes.

111

Jonathan Hedley

Jonathan Hedley @jhy

4 Mar 2025

See the full list of changes: jsoup.org/news/release-1.19.…

jsoup release 1.19.1 (2025-Mar-04)

jsoup 1.19.1 introduces HTTP/2 support, performance optimizations, and new APIs for cleaner, more efficient HTML parsing and manipulation.

jsoup.org

Jonathan Hedley

Jonathan Hedley @jhy

8 Jan 2025

The next version of #jsoup will (finally!) support making http/2 requests, if you're running on Java 11 . It still works down to Java 8 if you need that.

134

Jonathan Hedley

Jonathan Hedley @jhy

8 Jan 2025

It's a drop-in update with no changes required for existing Jsoup.connect() code, other than setting a system property (jsoup.useHttpClient) to enable.

Jonathan Hedley

Jonathan Hedley @jhy

8 Jan 2025

The implementation uses Java's multi-release JAR feature to make requests via the HttpClient impl if it's available, or will fallback to the current HttpURLConnection. This also gives a path to http/3 support when that PEP lands in Java. github.com/jhy/jsoup/pull/22…

Support http/2 requests via HttpClient by jhy · Pull Request #2257 · jhy/jsoup

This adds support for executing HTTP requests using the Java 11 HttpClient on systems that support it, enabling http/2 requests. On Java 8, and on Android, requests will still go via the existing ...

github.com

Jonathan Hedley

Jonathan Hedley @jhy

27 Nov 2024

Happy to announce the release of jsoup 1.18.2! Faster parsing with less GCs, and a bunch of bug fixes.

Jonathan Hedley

Jonathan Hedley @jhy

27 Nov 2024

Details (and graphs) at github.com/jhy/jsoup/release…

Release jsoup 1.18.2 · jhy/jsoup

Improvements Optimized the throughput and memory use throughout the input read and parse flows, with heap allocations and GC down between -6% and -89%, and throughput improved up to 143% for sma...

github.com

Jonathan Hedley

Jonathan Hedley @jhy

9 Aug 2024

I've been working on improving parse throughput and reducing memory allocations in jsoup (Java HTML parser) by recycling char[] and byte[] buffers between invocations—avoiding unnecessary heap allocations and garbage collection. Details: github.com/jhy/jsoup/pull/21…

128

Jonathan Hedley

Jonathan Hedley @jhy

9 Aug 2024

As a result, heap allocations (bytes/op) are down by -6% to -89%, and throughput has improved by -2% to 143% (with the biggest gains for smaller inputs). These improvements will be in the next release of jsoup, 1.18.2 (coming soon!)

Jonathan Hedley

Jonathan Hedley @jhy

10 Jul 2024

I've just released jsoup 1.18.1! Lots of improvements - a StreamParser that acts like a hybrid DOM SAX parser; URL download progress callbacks, and lots of other improvements. jsoup.org/news/release-1.18.…

jsoup release 1.18.1 (2024-Jul-10)

jsoup 1.18.1 is out now, with a new streaming parser that provides a hybrid DOM SAX event-driven parsing interface, request progress tracking, and many other improvements.

jsoup.org

180

Jonathan Hedley

Jonathan Hedley @jhy

5 Jan 2024

I've been working on a new feature for jsoup that I think is pretty cool: the new StreamParser lets you parse a document lazily with selectNext(query). Elements are parsed from the backing input stream on demand, and will include all their children. github.com/jhy/jsoup/pull/20…

Progressive parsing with StreamParser by jhy · Pull Request #2096 · jhy/jsoup

A StreamParser provides a progressive parse of its input. As each Element is completed, it is emitted via a Stream or Iterator interface. Elements returned will be complete with all their children,...

github.com

436

Jonathan Hedley

Jonathan Hedley @jhy

5 Jan 2024

If you're interested in this, please take a look at the implementation, and try it out by installing a snapshot. It would be great to incorporate any initial feedback / bug-fixes prior to releasing it in the next version of #jsoup.

115