Happy to announce that #jsoup v1.21.1 is out now! Lots of improvements, particularly the ability to directly select nodes (like text, data) with the CSS selectors.
Happy to announce that #jsoup v1.21.1 is out now! Lots of improvements, particularly the ability to directly select nodes (like text, data) with the CSS selectors.
Very happy to announce that I've just released #jsoup 1.20.1!
Lots of improvements and bugfixes -- improved HTML parse rules to align with modern browsers, improved XML namespace handling, and a redesigned HTML pretty-printer for better consistency and customizability. This release also delivers performance optimizations, new API enhancements such as flexible tag definitions via TagSet, concise CSS selectors, and parser thread-safety improvements.
Big thanks to everyone who helped out.
Good news everybody! I just released jsoup v1.19.1. It adds http/2 request support, and has a bunch of other improvements and bug fixes.
Cool to see how #jsoup is used for Marginalia Search!
https://github.com/search?q=repo%3AMarginaliaSearch%2FMarginaliaSearch%20jsoup&type=code
The upcoming version of #jsoup, 1.19.1 will (finally!) support making http/2 requests, if you're running on Java 11+. It still works down to Java 8 if you need that.
It's a drop-in update with no changes required for existing Jsoup.connect() code, other than setting a system property (jsoup.useHttpClient) to enable.
The implementation uses Java's multi-release JAR feature to make requests via the HttpClient impl if it's available, or will fallback to the current HttpURLConnection. This also gives a path to http/3 support when that PEP lands in Java.
ksoup v0.2.1 – Faster, Modular HTML Parsing for Kotlin Multiplatform
#html #java #jsoup #kmp #kotlin #multiplatform #parsing
https://github.com/fleeksoft/ksoup/releases/tag/0.2.1?utm_medium=erik.in&utm_source=mastodon
Have you listened to #74 yet?
@javajuneau @dhinojosa @ianhlavats and @kito99 are joined by pilot and #OSS contributor, @lprimak. They discuss #microprofile, #htmlx, @devoxx Genie, #IntelliJ #AI Assistant, #Apache #Shiro, #Arquillian, #PrimeVue, #PrimeNG, #Angular, #BDD, #Jsoup, #JBake, #TestContainers, and more! https://www.pubhouse.net/2024/10/stackd74-but-its-soup.html
On your next outing, listen to #74: @javajuneau @dhinojosa @ianhlavats and @kito99@mastadon.social are joined by pilot and #OSS contributor, @lprimak. They discuss #microprofile, #htmlx, @devoxx Genie, #IntelliJ #AI Assistant, #Apache #Shiro, #Arquillian, #PrimeVue, #PrimeNG, #Angular, #BDD, #Jsoup, #JBake, #TestContainers, Apple #VisionPro and much more! https://www.pubhouse.net/2024/10/stackd74-but-its-soup.html
#74: But it’s soup
After a long hiatus, the whole gang is back! @javajuneau @dhinojosa @ianhlavats and @kito99@mastadon.social are joined by pilot and #OSS contributor, lprimak@mastodon.social. They discuss #microprofile, #htmlx, @devoxx Genie, #IntelliJ #AI Assistant, #Apache #Shiro, #Arquillian, #PrimeVue, #PrimeNG, #Angular, #BDD, #Jsoup, #JBake, #TestContainers, and much more! https://www.pubhouse.net/2024/10/stackd74-but-its-soup.html
#74: But it’s soup
After a long hiatus, the whole gang is back! @javajuneau @dhinojosa @ianhlavats and @kito99 are joined by pilot and #OSS contributor, @lprimak. They discuss #microprofile, #htmlx, @devoxx Genie, #IntelliJ #AI Assistant, #Apache #Shiro, #Arquillian, #PrimeVue, #PrimeNG, #Angular, #BDD, #Jsoup, #JBake, #TestContainers, and much more! https://www.pubhouse.net/2024/10/stackd74-but-its-soup.html
I've been working on a new feature for jsoup that I think is pretty cool: the new StreamParser lets you parse a document progressively with stream()
, or lazily with selectNext(query)
. Elements are parsed from the backing input stream on demand, and when emitted will include all their children. This gives the benefits and simplicity of a DOM parser, but also enables chunked parsing that would otherwise cause out of memory exceptions, or to terminate the parse early.
The actual parse tree is backed by the full HTML or XML parser, and so all that functionality remains (like implicit elements, source position tracking, error tracking, etc).
If you're interested in this, please take a look at the implementation, and try it out by installing a snapshot. It would be great to incorporate any initial feedback / bug-fixes prior to releasing it in the next version of #jsoup.
I just released jsoup 1.17.2! Mostly bug fixes this round.
I'm very happy to announce the launch of #jsoup version 1.17.1!
Out now with support for request-level authentication, attribute name & value source ranges, stream() iterable support, the :is() selector, and a bunch of other improvements and bug fixes.