jsoup 1.10.3 发布,Java 的 HTML 解析器

栏目: Html · 发布时间: 7年前

内容简介:jsoup 1.10.3 发布,Java 的 HTML 解析器

jsoup 1.10.3 发布了,该版本带来了更好的 CSS 选择器性能,Jsoup.Connection 改进和其他 bug 修复。

详情包括:

Improvements

  • Added Elements.eachText() and  Elements.eachAttr() , which return a list of an  Element's text or attribute values, respectively. This makes it simpler to for example get a list of each URL on a page:  List<String> urls = doc.select("a").eachAttr("abs:href"");

  • Improved selector validation for :contains(...) with unbalanced quotes.

  • Improved the speed of index based CSS selectors and other methods that use elementSiblingIndex, by a factor of 34x.

  • Added Node.clearAttributes() , to simplify removing of all attributes of a  NodeElement .

Fixes

  • Bugfix: if an attribute name started or ended with a control character, the parse would fail with a validation exception.

  • Bugfix: Element.hasClass() and the  .classname selector would not find the class attribute case-insensitively.

  • Bugfix: In Jsoup.Connection , if a redirect contained a query string with  %xx escapes, they would be double escaped before the redirect was followed, leading to fetching an incorrect location.

  • Bugfix: In Jsoup.Connection , if a request body was set and the connection was redirected, the body would incorrectly still be sent.

  • Bugfix: In DataUtil when detecting the character set from meta data, and there are two Content-Types defined, use the one that defines a character set.

  • Bugfix: when parsing unknown tags in case-sensitive HTML mode, end tags would not close scope correctly.

  • In Jsoup.Connection , ensure there is no Content-Type set when being redirected to a GET.

  • Bugfix: in certain locales (Turkish specifically), lowercasing and case insensitivity could fail for specific items.

下载地址: https://jsoup.org/download


以上就是本文的全部内容,希望本文的内容对大家的学习或者工作能带来一定的帮助,也希望大家多多支持 码农网

查看所有标签

猜你喜欢:

本站部分资源来源于网络,本站转载出于传递更多信息之目的,版权归原作者或者来源机构所有,如转载稿涉及版权问题,请联系我们

创业维艰

创业维艰

本·霍洛维茨 Ben Horowitz / 杨晓红、钟莉婷 / 中信出版社 / 2015-2 / 49

本·霍洛维茨,硅谷顶级投资人,与网景之父马克·安德森联手合作18年,有着丰富的创业和管理经验。2009年创立风险投资公司A16Z,被外媒誉为“硅谷最牛的50个天使投资人”之一,先后在初期投资了Facebook、Twitter、Groupon、Skype,是诸多硅谷新贵的创业导师。 在《创业维艰》中,本·霍洛维茨从自己的创业经历讲起,以自己在硅谷近20余年的创业、管理和投资经验,对创业公司(尤......一起来看看 《创业维艰》 这本书的介绍吧!

JS 压缩/解压工具
JS 压缩/解压工具

在线压缩/解压 JS 代码

URL 编码/解码
URL 编码/解码

URL 编码/解码

XML 在线格式化
XML 在线格式化

在线 XML 格式化压缩工具