jsoup-1.11.2
所属分类:图形图像处理
开发工具:Pascal
文件大小:434KB
下载次数:0
上传日期:2019-03-11 07:30:59
上 传 者:
helonan
说明: useful for windows programming
文件列表:
.travis.yml (115, 2017-12-24)
CHANGES (49471, 2017-12-24)
LICENSE (1102, 2017-12-24)
pom.xml (8191, 2017-12-24)
src (0, 2017-12-24)
src\main (0, 2017-12-24)
src\main\java (0, 2017-12-24)
src\main\java\org (0, 2017-12-24)
src\main\java\org\jsoup (0, 2017-12-24)
src\main\java\org\jsoup\Connection.java (28531, 2017-12-24)
src\main\java\org\jsoup\HttpStatusException.java (655, 2017-12-24)
src\main\java\org\jsoup\Jsoup.java (10815, 2017-12-24)
src\main\java\org\jsoup\SerializationException.java (1572, 2017-12-24)
src\main\java\org\jsoup\UncheckedIOException.java (269, 2017-12-24)
src\main\java\org\jsoup\UnsupportedMimeTypeException.java (677, 2017-12-24)
src\main\java\org\jsoup\examples (0, 2017-12-24)
src\main\java\org\jsoup\examples\HtmlToPlainText.java (5467, 2017-12-24)
src\main\java\org\jsoup\examples\ListLinks.java (1804, 2017-12-24)
src\main\java\org\jsoup\examples\Wikipedia.java (750, 2017-12-24)
src\main\java\org\jsoup\examples\package-info.java (146, 2017-12-24)
src\main\java\org\jsoup\helper (0, 2017-12-24)
src\main\java\org\jsoup\helper\ChangeNotifyingArrayList.java (1815, 2017-12-24)
src\main\java\org\jsoup\helper\DataUtil.java (11350, 2017-12-24)
src\main\java\org\jsoup\helper\HttpConnection.java (45926, 2017-12-24)
src\main\java\org\jsoup\helper\StringUtil.java (9303, 2017-12-24)
src\main\java\org\jsoup\helper\Validate.java (3118, 2017-12-24)
src\main\java\org\jsoup\helper\W3CDom.java (6981, 2017-12-24)
src\main\java\org\jsoup\internal (0, 2017-12-24)
src\main\java\org\jsoup\internal\ConstrainableInputStream.java (4264, 2017-12-24)
src\main\java\org\jsoup\internal\Normalizer.java (435, 2017-12-24)
src\main\java\org\jsoup\internal\package-info.java (162, 2017-12-24)
src\main\java\org\jsoup\nodes (0, 2017-12-24)
src\main\java\org\jsoup\nodes\Attribute.java (6844, 2017-12-24)
src\main\java\org\jsoup\nodes\Attributes.java (13627, 2017-12-24)
src\main\java\org\jsoup\nodes\BooleanAttribute.java (489, 2017-12-24)
src\main\java\org\jsoup\nodes\CDataNode.java (1010, 2017-12-24)
src\main\java\org\jsoup\nodes\Comment.java (1292, 2017-12-24)
... ...
# jsoup: Java HTML Parser
**jsoup** is a Java library for working with real-world HTML. It provides a very convenient API for extracting and manipulating data, using the best of DOM, CSS, and jquery-like methods.
**jsoup** implements the [WHATWG HTML5](http://whatwg.org/html) specification, and parses HTML to the same DOM as modern browsers do.
* scrape and [parse](https://jsoup.org/cookbook/input/parse-document-from-string) HTML from a URL, file, or string
* find and [extract data](https://jsoup.org/cookbook/extracting-data/selector-syntax), using DOM traversal or CSS selectors
* manipulate the [HTML elements](https://jsoup.org/cookbook/modifying-data/set-html), attributes, and text
* [clean](https://jsoup.org/cookbook/cleaning-html/whitelist-sanitizer) user-submitted content against a safe white-list, to prevent XSS attacks
* output tidy HTML
jsoup is designed to deal with all varieties of HTML found in the wild; from pristine and validating, to invalid tag-soup; jsoup will create a sensible parse tree.
See [**jsoup.org**](https://jsoup.org/) for downloads and the full [API documentation](https://jsoup.org/apidocs/).
## Example
Fetch the [Wikipedia](http://en.wikipedia.org/wiki/Main_Page) homepage, parse it to a [DOM](https://developer.mozilla.org/en-US/docs/Web/API/Document_Object_Model/Introduction), and select the headlines from the *In the News* section into a list of [Elements](https://jsoup.org/apidocs/index.html?org/jsoup/select/Elements.html) ([online sample](https://try.jsoup.org/~LGB7rk_atM2roavV0d-czMt3J_g), [full source](https://github.com/jhy/jsoup/blob/master/src/main/java/org/jsoup/examples/Wikipedia.java)):
```java
Document doc = Jsoup.connect("http://en.wikipedia.org/").get();
log(doc.title());
Elements newsHeadlines = doc.select("#mp-itn b a");
for (Element headline : newsHeadlines) {
log("%s\n\t%s",
headline.attr("title"), headline.absUrl("href"));
}
```
## Open source
jsoup is an open source project distributed under the liberal [MIT license](https://jsoup.org/license). The source code is available at [GitHub](https://github.com/jhy/jsoup/tree/master/src/main/java/org/jsoup).
## Getting started
1. [Download](https://jsoup.org/download) the latest jsoup jar (or it add to your Maven/Gradle build)
2. Read the [cookbook](https://jsoup.org/cookbook/)
3. Enjoy!
## Development and support
If you have any questions on how to use jsoup, or have ideas for future development, please get in touch via the [mailing list](https://jsoup.org/discussion).
If you find any issues, please file a [bug](https://jsoup.org/bugs) after checking for duplicates.
The [colophon](https://jsoup.org/colophon) talks about the history of and tools used to build jsoup.
## Status
jsoup is in general, stable release.
近期下载者:
相关文件:
收藏者: