Package | Description |
---|---|
org.jsoup |
Contains the main
Jsoup class, which provides convenient static access to the jsoup functionality. |
org.jsoup.helper | |
org.jsoup.nodes |
HTML document structure nodes.
|
org.jsoup.parser |
Contains the HTML parser, tag specifications, and HTML tokeniser.
|
org.jsoup.safety |
Contains the jsoup HTML cleaner, and whitelist definitions.
|
Modifier and Type | Method | Description |
---|---|---|
Document |
Connection.get() |
Execute the request as a GET, and parse the result.
|
Document |
Connection.Response.parse() |
Read and parse the body of the response as a Document.
|
static Document |
Jsoup.parse(File in,
String charsetName) |
Parse the contents of a file as HTML.
|
static Document |
Jsoup.parse(File in,
String charsetName,
String baseUri) |
Parse the contents of a file as HTML.
|
static Document |
Jsoup.parse(InputStream in,
String charsetName,
String baseUri) |
Read an input stream, and parse it to a Document.
|
static Document |
Jsoup.parse(InputStream in,
String charsetName,
String baseUri,
Parser parser) |
Read an input stream, and parse it to a Document.
|
static Document |
Jsoup.parse(String html) |
Parse HTML into a Document.
|
static Document |
Jsoup.parse(String html,
String baseUri) |
Parse HTML into a Document.
|
static Document |
Jsoup.parse(String html,
String baseUri,
Parser parser) |
Parse HTML into a Document, using the provided Parser.
|
static Document |
Jsoup.parse(URL url,
int timeoutMillis) |
Fetch a URL, and parse it as HTML.
|
static Document |
Jsoup.parseBodyFragment(String bodyHtml) |
Parse a fragment of HTML, with the assumption that it forms the
body of the HTML. |
static Document |
Jsoup.parseBodyFragment(String bodyHtml,
String baseUri) |
Parse a fragment of HTML, with the assumption that it forms the
body of the HTML. |
Document |
Connection.post() |
Execute the request as a POST, and parse the result.
|
Modifier and Type | Method | Description |
---|---|---|
Document |
HttpConnection.get() |
|
static Document |
DataUtil.load(File in,
String charsetName,
String baseUri) |
Loads a file to a Document.
|
static Document |
DataUtil.load(InputStream in,
String charsetName,
String baseUri) |
Parses a Document from an input steam.
|
static Document |
DataUtil.load(InputStream in,
String charsetName,
String baseUri,
Parser parser) |
Parses a Document from an input steam, using the provided Parser.
|
Document |
HttpConnection.Response.parse() |
|
Document |
HttpConnection.post() |
Modifier and Type | Method | Description |
---|---|---|
void |
W3CDom.convert(Document in,
Document out) |
Converts a jsoup document into the provided W3C Document.
|
Document |
W3CDom.fromJsoup(Document in) |
Convert a jsoup Document to a W3C Document.
|
Modifier and Type | Method | Description |
---|---|---|
Document |
Document.clone() |
|
static Document |
Document.createShell(String baseUri) |
Create a valid, empty shell of a document, suitable for adding more elements to.
|
Document |
Document.normalise() |
Normalise the document.
|
Document |
Document.outputSettings(Document.OutputSettings outputSettings) |
Set the document's output settings.
|
Document |
Node.ownerDocument() |
Gets the Document associated with this Node.
|
Document |
Document.quirksMode(Document.QuirksMode quirksMode) |
Modifier and Type | Method | Description |
---|---|---|
static Document |
Parser.parse(String html,
String baseUri) |
Parse HTML into a Document.
|
static Document |
Parser.parseBodyFragment(String bodyHtml,
String baseUri) |
Parse a fragment of HTML into the
body of a Document. |
static Document |
Parser.parseBodyFragmentRelaxed(String bodyHtml,
String baseUri) |
|
Document |
Parser.parseInput(Reader inputHtml,
String baseUri) |
|
Document |
Parser.parseInput(String html,
String baseUri) |
Modifier and Type | Method | Description |
---|---|---|
Document |
Cleaner.clean(Document dirtyDocument) |
Creates a new, clean document, from the original dirty document, containing only elements allowed by the whitelist.
|
Modifier and Type | Method | Description |
---|---|---|
Document |
Cleaner.clean(Document dirtyDocument) |
Creates a new, clean document, from the original dirty document, containing only elements allowed by the whitelist.
|
boolean |
Cleaner.isValid(Document dirtyDocument) |
Determines if the input document bodyis valid, against the whitelist.
|
Copyright © 2009–2018 Jonathan Hedley. All rights reserved.