Similar (i.e. a dynamic equivalent) to rvest::read_html.

render_html(
  splash_obj = splash_local,
  url,
  base_url,
  timeout = 30,
  resource_timeout,
  wait = 0,
  proxy,
  js,
  js_src,
  filters,
  allowed_domains,
  allowed_content_types,
  forbidden_content_types,
  viewport = "1024x768",
  images,
  headers,
  body,
  http_method,
  save_args,
  load_args,
  http2 = FALSE,
  engine = c("webkit", "chromium"),
  raw_html = FALSE
)

Arguments

splash_obj

Object created by a call to splash()

url

The URL to render (required)

base_url

The base URL to render the page with.

timeout

A timeout (in seconds) for the render (defaults to 30). Without re-configuring the start-up parameters of the Splash server (not this package) the maximum allowed value for the timeout is 60 seconds.

resource_timeout

A timeout (in seconds) for individual network requests.

wait

Time (in seconds) to wait for updates after page is loaded (defaults to 0).

proxy

Proxy profile name or proxy URL.

js

Javascript profile name.

js_src

JavaScript code to be executed in page context.

filters

Comma-separated list of request filter names.

allowed_domains

Comma-separated list of allowed domain names. If present, Splash won’t load anything neither from domains not in this list nor from subdomains of domains not in this list.

allowed_content_types

Comma-separated list of allowed content types. If present, Splash will abort any request if the response’s content type doesn’t match any of the content types in this list. Wildcards are supported.

forbidden_content_types

Comma-separated list of forbidden content types. If present, Splash will abort any request if the response’s content type matches any of the content types in this list. Wildcards are supported.

viewport

View width and height (in pixels) of the browser viewport to render the web page. Format is “width>xheight”, e.g. 800x600. Default value is "full".

images

Whether to download images.

headers

HTTP headers to set for the first outgoing request.

body

Body of HTTP POST request to be sent if method is POST.

http_method

HTTP method of outgoing Splash request.

save_args

A list of argument names to put in cache.

load_args

Parameter values to load from cache

http2

Enable or disable HTTP2 support. TRUE to enable; FALSE to disable; defaults to FALSE when engine is webkit due to malformed behaviour in 3.4.x of Splash

engine

one of webkit or chromium; defaults to webkit

raw_html

if TRUE then return a character vector vs an XML document. Only valid for render_html

Value

An XML document. Note that this is processed by xml2::read_html() so it will not be the pristine, raw, rendered HTML from the site. Use raw_html=TRUE if you do not want it to be processed first by xml2. If you choose raw_html=TRUE you'll get back a character vector.

References

Splash docs

See also

Other splash_renderers: execute_lua(), render_har(), render_jpeg(), render_json(), render_png()