Extract attributes, text and tag name from webclient page html content

wc_html_text(dom_node, trim = FALSE)

wc_html_attr(dom_node, attr)

wc_html_name(dom_node)

Arguments

dom_node

a webclient page DOM node (likely produced by wc_html_nodes())

trim

if TRUE will trim leading/trailing white space

attr

name of attribute to retrieve

Examples

if (FALSE) {
wc <- web_client()

wc %>% wc_go("https://usa.gov/")

wc %>%
  wc_html_nodes("a") %>%
  sapply(wc_html_text)

wc %>%
  wc_html_nodes(xpath=".//a") %>%
  sapply(wc_html_text)

wc %>%
  wc_html_nodes(xpath=".//a") %>%
  sapply(wc_html_attr, "href")
}