summarise_url.Rd
Fetches the HTML from x
and returns the essential components
including:
url
original_url
title
description
site_name
theme_color
amp_url
canonical_url
image_url
video_url
feed_url
favicon_url
reading_time
text
(the reducted, plain text)
If any compontents cannot be derived from the contents of the URL they will be NA
.
summarise_url(x)
x | URL |
---|
# NOT RUN { ex_url <- "https://techcrunch.com/2019/02/28/thailand-passes-controversial-cybersecurity-law/" str(summarise_url(ex_url), 1) # }