B0_std.Url
URLs
authority u
tries to extract an URL authority (HOST:PORT
) part from u
.
path_and_query u
tries to extract an URL path and query part from u
.
list_of_text_scrape s
roughly finds URLs and relative or absolute paths in s
by looking in order:
href
or src
substring then tries to parses the content of an HTML attribute. This may result in relative or absolute paths.http
substrings in s
and then delimits an URL depending on the previous characters and checks that the delimited URL starts with http://
or https://
.