分析网页结构

西域网的特种扳手品类:

基本确定http://www.ehsy.com/category-16883?p=i的格式即为最后的格式!

读取网页

读取第一个网页(测试)

## {html_document}
## <html>
## [1] <head>\n<meta http-equiv="Content-Type" content="text/html; charset=UTF-8 ...
## [2] <body class="layout-body category-result">\n<div class="layout-header-con ...
## # A tibble: 2 x 2
##   encoding     confidence
##   <chr>             <dbl>
## 1 UTF-8              1   
## 2 windows-1252       0.42
## Best guess: UTF-8 (100% confident)
##  [1] "有库存" "有库存" "有库存" "有库存" "有库存" "有库存" "有库存" "有库存"
##  [9] "有库存" "有库存" "有库存" "有库存" "有库存" "有库存" "有库存" "有库存"
## [17] "有库存" "有库存" "有库存" "有库存" "有库存" "有库存" "有库存" "有库存"
## [25] "有库存" "有库存" "有库存" "有库存" "有库存" "有库存" "有库存" "有库存"
## [33] "有库存" "有库存" "有库存" "有库存"