更新爬虫方案文档,增加摘要提取模块以生成文档摘要;优化基础爬虫类的标题提取逻辑,支持多个选择器,调整内容处理逻辑以去除重复标题。
This commit is contained in:
1
.gitignore
vendored
1
.gitignore
vendored
@@ -32,6 +32,7 @@ wheels/
|
||||
|
||||
# 输出文件
|
||||
output/
|
||||
output_post/
|
||||
|
||||
# 临时文件
|
||||
*.tmp
|
||||
|
||||
Reference in New Issue
Block a user