chore: move zhongfali crawler to one_off_sites

This commit is contained in:
hello-dd-code
2026-03-04 09:43:35 +08:00
parent 19cf9ce901
commit bc4a2aa4d5
2 changed files with 445 additions and 0 deletions
+13
View File
@@ -5,6 +5,7 @@
## 目录
- `common_sites/`:大律师、找法网、法律快车、律图、华律 5 个采集脚本
- `one_off_sites/`:一次性/临时站点采集脚本(不纳入常用站点批量启动)
- `request/proxy_config.py`:代理配置加载逻辑
- `request/proxy_settings.json`:代理配置文件
- `Db.py`:数据库连接与基础操作
@@ -60,3 +61,15 @@ DLS_DIRECT=1 DLS_NO_DB=1 ./common_sites/start.sh
# 如果不需要解析 params 扩展信息
./.venv/bin/python ./common_sites/export_lawyers_excel.py --no-parse-params
```
## 一次性站点(众法利)
脚本:`one_off_sites/zhongfali_single.py`
```bash
# 仅采集写 JSON(默认输出到 data/one_off_sites/
./.venv/bin/python ./one_off_sites/zhongfali_single.py --direct --no-db
# 采集并写入 lawyer 表(domain=众法利单页)
./.venv/bin/python ./one_off_sites/zhongfali_single.py --direct
```