cdZWj e4751ed620 更新 'README.md' | 3 years ago | |
---|---|---|
.idea | 3 years ago | |
__pycache__ | 3 years ago | |
data | 3 years ago | |
img_folder | 3 years ago | |
logs | 3 years ago | |
res_folder | 3 years ago | |
structure | 3 years ago | |
templates | 3 years ago | |
utils | 3 years ago | |
README.md | 3 years ago | |
ans_structrue_old.py | 3 years ago | |
ceshi.py | 3 years ago | |
configs.py | 3 years ago | |
math_server.py | 3 years ago | |
parse_chunk.py | 3 years ago | |
photo_upload.py | 3 years ago | |
photo_upload_qcloud.py | 3 years ago | |
photo_upload_qcloud2.py | 3 years ago | |
requirements.txt | 3 years ago | |
server3.py | 3 years ago | |
server_new.py | 3 years ago | |
server_phy.py | 3 years ago | |
server_phy2.py | 3 years ago | |
server_tools.py | 3 years ago | |
server_tools2.py | 3 years ago | |
test.py | 3 years ago |
对word格式(doc, docx)的理科试卷进行解析结构化
主要支持3大类型:1>>模板格式的教师类用卷(每道题目下面含答案和解析)
2>> 题文和答案分开的形式,即题文单独放一起,答案单独放一起
3>> 只含题文,或题文下只含答案或解析
要求:
1>>排版规范,每道题或其答案从前往后,从小到大排列,题号连续不重复;
2>>与题文无关内容删除,特别是试卷中间和结尾的无用信息;
3>>题型行尽量明确;
4>>题文和答案分开的形式中,答案的标题要明显有“参考答案”类似字样,后面无用部分删除;
5>>本文所述试卷仅包含题型行、题干、答案、解析、分析、点睛、点评等,像每个题后面插个变式训练类型的非正式试卷不支持!
结构化返回形式:
解析流程:
上线服务器: 182 和 185
所需配套环境或服务:office word 、wordbin 、mathtype6