tujintao d9e302c832 revise | 10 months ago | |
---|---|---|
__pycache__ | 11 months ago | |
logs | 1 year ago | |
main_clear | 11 months ago | |
model_data | 10 months ago | |
README.md | 10 months ago | |
comparison.py | 10 months ago | |
comprehensive_score.py | 10 months ago | |
config.py | 10 months ago | |
data_preprocessing.py | 11 months ago | |
db_train_app.py | 10 months ago | |
dim_classify.py | 11 months ago | |
dim_classify_app.py | 11 months ago | |
formula_process.py | 1 year ago | |
guc_conf.py | 1 year ago | |
heap_sort.py | 10 months ago | |
hm_ir_train_app.py | 1 year ago | |
hnsw_app.py | 11 months ago | |
hnsw_model.py | 1 year ago | |
hnsw_model_train.py | 1 year ago | |
hnsw_retrieval.py | 11 months ago | |
info_retrieval.py | 10 months ago | |
ir_db_establish.py | 1 year ago | |
log_config.py | 1 year ago | |
physical_quantity_extract.py | 11 months ago | |
restart_server.py | 11 months ago | |
retrieval_app.py | 11 months ago | |
retrieval_monitor.py | 11 months ago | |
server_start.sh | 1 year ago | |
setup.py | 1 year ago | |
word_segment.py | 1 year ago |
考试院查重文档说明:
初始化方式:
注:若keyword_mapping.json不存在,则首先运行 python comparison.py # 计算知识点/物理量映射ID
python db_train_app.py # mongodb数据清洗与向量化/计算物理量/知识点转ID/计算求解类型
python hm_ir_train_app.py # hnsw模型/关键词检索/公式查重模型初始化
启动方式:
1、全部功能重启命令
conda activate dup_search
python restart_server.py
2、部分功能重启命令
conda activate dup_search
python restart_server.py 0/1/2/3
其中:
0表示重启考试院题库查重功能
1表示重启考试院题库HNSW模型检索功能
2表示重启多维度(求解类型/难度)分类模型功能
3表示重启服务监控功能
查重功能主要分三个部分(公式查重、关键词检索、文本查重、语义查重):
〇、数据初始化
一、公式查重
二、关键词检索
三、文本查重
四、语义查重