# UltraSearch **Repository Path**: yangjingla_admin/UltraSearch ## Basic Information - **Project Name**: UltraSearch - **Description**: 世上本没有奥特曼 - **Primary Language**: Unknown - **License**: GPL-3.0 - **Default Branch**: master - **Homepage**: None - **GVP Project**: No ## Statistics - **Stars**: 2 - **Forks**: 2 - **Created**: 2021-05-07 - **Last Updated**: 2022-10-25 ## Categories & Tags **Categories**: Uncategorized **Tags**: None ## README # UltraSearch ## 1. 编译安装 ``` git clone https://gitee.com/yangjingla/UltraSearch.git cd UltraSearch && mkdir build && cd build qmake .. && make sudo make install ``` ## 2. 环境准备 >Ultrasearch环境准备分为Qt和python两个部分 - Qt-5.12.8 ``` sudo apt-get install qt5-default sudo apt-get install debhelper pkgconf libgsettings-qt-dev qtbase5-dev qt5-qmake qtchooser qtscript5-dev qttools5-dev-tools libxapian-dev libquazip5-dev libglib2.0-dev libkf5windowsystem-dev libgsettings-qt-dev libqt5x11extras5-dev libuchardet-dev libpoppler-qt5-dev libukui-log4qt-dev openjdk-8-jdk ``` ------------------------- ### Python环境准备 - STEP1: - 准备python 3.6(建议使用conda安装环境)与java环境 ``` conda create -name ultra python=3.6 conda install -c conda-forge openjdk jpype1=0.7.0 -y ``` - 由于pyhannlp依赖于java环境,如果本级没有需要安装java,测试机环境为: ``` openjdk version "1.8.0_292" OpenJDK Runtime Environment (build 1.8.0_292-8u292-b10-0ubuntu1~20.10-b10) OpenJDK 64-Bit Server VM (build 25.292-b10, mixed mode) ``` - STEP2: - 安装python依赖包(在初次运行server.py的时候会需要网络下载预训练模型): ``` pip install -r requirements.txt ``` - requirements.txt ``` tensorflow==1.12 googletrans==4.0.0rc1 pyhanlp==0.1.79 pdfminer.six fitz==0.0.1.dev2 PyMuPDF==1.18.15 Pillow==8.1.0 python-docx==0.8.11 paddleocr==2.3.0.1 paddlepaddle==2.1.1 LAC==2.1.2 shapely scikit-image==0.17.2 imgaug==0.4.0 pyclipper lmdb tqdm numpy visualdl python-Levenshtein opencv-contrib-python==4.4.0.46 lxml premailer openpyxl ``` - STEP3: - im2txt预训练模型下载( 此im2txt.zip解压后,将文件夹名字改为model,放于/UltraSearch/im2txt/目录下) - [百度网盘提取码链接:khtx](https://pan.baidu.com/share/init?surl=nYlWBoOQjzgP5-GYfutMQA) - https://pan.baidu.com/share/init?surl=nYlWBoOQjzgP5-GYfutMQA - - STEP4: - python server.py - STEP5: - 执行 .imageapp ## 3. 使用说明 >在使用本搜索软件前请先将优麒麟20.04自带的“搜索”应用卸载。 ### 1)运行本地服务器 ``` python3 UltraSearch/im2txt/server.py ``` 等待出现“本地服务端启动完毕” ### 2)运行编译后的可执行文件 ``` /usr/bin/UltraSearch ``` ### 3)在弹出对话框中确认建立索引 ## 4. To do - 增强摘要模型检查的准确性和流畅性 - 支持更加丰富的文档类型,如 etc. - 支持视频及其他多媒体搜索 - 减少软件依赖,优化代码结构