Skip to content

NotOneWest/sifter-mrc-search-engine

ย 
ย 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

277 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

โœจ MRC training modules : post-train & fine-tune โœจ

mrc-search-engine

mrc-search-engine์€ ๊ฒ€์ƒ‰ ์—”์ง„๊ณผ ํ•จ๊ป˜ ์‚ฌ์šฉ๋˜๋Š” ๊ธฐ๊ณ„๋…ํ•ด(Machine Reading Comprehension) ๋ชจ๋“ˆ์„ ๊ฐœ๋ฐœํ•˜๋Š” ํ”„๋กœ์ ํŠธ์ž…๋‹ˆ๋‹ค.


์„œ๋น„์Šค ๋ชจ๋“ˆ ์‹คํ–‰ ๊ฐ€์ด๋“œ

Docker ์„ค์น˜

sudo wget -qO- http://get.docker.com/ | sh

Nvidia-docker ์„ค์น˜

์ €์žฅ์†Œ ๋ฐ GPGํ‚ค ์„ค์ •

distribution=$(. /etc/os-release;echo $ID$VERSION_ID) \
   && curl -s -L https://nvidia.github.io/nvidia-docker/gpgkey | sudo apt-key add - \
   && curl -s -L https://nvidia.github.io/nvidia-docker/$distribution/nvidia-docker.list | sudo tee /etc/apt/sources.list.d/nvidia-docker.list

Install Nvidia-docker

sudo apt-get update
sudo apt-get install -y nvidia-docker2

docker ์„œ๋น„์Šค ์žฌ์‹œ์ž‘

sudo systemctl restart docker

Docker image ๋ถˆ๋Ÿฌ์˜ค๊ธฐ

docker load -i mrc.tar

Docker image ์‹คํ–‰ํ•˜๊ธฐ (gpu ์‚ฌ์šฉ)

docker run -it --gpus all -p 8000:8000 mrc

Docker๋ฅผ ์‚ฌ์šฉํ•˜์ง€ ์•Š๊ณ  ๊ตฌ์ถ• (Ubuntu ๋ฐฐํฌํŒ ๊ธฐ์ค€)

JDK ์„ค์น˜ (1.7 ๋ฒ„์ „ ์ดํ›„) ๋ฐ JAVA_HOME ํ™˜๊ฒฝ ๋ณ€์ˆ˜ ์„ค์ •

apt-get install openjdk-17-jdk

Mecab ์„ค์น˜

wget https://bitbucket.org/eunjeon/mecab-ko/downloads/mecab-0.996-ko-0.9.2.tar.gz && \
    tar xvfz mecab-0.996-ko-0.9.2.tar.gz && \
    cd mecab-0.996-ko-0.9.2 && \
    ./configure && \
    make && \
    make check && \
    make install && \
    ldconfig

Mecab-dictionary ์„ค์น˜

wget https://bitbucket.org/eunjeon/mecab-ko-dic/downloads/mecab-ko-dic-2.1.1-20180720.tar.gz && \
    tar xvfz mecab-ko-dic-2.1.1-20180720.tar.gz && \
    cd mecab-ko-dic-2.1.1-20180720 && \
    ./configure && \
    make && \
    make install

ํŒŒ์ด์ฌ ๋ผ์ด๋ธŒ๋Ÿฌ๋ฆฌ ์„ค์น˜

pip install -r modules/mrc_service/requirements.txt

๊ฐœ๋ฐœ ํ™˜๊ฒฝ ์„ค์ • ๊ฐ€์ด๋“œ

๊ฐ€์ƒ ํ™˜๊ฒฝ ์„ค์ •(Conda)

conda create --name (์ด๋ฆ„) python=3.8

๊ฐ€์ƒ ํ™˜๊ฒฝ ํ™œ์„ฑํ™”/๋น„ํ™œ์„ฑํ™”(Conda)

conda activate (์ด๋ฆ„)
conda deactivate

ํŒจํ‚ค์ง€ ์„ค์น˜(CPU)

pip install -r requirements.txt

ํŒจํ‚ค์ง€ ์„ค์น˜(GPU)

pip install -r requirements_gpu.txt

requirements.txt ์ƒ์„ฑ

pip list --format=freeze > requirements.txt

์–ด๋–ป๊ฒŒ ์‹œ์ž‘ํ•˜๋‚˜์š”?

cd modules
python.exe -m main.py post-training

ํ›ˆ๋ จ ๊ณผ์ •์ƒ์˜ ๋ณ€๊ฒฝ์ ์„ ์ฃผ๊ณ  ์‹ถ๋‹ค๋ฉด ๋‹ค์–‘ํ•œ ์ธ์ž๋ฅผ ์ถ”๊ฐ€ํ•  ์ˆ˜ ์žˆ์Šต๋‹ˆ๋‹ค. ๋‹ค์Œ ๋ช…๋ น์–ด๋ฅผ ํ†ตํ•ด์„œ ์ด๋ฅผ ํ™•์ธํ•ด ๋ณด์„ธ์š”.

python.exe -m main.py --help

์„œ๋น„์Šค ๋ชจ๋“ˆ

ํ•ด๋‹น ํ”„๋กœ์ ํŠธ๋Š” ํ›ˆ๋ จํ•œ ๋ชจ๋ธ์„ ์‚ฌ์šฉํ•  ์ˆ˜ ์žˆ๋Š” api server๋ฅผ ๋™์‹œ์— ์ œ๊ณตํ•ฉ๋‹ˆ๋‹ค. Service program ๊ด€๋ จ ์‚ฌํ•ญ์€ ์„œ๋น„์Šค ๋ชจ๋“ˆ๋ฅผ ์ฐธ์กฐํ•ฉ๋‹ˆ๋‹ค.

About

Capstone Design Project MRC service module

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 98.7%
  • Dockerfile 1.3%