LogicKor

한국어 언어모델 다분야 사고력 벤치마크

Benchmark Website

https://lk.instruct.kr/

Note

pr 적극 환영합니다. 벤치마크 결과 Self-Report도 받습니다. issue나 pr 부탁드립니다. 💕

Repository

본 Repo는 LogicKor 벤치마크의 추론 및 평가 코드, 데이터셋을 담고 있습니다.

Evalutation Example

EEVE 템플릿, GPU 0,1 사용, model_len 4096

python generator.py --model yanolja/EEVE-Korean-Instruct-10.8B-v1.0 --template templates/template-EEVE.json --gpu_devices 0,1 --model_len 4096
python judgement.py --model-output yanolja_EEVE-Korean-Instruct-10.8B-v1.0.jsonl --openai-api-key sk-somethingsomething --threads 30

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
results		results
templates		templates
README.md		README.md
generator.py		generator.py
generator_vllm.py		generator_vllm.py
judge_template.jsonl		judge_template.jsonl
judgement.py		judgement.py
questions.jsonl		questions.jsonl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LogicKor

Benchmark Website

Note

Repository

Evalutation Example

About

Releases

Packages

Languages

Kesta-bos/LogicKor

Folders and files

Latest commit

History

Repository files navigation

LogicKor

Benchmark Website

Note

Repository

Evalutation Example

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages