GitHub

There are two python programs here (-h for usage):

-decode translates input sentences from French to English. -grade computes the model score of a translated sentence.

These commands work in a pipeline. For example:

> python decode | python compute-model-score

There is also a module:

-model.py implements very simple interfaces for language models and translation models, so you don't have to.

You can finish the assignment without modifying this file at all. You should look at it if you need to understand the interface to the translation and language model.

The data directory contains files derived from the Canadian Hansards, originally aligned by Ulrich Germann:

-input: French sentences to translate.

-tm: a phrase-based translation model. Each line is in the form:

French phrase ||| English phrase ||| log_10(translation_prob)

-lm: a trigram language model file in ARPA format.

log_10(ngram_prob)   ngram   log_10(backoff_prob)

The language model and translation model are computed from the data in the align directory, using alignments from the Berkeley aligner.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
data		data
Beam.py		Beam.py
Beam_backup.py		Beam_backup.py
LR.py		LR.py
LR_Beam.py		LR_Beam.py
LR_test.py		LR_test.py
OptBeam.py		OptBeam.py
README.md		README.md
answer ver1		answer ver1
compute-model-score		compute-model-score
debug.scala		debug.scala
decode		decode
example		example
input backup		input backup
iter10		iter10
iter100		iter100
iter1000		iter1000
iter_10		iter_10
iter_100		iter_100
iter_20		iter_20
lr_output		lr_output
lr_output1000		lr_output1000
lr_output_eta0.25		lr_output_eta0.25
lr_output_eta0.5		lr_output_eta0.5
lr_output_k1000		lr_output_k1000
lr_output_luckey150		lr_output_luckey150
lr_output_lucky.25		lr_output_lucky.25
lr_output_lucky125		lr_output_lucky125
lr_output_test		lr_output_test
models.py		models.py
models.pyc		models.pyc
test.py		test.py
test_output		test_output
testinput		testinput

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Releases

Packages

Languages

lhuizhan/HW2_decoder

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages