-
Notifications
You must be signed in to change notification settings - Fork 4
Experiments on French English
Jetic Gu edited this page Jul 20, 2017
·
7 revisions
CPU: 3.5 GHz 6-Core Intel Xeon E5 with 12 MB L3‑Cache(Turbo Boost to 3.9 GHz)
Memory: 32GB 1866 MHz DDR3 ECC
Datasize: 654k sentences for training, 447 sentences for testing.
Source words/tags: [84951]
Target words/tags: [38319]
The table below shows the Average Error Rate.
All are trained with cHMM model, with fully trained IBM model 1 translation table(10 iterations, same data as HMM) as the initial values.
Data\Iter | 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 |
---|---|---|---|---|---|---|---|---|---|---|
10% | 0.2412 | 0.2186 | 0.2061 | 0.2005 | 0.1953 | 0.1941 | 0.1949 | 0.1942 | 0.1937 | 0.1951 |
20% | 0.2297 | 0.2082 | 0.1975 | 0.1914 | 0.1889 | 0.1901 | 0.1914 | 0.1896 | 0.1904 | 0.1902 |
30% | 0.2291 | 0.2055 | 0.1962 | 0.1925 | 0.1871 | 0.1889 | 0.1886 | 0.1888 | 0.1872 | 0.1868 |
40% | 0.2244 | 0.2049 | 0.1955 | 0.1904 | 0.1864 | 0.1873 | 0.1882 | 0.1877 | 0.1873 | 0.1882 |
50% | 0.2257 | 0.2052 | 0.1945 | 0.1897 | 0.1875 | 0.1874 | 0.1870 | 0.1855 | 0.1851 | 0.1858 |
60% | 0.2280 | 0.2054 | 0.1959 | 0.1920 | 0.1860 | 0.1889 | 0.1874 | 0.1884 | 0.1875 | 0.1866 |
70% | 0.2269 | 0.2028 | 0.1937 | 0.1889 | 0.1868 | 0.1873 | 0.1875 | 0.1870 | 0.1850 | 0.1848 |
80% | 0.2278 | 0.2015 | 0.1943 | 0.1891 | 0.1857 | 0.1869 | 0.1854 | 0.1852 | 0.1839 | 0.1842 |
90% | 0.2270 | 0.2027 | 0.1945 | 0.1886 | 0.1849 | 0.1854 | 0.1854 | 0.1841 | 0.1832 | 0.1836 |
100% | 0.2262 | 0.2018 | 0.1951 | 0.1891 | 0.1854 | 0.1857 | 0.1850 | 0.1852 | 0.1849 | 0.1848 |