Most people (more than 95%) type Bhartiya languages using transliteration. There are many disadvantages of typing using transliteration compared to direct typing using inscript keyboard. This repo is for comparision between transliteration and InScript method of typing.
This repo calculates the average number of keystrokes required to accurately type the given text of Indian Languages.
- We first convert a word from our native script to Roman equivalent and then transliterator converts it back into Bhartiya Scripts.
- Transliterators are not 100% accuracte. In fact, it becomes highly inacurate when we use uncommon words.
- Some words just cannnot be written with transliteration. Try writing our national song : 'Vande Mataram' using it. We can only write words which are in the training dataset of some tech company. We cannot create new vocabulary in our languages. In addition, we cannot share words among Bhartiya Bhashas, e.g. I cannot write word 'आत्मोचित'(meaning Appropiate for Atma and which is proper word is Hindi and Samskrit) - in Gujarati Script using Microsoft Window's transliteration.
- We also make mistakes in mental conversion of native script to roman script.
- We need to use more keystrokes to write same word. Ex. To write चिराग using Inscript Keyboard, I need 5 keystrokes (च,ि,र,ा,ग). Transliteration would require 6 keystrokes (c,h,i,r,a,g).
- This method of Input is not entierly free. It is free only as long as you are on specific platform. The platform company has other ways of making money while you are on their platform. For example:
-
Microsoft provides transliteration service free as long as you are using Windows. But Microsoft charges upfront to let us install Windows. If we want to use it outside the windows, then we would be required to use their paid API : https://azure.microsoft.com/en-in/pricing/details/cognitive-services/translator/
As of now it will cost us ₹815 per 1 million character. The 167 line document that I have uploaded would cost us around ₹18 to type. -
Similarly, Google's transliteration, Google Input Tools https://www.google.com/inputtools/try works only within Google's platforms and Google Apps. Each character we type on these apps/platforms goes to Google's servers. Google knows how to make money using our data.
Until few years ago, Google used to give their Input Tools as a free standalone application on Windows. Google has stopped the windows app as it no longer suits their business model (or they have collected enought data, why give it free now). You need to use Chrome if you want to type using Google Input Tools. Again, Chrome is a commercial product.
-
- We will get inconsistent typing experince if we switch platforms. e.g. As of now(11/02/2023): For character sequence 'j a n', Microsoft gives "જન (जन)" as a first prediction, where as Google Input Tools gives 'જાણ (जाण)'. Once we get used to typing with specific platform, it will act as a deterrent to our wish to switch platform.
- The platform companies are pushing transliteration instead of giving hardware solutions because they wants us to get addicted to the their platform's transliteration. They can make money as long as we are on their platform.
If we start using direct typing using Inscript Keyboards then we would not face any of the issue.
- InScript (short for Indic Script) is the decreed standard keyboard layout for Indian scripts using a standard 104- or 105-key layout.
- This keyboard layout was standardised by the Government of India for inputting text in languages of India written in Brahmic scripts, as well as the Santali language, written in the non-Brahmic Ol Chiki script.
- It was developed by the Indian Government and supported by several public and private organisations.
- This is the standard keyboard for 12 Indian scripts including Devanagari, Bengali, Gujarati, Gurmukhi, Kannada, Malayalam, Odia, Tamil and Telugu, among others.
- The InScript layout is built into most of the major operating systems including Windows (2000 and later), and most Linux and Mac OS systems.
The InScript keyboard layout looks like below:
- It requires less key strokes compared to transliteration.
- It is 100% accurate. It is direct typing method. It will type what you want. It can type our national song easily.
- Inscript is designed specifically for Bharatiya languages. Indian Languages follow consonent-vowel sequence. Left side of keyboard has Swaras and right side has Vyanjanas. Therefore, we can type faster with alternate hand action.
- The position of alphabets is same across all the Indian languages. Therefore, if one learn to type in Telugu, s/he can type in Marathi also.
- It is very easy to learn as it uses inherent logic of Bharatiya alphabets.
For Example, look at the below image :- Consonents are arranged in verticle postion as per their Varga (वर्ग) as shown using red line. प Varg = (प,फ,ब,भ), क Varg = (क,ख,ग,घ), त Varg = (त,थ,द,ध), च Varg = (च,छ,ज,झ), ट Varg = (ट,ठ,ड,ढ).
- महाप्राण (aspirated) consonents are positioned in the shifted position of their corresponding अल्पप्राण (non-aspirated) consonents. e.g. (क,ख), (प,फ), and (स,श) etc.
- Short and long vowel signs (चिह्व) are arranged in verticle position as shown in blue line. e.g. (्,ा),(ि,ी), (ु,ू),(े,ै) and (ो,औ).
- The Vowel character is located in shifted position of their corresponding sign. e.g. (्,अ), (ा,आ), (ि,इ), (ी,ई), (ु,उ), (ू,ऊ), (े,ए), (ै,ऐ), and (ो,ओ) (ौ,औ).
- Less mental effort as it is no longer required to convert every word into roman script in mind.
In my testing with Gujarati, transliteration required 21.58% more keystrokes.
Typing time and number of keystrokes are highly correlated. Therefore, we can conclude that - "typing with transliteration requires 21.58% more time."
Let us remember, that typing with transliteration also requires us to do mental conversion of Bhartiya Script to Roman Script. We can assume that this conversion requires non zero amount of time as with the any mental process. Therefore, we would now have to update our conclusion:
Conclusion: Typing with transliteration would require at least 21.58% more time.
Why we don't have Bharatiya alphabets on keyboards? Rest of the world have it. Almost all countries have their own keyboard layouts. The image shown in title is Thai Magic Keyboard from Apple.
This is a classic argument made by pro-english when it comes to Bharat's Languages.
The argument is : Since India is a multilingula country, English serves as a neutral language. Developing anything in more than one language is not possible.
The inherent assumption in this argument is each nation should have exactly one keyboard layout. Also they assume that the same principle (one nation one keyboard layout) is followed in the other countries as it is not practical for a manufacturer to sell keyboard with more than one layout in one country. Nothing can be farther from truth than this belief.
Let's verify this belief taking the example of Apple which is known to be the least customization-friendly.
Apple sells macbooks with 16 keyboard layout options in USA. https://www.apple.com/shop/buy-mac/macbook-air/midnight-apple-m2-chip-with-8-core-cpu-and-8-core-gpu-256gb
And it does not end here: It sells macbook with 17 keyboard layout options in Belgium, 10 in Czech Republic, 16 in Denmark, 18 in Germany, 17 in Spain, 17 in France, 18 in Ireland, 17 in Italy, 17 in Luxeumbourg, 11 in Hungary, 16 in Netherlands, 15 in Norway, 16 in Austria, 5 in Poland, 16 in Portugal, 17 in Switzerland, 15 in Finland, 16 in Sweden, 4 in Turkey, 18 in UK, 7 in Australia, 8 in China, 7 in Hong Kong, 7 in Japan, 4 in Korea, 5 in Malasia, 7 in New Zealand, 5 in Singapore, 5 in Taiwan, and 4 in Thailand, 4 in UAE.
According to https://support.apple.com/en-in/HT201794, Apple sells their laptops with 40 distinct keyboard layouts.
Figures from only Apple are shown here as an illustration only. It is not the only company which customize their product based on local needs. Other manufactureres are also localizing their products.
Before someone argue that India is a poor country and beggers have no choice, let me state that Maharashtra's GDP is more than double of Hungary's GDP (11 layout options). Even Gujarat has higher GDP than Hungary. Before you think that our laptop market is small, let me infrom you that India sells more PCs in 3 months than the populations of Norway. https://www.canalys.com/newsroom/india-PC-market-Q1-2022
No. While it is true that manufatureres sometime exploit their monopoly position in the market, they do responds to local needs. In our case, there is no demand for bilingual Keyboards. I am not sure if Indians would appreciate the presence of their mother tongue characters on Keyboards. We have internalized the myth that anything modern has to be in English. Manufactueres are just exploiting our colonized mind.
- Mesaure the accuracy of Microsoft Transliteration.
- Calculate the character frequency of Indic characters: unigram and bigram
- Adjust the count for few characters
- Add more languages.