For the first evaluation, the 20 most spoken international languages shall be used. Data is from Wikipedia: https://en.wikipedia.org/wiki/List_of_l ... f_speakers and Wikipedia uses data from the 2019 edition of the Ethnologue https://en.wikipedia.org/wiki/Ethnologue
Not only the mother tongue but also the second language should be taken into account. However, the second language should be considered less important than the mother tongue. I decided to consider second languages as 50%.
I corrected the data from the Ethnologue 2019. I had to correct many languages.
The African language Hausa would be placed 18th. But unfortunately I did not find text corpora in Hausa. If anyone finds some, we could use it. As I had to exclude Hausa, Cantonese is now placed 20th.
Optin - Languages Included
Re: Optin - Languages Included
Current Progress: