I am a researcher of economics. I need to get specified the Turkish people names from a broader global people names list.
I expect it to be done in one week.
There are two huge name lists in txt format. One is a complete name and surname list of Turkish people. The other list is a list of names of people from different countries. I would like to know which people in the second list have Turkish names - surnames. The Turkish surnames are unique. Names are not that much unique. So matching the surnames would work. And I need the ratio of male to female population for each names (not surnames) (there is gender info in the Turkish citizens list). I also need the frequencies of both names and surnames (separately) in the Turkish citizens list. There is also a problem needed to be solved. Some of the Turkish names in the Global names list are written with English characters and the same names can also appear in Turkish characters. For example: There exists the name "Gokhan (with English characters)" and also the same name may exist as "Gökhan (with Turkish character of "ö")". And actually these are the different people with a same name. The problem is the names in the Turkish names list are all written with Turkish characters. For example there is not a name such as "Gokhan" in the Turkish names list (which there is in the global names list) but there is "Gökhan" in the Turkish names list. Actually there are only 7 letters in Turkish alphabet that don't appear in the English alphabet but all of them has only one unique equivalent letter in English alphabet. For example the equivalent letter for "ö" is always "o".
21 freelance ont fait une offre moyenne de 144 $ pour ce travail
Hi. I am a Turkish native. During my career I handled many contact lists like yours. I am an expert in Turkish names, and I am aware of the Turkish characters trap. I can fix your lists at requested.