Stories
Slash Boxes
Comments

News for nerds, stuff that matters

Slashdot Log In

Log In

Create Account  |  Retrieve Password

Russia Weighs Going Cyrillic For DNS

Posted by kdawson on Thu Jan 03, 2008 12:57 PM
from the ru-serious dept.
An anonymous reader writes "The Guardian reports that the Kremlin may start an alternate top-level domain, .rf. According to the story, .ru in Cyrillic translates to .py, the top-level domain for Paraguay, which the Russian government claims leads to confusion. This is similar to a move by China, which has their own .net and .com top-level domains in their native character set along with .cn, .com, and .net in ASCII." Hindering Paraguayan hackers may matter less to the Russian government than establishing greater control over a walled-off Internet.
+ -
story

Related Stories

[+] Technology: The Kremlin Tightens Its Grip on the Internet 280 comments
reporter writes "According to a report just published by "The Washington Post", the percentage of Russian adults having access to the Internet has risen from 8% in 2002 to 25% in 2007. This growth has attracted the attention of the Kremlin. Its allies are creating pro-Kremlin web sites and are purchasing web sites known for high-quality independent journalism. Pro-Kremlin bloggers have used their skills to bury news about anti-Kremlin demonstrations: at Russian news portals, web links to news about pro-Kremlin rallies consistently rank higher than web links to news about anti-Kremlin demonstrations. The most disturbing development is that the Kremlin intends to develop a Russian Internet which is separate from the global Internet. Russian officials are studying the techniques that the Chinese use to censor the Internet."
This discussion has been archived. No new comments can be posted.
The Fine Print: The following comments are owned by whoever posted them. We are not responsible for them in any way.
 Full
 Abbreviated
 Hidden
More
Loading... please wait.
  • Great!!! (Score:2, Interesting)

    by Anonymous Coward
    It's great that nations can use their own languages instead of being forced to use alien Latin-English characters.
    • Re: (Score:2, Informative)

      It's great that nations can use their own languages instead of being forced to use alien Latin-English characters.

      In this case, the characters are exactly the same. It's just that 'p' (pronounced 'pee' in English) is the letter 'er' in Russian, and 'y' (pronouced 'why' in English) is the letter 'oo' in Russian. So .ru to us is literally .py to them.

      Cyrillics has a number of Greek letters sprinkled in, but in this instance it is of no help.

      • Re: (Score:3, Interesting)

        they are not the same, they just look very similar
          != py
        • Re:Great!!! (Score:4, Informative)

          by Maimun (631984) on Thursday January 03 2008, @02:27PM (#21899060)
          They ARE the same. Trust me, I am Bulgarian and we also use the Cyrillic alphabet. The Cyrillic alphabet was created in the 9th century by Constantine, a Byzantine friar (I dunno if this is the correct term) serving the emperor in Constantinopol. The church name of Constatine was Cyrill, that is where the name of the alphabet came from. At that time, both Rome and Constantinopol were trying to convert the Slavic states to Christianity. The Eastern Roman Empire, a.k.a. Byzantia, was more flexible than the Catholics: she offered Christianity in the native Slavic languages, while the Catholics insisted on using Latin. The Cyrillic alphabet was introduced precisely for that purpose. It was modified Greek alphabet (Greek was, of course, was the language of the East Roman Empire) with symbols added for those Slavic sounds that had no Greek equivalent. Intially it was adopted in Bulgaria and after about a century or two it was adopted by the Russian proto-state -- in contrast to the Russian myths that the Cyrillic alphabet was first introduced in Russia and even invented in Russia.

          The initial Cyrillic alphabet looked quite different from what is used today in Russia and Bulgaria; the appearance of the modern Cyrillic alphabet is due to a reform by Tzar Peter I of Russia. Peter I imposed visual style similar to the one of the Roman font.

          BTW, the Cyrillic alphabet was not the only creation of Constantine-Cyrill. He had invented another alphabet to be used by the Slavs which was called "glagolitsa" and visually was totally different from the Cyrillic one. This radical design was not very successful, although I've heard it had been used in Croatia until 2-3 centuries ago.

          Here is a four-column table of the original Cyrillic alphabet [wikimedia.org] and the Glagolic one ("glagolitsa"). The first column is the name of each letter (yes, each one had a name; if the names are read sequentially they form a saying, quite deep and meaningful at that), the second is the cyrillic glyph, the third is the glagolic glyph, the fourth is the numeric value.

          • Re: (Score:3, Interesting)

            I live in Poland, more specifically in Przemysl on the Ukrainian border so I'm exposed to both alphabets more or less daily. I must confess, I envy the Easterners! The Latin alphabet is really not suited to Slavic tongues and I think the Cyrillic one is a far superior way to render them. For example, in Cyrillic you get one nice little letter looking like w with a tail, whereas we get szcz... if you're an English speaker, it'd be something like the sh ch between freSH CHeese. Anyway, the inadequacies of the
          • Re: (Score:3, Interesting)

            Sts Cyril and Methodius did not invent the Cyrillic alphabet. They invented only the Glagolitic alphabet. The Cyrillic alphabet was invented in the Kingdom of Bulgaria nearly a century later.
          • Re:Great!!! (Score:5, Funny)

            by Dogtanian (588974) on Thursday January 03 2008, @01:52PM (#21898448) Homepage

            Actually one of big advantages of Microsoft was internalization.
            You mean that it was possible to shove them up your ass?
          • Re: (Score:3, Informative)

            Slashdot is lame like U**x in 1980 and ate the characters you typed.

            Actually, Slash (the engine behind Slashdot) does exactly the right thing, converting any out-of-latin-1 characters into HTML-encoded characters such as &#041F;

            However, it also eliminates these from display because of the confusion that people use them to inject (e.g. mis-spelling a domain name with Cyrillic characters so that when someone cuts-and-pastes it, their session can be hijacked). It's a specific security feature used on MANY sites which are intended for English-language discussion.

            Actually one of big advantages of Microsoft was internalization.

            MS jumped

            • Re: (Score:3, Informative)

              The characters are not displayed in the same way

              As I said, it depends on your font. In Arial, they are pixel for pixel. In Courier, they have slightly different shapes. Either way, it doesn't really matter. Very few people will notice the font differences. Why? Because they are the same characters. The fact that a computer provides two copies of the same character, actually causes as many problems as it solves.
            • Re: (Score:3, Interesting)

              they are not the same character. Not historically

              And yes, they are the same character, historically speaking. Both characters were borrowed from a common Greek/Semitic ancestry. Cross pollination of Latin and Cyrillic languages have lead to Cyrillic renderings of the letter that are more or less the same as the Latin rendering.

              http://en.wikipedia.org/wiki/Y [wikipedia.org]
              http://en.wikipedia.org/wiki/%D0%A3 [wikipedia.org]

              http://en.wikipedia.org/wiki/P [wikipedia.org]
              http://en.wikipedia.org/wiki/%D0%A0 [wikipedia.org]

      • Re: (Score:2, Informative)

        by Anonymous Coward
        No, the characters only look the same to a human eye. To a computer they would look quite different:

        English "py" is keycode U+0070, U+0079
        Russian "py" is keycode U+0440, U+0443

        Of course, the whole internationalization issue wouldn't be an issue if ICANN didn't have their head up their collective ass.
        • Re:Great!!! (Score:5, Interesting)

          by Maimun (631984) on Thursday January 03 2008, @02:38PM (#21899248)

          No, the characters only look the same to a human eye. To a computer they would look quite different:
          This is precisely why Cyrillic symbols are not used in DNS. It is possible to have two URLs, one having latin letters only, the other one latin and cyrillic, that look exactly the same in most fonts but are completely different as strings, so if they are resolved by DNS they'd resolve to distinct IP addresses. This is just perfect for phishing attacks: you can't tell whether www.mybank.com is the URL of your bank "MyBank", or it has a Cyrillic "a" and is registered by the attacker, by simply lookong at it. To tell if the URL is genuine one must examine it with hex editor ro something...
    • Re:Great!!! (Score:5, Interesting)

      by Sigismundo (192183) on Thursday January 03 2008, @01:27PM (#21898042)

      Not sure why the parent has been modded flamebait. It's probably the phrase "alien Latin-English characters", but it's actually an accurate description of how a domain name might appear to speakers of non-European languages.

      I wasn't aware that China had already began experimenting with Chinese characters in domain names, so I did some Googling. Here is a link [cnnic.cn] (in English) that describes how to register a Chinese Domain Name (CDN). It makes for a pretty interesting read. It includes the predictable clause that you can't register CDNs that "harm the glory of the state." Users of CDNs are encouraged to use "Official Client-end CDN Software" to make access more convenient. I wonder exactly what this does.

      In general I think it's pretty cool to be able to have non-ASCII characters in domain names, but it seems to introduce a lot of extra compexity into DNS. Also, it seems like it could open the door for more governmental control of the internet, as TFA mentions.

      • I take it there is some good reason why a new but backwards-compatible version of DNS can't be released that uses unicode? Never mind Cyrillic, or Chinese characters, I want a domain name in Tengwar!
  • by mr_mischief (456295) on Thursday January 03 2008, @01:04PM (#21897528) Journal
    You can't really translate between 'r' and rho. It's a character set issue. It's a straight equivalency of sounds. Cyrillic is based on the Greek alphabet and the English alphabet is based on the Latin alphabet. It could be confused with Paraguay because of the character encoding, but it's not really the same letters.
    • From the user standpoint, it's a distinction without a difference. In most fonts, Latin "py" is not readily distinguishable from Cyrillic "ru." However, I would argue that confusion is more likely with the proposed Cyrillic domain names than with the current all-ASCII system. I am sympathetic to the desire for more localization, but the ramifications of a change like this should be considered very carefully.
      • Re: (Score:2, Informative)

        I'm not sure what you're asking, but I've always heard of conversion between scripts (i.e. writing systems) being called transliteration.
  • by savuporo (658486) on Thursday January 03 2008, @01:04PM (#21897532)
    i think this is a specially engineered news post to bring out the lamest "in soviet russia" jokes of slashdot. bring it on!
  • by trolltalk.com (1108067) on Thursday January 03 2008, @01:08PM (#21897598) Homepage Journal

    In Soviet Russia, DNS blocks YOU.

    ... which is the whole point of "greater control".

    • Re: (Score:3, Funny)

      In Soviet Russia, they are so tired of this joke.
      • If they can block links to myminicity, I for one would welcome our new Russian DNS Overlords!

        Better yet, lets give them myminicity.ro or rf, or whatever, and let let them build their own gulags.

  • In Soviet Russia, the domains name you!
  • by edwardpickman (965122) on Thursday January 03 2008, @01:13PM (#21897692)
    and prevent foreign outsourcing of Russian web site construction they plan to launch a version of HTML in Cyrillic. Soon to be followed by C++ in Cyrillic. Microsoft decided it was a niffty idea so they plan to start a Pig Latin based coding language called "Squeal Like".
    • HTML in Cyrillic...

      You can write XML with Cyrillic tags. XML with tags in Mandarin Chinese shows up now and then.

    • Re: (Score:3, Interesting)

      This is why we need "common" as a language choice! Go ahead and keep your individual languages (English, French, Goblin) but also have a "Common" language for all people. Like in Firefly everyone spoke a little English and a little Chinese to create a language of the people...

      I fear that it would create more and bloodier Wars than ever before though.
    • Funny you should mention outsourcing. Last time I checked, Russia annually issues about 7,000,000 work visas for foreigners.

      Now compare that to all the bitching about 60k H1Bs...
    • Soon to be followed by C++ in Cyrillic.

      When we studied programming in high school, we used a language called "Ershov" (last name of the textbook's author), which was really Pascal translated to Russian.

      I don't think, there was an actual compiler, though — nor did we have (enough) computers. Our little code-snippets were checked by the teacher by hand...

      "One laptop per child"? Right...

      In the American college, our professor was quite fond of (then brand new) Java. Among the advantages, he listed

  • to put a wall up across the internet.
    Also the reason I do not want changes to how the internet 'works'.
    It seems every change someone comes up with is designed to put a wall up someplace.

    • How?
      It is just a nameservice. If russia decides they want a top level .manyspecialcharacters, google will buy/register the domain name google.manyspecialcharacters, just like they bought/registered google.ru. Russia will get some money, and everyone is happy - especially Russia. You can still call it google.com, or just 72.14.207.99.
  • How long? (Score:5, Funny)

    by A beautiful mind (821714) on Thursday January 03 2008, @01:17PM (#21897798)
    How long until someon registers rm.rf ?
    • My first thought was 'tm.rf'--in Soviet Russia, The Manual, um, Reads... wait...
    • Re: (Score:3, Insightful)

      Bah, I can think up some that are way cooler. Let's see here:

      rt.fm
      poop-s.coop (a real TLD by the way)
      pen.is (BIC's homepage in Iceland?)
      vagi.na
      got.root (also real)
      Eat-sh.it
      sniff.co.ck (real TLD)
      Give-a-fu.ck
      por.no
      s.cat
      free.blow.jobs
      felat.io
      sc.um

      goat.se (deserves an honorable mention I guess).
  • Just me (Score:4, Interesting)

    by Rinisari (521266) on Thursday January 03 2008, @01:18PM (#21897814) Homepage Journal
    Is it just me, or does it seem like the article is really blowing this out of proportion? From my understanding, the Russian government just wants to add a .rf (well, . if I'm remembering Cyrillic correctly). That's it. Users with Cyrillic keyboards will be able to access those sites without a problem, and those of us with non-Cyrillic keyboards will have to either use a character map program or temporarily switch keyboard layouts (as I just did).

    Is that it, or am I missing something?
  • by gstoddart (321705) on Thursday January 03 2008, @01:23PM (#21897940) Homepage
    As it is I see spam which has Chinese characters embedded in what appears to be a google URL, but which I strongly suspect isn't.

    I fear the more we see unicode bytes in URLs the more it will open up people to vulnerabilities as they click on very innocent looking links.

    Hopefully the browsers can keep up with this.

    Cheers
    • There have already been some browser fixes, mainly triggering cases where characters from different scripts appear next to each other. That certainly breaks some valid cases as well, but I guess it's bearable. (So you can't just switch a single o in some domain name to a Cyrillic o and get it to show almost indistinguishably... or at least that's the idea.)
  • by athloi (1075845) on Thursday January 03 2008, @01:23PM (#21897960) Homepage Journal
    It's a smart move. Russia has already demonstrated that it wants to be a superpower again, which means that its main competition is China and the USA.

    It has to keep up with China's level of control, and not leave the internet in the hands of the USA, if it can.

    Again Putin demonstrates a smart interpretation of Machiavellian Realpolitik while no one else yet realizes the Cold War is back on.
    • by dusanv (256645) on Thursday January 03 2008, @02:35PM (#21899208)
      Or maybe, just maybe, they only want Cyrillic characters in URLS. ASCII isn't suitable for majority of the world so brace yourself for more of this in future.

      The article is loaded with bs like this brownish pearl:
      Kleinwachter says the speculation is that people will need a password authorised by government agencies to use the global internet.

      How the fsck did he deduce that from introduction of Cyrillic DNS?

  • Icons for Victory (Score:4, Interesting)

    by Doc Ruby (173196) on Thursday January 03 2008, @01:38PM (#21898196) Homepage Journal
    I'd like the URLs in my GUIs to be displayed in their frame with an icon indicating their character set, and colored if in a character set different from my GUI default. If I had that, I'd like to see "native" glyphs without fear that they're decoys. Even though such a system would no longer force most content publishers to deliver content in my own privileged native character set.
  • internet walls (Score:3, Insightful)

    by pembo13 (770295) on Thursday January 03 2008, @01:44PM (#21898280) Homepage

    Hindering Paraguayan hackers may matter less to the Russian government than establishing greater control over a walled-off Internet.

    I don't really have a problem with government's filtering the internet of their own citizens -- let their citizens deal with that. When I don't like it is when a government want to control/monitor the the internet usage of other citizens.

  • Trouble ahead? (Score:3, Interesting)

    by Duncan Blackthorne (1095849) on Thursday January 03 2008, @01:50PM (#21898390)
    I may not be looking at the whole picture here, but isn't this sort of decision going to have a tower-of-babel-like effect? Are search engines going to be able to index sites using the alternative character sets? Isn't there at least some risk of two different sites at least appearing to have identical URLs? Or is this really an attempt by countries like Russia and China to selectively cut their populations off from the public internet while not in actuality doing so? Don't get me wrong, I'm not saying that American English should be imposed on the rest of the world (I'm not that guy!), but the system in place was founded on such and I see this really mucking up the works..
  • by Quiet_Desperation (858215) on Thursday January 03 2008, @01:58PM (#21898564)
    I'm registering my next domain in Klingon.

  • by DaleGlass (1068434) on Thursday January 03 2008, @03:08PM (#21899716) Homepage
    If the domain name contains characters not from the system's character set, highlight them (with another color say), and warn the user.

    It's not a new problem either, "slashdot", and "sIashdot" will look the same in many fonts.
    • Re:Further Proof (Score:5, Insightful)

      by jacquesm (154384) <j AT ww DOT com> on Thursday January 03 2008, @01:34PM (#21898136) Homepage
      Hm, troll ? Maybe, maybe not. When I was 14 or so one of my main motivations in learning english was to be able to work better with computers, all the books I could find where in english. In the early 80's when everybody was too busy solving problems instead of customizing their desktop and putting the right accents on letters that are unambiguous anyway.

      The PC, the web and the laser printer changed all that. Mainframe printers were mostly 'chain' printers with a very limited (EBCDIC) character set, not much chance to get your fancy local script there, so people worked around it and on the whole were ok with the solutions.

      Now we get top level domains with all kinds of accents in them and completely local scripts. This 'internationalization' of computing is a good thing for many people because they can now access the digital world in their own language, but at the same time it removes us one step from having a universal language, and the web could have easily given us that holy grail. Because not to be part of the cyber community or learning English ? It would have been an easy choice for most, one or two generations and English would have become a de-facto world standard.

      The situation we have right now will long term probably mean that the amount of content on the net will be proportionally spread out over the various languages, with English only being a (slightly) disproportionally high fraction.

      That universal language window of opportunity is probably lost for a long time, whether it ever was a serious possibility if of course open to debate, I for one had some hope that it was.

      • I agree.....granted, I speak English natively (and have no secondary language), I would have liked to see a universal language (not required to be English, but it is sort of the de facto standard) emerge and the web would have been the perfect vehicle for forcing it.

        Layne
    • While the TCP/IP protocol suite was largely developed by DARPA, much of what the Internet is today (WWW) started at CERN [www.cern.ch] in Switzerland.

      So there.

      ...laura