The Hotness
Games|People|Company
Dungeon Crawl Classics Role Playing Game
Briarwood Castle
Parsec
Forgotten Realms Campaign Set
Pokéthulhu Adventure Game (2nd Edition)
Pathfinder Roleplaying Game: Beginner Box
The One Ring: Adventures over the Edge of the Wild
Reforger
Run out the Guns!
Pathfinder Roleplaying Game Advanced Race Guide
Shards of the Shattered Universe Core Rules
Chill Horror Role-Playing Game
The Slayer's Guide to Demons
Horrors of the Z'bri
Shadows of Yog-Sothoth (2nd Edition)
Pathfinder Roleplaying Game Bestiary
Into the Shadowhaunt
Lamentations of the Flame Princess (Grindhouse Edition)
Marvel Heroic Roleplaying: Basic Game
Busca Final
Yiffpunk
Call of Cthulhu (6th Edition)
Dungeons & Dragons Set 1: Basic Rules
X1: The Isle of Dread
ΑΓΩΝ
World of Darkness
Dice
Hellfrost Player's Guide
Book 02: Fire on the Water
B5: Horror on the Hill
Citybook VI: Up Town
Pathfinder Roleplaying Game Core Rulebook
Castle Falkenstein
Dave Arneson's Blackmoor: The First Campaign
Dark Dungeons
Legends of the Ancient World
Diaspora
PC Pearls: A Collection of Character Inspiration
Warhammer Fantasy Roleplay Core Set
Usagi Yojimbo
Savage Worlds: Test Drive V6!
In Search of Adventure
Monstercology: Orcs
Sartar: Kingdom of Heroes
Dragon Age RPG, Set 1
GURPS Ultra-Lite
The Edge of Night
By the Sword: Dueling in Realms of Fantasy
Floor Plan 2: The Great Salt Flats
Mouse Guard Roleplaying Game Box Set
Recommend
5 
 Thumb up
 Thumb up
16 Posts

RPG» Forums » RPGG Related » RPGG Bugs

Subject: (minor) No support for non-latin URLs? rss

Your Tags: Add tags
Popular Tags: [View All]
peter pan
Poland
Warsaw
mazowieckie
Avatar
mbmbmbmbmb
http://en.wikipedia.org/wiki/Inne_pieśni

[url= http://en.wikipedia.org/wiki/Inne_pieśni]the same situation[/url]

This works http://en.wikipedia.org/wiki/Inne_pie%C5%9Bni but that's not that easy to build without prior knowledge.
2 
 Thumb up
 tip
 Thumb up
Chris Talbot
Canada
Fort Smith
Northwest Territories
flag msg tools
Be seeing you... -Alphonse
badge
Avatar
mbmbmbmbmb
Just link without the accent.

http://en.wikipedia.org/wiki/Inne_piesni

Goes to the exact same page.

Chris
2 
 Thumb up
 tip
 Thumb up
peter pan
Poland
Warsaw
mazowieckie
Avatar
mbmbmbmbmb
That's workaround, not a solution. It works only because there actually exists a page for Inne pieśni without accent and it is a redirection. Wikipedia is an example which shows the underlying problem.

Try this:

http://pl.wikipedia.org/wiki/Inne_piesni
and this
http://pl.wikipedia.org/wiki/Inne_pieśni
and this
http://pl.wikipedia.org/wiki/Inne_pie%C5%9Bni

 
 Thumb up
 tip
 Thumb up
  • Last edited Wed Apr 27, 2011 12:30 am (Total Number of Edits: 1)
  • Posted Wed Apr 27, 2011 12:29 am
    • Choose your Dice
      • Roll
      • Comment (Optional)
    • QuickReply
    •  
    • QuickQuote
    •  
    • Reply
    •  
    • Quote
William Hostman
United States
Eagle River
Alaska
designer
Gaming in Greater Anchorage area, Alaska since 1978. Looking for Indy-willing RPG players in Eagle River (or willing to drive to Eagle River). Geekmail me if interested.
badge
Yes, this really is what I looked like when I uploaded that avatar. Not that it's quite current anymore.
Avatar
mbmbmbmbmb
Only the version with the %-codes is a valid url; the others are pseudo-URLs, and there is no HTML requirement to ever support them. Most URL handlers now do automatically convert the characters to the %-codes, but the internet protocols don't actually use non-latin nor accented latin characters.

Internet addresses are, properly, 7-bit ascii.
3 
 Thumb up
 tip
 Thumb up
peter pan
Poland
Warsaw
mazowieckie
Avatar
mbmbmbmbmb
Impressive.

I'm saying that the expected copy-paste behaviour does not work here.
You are saying that's OK because it is not required
You see that you are missing the point here, don't you? The UI is about friendliness and ease of use, not about doing only what is necessary.
5 
 Thumb up
1.00
 tip
 Thumb up
Dave Bernazzani (@rpggeek)
United States
Plainville
Massachusetts
flag msg tools
I wish to provide legendary service to the RPG community to help grow our hobby and enrich the lives of gamers everywhere.
badge
Avatar
mbmbmbmbmb
nimdil wrote:
The UI is about friendliness and ease of use, not about doing only what is necessary.

This may be my new motto around here! Well said.

-Dave
2 
 Thumb up
 tip
 Thumb up
William Hostman
United States
Eagle River
Alaska
designer
Gaming in Greater Anchorage area, Alaska since 1978. Looking for Indy-willing RPG players in Eagle River (or willing to drive to Eagle River). Geekmail me if interested.
badge
Yes, this really is what I looked like when I uploaded that avatar. Not that it's quite current anymore.
Avatar
mbmbmbmbmb
nimdil wrote:
Impressive.

I'm saying that the expected copy-paste behaviour does not work here.
You are saying that's OK because it is not required
You see that you are missing the point here, don't you? The UI is about friendliness and ease of use, not about doing only what is necessary.


I'm saying "It's OK because you're trying to use things which aren't actually URL's as URL's because you've gotten sloppy due to overly friendly UI's in your browser."

From a programming standpoint, any character outside the range ASCII 30-127 is a terminator for a URL by the W3C/industry-wide definition. The W3C has standardized a %-code system so that they can be substituted in on compliant browsers and internet applications.

The board software shouldn't detect "naked" pseudo-URLs with odd characters, but should convert something inside [url][/url] tags at submit.
2 
 Thumb up
 tip
 Thumb up
peter pan
Poland
Warsaw
mazowieckie
Avatar
mbmbmbmbmb
aramis wrote:
nimdil wrote:
Impressive.

I'm saying that the expected copy-paste behaviour does not work here.
You are saying that's OK because it is not required
You see that you are missing the point here, don't you? The UI is about friendliness and ease of use, not about doing only what is necessary.


I'm saying "It's OK because you're trying to use things which aren't actually URL's as URL's because you've gotten sloppy due to overly friendly UI's in your browser."

From a programming standpoint, any character outside the range ASCII 30-127 is a terminator for a URL by the W3C/industry-wide definition. The W3C has standardized a %-code system so that they can be substituted in on compliant browsers and internet applications.

The board software shouldn't detect "naked" pseudo-URLs with odd characters, but should convert something inside [url][/url] tags at submit.

Impressive.
You are still missing the point. Being aware of this documents I still think you are thinking the wrong way.

Your idea of friendly interface is to force users to
1 take URL from the Browser - i.e. http://pl.wikipedia.org/wiki/Inne_pieśni
2 google to look for the website which will transform the URL to html entities online (or do it manually or whatever)
3 perform transformation
4 use the result URL
? devil Maybe there is a shortcut here but I don't really see it right now. Plus the common user is probably not even aware of the problem.

Programming standpoint of view is one thing, friendly UI is not exactly the same.

I'm not saying that geek shouldn't convert it to the entities, although I hardly see reason for this as probably any user agent would do it automatically. The only theoretical advantage of doing so is to make website perfectly valid by W3C standards. But it seems geek is not valid anyway and apparently this issue is - as of present - so minor the W3C validator does not mark it with warning, much less error, when parsing HTML page with URLs containing bytes with value higher than 7F.

So OK. You are right but only technically. From user's point of view this is an error/bug/inconvenience/nuisance. Do you really think it is OK?
3 
 Thumb up
 tip
 Thumb up
William Hostman
United States
Eagle River
Alaska
designer
Gaming in Greater Anchorage area, Alaska since 1978. Looking for Indy-willing RPG players in Eagle River (or willing to drive to Eagle River). Geekmail me if interested.
badge
Yes, this really is what I looked like when I uploaded that avatar. Not that it's quite current anymore.
Avatar
mbmbmbmbmb
nimdil wrote:

So OK. You are right but only technically. From user's point of view this is an error/bug/inconvenience/nuisance. Do you really think it is OK?


As I said, I think it's a user ignorance issue, and that the parser for the URL tags should convert it ONCE, when either preview or submit is pressed, to the proper escaped (%-coded) form.
 
 Thumb up
 tip
 Thumb up
フィル
Australia
Newtown
NSW
flag msg tools
designer
Avatar
mbmbmbmbmb
The user shouldn't have to read an RFC just to link to Wikipedia. Instead, the software should encode non-ASCII characters behind the scenes so that the front end is usable and the back end is compliant.
7 
 Thumb up
 tip
 Thumb up
Dave Bernazzani (@rpggeek)
United States
Plainville
Massachusetts
flag msg tools
I wish to provide legendary service to the RPG community to help grow our hobby and enrich the lives of gamers everywhere.
badge
Avatar
mbmbmbmbmb
sbszine wrote:
The user shouldn't have to read an RFC just to link to Wikipedia. Instead, the software should encode non-ASCII characters behind the scenes so that the front end is usable and the back end is compliant.

I agree. I'd have no idea how to encode it - and wouldn't even know I was supposed to encode it. Paweł's original URL works in both my browsers (FF and IE) and I can enter it directly into a any of the major forms I use (translators mostly) that take a URL. The Geek doesn't deal with it - and it should. And no, that has nothing to do with that techo-mumbo-jumbo which is of almost no importance to us users.

Having said that, we've got a pile of changes and fixes needed and the list grows weekly. I'm guessing this won't be very high on the list. If it's super easy, I'm sure it will be done fast but if not, it's likely to be back-burner. If one of the devs doesn't chime in on this thread in a few days time, I'll log it into the JIRA system they are using to track issues needing resolution.

-Dave
5 
 Thumb up
 tip
 Thumb up
フィル
Australia
Newtown
NSW
flag msg tools
designer
Avatar
mbmbmbmbmb
You could just tweak the parser to terminate URLs only on a space or square bracket and that would be good enough. The actual encoding is probably already handled by most browsers en passant.
3 
 Thumb up
 tip
 Thumb up
peter pan
Poland
Warsaw
mazowieckie
Avatar
mbmbmbmbmb
aramis wrote:
nimdil wrote:

So OK. You are right but only technically. From user's point of view this is an error/bug/inconvenience/nuisance. Do you really think it is OK?


As I said, I think it's a user ignorance issue, and that the parser for the URL tags should convert it ONCE, when either preview or submit is pressed, to the proper escaped (%-coded) form.

Actually it probably should do it when transforming tags to html, so when editing the post the user will not encounter percent-entities in place of non-ascii characters.
3 
 Thumb up
 tip
 Thumb up
paz AKA Matt Lewis
United Kingdom
Great Sutton
Cheshire
flag msg tools
All hail Lord Fudge!
badge
[PFRPG PBF Games] GMing: Crypt of the Everflame, 2x We Be Goblins! · Playing: Vestige's Margreve 'Hollow', Bearpaw's Pathfinder Society
Avatar
mb
Sorry to butt in, but Wayback Machine links have similar issues:

Without URL tags:
http://classic-web.archive.org/web/20080506030618/http:/ www.wizards.com/default.asp?x=dnd/dnd/downloads

With URL tags:
www.wizards.com/default.asp?x=dnd/dnd/downloads" target="_blank" class="postlink" rel="nofollow">link

(Should go to http://bit.ly/lHBvrs)
3 
 Thumb up
 tip
 Thumb up
Young & Lawful Good
United Kingdom
Redhill
Surrey
flag msg tools
admin
Support comes in many forms: community involvement, forum posts, submitting data, word-of-mouth advertising, financial donations... All of these are vital to this site, and you have my sincere thanks for participating in any of them.
badge
Currently: banned as per http://rpggeek.com/article/9104203#9104203
Avatar
mbmbmbmbmb
We've had non-Latin domain names for over a year now. This article provides history and background to the issue:
http://en.wikipedia.org/wiki/Internationalized_domain_name

Some examples for testing...

http://موقع.وزارة-الأتصالات.مصر/ar/default.aspx
http://عربي.امارات/
http://кц.рф/en/
http://www.數位聯合.網路.tw/
2 
 Thumb up
 tip
 Thumb up
peter pan
Poland
Warsaw
mazowieckie
Avatar
mbmbmbmbmb
I would like to show some other weird situation.

How to reproduce another problem with URLs?

go to translate.google.com
paste http://www.wydawnictwoportal.pl/produkty/neuroshima-tactics/
go there
try to use URL of google translated version of the website. It sort of does not work (see my blog for the result).
 
 Thumb up
 tip
 Thumb up
Front Page | Welcome | Contact | Privacy Policy | Terms of Service | Advertise | Support BGG | Feeds RSS
Geekdo, BoardGameGeek, the Geekdo logo, and the BoardGameGeek logo are trademarks of BoardGameGeek, LLC.