Welcome to MSDN Blogs Sign in | Join | Help

Browse by Tags

All Tags » Unicode/standards   (RSS)
One can really never get enough of puns about the BOM (Byte Order Mark) and TSA. And when I say one, I mean I. :-) Just think back to blogs like Don't sneak a BOM in on someone who promises to ignore free space or Everyone seems averse to the BOM these Read More...
Regular reader Pavanaja U B asked over in the Suggestion Box: In Word 2003 and prior, there was an option "not to display the font name in that font". This was very useful while using hacked non-English TTF fonts which were using the ASCII codes meant Read More...
After reading It isn't really RED versus GREEN , Cheong asked: I've searched for a while in charmap.exe and yet to find a font that'll display these characters in WinXP. That brings me to a question: Is there a font that ships with Windows (not necessarily Read More...
WARNING: This blog has nothing whatsoever ¹ do with Nordic sex. Regular reader Santhosh Pillai had a question not too long ago that I found to be rather kick ass and cool, professionally speaking. It was: Hi, I am trying to figure out how these characters Read More...
In the past 10 days I have had four people ask me the same question. Here is one if the most recent ones: Hi Michael, One of my friends was asking if there is an ICU for .NET. icu4c-4_0-Win32-msvc8.zip that he got from http://www.icu-project.org/download/4.0.html Read More...
Previous blogs in this series of blogs on this Blog: Part 0: The intro, sans content Part 1: Getting the obvious out of the way Part 2: A&P of a 'linguistic character' Part 3: It starts with cursor movement (where MS simultaneously gets better and Read More...
Over in the Suggestion Box, Aaron asked: Hi again - question about one of your favorite codepages - 1258 (Vietnamese) and combining diacritics in regards to Unicode character U+1EB7 (LATIN SMALL LETTER A WITH BREVE AND DOT BELOW - http://www.fileformat.info/info/unicode/char/1eb7/index.htm) Read More...
Previous blogs in this series of blogs on this Blog: Part 0: The intro, sans content Part 1: Getting the obvious out of the way Part 2: A&P of a 'linguistic character' Part 3: It starts with cursor movement (where MS simultaneously gets better and Read More...
Jason's question was: The Unicode "Lt" character category -- this is the "title case" character category. It looks like the .Net BCL exposes this but the Win32 API doesn't. Am I correct in this? Thanks. Yes, Jason is spot on here. Like much of the Unicode Read More...
Previous blogs in this series of blogs on this Blog: Part 0: The intro, sans content Part 1: Getting the obvious out of the way Part 2: A&P of a 'linguistic character' Part 3: It starts with cursor movement (where MS simultaneously gets better and Read More...
So it all started in a conversation with some of the folks from the SQL Server team when I was at PASS. What did they expect? Send 600 Microsoft employees to an event and you are bound to run into some of them just in the act of walking around! They were Read More...
Previous blogs in this series of blogs on this Blog: Part 0: The intro, sans content Part 1: Getting the obvious out of the way Part 2: A&P of a 'linguistic character' Part 3: It starts with cursor movement (where MS simultaneously gets better and Read More...
Warning: although slightly technical, this blog is mostly non-technical, and/or technical about stuff related to the iBOT. If the technical issues related to SQL Server and/or PASS interest you then they will probably show up in future blogs ... Prior Read More...
The title of this blog is an allusion to Coppola's Apocalypse Now , and eventually I'll be quoting a bit of the Herr-provided narration (those are the pieces Martin Sheen read)... It all started with a seemingly innocent question the other day. It went Read More...
The question from, the other day was an interesting one. It was something like this: I’m trying to do a word-boundary check, and I noticed regex doesn’t handle boundaries correctly for some extended characters (░╤╞╬═╣etc.). A simple example is “\b░” which Read More...
More Posts Next page »
 
Page view tracker