Re: Sinhala GNU/Linux

From: Donald Gaminitillake <email-not-shown>
Date: Mon Dec 06 2004 - 19:54:19 LKT
To: harshula <email-not-shown>
CC: Anuradha Ratnaweera <email-not-shown>, Delan Silva <lakfoil@slt.lk>




  
  


Dear Harsshula


I am Still at the same place: ( as you wrote -- word by word)

SLS 1134 = Unicode  0d80-0dff  locations (this is a the incomplete sinhala alphabet)


Is this correct? 

I find your "set b" contains General Punctuation Range: 2000–206F   this include 

Spaces , Dashes,EM SPACE  FOUR-PER-EM SPACE  DOUBLE VERTICAL LINEs etc ,   LINE SEPARATORs , PER MILLE SIGN etc , DOUBLE DAGGERs etc, QUESTION EXCLAMATION MARKs etc,   COMMERCIAL MINUS SIGNs etc , REVERSED SEMICOLON etc. QUADRUPLE PRIME,  Greek enotikon, Urdu paragraph separator .Japanese kome, INVISIBLE SEPARATOR etc and

Formatting characters
200C   ZERO WIDTH NON-JOINER
= ZWNJ
200D   ZERO WIDTH JOINER
= ZWJ
200E   LEFT-TO-RIGHT MARK
= LRM
200F   RIGHT-TO-LEFT MARK

Where are the unicode locations and/or SLSI locations for sinhala language byond this?


Which you refers as  Set C = Cartesian product of A2 and A3 &  Set D = a subset of the Cartesian product of B, A3, A2 and C2

Can you give me the similar locations to download from UNICODE.

I will be in and out of Colombo during the next fewe days and E mails may get delayed

Best

Donald




harshula wrote:
Hi Donald,

Set A = Set A1 Union Set A2 Union Set A3 = 0d80-0dff

Set B = 200C-200D

Set C = Cartesian product of A2 and A3

(Set C2 = Union of C and A2)

Set D = a subset of the Cartesian product of B, A3, A2 and C2 

You seem to be very interested in Set B, so I assume you already
understand and acknowledge Set C, as it's derivable from Set A2 and A3.

Set B consists of two elements, the magical ZWNJ and ZWJ:
http://www.unicode.org/charts/PDF/U2000.pdf

Regards,
Harshula

On Sun, 2004-12-05 at 17:27 +0600, Donald Gaminitillake wrote:
  
Dear Harshula

I must get the things in right order

SLS 1134  =  Unicode range  0d80-0dff =  set A  (is this correct)

You have defined this as set A.

You have defined a set B =  200C - 200D 


  I do have the set A and "a" is 0D85  and I know every character
location up to 0dff.

Can you give me the images of set B or a URL to down load it.


Best

Donald
    

-------- Message --------
From: harshula <hash@jayasolutions.cjb.net>
To: Donald Gaminitillake
Cc: Anuradha Ratnaweera <gnu.slash.linux@gmail.com>, Delan Silva
<lakfoil@slt.lk>
Subject: Re: Sinhala GNU/Linux
Date: Sun, 05 Dec 2004 19:48:57 +1100 
Hi Donald,

You seem to be confused. You need to read my email again.

I explicitly stated that:

-----------------------------------
I assert that the Unicode code range:

0d80-0dff (hereby referred to as Set A1, A2 and A3)

and 

200C-200D (hereby referred to as Set B)

allows us to encode the Sinhala letters.

...

In our case it is roughly the union of:

* A1 
* A2 
* C (= Cartesian product of A2 and A3)
* D (= a subset of the Cartesian product of B, A3, A2 and C2 (C2 = Union
of C and A2))

This union produces a set of encodings containing all the basic elements
(letters).
-----------------------------------

So once you understand and acknowledge the existence of Set C and D,
then we can continue our discussion. If you do not understand the
mathematical terms, please tell me, and I'll find an explanation for
you.

What you wrote in your email does not even remotely correspond to what I
wrote. I have taken the liberty to attach my email to the bottom of this
email, as you seem to have replied to Anuradha's email and not mine.

Regards,
Harshula

  
Received on Mon Dec 06 19:54:19 2004

This archive was generated by hypermail 2.1.8 : Wed Dec 08 2004 - 17:56:45 LKT