Changeset 1411

Show
Ignore:
Timestamp:
09/10/08 11:10:23 (2 months ago)
Author:
thomase
Message:

fixed xi handling problem and added prose about expectations and deficiencies with respect to BAtlas/Classical Atlas Project transliteration forms

Files:

Legend:

Unmodified
Added
Removed
Modified
Copied
Moved
  • PleiadesEntity/trunk/Extensions/ws_grek.py

    r746 r1411  
     1 
     2# the assumption is that this transliterator should work according to the rules of the  
     3# Barrington Atlas. In the introduction to the Map-by-Map Directory (p. vii) its 
     4# Greek transliteration rules are described as follows: 
     5 
     6# Ancient Greek names are transcribed literally, though without marking accents or  
     7# long vowels.  Th is used for theta, k for kappa, x for xi, ch for chi, ps for psi.   
     8# Combinations with gamma such as gg or gk become ng and nk, etc.  Upsilon with  
     9# another vowel remains u (thus au for alpha + upsilon); otherwise it normally becomes  
     10# y.  Rho’s rough breathing is optional for initial rho (which can thus appear as either Rh  
     11# or R), but elsewhere in the word it is omitted in the case of names attested only in Greek.   
     12# Subscripts are ignored. 
     13 
     14# the current code does some of this, but doesn't live up to all aspects, especially dealing 
     15# with two-letter combinations; needs a major rework including some enforced  
     16# assumptions about unicode normalization form, capitalization and so forth 
    117 
    218grek_unaccented_capital = { 
     
    1430    u"\u039C" : "M",    # Greek capital letter mu  
    1531    u"\u039D" : "N",    # Greek capital letter nu 
    16     u"\u039E" : "KS",   # Greek capital letter xi 
     32    u"\u039E" : "X",   # Greek capital letter xi 
    1733    u"\u039F" : "O",    # Greek capital letter omicron 
    1834    u"\u03A0" : "P",    # Greek capital letter pi  
     
    4258    u"\u03BC" : "m",    # Greek small letter mu  
    4359    u"\u03BD" : "n",    # Greek small letter nu 
    44     u"\u03BE" : "ks",   # Greek small letter xi 
     60    u"\u03BE" : "x",   # Greek small letter xi 
    4561    u"\u03BF" : "o",    # Greek small letter omicron 
    4662    u"\u03C0" : "p",    # Greek small letter pi