faqts : Computers : Programming : Languages : Tse : Search

+ Search
Add Entry AlertManage Folder Edit Entry Add page to http://del.icio.us/
Did You Find This Entry Useful?

6 of 6 people (100%) answered Yes
Recently 6 of 6 people (100%) answered Yes

Entry

TSE: Can you give conversion tables to convert regular expressions from Perl to TSE? [POSIX]

Nov 20th, 2006 12:53
Knud van Eeden,


----------------------------------------------------------------------
--- Knud van Eeden --- 19 November 2020 - 07:49 pm -------------------
TSE: Can you give conversion tables to convert regular expressions 
from Perl to TSE? [POSIX]
---
Conversion tables for converting Perl to TSE regular expressions
===
-----------------------------------------------------------------------
TABLE: RANGE
-----------------------------------------------------------------------
|DESCRIPTION    |PERL            |TSE           |POSIX     |PERL FN   |
-----------------------------------------------------------------------
control         [\x00-\x19\x7F]  [\x00-\x19\x7F] [:cntrl:]  IsCntrl
white space+tab [ \t]            [ \t]           [:blank:]  IsSpace
whitespace      [ \f\t\v]        [ \f\t\v]       [:space:]  IsSpace
whitespacePerl  \s               [ \f\n\r\t\v]   [:space:]  IsSpacePerl
punctuation     [!-/:-@[-`{-~]   [!-/:-@[-`{-~]  [:punct:]  IsPunct
decimal digit   \d               [0-9]           [:digit:]  IsDigit
hexadecimal     [0-9A-Fa-f]      [0-9A-Fa-f]     [:xdigit:] IsXDigit
upper           \u               [A-Z]           [:upper:]  IsUpper
lower           \l               [a-z]           [:lower:]  IsLower
upper+lower     [A-Za-z]         [A-Za-z]        [:alpha:]  IsAlpha
alphanumeric    [A-Za-z0-9]      [A-Za-z0-9]     [:alnum:]  IsAlnum
alphanumeric+_  \w               [A-Za-z0-9_]    [:word:]   IsWord
printable       [!-~]            [!-~]           [:graph:]  IsGraph
printable+space [ -~]            [ -~]           [:print:]  IsPrint
any ASCII       [\x00-\xFF]      [\x00-\xFF]     [:ascii:]  IsASCII
any Unicode     [\x0000-\xFFFF]  not available?   ?         ?
not control    [^\x00-\x19\x7F] [~\x00-\x19\x7F] [^:cntrl:] \P{IsCntrl}
not space+tab   [^ \t]          [~ \t]           [^:blank:] \P{IsSpace}
not whitespace [^ \f\t\v]       [~ \f\t\v]       [^:space:] \P{IsSpace}
not whtspc Perl \S            [~ \f\n\r\t\v] [^:space:] \P{IsSpacePerl}
not punctuation [^!-/:-@[-`{-~] [~!-/:-@[-`{-~]  [^:punct:] \P{IsPunct}
not decimal dig \D              [~0-9]           [^:digit:] \P{IsDigit}
not hexadecimal [^0-9A-Fa-f]    [~0-9A-Fa-f]   [^:xdigit:] \P{IsXDigit}
not upper       \U              [~A-Z]           [^:upper:] \P{IsUpper}
not lower       \L              [~a-z]           [^:lower:] \P{IsLower}
not upper+lower [^A-Za-z]       [~A-Za-z]        [^:alpha:] \P{IsAlpha}
not alphanumc   [^A-Za-z0-9]    [~A-Za-z0-9]     [^:alnum:] \P{IsAlnum}
not alphnumc+_  \W              [~A-Za-z0-9_]    [^:word:]  \P{IsWord}
not printable   [^!-~]          [~!-~]           [^:graph:] \P{IsGraph}
not prt+space   [^ -~]          [~ -~]           [^:print:] \P{IsPrint}
not any ASCII   [^\x00-\xFF]    [~\x00-\xFF]     [^:ascii:] \P{IsASCII}
not any Unicode [^\x0000-\xFFFF] not available?   ?           ?
----------------------------------------------------------------------
===
-----------------------------------------------------------------------
TABLE: OPERATOR: ALTERNATION / CLOSURE / CONCATENATION / GROUPING
-----------------------------------------------------------------------
| PERL  |TSE           |NAME       |CLASS: TYPE  |CLASS: TYPE: SUB    |
-----------------------------------------------------------------------
  $0-$9  \0..\9         tag         -             -
  |      |              or          alternation   -
  ?      not available  0 or 1      closure       maximum: greedy
  *      @              0 or more   closure       maximum: greedy
  +      #              1 or more   closure       maximum: greedy
  ?+     not available  0 or 1      closure       maximum: possessive
  *+     not available  0 or more   closure       maximum: possessive
  ++     not available  1 or more   closure       maximum: possessive
  ??     ?              0 or 1      closure       minimum
  *?     *              0 or more   closure       minimum
  +?     +              1 or more   closure       minimum
  {n,m}  not available? n to m      closure       maximum: greedy
  {n,}   not available? n or more   closure       maximum: greedy
  {,n}   not available? 0 to n      closure       maximum: greedy
  {n}    not available? n times     closure       maximum: greedy
  {n,m}? not available? n to m      closure       minimum
  {n,}?  not available? n or more   closure       minimum
  {,n}?  not available? 0 to n      closure       minimum
  {n}?   not available? n times     closure       minimum
  [...]  [...]          class       concatenation character: single: 1
  [^...] [~...]         class not   concatenation character: single: 1
  .      .              any char    concatenation character: single: 1
  \      \              escape char concatenation character: single: 1
  \xHH   \xHH           hexadecimal concatenation character: single: 1
  \ooo   \oOOO          octal       concatenation character: single: 1
  -      \dDDD          decimal     concatenation character: single: 1
  [.-.]  [.-.]          range       concatenation character: single: 1
  (...)  {...}          group       group         -
-----------------------------------------------------------------------
===
-----------------------------------------------------------------------
TABLE: ANCHOR
-----------------------------------------------------------------------
| OBJECT      | POSITION     | PERL       | TSE                       |
-----------------------------------------------------------------------
  cursor        after          do not know  \c
  line          begin          ^            ^
  line          end            $            $
  string        begin          \A           do not know
  string        begin          \G           do not know
  string        end            \Z           do not know
  string        end            \z           do not know
  word          begin or end   \w           [A-Za-z0-9_]
-----------------------------------------------------------------------
===
Note:
The &#....; values in this table should be single graphical characters.
They show the HTML unicode for this single ASCII characters instead.
This conversion happened after pasting the original
table.
===
The table below corresponds to the characters as shown
when using the font 'Terminal'
===
-----------------------------------------------------------------------
TABLE: ASCII and HEXADECIMAL: PERL: TSE
-----------------------------------------------------------------------
| ASCII | HEXADECIMAL| PERL | TSE  | LETTER                           |
-----------------------------------------------------------------------
  000     \x00         \x00   \x00   -
  001     \x01         \x01   \x01   ^A
  002     \x02         \x02   \x02   ^B
  003     \x03         \x03   \x03   ^C
  004     \x04         \x04   \x04   ^D
  005     \x05         \x05   \x05   ^E
  006     \x06         \x06   \x06   ^F
  007     \x07         \a     \a     ^G
  008     \x08         \b     \b     ^H
  009     \x09         \t     \t     ^I
  010     \x0A         \n     \n     ^J
  011     \x0B         \v     \v     ^K
  012     \x0C         \f     \f     ^L
  013     \x0D         \r     \r     ^M
  014     \x0E         \x0E   \x0E   ^N
  015     \x0F         \x0F   \x0F   ^O
  016     \x10         \x10   \x10   ^P
  017     \x11         \x11   \x11   ^Q
  018     \x12         \x12   \x12   ^R
  019     \x13         \x13   \x13   ^S
  020     \x14         \x14   \x14   ^T
  021     \x15         \x15   \x15   ^U
  022     \x16         \x16   \x16   ^V
  023     \x17         \x17   \x17   ^W
  024     \x18         \x18   \x18   ^X
  025     \x19         \x19   \x19   ^Y
  026     \x1A         \x1A   \x1A   ^Z
  027     \x1B         \e     \x1B   ^[
  028     \x1C         \x1C   \x1C   ^\
  029     \x1D         \x1D   \x1D   ^]
  030     \x1E         \x1E   \x1E   ^^
  031     \x1F         \x1F   \x1F   ^_
  032     \x20         \x20   \x20   ?
  033     \x21         !      !      ?
  034     \x22         "      "      ?
  035     \x23         #      #      ?
  036     \x24         $      $      ?
  037     \x25         %      %      ?
  038     \x26         &      &      ?
  039     \x27         '      '      ?
  040     \x28         (      (      ?
  041     \x29         )      )      ?
  042     \x2A         *      *      ?
  043     \x2B         +      +      ?
  044     \x2C         ,      ,      ?
  045     \x2D         -      -      ?
  046     \x2E         .      .      ?
  047     \x2F         /      /      ?
  048     \x30         0      0      ?
  049     \x31         1      1      ?
  050     \x32         2      2      ?
  051     \x33         3      3      ?
  052     \x34         4      4      ?
  053     \x35         5      5      ?
  054     \x36         6      6      ?
  055     \x37         7      7      ?
  056     \x38         8      8      ?
  057     \x39         9      9      ?
  058     \x3A         :      :      ?
  059     \x3B         ;      ;      ?
  060     \x3C         <      <      ?
  061     \x3D         =      =      ?
  062     \x3E         >      >      ?
  063     \x3F         ?      ?      ?
  064     \x40         @      @      ?
  065     \x41         A      A      ?
  066     \x42         B      B      ?
  067     \x43         C      C      ?
  068     \x44         D      D      ?
  069     \x45         E      E      ?
  070     \x46         F      F      ?
  071     \x47         G      G      ?
  072     \x48         H      H      ?
  073     \x49         I      I      ?
  074     \x4A         J      J      ?
  075     \x4B         K      K      ?
  076     \x4C         L      L      ?
  077     \x4D         M      M      ?
  078     \x4E         N      N      ?
  079     \x4F         O      O      ?
  080     \x50         P      P      ?
  081     \x51         Q      Q      ?
  082     \x52         R      R      ?
  083     \x53         S      S      ?
  084     \x54         T      T      ?
  085     \x55         U      U      ?
  086     \x56         V      V      ?
  087     \x57         W      W      ?
  088     \x58         X      X      ?
  089     \x59         Y      Y      ?
  090     \x5A         Z      Z      ?
  091     \x5B         [      [      ?
  092     \x5C         \      \      ?
  093     \x5D         ]      ]      ?
  094     \x5E         ^      ^      ?
  095     \x5F         _      _      ?
  096     \x60         `      `      ?
  097     \x61         a      a      ?
  098     \x62         b      b      ?
  099     \x63         c      c      ?
  100     \x64         d      d      ?
  101     \x65         e      e      ?
  102     \x66         f      f      ?
  103     \x67         g      g      ?
  104     \x68         h      h      ?
  105     \x69         i      i      ?
  106     \x6A         j      j      ?
  107     \x6B         k      k      ?
  108     \x6C         l      l      ?
  109     \x6D         m      m      ?
  110     \x6E         n      n      ?
  111     \x6F         o      o      ?
  112     \x70         p      p      ?
  113     \x71         q      q      ?
  114     \x72         r      r      ?
  115     \x73         s      s      ?
  116     \x74         t      t      ?
  117     \x75         u      u      ?
  118     \x76         v      v      ?
  119     \x77         w      w      ?
  120     \x78         x      x      ?
  121     \x79         y      y      ?
  122     \x7A         z      z      ?
  123     \x7B         {      {      ?
  124     \x7C         |      |      ?
  125     \x7D         }      }      ?
  126     \x7E         ~      ~      ?
  127     \x7F                     ?
  128     \x80         Ç      Ç      ?
  129     \x81         ü      ü      ?
  130     \x82         é      é      ?
  131     \x83         â      â      ?
  132     \x84         ä      ä      ?
  133     \x85         à      à      ?
  134     \x86         å      å      ?
  135     \x87         ç      ç      ?
  136     \x88         ê      ê      ?
  137     \x89         ë      ë      ?
  138     \x8A         è      è      ?
  139     \x8B         ï      ï      ?
  140     \x8C         î      î      ?
  141     \x8D         ì      ì      ?
  142     \x8E         Ä      Ä      ?
  143     \x8F         Å      Å      ?
  144     \x90         É      É      ?
  145     \x91         æ      æ      ?
  146     \x92         Æ      Æ      ?
  147     \x93         ô      ô      ?
  148     \x94         ö      ö      ?
  149     \x95         ò      ò      ?
  150     \x96         û      û      ?
  151     \x97         ù      ù      ?
  152     \x98         ÿ      ÿ      ?
  153     \x99         Ö      Ö      ?
  154     \x9A         Ü      Ü      ?
  155     \x9B         ¢      ¢      ?
  156     \x9C         £      £      ?
  157     \x9D         ¥      ¥      ?
  158     \x9E         ₧      ₧      ?
  159     \x9F         ƒ      ƒ      ?
  160     \xA0         á      á      ?
  161     \xA1         í      í      ?
  162     \xA2         ó      ó      ?
  163     \xA3         ú      ú      ?
  164     \xA4         ñ      ñ      ?
  165     \xA5         Ñ      Ñ      ?
  166     \xA6         ª      ª      ?
  167     \xA7         º      º      ?
  168     \xA8         ¿      ¿      ?
  169     \xA9         ⌐      ⌐      ?
  170     \xAA         ¬      ¬      ?
  171     \xAB         ½      ½      ?
  172     \xAC         ¼      ¼      ?
  173     \xAD         ¡      ¡      ?
  174     \xAE         «      «      ?
  175     \xAF         »      »      ?
  176     \xB0         ░      ░      ?
  177     \xB1         ▒      ▒      ?
  178     \xB2         ▓      ▓      ?
  179     \xB3         │      │      ?
  180     \xB4         ┤      ┤      ?
  181     \xB5         ╡      ╡      ?
  182     \xB6         ╢      ╢      ?
  183     \xB7         ╖      ╖      ?
  184     \xB8         ╕      ╕      ?
  185     \xB9         ╣      ╣      ?
  186     \xBA         ║      ║      ?
  187     \xBB         ╗      ╗      ?
  188     \xBC         ╝      ╝      ?
  189     \xBD         ╜      ╜      ?
  190     \xBE         ╛      ╛      ?
  191     \xBF         ┐      ┐      ?
  192     \xC0         └      └      ?
  193     \xC1         ┴      ┴      ?
  194     \xC2         ┬      ┬      ?
  195     \xC3         ├      ├      ?
  196     \xC4         ─      ─      ?
  197     \xC5         ┼      ┼      ?
  198     \xC6         ╞      ╞      ?
  199     \xC7         ╟      ╟      ?
  200     \xC8         ╚      ╚      ?
  201     \xC9         ╔      ╔      ?
  202     \xCA         ╩      ╩      ?
  203     \xCB         ╦      ╦      ?
  204     \xCC         ╠      ╠      ?
  205     \xCD         ═      ═      ?
  206     \xCE         ╬      ╬      ?
  207     \xCF         ╧      ╧      ?
  208     \xD0         ╨      ╨      ?
  209     \xD1         ╤      ╤      ?
  210     \xD2         ╥      ╥      ?
  211     \xD3         ╙      ╙      ?
  212     \xD4         ╘      ╘      ?
  213     \xD5         ╒      ╒      ?
  214     \xD6         ╓      ╓      ?
  215     \xD7         ╫      ╫      ?
  216     \xD8         ╪      ╪      ?
  217     \xD9         ┘      ┘      ?
  218     \xDA         ┌      ┌      ?
  219     \xDB         █      █      ?
  220     \xDC         ▄      ▄      ?
  221     \xDD         ▌      ▌      ?
  222     \xDE         ▐      ▐      ?
  223     \xDF         ▀      ▀      ?
  224     \xE0         α      α      ?
  225     \xE1         ß      ß      ?
  226     \xE2         Γ      Γ      ?
  227     \xE3         π      π      ?
  228     \xE4         Σ      Σ      ?
  229     \xE5         σ      σ      ?
  230     \xE6         µ      µ      ?
  231     \xE7         τ      τ      ?
  232     \xE8         Φ      Φ      ?
  233     \xE9         Θ      Θ      ?
  234     \xEA         Ω      Ω      ?
  235     \xEB         δ      δ      ?
  236     \xEC         ∞      ∞      ?
  237     \xED         φ      φ      ?
  238     \xEE         ε      ε      ?
  239     \xEF         ∩      ∩      ?
  240     \xF0         ≡      ≡      ?
  241     \xF1         ±      ±      ?
  242     \xF2         ≥      ≥      ?
  243     \xF3         ≤      ≤      ?
  244     \xF4         ⌠      ⌠      ?
  245     \xF5         ⌡      ⌡      ?
  246     \xF6         ÷      ÷      ?
  247     \xF7         ≈      ≈      ?
  248     \xF8         °      °      ?
  249     \xF9         ∙      ∙      ?
  250     \xFA         ·      ·      ?
  251     \xFB         √      √      ?
  252     \xFC         ⁿ      ⁿ      ?
  253     \xFD         ²      ²      ?
  254     \xFE         ■      ■      ?
  255     \xFF         \xFF   \xFF   ?
-----------------------------------------------------------------------
===
-----------------------------------------------------------------------
TABLE: ASCII and HEXADECIMAL: NAME
-----------------------------------------------------------------------
| ASCII | HEXADECIMAL| PERL | TSE  | NAME                             |
-----------------------------------------------------------------------
  000     \x00         \x00   \x00   null, idle
  001     \x01         \x01   \x01   start of heading
  002     \x02         \x02   \x02   start of text
  003     \x03         \x03   \x03   end of text
  004     \x04         \x04   \x04   end of transmission
  005     \x05         \x05   \x05   enquiry
  006     \x06         \x06   \x06   acknowledge
  007     \x07         \a     \a     bell, beep, alert
  008     \x08         \b     \b     backspace
  009     \x09         \t     \t     horizontal tab
  010     \x0A         \n     \n     new line, linefeed
  011     \x0B         \v     \v     vertical tab, home
  012     \x0C         \f     \f     form feed
  013     \x0D         \r     \r     carriage return
  014     \x0E         \x0E   \x0E   shift out
  015     \x0F         \x0F   \x0F   shift in
  016     \x10         \x10   \x10   data link escape
  017     \x11         \x11   \x11   devicecontrol 1 (xon)
  018     \x12         \x12   \x12   devicecontrol 2
  019     \x13         \x13   \x13   devicecontrol 3 (xoff)
  020     \x14         \x14   \x14   devicecontrol 4
  021     \x15         \x15   \x15   negative acknowledge
  022     \x16         \x16   \x16   synchronous idle
  023     \x17         \x17   \x17   end transmission block
  024     \x18         \x18   \x18   cancel
  025     \x19         \x19   \x19   end of medium
  026     \x1A         \x1A   \x1A   substitute
  027     \x1B         \e     \x1B   escape sequence
  028     \x1C         \x1C   \x1C   cursor right
  029     \x1D         \x1D   \x1D   cursor left
  030     \x1E         \x1E   \x1E   cursor up
  031     \x1F         \x1F   \x1F   cursor down
  032     \x20         \x20   \x20   space
  033     \x21         !      !      exclamation mark
  034     \x22         "      "      double quotes
  035     \x23         #      #      hash
  036     \x24         $      $      dollar sign
  037     \x25         %      %      percentage sign
  038     \x26         &      &      ampersand
  039     \x27         '      '      apostrophe
  040     \x28         (      (      left parenthesis
  041     \x29         )      )      right parenthesis
  042     \x2A         *      *      asterisk
  043     \x2B         +      +      plus sign
  044     \x2C         ,      ,      comma
  045     \x2D         -      -      minus sign
  046     \x2E         .      .      period
  047     \x2F         /      /      forward slash
  048     \x30         0      0      zero
  049     \x31         1      1      one
  050     \x32         2      2      two
  051     \x33         3      3      three
  052     \x34         4      4      four
  053     \x35         5      5      five
  054     \x36         6      6      six
  055     \x37         7      7      seven
  056     \x38         8      8      eight
  057     \x39         9      9      nine
  058     \x3A         :      :      colon
  059     \x3B         ;      ;      semicolon
  060     \x3C         <      <      less than sign
  061     \x3D         =      =      equal sign
  062     \x3E         >      >      greater than sign
  063     \x3F         ?      ?      question mark
  064     \x40         @      @      at sign
  065     \x41         A      A      A
  066     \x42         B      B      B
  067     \x43         C      C      C
  068     \x44         D      D      D
  069     \x45         E      E      E
  070     \x46         F      F      F
  071     \x47         G      G      G
  072     \x48         H      H      H
  073     \x49         I      I      I
  074     \x4A         J      J      J
  075     \x4B         K      K      K
  076     \x4C         L      L      L
  077     \x4D         M      M      M
  078     \x4E         N      N      N
  079     \x4F         O      O      O
  080     \x50         P      P      P
  081     \x51         Q      Q      Q
  082     \x52         R      R      R
  083     \x53         S      S      S
  084     \x54         T      T      T
  085     \x55         U      U      U
  086     \x56         V      V      V
  087     \x57         W      W      W
  088     \x58         X      X      X
  089     \x59         Y      Y      Y
  090     \x5A         Z      Z      Z
  091     \x5B         [      [      left square bracket
  092     \x5C         \      \      back slash
  093     \x5D         ]      ]      right square bracket
  094     \x5E         ^      ^      caret
  095     \x5F         _      _      underscore
  096     \x60         `      `      left apostrophe
  097     \x61         a      a      a
  098     \x62         b      b      b
  099     \x63         c      c      c
  100     \x64         d      d      d
  101     \x65         e      e      e
  102     \x66         f      f      f
  103     \x67         g      g      g
  104     \x68         h      h      h
  105     \x69         i      i      i
  106     \x6A         j      j      j
  107     \x6B         k      k      k
  108     \x6C         l      l      l
  109     \x6D         m      m      m
  110     \x6E         n      n      n
  111     \x6F         o      o      o
  112     \x70         p      p      p
  113     \x71         q      q      q
  114     \x72         r      r      r
  115     \x73         s      s      s
  116     \x74         t      t      t
  117     \x75         u      u      u
  118     \x76         v      v      v
  119     \x77         w      w      w
  120     \x78         x      x      x
  121     \x79         y      y      y
  122     \x7A         z      z      z
  123     \x7B         {      {      left curly bracket
  124     \x7C         |      |      vertical line
  125     \x7D         }      }      right curly bracket
  126     \x7E         ~      ~      tilde
  127     \x7F                     del
  128     \x80         Ç      Ç      upper case C cedilla
  129     \x81         ü      ü      lower case u umlaut
  130     \x82         é      é      lower case e acute
  131     \x83         â      â      lower case a hat
  132     \x84         ä      ä      lower case a umlaut
  133     \x85         à      à      lower case a grave
  134     \x86         å      å      lower case a ring
  135     \x87         ç      ç      lower case c cedilla
  136     \x88         ê      ê      lower case e hat
  137     \x89         ë      ë      lower case e umlaut
  138     \x8A         è      è      lower case e grave
  139     \x8B         ï      ï      lower case i umlaut
  140     \x8C         î      î      lower case i hat
  141     \x8D         ì      ì      lower case i grave
  142     \x8E         Ä      Ä      upper case A umlaut
  143     \x8F         Å      Å      upper case A ring
  144     \x90         É      É      upper case E acute
  145     \x91         æ      æ      lower case ae digraph
  146     \x92         Æ      Æ      upper case AE digraph
  147     \x93         ô      ô      lower case o hat
  148     \x94         ö      ö      lower case o umlaut
  149     \x95         ò      ò      lower case o grave
  150     \x96         û      û      lower case u hat
  151     \x97         ù      ù      lower case u grave
  152     \x98         ÿ      ÿ      lower case y umlaut
  153     \x99         Ö      Ö      upper case O umlaut
  154     \x9A         Ü      Ü      upper case U umlaut
  155     \x9B         ¢      ¢      dollar cent sign
  156     \x9C         £      £      pound sign
  157     \x9D         ¥      ¥      yen sign
  158     \x9E         ₧      ₧      peseta sign
  159     \x9F         ƒ      ƒ      florin sign
  160     \xA0         á      á      lower case a acute
  161     \xA1         í      í      lower case i acute
  162     \xA2         ó      ó      lower case o acute
  163     \xA3         ú      ú      lower case u acute
  164     \xA4         ñ      ñ      lower case n macron
  165     \xA5         Ñ      Ñ      upper case N macron
  166     \xA6         ª      ª      feminin ordinal sign
  167     \xA7         º      º      masculin ordinal sign
  168     \xA8         ¿      ¿      inverted question mark
  169     \xA9         ⌐      ⌐      soft hyphen
  170     \xAA         ¬      ¬      logical not sign
  171     \xAB         ½      ½      one half sign
  172     \xAC         ¼      ¼      one fourth sign
  173     \xAD         ¡      ¡      inverted exclamation mark
  174     \xAE         «      «      open french quotation mark
  175     \xAF         »      »      close french quotation mark
  176     \xB0         ░      ░      graphblock invert
  177     \xB1         ▒      ▒      graphblock invert
  178     \xB2         ▓      ▓      graphblock invert
  179     \xB3         │      │      graphsingle vertical
  180     \xB4         ┤      ┤      graphsingle east
  181     \xB5         ╡      ╡      graphdouble top east
  182     \xB6         ╢      ╢      graphdouble side east
  183     \xB7         ╖      ╖      graphdouble side north east
  184     \xB8         ╕      ╕      graphdouble top north east
  185     \xB9         ╣      ╣      graphdouble both east
  186     \xBA         ║      ║      graphdouble both horizontal
  187     \xBB         ╗      ╗      graphdouble both north east
  188     \xBC         ╝      ╝      graphdouble both south east
  189     \xBD         ╜      ╜      graphdouble side south east
  190     \xBE         ╛      ╛      graphdouble top south east
  191     \xBF         ┐      ┐      graphsingle north east
  192     \xC0         └      └      graphsingle south west
  193     \xC1         ┴      ┴      graphsingle south
  194     \xC2         ┬      ┬      graphsingle north
  195     \xC3         ├      ├      graphsingle west
  196     \xC4         ─      ─      graphsingle horizontal
  197     \xC5         ┼      ┼      graphsingle middle
  198     \xC6         ╞      ╞      graphdouble top west
  199     \xC7         ╟      ╟      graphdouble side west
  200     \xC8         ╚      ╚      graphdouble both
  201     \xC9         ╔      ╔      graphdouble both north west
  202     \xCA         ╩      ╩      graphdouble both
  203     \xCB         ╦      ╦      graphdouble both north
  204     \xCC         ╠      ╠      graphdouble both west
  205     \xCD         ═      ═      graphdouble both vertical
  206     \xCE         ╬      ╬      graphdouble both middle
  207     \xCF         ╧      ╧      graphdouble top south
  208     \xD0         ╨      ╨      graphdouble side south
  209     \xD1         ╤      ╤      graphdouble top north
  210     \xD2         ╥      ╥      graphdouble side north
  211     \xD3         ╙      ╙      graphdouble side south west
  212     \xD4         ╘      ╘      graphdouble top south west
  213     \xD5         ╒      ╒      graphdouble top north west
  214     \xD6         ╓      ╓      graphdouble side north west
  215     \xD7         ╫      ╫      graphdouble side middle
  216     \xD8         ╪      ╪      graphdouble top middle
  217     \xD9         ┘      ┘      graphsingle south east
  218     \xDA         ┌      ┌      graphsingle north west
  219     \xDB         █      █      graphblock middle
  220     \xDC         ▄      ▄      graphblock horizontal
  221     \xDD         ▌      ▌      graphblock vertical
  222     \xDE         ▐      ▐      graphblock north west
  223     \xDF         ▀      ▀      graphblock south east
  224     \xE0         α      α      lower case alpha
  225     \xE1         ß      ß      lower case beta
  226     \xE2         Γ      Γ      upper case GAMMA
  227     \xE3         π      π      lower case pi
  228     \xE4         Σ      Σ      upper case SIGMA
  229     \xE5         σ      σ      lower case sigma
  230     \xE6         µ      µ      lower case mu
  231     \xE7         τ      τ      lower case tau
  232     \xE8         Φ      Φ      upper case PHI
  233     \xE9         Θ      Θ      lower case theta
  234     \xEA         Ω      Ω      upper case OMEGA
  235     \xEB         δ      δ      lower case delta
  236     \xEC         ∞      ∞      lower case omega
  237     \xED         φ      φ      lower case phi
  238     \xEE         ε      ε      lower case epsilon
  239     \xEF         ∩      ∩      intersection
  240     \xF0         ≡      ≡      mathematical identity sign
  241     \xF1         ±      ±      plus minus sign
  242     \xF2         ≥      ≥      greater than or equal sign
  243     \xF3         ≤      ≤      less than or equal sign
  244     \xF4         ⌠      ⌠      upper integral sign
  245     \xF5         ⌡      ⌡      lower integral sign
  246     \xF6         ÷      ÷      division sign
  247     \xF7         ≈      ≈      approximately equal sign
  248     \xF8         °      °      degree sign
  249     \xF9         ∙      ∙      diaresis
  250     \xFA         ·      ·      decimal point
  251     \xFB         √      √      square root sign
  252     \xFC         ⁿ      ⁿ      superscript n sign
  253     \xFD         ²      ²      superscript 2 sign
  254     \xFE         ■      ■      bullet
  255     \xFF         \xFF   \xFF   ?
-----------------------------------------------------------------------
===
-----------------------------------------------------------------------
TABLE: ASCII and HEXADECIMAL: CLASS: RANGE
-----------------------------------------------------------------------
ASCII|HEX| C| CLASS                                                   |
-----------------------------------------------------------------------
000  \x00  - control,ascii,unicode
001  \x01  - control,ascii,unicode
002  \x02  - control,ascii,unicode
003  \x03  - control,ascii,unicode
004  \x04  - control,ascii,unicode
005  \x05  - control,ascii,unicode
006  \x06  - control,ascii,unicode
007  \x07  - control,ascii,unicode
008  \x08  - control,ascii,unicode
009  \x09  - control,whitespace,ascii,unicode
010  \x0A  - control,ascii,unicode
011  \x0B  - control,whitespace,ascii,unicode
012  \x0C  - control,whitespace,ascii,unicode
013  \x0D  - control,ascii,unicode
014  \x0E  - control,ascii,unicode
015  \x0F  - control,ascii,unicode
016  \x10  - control,ascii,unicode
017  \x11  - control,ascii,unicode
018  \x12  - control,ascii,unicode
019  \x13  - control,ascii,unicode
020  \x14  - control,ascii,unicode
021  \x15  - control,ascii,unicode
022  \x16  - control,ascii,unicode
023  \x17  - control,ascii,unicode
024  \x18  - control,ascii,unicode
025  \x19  - ascii,unicode
026  \x1A  - ascii,unicode
027  \x1B  - ascii,unicode
028  \x1C  - ascii,unicode
029  \x1D  - ascii,unicode
030  \x1E  - ascii,unicode
031  \x1F  - ascii,unicode
032  \x20  - whitespace,print+space,ascii,unicode
033  \x21  ! punctuation,print,print+space,ascii,unicode
034  \x22  " punctuation,print,print+space,ascii,unicode
035  \x23  # punctuation,print,print+space,ascii,unicode
036  \x24  $ punctuation,print,print+space,ascii,unicode
037  \x25  % punctuation,print,print+space,ascii,unicode
038  \x26  & punctuation,print,print+space,ascii,unicode
039  \x27  ' punctuation,print,print+space,ascii,unicode
040  \x28  ( punctuation,print,print+space,ascii,unicode
041  \x29  ) punctuation,print,print+space,ascii,unicode
042  \x2A  * punctuation,print,print+space,ascii,unicode
043  \x2B  + punctuation,print,print+space,ascii,unicode
044  \x2C  , punctuation,print,print+space,ascii,unicode
045  \x2D  - punctuation,print,print+space,ascii,unicode
046  \x2E  . punctuation,print,print+space,ascii,unicode
047  \x2F  / punctuation,print,print+space,ascii,unicode
048  \x30  0 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
049  \x31  1 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
050  \x32  2 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
051  \x33  3 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
052  \x34  4 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
053  \x35  5 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
054  \x36  6 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
055  \x37  7 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
056  \x38  8 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
057  \x39  9 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
058  \x3A  : punctuation,print,print+space,ascii,unicode
059  \x3B  ; punctuation,print,print+space,ascii,unicode
060  \x3C  < punctuation,print,print+space,ascii,unicode
061  \x3D  = punctuation,print,print+space,ascii,unicode
062  \x3E  > punctuation,print,print+space,ascii,unicode
063  \x3F  ? punctuation,print,print+space,ascii,unicode
064  \x40  @ punctuation,print,print+space,ascii,unicode
065  \x41  A upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
066  \x42  B upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
067  \x43  C upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
068  \x44  D upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
069  \x45  E upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
070  \x46  F upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
071  \x47  G upper,alphan,alphan+_,print,print+space,ascii,unicode
072  \x48  H upper,alphan,alphan+_,print,print+space,ascii,unicode
073  \x49  I upper,alphan,alphan+_,print,print+space,ascii,unicode
074  \x4A  J upper,alphan,alphan+_,print,print+space,ascii,unicode
075  \x4B  K upper,alphan,alphan+_,print,print+space,ascii,unicode
076  \x4C  L upper,alphan,alphan+_,print,print+space,ascii,unicode
077  \x4D  M upper,alphan,alphan+_,print,print+space,ascii,unicode
078  \x4E  N upper,alphan,alphan+_,print,print+space,ascii,unicode
079  \x4F  O upper,alphan,alphan+_,print,print+space,ascii,unicode
080  \x50  P upper,alphan,alphan+_,print,print+space,ascii,unicode
081  \x51  Q upper,alphan,alphan+_,print,print+space,ascii,unicode
082  \x52  R upper,alphan,alphan+_,print,print+space,ascii,unicode
083  \x53  S upper,alphan,alphan+_,print,print+space,ascii,unicode
084  \x54  T upper,alphan,alphan+_,print,print+space,ascii,unicode
085  \x55  U upper,alphan,alphan+_,print,print+space,ascii,unicode
086  \x56  V upper,alphan,alphan+_,print,print+space,ascii,unicode
087  \x57  W upper,alphan,alphan+_,print,print+space,ascii,unicode
088  \x58  X upper,alphan,alphan+_,print,print+space,ascii,unicode
089  \x59  Y upper,alphan,alphan+_,print,print+space,ascii,unicode
090  \x5A  Z upper,alphan,alphan+_,print,print+space,ascii,unicode
091  \x5B  [ punctuation,print,print+space,ascii,unicode
092  \x5C  \ punctuation,print,print+space,ascii,unicode
093  \x5D  ] punctuation,print,print+space,ascii,unicode
094  \x5E  ^ punctuation,print,print+space,ascii,unicode
095  \x5F  _ punctuation,print,print+space,ascii,unicode
096  \x60  ` punctuation,print,print+space,ascii,unicode
097  \x61  a lower,alphan,alphan+_,print,print+space,ascii,unicode
098  \x62  b lower,alphan,alphan+_,print,print+space,ascii,unicode
099  \x63  c lower,alphan,alphan+_,print,print+space,ascii,unicode
100  \x64  d lower,alphan,alphan+_,print,print+space,ascii,unicode
101  \x65  e lower,alphan,alphan+_,print,print+space,ascii,unicode
102  \x66  f lower,alphan,alphan+_,print,print+space,ascii,unicode
103  \x67  g lower,alphan,alphan+_,print,print+space,ascii,unicode
104  \x68  h lower,alphan,alphan+_,print,print+space,ascii,unicode
105  \x69  i lower,alphan,alphan+_,print,print+space,ascii,unicode
106  \x6A  j lower,alphan,alphan+_,print,print+space,ascii,unicode
107  \x6B  k lower,alphan,alphan+_,print,print+space,ascii,unicode
108  \x6C  l lower,alphan,alphan+_,print,print+space,ascii,unicode
109  \x6D  m lower,alphan,alphan+_,print,print+space,ascii,unicode
110  \x6E  n lower,alphan,alphan+_,print,print+space,ascii,unicode
111  \x6F  o lower,alphan,alphan+_,print,print+space,ascii,unicode
112  \x70  p lower,alphan,alphan+_,print,print+space,ascii,unicode
113  \x71  q lower,alphan,alphan+_,print,print+space,ascii,unicode
114  \x72  r lower,alphan,alphan+_,print,print+space,ascii,unicode
115  \x73  s lower,alphan,alphan+_,print,print+space,ascii,unicode
116  \x74  t lower,alphan,alphan+_,print,print+space,ascii,unicode
117  \x75  u lower,alphan,alphan+_,print,print+space,ascii,unicode
118  \x76  v lower,alphan,alphan+_,print,print+space,ascii,unicode
119  \x77  w lower,alphan,alphan+_,print,print+space,ascii,unicode
120  \x78  x lower,alphan,alphan+_,print,print+space,ascii,unicode
121  \x79  y lower,alphan,alphan+_,print,print+space,ascii,unicode
122  \x7A  z lower,alphan,alphan+_,print,print+space,ascii,unicode
123  \x7B  { punctuation,print,print+space,ascii,unicode
124  \x7C  | punctuation,print,print+space,ascii,unicode
125  \x7D  } punctuation,print,print+space,ascii,unicode
126  \x7E  ~ punctuation,print,print+space,ascii,unicode
127  \x7F   control,ascii,unicode
128  \x80  Ç ascii,unicode
129  \x81  ü ascii,unicode
130  \x82  é ascii,unicode
131  \x83  â ascii,unicode
132  \x84  ä ascii,unicode
133  \x85  à ascii,unicode
134  \x86  å ascii,unicode
135  \x87  ç ascii,unicode
136  \x88  ê ascii,unicode
137  \x89  ë ascii,unicode
138  \x8A  è ascii,unicode
139  \x8B  ï ascii,unicode
140  \x8C  î ascii,unicode
141  \x8D  ì ascii,unicode
142  \x8E  Ä ascii,unicode
143  \x8F  Å ascii,unicode
144  \x90  É ascii,unicode
145  \x91  æ ascii,unicode
146  \x92  Æ ascii,unicode
147  \x93  ô ascii,unicode
148  \x94  ö ascii,unicode
149  \x95  ò ascii,unicode
150  \x96  û ascii,unicode
151  \x97  ù ascii,unicode
152  \x98  ÿ ascii,unicode
153  \x99  Ö ascii,unicode
154  \x9A  Ü ascii,unicode
155  \x9B  ¢ ascii,unicode
156  \x9C  £ ascii,unicode
157  \x9D  ¥ ascii,unicode
158  \x9E  ₧ ascii,unicode
159  \x9F  ƒ ascii,unicode
160  \xA0  á ascii,unicode
161  \xA1  í ascii,unicode
162  \xA2  ó ascii,unicode
163  \xA3  ú ascii,unicode
164  \xA4  ñ ascii,unicode
165  \xA5  Ñ ascii,unicode
166  \xA6  ª ascii,unicode
167  \xA7  º ascii,unicode
168  \xA8  ¿ ascii,unicode
169  \xA9  ⌐ ascii,unicode
170  \xAA  ¬ ascii,unicode
171  \xAB  ½ ascii,unicode
172  \xAC  ¼ ascii,unicode
173  \xAD  ¡ ascii,unicode
174  \xAE  « ascii,unicode
175  \xAF  » ascii,unicode
176  \xB0  ░ ascii,unicode
177  \xB1  ▒ ascii,unicode
178  \xB2  ▓ ascii,unicode
179  \xB3  │ ascii,unicode
180  \xB4  ┤ ascii,unicode
181  \xB5  ╡ ascii,unicode
182  \xB6  ╢ ascii,unicode
183  \xB7  ╖ ascii,unicode
184  \xB8  ╕ ascii,unicode
185  \xB9  ╣ ascii,unicode
186  \xBA  ║ ascii,unicode
187  \xBB  ╗ ascii,unicode
188  \xBC  ╝ ascii,unicode
189  \xBD  ╜ ascii,unicode
190  \xBE  ╛ ascii,unicode
191  \xBF  ┐ ascii,unicode
192  \xC0  └ ascii,unicode
193  \xC1  ┴ ascii,unicode
194  \xC2  ┬ ascii,unicode
195  \xC3  ├ ascii,unicode
196  \xC4  ─ ascii,unicode
197  \xC5  ┼ ascii,unicode
198  \xC6  ╞ ascii,unicode
199  \xC7  ╟ ascii,unicode
200  \xC8  ╚ ascii,unicode
201  \xC9  ╔ ascii,unicode
202  \xCA  ╩ ascii,unicode
203  \xCB  ╦ ascii,unicode
204  \xCC  ╠ ascii,unicode
205  \xCD  ═ ascii,unicode
206  \xCE  ╬ ascii,unicode
207  \xCF  ╧ ascii,unicode
208  \xD0  ╨ ascii,unicode
209  \xD1  ╤ ascii,unicode
210  \xD2  ╥ ascii,unicode
211  \xD3  ╙ ascii,unicode
212  \xD4  ╘ ascii,unicode
213  \xD5  ╒ ascii,unicode
214  \xD6  ╓ ascii,unicode
215  \xD7  ╫ ascii,unicode
216  \xD8  ╪ ascii,unicode
217  \xD9  ┘ ascii,unicode
218  \xDA  ┌ ascii,unicode
219  \xDB  █ ascii,unicode
220  \xDC  ▄ ascii,unicode
221  \xDD  ▌ ascii,unicode
222  \xDE  ▐ ascii,unicode
223  \xDF  ▀ ascii,unicode
224  \xE0  α ascii,unicode
225  \xE1  ß ascii,unicode
226  \xE2  Γ ascii,unicode
227  \xE3  π ascii,unicode
228  \xE4  Σ ascii,unicode
229  \xE5  σ ascii,unicode
230  \xE6  µ ascii,unicode
231  \xE7  τ ascii,unicode
232  \xE8  Φ ascii,unicode
233  \xE9  Θ ascii,unicode
234  \xEA  Ω ascii,unicode
235  \xEB  δ ascii,unicode
236  \xEC  ∞ ascii,unicode
237  \xED  φ ascii,unicode
238  \xEE  ε ascii,unicode
239  \xEF  ∩ ascii,unicode
240  \xF0  ≡ ascii,unicode
241  \xF1  ± ascii,unicode
242  \xF2  ≥ ascii,unicode
243  \xF3  ≤ ascii,unicode
244  \xF4  ⌠ ascii,unicode
245  \xF5  ⌡ ascii,unicode
246  \xF6  ÷ ascii,unicode
247  \xF7  ≈ ascii,unicode
248  \xF8  ° ascii,unicode
249  \xF9  ∙ ascii,unicode
250  \xFA  · ascii,unicode
251  \xFB  √ ascii,unicode
252  \xFC  ⁿ ascii,unicode
253  \xFD  ² ascii,unicode
254  \xFE  ■ ascii,unicode
255  \xFF  \ ascii,unicode
-----------------------------------------------------------------------
===
Internet: see also:
---
TSE: Search/Replace: Regular expression: Link: Can you give overview 
links regular expressions?
http://www.faqts.com/knowledge_base/view.phtml/aid/31433/fid/865
----------------------------------------------------------------------