Entry
TSE: Can you give conversion tables to convert regular expressions from Perl to TSE? [POSIX]
Nov 20th, 2006 12:53
Knud van Eeden,
----------------------------------------------------------------------
--- Knud van Eeden --- 19 November 2020 - 07:49 pm -------------------
TSE: Can you give conversion tables to convert regular expressions
from Perl to TSE? [POSIX]
---
Conversion tables for converting Perl to TSE regular expressions
===
-----------------------------------------------------------------------
TABLE: RANGE
-----------------------------------------------------------------------
|DESCRIPTION |PERL |TSE |POSIX |PERL FN |
-----------------------------------------------------------------------
control [\x00-\x19\x7F] [\x00-\x19\x7F] [:cntrl:] IsCntrl
white space+tab [ \t] [ \t] [:blank:] IsSpace
whitespace [ \f\t\v] [ \f\t\v] [:space:] IsSpace
whitespacePerl \s [ \f\n\r\t\v] [:space:] IsSpacePerl
punctuation [!-/:-@[-`{-~] [!-/:-@[-`{-~] [:punct:] IsPunct
decimal digit \d [0-9] [:digit:] IsDigit
hexadecimal [0-9A-Fa-f] [0-9A-Fa-f] [:xdigit:] IsXDigit
upper \u [A-Z] [:upper:] IsUpper
lower \l [a-z] [:lower:] IsLower
upper+lower [A-Za-z] [A-Za-z] [:alpha:] IsAlpha
alphanumeric [A-Za-z0-9] [A-Za-z0-9] [:alnum:] IsAlnum
alphanumeric+_ \w [A-Za-z0-9_] [:word:] IsWord
printable [!-~] [!-~] [:graph:] IsGraph
printable+space [ -~] [ -~] [:print:] IsPrint
any ASCII [\x00-\xFF] [\x00-\xFF] [:ascii:] IsASCII
any Unicode [\x0000-\xFFFF] not available? ? ?
not control [^\x00-\x19\x7F] [~\x00-\x19\x7F] [^:cntrl:] \P{IsCntrl}
not space+tab [^ \t] [~ \t] [^:blank:] \P{IsSpace}
not whitespace [^ \f\t\v] [~ \f\t\v] [^:space:] \P{IsSpace}
not whtspc Perl \S [~ \f\n\r\t\v] [^:space:] \P{IsSpacePerl}
not punctuation [^!-/:-@[-`{-~] [~!-/:-@[-`{-~] [^:punct:] \P{IsPunct}
not decimal dig \D [~0-9] [^:digit:] \P{IsDigit}
not hexadecimal [^0-9A-Fa-f] [~0-9A-Fa-f] [^:xdigit:] \P{IsXDigit}
not upper \U [~A-Z] [^:upper:] \P{IsUpper}
not lower \L [~a-z] [^:lower:] \P{IsLower}
not upper+lower [^A-Za-z] [~A-Za-z] [^:alpha:] \P{IsAlpha}
not alphanumc [^A-Za-z0-9] [~A-Za-z0-9] [^:alnum:] \P{IsAlnum}
not alphnumc+_ \W [~A-Za-z0-9_] [^:word:] \P{IsWord}
not printable [^!-~] [~!-~] [^:graph:] \P{IsGraph}
not prt+space [^ -~] [~ -~] [^:print:] \P{IsPrint}
not any ASCII [^\x00-\xFF] [~\x00-\xFF] [^:ascii:] \P{IsASCII}
not any Unicode [^\x0000-\xFFFF] not available? ? ?
----------------------------------------------------------------------
===
-----------------------------------------------------------------------
TABLE: OPERATOR: ALTERNATION / CLOSURE / CONCATENATION / GROUPING
-----------------------------------------------------------------------
| PERL |TSE |NAME |CLASS: TYPE |CLASS: TYPE: SUB |
-----------------------------------------------------------------------
$0-$9 \0..\9 tag - -
| | or alternation -
? not available 0 or 1 closure maximum: greedy
* @ 0 or more closure maximum: greedy
+ # 1 or more closure maximum: greedy
?+ not available 0 or 1 closure maximum: possessive
*+ not available 0 or more closure maximum: possessive
++ not available 1 or more closure maximum: possessive
?? ? 0 or 1 closure minimum
*? * 0 or more closure minimum
+? + 1 or more closure minimum
{n,m} not available? n to m closure maximum: greedy
{n,} not available? n or more closure maximum: greedy
{,n} not available? 0 to n closure maximum: greedy
{n} not available? n times closure maximum: greedy
{n,m}? not available? n to m closure minimum
{n,}? not available? n or more closure minimum
{,n}? not available? 0 to n closure minimum
{n}? not available? n times closure minimum
[...] [...] class concatenation character: single: 1
[^...] [~...] class not concatenation character: single: 1
. . any char concatenation character: single: 1
\ \ escape char concatenation character: single: 1
\xHH \xHH hexadecimal concatenation character: single: 1
\ooo \oOOO octal concatenation character: single: 1
- \dDDD decimal concatenation character: single: 1
[.-.] [.-.] range concatenation character: single: 1
(...) {...} group group -
-----------------------------------------------------------------------
===
-----------------------------------------------------------------------
TABLE: ANCHOR
-----------------------------------------------------------------------
| OBJECT | POSITION | PERL | TSE |
-----------------------------------------------------------------------
cursor after do not know \c
line begin ^ ^
line end $ $
string begin \A do not know
string begin \G do not know
string end \Z do not know
string end \z do not know
word begin or end \w [A-Za-z0-9_]
-----------------------------------------------------------------------
===
Note:
The ....; values in this table should be single graphical characters.
They show the HTML unicode for this single ASCII characters instead.
This conversion happened after pasting the original
table.
===
The table below corresponds to the characters as shown
when using the font 'Terminal'
===
-----------------------------------------------------------------------
TABLE: ASCII and HEXADECIMAL: PERL: TSE
-----------------------------------------------------------------------
| ASCII | HEXADECIMAL| PERL | TSE | LETTER |
-----------------------------------------------------------------------
000 \x00 \x00 \x00 -
001 \x01 \x01 \x01 ^A
002 \x02 \x02 \x02 ^B
003 \x03 \x03 \x03 ^C
004 \x04 \x04 \x04 ^D
005 \x05 \x05 \x05 ^E
006 \x06 \x06 \x06 ^F
007 \x07 \a \a ^G
008 \x08 \b \b ^H
009 \x09 \t \t ^I
010 \x0A \n \n ^J
011 \x0B \v \v ^K
012 \x0C \f \f ^L
013 \x0D \r \r ^M
014 \x0E \x0E \x0E ^N
015 \x0F \x0F \x0F ^O
016 \x10 \x10 \x10 ^P
017 \x11 \x11 \x11 ^Q
018 \x12 \x12 \x12 ^R
019 \x13 \x13 \x13 ^S
020 \x14 \x14 \x14 ^T
021 \x15 \x15 \x15 ^U
022 \x16 \x16 \x16 ^V
023 \x17 \x17 \x17 ^W
024 \x18 \x18 \x18 ^X
025 \x19 \x19 \x19 ^Y
026 \x1A \x1A \x1A ^Z
027 \x1B \e \x1B ^[
028 \x1C \x1C \x1C ^\
029 \x1D \x1D \x1D ^]
030 \x1E \x1E \x1E ^^
031 \x1F \x1F \x1F ^_
032 \x20 \x20 \x20 ?
033 \x21 ! ! ?
034 \x22 " " ?
035 \x23 # # ?
036 \x24 $ $ ?
037 \x25 % % ?
038 \x26 & & ?
039 \x27 ' ' ?
040 \x28 ( ( ?
041 \x29 ) ) ?
042 \x2A * * ?
043 \x2B + + ?
044 \x2C , , ?
045 \x2D - - ?
046 \x2E . . ?
047 \x2F / / ?
048 \x30 0 0 ?
049 \x31 1 1 ?
050 \x32 2 2 ?
051 \x33 3 3 ?
052 \x34 4 4 ?
053 \x35 5 5 ?
054 \x36 6 6 ?
055 \x37 7 7 ?
056 \x38 8 8 ?
057 \x39 9 9 ?
058 \x3A : : ?
059 \x3B ; ; ?
060 \x3C < < ?
061 \x3D = = ?
062 \x3E > > ?
063 \x3F ? ? ?
064 \x40 @ @ ?
065 \x41 A A ?
066 \x42 B B ?
067 \x43 C C ?
068 \x44 D D ?
069 \x45 E E ?
070 \x46 F F ?
071 \x47 G G ?
072 \x48 H H ?
073 \x49 I I ?
074 \x4A J J ?
075 \x4B K K ?
076 \x4C L L ?
077 \x4D M M ?
078 \x4E N N ?
079 \x4F O O ?
080 \x50 P P ?
081 \x51 Q Q ?
082 \x52 R R ?
083 \x53 S S ?
084 \x54 T T ?
085 \x55 U U ?
086 \x56 V V ?
087 \x57 W W ?
088 \x58 X X ?
089 \x59 Y Y ?
090 \x5A Z Z ?
091 \x5B [ [ ?
092 \x5C \ \ ?
093 \x5D ] ] ?
094 \x5E ^ ^ ?
095 \x5F _ _ ?
096 \x60 ` ` ?
097 \x61 a a ?
098 \x62 b b ?
099 \x63 c c ?
100 \x64 d d ?
101 \x65 e e ?
102 \x66 f f ?
103 \x67 g g ?
104 \x68 h h ?
105 \x69 i i ?
106 \x6A j j ?
107 \x6B k k ?
108 \x6C l l ?
109 \x6D m m ?
110 \x6E n n ?
111 \x6F o o ?
112 \x70 p p ?
113 \x71 q q ?
114 \x72 r r ?
115 \x73 s s ?
116 \x74 t t ?
117 \x75 u u ?
118 \x76 v v ?
119 \x77 w w ?
120 \x78 x x ?
121 \x79 y y ?
122 \x7A z z ?
123 \x7B { { ?
124 \x7C | | ?
125 \x7D } } ?
126 \x7E ~ ~ ?
127 \x7F ?
128 \x80 Ç Ç ?
129 \x81 ü ü ?
130 \x82 é é ?
131 \x83 â â ?
132 \x84 ä ä ?
133 \x85 à à ?
134 \x86 å å ?
135 \x87 ç ç ?
136 \x88 ê ê ?
137 \x89 ë ë ?
138 \x8A è è ?
139 \x8B ï ï ?
140 \x8C î î ?
141 \x8D ì ì ?
142 \x8E Ä Ä ?
143 \x8F Å Å ?
144 \x90 É É ?
145 \x91 æ æ ?
146 \x92 Æ Æ ?
147 \x93 ô ô ?
148 \x94 ö ö ?
149 \x95 ò ò ?
150 \x96 û û ?
151 \x97 ù ù ?
152 \x98 ÿ ÿ ?
153 \x99 Ö Ö ?
154 \x9A Ü Ü ?
155 \x9B ¢ ¢ ?
156 \x9C £ £ ?
157 \x9D ¥ ¥ ?
158 \x9E ₧ ₧ ?
159 \x9F ƒ ƒ ?
160 \xA0 á á ?
161 \xA1 í í ?
162 \xA2 ó ó ?
163 \xA3 ú ú ?
164 \xA4 ñ ñ ?
165 \xA5 Ñ Ñ ?
166 \xA6 ª ª ?
167 \xA7 º º ?
168 \xA8 ¿ ¿ ?
169 \xA9 ⌐ ⌐ ?
170 \xAA ¬ ¬ ?
171 \xAB ½ ½ ?
172 \xAC ¼ ¼ ?
173 \xAD ¡ ¡ ?
174 \xAE « « ?
175 \xAF » » ?
176 \xB0 ░ ░ ?
177 \xB1 ▒ ▒ ?
178 \xB2 ▓ ▓ ?
179 \xB3 │ │ ?
180 \xB4 ┤ ┤ ?
181 \xB5 ╡ ╡ ?
182 \xB6 ╢ ╢ ?
183 \xB7 ╖ ╖ ?
184 \xB8 ╕ ╕ ?
185 \xB9 ╣ ╣ ?
186 \xBA ║ ║ ?
187 \xBB ╗ ╗ ?
188 \xBC ╝ ╝ ?
189 \xBD ╜ ╜ ?
190 \xBE ╛ ╛ ?
191 \xBF ┐ ┐ ?
192 \xC0 └ └ ?
193 \xC1 ┴ ┴ ?
194 \xC2 ┬ ┬ ?
195 \xC3 ├ ├ ?
196 \xC4 ─ ─ ?
197 \xC5 ┼ ┼ ?
198 \xC6 ╞ ╞ ?
199 \xC7 ╟ ╟ ?
200 \xC8 ╚ ╚ ?
201 \xC9 ╔ ╔ ?
202 \xCA ╩ ╩ ?
203 \xCB ╦ ╦ ?
204 \xCC ╠ ╠ ?
205 \xCD ═ ═ ?
206 \xCE ╬ ╬ ?
207 \xCF ╧ ╧ ?
208 \xD0 ╨ ╨ ?
209 \xD1 ╤ ╤ ?
210 \xD2 ╥ ╥ ?
211 \xD3 ╙ ╙ ?
212 \xD4 ╘ ╘ ?
213 \xD5 ╒ ╒ ?
214 \xD6 ╓ ╓ ?
215 \xD7 ╫ ╫ ?
216 \xD8 ╪ ╪ ?
217 \xD9 ┘ ┘ ?
218 \xDA ┌ ┌ ?
219 \xDB █ █ ?
220 \xDC ▄ ▄ ?
221 \xDD ▌ ▌ ?
222 \xDE ▐ ▐ ?
223 \xDF ▀ ▀ ?
224 \xE0 α α ?
225 \xE1 ß ß ?
226 \xE2 Γ Γ ?
227 \xE3 π π ?
228 \xE4 Σ Σ ?
229 \xE5 σ σ ?
230 \xE6 µ µ ?
231 \xE7 τ τ ?
232 \xE8 Φ Φ ?
233 \xE9 Θ Θ ?
234 \xEA Ω Ω ?
235 \xEB δ δ ?
236 \xEC ∞ ∞ ?
237 \xED φ φ ?
238 \xEE ε ε ?
239 \xEF ∩ ∩ ?
240 \xF0 ≡ ≡ ?
241 \xF1 ± ± ?
242 \xF2 ≥ ≥ ?
243 \xF3 ≤ ≤ ?
244 \xF4 ⌠ ⌠ ?
245 \xF5 ⌡ ⌡ ?
246 \xF6 ÷ ÷ ?
247 \xF7 ≈ ≈ ?
248 \xF8 ° ° ?
249 \xF9 ∙ ∙ ?
250 \xFA · · ?
251 \xFB √ √ ?
252 \xFC ⁿ ⁿ ?
253 \xFD ² ² ?
254 \xFE ■ ■ ?
255 \xFF \xFF \xFF ?
-----------------------------------------------------------------------
===
-----------------------------------------------------------------------
TABLE: ASCII and HEXADECIMAL: NAME
-----------------------------------------------------------------------
| ASCII | HEXADECIMAL| PERL | TSE | NAME |
-----------------------------------------------------------------------
000 \x00 \x00 \x00 null, idle
001 \x01 \x01 \x01 start of heading
002 \x02 \x02 \x02 start of text
003 \x03 \x03 \x03 end of text
004 \x04 \x04 \x04 end of transmission
005 \x05 \x05 \x05 enquiry
006 \x06 \x06 \x06 acknowledge
007 \x07 \a \a bell, beep, alert
008 \x08 \b \b backspace
009 \x09 \t \t horizontal tab
010 \x0A \n \n new line, linefeed
011 \x0B \v \v vertical tab, home
012 \x0C \f \f form feed
013 \x0D \r \r carriage return
014 \x0E \x0E \x0E shift out
015 \x0F \x0F \x0F shift in
016 \x10 \x10 \x10 data link escape
017 \x11 \x11 \x11 devicecontrol 1 (xon)
018 \x12 \x12 \x12 devicecontrol 2
019 \x13 \x13 \x13 devicecontrol 3 (xoff)
020 \x14 \x14 \x14 devicecontrol 4
021 \x15 \x15 \x15 negative acknowledge
022 \x16 \x16 \x16 synchronous idle
023 \x17 \x17 \x17 end transmission block
024 \x18 \x18 \x18 cancel
025 \x19 \x19 \x19 end of medium
026 \x1A \x1A \x1A substitute
027 \x1B \e \x1B escape sequence
028 \x1C \x1C \x1C cursor right
029 \x1D \x1D \x1D cursor left
030 \x1E \x1E \x1E cursor up
031 \x1F \x1F \x1F cursor down
032 \x20 \x20 \x20 space
033 \x21 ! ! exclamation mark
034 \x22 " " double quotes
035 \x23 # # hash
036 \x24 $ $ dollar sign
037 \x25 % % percentage sign
038 \x26 & & ampersand
039 \x27 ' ' apostrophe
040 \x28 ( ( left parenthesis
041 \x29 ) ) right parenthesis
042 \x2A * * asterisk
043 \x2B + + plus sign
044 \x2C , , comma
045 \x2D - - minus sign
046 \x2E . . period
047 \x2F / / forward slash
048 \x30 0 0 zero
049 \x31 1 1 one
050 \x32 2 2 two
051 \x33 3 3 three
052 \x34 4 4 four
053 \x35 5 5 five
054 \x36 6 6 six
055 \x37 7 7 seven
056 \x38 8 8 eight
057 \x39 9 9 nine
058 \x3A : : colon
059 \x3B ; ; semicolon
060 \x3C < < less than sign
061 \x3D = = equal sign
062 \x3E > > greater than sign
063 \x3F ? ? question mark
064 \x40 @ @ at sign
065 \x41 A A A
066 \x42 B B B
067 \x43 C C C
068 \x44 D D D
069 \x45 E E E
070 \x46 F F F
071 \x47 G G G
072 \x48 H H H
073 \x49 I I I
074 \x4A J J J
075 \x4B K K K
076 \x4C L L L
077 \x4D M M M
078 \x4E N N N
079 \x4F O O O
080 \x50 P P P
081 \x51 Q Q Q
082 \x52 R R R
083 \x53 S S S
084 \x54 T T T
085 \x55 U U U
086 \x56 V V V
087 \x57 W W W
088 \x58 X X X
089 \x59 Y Y Y
090 \x5A Z Z Z
091 \x5B [ [ left square bracket
092 \x5C \ \ back slash
093 \x5D ] ] right square bracket
094 \x5E ^ ^ caret
095 \x5F _ _ underscore
096 \x60 ` ` left apostrophe
097 \x61 a a a
098 \x62 b b b
099 \x63 c c c
100 \x64 d d d
101 \x65 e e e
102 \x66 f f f
103 \x67 g g g
104 \x68 h h h
105 \x69 i i i
106 \x6A j j j
107 \x6B k k k
108 \x6C l l l
109 \x6D m m m
110 \x6E n n n
111 \x6F o o o
112 \x70 p p p
113 \x71 q q q
114 \x72 r r r
115 \x73 s s s
116 \x74 t t t
117 \x75 u u u
118 \x76 v v v
119 \x77 w w w
120 \x78 x x x
121 \x79 y y y
122 \x7A z z z
123 \x7B { { left curly bracket
124 \x7C | | vertical line
125 \x7D } } right curly bracket
126 \x7E ~ ~ tilde
127 \x7F del
128 \x80 Ç Ç upper case C cedilla
129 \x81 ü ü lower case u umlaut
130 \x82 é é lower case e acute
131 \x83 â â lower case a hat
132 \x84 ä ä lower case a umlaut
133 \x85 à à lower case a grave
134 \x86 å å lower case a ring
135 \x87 ç ç lower case c cedilla
136 \x88 ê ê lower case e hat
137 \x89 ë ë lower case e umlaut
138 \x8A è è lower case e grave
139 \x8B ï ï lower case i umlaut
140 \x8C î î lower case i hat
141 \x8D ì ì lower case i grave
142 \x8E Ä Ä upper case A umlaut
143 \x8F Å Å upper case A ring
144 \x90 É É upper case E acute
145 \x91 æ æ lower case ae digraph
146 \x92 Æ Æ upper case AE digraph
147 \x93 ô ô lower case o hat
148 \x94 ö ö lower case o umlaut
149 \x95 ò ò lower case o grave
150 \x96 û û lower case u hat
151 \x97 ù ù lower case u grave
152 \x98 ÿ ÿ lower case y umlaut
153 \x99 Ö Ö upper case O umlaut
154 \x9A Ü Ü upper case U umlaut
155 \x9B ¢ ¢ dollar cent sign
156 \x9C £ £ pound sign
157 \x9D ¥ ¥ yen sign
158 \x9E ₧ ₧ peseta sign
159 \x9F ƒ ƒ florin sign
160 \xA0 á á lower case a acute
161 \xA1 í í lower case i acute
162 \xA2 ó ó lower case o acute
163 \xA3 ú ú lower case u acute
164 \xA4 ñ ñ lower case n macron
165 \xA5 Ñ Ñ upper case N macron
166 \xA6 ª ª feminin ordinal sign
167 \xA7 º º masculin ordinal sign
168 \xA8 ¿ ¿ inverted question mark
169 \xA9 ⌐ ⌐ soft hyphen
170 \xAA ¬ ¬ logical not sign
171 \xAB ½ ½ one half sign
172 \xAC ¼ ¼ one fourth sign
173 \xAD ¡ ¡ inverted exclamation mark
174 \xAE « « open french quotation mark
175 \xAF » » close french quotation mark
176 \xB0 ░ ░ graphblock invert
177 \xB1 ▒ ▒ graphblock invert
178 \xB2 ▓ ▓ graphblock invert
179 \xB3 │ │ graphsingle vertical
180 \xB4 ┤ ┤ graphsingle east
181 \xB5 ╡ ╡ graphdouble top east
182 \xB6 ╢ ╢ graphdouble side east
183 \xB7 ╖ ╖ graphdouble side north east
184 \xB8 ╕ ╕ graphdouble top north east
185 \xB9 ╣ ╣ graphdouble both east
186 \xBA ║ ║ graphdouble both horizontal
187 \xBB ╗ ╗ graphdouble both north east
188 \xBC ╝ ╝ graphdouble both south east
189 \xBD ╜ ╜ graphdouble side south east
190 \xBE ╛ ╛ graphdouble top south east
191 \xBF ┐ ┐ graphsingle north east
192 \xC0 └ └ graphsingle south west
193 \xC1 ┴ ┴ graphsingle south
194 \xC2 ┬ ┬ graphsingle north
195 \xC3 ├ ├ graphsingle west
196 \xC4 ─ ─ graphsingle horizontal
197 \xC5 ┼ ┼ graphsingle middle
198 \xC6 ╞ ╞ graphdouble top west
199 \xC7 ╟ ╟ graphdouble side west
200 \xC8 ╚ ╚ graphdouble both
201 \xC9 ╔ ╔ graphdouble both north west
202 \xCA ╩ ╩ graphdouble both
203 \xCB ╦ ╦ graphdouble both north
204 \xCC ╠ ╠ graphdouble both west
205 \xCD ═ ═ graphdouble both vertical
206 \xCE ╬ ╬ graphdouble both middle
207 \xCF ╧ ╧ graphdouble top south
208 \xD0 ╨ ╨ graphdouble side south
209 \xD1 ╤ ╤ graphdouble top north
210 \xD2 ╥ ╥ graphdouble side north
211 \xD3 ╙ ╙ graphdouble side south west
212 \xD4 ╘ ╘ graphdouble top south west
213 \xD5 ╒ ╒ graphdouble top north west
214 \xD6 ╓ ╓ graphdouble side north west
215 \xD7 ╫ ╫ graphdouble side middle
216 \xD8 ╪ ╪ graphdouble top middle
217 \xD9 ┘ ┘ graphsingle south east
218 \xDA ┌ ┌ graphsingle north west
219 \xDB █ █ graphblock middle
220 \xDC ▄ ▄ graphblock horizontal
221 \xDD ▌ ▌ graphblock vertical
222 \xDE ▐ ▐ graphblock north west
223 \xDF ▀ ▀ graphblock south east
224 \xE0 α α lower case alpha
225 \xE1 ß ß lower case beta
226 \xE2 Γ Γ upper case GAMMA
227 \xE3 π π lower case pi
228 \xE4 Σ Σ upper case SIGMA
229 \xE5 σ σ lower case sigma
230 \xE6 µ µ lower case mu
231 \xE7 τ τ lower case tau
232 \xE8 Φ Φ upper case PHI
233 \xE9 Θ Θ lower case theta
234 \xEA Ω Ω upper case OMEGA
235 \xEB δ δ lower case delta
236 \xEC ∞ ∞ lower case omega
237 \xED φ φ lower case phi
238 \xEE ε ε lower case epsilon
239 \xEF ∩ ∩ intersection
240 \xF0 ≡ ≡ mathematical identity sign
241 \xF1 ± ± plus minus sign
242 \xF2 ≥ ≥ greater than or equal sign
243 \xF3 ≤ ≤ less than or equal sign
244 \xF4 ⌠ ⌠ upper integral sign
245 \xF5 ⌡ ⌡ lower integral sign
246 \xF6 ÷ ÷ division sign
247 \xF7 ≈ ≈ approximately equal sign
248 \xF8 ° ° degree sign
249 \xF9 ∙ ∙ diaresis
250 \xFA · · decimal point
251 \xFB √ √ square root sign
252 \xFC ⁿ ⁿ superscript n sign
253 \xFD ² ² superscript 2 sign
254 \xFE ■ ■ bullet
255 \xFF \xFF \xFF ?
-----------------------------------------------------------------------
===
-----------------------------------------------------------------------
TABLE: ASCII and HEXADECIMAL: CLASS: RANGE
-----------------------------------------------------------------------
ASCII|HEX| C| CLASS |
-----------------------------------------------------------------------
000 \x00 - control,ascii,unicode
001 \x01 - control,ascii,unicode
002 \x02 - control,ascii,unicode
003 \x03 - control,ascii,unicode
004 \x04 - control,ascii,unicode
005 \x05 - control,ascii,unicode
006 \x06 - control,ascii,unicode
007 \x07 - control,ascii,unicode
008 \x08 - control,ascii,unicode
009 \x09 - control,whitespace,ascii,unicode
010 \x0A - control,ascii,unicode
011 \x0B - control,whitespace,ascii,unicode
012 \x0C - control,whitespace,ascii,unicode
013 \x0D - control,ascii,unicode
014 \x0E - control,ascii,unicode
015 \x0F - control,ascii,unicode
016 \x10 - control,ascii,unicode
017 \x11 - control,ascii,unicode
018 \x12 - control,ascii,unicode
019 \x13 - control,ascii,unicode
020 \x14 - control,ascii,unicode
021 \x15 - control,ascii,unicode
022 \x16 - control,ascii,unicode
023 \x17 - control,ascii,unicode
024 \x18 - control,ascii,unicode
025 \x19 - ascii,unicode
026 \x1A - ascii,unicode
027 \x1B - ascii,unicode
028 \x1C - ascii,unicode
029 \x1D - ascii,unicode
030 \x1E - ascii,unicode
031 \x1F - ascii,unicode
032 \x20 - whitespace,print+space,ascii,unicode
033 \x21 ! punctuation,print,print+space,ascii,unicode
034 \x22 " punctuation,print,print+space,ascii,unicode
035 \x23 # punctuation,print,print+space,ascii,unicode
036 \x24 $ punctuation,print,print+space,ascii,unicode
037 \x25 % punctuation,print,print+space,ascii,unicode
038 \x26 & punctuation,print,print+space,ascii,unicode
039 \x27 ' punctuation,print,print+space,ascii,unicode
040 \x28 ( punctuation,print,print+space,ascii,unicode
041 \x29 ) punctuation,print,print+space,ascii,unicode
042 \x2A * punctuation,print,print+space,ascii,unicode
043 \x2B + punctuation,print,print+space,ascii,unicode
044 \x2C , punctuation,print,print+space,ascii,unicode
045 \x2D - punctuation,print,print+space,ascii,unicode
046 \x2E . punctuation,print,print+space,ascii,unicode
047 \x2F / punctuation,print,print+space,ascii,unicode
048 \x30 0 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
049 \x31 1 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
050 \x32 2 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
051 \x33 3 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
052 \x34 4 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
053 \x35 5 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
054 \x36 6 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
055 \x37 7 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
056 \x38 8 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
057 \x39 9 digit,hex,alphan,alphan+_,print,print+space,ascii,unicode
058 \x3A : punctuation,print,print+space,ascii,unicode
059 \x3B ; punctuation,print,print+space,ascii,unicode
060 \x3C < punctuation,print,print+space,ascii,unicode
061 \x3D = punctuation,print,print+space,ascii,unicode
062 \x3E > punctuation,print,print+space,ascii,unicode
063 \x3F ? punctuation,print,print+space,ascii,unicode
064 \x40 @ punctuation,print,print+space,ascii,unicode
065 \x41 A upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
066 \x42 B upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
067 \x43 C upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
068 \x44 D upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
069 \x45 E upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
070 \x46 F upper,hex,alphan,alphan+_,print,print+space,ascii,unicode
071 \x47 G upper,alphan,alphan+_,print,print+space,ascii,unicode
072 \x48 H upper,alphan,alphan+_,print,print+space,ascii,unicode
073 \x49 I upper,alphan,alphan+_,print,print+space,ascii,unicode
074 \x4A J upper,alphan,alphan+_,print,print+space,ascii,unicode
075 \x4B K upper,alphan,alphan+_,print,print+space,ascii,unicode
076 \x4C L upper,alphan,alphan+_,print,print+space,ascii,unicode
077 \x4D M upper,alphan,alphan+_,print,print+space,ascii,unicode
078 \x4E N upper,alphan,alphan+_,print,print+space,ascii,unicode
079 \x4F O upper,alphan,alphan+_,print,print+space,ascii,unicode
080 \x50 P upper,alphan,alphan+_,print,print+space,ascii,unicode
081 \x51 Q upper,alphan,alphan+_,print,print+space,ascii,unicode
082 \x52 R upper,alphan,alphan+_,print,print+space,ascii,unicode
083 \x53 S upper,alphan,alphan+_,print,print+space,ascii,unicode
084 \x54 T upper,alphan,alphan+_,print,print+space,ascii,unicode
085 \x55 U upper,alphan,alphan+_,print,print+space,ascii,unicode
086 \x56 V upper,alphan,alphan+_,print,print+space,ascii,unicode
087 \x57 W upper,alphan,alphan+_,print,print+space,ascii,unicode
088 \x58 X upper,alphan,alphan+_,print,print+space,ascii,unicode
089 \x59 Y upper,alphan,alphan+_,print,print+space,ascii,unicode
090 \x5A Z upper,alphan,alphan+_,print,print+space,ascii,unicode
091 \x5B [ punctuation,print,print+space,ascii,unicode
092 \x5C \ punctuation,print,print+space,ascii,unicode
093 \x5D ] punctuation,print,print+space,ascii,unicode
094 \x5E ^ punctuation,print,print+space,ascii,unicode
095 \x5F _ punctuation,print,print+space,ascii,unicode
096 \x60 ` punctuation,print,print+space,ascii,unicode
097 \x61 a lower,alphan,alphan+_,print,print+space,ascii,unicode
098 \x62 b lower,alphan,alphan+_,print,print+space,ascii,unicode
099 \x63 c lower,alphan,alphan+_,print,print+space,ascii,unicode
100 \x64 d lower,alphan,alphan+_,print,print+space,ascii,unicode
101 \x65 e lower,alphan,alphan+_,print,print+space,ascii,unicode
102 \x66 f lower,alphan,alphan+_,print,print+space,ascii,unicode
103 \x67 g lower,alphan,alphan+_,print,print+space,ascii,unicode
104 \x68 h lower,alphan,alphan+_,print,print+space,ascii,unicode
105 \x69 i lower,alphan,alphan+_,print,print+space,ascii,unicode
106 \x6A j lower,alphan,alphan+_,print,print+space,ascii,unicode
107 \x6B k lower,alphan,alphan+_,print,print+space,ascii,unicode
108 \x6C l lower,alphan,alphan+_,print,print+space,ascii,unicode
109 \x6D m lower,alphan,alphan+_,print,print+space,ascii,unicode
110 \x6E n lower,alphan,alphan+_,print,print+space,ascii,unicode
111 \x6F o lower,alphan,alphan+_,print,print+space,ascii,unicode
112 \x70 p lower,alphan,alphan+_,print,print+space,ascii,unicode
113 \x71 q lower,alphan,alphan+_,print,print+space,ascii,unicode
114 \x72 r lower,alphan,alphan+_,print,print+space,ascii,unicode
115 \x73 s lower,alphan,alphan+_,print,print+space,ascii,unicode
116 \x74 t lower,alphan,alphan+_,print,print+space,ascii,unicode
117 \x75 u lower,alphan,alphan+_,print,print+space,ascii,unicode
118 \x76 v lower,alphan,alphan+_,print,print+space,ascii,unicode
119 \x77 w lower,alphan,alphan+_,print,print+space,ascii,unicode
120 \x78 x lower,alphan,alphan+_,print,print+space,ascii,unicode
121 \x79 y lower,alphan,alphan+_,print,print+space,ascii,unicode
122 \x7A z lower,alphan,alphan+_,print,print+space,ascii,unicode
123 \x7B { punctuation,print,print+space,ascii,unicode
124 \x7C | punctuation,print,print+space,ascii,unicode
125 \x7D } punctuation,print,print+space,ascii,unicode
126 \x7E ~ punctuation,print,print+space,ascii,unicode
127 \x7F control,ascii,unicode
128 \x80 Ç ascii,unicode
129 \x81 ü ascii,unicode
130 \x82 é ascii,unicode
131 \x83 â ascii,unicode
132 \x84 ä ascii,unicode
133 \x85 à ascii,unicode
134 \x86 å ascii,unicode
135 \x87 ç ascii,unicode
136 \x88 ê ascii,unicode
137 \x89 ë ascii,unicode
138 \x8A è ascii,unicode
139 \x8B ï ascii,unicode
140 \x8C î ascii,unicode
141 \x8D ì ascii,unicode
142 \x8E Ä ascii,unicode
143 \x8F Å ascii,unicode
144 \x90 É ascii,unicode
145 \x91 æ ascii,unicode
146 \x92 Æ ascii,unicode
147 \x93 ô ascii,unicode
148 \x94 ö ascii,unicode
149 \x95 ò ascii,unicode
150 \x96 û ascii,unicode
151 \x97 ù ascii,unicode
152 \x98 ÿ ascii,unicode
153 \x99 Ö ascii,unicode
154 \x9A Ü ascii,unicode
155 \x9B ¢ ascii,unicode
156 \x9C £ ascii,unicode
157 \x9D ¥ ascii,unicode
158 \x9E ₧ ascii,unicode
159 \x9F ƒ ascii,unicode
160 \xA0 á ascii,unicode
161 \xA1 í ascii,unicode
162 \xA2 ó ascii,unicode
163 \xA3 ú ascii,unicode
164 \xA4 ñ ascii,unicode
165 \xA5 Ñ ascii,unicode
166 \xA6 ª ascii,unicode
167 \xA7 º ascii,unicode
168 \xA8 ¿ ascii,unicode
169 \xA9 ⌐ ascii,unicode
170 \xAA ¬ ascii,unicode
171 \xAB ½ ascii,unicode
172 \xAC ¼ ascii,unicode
173 \xAD ¡ ascii,unicode
174 \xAE « ascii,unicode
175 \xAF » ascii,unicode
176 \xB0 ░ ascii,unicode
177 \xB1 ▒ ascii,unicode
178 \xB2 ▓ ascii,unicode
179 \xB3 │ ascii,unicode
180 \xB4 ┤ ascii,unicode
181 \xB5 ╡ ascii,unicode
182 \xB6 ╢ ascii,unicode
183 \xB7 ╖ ascii,unicode
184 \xB8 ╕ ascii,unicode
185 \xB9 ╣ ascii,unicode
186 \xBA ║ ascii,unicode
187 \xBB ╗ ascii,unicode
188 \xBC ╝ ascii,unicode
189 \xBD ╜ ascii,unicode
190 \xBE ╛ ascii,unicode
191 \xBF ┐ ascii,unicode
192 \xC0 └ ascii,unicode
193 \xC1 ┴ ascii,unicode
194 \xC2 ┬ ascii,unicode
195 \xC3 ├ ascii,unicode
196 \xC4 ─ ascii,unicode
197 \xC5 ┼ ascii,unicode
198 \xC6 ╞ ascii,unicode
199 \xC7 ╟ ascii,unicode
200 \xC8 ╚ ascii,unicode
201 \xC9 ╔ ascii,unicode
202 \xCA ╩ ascii,unicode
203 \xCB ╦ ascii,unicode
204 \xCC ╠ ascii,unicode
205 \xCD ═ ascii,unicode
206 \xCE ╬ ascii,unicode
207 \xCF ╧ ascii,unicode
208 \xD0 ╨ ascii,unicode
209 \xD1 ╤ ascii,unicode
210 \xD2 ╥ ascii,unicode
211 \xD3 ╙ ascii,unicode
212 \xD4 ╘ ascii,unicode
213 \xD5 ╒ ascii,unicode
214 \xD6 ╓ ascii,unicode
215 \xD7 ╫ ascii,unicode
216 \xD8 ╪ ascii,unicode
217 \xD9 ┘ ascii,unicode
218 \xDA ┌ ascii,unicode
219 \xDB █ ascii,unicode
220 \xDC ▄ ascii,unicode
221 \xDD ▌ ascii,unicode
222 \xDE ▐ ascii,unicode
223 \xDF ▀ ascii,unicode
224 \xE0 α ascii,unicode
225 \xE1 ß ascii,unicode
226 \xE2 Γ ascii,unicode
227 \xE3 π ascii,unicode
228 \xE4 Σ ascii,unicode
229 \xE5 σ ascii,unicode
230 \xE6 µ ascii,unicode
231 \xE7 τ ascii,unicode
232 \xE8 Φ ascii,unicode
233 \xE9 Θ ascii,unicode
234 \xEA Ω ascii,unicode
235 \xEB δ ascii,unicode
236 \xEC ∞ ascii,unicode
237 \xED φ ascii,unicode
238 \xEE ε ascii,unicode
239 \xEF ∩ ascii,unicode
240 \xF0 ≡ ascii,unicode
241 \xF1 ± ascii,unicode
242 \xF2 ≥ ascii,unicode
243 \xF3 ≤ ascii,unicode
244 \xF4 ⌠ ascii,unicode
245 \xF5 ⌡ ascii,unicode
246 \xF6 ÷ ascii,unicode
247 \xF7 ≈ ascii,unicode
248 \xF8 ° ascii,unicode
249 \xF9 ∙ ascii,unicode
250 \xFA · ascii,unicode
251 \xFB √ ascii,unicode
252 \xFC ⁿ ascii,unicode
253 \xFD ² ascii,unicode
254 \xFE ■ ascii,unicode
255 \xFF \ ascii,unicode
-----------------------------------------------------------------------
===
Internet: see also:
---
TSE: Search/Replace: Regular expression: Link: Can you give overview
links regular expressions?
http://www.faqts.com/knowledge_base/view.phtml/aid/31433/fid/865
----------------------------------------------------------------------