├── LICENSE
├── README.md
├── Syntax.rules.txt
├── _config.yml
├── bnf2html.perl.txt
├── bnf2html.pl
├── bnf2yacc.perl.txt
├── bnf2yacc.pl
├── index.html
├── outer-joins.html
├── sql-2003-1.bnf
├── sql-2003-1.bnf.html
├── sql-2003-2.bnf
├── sql-2003-2.bnf.html
├── sql-2003-2.ebnf
├── sql-2003-2.ebnf.readme
├── sql-2003-core-features.html
├── sql-2003-noncore-features.html
├── sql-2016.ebnf
├── sql-2016.ebnf.readme
├── sql-92.bnf
├── sql-92.bnf.html
├── sql-99.bnf
├── sql-99.bnf.html
├── sql-bnf.mk
└── webcode-1.09.tgz


/LICENSE:
--------------------------------------------------------------------------------
 1 | MIT License
 2 | 
 3 | Copyright (c) 2017 Ron Savage
 4 | 
 5 | Permission is hereby granted, free of charge, to any person obtaining a copy
 6 | of this software and associated documentation files (the "Software"), to deal
 7 | in the Software without restriction, including without limitation the rights
 8 | to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
 9 | copies of the Software, and to permit persons to whom the Software is
10 | furnished to do so, subject to the following conditions:
11 | 
12 | The above copyright notice and this permission notice shall be included in all
13 | copies or substantial portions of the Software.
14 | 
15 | THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
16 | IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
17 | FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
18 | AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
19 | LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
20 | OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
21 | SOFTWARE.
22 | 


--------------------------------------------------------------------------------
/README.md:
--------------------------------------------------------------------------------
  1 | # BNF Grammars for SQL-92, SQL-99 and SQL-2003
  2 | 
  3 | This repository contains the BNF (Backus-Naur Form) grammars for three versions of standard SQL — SQL-92, SQL-99 and SQL-2003.
  4 | 
  5 | You should be able to find a version of this site with 'active HTML' at:
  6 | 
  7 | * https://ronsavage.github.io/SQL/
  8 | 
  9 | It may not be the most recent release, but the technical content is mostly valid.
 10 | The download link is not functional — you can obtain the material for the latest
 11 | release from https://github.com/ronsavage/SQL/releases/latest.
 12 | 
 13 | ** !! Syntax Rules
 14 | 
 15 | Regarding the text '!! See the Syntax Rules': That is literally what it says in the PDF
 16 | containing the standard.
 17 | 
 18 | For an extract of the standard about these rules see the file 'Syntax.rules.txt'.
 19 | 
 20 | *This project is still in transition to GitHub.
 21 | The links in this README.md file lead to the pages in the GitHub source tree.
 22 | Most of them will display the HTML source — not a rendered HTML image.
 23 | There probably are ways around that; we're learning GitHub as we go.*
 24 | 
 25 | For a long time, this material was hosted by Ron Savage at
 26 | [http://savage.net.au/SQL](http://savage.net.au) — many thanks, Ron! —
 27 | but that site now points to here.
 28 | 
 29 | At the moment, the suggested method of operation is:
 30 | 
 31 | * Clone this repository to your machine — e.g. into the `/home/somebody/SQL` directory
 32 | * Point your browser to `file:///home/somebody/SQL/index.html`.
 33 | 
 34 | This should give you full HTML access to the material.
 35 | Alternatively, you can download the latest release of this material
 36 | (instead of cloning the repo), and then extract that into a directory
 37 | and point your browser to the `index.html` file in that directory.
 38 | 
 39 | Yes: it is sub-optimal.
 40 | Yes: we'll fix it when we know how to fix it.
 41 | 
 42 | ## SQL-92
 43 | 
 44 | The file [`sql-92.bnf.html`](sql-92.bnf.html) is a heavily hyperlinked HTML
 45 | version of the BNF grammar for SQL-92 (ISO/IEC 9075:1992 - Database Language -
 46 | SQL).
 47 | 
 48 | The plain text file [`sql-92.bnf`](sql-92.bnf), from which it was
 49 | automatically converted, is more useful (read legible) for reading
 50 | without a browser.
 51 | 
 52 | ## SQL-99
 53 | 
 54 | The file [`sql-99.bnf.html`](sql-99.bnf.html) is a heavily hyperlinked HTML
 55 | version of the BNF grammar for SQL-99 (ISO/IEC 9075-2:1999 - Database
 56 | Languages - SQL - Part 2: Foundation (SQL/Foundation)).
 57 | 
 58 | The plain text file [`sql-99.bnf`](sql-99.bnf), from which it was
 59 | automatically converted, is more useful (read legible) for reading
 60 | without a browser.
 61 | 
 62 | ## SQL-2003
 63 | 
 64 | The file [`sql-2003-2.bnf.html`](sql-2003-2.bnf.html) is a heavily hyperlinked HTML
 65 | version of the BNF grammar for SQL-2003 (ISO/IEC 9075-2:2003 - Database
 66 | Languages - SQL - Part 2: Foundation (SQL/Foundation)).
 67 | 
 68 | The plain text file [`sql-2003-2.bnf`](sql-2003-2.bnf), from which it was
 69 | automatically converted, is more useful (read legible) for reading
 70 | without a browser.
 71 | 
 72 | 
 73 | There is a separate file [`sql-2003-1.bnf.html`](sql-2003-1.bnf.html) for
 74 | the information from ISO/IEC 9075-1:2003 - Database Languages - SQL - Part
 75 | 1: Framework (SQL/Framework).
 76 | 
 77 | It was automatically converted from the plain text file [`sql-2003-1.bnf`](sql-2003-1.bnf),
 78 | which is more useful (read legible) for reading without a browser.
 79 | 
 80 | 
 81 | Also available:
 82 | <bl>
 83 | <li> <a href="sql-2003-core-features.html"> SQL 2003 Core Features </a> </li>
 84 | <li> <a href="sql-2003-noncore-features.html"> SQL 2003 Non-Core Features </a> </li>
 85 | </bl>
 86 | 
 87 | ## Informix OUTER Join Syntax
 88 | 
 89 | The file [`outer-joins.html`](outer-joins.html) is an explanation of the
 90 | non-standard Informix OUTER join syntax and semantics.
 91 | 
 92 | ## Conversion tools
 93 | 
 94 | 
 95 | The plain text was converted to HTML by the Perl script
 96 | [`bnf2html`](bnf2html.perl.txt) which you may use if you wish.
 97 | The `bnf2html` script also uses the C program
 98 | WEBCODE version 1.09
 99 | which you can download as a [gzipped tar file](webcode-1.09.tgz).
100 | 
101 | See also [`bnf2yacc`](bnf2yacc.perl.txt), an experimental
102 | script to convert BNF into an outline Yacc grammar.
103 | The generated grammar typically includes some unacceptable tokens, such
104 | as _`%token 0`_, that should be handled by the lexical analyzer
105 | rather than the grammar.
106 | The SQL standard includes such rules as grammar rules; consequently, you won't
107 | get a clean Yacc grammar from the SQL BNF files.
108 | 
109 | _(The Perl scripts should normally be renamed after downloading.)_
110 | 
111 | ## Download
112 | 
113 | You should be able to get the downloadable version of the latest release of this
114 | repository from the releases area:
115 | 
116 | * https://github.com/ronsavage/SQL/releases/latest
117 | 
118 | ## SQL 2016 Released
119 | 
120 | [ISO/IEC JTC 1/SC 32 Publishes Updated SQL Database Language Standard](https://www.ansi.org/news_publications/news_story?menuid=7&articleid=753a952d-1244-415b-bb92-0010750bb8cd) — SQL 2016.
121 | 
122 | 
123 | <hr>
124 | Please send feedback to Jonathan Leffler
125 | (<a href="mailto:jonathan.leffler@gmail.com"> jonathan.leffler@gmail.com </a>) _and_
126 | Ron Savage (<a href="mailto:ron@savage.net.au"> ron@savage.net.au </a>).
127 | 
128 | Last modified:
129 | 13th March 2017
130 | 


--------------------------------------------------------------------------------
/Syntax.rules.txt:
--------------------------------------------------------------------------------
  1 | That (!! See the Syntax Rules) is literally what it says in the PDF
  2 | containing the standard. And the Syntax Rules are one part of the verbiage
  3 | in the standard supporting the grammar — specifying what it means. The
  4 | first such place where it occurs is:
  5 | 
  6 | <space> <#xref-space> ::= !! See the Syntax Rules.
  7 | 
  8 | And if I go to the full pDF, I find 5.1 <SQL terminal character> says:
  9 | 
 10 | Information technology — Database languages — SQL — Part 2: Foundation
 11 | (SQL/Foundation)
 12 | 
 13 | Syntax Rules
 14 | 
 15 | 1) Every character set shall contain a <space> character that is equivalent
 16 | to U+0020.
 17 | 
 18 | Access Rules
 19 | 
 20 | None.
 21 | 
 22 | General Rules
 23 | 
 24 | 1) There is a one-to-one correspondence between the symbols contained in
 25 | <simple Latin upper case letter> and the symbols contained in <simple Latin
 26 | lower case letter> such that, for all i, the symbol defined as the i-th
 27 | alternative for <simple Latin upper case letter> corresponds to the symbol
 28 | defined as the i-th alternative for <simple Latin lower case letter>.
 29 | 
 30 | Conformance Rules
 31 | 
 32 | None.
 33 | And, in this case, that's all it has to say. Each section of the standard
 34 | has sub-headings 'Function', 'Format' (containing the BNF), 'Syntax Rules',
 35 | 'Access Rules', 'General Rules' (usually the biggest section), and
 36 | 'Conformance Rules'. The next pair of occurrences are:
 37 | 
 38 | <identifier part> ::=
 39 | 
 40 | <identifier start>
 41 | 
 42 | | <identifier extend>
 43 | 
 44 | <identifier start> ::= !! See the Syntax Rules
 45 | 
 46 | <identifier extend> ::= !! See the Syntax Rules
 47 | 
 48 | 
 49 | 
 50 | That's pulled from the PDF, not the HTML. This time, we find:
 51 | 
 52 | Syntax Rules
 53 | 
 54 | 1) An <identifier start> is any character in the Unicode General Category
 55 | classes "Lu", "Ll", "Lt", "Lm",
 56 | 
 57 | "Lo", or "Nl".
 58 | 
 59 | NOTE 58 — The Unicode General Category classes "Lu", "Ll", "Lt", "Lm",
 60 | "Lo", and "Nl" are assigned to Unicode characters
 61 | 
 62 | that are, respectively, upper-case letters, lower-case letters, title-case
 63 | letters, modifier letters, other letters, and letter numbers.
 64 | 
 65 | 2) An <identifier extend> is U+00B7, "Middle Dot", or any character in the
 66 | Unicode General Category classes
 67 | 
 68 | "Mn", "Mc", "Nd", "Pc", or "Cf".
 69 | 
 70 | NOTE 59 — The Unicode General Category classes "Mn", "Mc", "Nd", "Pc", and
 71 | "Cf" are assigned to Unicode characters that
 72 | 
 73 | are, respectively, nonspacing marks, spacing combining marks, decimal
 74 | numbers, connector punctuations, and formatting codes.
 75 | 
 76 | Very detailed specification stuff — but not something you can easily put
 77 | into the BNF. It belongs in the lexical analyzer, probably.
 78 | 
 79 | Another example — not to do with characters this time:
 80 | <preparable implementation-defined statement> <#xref-preparable
 81 | implementation-defined statement> ::= !! See the Syntax Rules.
 82 | 
 83 | Here the further information is:
 84 | 
 85 | 3) The Format and Syntax Rules for <preparable implementation-defined
 86 | statement> are implementation-defined.
 87 | 
 88 | And another pair of them:
 89 | 
 90 | <SQLSTATE class value> <#xref-SQLSTATE class value> ::= <SQLSTATE char>
 91 | <#SQLSTATE char> <SQLSTATE char> <#SQLSTATE char> !! See the Syntax Rules.
 92 | 
 93 | <SQLSTATE subclass value> <#xref-SQLSTATE subclass value> ::= <SQLSTATE
 94 | char> <#SQLSTATE char> <SQLSTATE char> <#SQLSTATE char> <SQLSTATE char>
 95 | <#SQLSTATE char> !! See the Syntax Rules.
 96 | The Syntax Rules say:
 97 | 
 98 | 3) In the values of <SQLSTATE class value> and <SQLSTATE subclass value>,
 99 | there shall be no <separator>
100 | 
101 | between the <SQLSTATE char>s.
102 | 
103 | 4) The values of <SQLSTATE class value> and <SQLSTATE subclass value> shall
104 | correspond to class values
105 | 
106 | and subclass values, respectively, specified in Table 32, "SQLSTATE class
107 | and subclass values".
108 | 
109 | Expanding on this last example, here is the copy'n'paste of the Syntax
110 | Rules through the end of the section:
111 | 
112 | Syntax Rules
113 | 
114 | 1) SQLWARNING, NOT FOUND, and SQLEXCEPTION correspond to SQLSTATE class
115 | values corresponding
116 | 
117 | to categories W, N, and X in Table 32, "SQLSTATE class and subclass values",
118 | respectively.
119 | 
120 | ©ISO/IEC 2003 – All rights reserved Embedded SQL 1001
121 | 
122 | ISO/IEC 9075-2:2003 (E)
123 | 
124 | 20.2 <embedded exception declaration>
125 | 
126 | 2) An <embedded exception declaration> contained in an <embedded SQL host
127 | program> applies to an <SQL
128 | 
129 | procedure statement> contained in that <embedded SQL host program> if and
130 | only if the <SQL procedure
131 | 
132 | statement> appears after the <embedded exception declaration> that has
133 | condition C in the text sequence
134 | 
135 | of the <embedded SQL host program> and no other <embedded exception
136 | declaration> E that satisfies one
137 | 
138 | of the following conditions appears between the <embedded exception
139 | declaration> and the <SQL procedure
140 | 
141 | statement> in the text sequence of the <embedded SQL host program>.
142 | 
143 | Let D be the <condition> contained in E.
144 | 
145 | a) D is the same as C.
146 | 
147 | b) D is a <major category> and belongs to the same class to which C belongs.
148 | 
149 | c) D contains an <SQLSTATE class value>, but does not contain an <SQLSTATE
150 | subclass value>, and
151 | 
152 | E contains the same <SQLSTATE class value> that C contains.
153 | 
154 | d) D contains the <SQLSTATE class value> that corresponds to integrity
155 | constraint violation and C
156 | 
157 | contains CONSTRAINT.
158 | 
159 | 3) In the values of <SQLSTATE class value> and <SQLSTATE subclass value>,
160 | there shall be no <separator>
161 | 
162 | between the <SQLSTATE char>s.
163 | 
164 | 4) The values of <SQLSTATE class value> and <SQLSTATE subclass value> shall
165 | correspond to class values
166 | 
167 | and subclass values, respectively, specified in Table 32, "SQLSTATE class
168 | and subclass values".
169 | 
170 | 5) If an <embedded exception declaration> specifies a <go to>, then the
171 | <host label identifier>, <host PL/I
172 | 
173 | label variable>, or <unsigned integer> of the <go to> shall be such that a
174 | host language GO TO statement
175 | 
176 | specifying that <host label identifier>, <host PL/I label variable>, or
177 | <unsigned integer> is valid at every
178 | 
179 | <SQL procedure statement> to which the <embedded exception declaration>
180 | applies.
181 | 
182 | NOTE 445 —
183 | 
184 | If an <embedded exception declaration> is contained in an <embedded SQL Ada
185 | program>, then the <goto target> of a <go
186 | 
187 | to> should specify a <host label identifier> that is a label_name in the
188 | containing <embedded SQL Ada program>.
189 | 
190 | If an <embedded exception declaration> is contained in an <embedded SQL C
191 | program>, then the <goto target> of a <go to>
192 | 
193 | should specify a <host label identifier> that is a label in the containing
194 | <embedded SQL C program>.
195 | 
196 | If an <embedded exception declaration> is contained in an <embedded SQL
197 | COBOL program>, then the <goto target> of a
198 | 
199 | <go to> should specify a <host label identifier> that is a section-name or
200 | an unqualified paragraph-name in the containing
201 | 
202 | <embedded SQL COBOL program>.
203 | 
204 | If an <embedded exception declaration> is contained in an <embedded SQL
205 | Fortran program>, then the <goto target> of a <go
206 | 
207 | to> should be an <unsigned integer> that is the statement label of an
208 | executable statement that appears in the same program
209 | 
210 | unit as the <go to>.
211 | 
212 | If an <embedded exception declaration> is contained in an <embedded SQL
213 | MUMPS program>, then the <goto target> of a
214 | 
215 | <go to> should be a gotoargument that is the statement label of an
216 | executable statement that appears in the same <embedded
217 | 
218 | SQL MUMPS program>.
219 | 
220 | If an <embedded exception declaration> is contained in an <embedded SQL
221 | Pascal program>, then the <goto target> of a <go
222 | 
223 | to> should be an <unsigned integer> that is a label.
224 | 
225 | If an <embedded exception declaration> is contained in an <embedded SQL
226 | PL/I program>, then the <goto target> of a <go
227 | 
228 | to> should specify either a <host label identifier> or a <host PL/I label
229 | variable>.
230 | 
231 | Case:
232 | 
233 | — If <host label identifier> is specified, then the <host label identifier>
234 | should be a label constant in the containing
235 | 
236 | <embedded SQL PL/I program>.
237 | 
238 | ISO/IEC 9075-2:2003 (E)
239 | 
240 | 20.2 <embedded exception declaration>
241 | 
242 | 1002 Foundation (SQL/Foundation) ©ISO/IEC 2003 – All rights reserved
243 | 
244 | — If <host PL/I label variable> is specified, then the <host PL/I label
245 | variable> should be a PL/I label variable declared in
246 | 
247 | the containing <embedded SQL PL/I program>.
248 | 
249 | Access Rules
250 | 
251 | None.
252 | 
253 | General Rules
254 | 
255 | 1) Immediately after the execution of an <SQL procedure statement> STMT in
256 | an <embedded SQL host program>
257 | 
258 | that returns an SQLSTATE value other than successful completion:
259 | 
260 | a) Let E be the set of <embedded exception declaration>s that are contained
261 | in the <embedded SQL host
262 | 
263 | program> containing STMT, that applies to STMT, and that specifies a
264 | <condition action> that is <go
265 | 
266 | to>.
267 | 
268 | b) Let CV and SCV be respectively the values of the class and subclass of
269 | the SQLSTATE value that
270 | 
271 | indicates the result of the <SQL procedure statement>.
272 | 
273 | c) If the execution of the <SQL procedure statement> caused the violation
274 | of one or more constraints or
275 | 
276 | assertions, then:
277 | 
278 | i) Let ECN be the set of <embedded exception declaration>s in E that
279 | specify CONSTRAINT and
280 | 
281 | the <constraint name> of a constraint that was violated by execution of
282 | STMT.
283 | 
284 | ii) If ECN contains more than one <embedded exception declaration>, then an
285 | implementationdependent
286 | 
287 | <embedded exception declaration> is chosen from ECN; otherwise, the single
288 | 
289 | <embedded exception declaration> in ECN is chosen.
290 | 
291 | iii) A GO TO statement of the host language is performed, specifying the
292 | <host label identifier>,
293 | 
294 | <host PL/I label variable>, or <unsigned integer> of the <go to> specified
295 | in the <embedded
296 | 
297 | exception declaration> chosen from ECN.
298 | 
299 | d) Otherwise:
300 | 
301 | i) Let ECS be the set of <embedded exception declaration>s in E that
302 | specify SQLSTATE, an
303 | 
304 | <SQLSTATE class value>, and an <SQLSTATE subclass value>.
305 | 
306 | ii) If ECS contains an <embedded exception declaration> EY that specifies
307 | an <SQLSTATE class
308 | 
309 | value> identical to CV and an <SQLSTATE subclass value> identical to SCV,
310 | then a GO TO
311 | 
312 | statement of the host language is performed, specifying the <host label
313 | identifier>, <host PL/I
314 | 
315 | label variable>, or <unsigned integer> of the <go to> specified in the
316 | <embedded exception
317 | 
318 | declaration> EY.
319 | 
320 | iii) Otherwise:
321 | 
322 | 1) Let EC be the set of <embedded exception declaration>s in E that specify
323 | SQLSTATE and
324 | 
325 | an <SQLSTATE class value> without an <SQLSTATE subclass value>.
326 | 
327 | 2) If EC contains an <embedded exception declaration> EY that specifies an
328 | <SQLSTATE
329 | 
330 | class value> identical to CV, then a GO TO statement of the host language
331 | is performed,
332 | 
333 | ©ISO/IEC 2003 – All rights reserved Embedded SQL 1003
334 | 
335 | ISO/IEC 9075-2:2003 (E)
336 | 
337 | 20.2 <embedded exception declaration>
338 | 
339 | specifying the <host label identifier>, <host PL/I label variable>, or
340 | <unsigned integer> of
341 | 
342 | the <go to> specified in the <embedded exception declaration> EY.
343 | 
344 | 3) Otherwise:
345 | 
346 | A) Let EX be the set of <embedded exception declaration>s in E that specify
347 | SQLEXCEPTION.
348 | 
349 | B) If EX contains an <embedded exception declaration> EY and CV belongs to
350 | Category
351 | 
352 | X in Table 32, "SQLSTATE class and subclass values", then a GO TO statement
353 | of the
354 | 
355 | host language is performed, specifying the <host label identifier>, <host
356 | PL/I label
357 | 
358 | variable>, or <unsigned integer> of the <go to> specified in the <embedded
359 | exception
360 | 
361 | declaration> EY.
362 | 
363 | C) Otherwise:
364 | 
365 | I) Let EW be the set of <embedded exception declaration>s in E that specify
366 | SQLWARNING.
367 | 
368 | II) If EW contains an <embedded exception declaration> EY and CV belongs to
369 | 
370 | Category W in Table 32, "SQLSTATE class and subclass values", then a GO
371 | 
372 | TO statement of the host language is performed, specifying the <host label
373 | 
374 | identifier>, <host PL/I label variable>, or <unsigned integer> of the <go
375 | to>
376 | 
377 | specified in the <embedded exception declaration> EY.
378 | 
379 | III) Otherwise, let ENF be the set of <embedded exception declaration>s in
380 | E that
381 | 
382 | specify NOT FOUND. If ENF contains an <embedded exception declaration>
383 | 
384 | EY and CV belongs to Category N in Table 32, "SQLSTATE class and subclass
385 | 
386 | values", then a GO TO statement of the host language is performed,
387 | specifying
388 | 
389 | the <host label identifier>, <host PL/I label variable>, or <unsigned
390 | integer> of
391 | 
392 | the <go to> specified in the <embedded exception declaration> EY.
393 | 
394 | Conformance Rules
395 | 
396 | 1) Without Feature B041, "Extensions to embedded SQL exception
397 | declarations", conforming SQL language
398 | 
399 | shall not contain an <SQL condition> that contains either SQLSTATE or
400 | CONSTRAINT.
401 | 
402 | 2) Without Feature F491, "Constraint management", conforming SQL language
403 | shall not contain an <SQL
404 | 
405 | condition> that contains a <constraint name>.
406 | 
407 | ISO/IEC 9075-2:2003 (E)
408 | 
409 | 20.2 <embedded exception declaration>
410 | 
411 | 1004
412 | 


--------------------------------------------------------------------------------
/_config.yml:
--------------------------------------------------------------------------------
1 | theme: jekyll-theme-merlot


--------------------------------------------------------------------------------
/bnf2html.perl.txt:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env perl
  2 | #
  3 | # @(#)$Id: bnf2html.pl,v 3.16 2017/11/14 06:53:22 jleffler Exp $
  4 | #
  5 | # Convert SQL-92, SQL-99 BNF plain text file into hyperlinked HTML.
  6 | 
  7 | use strict;
  8 | use warnings;
  9 | use POSIX qw(strftime);
 10 | #use Data::Dumper;
 11 | 
 12 | use constant debug => 0;
 13 | 
 14 | my(%rules);     # Indexed by rule names w/o angle-brackets; each entry is a ref to a hash.
 15 | my(%keywords);  # Index by keywords; each entry is a ref to a hash.
 16 | my(%names);     # Indexed by rule names w/o angle-brackets; each entry is a ref to an array of line numbers
 17 | 
 18 | sub top
 19 | {
 20 | print "<p><a href='#top'>Top</a></p>\n\n";
 21 | }
 22 | 
 23 | # Usage: add_rule_name(\%names, $rulename, $.);
 24 | sub add_rule_name
 25 | {
 26 |     my($reflist, $lhs, $line) = @_;
 27 |     #print "\nrulename = $lhs; line = $line\n";
 28 |     if (defined ${$reflist}{$lhs})
 29 |     {
 30 |         #print Data::Dumper->Dump([ ${$reflist}{$lhs} ], qw[ ${$reflist}{$lhs} ]);
 31 |         #print Data::Dumper->Dump([ \@{${$reflist}{$lhs}} ], qw[ \@{${$reflist}{$lhs}} ]);
 32 |         my @lines = @{${$reflist}{$lhs}};
 33 |         print STDERR "\n$0: Rule <$lhs> at line $line already seen at line(s) ", join(", ", @lines), "\n\n";
 34 |     }
 35 |     else
 36 |     {
 37 |         ${$reflist}{$lhs} = [];
 38 |     }
 39 |     push @{${$reflist}{$lhs}}, $line;
 40 | }
 41 | 
 42 | # Usage: add_entry(\%keywords, $keyword, $rule);
 43 | # Usage: add_entry(\%rules, $rhs, $rule);
 44 | sub add_entry
 45 | {
 46 |     my($reflist, $lhs, $rhs) = @_;
 47 |     ${$reflist}{$lhs} = {} unless defined ${$reflist}{$lhs};
 48 |     ${$reflist}{$lhs}{$rhs} = 1;
 49 | }
 50 | 
 51 | sub add_refs
 52 | {
 53 |     my($def, $tail) = @_;
 54 |     print "\n<!-- ADD REFS ($def) ($tail) -->\n" if debug;
 55 |     return if $tail =~ m/^!!/;
 56 |     return if $tail =~ m/^&(?:lt|gt|amp);$/;
 57 |     while ($tail)
 58 |     {
 59 |         $tail =~ s/^\s*//;
 60 |         if ($tail =~ m%^\&lt;([-:/\w\s]+)\&gt;%)
 61 |         {
 62 |             print "<!-- Rule - LHS: $def - RHS $1 -->\n" if debug;
 63 |             add_entry(\%rules, $1, $def);
 64 |             $tail =~ s%^\&lt;([-:/\w\s]+)\&gt;%%;
 65 |         }
 66 |         elsif ($tail =~ m%^([-:/\w]+)%)
 67 |         {
 68 |             my($token) = $1;
 69 |             print "<!-- KyWd - LHS: $def - RHS $token -->\n" if debug;
 70 |             add_entry(\%keywords, $token, $def) if $token =~ m%[[:alpha:]][[:alpha:]]% || $token eq 'C';
 71 |             $tail =~ s%^[-:/\w]+%%;
 72 |         }
 73 |         else
 74 |         {
 75 |             # Otherwise, it is punctuation (such as the BNF metacharacters).
 76 |             $tail =~ s%^[^-:/\w]%%;
 77 |         }
 78 |     }
 79 | }
 80 | 
 81 | # NB: webcode replaces tabs with blanks!
 82 | open( my $WEBCODE, "-|", "webcode @ARGV") or die "$!";
 83 | 
 84 | # Read first line of file - use as title in head and in H1 heading in body
 85 | $_ = <$WEBCODE>;
 86 | exit 0 unless defined($_);
 87 | chomp;
 88 | 
 89 | # Is it wicked to use double quoting with single quotes, as in qq'text'?
 90 | # It is used quite extensively in this script - beware!
 91 | print qq'<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">\n';
 92 | print "<!-- Generated HTML - Modify at your own peril! -->\n";
 93 | print "<html>\n<head>\n";
 94 | print "<title> $_ </title>\n</head>\n<body>\n\n";
 95 | print "<h1> $_ </h1>\n\n";
 96 | print qq'<a name="top">&nbsp;</a>\n';
 97 | 
 98 | print "<br>\n";
 99 | print qq'<a href="#xref-rules"> Cross-Reference: rules </a>\n';
100 | print "<br>\n";
101 | print qq'<a href="#xref-keywords"> Cross-Reference: keywords </a>\n';
102 | print "<br>\n";
103 | 
104 | sub rcs_id
105 | {
106 |     my($id) = @_;
107 |     $id =~ s%^(@\(#\))?\$[I]d: %%o;
108 |     $id =~ s% \$$%%o;
109 |     $id =~ s%,v % %o;
110 |     $id =~ s%\w+ Exp( \w+)?$%%o;
111 |     my(@words) = split / /, $id;
112 |     my($version) = "file $words[0] version $words[1] dated $words[2] $words[3]";
113 |     return $version;
114 | }
115 | 
116 | sub iso8601_format
117 | {
118 |     my($tm) = @_;
119 |     my $today = strftime("%Y-%m-%d %H:%M:%S+00:00", gmtime($tm));
120 |     return($today);
121 | }
122 | 
123 | # Print hrefs for non-terminals and keywords.
124 | # Also substitute /* Nothing */ for an absence of productions between alternatives.
125 | sub print_tail
126 | {
127 |     my($tail, $tcount) = @_;
128 |     while ($tail)
129 |     {
130 |         my($newtail);
131 |         if ($tail =~ m%^\s+%)
132 |         {
133 |             my($spaces) = $&;
134 |             $newtail = $';
135 |             print "<!-- print_tail: SPACES = '$spaces', NEWTAIL = '$newtail' -->\n" if debug;
136 |             $spaces =~ s% {4,8}%&nbsp;&nbsp;&nbsp;&nbsp;%g;
137 |             print $spaces;
138 |             # Spaces are not a token - don't count them!
139 |         }
140 |         elsif ($tail =~ m%^'[^']*'% || $tail =~ m%^"[^"]*"% || $tail =~ m%^!!.*$%)
141 |         {
142 |             # Quoted literal - print and ignore.
143 |             # Or meta-expression...
144 |             my($quote) = $&;
145 |             $newtail = $';
146 |             print "<!-- print_tail: QUOTE = <$quote>, NEWTAIL = '$newtail' -->\n" if debug;
147 |             $quote =~ s%!!.*%<font color="red"> $quote </font>%;
148 |             print $quote;
149 |             $tcount++;
150 |         }
151 |         elsif ($tail =~ m%^\&lt;([-:/\w\s]+)\&gt;%)
152 |         {
153 |             my($nonterm) = $&;
154 |             $newtail = $';
155 |             print "<!-- print_tail: NONTERM = '$nonterm', NEWTAIL = '$newtail' -->\n" if debug;
156 |             $nonterm =~ s%\&lt;([-:/\w\s]+)\&gt;%<a href='#$1'>\&lt;$1\&gt;</a>%;
157 |             print " $nonterm";
158 |             $tcount++;
159 |         }
160 |         elsif ($tail =~ m%^[\w_]([-._\w]*[\w_])?%)
161 |         {
162 |             # Keyword
163 |             my($keyword) = $&;
164 |             $newtail = $';
165 |             print "<!-- print_tail: KEYWORD = '$keyword', NEWTAIL = '$newtail' -->\n" if debug;
166 |             print(($keyword =~ m/^\d\d+$/) ? $keyword : qq' <a href="#xref-$keyword"> $keyword </a>');
167 |             $tcount++;
168 |         }
169 |         else
170 |         {
171 |             # Metacharacter, string literal, etc.
172 |             $tail =~ m%\S+%;
173 |             my($symbol) = $&;
174 |             $newtail = $';
175 |             print "<!-- print_tail: SYMBOL = '$symbol', NEWTAIL = '$newtail' -->\n" if debug;
176 |             if ($symbol eq '|')
177 |             {
178 |                 print "<font color=red>/* Nothing */</font> " if $tcount == 0;
179 |                 $tcount = 0;
180 |             }
181 |             else
182 |             {
183 |                 $symbol =~ s%...omitted...%<font color=red>/* $& */</font>%i;
184 |                 $tcount++;
185 |             }
186 |             print " $symbol";
187 |         }
188 |         $tail = $newtail;
189 |     }
190 |     return($tcount);
191 | }
192 | 
193 | sub undo_web_coding
194 | {
195 |     my($line) = @_;
196 |     $line =~ s%&gt;%>%g;
197 |     $line =~ s%&lt;%<%g;
198 |     $line =~ s%&amp;%&%g;
199 |     return $line;
200 | }
201 | 
202 | my $hr_count = 0;
203 | my $tcount = 0;                 # Ick!
204 | my $def;                        # Current rule
205 | 
206 | # Don't forget - the input has been web-encoded!
207 | 
208 | while (<$WEBCODE>)
209 | {
210 |     chomp;
211 |     next if /^===*$/o;
212 |     s/\s+$//o;  # Remove trailing white space
213 |     if (/^$/)
214 |     {
215 |         print "\n";
216 |     }
217 |     elsif (/^---*$/)
218 |     {
219 |         print "<hr>\n";
220 |     }
221 |     elsif (/^--@@\s*(.*)$/)
222 |     {
223 |         my $comment = undo_web_coding($1);
224 |         print "<!-- $comment -->\n";
225 |     }
226 |     elsif (/^@.#..Id:/)
227 |     {
228 |         # Convert what(1) string identifier into version information
229 |         my $id = '$Id: bnf2html.pl,v 3.16 2017/11/14 06:53:22 jleffler Exp $';
230 |         my($v1) = rcs_id($_);
231 |         my $v2 = rcs_id($id);
232 |         print "<p><font color=green><i><small>\n";
233 |         print "Derived from $v1\n";
234 |         my $today = iso8601_format(time);
235 |         print "<br>\n";
236 |         print "Generated on $today by $v2\n";
237 |         print "</small></i></font></p>\n";
238 |     }
239 |     elsif (/\s+::=/)
240 |     {
241 |         # Definition line
242 |         $def = $_;
243 |         $def =~ s%\&lt;([-:/()\w\s]+)\&gt;.*%$1%;
244 |         my($tail) = $_;
245 |         $tail =~ s%.*::=\s*%%;
246 |         print qq'<p><a href="#xref-$def" name="$def"> &lt;$def&gt; </a>&nbsp;&nbsp;&nbsp;::=';
247 |         $tcount = 0;
248 |         add_rule_name(\%names, $def, $.);
249 |         if ($def eq "vertical bar")
250 |         {
251 |             # Needs special case attention to avoid a /* Nothing */ comment appearing.
252 |             # Problem pointed out by Jens Odborg (jho1965us@gmail.com) 2016-04-14.
253 |             # This builds knowledge of the SQL language definition into this script;
254 |             # ugly, but trying to fix it in the print_tail function is probably worse.
255 |             print "&nbsp;&nbsp;|";
256 |         }
257 |         elsif ($tail)
258 |         {
259 |             add_refs($def, $tail);
260 |             print "&nbsp;&nbsp;";
261 |             $tcount = print_tail($tail, $tcount);
262 |         }
263 |         print "\n";
264 |     }
265 |     elsif (/^\s/)
266 |     {
267 |         # Expansion line
268 |         add_refs($def, $_);
269 |         print "<br>";
270 |         $tcount = print_tail($_, $tcount);
271 |     }
272 |     elsif (m/^--[\/]?(\w+)/)
273 |     {
274 |         # Pseudo-directive line in lower-case
275 |         # Print a 'Top' link before <hr> tags except first.
276 |         top if /--hr/ && $hr_count++ > 0;
277 |         s%--(/?[a-z][a-z\d]*)%<$1>%;
278 |         s%\&lt;([-:/\w\s]+)\&gt;%<a href='#$1'>\&lt;$1\&gt;</a>%g;
279 |         print "$_\n";
280 |     }
281 |     elsif (m%^--##%)
282 |     {
283 |         $_ = undo_web_coding($_);
284 |         s%^--##\s*%%;
285 |         print "$_\n";
286 |     }
287 |     elsif (m/^--%start\s+(\w+)/)
288 |     {
289 |         # Designated start symbol
290 |         my $start = $1;
291 |         print qq'<p><b>Start symbol: </b> <a href="#$start"> $start </a></p>\n';
292 |     }
293 |     else
294 |     {
295 |         # Anything unrecognized passed through unchanged!
296 |         print "$_\n";
297 |     }
298 | }
299 | 
300 | close $WEBCODE;
301 | 
302 | # Print index of initial letters for keywords.
303 | sub print_index_key
304 | {
305 |     my($prefix, @keys) = @_;
306 |     my %letters = ();
307 |     foreach my $keyword (@keys)
308 |     {
309 |         my $initial = uc substr $keyword, 0, 1;
310 |         $letters{$initial} = 1;
311 |     }
312 |     foreach my $letter ('A' .. 'Z')
313 |     {
314 |         if (defined($letters{$letter}))
315 |         {
316 |             print qq'<a href="#$prefix-$letter"> $letter </a>\n';
317 |         }
318 |         else
319 |         {
320 |             print qq'$letter\n';
321 |         }
322 |     }
323 |     print "\n";
324 | }
325 | 
326 | ### Generate cross-reference tables
327 | 
328 | {
329 | print "<br>\n\n";
330 | print "<hr>\n";
331 | print qq'<a name="xref-rules"></a>\n';
332 | print "<h2> Cross-Reference Table: Rules </h2>\n";
333 | 
334 | print_index_key("rules", keys %rules);
335 | 
336 | print "<table border=1>\n";
337 | print "<tr> <th> Rule (non-terminal) </th> <th> Rules using it </th> </tr>\n";
338 | my %letters = ();
339 | 
340 | foreach my $rule (sort { uc $a cmp uc $b } keys %rules)
341 | {
342 |     my $initial = uc substr $rule, 0, 1;
343 |     my $label = "";
344 |     if (!defined($letters{$initial}))
345 |     {
346 |         $letters{$initial} = 1;
347 |         $label = qq'<a name="rules-$initial"> </a>';
348 |     }
349 |     print qq'<tr> <td> $label <a href="#$rule" name="xref-$rule"> $rule </a> </td>\n     <td> ';
350 |     my $pad = "";
351 |     foreach my $ref (sort { uc $a cmp uc $b } keys %{$rules{$rule}})
352 |     {
353 |         print qq'$pad<a href="#$ref"> &lt;$ref&gt; </a>\n';
354 |         $pad = "          ";
355 |     }
356 |     print "     </td>\n</tr>\n";
357 | }
358 | print "</table>\n";
359 | print "<br>\n";
360 | top;
361 | }
362 | 
363 | {
364 | print "<hr>\n";
365 | print qq'<a name="xref-keywords"></a>\n';
366 | print "<h2> Cross-Reference Table: Keywords </h2>\n";
367 | 
368 | print_index_key("keywords", keys %keywords);
369 | 
370 | print "<table border=1>\n";
371 | print "<tr> <th> Keyword </th> <th> Rules using it </th> </tr>\n";
372 | my %letters = ();
373 | foreach my $keyword (sort { uc $a cmp uc $b } keys %keywords)
374 | {
375 |     my $initial = uc substr $keyword, 0, 1;
376 |     my $label = "";
377 |     if (!defined($letters{$initial}))
378 |     {
379 |         $letters{$initial} = 1;
380 |         $label = qq'<a name="keywords-$initial"> </a>';
381 |     }
382 |     print qq'<tr> <td> $label <a name="xref-$keyword"> </a> $keyword </td>\n     <td> ';
383 |     my $pad = "";
384 |     foreach my $ref (sort { uc $a cmp uc $b } keys %{$keywords{$keyword}})
385 |     {
386 |         print qq'$pad<a href="#$ref"> &lt;$ref&gt; </a>\n';
387 |         $pad = "          ";
388 |     }
389 |     print "     </td>\n</tr>\n";
390 | }
391 | print "</table>\n";
392 | print "<br>\n";
393 | top;
394 | print "<hr>\n";
395 | }
396 | 
397 | printf "%s\n", q'Please send feedback to Jonathan Leffler:';
398 | printf "%s\n", q'<a href="mailto:jonathan.leffler@gmail.com"> jonathan.leffler@gmail.com </a>.';
399 | 
400 | print "\n</body>\n</html>\n";
401 | 
402 | __END__
403 | 
404 | =pod
405 | 
406 | =head1 PROGRAM
407 | 
408 | bnf2html - Convert (ISO SQL) BNF Notation to Hyperlinked HTML
409 | 
410 | =head1 SYNTAX
411 | 
412 | bnf2html [file ...]
413 | 
414 | =head1 DESCRIPTION
415 | 
416 | The bnf2html filters the annotated BNF (Backus-Naur Form) from its input
417 | files and converts it into HTML on standard output.
418 | 
419 | The HTML is heavily hyperlinked.
420 | Each rule (LHS) links to a table of other rules where it is used on the
421 | RHS.
422 | Similarly, each symbol on the RHS is linked to the rule that defines it.
423 | Thus, it is possible to find where items are used and defined quite
424 | easily.
425 | 
426 | =head1 INPUT FORMAT
427 | 
428 | This script is adapted to the BNF notation using in the SQL standard
429 | (ISO/IEC 9075:2003, for example).
430 | It also takes various forms of annotations.
431 | 
432 | The first line of the file is used as the title in the head section.
433 | It is also used as the text for a H1 header at the top of the body.
434 | 
435 | Lines consisting of two or more equal signs are ignored.
436 | 
437 | Lines consisting of two or more dashes are converted to a horizontal
438 | rule.
439 | 
440 | Lines starting with the SCCS identification string '@(#)' are used to
441 | print version information about the file converted and the script doing
442 | the converting.
443 | 
444 | Lines containing space, colon, colon, equals are treated as rules.
445 | 
446 | Lines starting with white space are treated as continuations of a rule.
447 | 
448 | Lines starting dash, dash, (optionally a slash) and then one or more tag
449 | letters are converted into an HTML start or end tag.
450 | 
451 | Any line starting dash, dash, hash, hash has any HTML entities
452 | introduced by the WEBCODE program removed.
453 | 
454 | The should be at most one line starting '--%start'; this indicates the
455 | start symbol for the bnf2yacc converter, but is effectively ignored by
456 | bnf2html.
457 | 
458 | Any other line is passed through verbatim.
459 | 
460 | =head1 AUTHOR
461 | 
462 | Jonathan Leffler <jonathan.leffler@gmail.com>
463 | 
464 | =cut
465 | 


--------------------------------------------------------------------------------
/bnf2html.pl:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/env perl
  2 | #
  3 | # @(#)$Id: bnf2html.pl,v 3.16 2017/11/14 06:53:22 jleffler Exp $
  4 | #
  5 | # Convert SQL-92, SQL-99 BNF plain text file into hyperlinked HTML.
  6 | 
  7 | use strict;
  8 | use warnings;
  9 | use POSIX qw(strftime);
 10 | #use Data::Dumper;
 11 | 
 12 | use constant debug => 0;
 13 | 
 14 | my(%rules);     # Indexed by rule names w/o angle-brackets; each entry is a ref to a hash.
 15 | my(%keywords);  # Index by keywords; each entry is a ref to a hash.
 16 | my(%names);     # Indexed by rule names w/o angle-brackets; each entry is a ref to an array of line numbers
 17 | 
 18 | sub top
 19 | {
 20 | print "<p><a href='#top'>Top</a></p>\n\n";
 21 | }
 22 | 
 23 | # Usage: add_rule_name(\%names, $rulename, $.);
 24 | sub add_rule_name
 25 | {
 26 |     my($reflist, $lhs, $line) = @_;
 27 |     #print "\nrulename = $lhs; line = $line\n";
 28 |     if (defined ${$reflist}{$lhs})
 29 |     {
 30 |         #print Data::Dumper->Dump([ ${$reflist}{$lhs} ], qw[ ${$reflist}{$lhs} ]);
 31 |         #print Data::Dumper->Dump([ \@{${$reflist}{$lhs}} ], qw[ \@{${$reflist}{$lhs}} ]);
 32 |         my @lines = @{${$reflist}{$lhs}};
 33 |         print STDERR "\n$0: Rule <$lhs> at line $line already seen at line(s) ", join(", ", @lines), "\n\n";
 34 |     }
 35 |     else
 36 |     {
 37 |         ${$reflist}{$lhs} = [];
 38 |     }
 39 |     push @{${$reflist}{$lhs}}, $line;
 40 | }
 41 | 
 42 | # Usage: add_entry(\%keywords, $keyword, $rule);
 43 | # Usage: add_entry(\%rules, $rhs, $rule);
 44 | sub add_entry
 45 | {
 46 |     my($reflist, $lhs, $rhs) = @_;
 47 |     ${$reflist}{$lhs} = {} unless defined ${$reflist}{$lhs};
 48 |     ${$reflist}{$lhs}{$rhs} = 1;
 49 | }
 50 | 
 51 | sub add_refs
 52 | {
 53 |     my($def, $tail) = @_;
 54 |     print "\n<!-- ADD REFS ($def) ($tail) -->\n" if debug;
 55 |     return if $tail =~ m/^!!/;
 56 |     return if $tail =~ m/^&(?:lt|gt|amp);$/;
 57 |     while ($tail)
 58 |     {
 59 |         $tail =~ s/^\s*//;
 60 |         if ($tail =~ m%^\&lt;([-:/\w\s]+)\&gt;%)
 61 |         {
 62 |             print "<!-- Rule - LHS: $def - RHS $1 -->\n" if debug;
 63 |             add_entry(\%rules, $1, $def);
 64 |             $tail =~ s%^\&lt;([-:/\w\s]+)\&gt;%%;
 65 |         }
 66 |         elsif ($tail =~ m%^([-:/\w]+)%)
 67 |         {
 68 |             my($token) = $1;
 69 |             print "<!-- KyWd - LHS: $def - RHS $token -->\n" if debug;
 70 |             add_entry(\%keywords, $token, $def) if $token =~ m%[[:alpha:]][[:alpha:]]% || $token eq 'C';
 71 |             $tail =~ s%^[-:/\w]+%%;
 72 |         }
 73 |         else
 74 |         {
 75 |             # Otherwise, it is punctuation (such as the BNF metacharacters).
 76 |             $tail =~ s%^[^-:/\w]%%;
 77 |         }
 78 |     }
 79 | }
 80 | 
 81 | # NB: webcode replaces tabs with blanks!
 82 | open( my $WEBCODE, "-|", "webcode @ARGV") or die "$!";
 83 | 
 84 | # Read first line of file - use as title in head and in H1 heading in body
 85 | $_ = <$WEBCODE>;
 86 | exit 0 unless defined($_);
 87 | chomp;
 88 | 
 89 | # Is it wicked to use double quoting with single quotes, as in qq'text'?
 90 | # It is used quite extensively in this script - beware!
 91 | print qq'<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">\n';
 92 | print "<!-- Generated HTML - Modify at your own peril! -->\n";
 93 | print "<html>\n<head>\n";
 94 | print "<title> $_ </title>\n</head>\n<body>\n\n";
 95 | print "<h1> $_ </h1>\n\n";
 96 | print qq'<a name="top">&nbsp;</a>\n';
 97 | 
 98 | print "<br>\n";
 99 | print qq'<a href="#xref-rules"> Cross-Reference: rules </a>\n';
100 | print "<br>\n";
101 | print qq'<a href="#xref-keywords"> Cross-Reference: keywords </a>\n';
102 | print "<br>\n";
103 | 
104 | sub rcs_id
105 | {
106 |     my($id) = @_;
107 |     $id =~ s%^(@\(#\))?\$[I]d: %%o;
108 |     $id =~ s% \$$%%o;
109 |     $id =~ s%,v % %o;
110 |     $id =~ s%\w+ Exp( \w+)?$%%o;
111 |     my(@words) = split / /, $id;
112 |     my($version) = "file $words[0] version $words[1] dated $words[2] $words[3]";
113 |     return $version;
114 | }
115 | 
116 | sub iso8601_format
117 | {
118 |     my($tm) = @_;
119 |     my $today = strftime("%Y-%m-%d %H:%M:%S+00:00", gmtime($tm));
120 |     return($today);
121 | }
122 | 
123 | # Print hrefs for non-terminals and keywords.
124 | # Also substitute /* Nothing */ for an absence of productions between alternatives.
125 | sub print_tail
126 | {
127 |     my($tail, $tcount) = @_;
128 |     while ($tail)
129 |     {
130 |         my($newtail);
131 |         if ($tail =~ m%^\s+%)
132 |         {
133 |             my($spaces) = $&;
134 |             $newtail = $';
135 |             print "<!-- print_tail: SPACES = '$spaces', NEWTAIL = '$newtail' -->\n" if debug;
136 |             $spaces =~ s% {4,8}%&nbsp;&nbsp;&nbsp;&nbsp;%g;
137 |             print $spaces;
138 |             # Spaces are not a token - don't count them!
139 |         }
140 |         elsif ($tail =~ m%^'[^']*'% || $tail =~ m%^"[^"]*"% || $tail =~ m%^!!.*$%)
141 |         {
142 |             # Quoted literal - print and ignore.
143 |             # Or meta-expression...
144 |             my($quote) = $&;
145 |             $newtail = $';
146 |             print "<!-- print_tail: QUOTE = <$quote>, NEWTAIL = '$newtail' -->\n" if debug;
147 |             $quote =~ s%!!.*%<font color="red"> $quote </font>%;
148 |             print $quote;
149 |             $tcount++;
150 |         }
151 |         elsif ($tail =~ m%^\&lt;([-:/\w\s]+)\&gt;%)
152 |         {
153 |             my($nonterm) = $&;
154 |             $newtail = $';
155 |             print "<!-- print_tail: NONTERM = '$nonterm', NEWTAIL = '$newtail' -->\n" if debug;
156 |             $nonterm =~ s%\&lt;([-:/\w\s]+)\&gt;%<a href='#$1'>\&lt;$1\&gt;</a>%;
157 |             print " $nonterm";
158 |             $tcount++;
159 |         }
160 |         elsif ($tail =~ m%^[\w_]([-._\w]*[\w_])?%)
161 |         {
162 |             # Keyword
163 |             my($keyword) = $&;
164 |             $newtail = $';
165 |             print "<!-- print_tail: KEYWORD = '$keyword', NEWTAIL = '$newtail' -->\n" if debug;
166 |             print(($keyword =~ m/^\d\d+$/) ? $keyword : qq' <a href="#xref-$keyword"> $keyword </a>');
167 |             $tcount++;
168 |         }
169 |         else
170 |         {
171 |             # Metacharacter, string literal, etc.
172 |             $tail =~ m%\S+%;
173 |             my($symbol) = $&;
174 |             $newtail = $';
175 |             print "<!-- print_tail: SYMBOL = '$symbol', NEWTAIL = '$newtail' -->\n" if debug;
176 |             if ($symbol eq '|')
177 |             {
178 |                 print "<font color=red>/* Nothing */</font> " if $tcount == 0;
179 |                 $tcount = 0;
180 |             }
181 |             else
182 |             {
183 |                 $symbol =~ s%...omitted...%<font color=red>/* $& */</font>%i;
184 |                 $tcount++;
185 |             }
186 |             print " $symbol";
187 |         }
188 |         $tail = $newtail;
189 |     }
190 |     return($tcount);
191 | }
192 | 
193 | sub undo_web_coding
194 | {
195 |     my($line) = @_;
196 |     $line =~ s%&gt;%>%g;
197 |     $line =~ s%&lt;%<%g;
198 |     $line =~ s%&amp;%&%g;
199 |     return $line;
200 | }
201 | 
202 | my $hr_count = 0;
203 | my $tcount = 0;                 # Ick!
204 | my $def;                        # Current rule
205 | 
206 | # Don't forget - the input has been web-encoded!
207 | 
208 | while (<$WEBCODE>)
209 | {
210 |     chomp;
211 |     next if /^===*$/o;
212 |     s/\s+$//o;  # Remove trailing white space
213 |     if (/^$/)
214 |     {
215 |         print "\n";
216 |     }
217 |     elsif (/^---*$/)
218 |     {
219 |         print "<hr>\n";
220 |     }
221 |     elsif (/^--@@\s*(.*)$/)
222 |     {
223 |         my $comment = undo_web_coding($1);
224 |         print "<!-- $comment -->\n";
225 |     }
226 |     elsif (/^@.#..Id:/)
227 |     {
228 |         # Convert what(1) string identifier into version information
229 |         my $id = '$Id: bnf2html.pl,v 3.16 2017/11/14 06:53:22 jleffler Exp $';
230 |         my($v1) = rcs_id($_);
231 |         my $v2 = rcs_id($id);
232 |         print "<p><font color=green><i><small>\n";
233 |         print "Derived from $v1\n";
234 |         my $today = iso8601_format(time);
235 |         print "<br>\n";
236 |         print "Generated on $today by $v2\n";
237 |         print "</small></i></font></p>\n";
238 |     }
239 |     elsif (/\s+::=/)
240 |     {
241 |         # Definition line
242 |         $def = $_;
243 |         $def =~ s%\&lt;([-:/()\w\s]+)\&gt;.*%$1%;
244 |         my($tail) = $_;
245 |         $tail =~ s%.*::=\s*%%;
246 |         print qq'<p><a href="#xref-$def" name="$def"> &lt;$def&gt; </a>&nbsp;&nbsp;&nbsp;::=';
247 |         $tcount = 0;
248 |         add_rule_name(\%names, $def, $.);
249 |         if ($def eq "vertical bar")
250 |         {
251 |             # Needs special case attention to avoid a /* Nothing */ comment appearing.
252 |             # Problem pointed out by Jens Odborg (jho1965us@gmail.com) 2016-04-14.
253 |             # This builds knowledge of the SQL language definition into this script;
254 |             # ugly, but trying to fix it in the print_tail function is probably worse.
255 |             print "&nbsp;&nbsp;|";
256 |         }
257 |         elsif ($tail)
258 |         {
259 |             add_refs($def, $tail);
260 |             print "&nbsp;&nbsp;";
261 |             $tcount = print_tail($tail, $tcount);
262 |         }
263 |         print "\n";
264 |     }
265 |     elsif (/^\s/)
266 |     {
267 |         # Expansion line
268 |         add_refs($def, $_);
269 |         print "<br>";
270 |         $tcount = print_tail($_, $tcount);
271 |     }
272 |     elsif (m/^--[\/]?(\w+)/)
273 |     {
274 |         # Pseudo-directive line in lower-case
275 |         # Print a 'Top' link before <hr> tags except first.
276 |         top if /--hr/ && $hr_count++ > 0;
277 |         s%--(/?[a-z][a-z\d]*)%<$1>%;
278 |         s%\&lt;([-:/\w\s]+)\&gt;%<a href='#$1'>\&lt;$1\&gt;</a>%g;
279 |         print "$_\n";
280 |     }
281 |     elsif (m%^--##%)
282 |     {
283 |         $_ = undo_web_coding($_);
284 |         s%^--##\s*%%;
285 |         print "$_\n";
286 |     }
287 |     elsif (m/^--%start\s+(\w+)/)
288 |     {
289 |         # Designated start symbol
290 |         my $start = $1;
291 |         print qq'<p><b>Start symbol: </b> <a href="#$start"> $start </a></p>\n';
292 |     }
293 |     else
294 |     {
295 |         # Anything unrecognized passed through unchanged!
296 |         print "$_\n";
297 |     }
298 | }
299 | 
300 | close $WEBCODE;
301 | 
302 | # Print index of initial letters for keywords.
303 | sub print_index_key
304 | {
305 |     my($prefix, @keys) = @_;
306 |     my %letters = ();
307 |     foreach my $keyword (@keys)
308 |     {
309 |         my $initial = uc substr $keyword, 0, 1;
310 |         $letters{$initial} = 1;
311 |     }
312 |     foreach my $letter ('A' .. 'Z')
313 |     {
314 |         if (defined($letters{$letter}))
315 |         {
316 |             print qq'<a href="#$prefix-$letter"> $letter </a>\n';
317 |         }
318 |         else
319 |         {
320 |             print qq'$letter\n';
321 |         }
322 |     }
323 |     print "\n";
324 | }
325 | 
326 | ### Generate cross-reference tables
327 | 
328 | {
329 | print "<br>\n\n";
330 | print "<hr>\n";
331 | print qq'<a name="xref-rules"></a>\n';
332 | print "<h2> Cross-Reference Table: Rules </h2>\n";
333 | 
334 | print_index_key("rules", keys %rules);
335 | 
336 | print "<table border=1>\n";
337 | print "<tr> <th> Rule (non-terminal) </th> <th> Rules using it </th> </tr>\n";
338 | my %letters = ();
339 | 
340 | foreach my $rule (sort { uc $a cmp uc $b } keys %rules)
341 | {
342 |     my $initial = uc substr $rule, 0, 1;
343 |     my $label = "";
344 |     if (!defined($letters{$initial}))
345 |     {
346 |         $letters{$initial} = 1;
347 |         $label = qq'<a name="rules-$initial"> </a>';
348 |     }
349 |     print qq'<tr> <td> $label <a href="#$rule" name="xref-$rule"> $rule </a> </td>\n     <td> ';
350 |     my $pad = "";
351 |     foreach my $ref (sort { uc $a cmp uc $b } keys %{$rules{$rule}})
352 |     {
353 |         print qq'$pad<a href="#$ref"> &lt;$ref&gt; </a>\n';
354 |         $pad = "          ";
355 |     }
356 |     print "     </td>\n</tr>\n";
357 | }
358 | print "</table>\n";
359 | print "<br>\n";
360 | top;
361 | }
362 | 
363 | {
364 | print "<hr>\n";
365 | print qq'<a name="xref-keywords"></a>\n';
366 | print "<h2> Cross-Reference Table: Keywords </h2>\n";
367 | 
368 | print_index_key("keywords", keys %keywords);
369 | 
370 | print "<table border=1>\n";
371 | print "<tr> <th> Keyword </th> <th> Rules using it </th> </tr>\n";
372 | my %letters = ();
373 | foreach my $keyword (sort { uc $a cmp uc $b } keys %keywords)
374 | {
375 |     my $initial = uc substr $keyword, 0, 1;
376 |     my $label = "";
377 |     if (!defined($letters{$initial}))
378 |     {
379 |         $letters{$initial} = 1;
380 |         $label = qq'<a name="keywords-$initial"> </a>';
381 |     }
382 |     print qq'<tr> <td> $label <a name="xref-$keyword"> </a> $keyword </td>\n     <td> ';
383 |     my $pad = "";
384 |     foreach my $ref (sort { uc $a cmp uc $b } keys %{$keywords{$keyword}})
385 |     {
386 |         print qq'$pad<a href="#$ref"> &lt;$ref&gt; </a>\n';
387 |         $pad = "          ";
388 |     }
389 |     print "     </td>\n</tr>\n";
390 | }
391 | print "</table>\n";
392 | print "<br>\n";
393 | top;
394 | print "<hr>\n";
395 | }
396 | 
397 | printf "%s\n", q'Please send feedback to Jonathan Leffler:';
398 | printf "%s\n", q'<a href="mailto:jonathan.leffler@gmail.com"> jonathan.leffler@gmail.com </a>.';
399 | 
400 | print "\n</body>\n</html>\n";
401 | 
402 | __END__
403 | 
404 | =pod
405 | 
406 | =head1 PROGRAM
407 | 
408 | bnf2html - Convert (ISO SQL) BNF Notation to Hyperlinked HTML
409 | 
410 | =head1 SYNTAX
411 | 
412 | bnf2html [file ...]
413 | 
414 | =head1 DESCRIPTION
415 | 
416 | The bnf2html filters the annotated BNF (Backus-Naur Form) from its input
417 | files and converts it into HTML on standard output.
418 | 
419 | The HTML is heavily hyperlinked.
420 | Each rule (LHS) links to a table of other rules where it is used on the
421 | RHS.
422 | Similarly, each symbol on the RHS is linked to the rule that defines it.
423 | Thus, it is possible to find where items are used and defined quite
424 | easily.
425 | 
426 | =head1 INPUT FORMAT
427 | 
428 | This script is adapted to the BNF notation using in the SQL standard
429 | (ISO/IEC 9075:2003, for example).
430 | It also takes various forms of annotations.
431 | 
432 | The first line of the file is used as the title in the head section.
433 | It is also used as the text for a H1 header at the top of the body.
434 | 
435 | Lines consisting of two or more equal signs are ignored.
436 | 
437 | Lines consisting of two or more dashes are converted to a horizontal
438 | rule.
439 | 
440 | Lines starting with the SCCS identification string '@(#)' are used to
441 | print version information about the file converted and the script doing
442 | the converting.
443 | 
444 | Lines containing space, colon, colon, equals are treated as rules.
445 | 
446 | Lines starting with white space are treated as continuations of a rule.
447 | 
448 | Lines starting dash, dash, (optionally a slash) and then one or more tag
449 | letters are converted into an HTML start or end tag.
450 | 
451 | Any line starting dash, dash, hash, hash has any HTML entities
452 | introduced by the WEBCODE program removed.
453 | 
454 | The should be at most one line starting '--%start'; this indicates the
455 | start symbol for the bnf2yacc converter, but is effectively ignored by
456 | bnf2html.
457 | 
458 | Any other line is passed through verbatim.
459 | 
460 | =head1 AUTHOR
461 | 
462 | Jonathan Leffler <jonathan.leffler@gmail.com>
463 | 
464 | =cut
465 | 


--------------------------------------------------------------------------------
/bnf2yacc.perl.txt:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/perl -w
  2 | #
  3 | # @(#)$Id: bnf2yacc.pl,v 1.16 2017/11/14 06:53:22 jleffler Exp $
  4 | #
  5 | # Convert SQL-92, SQL-99 BNF plain text file into YACC grammar.
  6 | 
  7 | use strict;
  8 | $| = 1;
  9 | 
 10 | use constant debug => 0;
 11 | 
 12 | my $heading = "";
 13 | my %tokens;
 14 | my %nonterminals;
 15 | my %rules;
 16 | my %used;
 17 | my $start;
 18 | my @grammar;
 19 | 
 20 | my $nt_number = 0;
 21 | 
 22 | # Generate a new non-terminal identifier
 23 | sub new_non_terminal
 24 | {
 25 |     my($prefix) = @_;
 26 |     $prefix = "" unless defined $prefix;
 27 |     return sprintf "${prefix}nt_%03d", ++$nt_number;
 28 | }
 29 | 
 30 | # map_non_terminal converts names that are not acceptable to Yacc into names that are.
 31 | # Non-identifier characters are converted to underscores.
 32 | # If the first character is not alphabetic, prefix 'j_'.
 33 | # Case-convert to lower case.
 34 | sub map_non_terminal
 35 | {
 36 |     my($nt) = @_;
 37 |     $nt =~ s/\W+/_/go;
 38 |     $nt = "j_$nt" unless $nt =~ m/^[a-zA-Z]/o;
 39 |     $nt =~ tr/[A-Z]/[a-z]/;
 40 |     $nt =~ s/__+/_/go;
 41 |     return $nt;
 42 | }
 43 | 
 44 | # scan_rhs breaks up the RHS of a rule into a token stream
 45 | # Keywords (terminals) are prefixed with a '#' marker.
 46 | sub scan_rhs
 47 | {
 48 |     my($tail) = @_;
 49 |     my(@rhs);
 50 |     while ($tail)
 51 |     {
 52 |         print "RHS: $tail\n" if debug;
 53 |         my $name;
 54 |         if ($tail =~ m%^(\s*<([-:/()_\w\s]+)>\s*)%o)
 55 |         {
 56 |             # Simpler regex for non-terminal: <[^>]+>
 57 |             # Non-terminal
 58 |             my $n = $2;
 59 |             print "N: $n\n" if debug;
 60 |             $tail = substr $tail, length($1);
 61 |             $name = map_non_terminal($n);
 62 |             $nonterminals{$name} = 1;
 63 |             $used{$name} = 1;
 64 |             push @rhs, $name;
 65 |         }
 66 |         elsif ($tail =~ m%^(\s*(\w[-\w\d_.]*)\s*)%o)
 67 |         {
 68 |             # Terminal (keyword)
 69 |             # Dot '.' is used in Interfaces.SQL in Ada syntax
 70 |             # Dash '-' is used in EXEC-SQL in the keywords.
 71 |             my $t = $2;
 72 |             print "T: $t\n" if debug;
 73 |             $tail = substr $tail, length($1);
 74 |             $name = $t;
 75 |             $tokens{$name} = 1;
 76 |             push @rhs, "#$name";
 77 |         }
 78 |         elsif ($tail =~ m%^\s*(\.\.\.omitted\.\.\.)\s*%o)
 79 |         {
 80 |             # Something omitted from the grammar.
 81 |             # Triple punctuation detected before double.
 82 |             my $str = "/* $1 */";
 83 |             push @rhs, $str;
 84 |             last;
 85 |         }
 86 |         elsif ($tail =~ m{^(\s*([-.<=>|]{2})\s*)$}o)
 87 |         {
 88 |             # Double-punctuation (non-metacharacters)
 89 |             # .., <=, >=, <>, ||, ->
 90 |             my $p = $2;
 91 |             print "DP: $p\n" if debug;
 92 |             $tail = substr $tail, length($1);
 93 |             $name = "'$p'";
 94 |             push @rhs, $name;
 95 |         }
 96 |         elsif ($tail =~ m{^(\s*([][{}"'%&()*+,-./:;<=>?^_|])\s*)$}o)
 97 |         {
 98 |             # Punctuation (non-metacharacters)
 99 |             # Note that none of '@', '~', '!' or '\' have any significance in SQL
100 |             my $p = $2;
101 |             print "P: $p\n" if debug;
102 |             $tail = substr $tail, length($1);
103 |             $p = "\\'" if $p eq "'";
104 |             $name = "'$p'";
105 |             push @rhs, $name;
106 |         }
107 |         elsif ($tail =~ m%^(\s*('[^']*'))\s*%o ||
108 |                $tail =~ m%^(\s*("[^"]*"))\s*%o)
109 |         {
110 |             # Terminal in quotes - single or double.
111 |             # (Possibly a multi-character string).
112 |             my $q = $2;
113 |             print "Q: $q\n" if debug;
114 |             $tail = substr $tail, length($1);
115 |             $q =~ m%^(['"])(.+)['"]$%o;
116 |             # Expand multi-character string constants.
117 |             # into repeated single-character constants.
118 |             my($o) = $1;
119 |             my($l) = $2;
120 |             while (length($l))
121 |             {
122 |                 my($c) = substr $l, 0, 1;
123 |                 $name = "$o$c$o";
124 |                 $l = substr $l, 1, length($l)-1;
125 |                 push @rhs, $name;
126 |             }
127 |         }
128 |         elsif ($tail =~ m%^(\s*([{}\|\[\]]|\.\.\.)\s*)%o)
129 |         {
130 |             # Punctuation (metacharacters)
131 |             my $p = $2;
132 |             print "M: $p\n" if debug;
133 |             $tail = substr $tail, length($1);
134 |             $name = $p;
135 |             push @rhs, $name;
136 |         }
137 |         elsif ($tail =~ m%^\s*!!%o)
138 |         {
139 |             # Exhortation to see the syntax rules - usually.
140 |             my $str = "/* $tail */";
141 |             push @rhs, $str;
142 |             last;
143 |         }
144 |         else
145 |         {
146 |             # Unknown!
147 |             print "/* UNK: $tail */\n";
148 |             print STDERR "UNK:$.: $tail\n";
149 |             last;
150 |         }
151 |     }
152 |     return(@rhs);
153 | }
154 | 
155 | # Format a Yacc rule given LHS and RHS array
156 | sub record_rule
157 | {
158 |     my($lhs, $comment, @rule) = @_;
159 |     my($production) = "";
160 |     print "==>> record_rule ($lhs : @rule)\n" if debug;
161 |     $production .= "/*\n" if $comment;
162 |     $production .= "$lhs\n\t:\t";
163 |     my $pad = "";
164 |     my $br_count = 0;
165 |     for (my $i = 0; $i <= $#rule; $i++)
166 |     {
167 |         my $item = $rule[$i];
168 |         print "==== item $item\n" if debug;
169 |         if ($item eq "|" && $br_count == 0)
170 |         {
171 |             $production .= "\n\t|\t";
172 |             $pad = "";
173 |         }
174 |         else
175 |         {
176 |             $production .= "$pad$item";
177 |             $pad = " ";
178 |             $br_count++ if ($item eq '[' or $item eq '{');
179 |             $br_count-- if ($item eq ']' or $item eq '}');
180 |         }
181 |     }
182 |     $production .= "\n\t;\n";
183 |     $production .= "*/\n" if $comment;
184 |     $production .= "\n";
185 |     print "$production" if debug;
186 |     push @grammar, $production;
187 |     print "<<== record_rule\n" if debug;
188 | }
189 | 
190 | sub print_iterator
191 | {
192 |     my($lhs,$rhs) = @_;
193 |     my($production) = "";
194 |     print "==>> print_iterator ($lhs $rhs)\n" if debug;
195 |     $production .= "$lhs\n\t:\t$rhs\n\t|\t$lhs $rhs\n\t;\n\n";
196 |     print "<<== print_iterator\n" if debug;
197 |     push @grammar, $production;
198 | }
199 | 
200 | # Process an optional item enclosed in square brackets
201 | sub find_balanced_bracket
202 | {
203 |     my($lhs,@rhs) = @_;
204 |     my(@rule) = ( "/* Nothing */", "|");
205 |     print "==>> find_balanced_bracket ($lhs : @rhs)\n" if debug;
206 |     while (my $name = shift @rhs)
207 |     {
208 |         print "     name = $name\n" if debug;
209 |         if ($name eq ']')
210 |         {
211 |             # Found closing bracket
212 |             # Terminate search
213 |             last;
214 |         }
215 |         elsif ($name eq '[')
216 |         {
217 |             # Found nested optional clause
218 |             my $tag = new_non_terminal('opt_');
219 |             @rhs = find_balanced_bracket($tag, @rhs);
220 |             push @rule, $tag;
221 |         }
222 |         elsif ($name eq '{')
223 |         {
224 |             # Found start of sequence
225 |             my $tag = new_non_terminal('seq_');
226 |             @rhs = find_balanced_brace($tag, @rhs);
227 |             push @rule, $tag;
228 |         }
229 |         elsif ($name eq '}')
230 |         {
231 |             # Found unbalanced close brace.
232 |             # Error!
233 |         }
234 |         elsif ($name eq '...')
235 |         {
236 |             # Found iteration.
237 |             my $tag = new_non_terminal('lst_');
238 |             print "==== find_balanced_bracket: iterator (@rule)\n" if debug;
239 |             my($old) = pop @rule;
240 |             push @rule, $tag;
241 |             print "==== find_balanced_bracket: iterator ($tag/$old - @rule)\n" if debug;
242 |             print_iterator($tag, $old);
243 |         }
244 |         else
245 |         {
246 |             $name =~ s/^#//;
247 |             push @rule, $name;
248 |             $used{$name} = 1;
249 |         }
250 |     }
251 |     record_rule($lhs, 0, @rule);
252 |     print "<<== find_balanced_bracket: @rhs)\n" if debug;
253 |     return(@rhs);
254 | }
255 | 
256 | # Process an sequence item enclosed in curly braces
257 | sub find_balanced_brace
258 | {
259 |     my($lhs,@rhs) = @_;
260 |     my(@rule);
261 |     print "==>> find_balanced_brace ($lhs : @rhs)\n" if debug;
262 |     while (my $name = shift @rhs)
263 |     {
264 |         print "     name = $name\n" if debug;
265 |         if ($name eq '}')
266 |         {
267 |             # Found closing brace
268 |             # Terminate search
269 |             last;
270 |         }
271 |         elsif ($name eq '[')
272 |         {
273 |             # Found nested optional clause
274 |             my $tag = new_non_terminal('opt_');
275 |             @rhs = find_balanced_bracket($tag, @rhs);
276 |             push @rule, $tag;
277 |         }
278 |         elsif ($name eq '{')
279 |         {
280 |             # Found start of sequence
281 |             my $tag = new_non_terminal('seq_');
282 |             @rhs = find_balanced_brace($tag, @rhs);
283 |             push @rule, $tag;
284 |         }
285 |         elsif ($name eq ']')
286 |         {
287 |             # Found unbalanced close brace.
288 |             # Error!
289 |         }
290 |         elsif ($name eq '...')
291 |         {
292 |             # Found iteration.
293 |             my $tag = new_non_terminal('lst_');
294 |             print "==== find_balanced_brace: iterator (@rule)\n" if debug;
295 |             my($old) = pop @rule;
296 |             push @rule, $tag;
297 |             print "==== find_balanced_brace: iterator ($tag/$old - @rule)\n" if debug;
298 |             print_iterator($tag, $old);
299 |         }
300 |         else
301 |         {
302 |             $name =~ s/^#//;
303 |             push @rule, $name;
304 |             $used{$name} = 1;
305 |         }
306 |     }
307 |     record_rule($lhs, 0, @rule);
308 |     print "<<== find_balanced_brace: @rhs)\n" if debug;
309 |     return(@rhs);
310 | }
311 | 
312 | # Note that the [ and { parts are nice and easy because they are
313 | # balanced operators.  The iteration operator ... is much harder to
314 | # process because it is a trailing modifier.  When processing the list
315 | # of symbols, you need to establish whether there is a trailing iterator
316 | # after the current symbol, and modify the behaviour appropriately.
317 | sub process_rhs
318 | {
319 |     my($lhs, $tail) = @_;
320 |     my(@rhs) = scan_rhs($tail);
321 |     print "==>> process_rhs ($lhs : @rhs)\n" if debug;
322 |     # List parsed rule in output only if debugging.
323 |     record_rule($lhs, 1, @rhs) if debug;
324 |     my(@rule);
325 |     while (my $name = shift @rhs)
326 |     {
327 |         print "name = $name\n" if debug;
328 |         if ($name eq '[')
329 |         {
330 |             my $tag = new_non_terminal('opt_');
331 |             @rhs = find_balanced_bracket($tag, @rhs);
332 |             push @rule, $tag;
333 |         }
334 |         elsif ($name eq ']')
335 |         {
336 |             # Found a close bracket for something unbalanced.
337 |             # Error!
338 |         }
339 |         elsif ($name eq '{')
340 |         {
341 |             # Start of mandatory sequence of items, possibly containing alternatives.
342 |             my $tag = new_non_terminal('seq_');
343 |             @rhs = find_balanced_brace($tag, @rhs);
344 |             push @rule, $tag;
345 |         }
346 |         elsif ($name eq '}')
347 |         {
348 |             # Found a close brace for something unbalanced.
349 |             # Error!
350 |         }
351 |         elsif ($name eq '|')
352 |         {
353 |             # End of one alternative and start of a new one.
354 |             print "==== process_rhs: alternative $name\n" if debug;
355 |             push @rule, $name;
356 |         }
357 |         elsif ($name eq '...')
358 |         {
359 |             # Found iteration.
360 |             my $tag = new_non_terminal('lst_');
361 |             my($old) = pop @rule;
362 |             push @rule, $tag;
363 |             print "==== process_rhs: iterator\n" if debug;
364 |             print_iterator($tag, $old);
365 |         }
366 |         elsif ($name =~ m/^#/)
367 |         {
368 |             # Keyword token
369 |             print "==== process_rhs: token $name\n" if debug;
370 |             $name =~ s/^#//;
371 |             push @rule, $name;
372 |         }
373 |         else
374 |         {
375 |             # Non-terminal (or comment)
376 |             print "==== process_rhs: non-terminal $name\n" if debug;
377 |             push @rule, $name;
378 |         }
379 |     }
380 |     print "==== process_rhs: @rule\n" if debug;
381 |     record_rule($lhs, 0, @rule);
382 |     print "<<== process_rhs\n" if debug;
383 | }
384 | 
385 | sub count_unmatched_keys
386 | {
387 |     my($ref1, $ref2) = @_;
388 |     my(%keys) = %$ref1;
389 |     my(%match) = %$ref2;
390 |     my($count) = 0;
391 |     foreach my $key (keys %keys)
392 |     {
393 |         $count++ unless defined $match{$key};
394 |     }
395 |     return $count;
396 | }
397 | 
398 | # ------------------------------------------------------------
399 | 
400 | open INPUT, "cat @ARGV |" or die "$!";
401 | $_ = <INPUT>;
402 | exit 0 unless defined($_);
403 | chomp;
404 | $heading = "%{\n/*\n** $_\n*/\n%}\n\n" unless m/^\s*$/;
405 | 
406 | # Commentary appears in column 1.
407 | # Continuations of rules have a blank in column 1.
408 | # Blank lines, dash lines and equals lines separate rules (are not embedded within them)..
409 | 
410 | while (<INPUT>)
411 | {
412 |     chomp;
413 |     print "DBG:$.: $_\n" if debug;
414 |     next if /^===*$/o;
415 |     next if /^\s*$/o;	# Blank lines
416 |     next if /^---*$/o;	# Horizontal lines
417 |     if (/^--/o)
418 |     {
419 |         # Various HTML pseudo-directives
420 |         if (m%^--/?\w+\b%)
421 |         {
422 |             print "/* $' */\n" if $';
423 |         }
424 |         elsif (/^--%start (\w+)/)
425 |         {
426 |             $start = $1;
427 |             print "/* Start symbol - $start */\n";
428 |         }
429 |         elsif (/^--##/)
430 |         {
431 |             print "/* $_ */\n";
432 |         }
433 |         else
434 |         {
435 |             print "/* Unrecognized 2: $_ */\n";
436 |         }
437 |     }
438 |     elsif (/^@.#..Id:/)
439 |     {
440 |         # Convert what(1) string identifier into version information
441 |         s%^@.#..Id: %%;
442 |         s% \$$%%;
443 |         s%,v % %;
444 |         s%\w+ Exp( \w+)?$%%;
445 |         my @words = split;
446 |         print "/*\n";
447 |         print "** Derived from file $words[0] version $words[1] dated $words[2] $words[3]\n";
448 |         print "*/\n";
449 |     }
450 |     elsif (/ ::=/)
451 |     {
452 |         # Definition line
453 |         my $def = $_;
454 |         $def =~ s%<([-:/()\w\s]+)>.*%$1%o;
455 |         $def = map_non_terminal($def);
456 |         $rules{$def} = 1;
457 |         $nonterminals{$def} = 1;
458 |         my $tail = $_;
459 |         $tail =~ s%.*::=\s*%%;	# Remove LHS of statement
460 |         while (<INPUT>)
461 |         {
462 |             chomp;
463 |             last unless /^\s/;
464 |             $tail .= $_;
465 |         }
466 |         process_rhs($def, $tail);
467 |     }
468 |     else
469 |     {
470 |         # Anything unrecognized passed through as a comment!
471 |         print "/* $_ */\n";
472 |     }
473 | }
474 | 
475 | close INPUT;
476 | 
477 | print "==== End of input phase ====\n" if debug;
478 | 
479 | print $heading if $heading;
480 | 
481 | # List of tokens
482 | foreach my $token (sort keys %tokens)
483 | {
484 |     print "\%token $token\n";
485 | }
486 | print "\n";
487 | 
488 | # Undefined non-terminals might need to be treated as tokens
489 | if (count_unmatched_keys(\%nonterminals, \%rules) > 0)
490 | {
491 |     print "/* The following non-terminals were not defined */\n";
492 |     foreach my $nt (sort keys %nonterminals)
493 |     {
494 |         print "%token $nt\n" unless defined $rules{$nt};
495 |     }
496 |     print "/* End of undefined non-terminals */\n\n";
497 | }
498 | 
499 | # List the rules that are defined in the original grammar.
500 | # Do not list the rules defined by this conversion process.
501 | print "/*\n";
502 | foreach my $nt (sort keys %nonterminals)
503 | {
504 |     print "\%rule $nt\n";
505 | }
506 | print "*/\n\n";
507 | 
508 | 
509 | if (defined $start)
510 | {
511 |     print "%start $start\n\n";
512 |     print "%%\n\n";
513 | }
514 | else
515 | {
516 |     # No start symbol defined - let's see if we can work out what to use.
517 |     # If there's more than one unused non-terminal, then treat them
518 |     # all as simple alternatives to a list of statements.
519 |     my $count = count_unmatched_keys(\%nonterminals, \%used);
520 | 
521 |     if ($count > 1)
522 |     {
523 |         my $prog = "bnf_program";
524 |         my $stmt = "bnf_statement";
525 |         print "%start $prog\n\n";
526 |         print "%%\n\n";
527 |         print "$prog\n\t:\t$stmt\n\t|\t$prog $stmt\n\t;\n\n";
528 |         print "$stmt\n";
529 |         my $pad = "\t:\t";
530 |         foreach my $nt (sort keys %nonterminals)
531 |         {
532 |             unless (defined $used{$nt})
533 |             {
534 |                 print "$pad$nt\n";
535 |                 $pad = "\t|\t";
536 |             }
537 |         }
538 |         print "\t;\n\n";
539 |     }
540 |     elsif ($count == 1)
541 |     {
542 |         foreach my $nt (sort keys %nonterminals)
543 |         {
544 |             print "%start $nt" unless defined $used{$nt};
545 |         }
546 |         print "%%\n\n";
547 |     }
548 |     else
549 |     {
550 |         # No single start symbol - loop?
551 |         # Error!
552 |         print STDERR "$0: no start symbol recognized!\n";
553 |         print "%%\n\n";
554 |     }
555 | }
556 | 
557 | # Output the complete grammar
558 | while (my $line = shift @grammar)
559 | {
560 |     print $line;
561 | }
562 | 
563 | print "\n%%\n\n";
564 | 
565 | __END__
566 | 
567 | =pod
568 | 
569 | Given a rule:
570 | 
571 |   abc:  def ghi jkl
572 | 
573 | The Yacc output is:
574 | 
575 |   abc
576 |       : def ghi jkl
577 |       ;
578 | 
579 | Given a rule:
580 | 
581 |   abc:  def [ ghi ] jkl
582 | 
583 | The Yacc output is:
584 | 
585 |   abc
586 |       : def opt_nt_0001 jkl
587 |       ;
588 | 
589 |   opt_nt_0001
590 |       : /* Nothing */
591 |       | ghi
592 |       ;
593 | 
594 | Given a rule:
595 | 
596 |   abc:  def { ghi } jkl
597 | 
598 | The Yacc output is:
599 | 
600 |   abc
601 |       : def seq_nt_0002 jkl
602 |       ;
603 | 
604 |   seq_nt_0002
605 |       : ghi
606 |       ;
607 | 
608 | Note that such rules are seldom used in isolation; either the contents
609 | of the '{' to '}' contains alternatives, or the construct as a whole is
610 | followed by a repetition.
611 | 
612 | Given a rule:
613 | 
614 |   abc: def | ghi
615 | 
616 | The Yacc output is:
617 | 
618 |   abc
619 |       : def
620 |       | ghi
621 |       ;
622 | 
623 | Given a rule:
624 | 
625 |   abc: def ghi... jkl
626 | 
627 | The Yacc output is:
628 | 
629 |   abc
630 |       : def lst_nt_0003 jkl
631 |       ;
632 | 
633 |   lst_nt_0003
634 |       : ghi
635 |       | lst_nt_0003 ghi
636 |       ;
637 | 
638 | These rules can be, and often are, combined.  The following examples
639 | come from the SQL-99 grammar which is the target of this effort.  The
640 | target of this program is to produce Yacc rules equivalent to those
641 | which follow each fragment.  Note that keywords (equivalently,
642 | terminals) are in upper case only; mixed case or lower case symbols are
643 | non-terminals.
644 | 
645 |   <SQL-client module definition> ::=
646 |                   <module name clause>
647 |                   <language clause>
648 |                   <module authorization clause>
649 |                   [ <module path specification> ]
650 |                   [ <module transform group specification> ]
651 |                   [ <temporary table declaration>... ]
652 |                   <module contents>...
653 | 
654 |   SQL_client_module_definition
655 |         : module_name_clause language_clause module_authorization_clause opt_nt_0001 opt_nt_0002 opt_nt_0003 lst_nt_0004
656 |         ;
657 |   opt_nt_0001
658 |         : /* Nothing */
659 |         | module_path_specification
660 |         ;
661 |   opt_nt_0002
662 |         : /* Nothing */
663 |         | module_transform_group_specification
664 |         ;
665 |   opt_nt_0003
666 |         : /* Nothing */
667 |         | lst_nt_0005
668 |         ;
669 |   lst_nt_0004
670 |         : module_contents
671 |         | lst_nt_0004 module_contents
672 |         ;
673 |   lst_nt_0005
674 |         : temporary_table_declaration
675 |         | lst_nt_0005 temporary_table_declaration
676 |         ;
677 | 
678 | The next example is interesting - it is fairly typical of the grammar,
679 | but is not minimal.  The rule could be written '<identifier body> ::=
680 | <identifier start> [ <identifier part> ... ]' without altering the
681 | meaning.  It is not clear whether this program should apply this
682 | transformation automatically.
683 | 
684 |   <identifier body> ::= <identifier start> [ { <identifier part> }... ]
685 | 
686 |   identifier_body
687 |         : identifier_start opt_nt_0006
688 |         ;
689 |   opt_nt_0006
690 |         : /* Nothing */
691 |         | lst_nt_0007
692 |         ;
693 |   lst_nt_0007
694 |         : seq_nt_0008
695 |         | lst_nt_0007 seq_nt_0008
696 |         ;
697 |   seq_nt_0008
698 |         : identifier_part
699 |         ;
700 | 
701 |   /* Optimized alternative to lst_nt_0007 */
702 |   lst_nt_0007
703 |         : identifier_part
704 |         | lst_nt_0007 identifier_part
705 |         ;
706 | 
707 |   <SQL language identifier> ::=
708 |                   <SQL language identifier start> [ { <underscore> | <SQL language identifier part> }... ]
709 | 
710 |   sql_language_identifier
711 |         : sql_language_identifier_start opt_nt_0009
712 |         ;
713 |   opt_nt_0009
714 |         : /* Nothing */
715 |         | lst_nt_0010
716 |         ;
717 |   lst_nt_0010
718 |         : seq_nt_0011
719 |         | lst_nt_0010 seq_nt_0011
720 |         ;
721 |   seq_nt_0011
722 |         : underscore
723 |         | sql_language_identifier_part
724 |         ;
725 | 
726 | The next rule is the first example with keywords.
727 | 
728 |   <module authorization clause> ::=
729 |                 SCHEMA <schema name>
730 |           |     AUTHORIZATION <module authorization identifier>
731 |           |     SCHEMA <schema name> AUTHORIZATION <module authorization identifier>
732 | 
733 |   module_authorization_clause
734 |         : SCHEMA schema_name
735 |         | AUTHORIZATION module_authorization_identifier
736 |         | SCHEMA schema_name AUTHORIZATION module_authorization_identifier
737 |         ;
738 | 
739 |   <transform group specification> ::=
740 |                   TRANSFORM GROUP { <single group specification> | <multiple group specification> }
741 | 
742 |   transform_group_specification
743 |         : TRANSFORM GROUP seq_nt_0012
744 |         ;
745 |   seq_nt_0012
746 |         : single_group_specification
747 |         | multiple_group_specification
748 |         ;
749 | 
750 |   <multiple group specification> ::= <group specification> [ { <comma> <group specification> }... ]
751 | 
752 |   multiple_group_specification
753 |         : group_specification opt_nt_0013
754 |         ;
755 |   opt_nt_0013
756 |         : /* Nothing */
757 |         | lst_nt_0014
758 |         ;
759 |   lst_nt_0014
760 |         : seq_nt_0015
761 |         | lst_nt_0014 seq_nt_0015
762 |         ;
763 |   seq_nt_0015
764 |         : comma group_specification
765 |         ;
766 | 
767 | Except for the presence of a token (<right paren>) after the optional
768 | list, the next example is equivalent to the previous one.  It does show,
769 | however, that there is an element of lookahead required to tell whether
770 | an optional item contains a list or a sequence or a simple list of
771 | terminals and non-terminals.
772 | 
773 |   <table element list> ::=
774 |                   <left paren> <table element> [ { <comma> <table element> }... ] <right paren>
775 | 
776 |   table_element_list
777 |         : left_paren table_element opt_nt_0016 right_paren
778 |         ;
779 |   opt_nt_0016
780 |         : /* Nothing */
781 |         | lst_nt_0017
782 |         ;
783 |   lst_nt_0017
784 |         : seq_nt_0018
785 |         | lst_nt_0017 seq_nt_0018
786 |         ;
787 |   seq_nt_0018
788 |         : comma table_element
789 |         ;
790 | 
791 | The next example is interesting because the sequence item contains
792 | alternatives with no optionality or iteration.  It suggests that the
793 | term 'sequence' is not necessarily the 'mot juste'.
794 | 
795 |   <column definition> ::=
796 |                   <column name>
797 |                   { <data type> | <domain name> }
798 |                   [ <reference scope check> ]
799 |                   [ <default clause> ]
800 |                   [ <column constraint definition>... ]
801 |                   [ <collate clause> ]
802 | 
803 |   column_definition
804 |         : column_name seq_nt_0019 opt_nt_0020 opt_nt_0021 opt_nt_0022 opt_nt_0023
805 |         ;
806 |   seq_nt_0019
807 |         : data_type
808 |         | domain_name
809 |         ;
810 |   opt_nt_0020
811 |         : /* Nothing */
812 |         | reference_scope_check
813 |         ;
814 |   opt_nt_0021
815 |         : /* Nothing */
816 |         | default_clause
817 |         ;
818 |   opt_nt_0022
819 |         : /* Nothing */
820 |         | lst_nt_0024
821 |         ;
822 |   opt_nt_0023
823 |         : /* Nothing */
824 |         | collate_clause
825 |         ;
826 |   lst_nt_0024
827 |         : column_constraint_definition
828 |         | lst_nt_0024 column_constraint_definition
829 |         ;
830 | 
831 | 
832 |   <select list> ::= <asterisk> | <select sublist> [ { <comma> <select sublist> }... ]
833 | 
834 |   select_list
835 |         : asterisk
836 |         | select_sublist opt_nt_0025
837 |         ;
838 |   opt_nt_0025
839 |         : /* Nothing */
840 |         | lst_nt_0026
841 |         ;
842 |   lst_nt_0026
843 |         : seq_nt_0027
844 |         | lst_nt_0026 seq_nt_0027
845 |         ;
846 |   seq_nt_0027
847 |         : comma select_sublist
848 |         ;
849 | 
850 | The next statement does not introduce any new grammatical features.  It
851 | does, however, trigger a shift/reduce conflict because an LALR(1)
852 | grammar cannot resolve with one lookahead token whether the token WITH
853 | is part of the WITH HIERARCHY OPTION or part of the WITH GRANT OPTION.
854 | Note that should use a non-terminal such as <non-empty comma list of
855 | grantees>, but such structural changes cannot readily be done by this
856 | program.
857 | 
858 |   <grant privilege statement> ::=
859 |                   GRANT <privileges> TO <grantee> [ { <comma> <grantee> }... ]
860 |                   [ WITH HIERARCHY OPTION ] [ WITH GRANT OPTION ] [ GRANTED BY <grantor> ]
861 | 
862 |   grant_privilege_statement
863 |         : GRANT privileges TO grantee opt_nt_0028 opt_nt_0029 opt_nt_0030 opt_nt_0031
864 |         ;
865 |   opt_nt_0028
866 |         : /* Nothing */
867 |         | lst_nt_0032
868 |         ;
869 |   opt_nt_0029
870 |         : /* Nothing */
871 |         | WITH HIERARCHY OPTION
872 |         ;
873 |   opt_nt_0030
874 |         : /* Nothing */
875 |         | WITH GRANT OPTION
876 |         ;
877 |   opt_nt_0031
878 |         : /* Nothing */
879 |         | GRANTED BY grantor
880 |         ;
881 |   lst_nt_0032
882 |         : seq_nt_0033
883 |         | lst_nt_0032 seq_nt_0033
884 |         ;
885 |   seq_nt_0033
886 |         : comma grantee
887 |         ;
888 | 
889 | The next statement reuses material introduced previously, but in a
890 | slightly more complex manner.
891 | 
892 |   <set descriptor information> ::=
893 |                 <set header information> [ { <comma> <set header information> }... ]
894 |           |     VALUE <item number> <set item information> [ { <comma> <set item information> }... ]
895 | 
896 |   set_descriptor_information
897 |         : set_header_information opt_nt_0034
898 |         | VALUE item_number set_item_information opt_nt_0035
899 |         ;
900 |   opt_nt_0034
901 |         : /* Nothing */
902 |         | lst_nt_0036
903 |         ;
904 |   opt_nt_0035
905 |         : /* Nothing */
906 |         | lst_nt_0037
907 |         ;
908 |   lst_nt_0036
909 |         : seq_nt_0038
910 |         | lst_nt_0036 seq_nt_0038
911 |         ;
912 |   lst_nt_0037
913 |         : seq_nt_0039
914 |         | lst_nt_0037 seq_nt_0039
915 |         ;
916 |   seq_nt_0038
917 |         : comma set_header_information
918 |         ;
919 |   seq_nt_0039
920 |         : comma set_item_information
921 |         ;
922 | 
923 | The next statement introduces deeper nesting than any of the previous
924 | ones.  The expansion produces two rules (opt_nt_0040 and opt_nt_0044)
925 | that are identical.  This is indicative of problems with the grammar on
926 | which it is working, which would be better written with a couple of new
927 | non-terminals, <possibly initialized c host identifier> and <non-empty
928 | comma list of possibly initialized c host identifiers>.  However, this
929 | is a stylistic change that should also be made in many other places in
930 | the grammar.
931 | 
932 |   <C CLOB locator variable> ::=
933 |                   SQL TYPE IS CLOB AS LOCATOR
934 |                   <C host identifier> [ <C initial value> ] [ { <comma> <C host identifier> [ <C initial value> ] } ... ]
935 | 
936 |   c_blob_locator_variable
937 |         : SQL TYPE IS CLOB AS LOCATOR c_host_identifier opt_nt_0040 opt_nt_0041
938 |         ;
939 |   opt_nt_0040
940 |         : /* Nothing */
941 |         | c_initial_value
942 |         ;
943 |   opt_nt_0041
944 |         : /* Nothing */
945 |         | lst_nt_0042
946 |         ;
947 |   lst_nt_0042
948 |         : seq_nt_0043
949 |         | lst_nt_0042 seq_nt_0043
950 |         ;
951 |   seq_nt_0043
952 |         : comma c_host_identifier opt_nt_0044
953 |         ;
954 |   opt_nt_0044
955 |         : /* Nothing */
956 |         | c_initial_value
957 |         ;
958 | 
959 | =cut
960 | 


--------------------------------------------------------------------------------
/bnf2yacc.pl:
--------------------------------------------------------------------------------
  1 | #!/usr/bin/perl -w
  2 | #
  3 | # @(#)$Id: bnf2yacc.pl,v 1.16 2017/11/14 06:53:22 jleffler Exp $
  4 | #
  5 | # Convert SQL-92, SQL-99 BNF plain text file into YACC grammar.
  6 | 
  7 | use strict;
  8 | $| = 1;
  9 | 
 10 | use constant debug => 0;
 11 | 
 12 | my $heading = "";
 13 | my %tokens;
 14 | my %nonterminals;
 15 | my %rules;
 16 | my %used;
 17 | my $start;
 18 | my @grammar;
 19 | 
 20 | my $nt_number = 0;
 21 | 
 22 | # Generate a new non-terminal identifier
 23 | sub new_non_terminal
 24 | {
 25 |     my($prefix) = @_;
 26 |     $prefix = "" unless defined $prefix;
 27 |     return sprintf "${prefix}nt_%03d", ++$nt_number;
 28 | }
 29 | 
 30 | # map_non_terminal converts names that are not acceptable to Yacc into names that are.
 31 | # Non-identifier characters are converted to underscores.
 32 | # If the first character is not alphabetic, prefix 'j_'.
 33 | # Case-convert to lower case.
 34 | sub map_non_terminal
 35 | {
 36 |     my($nt) = @_;
 37 |     $nt =~ s/\W+/_/go;
 38 |     $nt = "j_$nt" unless $nt =~ m/^[a-zA-Z]/o;
 39 |     $nt =~ tr/[A-Z]/[a-z]/;
 40 |     $nt =~ s/__+/_/go;
 41 |     return $nt;
 42 | }
 43 | 
 44 | # scan_rhs breaks up the RHS of a rule into a token stream
 45 | # Keywords (terminals) are prefixed with a '#' marker.
 46 | sub scan_rhs
 47 | {
 48 |     my($tail) = @_;
 49 |     my(@rhs);
 50 |     while ($tail)
 51 |     {
 52 |         print "RHS: $tail\n" if debug;
 53 |         my $name;
 54 |         if ($tail =~ m%^(\s*<([-:/()_\w\s]+)>\s*)%o)
 55 |         {
 56 |             # Simpler regex for non-terminal: <[^>]+>
 57 |             # Non-terminal
 58 |             my $n = $2;
 59 |             print "N: $n\n" if debug;
 60 |             $tail = substr $tail, length($1);
 61 |             $name = map_non_terminal($n);
 62 |             $nonterminals{$name} = 1;
 63 |             $used{$name} = 1;
 64 |             push @rhs, $name;
 65 |         }
 66 |         elsif ($tail =~ m%^(\s*(\w[-\w\d_.]*)\s*)%o)
 67 |         {
 68 |             # Terminal (keyword)
 69 |             # Dot '.' is used in Interfaces.SQL in Ada syntax
 70 |             # Dash '-' is used in EXEC-SQL in the keywords.
 71 |             my $t = $2;
 72 |             print "T: $t\n" if debug;
 73 |             $tail = substr $tail, length($1);
 74 |             $name = $t;
 75 |             $tokens{$name} = 1;
 76 |             push @rhs, "#$name";
 77 |         }
 78 |         elsif ($tail =~ m%^\s*(\.\.\.omitted\.\.\.)\s*%o)
 79 |         {
 80 |             # Something omitted from the grammar.
 81 |             # Triple punctuation detected before double.
 82 |             my $str = "/* $1 */";
 83 |             push @rhs, $str;
 84 |             last;
 85 |         }
 86 |         elsif ($tail =~ m{^(\s*([-.<=>|]{2})\s*)$}o)
 87 |         {
 88 |             # Double-punctuation (non-metacharacters)
 89 |             # .., <=, >=, <>, ||, ->
 90 |             my $p = $2;
 91 |             print "DP: $p\n" if debug;
 92 |             $tail = substr $tail, length($1);
 93 |             $name = "'$p'";
 94 |             push @rhs, $name;
 95 |         }
 96 |         elsif ($tail =~ m{^(\s*([][{}"'%&()*+,-./:;<=>?^_|])\s*)$}o)
 97 |         {
 98 |             # Punctuation (non-metacharacters)
 99 |             # Note that none of '@', '~', '!' or '\' have any significance in SQL
100 |             my $p = $2;
101 |             print "P: $p\n" if debug;
102 |             $tail = substr $tail, length($1);
103 |             $p = "\\'" if $p eq "'";
104 |             $name = "'$p'";
105 |             push @rhs, $name;
106 |         }
107 |         elsif ($tail =~ m%^(\s*('[^']*'))\s*%o ||
108 |                $tail =~ m%^(\s*("[^"]*"))\s*%o)
109 |         {
110 |             # Terminal in quotes - single or double.
111 |             # (Possibly a multi-character string).
112 |             my $q = $2;
113 |             print "Q: $q\n" if debug;
114 |             $tail = substr $tail, length($1);
115 |             $q =~ m%^(['"])(.+)['"]$%o;
116 |             # Expand multi-character string constants.
117 |             # into repeated single-character constants.
118 |             my($o) = $1;
119 |             my($l) = $2;
120 |             while (length($l))
121 |             {
122 |                 my($c) = substr $l, 0, 1;
123 |                 $name = "$o$c$o";
124 |                 $l = substr $l, 1, length($l)-1;
125 |                 push @rhs, $name;
126 |             }
127 |         }
128 |         elsif ($tail =~ m%^(\s*([{}\|\[\]]|\.\.\.)\s*)%o)
129 |         {
130 |             # Punctuation (metacharacters)
131 |             my $p = $2;
132 |             print "M: $p\n" if debug;
133 |             $tail = substr $tail, length($1);
134 |             $name = $p;
135 |             push @rhs, $name;
136 |         }
137 |         elsif ($tail =~ m%^\s*!!%o)
138 |         {
139 |             # Exhortation to see the syntax rules - usually.
140 |             my $str = "/* $tail */";
141 |             push @rhs, $str;
142 |             last;
143 |         }
144 |         else
145 |         {
146 |             # Unknown!
147 |             print "/* UNK: $tail */\n";
148 |             print STDERR "UNK:$.: $tail\n";
149 |             last;
150 |         }
151 |     }
152 |     return(@rhs);
153 | }
154 | 
155 | # Format a Yacc rule given LHS and RHS array
156 | sub record_rule
157 | {
158 |     my($lhs, $comment, @rule) = @_;
159 |     my($production) = "";
160 |     print "==>> record_rule ($lhs : @rule)\n" if debug;
161 |     $production .= "/*\n" if $comment;
162 |     $production .= "$lhs\n\t:\t";
163 |     my $pad = "";
164 |     my $br_count = 0;
165 |     for (my $i = 0; $i <= $#rule; $i++)
166 |     {
167 |         my $item = $rule[$i];
168 |         print "==== item $item\n" if debug;
169 |         if ($item eq "|" && $br_count == 0)
170 |         {
171 |             $production .= "\n\t|\t";
172 |             $pad = "";
173 |         }
174 |         else
175 |         {
176 |             $production .= "$pad$item";
177 |             $pad = " ";
178 |             $br_count++ if ($item eq '[' or $item eq '{');
179 |             $br_count-- if ($item eq ']' or $item eq '}');
180 |         }
181 |     }
182 |     $production .= "\n\t;\n";
183 |     $production .= "*/\n" if $comment;
184 |     $production .= "\n";
185 |     print "$production" if debug;
186 |     push @grammar, $production;
187 |     print "<<== record_rule\n" if debug;
188 | }
189 | 
190 | sub print_iterator
191 | {
192 |     my($lhs,$rhs) = @_;
193 |     my($production) = "";
194 |     print "==>> print_iterator ($lhs $rhs)\n" if debug;
195 |     $production .= "$lhs\n\t:\t$rhs\n\t|\t$lhs $rhs\n\t;\n\n";
196 |     print "<<== print_iterator\n" if debug;
197 |     push @grammar, $production;
198 | }
199 | 
200 | # Process an optional item enclosed in square brackets
201 | sub find_balanced_bracket
202 | {
203 |     my($lhs,@rhs) = @_;
204 |     my(@rule) = ( "/* Nothing */", "|");
205 |     print "==>> find_balanced_bracket ($lhs : @rhs)\n" if debug;
206 |     while (my $name = shift @rhs)
207 |     {
208 |         print "     name = $name\n" if debug;
209 |         if ($name eq ']')
210 |         {
211 |             # Found closing bracket
212 |             # Terminate search
213 |             last;
214 |         }
215 |         elsif ($name eq '[')
216 |         {
217 |             # Found nested optional clause
218 |             my $tag = new_non_terminal('opt_');
219 |             @rhs = find_balanced_bracket($tag, @rhs);
220 |             push @rule, $tag;
221 |         }
222 |         elsif ($name eq '{')
223 |         {
224 |             # Found start of sequence
225 |             my $tag = new_non_terminal('seq_');
226 |             @rhs = find_balanced_brace($tag, @rhs);
227 |             push @rule, $tag;
228 |         }
229 |         elsif ($name eq '}')
230 |         {
231 |             # Found unbalanced close brace.
232 |             # Error!
233 |         }
234 |         elsif ($name eq '...')
235 |         {
236 |             # Found iteration.
237 |             my $tag = new_non_terminal('lst_');
238 |             print "==== find_balanced_bracket: iterator (@rule)\n" if debug;
239 |             my($old) = pop @rule;
240 |             push @rule, $tag;
241 |             print "==== find_balanced_bracket: iterator ($tag/$old - @rule)\n" if debug;
242 |             print_iterator($tag, $old);
243 |         }
244 |         else
245 |         {
246 |             $name =~ s/^#//;
247 |             push @rule, $name;
248 |             $used{$name} = 1;
249 |         }
250 |     }
251 |     record_rule($lhs, 0, @rule);
252 |     print "<<== find_balanced_bracket: @rhs)\n" if debug;
253 |     return(@rhs);
254 | }
255 | 
256 | # Process an sequence item enclosed in curly braces
257 | sub find_balanced_brace
258 | {
259 |     my($lhs,@rhs) = @_;
260 |     my(@rule);
261 |     print "==>> find_balanced_brace ($lhs : @rhs)\n" if debug;
262 |     while (my $name = shift @rhs)
263 |     {
264 |         print "     name = $name\n" if debug;
265 |         if ($name eq '}')
266 |         {
267 |             # Found closing brace
268 |             # Terminate search
269 |             last;
270 |         }
271 |         elsif ($name eq '[')
272 |         {
273 |             # Found nested optional clause
274 |             my $tag = new_non_terminal('opt_');
275 |             @rhs = find_balanced_bracket($tag, @rhs);
276 |             push @rule, $tag;
277 |         }
278 |         elsif ($name eq '{')
279 |         {
280 |             # Found start of sequence
281 |             my $tag = new_non_terminal('seq_');
282 |             @rhs = find_balanced_brace($tag, @rhs);
283 |             push @rule, $tag;
284 |         }
285 |         elsif ($name eq ']')
286 |         {
287 |             # Found unbalanced close brace.
288 |             # Error!
289 |         }
290 |         elsif ($name eq '...')
291 |         {
292 |             # Found iteration.
293 |             my $tag = new_non_terminal('lst_');
294 |             print "==== find_balanced_brace: iterator (@rule)\n" if debug;
295 |             my($old) = pop @rule;
296 |             push @rule, $tag;
297 |             print "==== find_balanced_brace: iterator ($tag/$old - @rule)\n" if debug;
298 |             print_iterator($tag, $old);
299 |         }
300 |         else
301 |         {
302 |             $name =~ s/^#//;
303 |             push @rule, $name;
304 |             $used{$name} = 1;
305 |         }
306 |     }
307 |     record_rule($lhs, 0, @rule);
308 |     print "<<== find_balanced_brace: @rhs)\n" if debug;
309 |     return(@rhs);
310 | }
311 | 
312 | # Note that the [ and { parts are nice and easy because they are
313 | # balanced operators.  The iteration operator ... is much harder to
314 | # process because it is a trailing modifier.  When processing the list
315 | # of symbols, you need to establish whether there is a trailing iterator
316 | # after the current symbol, and modify the behaviour appropriately.
317 | sub process_rhs
318 | {
319 |     my($lhs, $tail) = @_;
320 |     my(@rhs) = scan_rhs($tail);
321 |     print "==>> process_rhs ($lhs : @rhs)\n" if debug;
322 |     # List parsed rule in output only if debugging.
323 |     record_rule($lhs, 1, @rhs) if debug;
324 |     my(@rule);
325 |     while (my $name = shift @rhs)
326 |     {
327 |         print "name = $name\n" if debug;
328 |         if ($name eq '[')
329 |         {
330 |             my $tag = new_non_terminal('opt_');
331 |             @rhs = find_balanced_bracket($tag, @rhs);
332 |             push @rule, $tag;
333 |         }
334 |         elsif ($name eq ']')
335 |         {
336 |             # Found a close bracket for something unbalanced.
337 |             # Error!
338 |         }
339 |         elsif ($name eq '{')
340 |         {
341 |             # Start of mandatory sequence of items, possibly containing alternatives.
342 |             my $tag = new_non_terminal('seq_');
343 |             @rhs = find_balanced_brace($tag, @rhs);
344 |             push @rule, $tag;
345 |         }
346 |         elsif ($name eq '}')
347 |         {
348 |             # Found a close brace for something unbalanced.
349 |             # Error!
350 |         }
351 |         elsif ($name eq '|')
352 |         {
353 |             # End of one alternative and start of a new one.
354 |             print "==== process_rhs: alternative $name\n" if debug;
355 |             push @rule, $name;
356 |         }
357 |         elsif ($name eq '...')
358 |         {
359 |             # Found iteration.
360 |             my $tag = new_non_terminal('lst_');
361 |             my($old) = pop @rule;
362 |             push @rule, $tag;
363 |             print "==== process_rhs: iterator\n" if debug;
364 |             print_iterator($tag, $old);
365 |         }
366 |         elsif ($name =~ m/^#/)
367 |         {
368 |             # Keyword token
369 |             print "==== process_rhs: token $name\n" if debug;
370 |             $name =~ s/^#//;
371 |             push @rule, $name;
372 |         }
373 |         else
374 |         {
375 |             # Non-terminal (or comment)
376 |             print "==== process_rhs: non-terminal $name\n" if debug;
377 |             push @rule, $name;
378 |         }
379 |     }
380 |     print "==== process_rhs: @rule\n" if debug;
381 |     record_rule($lhs, 0, @rule);
382 |     print "<<== process_rhs\n" if debug;
383 | }
384 | 
385 | sub count_unmatched_keys
386 | {
387 |     my($ref1, $ref2) = @_;
388 |     my(%keys) = %$ref1;
389 |     my(%match) = %$ref2;
390 |     my($count) = 0;
391 |     foreach my $key (keys %keys)
392 |     {
393 |         $count++ unless defined $match{$key};
394 |     }
395 |     return $count;
396 | }
397 | 
398 | # ------------------------------------------------------------
399 | 
400 | open INPUT, "cat @ARGV |" or die "$!";
401 | $_ = <INPUT>;
402 | exit 0 unless defined($_);
403 | chomp;
404 | $heading = "%{\n/*\n** $_\n*/\n%}\n\n" unless m/^\s*$/;
405 | 
406 | # Commentary appears in column 1.
407 | # Continuations of rules have a blank in column 1.
408 | # Blank lines, dash lines and equals lines separate rules (are not embedded within them)..
409 | 
410 | while (<INPUT>)
411 | {
412 |     chomp;
413 |     print "DBG:$.: $_\n" if debug;
414 |     next if /^===*$/o;
415 |     next if /^\s*$/o;	# Blank lines
416 |     next if /^---*$/o;	# Horizontal lines
417 |     if (/^--/o)
418 |     {
419 |         # Various HTML pseudo-directives
420 |         if (m%^--/?\w+\b%)
421 |         {
422 |             print "/* $' */\n" if $';
423 |         }
424 |         elsif (/^--%start (\w+)/)
425 |         {
426 |             $start = $1;
427 |             print "/* Start symbol - $start */\n";
428 |         }
429 |         elsif (/^--##/)
430 |         {
431 |             print "/* $_ */\n";
432 |         }
433 |         else
434 |         {
435 |             print "/* Unrecognized 2: $_ */\n";
436 |         }
437 |     }
438 |     elsif (/^@.#..Id:/)
439 |     {
440 |         # Convert what(1) string identifier into version information
441 |         s%^@.#..Id: %%;
442 |         s% \$$%%;
443 |         s%,v % %;
444 |         s%\w+ Exp( \w+)?$%%;
445 |         my @words = split;
446 |         print "/*\n";
447 |         print "** Derived from file $words[0] version $words[1] dated $words[2] $words[3]\n";
448 |         print "*/\n";
449 |     }
450 |     elsif (/ ::=/)
451 |     {
452 |         # Definition line
453 |         my $def = $_;
454 |         $def =~ s%<([-:/()\w\s]+)>.*%$1%o;
455 |         $def = map_non_terminal($def);
456 |         $rules{$def} = 1;
457 |         $nonterminals{$def} = 1;
458 |         my $tail = $_;
459 |         $tail =~ s%.*::=\s*%%;	# Remove LHS of statement
460 |         while (<INPUT>)
461 |         {
462 |             chomp;
463 |             last unless /^\s/;
464 |             $tail .= $_;
465 |         }
466 |         process_rhs($def, $tail);
467 |     }
468 |     else
469 |     {
470 |         # Anything unrecognized passed through as a comment!
471 |         print "/* $_ */\n";
472 |     }
473 | }
474 | 
475 | close INPUT;
476 | 
477 | print "==== End of input phase ====\n" if debug;
478 | 
479 | print $heading if $heading;
480 | 
481 | # List of tokens
482 | foreach my $token (sort keys %tokens)
483 | {
484 |     print "\%token $token\n";
485 | }
486 | print "\n";
487 | 
488 | # Undefined non-terminals might need to be treated as tokens
489 | if (count_unmatched_keys(\%nonterminals, \%rules) > 0)
490 | {
491 |     print "/* The following non-terminals were not defined */\n";
492 |     foreach my $nt (sort keys %nonterminals)
493 |     {
494 |         print "%token $nt\n" unless defined $rules{$nt};
495 |     }
496 |     print "/* End of undefined non-terminals */\n\n";
497 | }
498 | 
499 | # List the rules that are defined in the original grammar.
500 | # Do not list the rules defined by this conversion process.
501 | print "/*\n";
502 | foreach my $nt (sort keys %nonterminals)
503 | {
504 |     print "\%rule $nt\n";
505 | }
506 | print "*/\n\n";
507 | 
508 | 
509 | if (defined $start)
510 | {
511 |     print "%start $start\n\n";
512 |     print "%%\n\n";
513 | }
514 | else
515 | {
516 |     # No start symbol defined - let's see if we can work out what to use.
517 |     # If there's more than one unused non-terminal, then treat them
518 |     # all as simple alternatives to a list of statements.
519 |     my $count = count_unmatched_keys(\%nonterminals, \%used);
520 | 
521 |     if ($count > 1)
522 |     {
523 |         my $prog = "bnf_program";
524 |         my $stmt = "bnf_statement";
525 |         print "%start $prog\n\n";
526 |         print "%%\n\n";
527 |         print "$prog\n\t:\t$stmt\n\t|\t$prog $stmt\n\t;\n\n";
528 |         print "$stmt\n";
529 |         my $pad = "\t:\t";
530 |         foreach my $nt (sort keys %nonterminals)
531 |         {
532 |             unless (defined $used{$nt})
533 |             {
534 |                 print "$pad$nt\n";
535 |                 $pad = "\t|\t";
536 |             }
537 |         }
538 |         print "\t;\n\n";
539 |     }
540 |     elsif ($count == 1)
541 |     {
542 |         foreach my $nt (sort keys %nonterminals)
543 |         {
544 |             print "%start $nt" unless defined $used{$nt};
545 |         }
546 |         print "%%\n\n";
547 |     }
548 |     else
549 |     {
550 |         # No single start symbol - loop?
551 |         # Error!
552 |         print STDERR "$0: no start symbol recognized!\n";
553 |         print "%%\n\n";
554 |     }
555 | }
556 | 
557 | # Output the complete grammar
558 | while (my $line = shift @grammar)
559 | {
560 |     print $line;
561 | }
562 | 
563 | print "\n%%\n\n";
564 | 
565 | __END__
566 | 
567 | =pod
568 | 
569 | Given a rule:
570 | 
571 |   abc:  def ghi jkl
572 | 
573 | The Yacc output is:
574 | 
575 |   abc
576 |       : def ghi jkl
577 |       ;
578 | 
579 | Given a rule:
580 | 
581 |   abc:  def [ ghi ] jkl
582 | 
583 | The Yacc output is:
584 | 
585 |   abc
586 |       : def opt_nt_0001 jkl
587 |       ;
588 | 
589 |   opt_nt_0001
590 |       : /* Nothing */
591 |       | ghi
592 |       ;
593 | 
594 | Given a rule:
595 | 
596 |   abc:  def { ghi } jkl
597 | 
598 | The Yacc output is:
599 | 
600 |   abc
601 |       : def seq_nt_0002 jkl
602 |       ;
603 | 
604 |   seq_nt_0002
605 |       : ghi
606 |       ;
607 | 
608 | Note that such rules are seldom used in isolation; either the contents
609 | of the '{' to '}' contains alternatives, or the construct as a whole is
610 | followed by a repetition.
611 | 
612 | Given a rule:
613 | 
614 |   abc: def | ghi
615 | 
616 | The Yacc output is:
617 | 
618 |   abc
619 |       : def
620 |       | ghi
621 |       ;
622 | 
623 | Given a rule:
624 | 
625 |   abc: def ghi... jkl
626 | 
627 | The Yacc output is:
628 | 
629 |   abc
630 |       : def lst_nt_0003 jkl
631 |       ;
632 | 
633 |   lst_nt_0003
634 |       : ghi
635 |       | lst_nt_0003 ghi
636 |       ;
637 | 
638 | These rules can be, and often are, combined.  The following examples
639 | come from the SQL-99 grammar which is the target of this effort.  The
640 | target of this program is to produce Yacc rules equivalent to those
641 | which follow each fragment.  Note that keywords (equivalently,
642 | terminals) are in upper case only; mixed case or lower case symbols are
643 | non-terminals.
644 | 
645 |   <SQL-client module definition> ::=
646 |                   <module name clause>
647 |                   <language clause>
648 |                   <module authorization clause>
649 |                   [ <module path specification> ]
650 |                   [ <module transform group specification> ]
651 |                   [ <temporary table declaration>... ]
652 |                   <module contents>...
653 | 
654 |   SQL_client_module_definition
655 |         : module_name_clause language_clause module_authorization_clause opt_nt_0001 opt_nt_0002 opt_nt_0003 lst_nt_0004
656 |         ;
657 |   opt_nt_0001
658 |         : /* Nothing */
659 |         | module_path_specification
660 |         ;
661 |   opt_nt_0002
662 |         : /* Nothing */
663 |         | module_transform_group_specification
664 |         ;
665 |   opt_nt_0003
666 |         : /* Nothing */
667 |         | lst_nt_0005
668 |         ;
669 |   lst_nt_0004
670 |         : module_contents
671 |         | lst_nt_0004 module_contents
672 |         ;
673 |   lst_nt_0005
674 |         : temporary_table_declaration
675 |         | lst_nt_0005 temporary_table_declaration
676 |         ;
677 | 
678 | The next example is interesting - it is fairly typical of the grammar,
679 | but is not minimal.  The rule could be written '<identifier body> ::=
680 | <identifier start> [ <identifier part> ... ]' without altering the
681 | meaning.  It is not clear whether this program should apply this
682 | transformation automatically.
683 | 
684 |   <identifier body> ::= <identifier start> [ { <identifier part> }... ]
685 | 
686 |   identifier_body
687 |         : identifier_start opt_nt_0006
688 |         ;
689 |   opt_nt_0006
690 |         : /* Nothing */
691 |         | lst_nt_0007
692 |         ;
693 |   lst_nt_0007
694 |         : seq_nt_0008
695 |         | lst_nt_0007 seq_nt_0008
696 |         ;
697 |   seq_nt_0008
698 |         : identifier_part
699 |         ;
700 | 
701 |   /* Optimized alternative to lst_nt_0007 */
702 |   lst_nt_0007
703 |         : identifier_part
704 |         | lst_nt_0007 identifier_part
705 |         ;
706 | 
707 |   <SQL language identifier> ::=
708 |                   <SQL language identifier start> [ { <underscore> | <SQL language identifier part> }... ]
709 | 
710 |   sql_language_identifier
711 |         : sql_language_identifier_start opt_nt_0009
712 |         ;
713 |   opt_nt_0009
714 |         : /* Nothing */
715 |         | lst_nt_0010
716 |         ;
717 |   lst_nt_0010
718 |         : seq_nt_0011
719 |         | lst_nt_0010 seq_nt_0011
720 |         ;
721 |   seq_nt_0011
722 |         : underscore
723 |         | sql_language_identifier_part
724 |         ;
725 | 
726 | The next rule is the first example with keywords.
727 | 
728 |   <module authorization clause> ::=
729 |                 SCHEMA <schema name>
730 |           |     AUTHORIZATION <module authorization identifier>
731 |           |     SCHEMA <schema name> AUTHORIZATION <module authorization identifier>
732 | 
733 |   module_authorization_clause
734 |         : SCHEMA schema_name
735 |         | AUTHORIZATION module_authorization_identifier
736 |         | SCHEMA schema_name AUTHORIZATION module_authorization_identifier
737 |         ;
738 | 
739 |   <transform group specification> ::=
740 |                   TRANSFORM GROUP { <single group specification> | <multiple group specification> }
741 | 
742 |   transform_group_specification
743 |         : TRANSFORM GROUP seq_nt_0012
744 |         ;
745 |   seq_nt_0012
746 |         : single_group_specification
747 |         | multiple_group_specification
748 |         ;
749 | 
750 |   <multiple group specification> ::= <group specification> [ { <comma> <group specification> }... ]
751 | 
752 |   multiple_group_specification
753 |         : group_specification opt_nt_0013
754 |         ;
755 |   opt_nt_0013
756 |         : /* Nothing */
757 |         | lst_nt_0014
758 |         ;
759 |   lst_nt_0014
760 |         : seq_nt_0015
761 |         | lst_nt_0014 seq_nt_0015
762 |         ;
763 |   seq_nt_0015
764 |         : comma group_specification
765 |         ;
766 | 
767 | Except for the presence of a token (<right paren>) after the optional
768 | list, the next example is equivalent to the previous one.  It does show,
769 | however, that there is an element of lookahead required to tell whether
770 | an optional item contains a list or a sequence or a simple list of
771 | terminals and non-terminals.
772 | 
773 |   <table element list> ::=
774 |                   <left paren> <table element> [ { <comma> <table element> }... ] <right paren>
775 | 
776 |   table_element_list
777 |         : left_paren table_element opt_nt_0016 right_paren
778 |         ;
779 |   opt_nt_0016
780 |         : /* Nothing */
781 |         | lst_nt_0017
782 |         ;
783 |   lst_nt_0017
784 |         : seq_nt_0018
785 |         | lst_nt_0017 seq_nt_0018
786 |         ;
787 |   seq_nt_0018
788 |         : comma table_element
789 |         ;
790 | 
791 | The next example is interesting because the sequence item contains
792 | alternatives with no optionality or iteration.  It suggests that the
793 | term 'sequence' is not necessarily the 'mot juste'.
794 | 
795 |   <column definition> ::=
796 |                   <column name>
797 |                   { <data type> | <domain name> }
798 |                   [ <reference scope check> ]
799 |                   [ <default clause> ]
800 |                   [ <column constraint definition>... ]
801 |                   [ <collate clause> ]
802 | 
803 |   column_definition
804 |         : column_name seq_nt_0019 opt_nt_0020 opt_nt_0021 opt_nt_0022 opt_nt_0023
805 |         ;
806 |   seq_nt_0019
807 |         : data_type
808 |         | domain_name
809 |         ;
810 |   opt_nt_0020
811 |         : /* Nothing */
812 |         | reference_scope_check
813 |         ;
814 |   opt_nt_0021
815 |         : /* Nothing */
816 |         | default_clause
817 |         ;
818 |   opt_nt_0022
819 |         : /* Nothing */
820 |         | lst_nt_0024
821 |         ;
822 |   opt_nt_0023
823 |         : /* Nothing */
824 |         | collate_clause
825 |         ;
826 |   lst_nt_0024
827 |         : column_constraint_definition
828 |         | lst_nt_0024 column_constraint_definition
829 |         ;
830 | 
831 | 
832 |   <select list> ::= <asterisk> | <select sublist> [ { <comma> <select sublist> }... ]
833 | 
834 |   select_list
835 |         : asterisk
836 |         | select_sublist opt_nt_0025
837 |         ;
838 |   opt_nt_0025
839 |         : /* Nothing */
840 |         | lst_nt_0026
841 |         ;
842 |   lst_nt_0026
843 |         : seq_nt_0027
844 |         | lst_nt_0026 seq_nt_0027
845 |         ;
846 |   seq_nt_0027
847 |         : comma select_sublist
848 |         ;
849 | 
850 | The next statement does not introduce any new grammatical features.  It
851 | does, however, trigger a shift/reduce conflict because an LALR(1)
852 | grammar cannot resolve with one lookahead token whether the token WITH
853 | is part of the WITH HIERARCHY OPTION or part of the WITH GRANT OPTION.
854 | Note that should use a non-terminal such as <non-empty comma list of
855 | grantees>, but such structural changes cannot readily be done by this
856 | program.
857 | 
858 |   <grant privilege statement> ::=
859 |                   GRANT <privileges> TO <grantee> [ { <comma> <grantee> }... ]
860 |                   [ WITH HIERARCHY OPTION ] [ WITH GRANT OPTION ] [ GRANTED BY <grantor> ]
861 | 
862 |   grant_privilege_statement
863 |         : GRANT privileges TO grantee opt_nt_0028 opt_nt_0029 opt_nt_0030 opt_nt_0031
864 |         ;
865 |   opt_nt_0028
866 |         : /* Nothing */
867 |         | lst_nt_0032
868 |         ;
869 |   opt_nt_0029
870 |         : /* Nothing */
871 |         | WITH HIERARCHY OPTION
872 |         ;
873 |   opt_nt_0030
874 |         : /* Nothing */
875 |         | WITH GRANT OPTION
876 |         ;
877 |   opt_nt_0031
878 |         : /* Nothing */
879 |         | GRANTED BY grantor
880 |         ;
881 |   lst_nt_0032
882 |         : seq_nt_0033
883 |         | lst_nt_0032 seq_nt_0033
884 |         ;
885 |   seq_nt_0033
886 |         : comma grantee
887 |         ;
888 | 
889 | The next statement reuses material introduced previously, but in a
890 | slightly more complex manner.
891 | 
892 |   <set descriptor information> ::=
893 |                 <set header information> [ { <comma> <set header information> }... ]
894 |           |     VALUE <item number> <set item information> [ { <comma> <set item information> }... ]
895 | 
896 |   set_descriptor_information
897 |         : set_header_information opt_nt_0034
898 |         | VALUE item_number set_item_information opt_nt_0035
899 |         ;
900 |   opt_nt_0034
901 |         : /* Nothing */
902 |         | lst_nt_0036
903 |         ;
904 |   opt_nt_0035
905 |         : /* Nothing */
906 |         | lst_nt_0037
907 |         ;
908 |   lst_nt_0036
909 |         : seq_nt_0038
910 |         | lst_nt_0036 seq_nt_0038
911 |         ;
912 |   lst_nt_0037
913 |         : seq_nt_0039
914 |         | lst_nt_0037 seq_nt_0039
915 |         ;
916 |   seq_nt_0038
917 |         : comma set_header_information
918 |         ;
919 |   seq_nt_0039
920 |         : comma set_item_information
921 |         ;
922 | 
923 | The next statement introduces deeper nesting than any of the previous
924 | ones.  The expansion produces two rules (opt_nt_0040 and opt_nt_0044)
925 | that are identical.  This is indicative of problems with the grammar on
926 | which it is working, which would be better written with a couple of new
927 | non-terminals, <possibly initialized c host identifier> and <non-empty
928 | comma list of possibly initialized c host identifiers>.  However, this
929 | is a stylistic change that should also be made in many other places in
930 | the grammar.
931 | 
932 |   <C CLOB locator variable> ::=
933 |                   SQL TYPE IS CLOB AS LOCATOR
934 |                   <C host identifier> [ <C initial value> ] [ { <comma> <C host identifier> [ <C initial value> ] } ... ]
935 | 
936 |   c_blob_locator_variable
937 |         : SQL TYPE IS CLOB AS LOCATOR c_host_identifier opt_nt_0040 opt_nt_0041
938 |         ;
939 |   opt_nt_0040
940 |         : /* Nothing */
941 |         | c_initial_value
942 |         ;
943 |   opt_nt_0041
944 |         : /* Nothing */
945 |         | lst_nt_0042
946 |         ;
947 |   lst_nt_0042
948 |         : seq_nt_0043
949 |         | lst_nt_0042 seq_nt_0043
950 |         ;
951 |   seq_nt_0043
952 |         : comma c_host_identifier opt_nt_0044
953 |         ;
954 |   opt_nt_0044
955 |         : /* Nothing */
956 |         | c_initial_value
957 |         ;
958 | 
959 | =cut
960 | 


--------------------------------------------------------------------------------
/index.html:
--------------------------------------------------------------------------------
  1 | <!-- @(#)$Id: index.html,v 1.9 2017/11/13 20:12:50 jleffler Exp $ -->
  2 | 
  3 | <html>
  4 | <head>
  5 | <title> BNF Grammars for SQL-92, SQL-99 and SQL-2003 </title>
  6 | </head>
  7 | 
  8 | <body bgcolor="WHITE">
  9 | <h1> BNF Grammars for SQL-92, SQL-99 and SQL-2003 </h1>
 10 | 
 11 | <h2> SQL-92 </h2>
 12 | 
 13 | Here is a heavily hyperlinked <a href="sql-92.bnf.html"> HTML </a>
 14 | version of the BNF grammar for SQL-92 (ISO/IEC 9075:1992 - Database Language -
 15 | SQL).
 16 | 
 17 | The <a href="sql-92.bnf"> plain text </a> file from which it was
 18 | automatically converted is more useful (read legible) for reading
 19 | without a browser.
 20 | 
 21 | <h2> SQL-99 </h2>
 22 | 
 23 | Here is a heavily hyperlinked <a href="sql-99.bnf.html"> HTML </a>
 24 | version of the BNF grammar for SQL-99 (ISO/IEC 9075-2:1999 - Database
 25 | Languages - SQL - Part 2: Foundation (SQL/Foundation)).
 26 | 
 27 | The <a href="sql-99.bnf"> plain text </a> file from which it was
 28 | automatically converted is more useful (read legible) for reading
 29 | without a browser.
 30 | 
 31 | <h2> SQL-2003 </h2>
 32 | <p>
 33 | Here is a heavily hyperlinked <a href="sql-2003-2.bnf.html"> HTML </a>
 34 | version of the BNF grammar for SQL-2003 (ISO/IEC 9075-2:2003 - Database
 35 | Languages - SQL - Part 2: Foundation (SQL/Foundation)).
 36 | 
 37 | The <a href="sql-2003-2.bnf"> plain text </a> file from which it was
 38 | automatically converted is more useful (read legible) for reading
 39 | without a browser.
 40 | </p>
 41 | 
 42 | <p>
 43 | There is a separate file <a href="sql-2003-1.bnf.html"> HTML </a> for
 44 | the information from ISO/IEC 9075-1:2003 - Database Languages - SQL - Part
 45 | 1: Framework (SQL/Framework).
 46 | 
 47 | It was automatically converted from the <a href="sql-2003-1.bnf"> plain
 48 | text </a> file, which is more useful (read legible) for reading without
 49 | a browser.
 50 | </p>
 51 | 
 52 | <p>
 53 | Also available:
 54 | <bl>
 55 | <li> <a href="sql-2003-core-features.html"> SQL 2003 Core Features </a> </li>
 56 | <li> <a href="sql-2003-noncore-features.html"> SQL 2003 Non-Core Features </a> </li>
 57 | </bl>
 58 | 
 59 | <h2> Informix OUTER Join Syntax </h2>
 60 | 
 61 | Here is an <a href="outer-joins.html"> HTML </a> explanation of the
 62 | Informix OUTER join syntax.
 63 | 
 64 | <h2> Conversion tools </h2>
 65 | 
 66 | <p>
 67 | The plain text was converted to HTML by the Perl script
 68 | <a href="bnf2html.perl.txt"> bnf2html </a> which you may use if you wish.
 69 | The bnf2html script also uses the C program
 70 | <a href="webcode-1.09.tgz"> WEBCODE version 1.09 </a>
 71 | which you can download as a gzipped tar file.
 72 | </p>
 73 | 
 74 | <p>
 75 | See also <a href="bnf2yacc.perl.txt"> bnf2yacc </a>, an experimental
 76 | script to convert BNF into an outline Yacc grammar.
 77 | The generated grammar typically includes some unacceptable tokens, such
 78 | as <i>%token 0</i>, that should be handled by the lexical analyzer
 79 | rather than the grammar.
 80 | The SQL standard includes such rules as grammar rules.
 81 | </p>
 82 | 
 83 | <p>
 84 | <i>(The Perl scripts should normally be renamed after downloading.)</i>
 85 | </p>
 86 | 
 87 | <h2> Download </h2>
 88 | 
 89 | You can download a gzipped tar file containing the raw grammars, the
 90 | HTML versions of those grammars, and the conversion tools as the gzipped
 91 | tar file <a href="sql-bnf.tgz"> sql-bnf.tgz </a>.
 92 | 
 93 | <hr>
 94 | Please send feedback to Jonathan Leffler:
 95 | <a href="mailto:jonathan.leffler@gmail.com"> jonathan.leffler@gmail.com </a>.
 96 | <p>
 97 | Last modified:
 98 | 13th November 2017
 99 | </body>
100 | </html>
101 | 


--------------------------------------------------------------------------------
/sql-2003-1.bnf:
--------------------------------------------------------------------------------
  1 | BNF Grammar for ISO/IEC 9075-1:2003 SQL/Foundation - Database Language SQL (SQL-2003)
  2 | =====================================================================================
  3 | 
  4 | @(#)$Id: sql-2003-1.bnf,v 1.4 2017/11/14 06:53:22 jleffler Exp $
  5 | 
  6 | --p
  7 | Information taken from the Final Committee Draft (FCD) of ISO/IEC 9075-1:2003.
  8 | --/p
  9 | 
 10 | 
 11 | --p
 12 | The plain text version of this grammar is
 13 | --## <a href='sql-2003-1.bnf'> sql-2003-1.bnf </a>.
 14 | --/p
 15 | 
 16 | --hr
 17 | --h2 Identifying the version of SQL in use
 18 | --/h2
 19 | 
 20 | --p
 21 | This material (starting with <SQL object identifier>) is defined in
 22 | section 6.3 "Object Identifier for Database Language SQL" of ISO/IEC
 23 | 9075-1:1999 (SQL Framework).
 24 | It is used to express the capabilities of an implementation.
 25 | The package names are identifiers such as 'PKG001', equivalent to
 26 | 'Enhanced datetime facilities', as defined in the informative Annex B to
 27 | SQL Framework.
 28 | Each such package identifies a number of features that are provided when
 29 | the SQL object identifier claims to provide the package.
 30 | --/p
 31 | 
 32 | --hr
 33 | --h2 6.3 Object identifier for Database Language SQL
 34 | --/h2
 35 | 
 36 | <SQL object identifier> ::= <SQL provenance> <SQL variant>
 37 | 
 38 | <SQL provenance> ::= <arc1> <arc2> <arc3>
 39 | 
 40 | <arc1> ::= iso | 1 | iso <left paren> 1 <right paren>
 41 | 
 42 | <arc2> ::= standard | 0 | standard <left paren> 0 <right paren>
 43 | 
 44 | <arc3> ::= 9075
 45 | 
 46 | <SQL variant> ::= <SQL edition> <SQL conformance>
 47 | 
 48 | <SQL edition> ::= <1987> | <1989> | <1992> | <1999> | <2003>
 49 | 
 50 | <1987> ::= 0 | edition1987 <left paren> 0 <right paren>
 51 | 
 52 | <1989> ::= <1989 base> <1989 package>
 53 | 
 54 | <1989 base> ::= 1 | edition1989 <left paren> 1 <right paren>
 55 | 
 56 | <1989 package> ::= <integrity no> | <integrity yes>
 57 | 
 58 | <integrity no> ::= 0 | IntegrityNo <left paren> 0 <right paren>
 59 | 
 60 | <integrity yes> ::= 1 | IntegrityYes <left paren> 1 <right paren>
 61 | 
 62 | <1992> ::= 2 | edition1992 <left paren> 2 <right paren>
 63 | 
 64 | <SQL conformance> ::= <level> <bindings> <parts> <packages>
 65 | 
 66 | <level> ::= <low> | <intermediate> | <high>
 67 | 
 68 | <low> ::= 0 | Low <left paren> 0 <right paren>
 69 | 
 70 | <intermediate> ::= 1 | Intermediate <left paren> 1 <right paren>
 71 | 
 72 | <high> ::= 2 | High <left paren> 2 <right paren>
 73 | 
 74 | <1999> ::= 3 | edition1999 <left paren> 3 <right paren>
 75 | 
 76 | <2003> ::= 4 | edition2003 <left paren> 4 <right paren>
 77 | 
 78 | <bindings> ::= <module> <embedded> <direct> <invoked routine languages>
 79 | 
 80 | <module> ::= <module no> | <module languages>...
 81 | 
 82 | <module languages> ::=
 83 | 		<module Ada>
 84 | 	|	<module C>
 85 | 	|	<module COBOL>
 86 | 	|	<module Fortran>
 87 | 	|	<module MUMPS>
 88 | 	|	<module Pascal>
 89 | 	|	<module PL/I>
 90 | 
 91 | <module Ada> ::=
 92 | 		1 | moduleAda <left paren> 1 <right paren>
 93 | 
 94 | <module C> ::=
 95 | 		2 | moduleC <left paren> 2 <right paren>
 96 | 
 97 | <module COBOL> ::=
 98 | 		3 | moduleCOBOL <left paren> 3 <right paren>
 99 | 
100 | <module Fortran> ::=
101 | 		4 | moduleFortran <left paren> 4 <right paren>
102 | 
103 | <module MUMPS> ::=
104 | 		5 | moduleMUMPS <left paren> 5 <right paren>
105 | 
106 | <module Pascal> ::=
107 | 		6 | modulePascal <left paren> 6 <right paren>
108 | 
109 | <module PL/I> ::=
110 | 		7 | modulePLI <left paren> 7 <right paren>
111 | 
112 | <embedded> ::= <embedded no> | <embedded languages>...
113 | 
114 | <embedded languages> ::=
115 | 		<embedded Ada>
116 | 	|	<embedded C>
117 | 	|	<embedded COBOL>
118 | 	|	<embedded Fortran>
119 | 	|	<embedded MUMPS>
120 | 	|	<embedded Pascal>
121 | 	|	<embedded PL/I>
122 | 
123 | <embedded Ada> ::=
124 | 		1 | embeddedAda <left paren> 1 <right paren>
125 | 
126 | <embedded C> ::=
127 | 		2 | embeddedC <left paren> 2 <right paren>
128 | 
129 | <embedded COBOL> ::=
130 | 		3 | embeddedCOBOL <left paren> 3 <right paren>
131 | 
132 | <embedded Fortran> ::=
133 | 		4 | embeddedFortran <left paren> 4 <right paren>
134 | 
135 | <embedded MUMPS> ::=
136 | 		5 | embeddedMUMPS <left paren> 5 <right paren>
137 | 
138 | <embedded Pascal> ::=
139 | 		6 | embeddedPascal <left paren> 6 <right paren>
140 | 
141 | <embedded PL/I> ::=
142 | 		7 | embeddedPLI <left paren> 7 <right paren>
143 | 
144 | <direct> ::= <direct yes> | <direct no>
145 | 
146 | <direct yes> ::=
147 | 		1 | directyes <left paren> 1 <right paren>
148 | 
149 | <direct no> ::=
150 | 		0 | directno <left paren> 0 <right paren>
151 | 
152 | <invoked routine languages> ::=
153 | 		<invoked Ada>
154 | 	|	<invoked C>
155 | 	|	<invoked COBOL>
156 | 	|	<invoked Fortran>
157 | 	|	<invoked MUMPS>
158 | 	|	<invoked Pascal>
159 | 	|	<invoked PL/I>
160 | 
161 | <invoked Ada> ::=
162 | 		1 | invokedAda <left paren> 1 <right paren>
163 | 
164 | <invoked C> ::=
165 | 		2 | invokedC <left paren> 2 <right paren>
166 | 
167 | <invoked COBOL> ::=
168 | 		3 | invokedCOBOL <left paren> 3 <right paren>
169 | 
170 | <invoked Fortran> ::=
171 | 		4 | invokedFortran <left paren> 4 <right paren>
172 | 
173 | <invoked MUMPS> ::=
174 | 		5 | invokedMUMPS <left paren> 5 <right paren>
175 | 
176 | <invoked Pascal> ::=
177 | 		6 | invokedPascal <left paren> 6 <right paren>
178 | 
179 | <invoked PL/I> ::=
180 | 		7 | invokedPLI <left paren> 7 <right paren>
181 | 
182 | <parts> ::=
183 | 		<Part 3> <Part 4> <Part 7> <Part 9> <Part 10> <Part 11>
184 | 
185 | <Part n> ::= <Part n no> | <Part n yes>
186 | 
187 | <Part n no> ::= 0 | Part-nNo <left paren> 0 <right paren>
188 | 
189 | <Part n yes> ::= !! as specified in ISO/IEC 9075-n
190 | 
191 | <packages> ::= <Package PKGi>...
192 | 
193 | <Package PKGi> ::=
194 | 		<Package PKGiYes>
195 | 	|	<Package PKGiNo>
196 | 
197 | 
198 | --hr
199 | --h2 Annex B (informative) SQL Packages:
200 | --/h2
201 | 
202 | --## <table border=1>
203 | --## <tr><td> 1 </td><td> PKG001 </td><td> Enhanced datetime facilities </td></tr>
204 | --## <tr><td> 2 </td><td> PKG002 </td><td> Enhanced integrity management </td></tr>
205 | --## <tr><td> 3 </td><td> PKG004 </td><td> PSM </td></tr>
206 | --## <tr><td> 4 </td><td> PKG005 </td><td> CLI </td></tr>
207 | --## <tr><td> 5 </td><td> PKG006 </td><td> Basic object support </td></tr>
208 | --## <tr><td> 6 </td><td> PKG007 </td><td> Enhanced object support </td></tr>
209 | --## <tr><td> 7 </td><td> PKG008 </td><td> Active database </td></tr>
210 | --## <tr><td> 8 </td><td> PKG009 </td><td> SQL/MM support </td></tr>
211 | --## <tr><td> 9 </td><td> PKG010 </td><td> OLAP </td></tr>
212 | --## </table>
213 | 
214 | --hr
215 | --h2 B.1 Enhanced datetime facilities
216 | --/h2
217 | 
218 | --p
219 | The package called "Enhanced datetime facilities" comprises the following features of the SQL
220 | language as specified in the SQL Feature Taxonomy Annex of the various parts of ISO/IEC 9075.
221 | --/p
222 | 
223 | --p
224 | --## <table border=1>
225 | --## <tr> <td> Feature F052 </td> <td> Intervals and datetime arithmetic </td> </tr>
226 | --## <tr> <td> Feature F411 </td> <td> Time zone specification </td> </tr>
227 | --## <tr> <td> Feature F555 </td> <td> Enhanced seconds precision </td> </tr>
228 | --## </table>
229 | --/p
230 | 
231 | --hr
232 | --h2
233 | B.2 Enhanced integrity management
234 | --/h2
235 | 
236 | --p
237 | The package called "Enhanced integrity management" comprises the following features of the SQL
238 | language as specified in the SQL Feature Taxonomy Annex of the various parts of ISO/IEC 9075.
239 | --/p
240 | 
241 | --p
242 | --## <table border=1>
243 | --## <tr> <td> Feature F191 </td> <td> Referential delete actions </td> </tr>
244 | --## <tr> <td> Feature F521 </td> <td> Assertions </td> </tr>
245 | --## <tr> <td> Feature F701 </td> <td> Referential update actions </td> </tr>
246 | --## <tr> <td> Feature F491 </td> <td> Constraint management </td> </tr>
247 | --## <tr> <td> Feature F671 </td> <td> Subqueries in CHECK constraints </td> </tr>
248 | --## <tr> <td> Feature T201 </td> <td> Comparable data types for referential constraints </td> </tr>
249 | --## <tr> <td> Feature T211 </td> <td> Basic trigger capability </td> </tr>
250 | --## <tr> <td> Feature T212 </td> <td> Enhanced trigger capability </td> </tr>
251 | --## <tr> <td> Feature T191 </td> <td> Referential action RESTRICT </td> </tr>
252 | --## </table>
253 | --/p
254 | 
255 | --hr
256 | --h2 B.3 PSM
257 | --/h2
258 | 
259 | --p
260 | The package called "PSM" comprises the following features of the SQL language as specified in the
261 | SQL Feature Taxonomy Annex of the various parts of ISO/IEC 9075.
262 | --/p
263 | 
264 | --p
265 | --## <table border=1>
266 | --## <tr> <td> Feature T322 </td> <td> Overloading of SQL-invoked functions and SQL-invoked procedures </td> </tr>
267 | --## <tr> <td> Feature P001 </td> <td> Stored modules </td> </tr>
268 | --## <tr> <td> Feature P002 </td> <td> Computational completeness </td> </tr>
269 | --## <tr> <td> Feature P003 </td> <td> Information Schema views </td> </tr>
270 | --## </table>
271 | --/p
272 | 
273 | --hr
274 | --h2 B.4 CLI
275 | --/h2
276 | 
277 | --p
278 | The package called "CLI" comprises the following features of the SQL language as specified in the
279 | SQL Feature Taxonomy Annex of the various parts of ISO/IEC 9075.
280 | --/p
281 | 
282 | --p
283 | --## <table border=1>
284 | --## <tr> <td> Feature C011 </td> <td> SQL/CLI </td> </tr>
285 | --## <tr> <td> Feature C021 </td> <td> Automatic population of Implementation Parameter Descriptor </td> </tr>
286 | --## <tr> <td> Feature C041 </td> <td> Information Schema data controlled by current privileges </td> </tr>
287 | --## <tr> <td> Feature C051 </td> <td> GetData extensions </td> </tr>
288 | --## <tr> <td> Feature C061 </td> <td> GetParamData extensions </td> </tr>
289 | --## <tr> <td> Feature C071 </td> <td> Scroll Concurrency </td> </tr>
290 | --## <tr> <td> Feature C081 </td> <td> Read-only data source </td> </tr>
291 | --## </table>
292 | --/p
293 | 
294 | --hr
295 | --h2 B.5 Basic object support
296 | --/h2
297 | 
298 | --p
299 | The package called "basic object support" comprises the following features of the SQL language as
300 | specified in the SQL Feature Taxonomy Annex of the various parts of ISO/IEC 9075.
301 | --/p
302 | 
303 | --p
304 | --## <table border=1>
305 | --## <tr> <td> Feature S023 </td> <td> Basic structured types </td> </tr>
306 | --## <tr> <td> Feature S041 </td> <td> Basic reference types </td> </tr>
307 | --## <tr> <td> Feature S051 </td> <td> Create table of type </td> </tr>
308 | --## <tr> <td> Feature S151 </td> <td> Type predicate </td> </tr>
309 | --## <tr> <td> Feature T041 </td> <td> Basic LOB data type support </td> </tr>
310 | --## </table>
311 | --/p
312 | 
313 | --hr
314 | --h2 B.6 Enhanced object support
315 | --/h2
316 | 
317 | --p
318 | The package called "enhanced object support" comprises all of the features of the package called
319 | (Basic object support), plus the following features of the SQL language as specified in the SQL
320 | Feature Taxonomy Annex of the various parts of ISO/IEC 9075.
321 | --/p
322 | 
323 | --p
324 | --## <table border=1>
325 | --## <tr> <td> Feature S024 </td> <td> Enhanced structured types </td> </tr>
326 | --## <tr> <td> Feature S043 </td> <td> Enhanced reference types </td> </tr>
327 | --## <tr> <td> Feature S071 </td> <td> SQL-paths in function and type name resolution </td> </tr>
328 | --## <tr> <td> Feature S081 </td> <td> Subtables </td> </tr>
329 | --## <tr> <td> Feature S111 </td> <td> ONLY in query expressions </td> </tr>
330 | --## <tr> <td> Feature S161 </td> <td> Subtype treatment </td> </tr>
331 | --## <tr> <td> Feature S211 </td> <td> User-defined cast functions </td> </tr>
332 | --## <tr> <td> Feature S231 </td> <td> Structured type locators </td> </tr>
333 | --## <tr> <td> Feature S241 </td> <td> Transform functions </td> </tr>
334 | --## </table>
335 | --/p
336 | 
337 | --hr
338 | --h2 B.7 Active database
339 | --/h2
340 | 
341 | --p
342 | The package called "Active database" comprises the following features of the SQL language as
343 | specified in the SQL Feature Taxonomy Annex of the various parts of ISO/IEC 9075.
344 | --/p
345 | 
346 | --p
347 | --## <table border=1>
348 | --## <tr> <td> Feature T211 </td> <td> Basic trigger capability </td> </tr>
349 | --## </table>
350 | --/p
351 | 
352 | --hr
353 | --h2 B.8 OLAP
354 | --/h2
355 | 
356 | --p
357 | The package called "OLAP" comprises the following features of the SQL language as specified in the
358 | SQL Feature Taxonomy Annex of the various parts of ISO/IEC 9075.
359 | --/p
360 | 
361 | --p
362 | --## <table border=1>
363 | --## <tr> <td> Feature T431 </td> <td> Extended grouping capabilities </td> </tr>
364 | --## <tr> <td> Feature T611 </td> <td> Elementary OLAP operators </td> </tr>
365 | --## </table>
366 | --/p
367 | 
368 | --hr
369 | --h2 END OF SQL-2003-1 GRAMMAR
370 | --/h2
371 | 
372 | --hr
373 | 
374 | 


--------------------------------------------------------------------------------
/sql-2003-2.ebnf.readme:
--------------------------------------------------------------------------------
 1 | How to use sql-2003-2.ebnf
 2 | ==========================
 3 | This file was extracted manually from sql-2003-2.bnf by Domingo Alvarez Duarte.
 4 | Many thanx Domingo!
 5 | 
 6 | It is the latter file as pure ebnf, i.e. with the original markup stripped.
 7 | 
 8 | The corresponding XHTML + SVG file is sql-2003-2-railroad-diagrams.xhtml.
 9 | 
10 | The grammar can be rendered as railroad diagrams on the website https://www.bottlecaps.de/rr/ui.
11 | Just copy it into the Edit Grammar tab, and click View Diagram. Then wait a moment :-).
12 | 
13 | Domingo notes:
14 | 1. Non reserved words appear with * as suffix, ex: SUBSTRING -> SUBSTRING*.
15 | 
16 | 2. You can navigate through the railroad's grammar by clicking the rectangular boxes.
17 | 
18 | Finally, you can download the rendered railroad as either XHTML + SVG or HTML + PNG.
19 | 


--------------------------------------------------------------------------------
/sql-2003-core-features.html:
--------------------------------------------------------------------------------
  1 | <html>
  2 | <head>
  3 | <title> SQL 2003 Feature Taxonomy and Definition for Core SQL </title>
  4 | </head>
  5 | <body>
  6 | 
  7 | <h1> SQL 2003 (Annex F, Table 34) Feature Taxonomy and Definition for Core SQL </h1>
  8 | 
  9 | Derived from Final Committee Draft (FCD) of ISO/IEC 9075-2:2003.
 10 | <p>
 11 | 
 12 | <table border=1>
 13 | <tr><th> Number </th><th> Feature ID </th><th> Feature Name </th><th> Feature Description </th></tr>
 14 | 
 15 | <tr><td> 1 </td><td> B011 </td><td> Embedded Ada<sup>1</sup> </td><td> - Subclause 20.3, "&lt;embedded SQL Ada program&gt;" </td></tr>
 16 | <tr><td> 2 </td><td> B012 </td><td> Embedded C<sup>1</sup> </td><td> - Subclause 20.4, "&lt;embedded SQL C program&gt;" </td></tr>
 17 | <tr><td> 3 </td><td> B013 </td><td> Embedded COBOL<sup>1</sup> </td><td> - Subclause 20.5, "&lt;embedded SQL COBOL program&gt;" </td></tr>
 18 | <tr><td> 4 </td><td> B014 </td><td> Embedded Fortran<sup>1</sup> </td><td> - Subclause 20.6, "&lt;embedded SQL Fortran program&gt;" </td></tr>
 19 | <tr><td> 5 </td><td> B015 </td><td> Embedded MUMPS<sup>1</sup> </td><td> - Subclause 20.7, "&lt;embedded SQL MUMPS program&gt;" </td></tr>
 20 | <tr><td> 6 </td><td> B016 </td><td> Embedded Pascal<sup>1</sup> </td><td> - Subclause 20.8, "&lt;embedded SQL Pascal program&gt;" </td></tr>
 21 | <tr><td> 7 </td><td> B017 </td><td> Embedded PL/I<sup>1</sup> </td><td> - Subclause 20.9, "&lt;embedded SQL PL/I program&gt;" </td></tr>
 22 | 
 23 | <tr><td colspan=4> <sup>1</sup> A conforming SQL-implementation is required (by Clause 8, "Conformance", in ISO/IEC 9075-1) to support at least one embedded language or to support the SQL-client module binding for at least one host language. </td></tr>
 24 | 
 25 | <tr><td> 8 </td><td> E011 </td><td> Numeric data types </td><td> - Subclause 6.1, "&lt;data type&gt;", &lt;numeric type&gt;, including numeric expressions, numeric literals, numeric comparisons, and numeric assignments </td></tr>
 26 | <tr><td> 9 </td><td> E011-01 </td><td> INTEGER and SMALLINT data types (including all spellings) </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": The &lt;reserved word&gt;s INT, INTEGER, and SMALLINT </td></tr>
 27 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 5.3, "&lt;literal&gt;": [&lt;sign&gt;] &lt;unsigned integer&gt; </td></tr>
 28 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.1, "&lt;data type&gt;": The INTEGER and SMALLINT &lt;exact numeric type&gt;s </td></tr>
 29 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 13.6, "Data type correspondences": Type correspondences for INTEGER and SMALLINT for all supported languages </td></tr>
 30 | <tr><td> 10 </td><td> E011-02 </td><td> REAL, DOUBLE PRECISON, and FLOAT data types </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": The &lt;reserved word&gt;s REAL, DOUBLE, FLOAT, and PRECISION </td></tr>
 31 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 5.3, "&lt;literal&gt;": [&lt;sign&gt;] &lt;approximate numeric literal&gt; </td></tr>
 32 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.1, "&lt;data type&gt;": &lt;approximate numeric type&gt; </td></tr>
 33 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 13.6, "Data type correspondences": Type correspondences for REAL, DOUBLE PRECISION, and FLOAT for all supported languages </td></tr>
 34 | 
 35 | <tr><td> 11 </td><td> E011-03 </td><td> DECIMAL and NUMERIC data types </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": The &lt;reserved word&gt;s DEC, DECIMAL, and NUMERIC </td></tr>
 36 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 5.3, "&lt;literal&gt;": [&lt;sign&gt;] &lt;exact numeric literal&gt; </td></tr>
 37 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.1, "&lt;data type&gt;": The DECIMAL and NUMERIC &lt;exact numeric type&gt;s </td></tr>
 38 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 13.6, "Data type correspondences": Type correspondences for DECIMAL and NUMERIC for all supported languages </td></tr>
 39 | <tr><td> 12 </td><td> E011-04 </td><td> Arithmetic operators </td><td> - Subclause 6.26, "&lt;numeric value expression&gt;": When the &lt;numeric primary&gt; is a &lt;value expression primary&gt; </td></tr>
 40 | <tr><td> 13 </td><td> E011-05 </td><td> Numeric comparison </td><td> - Subclause 8.2, "&lt;comparison predicate&gt;": For the numeric data types, without support for &lt;table subquery&gt; and without support for Feature F131, "Grouped operations" </td></tr>
 41 | <tr><td> 14 </td><td> E011-06 </td><td> Implicit casting among the numeric data types </td><td> - Subclause 8.2, "&lt;comparison predicate&gt;": Values of any of the numeric data types can be compared to each other; such values are compared with respect to their algebraic values </td></tr>
 42 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 9.1, "Retrieval assignment", and Subclause 9.2, "Store assignment": Values of one numeric type can be assigned to another numeric type, subject to rounding, truncation, and out of range conditions </td></tr>
 43 | <tr><td> 15 </td><td> E021 </td><td> Character data types </td><td> - Subclause 6.1, "&lt;data type&gt;": &lt;character string type&gt;, including character expressions, character literals, character comparisons, character assignments, and other operations on character data </td></tr>
 44 | <tr><td> 16 </td><td> E021-01 </td><td> CHARACTER data type (including all its spellings) </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": The &lt;reserved word&gt;s CHAR and CHARACTER </td></tr>
 45 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.1, "&lt;data type&gt;": The CHARACTER &lt;character string type&gt; </td></tr>
 46 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.28, "&lt;string value expression&gt;": For values of type CHARACTER </td></tr>
 47 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 13.6, "Data type correspondences": Type correspondences for CHARACTER for all supported languages </td></tr>
 48 | 
 49 | <tr><td> 17 </td><td> E021-02 </td><td> CHARACTER VARYING data type (including all its spellings) </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": The &lt;reserved word&gt;s VARCHAR and VARYING </td></tr>
 50 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.1, "&lt;data type&gt;": The CHARACTER VARYING &lt;character string type&gt; </td></tr>
 51 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.28, "&lt;string value expression&gt;": For values of type CHARACTER VARYING </td></tr>
 52 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 13.6, "Data type correspondences": Type correspondences for CHARACTER VARYING for all supported languages </td></tr>
 53 | <tr><td> 18 </td><td> E021-03 </td><td> Character literals </td><td> - Subclause 5.3, "&lt;literal&gt;": &lt;quote&gt; [ &lt;character representation&gt;... ] &lt;quote&gt; </td></tr>
 54 | <tr><td> 19 </td><td> E021-04 </td><td> CHARACTER_LENGTH function </td><td> - Subclause 6.27, "&lt;numeric value function&gt;": The &lt;char length expression&gt; </td></tr>
 55 | <tr><td> 20 </td><td> E021-05 </td><td> OCTET_LENGTH function </td><td> - Subclause 6.27, "&lt;numeric value function&gt;": The &lt;octet length expression&gt; </td></tr>
 56 | <tr><td> 21 </td><td> E021-06 </td><td> SUBSTRING function </td><td> - Subclause 6.29, "&lt;string value function&gt;": The &lt;character substring function&gt; </td></tr>
 57 | <tr><td> 22 </td><td> E021-07 </td><td> Character concatenation </td><td> - Subclause 6.28, "&lt;string value expression&gt;": The &lt;concatenation&gt; expression </td></tr>
 58 | <tr><td> 23 </td><td> E021-08 </td><td> UPPER and LOWER functions </td><td> - Subclause 6.29, "&lt;string value function&gt;": The &lt;fold&gt; function </td></tr>
 59 | <tr><td> 24 </td><td> E021-09 </td><td> TRIM function </td><td> - Subclause 6.29, "&lt;string value function&gt;": The &lt;trim function&gt; </td></tr>
 60 | <tr><td> 25 </td><td> E021-10 </td><td> Implicit casting among the character data types </td><td> - Subclause 8.2, "&lt;comparison predicate&gt;": Values of either the CHARACTER or CHARACTER VARYING data types can be compared to each other </td></tr>
 61 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 9.1, "Retrieval assignment", and Subclause 9.2, "Store assignment": Values of either the CHARACTER or CHARACTER VARYING data type can be assigned to the other type, subject to truncation conditions </td></tr>
 62 | <tr><td> 26 </td><td> E021-11 </td><td> POSITION function </td><td> - Subclause 6.27, "&lt;numeric value function&gt;": The &lt;position expression&gt; </td></tr>
 63 | <tr><td> 27 </td><td> E021-12 </td><td> Character comparison </td><td> - Subclause 8.2, "&lt;comparison predicate&gt;": For the CHARACTER and CHARACTER VARYING data types, without support for &lt;table subquery&gt; and without support for Feature F131, "Grouped operations" </td></tr>
 64 | <tr><td> 28 </td><td> E031 </td><td> Identifiers </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": &lt;regular identifier&gt; and &lt;delimited identifier&gt; </td></tr>
 65 | 
 66 | <tr><td> 29 </td><td> E031-01 </td><td> Delimited identifiers </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": &lt;delimited identifier&gt; </td></tr>
 67 | <tr><td> 30 </td><td> E031-02 </td><td> Lower case identifiers </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": An alphabetic character in a &lt;regular identifier&gt; can be either lower case or upper case (meaning that non-delimited identifiers need not comprise only upper case letters) </td></tr>
 68 | <tr><td> 31 </td><td> E031-03 </td><td> Trailing underscore </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": The list &lt;identifier part&gt; in a &lt;regular identifier&gt; can be an &lt;underscore&gt; </td></tr>
 69 | <tr><td> 32 </td><td> E051 </td><td> Basic query specification </td><td> - Subclause 7.12, "&lt;query specification&gt;": When &lt;table reference&gt; is a &lt;table or query name&gt; that is a &lt;table name&gt;, without the support of Feature F131, "Grouped operations" </td></tr>
 70 | <tr><td> 33 </td><td> E051-01 </td><td> SELECT DISTINCT </td><td> - Subclause 7.12, "&lt;query specification&gt;": With a &lt;set quantifier&gt; of DISTINCT, but without subfeatures E051-02 through E051-09 </td></tr>
 71 | <tr><td> 34 </td><td> E051-02 </td><td> GROUP BY clause </td><td> - Subclause 7.4, "&lt;table expression&gt;": &lt;group by clause&gt;, but without subfeatures E051-03 through E051-09 </td></tr>
 72 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 7.9, "&lt;group by clause&gt;": With the restrictions that the &lt;group by clause&gt; must contain all non-aggregated columns in the &lt;select list&gt; and that any column in the &lt;group by clause&gt; must also appear in the &lt;select list&gt; </td></tr>
 73 | <tr><td> 35 </td><td> E051-04 </td><td> GROUP BY can contain columns not in &lt;select list&gt; </td><td> - Subclause 7.9, "&lt;group by clause&gt;": Without the restriction that any column in the &lt;group by clause&gt; must also appear in the &lt;select list&gt; </td></tr>
 74 | <tr><td> 36 </td><td> E051-05 </td><td> Select list items can be renamed </td><td> - Subclause 7.12, "&lt;query specification&gt;": &lt;as clause&gt; </td></tr>
 75 | <tr><td> 37 </td><td> E051-06 </td><td> HAVING clause </td><td> - Subclause 7.4, "&lt;table expression&gt;": &lt;having clause&gt; </td></tr>
 76 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 7.10, "&lt;having clause&gt;" </td></tr>
 77 | <tr><td> 38 </td><td> E051-07 </td><td> Qualified * in select list </td><td> - Subclause 7.12, "&lt;query specification&gt;": &lt;qualified asterisk&gt; </td></tr>
 78 | <tr><td> 39 </td><td> E051-08 </td><td> Correlation names in the FROM clause </td><td> - Subclause 7.6, "&lt;table reference&gt;": [ AS ] &lt;correlation name&gt; </td></tr>
 79 | <tr><td> 40 </td><td> E051-09 </td><td> Rename columns in the FROM clause </td><td> - Subclause 7.6, "&lt;table reference&gt;": [ AS ] &lt;correlation name&gt; [ &lt;left paren&gt; &lt;derived column list&gt; &lt;right paren&gt; ] </td></tr>
 80 | <tr><td> 41 </td><td> E061 </td><td> Basic predicates and search conditions </td><td> - Subclause 8.19, "&lt;search condition&gt;", and Subclause 8.1, "&lt;predicate&gt;" </td></tr>
 81 | 
 82 | <tr><td> 42 </td><td> E061-01 </td><td> Comparison predicate </td><td> - Subclause 8.2, "&lt;comparison predicate&gt;": For supported data types, without support for &lt;table subquery&gt; </td></tr>
 83 | <tr><td> 43 </td><td> E061-02 </td><td> BETWEEN predicate </td><td> - Subclause 8.3, "&lt;between predicate&gt;" </td></tr>
 84 | <tr><td> 44 </td><td> E061-03 </td><td> IN predicate with list of values </td><td> - Subclause 8.4, "&lt;in predicate&gt;": Without support for &lt;table subquery&gt; </td></tr>
 85 | <tr><td> 45 </td><td> E061-04 </td><td> LIKE predicate </td><td> - Subclause 8.5, "&lt;like predicate&gt;": Without [ ESCAPE &lt;escape character&gt; ] </td></tr>
 86 | <tr><td> 46 </td><td> E061-05 </td><td> LIKE predicate: ESCAPE clause </td><td> - Subclause 8.5, "&lt;like predicate&gt;": With [ ESCAPE &lt;escape character&gt; ] </td></tr>
 87 | <tr><td> 47 </td><td> E061-06 </td><td> NULL predicate </td><td> - Subclause 8.7, "&lt;null predicate&gt;": Without Feature F481, "Expanded NULL predicate" </td></tr>
 88 | <tr><td> 48 </td><td> E061-07 </td><td> Quantified comparison predicate </td><td> - Subclause 8.8, "&lt;quantified comparison predicate&gt;": Without support for &lt;table subquery&gt; </td></tr>
 89 | <tr><td> 49 </td><td> E061-08 </td><td> EXISTS predicate </td><td> - Subclause 8.9, "&lt;exists predicate&gt;" </td></tr>
 90 | <tr><td> 50 </td><td> E061-09 </td><td> Subqueries in comparison predicate </td><td> - Subclause 8.2, "&lt;comparison predicate&gt;": For supported data types, with support for &lt;table subquery&gt; </td></tr>
 91 | <tr><td> 51 </td><td> E061-11 </td><td> Subqueries in IN predicate </td><td> - Subclause 8.4, "&lt;in predicate&gt;": With support for &lt;table subquery&gt; </td></tr>
 92 | <tr><td> 52 </td><td> E061-12 </td><td> Subqueries in quantified comparison predicate </td><td> - Subclause 8.8, "&lt;quantified comparison predicate&gt;": With support for &lt;table subquery&gt; </td></tr>
 93 | <tr><td> 53 </td><td> E061-13 </td><td> Correlated subqueries </td><td> - Subclause 8.1, "&lt;predicate&gt;": When a &lt;correlation name&gt; can be used in a &lt;table subquery&gt; as a correlated reference to a column in the outer query </td></tr>
 94 | <tr><td> 54 </td><td> E061-14 </td><td> Search condition </td><td> - Subclause 8.19, "&lt;search condition&gt;" </td></tr>
 95 | <tr><td> 55 </td><td> E071 </td><td> Basic query expressions </td><td> - Subclause 7.13, "&lt;query expression&gt;" </td></tr>
 96 | <tr><td> 56 </td><td> E071-01 </td><td> UNION DISTINCT table operator </td><td> - Subclause 7.13, "&lt;query expression&gt;": With support for UNION [ DISTINCT ] </td></tr>
 97 | <tr><td> 57 </td><td> E071-02 </td><td> UNION ALL table operator </td><td> - Subclause 7.13, "&lt;query expression&gt;": With support for UNION ALL </td></tr>
 98 | <tr><td> 58 </td><td> E071-03 </td><td> EXCEPT DISTINCT table operator </td><td> - Subclause 7.13, "&lt;query expression&gt;": With support for EXCEPT [ DISTINCT ] </td></tr>
 99 | <tr><td> 59 </td><td> E071-05 </td><td> Columns combined via table operators need not have exactly the same data type.  </td><td> - Subclause 7.13, "&lt;query expression&gt;": Columns combined via UNION and EXCEPT need not have exactly the same data type </td></tr>
100 | <tr><td> 60 </td><td> E071-06 </td><td> Table operators in subqueries </td><td> - Subclause 7.13, "&lt;query expression&gt;": &lt;table subquery&gt;s can specify UNION and EXCEPT </td></tr>
101 | <tr><td> 61 </td><td> E081 </td><td> Basic Privileges </td><td> - Subclause 12.3, "&lt;privileges&gt;" </td></tr>
102 | 
103 | <tr><td> 62 </td><td> E081-01 </td><td> SELECT privilege </td><td> - Subclause 12.3, "&lt;privileges&gt;": With &lt;action&gt; of SELECT </td></tr>
104 | <tr><td> 63 </td><td> E081-02 </td><td> DELETE privilege </td><td> - Subclause 12.3, "&lt;privileges&gt;": With &lt;action&gt; of DELETE </td></tr>
105 | <tr><td> 64 </td><td> E081-03 </td><td> INSERT privilege at the table level </td><td> - Subclause 12.3, "&lt;privileges&gt;": With &lt;action&gt; of INSERT without &lt;privilege column list&gt; </td></tr>
106 | <tr><td> 65 </td><td> E081-04 </td><td> UPDATE privilege at the table level </td><td> - Subclause 12.3, "&lt;privileges&gt;": With &lt;action&gt; of UPDATE without &lt;privilege column list&gt; </td></tr>
107 | <tr><td> 66 </td><td> E081-05 </td><td> UPDATE privilege at the column level </td><td> - Subclause 12.3, "&lt;privileges&gt;": With &lt;action&gt; of UPDATE &lt;left paren&gt; &lt;privilege column list&gt; &lt;right paren&gt; </td></tr>
108 | <tr><td> 67 </td><td> E081-06 </td><td> REFERENCES privilege at the table level </td><td> - Subclause 12.3, "&lt;privileges&gt;": with &lt;action&gt; of REFERENCES without &lt;privilege column list&gt; </td></tr>
109 | <tr><td> 68 </td><td> E081-07 </td><td> REFERENCES privilege at the column level </td><td> - Subclause 12.3, "&lt;privileges&gt;": With &lt;action&gt; of REFERENCES &lt;left paren&gt; &lt;privilege column list&gt; &lt;right paren&gt; </td></tr>
110 | <tr><td> 69 </td><td> E081-08 </td><td> WITH GRANT OPTION </td><td> - Subclause 12.2, "&lt;grant privilege statement&gt;": WITH GRANT OPTION </td></tr>
111 | <tr><td> 70 </td><td> E091 </td><td> Set functions </td><td> - Subclause 6.9, "&lt;set function specification&gt;" </td></tr>
112 | <tr><td> 71 </td><td> E091-01 </td><td> AVG </td><td> - Subclause 6.9, "&lt;set function specification&gt;": With &lt;computational operation&gt; of AVG </td></tr>
113 | <tr><td> 72 </td><td> E091-02 </td><td> COUNT </td><td> - Subclause 6.9, "&lt;set function specification&gt;": With &lt;computational operation&gt; of COUNT </td></tr>
114 | <tr><td> 73 </td><td> E091-03 </td><td> MAX </td><td> - Subclause 6.9, "&lt;set function specification&gt;": With &lt;computational operation&gt; of MAX </td></tr>
115 | <tr><td> 74 </td><td> E091-04 </td><td> MIN </td><td> - Subclause 6.9, "&lt;set function specification&gt;": With &lt;computational operation&gt; of MIN </td></tr>
116 | <tr><td> 75 </td><td> E091-05 </td><td> SUM </td><td> - Subclause 6.9, "&lt;set function specification&gt;": With &lt;computational operation&gt; of SUM </td></tr>
117 | <tr><td> 76 </td><td> E091-06 </td><td> ALL quantifier </td><td> - Subclause 6.9, "&lt;set function specification&gt;": With &lt;set quantifier&gt; of ALL </td></tr>
118 | <tr><td> 77 </td><td> E091-07 </td><td> DISTINCT quantifier </td><td> - Subclause 6.9, "&lt;set function specification&gt;": With &lt;set quantifier&gt; of DISTINCT </td></tr>
119 | <tr><td> 78 </td><td> E101 </td><td> Basic data manipulation </td><td> - Clause 14, "Data manipulation": &lt;insert statement&gt;, &lt;delete statement: searched&gt;, and &lt;update statement: searched&gt; </td></tr>
120 | 
121 | <tr><td> 79 </td><td> E101-01 </td><td> INSERT statement </td><td> - Subclause 14.8, "&lt;insert statement&gt;": When a &lt;contextually typed table value constructor&gt; can consist of no more than a single &lt;contextually typed row value expression&gt; </td></tr>
122 | <tr><td> 80 </td><td> E101-03 </td><td> Searched UPDATE statement </td><td> - Subclause 14.11, "&lt;update statement: searched&gt;": But without support either of Feature E153, "Updatable tables with subqueries", or Feature F221, "Explicit defaults" </td></tr>
123 | <tr><td> 81 </td><td> E101-04 </td><td> Searched DELETE statement </td><td> - Subclause 14.7, "&lt;delete statement: searched&gt;" </td></tr>
124 | <tr><td> 82 </td><td> E111 </td><td> Single row SELECT statement </td><td> - Subclause 14.5, "&lt;select statement: single row&gt;": Without support of Feature F131, "Grouped operations" </td></tr>
125 | <tr><td> 83 </td><td> E121 </td><td> Basic cursor support </td><td> - Clause 14, "Data manipulation": &lt;declare cursor&gt;, &lt;open statement&gt;, &lt;fetch statement&gt;, &lt;close statement&gt;, &lt;delete statement: positioned&gt;, and &lt;update statement: positioned&gt; </td></tr>
126 | <tr><td> 84 </td><td> E121-01 </td><td> DECLARE CURSOR </td><td> - Subclause 14.1, "&lt;declare cursor&gt;": When each &lt;value expression&gt; in the &lt;sort key&gt; must be a &lt;column reference&gt; and that &lt;column reference&gt; must also be in the &lt;select list&gt;, and &lt;cursor holdability&gt; is not specified </td></tr>
127 | <tr><td> 85 </td><td> E121-02 </td><td> ORDER BY columns need not be in select list </td><td> - Subclause 14.1, "&lt;declare cursor&gt;": Extend subfeature E121-01 so that &lt;column reference&gt; need not also be in the &lt;select list&gt; </td></tr>
128 | <tr><td> 86 </td><td> E121-03 </td><td> Value expressions in ORDER BY clause </td><td> - Subclause 14.1, "&lt;declare cursor&gt;": Extend subfeature E121-01 so that the &lt;value expression&gt; in the &lt;sort key&gt; need not be a &lt;column reference&gt; </td></tr>
129 | <tr><td> 87 </td><td> E121-04 </td><td> OPEN statement </td><td> - Subclause 14.2, "&lt;open statement&gt;" </td></tr>
130 | <tr><td> 88 </td><td> E121-06 </td><td> Positioned UPDATE statement </td><td> - Subclause 14.10, "&lt;update statement: positioned&gt;": Without support of either Feature E153, "Updateable tables with subqueries" or Feature F221, "Explicit defaults" </td></tr>
131 | <tr><td> 89 </td><td> E121-07 </td><td> Positioned DELETE statement </td><td> - Subclause 14.6, "&lt;delete statement: positioned&gt;" </td></tr>
132 | <tr><td> 90 </td><td> E121-08 </td><td> CLOSE statement </td><td> - Subclause 14.4, "&lt;close statement&gt;" </td></tr>
133 | <tr><td> 91 </td><td> E121-10 </td><td> FETCH statement: implicit NEXT </td><td> - Subclause 14.3, "&lt;fetch statement&gt;" </td></tr>
134 | <tr><td> 92 </td><td> E121-17 </td><td> WITH HOLD cursors </td><td> - Subclause 14.1, "&lt;declare cursor&gt;": Where the &lt;value expression&gt; in the &lt;sort key&gt; need not be a &lt;column reference&gt; and need not be in the &lt;select list&gt;, and &lt;cursor holdability&gt; may be specified </td></tr>
135 | 
136 | <tr><td> 93 </td><td> E131 </td><td> Null value support (nulls in lieu of values) </td><td> - Subclause 4.14, "Columns, fields, and attributes": Nullability characteristic </td></tr>
137 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.5, "&lt;contextually typed value specification&gt;": &lt;null specification&gt; </td></tr>
138 | <tr><td> 94 </td><td> E141 </td><td> Basic integrity constraints </td><td> - Subclause 11.6, "&lt;table constraint definition&gt;": As specified by the subfeatures of this feature in this table </td></tr>
139 | <tr><td> 95 </td><td> E141-01 </td><td> NOT NULL constraints </td><td> - Subclause 11.4, "&lt;column definition&gt;": With &lt;column constraint&gt; of NOT NULL </td></tr>
140 | <tr><td> 96 </td><td> E141-02 </td><td> UNIQUE constraints of NOT NULL columns </td><td> - Subclause 11.4, "&lt;column definition&gt;": With &lt;unique specification&gt; of UNIQUE for columns specified as NOT NULL </td></tr>
141 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 11.7, "&lt;unique constraint definition&gt;": With &lt;unique specification&gt; of UNIQUE </td></tr>
142 | <tr><td> 97 </td><td> E141-03 </td><td> PRIMARY KEY constraints </td><td> - Subclause 11.4, "&lt;column definition&gt;": With &lt;unique specification&gt; of PRIMARY KEY for columns specified as NOT NULL </td></tr>
143 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 11.7, "&lt;unique constraint definition&gt;": With &lt;unique specification&gt; of PRIMARY KEY </td></tr>
144 | <tr><td> 98 </td><td> E141-04 </td><td> Basic FOREIGN KEY constraint with the NO ACTION default for both referential delete action and referential update action.  </td><td> - Subclause 11.4, "&lt;column definition&gt;": With &lt;column constraint&gt; of &lt;references specification&gt; </td></tr>
145 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 11.8, "&lt;referential constraint definition&gt;": Where the columns in the &lt;column name list&gt;, if specified, must be in the same order as the names in the &lt;unique column list&gt; of the applicable &lt;unique constraint definition&gt; and the &lt;data type&gt;s of the matching columns must be the same </td></tr>
146 | <tr><td> 99 </td><td> E141-06 </td><td> CHECK constraints </td><td> - Subclause 11.4, "&lt;column definition&gt;": With &lt;column constraint&gt; of &lt;check constraint definition&gt; </td></tr>
147 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 11.9, "&lt;check constraint definition&gt;" </td></tr>
148 | <tr><td> 100 </td><td> E141-07 </td><td> Column defaults </td><td> - Subclause 11.4, "&lt;column definition&gt;": With &lt;default clause&gt; </td></tr>
149 | <tr><td> 101 </td><td> E141-08 </td><td> NOT NULL inferred on PRIMARY KEY </td><td> - Subclause 11.4, "&lt;column definition&gt;", and Subclause 11.7, "&lt;unique constraint definition&gt;": Remove the restriction in subfeatures E141-02 and E141-03 that NOT NULL be specified along with every PRIMARY KEY and UNIQUE constraint </td></tr>
150 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 11.4, "&lt;column definition&gt;": NOT NULL is implicit on PRIMARY KEY constraints </td></tr>
151 | 
152 | <tr><td> 102 </td><td> E141-10 </td><td> Names in a foreign key can be specified in any order </td><td> - Subclause 11.4, "&lt;column definition&gt;", and Subclause 11.8, "&lt;referential constraint definition&gt;": Extend subfeature E141-04 so that the columns in the &lt;column name list&gt;, if specified, need not be in the same order as the names in the &lt;unique column list&gt; of the applicable &lt;unique constraint definition&gt; </td></tr>
153 | <tr><td> 103 </td><td> E141-11 </td><td> Foreign key"s data types need not be the same as the primary key"s </td><td> - Subclause 11.4, "&lt;column definition&gt;", and Subclause 11.8, "&lt;referential constraint definition&gt;": Extend subfeature E141-04 so that the data types of matching columns need not be the same. </td></tr>
154 | <tr><td> 104 </td><td> E151 </td><td> Transaction support </td><td> - Clause 16, "Transaction management": &lt;commit statement&gt; and &lt;rollback statement&gt; </td></tr>
155 | <tr><td> 105 </td><td> E151-01 </td><td> COMMIT statement </td><td> - Subclause 16.6, "&lt;commit statement&gt;" </td></tr>
156 | <tr><td> 106 </td><td> E151-02 </td><td> ROLLBACK statement </td><td> - Subclause 16.7, "&lt;rollback statement&gt;" </td></tr>
157 | <tr><td> 107 </td><td> E152 </td><td> Basic SET TRANSACTION statement </td><td> - Subclause 16.2, "&lt;set transaction statement&gt;" </td></tr>
158 | <tr><td> 108 </td><td> E152-01 </td><td> SET TRANSACTION statement: ISOLATION LEVEL SERIALIZABLE clause </td><td> - Subclause 16.2, "&lt;set transaction statement&gt;": With &lt;transaction mode&gt; of ISOLATION LEVEL SERIALIZABLE clause </td></tr>
159 | <tr><td> 109 </td><td> E152-02 </td><td> SET TRANSACTION statement: READ ONLY and READ WRITE clauses </td><td> - Subclause 16.2, "&lt;set transaction statement&gt;": with &lt;transaction access mode&gt; of READ ONLY or READ WRITE </td></tr>
160 | <tr><td> 110 </td><td> E153 </td><td> Updatable queries with subqueries </td><td> - Subclause 7.13, "&lt;query expression&gt;": A &lt;query expression&gt; is updatable even though its &lt;where clause&gt; contains a &lt;subquery&gt; </td></tr>
161 | <tr><td> 111 </td><td> E161 </td><td> SQL comments using leading double minus </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": &lt;simple comment&gt; </td></tr>
162 | <tr><td> 112 </td><td> E171 </td><td> SQLSTATE support </td><td> - Subclause 23.1, "SQLSTATE" </td></tr>
163 | <tr><td> 113 </td><td> E182 </td><td> Module language </td><td> - Clause 13, "SQL-client modules" <br>(NOTE 450 - An SQL-implementation is required to supply at least one binding to a standard host language using either module language, embedded SQL, or both.) </td></tr>
164 | <tr><td> 114 </td><td> F031 </td><td> Basic schema manipulation </td><td> - Clause 11, "Schema definition and manipulation": Selected facilities as indicated by the subfeatures of this Feature </td></tr>
165 | <tr><td> 115 </td><td> F031-01 </td><td> CREATE TABLE statement to create persistent base tables </td><td> - Subclause 11.3, "&lt;table definition&gt;": Not in the context of a &lt;schema definition&gt; </td></tr>
166 | 
167 | <tr><td> 116 </td><td> F031-02 </td><td> CREATE VIEW statement </td><td> - Subclause 11.22, "&lt;view definition&gt;": Not in the context of a &lt;schema definition&gt;, and without support of Feature F081, "UNION and EXCEPT in views" </td></tr>
168 | <tr><td> 117 </td><td> F031-03 </td><td> GRANT statement </td><td> - Subclause 12.1, "&lt;grant statement&gt;": Not in the context of a &lt;schema definition&gt; </td></tr>
169 | <tr><td> 118 </td><td> F031-04 </td><td> ALTER TABLE statement: ADD COLUMN clause </td><td> - Subclause 11.10, "&lt;alter table statement&gt;": The &lt;add column definition&gt; clause </td></tr>
170 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 11.11, "&lt;add column definition&gt;" </td></tr>
171 | <tr><td> 119 </td><td> F031-13 </td><td> DROP TABLE statement: RESTRICT clause </td><td> - Subclause 11.21, "&lt;drop table statement&gt;": With a &lt;drop behavior&gt; of RESTRICT </td></tr>
172 | <tr><td> 120 </td><td> F031-16 </td><td> DROP VIEW statement: RESTRICT clause </td><td> - Subclause 11.23, "&lt;drop view statement&gt;": With a &lt;drop behavior&gt; of RESTRICT </td></tr>
173 | <tr><td> 121 </td><td> F031-19 </td><td> REVOKE statement: RESTRICT clause </td><td> - Subclause 12.7, "&lt;revoke statement&gt;": With a &lt;drop behavior&gt; of RESTRICT, only where the use of this statement can be restricted to the owner of the table being dropped </td></tr>
174 | <tr><td> 122 </td><td> F041 </td><td> Basic joined table </td><td> - Subclause 7.7, "&lt;joined table&gt;" </td></tr>
175 | <tr><td> 123 </td><td> F041-01 </td><td> Inner join (but not necessarily the INNER keyword) </td><td> - Subclause 7.6, "&lt;table reference&gt;": The &lt;joined table&gt; clause, but without support for subfeatures F041-02 through F041-08 </td></tr>
176 | <tr><td> 124 </td><td> F041-02 </td><td> INNER keyword </td><td> - Subclause 7.7, "&lt;joined table&gt;": &lt;join type&gt; of INNER </td></tr>
177 | <tr><td> 125 </td><td> F041-03 </td><td> LEFT OUTER JOIN </td><td> - Subclause 7.7, "&lt;joined table&gt;": &lt;outer join type&gt; of LEFT </td></tr>
178 | <tr><td> 126 </td><td> F041-04 </td><td> RIGHT OUTER JOIN </td><td> - Subclause 7.7, "&lt;joined table&gt;": &lt;outer join type&gt; of RIGHT </td></tr>
179 | <tr><td> 127 </td><td> F041-05 </td><td> Outer joins can be nested </td><td> - Subclause 7.7, "&lt;joined table&gt;": Subfeature F041-1 extended so that a &lt;table reference&gt; within the &lt;joined table&gt; can itself be a &lt;joined table&gt; </td></tr>
180 | <tr><td> 128 </td><td> F041-07 </td><td> The inner table in a left or right outer join can also be used in an inner join </td><td> - Subclause 7.7, "&lt;joined table&gt;": Subfeature F041-1 extended so that a &lt;table name&gt; within a nested &lt;joined table&gt; can be the same as a &lt;table name&gt; in an outer &lt;joined table&gt; </td></tr>
181 | <tr><td> 129 </td><td> F041-08 </td><td> All comparison operators are supported (rather than just =) </td><td> - Subclause 7.7, "&lt;joined table&gt;": Subfeature F041-1 extended so that the &lt;join condition&gt; is not limited to a &lt;comparison predicate&gt; with a &lt;comp op&gt; of &lt;equals operator&gt; </td></tr>
182 | <tr><td> 130 </td><td> F051 </td><td> Basic date and time </td><td> - Subclause 6.1, "&lt;data type&gt;": &lt;datetime type&gt; including datetime literals, datetime comparisons, and datetime conversions </td></tr>
183 | 
184 | <tr><td> 131 </td><td> F051-01 </td><td> DATE data type (including support of DATE literal) </td><td> - Subclause 5.3, "&lt;literal&gt;": The &lt;date literal&gt; form of &lt;datetime literal&gt; </td></tr>
185 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.1, "&lt;data type&gt;": The DATE &lt;datetime type&gt; </td></tr>
186 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.30, "&lt;datetime value expression&gt;": For values of type DATE 132 F051-02 TIME data type (including support of TIME literal) with fractional seconds precision of at least 0. </td></tr>
187 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 5.3, "&lt;literal&gt;": The &lt;time literal&gt; form of &lt;datetime literal&gt;, where the value of &lt;unquoted time string&gt; is simply &lt;time value&gt; that does not include the optional &lt;time zone interval&gt; </td></tr>
188 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.1, "&lt;data type&gt;": The TIME &lt;datetime type&gt; without the &lt;with or without timezone&gt; clause </td></tr>
189 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.30, "&lt;datetime value expression&gt;": For values of type TIME 133 F051-03 TIMESTAMP data type (including support of TIMESTAMP literal) with fractional seconds precision of at least 0 and 6. </td></tr>
190 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 5.3, "&lt;literal&gt;": The &lt;timestamp literal&gt; form of &lt;datetime literal&gt;, where the value of &lt;unquoted timestamp string&gt; is simply &lt;time value&gt; that does not include the optional &lt;time zone interval&gt; </td></tr>
191 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.1, "&lt;data type&gt;": The TIMESTAMP &lt;datetime type&gt; without the &lt;with or without timezone&gt; clause </td></tr>
192 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.30, "&lt;datetime value expression&gt;": For values of type TIMESTAMP </td></tr>
193 | <tr><td> 134 </td><td> F051-04 </td><td> Comparison predicate on DATE, TIME, and TIMESTAMP data types </td><td> - Subclause 8.2, "&lt;comparison predicate&gt;": For comparison between values of the following types: DATE and DATE, TIME and TIME, TIMESTAMP and TIMESTAMP, DATE and TIMESTAMP, and TIME and TIMESTAMP </td></tr>
194 | <tr><td> 135 </td><td> F051-05 </td><td> Explicit CAST between datetime types and character types </td><td> - Subclause 6.12, "&lt;cast specification&gt;": If support for Feature F201, "CAST function" is available, then CASTing between the following types: from character string to DATE, TIME, and TIMESTAMP; from DATE to DATE, TIMESTAMP, and character string; from TIME to TIME, TIMESTAMP, and character string; from TIMESTAMP to DATE, TIME, TIMESTAMP, and character string </td></tr>
195 | <tr><td> 136 </td><td> F051-06 </td><td> CURRENT_DATE </td><td> - Subclause 6.31, "&lt;datetime value function&gt;": The &lt;current date value function&gt; </td></tr>
196 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.30, "&lt;datetime value expression&gt;": When the value is a &lt;current date value function&gt; </td></tr>
197 | 
198 | <tr><td> 137 </td><td> F051-07 </td><td> LOCALTIME </td><td> - Subclause 6.31, "&lt;datetime value function&gt;": The &lt;current local time value function&gt; </td></tr>
199 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.30, "&lt;datetime value expression&gt;": When the value is a &lt;current local time value function&gt; </td></tr>
200 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 11.5, "&lt;default clause&gt;": LOCALTIME option of &lt;datetime value function&gt; </td></tr>
201 | <tr><td> 138 </td><td> F051-08 </td><td> LOCALTIMESTAMP </td><td> - Subclause 6.31, "&lt;datetime value function&gt;": The &lt;current local timestamp value function&gt; </td></tr>
202 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.30, "&lt;datetime value expression&gt;": When the value is a &lt;current local timestamp value function&gt; </td></tr>
203 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 11.5, "&lt;default clause&gt;": LOCALTIMESTAMP option of &lt;datetime value function&gt; </td></tr>
204 | <tr><td> 139 </td><td> F081 </td><td> UNION and EXCEPT in views </td><td> - Subclause 11.22, "&lt;view definition&gt;": A &lt;query expression&gt; in a &lt;view definition&gt; may specify UNION DISTINCT, UNION ALL, EXCEPT, and/or EXCEPT ALL </td></tr>
205 | <tr><td> 140 </td><td> F131 </td><td> Grouped operations </td><td> - A grouped view is a view whose &lt;query expression&gt; contains a &lt;group by clause&gt; </td></tr>
206 | <tr><td> 141 </td><td> F131-01 </td><td> WHERE, GROUP BY, and HAVING clauses supported in queries with grouped views </td><td> - Subclause 7.4, "&lt;table expression&gt;": Even though a table in the &lt;from clause&gt; is a grouped view, the &lt;where clause&gt;, &lt;group by clause&gt;, and &lt;having clause&gt; may be specified </td></tr>
207 | <tr><td> 142 </td><td> F131-02 </td><td> Multiple tables supported in queries with grouped views </td><td> - Subclause 7.5, "&lt;from clause&gt;": Even though a table in the &lt;from clause&gt; is a grouped view, the &lt;from clause&gt; may specify more than one &lt;table reference&gt; </td></tr>
208 | <tr><td> 143 </td><td> F131-03 </td><td> Set functions supported in queries with grouped views </td><td> - Subclause 7.12, "&lt;query specification&gt;": Even though a table in the &lt;from clause&gt; is a grouped view, the &lt;select list&gt; may specify a &lt;set function specification&gt; </td></tr>
209 | <tr><td> 144 </td><td> F131-04 </td><td> Subqueries with GROUP BY and HAVING clauses and grouped views </td><td> - Subclause 7.15, "&lt;subquery&gt;": A &lt;subquery&gt; in a &lt;comparison predicate&gt; is allowed to contain a &lt;group by clause&gt; and/or a &lt;having clause and/or it may identify a grouped view </td></tr>
210 | <tr><td> 145 </td><td> F131-05 </td><td> Single row SELECT with GROUP BY and HAVING clauses and grouped views </td><td> - Subclause 14.5, "&lt;select statement: single row&gt;": The table in a &lt;from clause&gt; can be a grouped view </td></tr>
211 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 14.5, "&lt;select statement: single row&gt;": The &lt;table expression&gt; may specify a &lt;group by clause and/or a &lt;having clause </td></tr>
212 | 
213 | <tr><td> 146 </td><td> F181 </td><td> Multiple module support <br>(NOTE 451 - The ability to associate multiple host compilation units with a single SQL-session at one time.) </td><td> - Subclause 13.1, "&lt;SQL-client module definition&gt;": An SQL-agent can be associated with more than one &lt;SQL-client module definition&gt; <br>(NOTE 452 - With this feature, it is possible to compile &lt;SQL-client module definition&gt;s or &lt;embedded SQL host program&gt;s separately and rely on the SQL-implementation to "link" the together properly at execution time. To ensure portability, applications should adhere to the following limitations: <br><bl><li> Avoid linking modules having cursors with the same &lt;cursor name&gt;. </li> <li> Avoid linking modules that prepare statements using the same &lt;SQL statement name&gt;. </li> <li> Avoid linking modules that allocate descriptors with the same &lt;descriptor name&gt;. </li> <li> Assume that the scope of an &lt;embedded exception declaration&gt; is a single compilation unit. </li> <li> Assume that an &lt;embedded variable name&gt; can be referenced only in the same compilation unit in which it is declared.) </li></bl> </td></tr>
214 | <tr><td> 147 </td><td> F201 </td><td> CAST function <br>(NOTE 453 - This means the support of CAST, where relevant, among all supported data types.) </td><td> - Subclause 6.12, "&lt;cast specification&gt;": For all supported data types </td></tr>
215 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.25, "&lt;value expression&gt;": &lt;cast specification&gt; </td></tr>
216 | <tr><td> 148 </td><td> F221 </td><td> Explicit defaults </td><td> - Subclause 6.5, "&lt;contextually typed value specification&gt;": &lt;default specification&gt; <br>(NOTE 454 - Including its use in UPDATE and INSERT statements.) </td></tr>
217 | <tr><td> 149 </td><td> F261 </td><td> CASE expression </td><td> - Subclause 6.25, "&lt;value expression&gt;": &lt;case expression&gt; </td></tr>
218 | <tr><td> 150 </td><td> F261-01 </td><td> Simple CASE </td><td> - Subclause 6.11, "&lt;case expression&gt;": The &lt;simple case&gt; variation </td></tr>
219 | <tr><td> 151 </td><td> F261-02 </td><td> Searched CASE </td><td> - Subclause 6.11, "&lt;case expression&gt;": The &lt;searched case variation&gt; </td></tr>
220 | <tr><td> 152 </td><td> F261-03 </td><td> NULLIF </td><td> - Subclause 6.11, "&lt;case expression&gt;": The NULLIF &lt;case abbreviation </td></tr>
221 | <tr><td> 153 </td><td> F261-04 </td><td> COALESCE </td><td> - Subclause 6.11, "&lt;case expression&gt;": The COALESCE &lt;case abbreviation </td></tr>
222 | <tr><td> 154 </td><td> F311 </td><td> Schema definition statement </td><td> - Subclause 11.1, "&lt;schema definition&gt;" </td></tr>
223 | 
224 | <tr><td> 155 </td><td> F311-01 </td><td> CREATE SCHEMA </td><td> - Subclause 11.1, "&lt;schema definition&gt;": Support for circular references in that &lt;referential constraint definition&gt;s in two different &lt;table definition&gt;s may reference columns in the other table </td></tr>
225 | <tr><td> 156 </td><td> F311-02 </td><td> CREATE TABLE for persistent base tables </td><td> - Subclause 11.1, "&lt;schema definition&gt;": A &lt;schema element&gt; that is a &lt;table definition&gt; </td></tr>
226 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 11.3, "&lt;table definition&gt;": In the context of a &lt;schema definition&gt; </td></tr>
227 | <tr><td> 157 </td><td> F311-03 </td><td> CREATE VIEW </td><td> - Subclause 11.1, "&lt;schema definition&gt;": A &lt;schema element&gt; that is a &lt;view definition&gt; </td></tr>
228 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 11.22, "&lt;view definition&gt;": In the context of a &lt;schema definition&gt; without the WITH CHECK OPTION clause and without support of Feature F081, "UNION and EXCEPT in views" </td></tr>
229 | <tr><td> 158 </td><td> F311-04 </td><td> CREATE VIEW: WITH CHECK OPTION </td><td> - Subclause 11.22, "&lt;view definition&gt;": The WITH CHECK OPTION clause, in the context of a &lt;schema definition&gt;, but without support of Feature F081, "UNION and EXCEPT in views" </td></tr>
230 | <tr><td> 159 </td><td> F311-05 </td><td> GRANT statement </td><td> - Subclause 11.1, "&lt;schema definition&gt;": A &lt;schema element&gt; that is a &lt;grant statement&gt; </td></tr>
231 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 12.1, "&lt;grant statement&gt;": In the context of a &lt;schema definition&gt; </td></tr>
232 | <tr><td> 160 </td><td> F471 </td><td> Scalar subquery values </td><td> - Subclause 6.25, "&lt;value expression&gt;": A &lt;value expression primary&gt; can be a &lt;scalar subquery&gt; </td></tr>
233 | <tr><td> 161 </td><td> F481 </td><td> Expanded NULL predicate </td><td> - Subclause 8.7, "&lt;null predicate&gt;": The &lt;row value expression&gt; can be something other than a &lt;column reference&gt; </td></tr>
234 | <tr><td> 162 </td><td> F812 </td><td> Basic flagging </td><td> - Part 1, Subclause 8.1.4, "SQL flagger": With "level of flagging" specified to be Core SQL Flagging and "extent of checking" specified to be Syntax Only <br>(NOTE 455 - This form of flagging identifies vendor extensions and other non-standard SQL by checking syntax only without requiring access to the catalog information.) </td></tr>
235 | <tr><td> 163 </td><td> S011 </td><td> Distinct data types </td><td> - Subclause 11.41, "&lt;user-defined type definition&gt;": When &lt;representation&gt; is &lt;predefined type&gt; </td></tr>
236 | 
237 | <tr><td> 164 </td><td> T321 </td><td> Basic SQL-invoked routines </td><td> - Subclause 11.50, "&lt;SQL-invoked routine&gt;" </td></tr>
238 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - If Feature T041, "Basic LOB data type support", is supported, then the &lt;locator indication&gt; clause must also be supported <br>(NOTE 456 - "Routine" is the collective term for functions, methods, and procedures.  This feature requires a conforming SQLimplementation to support both user-defined functions and user-defined procedures.  An SQL-implementation that conforms to Core SQL must support at least one language for writing routines; that language may be SQL. If the language is SQL, then the basic specification capability in Core SQL is the ability to specify a one-statement routine.  Support for overloaded functions and procedures is not part of Core SQL.) </td></tr>
239 | <tr><td> 165 </td><td> T321-01 </td><td> User-defined functions with no overloading </td><td> - Subclause 11.50, "&lt;SQL-invoked routine&gt;": With &lt;function specification&gt; </td></tr>
240 | <tr><td> 166 </td><td> T321-02 </td><td> User-defined stored procedures with no overloading </td><td> - Subclause 11.50, "&lt;SQL-invoked routine&gt;": With &lt;SQL-invoked procedure&gt; </td></tr>
241 | <tr><td> 167 </td><td> T321-03 </td><td> Function invocation </td><td> - Subclause 6.4, "&lt;value specification&gt; and &lt;target specification&gt;": With a &lt;value expression primary&gt; that is a &lt;routine invocation&gt; </td></tr>
242 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 10.4, "&lt;routine invocation&gt;": For user-defined functions </td></tr>
243 | <tr><td> 168 </td><td> T321-04 </td><td> CALL statement </td><td> - Subclause 10.4, "&lt;routine invocation&gt;": Used by &lt;call statement&gt;s </td></tr>
244 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 15.1, "&lt;call statement&gt;" </td></tr>
245 | <tr><td> 169 </td><td> T321-05 </td><td> RETURN statement </td><td> - Subclause 15.2, "&lt;return statement&gt;", if the SQL-implementation supports SQL routines </td></tr>
246 | 
247 | </table>
248 | 
249 | <hr>
250 | <p>
251 | Please send feedback to Jonathan Leffler:
252 | <a href="mailto:jonathan.leffler@gmail.com"> jonathan.leffler@gmail.com </a>.
253 | </p>
254 | 
255 | <p><font color=green><i><small>
256 | @(#)$Id: sql-2003-core-features.html,v 1.3 2017/11/13 20:45:42 jleffler Exp $
257 | </small></i></font></p>
258 | 
259 | </body>
260 | </html>
261 | 


--------------------------------------------------------------------------------
/sql-2003-noncore-features.html:
--------------------------------------------------------------------------------
  1 | <html>
  2 | <head>
  3 | <title>
  4 | SQL 2003 Feature Taxonomy for Features Outside Core SQL
  5 | </title>
  6 | </head>
  7 | <body>
  8 | <h1>
  9 | SQL 2003 (Annex F, Table 35) Feature Taxonomy for Features Outside Core SQL
 10 | </h1>
 11 | 
 12 | Derived from Final Committee Draft (FCD) of ISO/IEC 9075-2:2003.
 13 | <p>
 14 | 
 15 | <table border=1>
 16 | <tr><td> Number </td><td> Feature ID </td><td> Feature Name </td></tr>
 17 | 
 18 | <tr><td> 1 </td><td> B021 </td><td> Direct SQL </td></tr>
 19 | <tr><td> 2 </td><td> B031 </td><td> Basic dynamic SQL </td></tr>
 20 | <tr><td> 3 </td><td> B032 </td><td> Extended dynamic SQL </td></tr>
 21 | <tr><td> 4 </td><td> B032-01 </td><td> &lt;describe input&gt; statement </td></tr>
 22 | 
 23 | <tr><td> 5 </td><td> B033 </td><td> Untyped SQL-invoked function arguments </td></tr>
 24 | <tr><td> 6 </td><td> B034 </td><td> Dynamic specification of cursor attributes </td></tr>
 25 | <tr><td> 7 </td><td> B041 </td><td> Extensions to embedded SQL exception declarations </td></tr>
 26 | <tr><td> 8 </td><td> B051 </td><td> Enhanced execution rights </td></tr>
 27 | <tr><td> 9 </td><td> F032 </td><td> CASCADE drop behavior </td></tr>
 28 | <tr><td> 10 </td><td> F033 </td><td> ALTER TABLE statement: DROP COLUMN clause </td></tr>
 29 | <tr><td> 11 </td><td> F034 </td><td> Extended REVOKE statement </td></tr>
 30 | <tr><td> 12 </td><td> F034-01 </td><td> REVOKE statement performed by other than the owner of a schema object </td></tr>
 31 | <tr><td> 13 </td><td> F034-02 </td><td> REVOKE statement: GRANT OPTION FOR clause </td></tr>
 32 | <tr><td> 14 </td><td> F034-03 </td><td> REVOKE statement to revoke a privilege that the grantee has WITH GRANT OPTION </td></tr>
 33 | <tr><td> 15 </td><td> F052 </td><td> Intervals and datetime arithmetic </td></tr>
 34 | <tr><td> 16 </td><td> F053 </td><td> OVERLAPS predicate </td></tr>
 35 | <tr><td> 17 </td><td> F111 </td><td> Isolation levels other than SERIALIZABLE </td></tr>
 36 | <tr><td> 18 </td><td> F111-01 </td><td> READ UNCOMMITTED isolation level </td></tr>
 37 | <tr><td> 19 </td><td> F111-02 </td><td> READ COMMITTED isolation level </td></tr>
 38 | <tr><td> 20 </td><td> F111-03 </td><td> REPEATABLE READ isolation level </td></tr>
 39 | <tr><td> 21 </td><td> F121 </td><td> Basic diagnostics management </td></tr>
 40 | <tr><td> 22 </td><td> F121-01 </td><td> GET DIAGNOSTICS statement </td></tr>
 41 | <tr><td> 23 </td><td> F121-02 </td><td> SET TRANSACTION statement: DIAGNOSTICS SIZE clause </td></tr>
 42 | <tr><td> 24 </td><td> F171 </td><td> Multiple schemas per user </td></tr>
 43 | <tr><td> 25 </td><td> F191 </td><td> Referential delete actions </td></tr>
 44 | <tr><td> 26 </td><td> F222 </td><td> INSERT statement: DEFAULT VALUES clause </td></tr>
 45 | <tr><td> 27 </td><td> F231 </td><td> Privilege tables </td></tr>
 46 | <tr><td> 28 </td><td> F231-01 </td><td> TABLE_PRIVILEGES view </td></tr>
 47 | <tr><td> 29 </td><td> F231-02 </td><td> COLUMN_PRIVILEGES view </td></tr>
 48 | <tr><td> 30 </td><td> F231-03 </td><td> USAGE_PRIVILEGES view </td></tr>
 49 | <tr><td> 31 </td><td> F251 </td><td> Domain support </td></tr>
 50 | <tr><td> 32 </td><td> F262 </td><td> Extended CASE expression </td></tr>
 51 | <tr><td> 33 </td><td> F271 </td><td> Compound character literals </td></tr>
 52 | <tr><td> 34 </td><td> F281 </td><td> LIKE enhancements </td></tr>
 53 | <tr><td> 35 </td><td> F291 </td><td> UNIQUE predicate </td></tr>
 54 | <tr><td> 36 </td><td> F301 </td><td> CORRESPONDING in query expressions </td></tr>
 55 | 
 56 | <tr><td> 37 </td><td> F302 </td><td> INTERSECT table operator </td></tr>
 57 | <tr><td> 38 </td><td> F302-01 </td><td> INTERSECT DISTINCT table operator </td></tr>
 58 | <tr><td> 39 </td><td> F302-02 </td><td> INTERSECT ALL table operator </td></tr>
 59 | <tr><td> 40 </td><td> F304 </td><td> EXCEPT ALL table operator </td></tr>
 60 | <tr><td> 41 </td><td> F312 </td><td> MERGE statement </td></tr>
 61 | <tr><td> 42 </td><td> F321 </td><td> User authorization </td></tr>
 62 | <tr><td> 43 </td><td> F341 </td><td> Usage tables </td></tr>
 63 | <tr><td> 44 </td><td> F361 </td><td> Subprogram support </td></tr>
 64 | <tr><td> 45 </td><td> F381 </td><td> Extended schema manipulation </td></tr>
 65 | <tr><td> 46 </td><td> F381-01 </td><td> ALTER TABLE statement: ALTER COLUMN clause </td></tr>
 66 | <tr><td> 47 </td><td> F381-02 </td><td> ALTER TABLE statement: ADD CONSTRAINT clause </td></tr>
 67 | <tr><td> 48 </td><td> F381-03 </td><td> ALTER TABLE statement: DROP CONSTRAINT clause </td></tr>
 68 | <tr><td> 49 </td><td> F391 </td><td> Long identifiers </td></tr>
 69 | <tr><td> 50 </td><td> F392 </td><td> Unicode escapes in identifiers </td></tr>
 70 | <tr><td> 51 </td><td> F393 </td><td> Unicode escapes in literals </td></tr>
 71 | <tr><td> 52 </td><td> F401 </td><td> Extended joined table </td></tr>
 72 | <tr><td> 53 </td><td> F401-01 </td><td> NATURAL JOIN </td></tr>
 73 | <tr><td> 54 </td><td> F401-02 </td><td> FULL OUTER JOIN </td></tr>
 74 | <tr><td> 55 </td><td> F401-03 </td><td> UNION JOIN </td></tr>
 75 | <tr><td> 56 </td><td> F401-04 </td><td> CROSS JOIN </td></tr>
 76 | <tr><td> 57 </td><td> F402 </td><td> Named column joins for LOBs, arrays, and multisets </td></tr>
 77 | <tr><td> 58 </td><td> F411 </td><td> Time zone specification </td></tr>
 78 | <tr><td> 59 </td><td> F421 </td><td> National character </td></tr>
 79 | <tr><td> 60 </td><td> F431 </td><td> Read-only scrollable cursors </td></tr>
 80 | <tr><td> 61 </td><td> F431-01 </td><td> FETCH with explicit NEXT </td></tr>
 81 | <tr><td> 62 </td><td> F431-02 </td><td> FETCH FIRST </td></tr>
 82 | <tr><td> 63 </td><td> F431-03 </td><td> FETCH LAST </td></tr>
 83 | <tr><td> 64 </td><td> F431-04 </td><td> FETCH PRIOR </td></tr>
 84 | <tr><td> 65 </td><td> F431-05 </td><td> FETCH ABSOLUTE </td></tr>
 85 | <tr><td> 66 </td><td> F431-06 </td><td> FETCH RELATIVE </td></tr>
 86 | <tr><td> 67 </td><td> F441 </td><td> Extended set function support </td></tr>
 87 | <tr><td> 68 </td><td> F442 </td><td> Mixed column references in set functions </td></tr>
 88 | <tr><td> 69 </td><td> F451 </td><td> Character set definition </td></tr>
 89 | 
 90 | <tr><td> 70 </td><td> F461 </td><td> Named character sets </td></tr>
 91 | <tr><td> 71 </td><td> F491 </td><td> Constraint management </td></tr>
 92 | <tr><td> 72 </td><td> F502 </td><td> Enhanced documentation tables </td></tr>
 93 | <tr><td> 73 </td><td> F502-01 </td><td> SQL_SIZING_PROFILES view </td></tr>
 94 | <tr><td> 74 </td><td> F502-02 </td><td> SQL_IMPLEMENTATION_INFO view </td></tr>
 95 | <tr><td> 75 </td><td> F502-03 </td><td> SQL_PACKAGES view </td></tr>
 96 | <tr><td> 76 </td><td> F521 </td><td> Assertions </td></tr>
 97 | <tr><td> 77 </td><td> F531 </td><td> Temporary tables </td></tr>
 98 | <tr><td> 78 </td><td> F555 </td><td> Enhanced seconds precision </td></tr>
 99 | <tr><td> 79 </td><td> F561 </td><td> Full value expressions </td></tr>
100 | <tr><td> 80 </td><td> F571 </td><td> Truth value tests </td></tr>
101 | <tr><td> 81 </td><td> F591 </td><td> Derived tables </td></tr>
102 | <tr><td> 82 </td><td> F611 </td><td> Indicator data types </td></tr>
103 | <tr><td> 83 </td><td> F641 </td><td> Row and table constructors </td></tr>
104 | <tr><td> 84 </td><td> F651 </td><td> Catalog name qualifiers </td></tr>
105 | <tr><td> 85 </td><td> F661 </td><td> Simple tables </td></tr>
106 | <tr><td> 86 </td><td> F671 </td><td> Subqueries in CHECK </td></tr>
107 | <tr><td> 87 </td><td> F672 </td><td> Retrospective check constraints </td></tr>
108 | <tr><td> 88 </td><td> F691 </td><td> Collation and translation </td></tr>
109 | <tr><td> 89 </td><td> F692 </td><td> Enhanced collation support </td></tr>
110 | <tr><td> 90 </td><td> F693 </td><td> SQL-session and client module collations </td></tr>
111 | <tr><td> 91 </td><td> F701 </td><td> Referential update actions </td></tr>
112 | <tr><td> 92 </td><td> F711 </td><td> ALTER domain </td></tr>
113 | <tr><td> 93 </td><td> F721 </td><td> Deferrable constraints </td></tr>
114 | <tr><td> 94 </td><td> F731 </td><td> INSERT column privileges </td></tr>
115 | <tr><td> 95 </td><td> F741 </td><td> Referential MATCH types </td></tr>
116 | <tr><td> 96 </td><td> F751 </td><td> View CHECK enhancements </td></tr>
117 | <tr><td> 97 </td><td> F761 </td><td> Session management </td></tr>
118 | <tr><td> 98 </td><td> F771 </td><td> Connection management </td></tr>
119 | <tr><td> 99 </td><td> F781 </td><td> Self-referencing operations </td></tr>
120 | <tr><td> 100 </td><td> F791 </td><td> Insensitive cursors </td></tr>
121 | <tr><td> 101 </td><td> F801 </td><td> Full set function </td></tr>
122 | 
123 | 
124 | <tr><td> 102 </td><td> F813 </td><td> Extended flagging - Part 1, Subclause 8.1.4, "SQL flagger": With 'level of flagging' specified to be Core SQL Flagging and 'extent of checking' specified to be Catalog Lookup </td></tr>
125 | <tr><td> 103 </td><td> F821 </td><td> Local table references </td></tr>
126 | <tr><td> 104 </td><td> F831 </td><td> Full cursor update </td></tr>
127 | <tr><td> 105 </td><td> F831-01 </td><td> Updateable scrollable cursors </td></tr>
128 | <tr><td> 106 </td><td> F831-02 </td><td> Updateable ordered cursors </td></tr>
129 | <tr><td> 107 </td><td> S023 </td><td> Basic structured types </td></tr>
130 | <tr><td> 108 </td><td> S024 </td><td> Enhanced structured types </td></tr>
131 | <tr><td> 109 </td><td> S025 </td><td> Final structured types </td></tr>
132 | <tr><td> 110 </td><td> S026 </td><td> Self-referencing structured types </td></tr>
133 | <tr><td> 111 </td><td> S027 </td><td> Create method by specific method name </td></tr>
134 | <tr><td> 112 </td><td> S028 </td><td> Permutable UDT options list </td></tr>
135 | <tr><td> 113 </td><td> S041 </td><td> Basic reference types </td></tr>
136 | <tr><td> 114 </td><td> S043 </td><td> Enhanced reference types </td></tr>
137 | <tr><td> 115 </td><td> S051 </td><td> Create table of type </td></tr>
138 | <tr><td> 116 </td><td> S071 </td><td> SQL paths in function and type name resolution </td></tr>
139 | <tr><td> 117 </td><td> S081 </td><td> Subtables </td></tr>
140 | <tr><td> 118 </td><td> S091 </td><td> Basic array support </td></tr>
141 | <tr><td> 119 </td><td> S091-01 </td><td> Arrays of built-in data types </td></tr>
142 | <tr><td> 120 </td><td> S091-02 </td><td> Arrays of distinct types </td></tr>
143 | <tr><td> 121 </td><td> S091-03 </td><td> Array expressions </td></tr>
144 | <tr><td> 122 </td><td> S092 </td><td> Arrays of user-defined types </td></tr>
145 | <tr><td> 123 </td><td> S094 </td><td> Arrays of reference types </td></tr>
146 | <tr><td> 124 </td><td> S095 </td><td> Array constructors by query </td></tr>
147 | <tr><td> 125 </td><td> S096 </td><td> Optional array bounds </td></tr>
148 | <tr><td> 126 </td><td> S097 </td><td> Array element assignment </td></tr>
149 | <tr><td> 127 </td><td> S111 </td><td> ONLY in query expressions </td></tr>
150 | <tr><td> 128 </td><td> S151 </td><td> Type predicate </td></tr>
151 | <tr><td> 129 </td><td> S161 </td><td> Subtype treatment </td></tr>
152 | <tr><td> 130 </td><td> S162 </td><td> Subtype treatment for references </td></tr>
153 | <tr><td> 131 </td><td> S201 </td><td> SQL-invoked routines on arrays </td></tr>
154 | <tr><td> 132 </td><td> S201-01 </td><td> Array parameters </td></tr>
155 | <tr><td> 133 </td><td> S201-02 </td><td> Array as result type of functions </td></tr>
156 | 
157 | <tr><td> 134 </td><td> S202 </td><td> SQL-invoked routines on multisets </td></tr>
158 | <tr><td> 135 </td><td> S211 </td><td> User-defined cast functions </td></tr>
159 | <tr><td> 136 </td><td> S231 </td><td> Structured type locators </td></tr>
160 | <tr><td> 137 </td><td> S232 </td><td> Array locators </td></tr>
161 | <tr><td> 138 </td><td> S233 </td><td> Multiset locators </td></tr>
162 | <tr><td> 139 </td><td> S241 </td><td> Transform functions </td></tr>
163 | <tr><td> 140 </td><td> S242 </td><td> Alter transform statement </td></tr>
164 | <tr><td> 141 </td><td> S251 </td><td> User-defined orderings </td></tr>
165 | <tr><td> 142 </td><td> S261 </td><td> Specific type method </td></tr>
166 | <tr><td> 143 </td><td> S271 </td><td> Basic multiset support </td></tr>
167 | <tr><td> 144 </td><td> S272 </td><td> Multisets of user-defined types </td></tr>
168 | <tr><td> 145 </td><td> S274 </td><td> Multisets of reference types </td></tr>
169 | <tr><td> 146 </td><td> S275 </td><td> Advanced multiset support </td></tr>
170 | <tr><td> 147 </td><td> S281 </td><td> Nested collection types </td></tr>
171 | <tr><td> 148 </td><td> T011 </td><td> Timestamp in Information Schema </td></tr>
172 | <tr><td> 149 </td><td> T031 </td><td> BOOLEAN data type </td></tr>
173 | <tr><td> 150 </td><td> T041 </td><td> Basic LOB data type support </td></tr>
174 | <tr><td> 151 </td><td> T041-01 </td><td> BLOB data type </td></tr>
175 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": The &lt;reserved word&gt;s BINARY, BLOB, LARGE, and OBJECT </td></tr>
176 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 5.3, "&lt;literal&gt;": &lt;binary string literal&gt; </td></tr>
177 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.1, "&lt;data type&gt;": The BINARY LARGE OBJECT data type </td></tr>
178 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.28, "&lt;string value expression&gt;": For values of type BINARY LARGE OBJECT </td></tr>
179 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 13.6, "Data type correspondences": Type correspondences for BINARY LARGE OBJECT for all supported languages </td></tr>
180 | <tr><td> 152 </td><td> T041-02 </td><td> CLOB data type </td></tr>
181 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 5.2, "&lt;token&gt; and &lt;separator&gt;": The &lt;reserved word&gt;s CHARACTER, CLOB, LARGE, and OBJECT </td></tr>
182 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.1, "&lt;data type&gt;": The CHARACTER LARGE OBJECT data type </td></tr>
183 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.28, "&lt;string value expression&gt;": For values of type CHARACTER LARGE OBJECT </td></tr>
184 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 13.6, "Data type correspondences": Type correspondences for CHARACTER LARGE OBJECT for all supported languages </td></tr>
185 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - The automatic casting among the character types supported by subfeature E021-11 is extended to support the CHARACTER LARGE OBJECT type </td></tr>
186 | 
187 | <tr><td> 153 </td><td> T041-03 </td><td> POSITION, LENGTH, LOWER, TRIM, UPPER, and SUBSTRING functions for LOB data types </td></tr>
188 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.27, "&lt;numeric value function&gt;": The &lt;position expression&gt; for expressions of type BINARY LARGE OBJECT and CHARACTER LARGE OBJECT </td></tr>
189 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.27, "&lt;numeric value function&gt;": The &lt;char length function&gt; for expressions of type CHARACTER LARGE OBJECT </td></tr>
190 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.27, "&lt;numeric value function&gt;": The &lt;octet length function&gt; for expressions of type BINARY LARGE OBJECT and CHARACTER LARGE OBJECT </td></tr>
191 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.29, "&lt;string value function&gt;": The &lt;fold&gt; function for expressions of type CHARACTER LARGE OBJECT </td></tr>
192 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.29, "&lt;string value function&gt;": The &lt;trim function&gt; for expressions of type CHARACTER LARGE OBJECT </td></tr>
193 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.29, "&lt;string value function&gt;": The &lt;blob trim function&gt; </td></tr>
194 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.29, "&lt;string value function&gt;": The &lt;character substring function&gt; for expressions of type CHARACTER LARGE OBJECT </td></tr>
195 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.29, "&lt;string value function&gt;": The &lt;blob substring function&gt; </td></tr>
196 | <tr><td> 154 </td><td> T041-04 </td><td> Concatenation of LOB data types </td></tr>
197 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.28, "&lt;string value expression&gt;": The &lt;concatenation&gt; expression for expressions of type CHARACTER LARGE OBJECT </td></tr>
198 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 6.28, "&lt;string value expression&gt;": The &lt;blob concatenation&gt; expression </td></tr>
199 | <tr><td> 155 </td><td> T041-05 </td><td> LOB locator: non-holdable </td></tr>
200 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 13.3, "&lt;externally-invoked procedure&gt;": &lt;locator indication&gt; </td></tr>
201 | <tr><td> &nbsp; </td><td> &nbsp; </td><td> - Subclause 14.14, "&lt;free locator statement&gt;" </td></tr>
202 | <tr><td> 156 </td><td> T042 </td><td> Extended LOB data type support </td></tr>
203 | <tr><td> 157 </td><td> T051 </td><td> Row types </td></tr>
204 | <tr><td> 158 </td><td> T052 </td><td> MAX and MIN for row types </td></tr>
205 | <tr><td> 159 </td><td> T053 </td><td> Explicit aliases for &lt;all fields reference&gt; </td></tr>
206 | <tr><td> 160 </td><td> T061 </td><td> UCS support </td></tr>
207 | <tr><td> 161 </td><td> T071 </td><td> BIGINT data type </td></tr>
208 | <tr><td> 162 </td><td> T111 </td><td> Updatable joins, unions, and columns </td></tr>
209 | <tr><td> 163 </td><td> T121 </td><td> WITH (excluding RECURSIVE) in query expression </td></tr>
210 | <tr><td> 164 </td><td> T131 </td><td> Recursive query </td></tr>
211 | <tr><td> 165 </td><td> T141 </td><td> SIMILAR predicate </td></tr>
212 | <tr><td> 166 </td><td> T151 </td><td> DISTINCT predicate </td></tr>
213 | <tr><td> 167 </td><td> T171 </td><td> LIKE clause in table definition </td></tr>
214 | <tr><td> 168 </td><td> T172 </td><td> AS subquery clause in table definition </td></tr>
215 | <tr><td> 169 </td><td> T173 </td><td> Extended LIKE clause in table definition </td></tr>
216 | <tr><td> 170 </td><td> T174 </td><td> Identity columns </td></tr>
217 | <tr><td> 171 </td><td> T175 </td><td> Generated columns </td></tr>
218 | <tr><td> 172 </td><td> T176 </td><td> Sequence generator support </td></tr>
219 | 
220 | <tr><td> 173 </td><td> T191 </td><td> Referential action RESTRICT </td></tr>
221 | <tr><td> 174 </td><td> T201 </td><td> Comparable data types for referential constraints </td></tr>
222 | <tr><td> 175 </td><td> T211 </td><td> Basic trigger capability </td></tr>
223 | <tr><td> 176 </td><td> T211-01 </td><td> Triggers activated on UPDATE, INSERT, or DELETE of one base table. </td></tr>
224 | <tr><td> 177 </td><td> T211-02 </td><td> BEFORE triggers </td></tr>
225 | <tr><td> 178 </td><td> T211-03 </td><td> AFTER triggers </td></tr>
226 | <tr><td> 179 </td><td> T211-04 </td><td> FOR EACH ROW triggers </td></tr>
227 | <tr><td> 180 </td><td> T211-05 </td><td> Ability to specify a search condition that must be True before the trigger is invoked. </td></tr>
228 | <tr><td> 181 </td><td> T211-06 </td><td> Support for run-time rules for the interaction of triggers and constraints. </td></tr>
229 | <tr><td> 182 </td><td> T211-07 </td><td> TRIGGER privilege </td></tr>
230 | <tr><td> 183 </td><td> T211-08 </td><td> Multiple triggers for the same the event are executed in the order in which they were created in the catalog. </td></tr>
231 | <tr><td> 184 </td><td> T212 </td><td> Enhanced trigger capability </td></tr>
232 | <tr><td> 185 </td><td> T231 </td><td> Sensitive cursors </td></tr>
233 | <tr><td> 186 </td><td> T241 </td><td> START TRANSACTION statement </td></tr>
234 | <tr><td> 187 </td><td> T242 </td><td> Optional transaction modes in START TRANSACTION </td></tr>
235 | <tr><td> 188 </td><td> T251 </td><td> SET TRANSACTION statement: LOCAL option </td></tr>
236 | <tr><td> 189 </td><td> T261 </td><td> Chained transactions </td></tr>
237 | <tr><td> 190 </td><td> T271 </td><td> Savepoints </td></tr>
238 | <tr><td> 191 </td><td> T272 </td><td> Enhanced savepoint management </td></tr>
239 | <tr><td> 192 </td><td> T281 </td><td> SELECT privilege with column granularity </td></tr>
240 | <tr><td> 193 </td><td> T301 </td><td> Functional dependencies </td></tr>
241 | <tr><td> 194 </td><td> T312 </td><td> OVERLAY function </td></tr>
242 | <tr><td> 195 </td><td> T322 </td><td> Overloading of SQL-invoked functions and procedures </td></tr>
243 | <tr><td> 196 </td><td> T323 </td><td> Explicit security for external routines </td></tr>
244 | <tr><td> 197 </td><td> T324 </td><td> Explicit security for SQL routines </td></tr>
245 | <tr><td> 198 </td><td> T325 </td><td> Qualified SQL parameter references </td></tr>
246 | <tr><td> 199 </td><td> T326 </td><td> Table functions </td></tr>
247 | <tr><td> 200 </td><td> T331 </td><td> Basic roles </td></tr>
248 | <tr><td> 201 </td><td> T332 </td><td> Extended roles </td></tr>
249 | <tr><td> 202 </td><td> T351 </td><td> Bracketed SQL comments (/*...*/ comments) </td></tr>
250 | <tr><td> 203 </td><td> T431 </td><td> Extended grouping capabilities </td></tr>
251 | <tr><td> 204 </td><td> T432 </td><td> Nested and concatenated GROUPING SETS </td></tr>
252 | 
253 | <tr><td> 205 </td><td> T433 </td><td> Multiargument GROUPING function </td></tr>
254 | <tr><td> 206 </td><td> T434 </td><td> GROUP BY DISTINCT </td></tr>
255 | <tr><td> 207 </td><td> T441 </td><td> ABS and MOD functions </td></tr>
256 | <tr><td> 208 </td><td> T461 </td><td> Symmetric &lt;between predicate&gt; </td></tr>
257 | <tr><td> 209 </td><td> T471 </td><td> Result sets return value </td></tr>
258 | <tr><td> 210 </td><td> T491 </td><td> LATERAL derived table </td></tr>
259 | <tr><td> 211 </td><td> T501 </td><td> Enhanced EXISTS predicate </td></tr>
260 | <tr><td> 212 </td><td> T511 </td><td> Transaction counts </td></tr>
261 | 
262 | <tr><td> 213 </td><td> T551 </td><td> Optional key words for default syntax </td></tr>
263 | <tr><td> 214 </td><td> T561 </td><td> Holdable locators </td></tr>
264 | <tr><td> 215 </td><td> T571 </td><td> Array-returning external SQL-invoked functions </td></tr>
265 | <tr><td> 216 </td><td> T572 </td><td> Multiset-returning external SQL-invoked functions </td></tr>
266 | <tr><td> 217 </td><td> T581 </td><td> Regular expression substring function </td></tr>
267 | <tr><td> 218 </td><td> T591 </td><td> UNIQUE constraints of possibly null columns </td></tr>
268 | <tr><td> 219 </td><td> T601 </td><td> Local cursor references </td></tr>
269 | <tr><td> 220 </td><td> T611 </td><td> Elementary OLAP operations </td></tr>
270 | <tr><td> 221 </td><td> T612 </td><td> Advanced OLAP operations </td></tr>
271 | <tr><td> 222 </td><td> T613 </td><td> Sampling </td></tr>
272 | <tr><td> 223 </td><td> T621 </td><td> Enhanced numeric functions </td></tr>
273 | <tr><td> 224 </td><td> T631 </td><td> IN predicate with one list element </td></tr>
274 | <tr><td> 225 </td><td> T641 </td><td> Multiple column assignment </td></tr>
275 | <tr><td> 226 </td><td> T651 </td><td> SQL-schema statements in SQL routines </td></tr>
276 | <tr><td> 227 </td><td> T652 </td><td> SQL-dynamic statements in SQL routines </td></tr>
277 | 
278 | </table>
279 | 
280 | <hr>
281 | <p>
282 | Please send feedback to Jonathan Leffler:
283 | <a href="mailto:jonathan.leffler@gmail.com"> jonathan.leffler@gmail.com </a>.
284 | </p>
285 | 
286 | <p><font color=green><i><small>
287 | @(#)$Id: sql-2003-noncore-features.html,v 1.3 2017/11/13 20:45:42 jleffler Exp $
288 | </small></i></font></p>
289 | 
290 | </body>
291 | </html>
292 | 


--------------------------------------------------------------------------------
/sql-2016.ebnf.readme:
--------------------------------------------------------------------------------
 1 | How to use sql-2016.ebnf
 2 | ========================
 3 | This file was created manually by Domingo Alvarez Duarte.
 4 | Many thanx Domingo!
 5 | 
 6 | The corresponding XHTML + SVG file is sql-2016-railroad-diagrams.xhtml.
 7 | 
 8 | The grammar can be rendered as railroad diagrams on the website https://www.bottlecaps.de/rr/ui.
 9 | Just copy it into the Edit Grammar tab, and click View Diagram. Then wait a good minute or so :-).
10 | 
11 | Domingo notes:
12 | 1. Non reserved words appear with * as suffix, ex: SUBSTRING -> SUBSTRING*.
13 | 
14 | 2. You can navigate through the railroad's grammar by clicking the rectangular boxes.
15 | 
16 | Finally, you can download the rendered railroad as either XHTML + SVG or HTML + PNG.
17 | 


--------------------------------------------------------------------------------
/sql-bnf.mk:
--------------------------------------------------------------------------------
 1 | # @(#)$Id: sql-bnf.mk,v 1.18 2017/01/21 16:29:04 jleffler Exp $
 2 | #
 3 | # Makefile for SQL-92, SQL-99 and SQL-2003 BNF and HTML files
 4 | 
 5 | .NO_PENDING_GET:
 6 | 
 7 | WEBCODE.tgz  = webcode-1.09.tgz
 8 | FILE1.bnf    = sql-92.bnf
 9 | FILE2.bnf    = sql-99.bnf
10 | FILE3.bnf    = sql-2003-1.bnf
11 | FILE4.bnf    = sql-2003-2.bnf
12 | FILES.bnf    = ${FILE1.bnf} ${FILE2.bnf} ${FILE3.bnf} ${FILE4.bnf}
13 | FILES.html   = ${FILES.bnf:bnf=bnf.html}
14 | FILE1.aux    = index.html
15 | FILE2.aux    = outer-joins.html
16 | FILE3.aux    = sql-2003-core-features.html
17 | FILE4.aux    = sql-2003-noncore-features.html
18 | FILES.aux    = ${FILE1.aux} ${FILE2.aux} ${FILE3.aux} ${FILE4.aux}
19 | FILE1.pl     = bnf2html.pl
20 | FILE2.pl     = bnf2yacc.pl
21 | FILES.pl     = ${FILE1.pl} ${FILE2.pl}
22 | FILE1.txt    = bnf2html.perl.txt
23 | FILE2.txt    = bnf2yacc.perl.txt
24 | FILES.txt    = ${FILE1.txt} ${FILE2.txt}
25 | FILES.mk     = sql-bnf.mk
26 | FILES.all    = ${FILES.bnf} ${FILES.html} ${FILES.mk} ${FILES.pl} ${FILES.txt} \
27 |                ${FILES.aux} ${WEBCODE.tgz}
28 | 
29 | # Dummy datestamp - just in case.
30 | VERNUM       = 00000000
31 | VRSNFILE.tgz = sql-bnf-${VERNUM}.tgz
32 | DISTFILE.tgz = sql-bnf.tgz
33 | RCSFILES.tgz = sql-bnf-rcs-${VERNUM}.tgz
34 | 
35 | APACHE_HOME  = /opt/apache/webserver
36 | APACHE_HTML  = htdocs/SQL
37 | APACHE_DIR   = ${APACHE_HOME}/${APACHE_HTML}
38 | 
39 | TAR          = tar
40 | TARFLAGS     = -cvzf
41 | COPY         = cp
42 | COPYFLAGS    = -fp
43 | PERL         = perl
44 | RM_F         = rm -f
45 | CHMOD        = chmod
46 | WEBPERMS     = 444
47 | LN           = ln
48 | MKPATH       = mkdir -p
49 | 
50 | all:
51 | 	${MAKE} VERNUM=`date +'%Y''%m''%d'` all-vrsn
52 | 
53 | all-vrsn: ${VRSNFILE.tgz} ${RCSFILES.tgz}
54 | 
55 | ${VRSNFILE.tgz}:  ${FILES.all}
56 | 	${TAR} ${TARFLAGS} ${VRSNFILE.tgz} ${FILES.all}
57 | 	${RM_F} ${DISTFILE.tgz}
58 | 	${LN} ${VRSNFILE.tgz} ${DISTFILE.tgz}
59 | 
60 | ${RCSFILES.tgz}: RCS
61 | 	${TAR} ${TARFLAGS} ${RCSFILES.tgz} RCS
62 | 
63 | ${FILES.html}: $${@:.html=} ${FILE1.pl}
64 | 	${RM_F} $@
65 | 	${PERL} ${FILE1.pl} ${@:.html=} > $@
66 | 
67 | ${FILES.txt}:	$${@:.perl.txt=.pl}
68 | 	${RM_F} $@
69 | 	${LN} $? $@
70 | 
71 | install: all
72 | 	${MKPATH} ${APACHE_DIR}
73 | 	${COPY} ${COPYFLAGS} ${DISTFILE.tgz} ${FILES.all} ${WEBCODE.tgz} ${APACHE_DIR}
74 | 	cd ${APACHE_DIR}; ${CHMOD} ${WEBPERMS} ${DISTFILE.tgz} ${WEBCODE.tgz} ${FILES.all}
75 | 


--------------------------------------------------------------------------------
/webcode-1.09.tgz:
--------------------------------------------------------------------------------
https://raw.githubusercontent.com/ronsavage/SQL/19215adc8639d031a44acad7873c209444b71f1f/webcode-1.09.tgz


--------------------------------------------------------------------------------