mirror of
https://github.com/python/cpython.git
synced 2026-04-25 21:30:45 +00:00
svn+ssh://pythondev@svn.python.org/python/branches/p3yk
................
r55636 | neal.norwitz | 2007-05-29 00:06:39 -0700 (Tue, 29 May 2007) | 149 lines
Merged revisions 55506-55635 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r55507 | georg.brandl | 2007-05-22 07:28:17 -0700 (Tue, 22 May 2007) | 2 lines
Remove the "panel" module doc file which has been ignored since 1994.
........
r55522 | mark.hammond | 2007-05-22 19:04:28 -0700 (Tue, 22 May 2007) | 4 lines
Remove definition of PY_UNICODE_TYPE from pyconfig.h, allowing the
definition in unicodeobject.h to be used, giving us the desired
wchar_t in place of 'unsigned short'. As discussed on python-dev.
........
r55525 | neal.norwitz | 2007-05-22 23:35:32 -0700 (Tue, 22 May 2007) | 6 lines
Add -3 option to the interpreter to warn about features that are
deprecated and will be changed/removed in Python 3.0.
This patch is mostly from Anthony. I tweaked some format and added
a little doc.
........
r55527 | neal.norwitz | 2007-05-22 23:57:35 -0700 (Tue, 22 May 2007) | 1 line
Whitespace cleanup
........
r55528 | neal.norwitz | 2007-05-22 23:58:36 -0700 (Tue, 22 May 2007) | 1 line
Add a bunch more deprecation warnings for builtins that are going away in 3.0
........
r55549 | georg.brandl | 2007-05-24 09:49:29 -0700 (Thu, 24 May 2007) | 2 lines
shlex.split() now has an optional "posix" parameter.
........
r55550 | georg.brandl | 2007-05-24 10:33:33 -0700 (Thu, 24 May 2007) | 2 lines
Fix parameter passing.
........
r55555 | facundo.batista | 2007-05-24 10:50:54 -0700 (Thu, 24 May 2007) | 6 lines
Added an optional timeout parameter to urllib.ftpwrapper, with tests
(for this and a basic one, because there weren't any). Changed also
NEWS, but didn't find documentation for this function, assumed it
wasn't public...
........
r55563 | facundo.batista | 2007-05-24 13:01:59 -0700 (Thu, 24 May 2007) | 4 lines
Removed the .recv() in the test, is not necessary, and was
causing problems that didn't have anything to do with was
actually being tested...
........
r55564 | facundo.batista | 2007-05-24 13:51:19 -0700 (Thu, 24 May 2007) | 5 lines
Let's see if reading exactly what is written allow this live
test to pass (now I know why there were so few tests in ftp,
http, etc, :( ).
........
r55567 | facundo.batista | 2007-05-24 20:10:28 -0700 (Thu, 24 May 2007) | 4 lines
Trying to make the tests work in Windows and Solaris, everywhere
else just works
........
r55568 | facundo.batista | 2007-05-24 20:47:19 -0700 (Thu, 24 May 2007) | 4 lines
Fixing stupid error, and introducing a sleep, to see if the
other thread is awakened and finish sending data.
........
r55569 | facundo.batista | 2007-05-24 21:20:22 -0700 (Thu, 24 May 2007) | 4 lines
Commenting out the tests until find out who can test them in
one of the problematic enviroments.
........
r55570 | neal.norwitz | 2007-05-24 22:13:40 -0700 (Thu, 24 May 2007) | 2 lines
Get test passing again by commenting out the reference to the test class.
........
r55575 | vinay.sajip | 2007-05-25 00:05:59 -0700 (Fri, 25 May 2007) | 1 line
Updated docstring for SysLogHandler (#1720726).
........
r55576 | vinay.sajip | 2007-05-25 00:06:55 -0700 (Fri, 25 May 2007) | 1 line
Updated documentation for SysLogHandler (#1720726).
........
r55592 | brett.cannon | 2007-05-25 13:17:15 -0700 (Fri, 25 May 2007) | 3 lines
Remove direct call's to file's constructor and replace them with calls to
open() as ths is considered best practice.
........
r55601 | kristjan.jonsson | 2007-05-26 12:19:50 -0700 (Sat, 26 May 2007) | 1 line
Remove the rgbimgmodule from PCBuild8
........
r55602 | kristjan.jonsson | 2007-05-26 12:31:39 -0700 (Sat, 26 May 2007) | 1 line
Include <windows.h> after python.h, so that WINNT is properly set before windows.h is included. Fixes warnings in PC builds.
........
r55603 | walter.doerwald | 2007-05-26 14:04:13 -0700 (Sat, 26 May 2007) | 2 lines
Fix typo.
........
r55604 | peter.astrand | 2007-05-26 15:18:20 -0700 (Sat, 26 May 2007) | 1 line
Applied patch 1669481, slightly modified: Support close_fds on Win32
........
r55606 | neal.norwitz | 2007-05-26 21:08:54 -0700 (Sat, 26 May 2007) | 2 lines
Add the new function object attribute names from py3k.
........
r55617 | lars.gustaebel | 2007-05-27 12:49:30 -0700 (Sun, 27 May 2007) | 20 lines
Added errors argument to TarFile class that allows the user to
specify an error handling scheme for character conversion. Additional
scheme "utf-8" in read mode. Unicode input filenames are now
supported by design. The values of the pax_headers dictionary are now
limited to unicode objects.
Fixed: The prefix field is no longer used in PAX_FORMAT (in
conformance with POSIX).
Fixed: In read mode use a possible pax header size field.
Fixed: Strip trailing slashes from pax header name values.
Fixed: Give values in user-specified pax_headers precedence when
writing.
Added unicode tests. Added pax/regtype4 member to testtar.tar all
possible number fields in a pax header.
Added two chapters to the documentation about the different formats
tarfile.py supports and how unicode issues are handled.
........
r55618 | raymond.hettinger | 2007-05-27 22:23:22 -0700 (Sun, 27 May 2007) | 1 line
Explain when groupby() issues a new group.
........
r55634 | martin.v.loewis | 2007-05-28 21:01:29 -0700 (Mon, 28 May 2007) | 2 lines
Test pre-commit hook for a link to a .py file.
........
r55635 | martin.v.loewis | 2007-05-28 21:02:03 -0700 (Mon, 28 May 2007) | 2 lines
Revert 55634.
........
................
r55639 | neal.norwitz | 2007-05-29 00:58:11 -0700 (Tue, 29 May 2007) | 1 line
Remove sys.exc_{type,exc_value,exc_traceback}
................
r55641 | neal.norwitz | 2007-05-29 01:03:50 -0700 (Tue, 29 May 2007) | 1 line
Missed one sys.exc_type. I wonder why exc_{value,traceback} were already gone
................
r55642 | neal.norwitz | 2007-05-29 01:08:33 -0700 (Tue, 29 May 2007) | 1 line
Missed more doc for sys.exc_* attrs.
................
r55643 | neal.norwitz | 2007-05-29 01:18:19 -0700 (Tue, 29 May 2007) | 1 line
Remove sys.exc_clear()
................
r55665 | guido.van.rossum | 2007-05-29 19:45:43 -0700 (Tue, 29 May 2007) | 4 lines
Make None, True, False keywords.
We can now also delete all the other places that explicitly forbid
assignment to None, but I'm not going to bother right now.
................
r55666 | guido.van.rossum | 2007-05-29 20:01:51 -0700 (Tue, 29 May 2007) | 3 lines
Found another place that needs check for forbidden names.
Fixed test_syntax.py accordingly (it helped me find that one).
................
r55668 | guido.van.rossum | 2007-05-29 20:41:48 -0700 (Tue, 29 May 2007) | 2 lines
Mark None, True, False as keywords.
................
r55673 | neal.norwitz | 2007-05-29 23:28:25 -0700 (Tue, 29 May 2007) | 3 lines
Get the dis module working on modules again after changing dicts
to not return lists and also new-style classes. Add a test.
................
r55674 | neal.norwitz | 2007-05-29 23:35:45 -0700 (Tue, 29 May 2007) | 1 line
Umm, it helps to add the module that the test uses
................
r55675 | neal.norwitz | 2007-05-29 23:53:05 -0700 (Tue, 29 May 2007) | 4 lines
Try to fix up all the other places that were assigning to True/False.
There's at least one more problem in test.test_xmlrpc. I have other
changes in that file and that should be fixed soon (I hope).
................
r55679 | neal.norwitz | 2007-05-30 00:31:55 -0700 (Wed, 30 May 2007) | 1 line
Fix up another place that was assigning to True/False.
................
r55688 | brett.cannon | 2007-05-30 14:19:47 -0700 (Wed, 30 May 2007) | 2 lines
Ditch MimeWriter.
................
r55692 | brett.cannon | 2007-05-30 14:52:00 -0700 (Wed, 30 May 2007) | 2 lines
Remove the mimify module.
................
r55707 | guido.van.rossum | 2007-05-31 05:08:45 -0700 (Thu, 31 May 2007) | 2 lines
Backport the addition of show_code() to dis.py -- it's too handy.
................
r55708 | guido.van.rossum | 2007-05-31 06:22:57 -0700 (Thu, 31 May 2007) | 7 lines
Fix a fairly long-standing bug in the check for assignment to None (and other
keywords, these days). In 2.5, you could write foo(None=1) without getting
a SyntaxError (although foo()'s definition would have to use **kwds to avoid
getting a runtime error complaining about an unknown keyword of course).
This ought to be backported to 2.5.2 or at least 2.6.
................
r55724 | brett.cannon | 2007-05-31 19:32:41 -0700 (Thu, 31 May 2007) | 2 lines
Remove the cfmfile.
................
r55727 | neal.norwitz | 2007-05-31 22:19:44 -0700 (Thu, 31 May 2007) | 1 line
Remove reload() builtin.
................
r55729 | neal.norwitz | 2007-05-31 22:51:30 -0700 (Thu, 31 May 2007) | 59 lines
Merged revisions 55636-55728 via svnmerge from
svn+ssh://pythondev@svn.python.org/python/trunk
........
r55637 | georg.brandl | 2007-05-29 00:16:47 -0700 (Tue, 29 May 2007) | 2 lines
Fix rst markup.
........
r55638 | neal.norwitz | 2007-05-29 00:51:39 -0700 (Tue, 29 May 2007) | 1 line
Fix typo in doc
........
r55671 | neal.norwitz | 2007-05-29 21:53:41 -0700 (Tue, 29 May 2007) | 1 line
Fix indentation (whitespace only).
........
r55676 | thomas.heller | 2007-05-29 23:58:30 -0700 (Tue, 29 May 2007) | 1 line
Fix compiler warnings.
........
r55677 | thomas.heller | 2007-05-30 00:01:25 -0700 (Wed, 30 May 2007) | 2 lines
Correct the name of a field in the WIN32_FIND_DATAA and WIN32_FIND_DATAW structures.
Closes bug #1726026.
........
r55686 | brett.cannon | 2007-05-30 13:46:26 -0700 (Wed, 30 May 2007) | 2 lines
Have MimeWriter raise a DeprecationWarning as per PEP 4 and its documentation.
........
r55690 | brett.cannon | 2007-05-30 14:48:58 -0700 (Wed, 30 May 2007) | 3 lines
Have mimify raise a DeprecationWarning. The docs and PEP 4 have listed the
module as deprecated for a while.
........
r55696 | brett.cannon | 2007-05-30 15:24:28 -0700 (Wed, 30 May 2007) | 2 lines
Have md5 raise a DeprecationWarning as per PEP 4.
........
r55705 | neal.norwitz | 2007-05-30 21:14:22 -0700 (Wed, 30 May 2007) | 1 line
Add some spaces in the example code.
........
r55716 | brett.cannon | 2007-05-31 12:20:00 -0700 (Thu, 31 May 2007) | 2 lines
Have the sha module raise a DeprecationWarning as specified in PEP 4.
........
r55719 | brett.cannon | 2007-05-31 12:40:42 -0700 (Thu, 31 May 2007) | 2 lines
Cause buildtools to raise a DeprecationWarning.
........
r55721 | brett.cannon | 2007-05-31 13:01:11 -0700 (Thu, 31 May 2007) | 2 lines
Have cfmfile raise a DeprecationWarning as per PEP 4.
........
r55726 | neal.norwitz | 2007-05-31 21:56:47 -0700 (Thu, 31 May 2007) | 1 line
Mail if there is an installation failure.
........
................
r55730 | neal.norwitz | 2007-05-31 23:22:07 -0700 (Thu, 31 May 2007) | 2 lines
Remove the code that was missed in rev 55303.
................
r55738 | neal.norwitz | 2007-06-01 19:10:43 -0700 (Fri, 01 Jun 2007) | 1 line
Fix doc breakage
................
r55741 | neal.norwitz | 2007-06-02 00:41:58 -0700 (Sat, 02 Jun 2007) | 1 line
Remove timing module (plus some remnants of other modules).
................
r55742 | neal.norwitz | 2007-06-02 00:51:44 -0700 (Sat, 02 Jun 2007) | 1 line
Remove posixfile module (plus some remnants of other modules).
................
r55744 | neal.norwitz | 2007-06-02 10:18:56 -0700 (Sat, 02 Jun 2007) | 1 line
Fix doc breakage.
................
r55745 | neal.norwitz | 2007-06-02 11:32:16 -0700 (Sat, 02 Jun 2007) | 1 line
Make a whatsnew 3.0 template.
................
r55754 | neal.norwitz | 2007-06-03 23:24:18 -0700 (Sun, 03 Jun 2007) | 1 line
SF #1730441, os._execvpe raises UnboundLocal due to new try/except semantics
................
r55755 | neal.norwitz | 2007-06-03 23:26:00 -0700 (Sun, 03 Jun 2007) | 1 line
Get rid of extra whitespace
................
r55794 | guido.van.rossum | 2007-06-06 15:29:22 -0700 (Wed, 06 Jun 2007) | 3 lines
Make this compile in GCC 2.96, which does not allow interspersing
declarations and code.
................
276 lines
11 KiB
TeX
276 lines
11 KiB
TeX
\section{\module{shlex} ---
|
|
Simple lexical analysis}
|
|
|
|
\declaremodule{standard}{shlex}
|
|
\modulesynopsis{Simple lexical analysis for \UNIX\ shell-like languages.}
|
|
\moduleauthor{Eric S. Raymond}{esr@snark.thyrsus.com}
|
|
\moduleauthor{Gustavo Niemeyer}{niemeyer@conectiva.com}
|
|
\sectionauthor{Eric S. Raymond}{esr@snark.thyrsus.com}
|
|
\sectionauthor{Gustavo Niemeyer}{niemeyer@conectiva.com}
|
|
|
|
\versionadded{1.5.2}
|
|
|
|
The \class{shlex} class makes it easy to write lexical analyzers for
|
|
simple syntaxes resembling that of the \UNIX{} shell. This will often
|
|
be useful for writing minilanguages, (for example, in run control
|
|
files for Python applications) or for parsing quoted strings.
|
|
|
|
\note{The \module{shlex} module currently does not support Unicode input.}
|
|
|
|
The \module{shlex} module defines the following functions:
|
|
|
|
\begin{funcdesc}{split}{s\optional{, comments\optional{, posix}}}
|
|
Split the string \var{s} using shell-like syntax. If \var{comments} is
|
|
\constant{False} (the default), the parsing of comments in the given
|
|
string will be disabled (setting the \member{commenters} member of the
|
|
\class{shlex} instance to the empty string). This function operates
|
|
in \POSIX{} mode by default, but uses non-\POSIX{} mode if the
|
|
\var{posix} argument is false.
|
|
\versionadded{2.3}
|
|
\versionchanged[Added the \var{posix} parameter]{2.6}
|
|
\end{funcdesc}
|
|
|
|
The \module{shlex} module defines the following class:
|
|
|
|
\begin{classdesc}{shlex}{\optional{instream\optional{,
|
|
infile\optional{, posix}}}}
|
|
A \class{shlex} instance or subclass instance is a lexical analyzer
|
|
object. The initialization argument, if present, specifies where to
|
|
read characters from. It must be a file-/stream-like object with
|
|
\method{read()} and \method{readline()} methods, or a string (strings
|
|
are accepted since Python 2.3). If no argument is given, input will be
|
|
taken from \code{sys.stdin}. The second optional argument is a filename
|
|
string, which sets the initial value of the \member{infile} member. If
|
|
the \var{instream} argument is omitted or equal to \code{sys.stdin},
|
|
this second argument defaults to ``stdin''. The \var{posix} argument
|
|
was introduced in Python 2.3, and defines the operational mode. When
|
|
\var{posix} is not true (default), the \class{shlex} instance will
|
|
operate in compatibility mode. When operating in \POSIX{} mode,
|
|
\class{shlex} will try to be as close as possible to the \POSIX{} shell
|
|
parsing rules. See section~\ref{shlex-objects}.
|
|
\end{classdesc}
|
|
|
|
\begin{seealso}
|
|
\seemodule{ConfigParser}{Parser for configuration files similar to the
|
|
Windows \file{.ini} files.}
|
|
\end{seealso}
|
|
|
|
|
|
\subsection{shlex Objects \label{shlex-objects}}
|
|
|
|
A \class{shlex} instance has the following methods:
|
|
|
|
\begin{methoddesc}[shlex]{get_token}{}
|
|
Return a token. If tokens have been stacked using
|
|
\method{push_token()}, pop a token off the stack. Otherwise, read one
|
|
from the input stream. If reading encounters an immediate
|
|
end-of-file, \member{self.eof} is returned (the empty string (\code{''})
|
|
in non-\POSIX{} mode, and \code{None} in \POSIX{} mode).
|
|
\end{methoddesc}
|
|
|
|
\begin{methoddesc}[shlex]{push_token}{str}
|
|
Push the argument onto the token stack.
|
|
\end{methoddesc}
|
|
|
|
\begin{methoddesc}[shlex]{read_token}{}
|
|
Read a raw token. Ignore the pushback stack, and do not interpret source
|
|
requests. (This is not ordinarily a useful entry point, and is
|
|
documented here only for the sake of completeness.)
|
|
\end{methoddesc}
|
|
|
|
\begin{methoddesc}[shlex]{sourcehook}{filename}
|
|
When \class{shlex} detects a source request (see
|
|
\member{source} below) this method is given the following token as
|
|
argument, and expected to return a tuple consisting of a filename and
|
|
an open file-like object.
|
|
|
|
Normally, this method first strips any quotes off the argument. If
|
|
the result is an absolute pathname, or there was no previous source
|
|
request in effect, or the previous source was a stream
|
|
(such as \code{sys.stdin}), the result is left alone. Otherwise, if the
|
|
result is a relative pathname, the directory part of the name of the
|
|
file immediately before it on the source inclusion stack is prepended
|
|
(this behavior is like the way the C preprocessor handles
|
|
\code{\#include "file.h"}).
|
|
|
|
The result of the manipulations is treated as a filename, and returned
|
|
as the first component of the tuple, with
|
|
\function{open()} called on it to yield the second component. (Note:
|
|
this is the reverse of the order of arguments in instance initialization!)
|
|
|
|
This hook is exposed so that you can use it to implement directory
|
|
search paths, addition of file extensions, and other namespace hacks.
|
|
There is no corresponding `close' hook, but a shlex instance will call
|
|
the \method{close()} method of the sourced input stream when it
|
|
returns \EOF.
|
|
|
|
For more explicit control of source stacking, use the
|
|
\method{push_source()} and \method{pop_source()} methods.
|
|
\end{methoddesc}
|
|
|
|
\begin{methoddesc}[shlex]{push_source}{stream\optional{, filename}}
|
|
Push an input source stream onto the input stack. If the filename
|
|
argument is specified it will later be available for use in error
|
|
messages. This is the same method used internally by the
|
|
\method{sourcehook} method.
|
|
\versionadded{2.1}
|
|
\end{methoddesc}
|
|
|
|
\begin{methoddesc}[shlex]{pop_source}{}
|
|
Pop the last-pushed input source from the input stack.
|
|
This is the same method used internally when the lexer reaches
|
|
\EOF{} on a stacked input stream.
|
|
\versionadded{2.1}
|
|
\end{methoddesc}
|
|
|
|
\begin{methoddesc}[shlex]{error_leader}{\optional{file\optional{, line}}}
|
|
This method generates an error message leader in the format of a
|
|
\UNIX{} C compiler error label; the format is \code{'"\%s", line \%d: '},
|
|
where the \samp{\%s} is replaced with the name of the current source
|
|
file and the \samp{\%d} with the current input line number (the
|
|
optional arguments can be used to override these).
|
|
|
|
This convenience is provided to encourage \module{shlex} users to
|
|
generate error messages in the standard, parseable format understood
|
|
by Emacs and other \UNIX{} tools.
|
|
\end{methoddesc}
|
|
|
|
Instances of \class{shlex} subclasses have some public instance
|
|
variables which either control lexical analysis or can be used for
|
|
debugging:
|
|
|
|
\begin{memberdesc}[shlex]{commenters}
|
|
The string of characters that are recognized as comment beginners.
|
|
All characters from the comment beginner to end of line are ignored.
|
|
Includes just \character{\#} by default.
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{wordchars}
|
|
The string of characters that will accumulate into multi-character
|
|
tokens. By default, includes all \ASCII{} alphanumerics and
|
|
underscore.
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{whitespace}
|
|
Characters that will be considered whitespace and skipped. Whitespace
|
|
bounds tokens. By default, includes space, tab, linefeed and
|
|
carriage-return.
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{escape}
|
|
Characters that will be considered as escape. This will be only used
|
|
in \POSIX{} mode, and includes just \character{\textbackslash} by default.
|
|
\versionadded{2.3}
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{quotes}
|
|
Characters that will be considered string quotes. The token
|
|
accumulates until the same quote is encountered again (thus, different
|
|
quote types protect each other as in the shell.) By default, includes
|
|
\ASCII{} single and double quotes.
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{escapedquotes}
|
|
Characters in \member{quotes} that will interpret escape characters
|
|
defined in \member{escape}. This is only used in \POSIX{} mode, and
|
|
includes just \character{"} by default.
|
|
\versionadded{2.3}
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{whitespace_split}
|
|
If \code{True}, tokens will only be split in whitespaces. This is useful, for
|
|
example, for parsing command lines with \class{shlex}, getting tokens
|
|
in a similar way to shell arguments.
|
|
\versionadded{2.3}
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{infile}
|
|
The name of the current input file, as initially set at class
|
|
instantiation time or stacked by later source requests. It may
|
|
be useful to examine this when constructing error messages.
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{instream}
|
|
The input stream from which this \class{shlex} instance is reading
|
|
characters.
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{source}
|
|
This member is \code{None} by default. If you assign a string to it,
|
|
that string will be recognized as a lexical-level inclusion request
|
|
similar to the \samp{source} keyword in various shells. That is, the
|
|
immediately following token will opened as a filename and input taken
|
|
from that stream until \EOF, at which point the \method{close()}
|
|
method of that stream will be called and the input source will again
|
|
become the original input stream. Source requests may be stacked any
|
|
number of levels deep.
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{debug}
|
|
If this member is numeric and \code{1} or more, a \class{shlex}
|
|
instance will print verbose progress output on its behavior. If you
|
|
need to use this, you can read the module source code to learn the
|
|
details.
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{lineno}
|
|
Source line number (count of newlines seen so far plus one).
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{token}
|
|
The token buffer. It may be useful to examine this when catching
|
|
exceptions.
|
|
\end{memberdesc}
|
|
|
|
\begin{memberdesc}[shlex]{eof}
|
|
Token used to determine end of file. This will be set to the empty
|
|
string (\code{''}), in non-\POSIX{} mode, and to \code{None} in
|
|
\POSIX{} mode.
|
|
\versionadded{2.3}
|
|
\end{memberdesc}
|
|
|
|
\subsection{Parsing Rules\label{shlex-parsing-rules}}
|
|
|
|
When operating in non-\POSIX{} mode, \class{shlex} will try to obey to
|
|
the following rules.
|
|
|
|
\begin{itemize}
|
|
\item Quote characters are not recognized within words
|
|
(\code{Do"Not"Separate} is parsed as the single word
|
|
\code{Do"Not"Separate});
|
|
\item Escape characters are not recognized;
|
|
\item Enclosing characters in quotes preserve the literal value of
|
|
all characters within the quotes;
|
|
\item Closing quotes separate words (\code{"Do"Separate} is parsed
|
|
as \code{"Do"} and \code{Separate});
|
|
\item If \member{whitespace_split} is \code{False}, any character not
|
|
declared to be a word character, whitespace, or a quote will be
|
|
returned as a single-character token. If it is \code{True},
|
|
\class{shlex} will only split words in whitespaces;
|
|
\item EOF is signaled with an empty string (\code{''});
|
|
\item It's not possible to parse empty strings, even if quoted.
|
|
\end{itemize}
|
|
|
|
When operating in \POSIX{} mode, \class{shlex} will try to obey to the
|
|
following parsing rules.
|
|
|
|
\begin{itemize}
|
|
\item Quotes are stripped out, and do not separate words
|
|
(\code{"Do"Not"Separate"} is parsed as the single word
|
|
\code{DoNotSeparate});
|
|
\item Non-quoted escape characters (e.g. \character{\textbackslash})
|
|
preserve the literal value of the next character that follows;
|
|
\item Enclosing characters in quotes which are not part of
|
|
\member{escapedquotes} (e.g. \character{'}) preserve the literal
|
|
value of all characters within the quotes;
|
|
\item Enclosing characters in quotes which are part of
|
|
\member{escapedquotes} (e.g. \character{"}) preserves the literal
|
|
value of all characters within the quotes, with the exception of
|
|
the characters mentioned in \member{escape}. The escape characters
|
|
retain its special meaning only when followed by the quote in use,
|
|
or the escape character itself. Otherwise the escape character
|
|
will be considered a normal character.
|
|
\item EOF is signaled with a \constant{None} value;
|
|
\item Quoted empty strings (\code{''}) are allowed;
|
|
\end{itemize}
|
|
|