| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | :tocdepth: 2
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | ===============
 | 
					
						
							|  |  |  | Programming FAQ
 | 
					
						
							|  |  |  | ===============
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-03-28 13:28:44 +01:00
										 |  |  | .. only:: html
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    .. contents::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | General Questions
 | 
					
						
							|  |  |  | =================
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Is there a source code level debugger with breakpoints, single-stepping, etc.?
 | 
					
						
							|  |  |  | ------------------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Yes.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2019-05-03 13:53:22 -03:00
										 |  |  | Several debuggers for Python are described below, and the built-in function
 | 
					
						
							|  |  |  | :func:`breakpoint` allows you to drop into any of them.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | The pdb module is a simple but adequate console-mode debugger for Python. It is
 | 
					
						
							|  |  |  | part of the standard Python library, and is :mod:`documented in the Library
 | 
					
						
							|  |  |  | Reference Manual <pdb>`. You can also write your own debugger by using the code
 | 
					
						
							|  |  |  | for pdb as an example.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The IDLE interactive development environment, which is part of the standard
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | Python distribution (normally available as
 | 
					
						
							|  |  |  | `Tools/scripts/idle3 <https://github.com/python/cpython/blob/main/Tools/scripts/idle3>`_),
 | 
					
						
							|  |  |  | includes a graphical debugger.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | PythonWin is a Python IDE that includes a GUI debugger based on pdb.  The
 | 
					
						
							| 
									
										
										
										
											2021-04-28 22:06:53 -03:00
										 |  |  | PythonWin debugger colors breakpoints and has quite a few cool features such as
 | 
					
						
							|  |  |  | debugging non-PythonWin programs.  PythonWin is available as part of
 | 
					
						
							|  |  |  | `pywin32 <https://github.com/mhammond/pywin32>`_ project and
 | 
					
						
							|  |  |  | as a part of the
 | 
					
						
							|  |  |  | `ActivePython <https://www.activestate.com/products/python/>`_ distribution.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-09-27 14:08:11 +03:00
										 |  |  | `Eric <https://eric-ide.python-projects.org/>`_ is an IDE built on PyQt
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | and the Scintilla editing component.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-04-28 22:06:53 -03:00
										 |  |  | `trepan3k <https://github.com/rocky/python3-trepan/>`_ is a gdb-like debugger.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | `Visual Studio Code <https://code.visualstudio.com/>`_ is an IDE with debugging
 | 
					
						
							|  |  |  | tools that integrates with version-control software.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | There are a number of commercial Python IDEs that include graphical debuggers.
 | 
					
						
							|  |  |  | They include:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-04-28 22:06:53 -03:00
										 |  |  | * `Wing IDE <https://wingware.com/>`_
 | 
					
						
							|  |  |  | * `Komodo IDE <https://www.activestate.com/products/komodo-ide/>`_
 | 
					
						
							|  |  |  | * `PyCharm <https://www.jetbrains.com/pycharm/>`_
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-09-02 00:21:12 -03:00
										 |  |  | Are there tools to help find bugs or perform static analysis?
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | -------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Yes.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-06-21 21:55:18 +03:00
										 |  |  | `Pylint <https://pylint.pycqa.org/en/latest/index.html>`_ and
 | 
					
						
							| 
									
										
										
										
											2020-09-02 00:21:12 -03:00
										 |  |  | `Pyflakes <https://github.com/PyCQA/pyflakes>`_ do basic checking that will
 | 
					
						
							|  |  |  | help you catch bugs sooner.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2023-05-02 03:34:44 -03:00
										 |  |  | Static type checkers such as `Mypy <https://mypy-lang.org/>`_,
 | 
					
						
							| 
									
										
										
										
											2018-09-11 02:12:41 -03:00
										 |  |  | `Pyre <https://pyre-check.org/>`_, and
 | 
					
						
							|  |  |  | `Pytype <https://github.com/google/pytype>`_ can check type hints in Python
 | 
					
						
							|  |  |  | source code.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-05-23 00:03:09 +02:00
										 |  |  | .. _faq-create-standalone-binary:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | How can I create a stand-alone binary from a Python script?
 | 
					
						
							|  |  |  | -----------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | You don't need the ability to compile Python to C code if all you want is a
 | 
					
						
							|  |  |  | stand-alone program that users can download and run without having to install
 | 
					
						
							|  |  |  | the Python distribution first.  There are a number of tools that determine the
 | 
					
						
							|  |  |  | set of modules required by a program and bind these modules together with a
 | 
					
						
							|  |  |  | Python binary to produce a single executable.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | One is to use the freeze tool, which is included in the Python source tree as
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | `Tools/freeze <https://github.com/python/cpython/tree/main/Tools/freeze>`_.
 | 
					
						
							|  |  |  | It converts Python byte code to C arrays; with a C compiler you can
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | embed all your modules into a new program, which is then linked with the
 | 
					
						
							|  |  |  | standard Python modules.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | It works by scanning your source recursively for import statements (in both
 | 
					
						
							|  |  |  | forms) and looking for the modules in the standard Python path as well as in the
 | 
					
						
							|  |  |  | source directory (for built-in modules).  It then turns the bytecode for modules
 | 
					
						
							|  |  |  | written in Python into C code (array initializers that can be turned into code
 | 
					
						
							|  |  |  | objects using the marshal module) and creates a custom-made config file that
 | 
					
						
							|  |  |  | only contains those built-in modules which are actually used in the program.  It
 | 
					
						
							|  |  |  | then compiles the generated C code and links it with the rest of the Python
 | 
					
						
							|  |  |  | interpreter to form a self-contained binary which acts exactly like your script.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-05-23 00:03:09 +02:00
										 |  |  | The following packages can help with the creation of console and GUI
 | 
					
						
							|  |  |  | executables:
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-05-23 00:03:09 +02:00
										 |  |  | * `Nuitka <https://nuitka.net/>`_ (Cross-platform)
 | 
					
						
							| 
									
										
										
										
											2022-08-04 13:30:05 +03:00
										 |  |  | * `PyInstaller <https://pyinstaller.org/>`_ (Cross-platform)
 | 
					
						
							| 
									
										
										
										
											2021-05-23 00:03:09 +02:00
										 |  |  | * `PyOxidizer <https://pyoxidizer.readthedocs.io/en/stable/>`_ (Cross-platform)
 | 
					
						
							|  |  |  | * `cx_Freeze <https://marcelotduarte.github.io/cx_Freeze/>`_ (Cross-platform)
 | 
					
						
							|  |  |  | * `py2app <https://github.com/ronaldoussoren/py2app>`_ (macOS only)
 | 
					
						
							| 
									
										
										
										
											2022-09-27 14:08:11 +03:00
										 |  |  | * `py2exe <https://www.py2exe.org/>`_ (Windows only)
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | Are there coding standards or a style guide for Python programs?
 | 
					
						
							|  |  |  | ----------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Yes.  The coding style required for standard library modules is documented as
 | 
					
						
							|  |  |  | :pep:`8`.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Core Language
 | 
					
						
							|  |  |  | =============
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-12-22 15:23:25 -08:00
										 |  |  | .. _faq-unboundlocalerror:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-11-14 22:21:32 +00:00
										 |  |  | Why am I getting an UnboundLocalError when the variable has a value?
 | 
					
						
							|  |  |  | --------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | It can be a surprise to get the :exc:`UnboundLocalError` in previously working
 | 
					
						
							| 
									
										
										
										
											2009-11-14 22:21:32 +00:00
										 |  |  | code when it is modified by adding an assignment statement somewhere in
 | 
					
						
							|  |  |  | the body of a function.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This code:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> x = 10
 | 
					
						
							|  |  |  |    >>> def bar():
 | 
					
						
							|  |  |  |    ...     print(x)
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  |    ...
 | 
					
						
							| 
									
										
										
										
											2009-11-14 22:21:32 +00:00
										 |  |  |    >>> bar()
 | 
					
						
							|  |  |  |    10
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | works, but this code:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> x = 10
 | 
					
						
							|  |  |  |    >>> def foo():
 | 
					
						
							|  |  |  |    ...     print(x)
 | 
					
						
							|  |  |  |    ...     x += 1
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | results in an :exc:`!UnboundLocalError`:
 | 
					
						
							| 
									
										
										
										
											2009-11-14 22:21:32 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    >>> foo()
 | 
					
						
							|  |  |  |    Traceback (most recent call last):
 | 
					
						
							|  |  |  |      ...
 | 
					
						
							|  |  |  |    UnboundLocalError: local variable 'x' referenced before assignment
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This is because when you make an assignment to a variable in a scope, that
 | 
					
						
							|  |  |  | variable becomes local to that scope and shadows any similarly named variable
 | 
					
						
							|  |  |  | in the outer scope.  Since the last statement in foo assigns a new value to
 | 
					
						
							|  |  |  | ``x``, the compiler recognizes it as a local variable.  Consequently when the
 | 
					
						
							| 
									
										
										
										
											2009-11-14 22:27:22 +00:00
										 |  |  | earlier ``print(x)`` attempts to print the uninitialized local variable and
 | 
					
						
							| 
									
										
										
										
											2009-11-14 22:21:32 +00:00
										 |  |  | an error results.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In the example above you can access the outer scope variable by declaring it
 | 
					
						
							|  |  |  | global:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> x = 10
 | 
					
						
							|  |  |  |    >>> def foobar():
 | 
					
						
							|  |  |  |    ...     global x
 | 
					
						
							|  |  |  |    ...     print(x)
 | 
					
						
							|  |  |  |    ...     x += 1
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  |    ...
 | 
					
						
							| 
									
										
										
										
											2009-11-14 22:21:32 +00:00
										 |  |  |    >>> foobar()
 | 
					
						
							|  |  |  |    10
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This explicit declaration is required in order to remind you that (unlike the
 | 
					
						
							|  |  |  | superficially analogous situation with class and instance variables) you are
 | 
					
						
							|  |  |  | actually modifying the value of the variable in the outer scope:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> print(x)
 | 
					
						
							|  |  |  |    11
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | You can do a similar thing in a nested scope using the :keyword:`nonlocal`
 | 
					
						
							|  |  |  | keyword:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> def foo():
 | 
					
						
							|  |  |  |    ...    x = 10
 | 
					
						
							|  |  |  |    ...    def bar():
 | 
					
						
							|  |  |  |    ...        nonlocal x
 | 
					
						
							|  |  |  |    ...        print(x)
 | 
					
						
							|  |  |  |    ...        x += 1
 | 
					
						
							|  |  |  |    ...    bar()
 | 
					
						
							|  |  |  |    ...    print(x)
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  |    ...
 | 
					
						
							| 
									
										
										
										
											2009-11-14 22:21:32 +00:00
										 |  |  |    >>> foo()
 | 
					
						
							|  |  |  |    10
 | 
					
						
							|  |  |  |    11
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | What are the rules for local and global variables in Python?
 | 
					
						
							|  |  |  | ------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In Python, variables that are only referenced inside a function are implicitly
 | 
					
						
							| 
									
										
										
										
											2015-07-30 06:14:32 +12:00
										 |  |  | global.  If a variable is assigned a value anywhere within the function's body,
 | 
					
						
							|  |  |  | it's assumed to be a local unless explicitly declared as global.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | Though a bit surprising at first, a moment's consideration explains this.  On
 | 
					
						
							|  |  |  | one hand, requiring :keyword:`global` for assigned variables provides a bar
 | 
					
						
							|  |  |  | against unintended side-effects.  On the other hand, if ``global`` was required
 | 
					
						
							|  |  |  | for all global references, you'd be using ``global`` all the time.  You'd have
 | 
					
						
							| 
									
										
										
										
											2010-02-06 18:46:57 +00:00
										 |  |  | to declare as global every reference to a built-in function or to a component of
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | an imported module.  This clutter would defeat the usefulness of the ``global``
 | 
					
						
							|  |  |  | declaration for identifying side-effects.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-01-05 00:50:46 +02:00
										 |  |  | Why do lambdas defined in a loop with different values all return the same result?
 | 
					
						
							|  |  |  | ----------------------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Assume you use a for loop to define a few different lambdas (or even plain
 | 
					
						
							|  |  |  | functions), e.g.::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-06-19 16:58:26 -04:00
										 |  |  |    >>> squares = []
 | 
					
						
							|  |  |  |    >>> for x in range(5):
 | 
					
						
							| 
									
										
										
										
											2016-05-10 12:01:23 +03:00
										 |  |  |    ...     squares.append(lambda: x**2)
 | 
					
						
							| 
									
										
										
										
											2013-01-05 00:50:46 +02:00
										 |  |  | 
 | 
					
						
							|  |  |  | This gives you a list that contains 5 lambdas that calculate ``x**2``.  You
 | 
					
						
							|  |  |  | might expect that, when called, they would return, respectively, ``0``, ``1``,
 | 
					
						
							|  |  |  | ``4``, ``9``, and ``16``.  However, when you actually try you will see that
 | 
					
						
							|  |  |  | they all return ``16``::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> squares[2]()
 | 
					
						
							|  |  |  |    16
 | 
					
						
							|  |  |  |    >>> squares[4]()
 | 
					
						
							|  |  |  |    16
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This happens because ``x`` is not local to the lambdas, but is defined in
 | 
					
						
							|  |  |  | the outer scope, and it is accessed when the lambda is called --- not when it
 | 
					
						
							|  |  |  | is defined.  At the end of the loop, the value of ``x`` is ``4``, so all the
 | 
					
						
							|  |  |  | functions now return ``4**2``, i.e. ``16``.  You can also verify this by
 | 
					
						
							|  |  |  | changing the value of ``x`` and see how the results of the lambdas change::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> x = 8
 | 
					
						
							|  |  |  |    >>> squares[2]()
 | 
					
						
							|  |  |  |    64
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In order to avoid this, you need to save the values in variables local to the
 | 
					
						
							|  |  |  | lambdas, so that they don't rely on the value of the global ``x``::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-06-19 16:58:26 -04:00
										 |  |  |    >>> squares = []
 | 
					
						
							|  |  |  |    >>> for x in range(5):
 | 
					
						
							| 
									
										
										
										
											2016-05-10 12:01:23 +03:00
										 |  |  |    ...     squares.append(lambda n=x: n**2)
 | 
					
						
							| 
									
										
										
										
											2013-01-05 00:50:46 +02:00
										 |  |  | 
 | 
					
						
							|  |  |  | Here, ``n=x`` creates a new variable ``n`` local to the lambda and computed
 | 
					
						
							|  |  |  | when the lambda is defined so that it has the same value that ``x`` had at
 | 
					
						
							|  |  |  | that point in the loop.  This means that the value of ``n`` will be ``0``
 | 
					
						
							|  |  |  | in the first lambda, ``1`` in the second, ``2`` in the third, and so on.
 | 
					
						
							|  |  |  | Therefore each lambda will now return the correct result::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> squares[2]()
 | 
					
						
							|  |  |  |    4
 | 
					
						
							|  |  |  |    >>> squares[4]()
 | 
					
						
							|  |  |  |    16
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Note that this behaviour is not peculiar to lambdas, but applies to regular
 | 
					
						
							|  |  |  | functions too.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | How do I share global variables across modules?
 | 
					
						
							|  |  |  | ------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The canonical way to share information across modules within a single program is
 | 
					
						
							|  |  |  | to create a special module (often called config or cfg).  Just import the config
 | 
					
						
							|  |  |  | module in all modules of your application; the module then becomes available as
 | 
					
						
							|  |  |  | a global name.  Because there is only one instance of each module, any changes
 | 
					
						
							|  |  |  | made to the module object get reflected everywhere.  For example:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | config.py::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    x = 0   # Default value of the 'x' configuration setting
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | mod.py::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    import config
 | 
					
						
							|  |  |  |    config.x = 1
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | main.py::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    import config
 | 
					
						
							|  |  |  |    import mod
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    print(config.x)
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | Note that using a module is also the basis for implementing the singleton design
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | pattern, for the same reason.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | What are the "best practices" for using import in a module?
 | 
					
						
							|  |  |  | -----------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In general, don't use ``from modulename import *``.  Doing so clutters the
 | 
					
						
							| 
									
										
										
										
											2014-10-06 16:02:09 +02:00
										 |  |  | importer's namespace, and makes it much harder for linters to detect undefined
 | 
					
						
							|  |  |  | names.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | Import modules at the top of a file.  Doing so makes it clear what other modules
 | 
					
						
							|  |  |  | your code requires and avoids questions of whether the module name is in scope.
 | 
					
						
							|  |  |  | Using one import per line makes it easy to add and delete module imports, but
 | 
					
						
							|  |  |  | using multiple imports per line uses less screen space.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | It's good practice if you import modules in the following order:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | 1. standard library modules -- e.g. :mod:`sys`, :mod:`os`, :mod:`argparse`, :mod:`re`
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 2. third-party library modules (anything installed in Python's site-packages
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  |    directory) -- e.g. :mod:`!dateutil`, :mod:`!requests`, :mod:`!PIL.Image`
 | 
					
						
							| 
									
										
										
										
											2022-07-05 05:16:10 -04:00
										 |  |  | 3. locally developed modules
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | It is sometimes necessary to move imports to a function or class to avoid
 | 
					
						
							|  |  |  | problems with circular imports.  Gordon McMillan says:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    Circular imports are fine where both modules use the "import <module>" form
 | 
					
						
							|  |  |  |    of import.  They fail when the 2nd module wants to grab a name out of the
 | 
					
						
							|  |  |  |    first ("from module import name") and the import is at the top level.  That's
 | 
					
						
							|  |  |  |    because names in the 1st are not yet available, because the first module is
 | 
					
						
							|  |  |  |    busy importing the 2nd.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In this case, if the second module is only used in one function, then the import
 | 
					
						
							|  |  |  | can easily be moved into that function.  By the time the import is called, the
 | 
					
						
							|  |  |  | first module will have finished initializing, and the second module can do its
 | 
					
						
							|  |  |  | import.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | It may also be necessary to move imports out of the top level of code if some of
 | 
					
						
							|  |  |  | the modules are platform-specific.  In that case, it may not even be possible to
 | 
					
						
							|  |  |  | import all of the modules at the top of the file.  In this case, importing the
 | 
					
						
							|  |  |  | correct modules in the corresponding platform-specific code is a good option.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Only move imports into a local scope, such as inside a function definition, if
 | 
					
						
							|  |  |  | it's necessary to solve a problem such as avoiding a circular import or are
 | 
					
						
							|  |  |  | trying to reduce the initialization time of a module.  This technique is
 | 
					
						
							|  |  |  | especially helpful if many of the imports are unnecessary depending on how the
 | 
					
						
							|  |  |  | program executes.  You may also want to move imports into a function if the
 | 
					
						
							|  |  |  | modules are only ever used in that function.  Note that loading a module the
 | 
					
						
							|  |  |  | first time may be expensive because of the one time initialization of the
 | 
					
						
							|  |  |  | module, but loading a module multiple times is virtually free, costing only a
 | 
					
						
							|  |  |  | couple of dictionary lookups.  Even if the module name has gone out of scope,
 | 
					
						
							|  |  |  | the module is probably available in :data:`sys.modules`.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2014-07-06 20:53:27 +03:00
										 |  |  | Why are default values shared between objects?
 | 
					
						
							|  |  |  | ----------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This type of bug commonly bites neophyte programmers.  Consider this function::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    def foo(mydict={}):  # Danger: shared reference to one dict for all calls
 | 
					
						
							|  |  |  |        ... compute something ...
 | 
					
						
							|  |  |  |        mydict[key] = value
 | 
					
						
							|  |  |  |        return mydict
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The first time you call this function, ``mydict`` contains a single item.  The
 | 
					
						
							|  |  |  | second time, ``mydict`` contains two items because when ``foo()`` begins
 | 
					
						
							|  |  |  | executing, ``mydict`` starts out with an item already in it.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | It is often expected that a function call creates new objects for default
 | 
					
						
							|  |  |  | values. This is not what happens. Default values are created exactly once, when
 | 
					
						
							|  |  |  | the function is defined.  If that object is changed, like the dictionary in this
 | 
					
						
							|  |  |  | example, subsequent calls to the function will refer to this changed object.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | By definition, immutable objects such as numbers, strings, tuples, and ``None``,
 | 
					
						
							|  |  |  | are safe from change. Changes to mutable objects such as dictionaries, lists,
 | 
					
						
							|  |  |  | and class instances can lead to confusion.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Because of this feature, it is good programming practice to not use mutable
 | 
					
						
							|  |  |  | objects as default values.  Instead, use ``None`` as the default value and
 | 
					
						
							|  |  |  | inside the function, check if the parameter is ``None`` and create a new
 | 
					
						
							|  |  |  | list/dictionary/whatever if it is.  For example, don't write::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    def foo(mydict={}):
 | 
					
						
							|  |  |  |        ...
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | but::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    def foo(mydict=None):
 | 
					
						
							|  |  |  |        if mydict is None:
 | 
					
						
							|  |  |  |            mydict = {}  # create a new dict for local namespace
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This feature can be useful.  When you have a function that's time-consuming to
 | 
					
						
							|  |  |  | compute, a common technique is to cache the parameters and the resulting value
 | 
					
						
							|  |  |  | of each call to the function, and return the cached value if the same value is
 | 
					
						
							|  |  |  | requested again.  This is called "memoizing", and can be implemented like this::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2018-06-16 05:29:11 +02:00
										 |  |  |    # Callers can only provide two parameters and optionally pass _cache by keyword
 | 
					
						
							|  |  |  |    def expensive(arg1, arg2, *, _cache={}):
 | 
					
						
							| 
									
										
										
										
											2014-07-06 20:53:27 +03:00
										 |  |  |        if (arg1, arg2) in _cache:
 | 
					
						
							|  |  |  |            return _cache[(arg1, arg2)]
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        # Calculate the value
 | 
					
						
							|  |  |  |        result = ... expensive computation ...
 | 
					
						
							| 
									
										
										
										
											2014-09-28 11:01:11 -04:00
										 |  |  |        _cache[(arg1, arg2)] = result           # Store result in the cache
 | 
					
						
							| 
									
										
										
										
											2014-07-06 20:53:27 +03:00
										 |  |  |        return result
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | You could use a global variable containing a dictionary instead of the default
 | 
					
						
							|  |  |  | value; it's a matter of taste.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | How can I pass optional or keyword parameters from one function to another?
 | 
					
						
							|  |  |  | ---------------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Collect the arguments using the ``*`` and ``**`` specifiers in the function's
 | 
					
						
							|  |  |  | parameter list; this gives you the positional arguments as a tuple and the
 | 
					
						
							|  |  |  | keyword arguments as a dictionary.  You can then pass these arguments when
 | 
					
						
							|  |  |  | calling another function by using ``*`` and ``**``::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    def f(x, *args, **kwargs):
 | 
					
						
							|  |  |  |        ...
 | 
					
						
							|  |  |  |        kwargs['width'] = '14.3c'
 | 
					
						
							|  |  |  |        ...
 | 
					
						
							|  |  |  |        g(x, *args, **kwargs)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2012-12-25 14:54:44 -08:00
										 |  |  | .. index::
 | 
					
						
							|  |  |  |    single: argument; difference from parameter
 | 
					
						
							|  |  |  |    single: parameter; difference from argument
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2012-11-28 02:29:33 -08:00
										 |  |  | .. _faq-argument-vs-parameter:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | What is the difference between arguments and parameters?
 | 
					
						
							|  |  |  | --------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | :term:`Parameters <parameter>` are defined by the names that appear in a
 | 
					
						
							|  |  |  | function definition, whereas :term:`arguments <argument>` are the values
 | 
					
						
							| 
									
										
										
										
											2022-06-27 03:27:41 +05:30
										 |  |  | actually passed to a function when calling it.  Parameters define what
 | 
					
						
							|  |  |  | :term:`kind of arguments <parameter>` a function can accept.  For
 | 
					
						
							|  |  |  | example, given the function definition::
 | 
					
						
							| 
									
										
										
										
											2012-11-28 02:29:33 -08:00
										 |  |  | 
 | 
					
						
							|  |  |  |    def func(foo, bar=None, **kwargs):
 | 
					
						
							|  |  |  |        pass
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | *foo*, *bar* and *kwargs* are parameters of ``func``.  However, when calling
 | 
					
						
							|  |  |  | ``func``, for example::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    func(42, bar=314, extra=somevar)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | the values ``42``, ``314``, and ``somevar`` are arguments.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2014-09-28 11:01:11 -04:00
										 |  |  | Why did changing list 'y' also change list 'x'?
 | 
					
						
							|  |  |  | ------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | If you wrote code like::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> x = []
 | 
					
						
							|  |  |  |    >>> y = x
 | 
					
						
							|  |  |  |    >>> y.append(10)
 | 
					
						
							|  |  |  |    >>> y
 | 
					
						
							|  |  |  |    [10]
 | 
					
						
							|  |  |  |    >>> x
 | 
					
						
							|  |  |  |    [10]
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | you might be wondering why appending an element to ``y`` changed ``x`` too.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | There are two factors that produce this result:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 1) Variables are simply names that refer to objects.  Doing ``y = x`` doesn't
 | 
					
						
							|  |  |  |    create a copy of the list -- it creates a new variable ``y`` that refers to
 | 
					
						
							|  |  |  |    the same object ``x`` refers to.  This means that there is only one object
 | 
					
						
							|  |  |  |    (the list), and both ``x`` and ``y`` refer to it.
 | 
					
						
							|  |  |  | 2) Lists are :term:`mutable`, which means that you can change their content.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | After the call to :meth:`~list.append`, the content of the mutable object has
 | 
					
						
							|  |  |  | changed from ``[]`` to ``[10]``.  Since both the variables refer to the same
 | 
					
						
							| 
									
										
										
										
											2014-09-29 10:17:28 -04:00
										 |  |  | object, using either name accesses the modified value ``[10]``.
 | 
					
						
							| 
									
										
										
										
											2014-09-28 11:01:11 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | If we instead assign an immutable object to ``x``::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> x = 5  # ints are immutable
 | 
					
						
							|  |  |  |    >>> y = x
 | 
					
						
							|  |  |  |    >>> x = x + 1  # 5 can't be mutated, we are creating a new object here
 | 
					
						
							|  |  |  |    >>> x
 | 
					
						
							|  |  |  |    6
 | 
					
						
							|  |  |  |    >>> y
 | 
					
						
							|  |  |  |    5
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | we can see that in this case ``x`` and ``y`` are not equal anymore.  This is
 | 
					
						
							|  |  |  | because integers are :term:`immutable`, and when we do ``x = x + 1`` we are not
 | 
					
						
							|  |  |  | mutating the int ``5`` by incrementing its value; instead, we are creating a
 | 
					
						
							|  |  |  | new object (the int ``6``) and assigning it to ``x`` (that is, changing which
 | 
					
						
							|  |  |  | object ``x`` refers to).  After this assignment we have two objects (the ints
 | 
					
						
							|  |  |  | ``6`` and ``5``) and two variables that refer to them (``x`` now refers to
 | 
					
						
							|  |  |  | ``6`` but ``y`` still refers to ``5``).
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Some operations (for example ``y.append(10)`` and ``y.sort()``) mutate the
 | 
					
						
							|  |  |  | object, whereas superficially similar operations (for example ``y = y + [10]``
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | and :func:`sorted(y) <sorted>`) create a new object.  In general in Python (and in all cases
 | 
					
						
							| 
									
										
										
										
											2014-09-28 11:01:11 -04:00
										 |  |  | in the standard library) a method that mutates an object will return ``None``
 | 
					
						
							|  |  |  | to help avoid getting the two types of operations confused.  So if you
 | 
					
						
							|  |  |  | mistakenly write ``y.sort()`` thinking it will give you a sorted copy of ``y``,
 | 
					
						
							|  |  |  | you'll instead end up with ``None``, which will likely cause your program to
 | 
					
						
							|  |  |  | generate an easily diagnosed error.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | However, there is one class of operations where the same operation sometimes
 | 
					
						
							|  |  |  | has different behaviors with different types:  the augmented assignment
 | 
					
						
							|  |  |  | operators.  For example, ``+=`` mutates lists but not tuples or ints (``a_list
 | 
					
						
							|  |  |  | += [1, 2, 3]`` is equivalent to ``a_list.extend([1, 2, 3])`` and mutates
 | 
					
						
							|  |  |  | ``a_list``, whereas ``some_tuple += (1, 2, 3)`` and ``some_int += 1`` create
 | 
					
						
							|  |  |  | new objects).
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In other words:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | * If we have a mutable object (:class:`list`, :class:`dict`, :class:`set`,
 | 
					
						
							|  |  |  |   etc.), we can use some specific operations to mutate it and all the variables
 | 
					
						
							|  |  |  |   that refer to it will see the change.
 | 
					
						
							|  |  |  | * If we have an immutable object (:class:`str`, :class:`int`, :class:`tuple`,
 | 
					
						
							|  |  |  |   etc.), all the variables that refer to it will always see the same value,
 | 
					
						
							|  |  |  |   but operations that transform that value into a new value always return a new
 | 
					
						
							|  |  |  |   object.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | If you want to know if two variables refer to the same object or not, you can
 | 
					
						
							|  |  |  | use the :keyword:`is` operator, or the built-in function :func:`id`.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | How do I write a function with output parameters (call by reference)?
 | 
					
						
							|  |  |  | ---------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Remember that arguments are passed by assignment in Python.  Since assignment
 | 
					
						
							|  |  |  | just creates references to objects, there's no alias between an argument name in
 | 
					
						
							|  |  |  | the caller and callee, and so no call-by-reference per se.  You can achieve the
 | 
					
						
							|  |  |  | desired effect in a number of ways.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 1) By returning a tuple of the results::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-08-09 03:29:03 +08:00
										 |  |  |       >>> def func1(a, b):
 | 
					
						
							|  |  |  |       ...     a = 'new-value'        # a and b are local names
 | 
					
						
							|  |  |  |       ...     b = b + 1              # assigned to new objects
 | 
					
						
							|  |  |  |       ...     return a, b            # return new values
 | 
					
						
							|  |  |  |       ...
 | 
					
						
							|  |  |  |       >>> x, y = 'old-value', 99
 | 
					
						
							|  |  |  |       >>> func1(x, y)
 | 
					
						
							|  |  |  |       ('new-value', 100)
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    This is almost always the clearest solution.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 2) By using global variables.  This isn't thread-safe, and is not recommended.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 3) By passing a mutable (changeable in-place) object::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-08-09 03:29:03 +08:00
										 |  |  |       >>> def func2(a):
 | 
					
						
							|  |  |  |       ...     a[0] = 'new-value'     # 'a' references a mutable list
 | 
					
						
							|  |  |  |       ...     a[1] = a[1] + 1        # changes a shared object
 | 
					
						
							|  |  |  |       ...
 | 
					
						
							|  |  |  |       >>> args = ['old-value', 99]
 | 
					
						
							|  |  |  |       >>> func2(args)
 | 
					
						
							|  |  |  |       >>> args
 | 
					
						
							|  |  |  |       ['new-value', 100]
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 4) By passing in a dictionary that gets mutated::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-08-09 03:29:03 +08:00
										 |  |  |       >>> def func3(args):
 | 
					
						
							|  |  |  |       ...     args['a'] = 'new-value'     # args is a mutable dictionary
 | 
					
						
							|  |  |  |       ...     args['b'] = args['b'] + 1   # change it in-place
 | 
					
						
							|  |  |  |       ...
 | 
					
						
							|  |  |  |       >>> args = {'a': 'old-value', 'b': 99}
 | 
					
						
							|  |  |  |       >>> func3(args)
 | 
					
						
							|  |  |  |       >>> args
 | 
					
						
							|  |  |  |       {'a': 'new-value', 'b': 100}
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 5) Or bundle up values in a class instance::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-08-09 03:29:03 +08:00
										 |  |  |       >>> class Namespace:
 | 
					
						
							|  |  |  |       ...     def __init__(self, /, **args):
 | 
					
						
							|  |  |  |       ...         for key, value in args.items():
 | 
					
						
							|  |  |  |       ...             setattr(self, key, value)
 | 
					
						
							|  |  |  |       ...
 | 
					
						
							|  |  |  |       >>> def func4(args):
 | 
					
						
							|  |  |  |       ...     args.a = 'new-value'        # args is a mutable Namespace
 | 
					
						
							|  |  |  |       ...     args.b = args.b + 1         # change object in-place
 | 
					
						
							|  |  |  |       ...
 | 
					
						
							|  |  |  |       >>> args = Namespace(a='old-value', b=99)
 | 
					
						
							|  |  |  |       >>> func4(args)
 | 
					
						
							|  |  |  |       >>> vars(args)
 | 
					
						
							|  |  |  |       {'a': 'new-value', 'b': 100}
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    There's almost never a good reason to get this complicated.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Your best choice is to return a tuple containing the multiple results.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do you make a higher order function in Python?
 | 
					
						
							|  |  |  | --------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | You have two choices: you can use nested scopes or you can use callable objects.
 | 
					
						
							|  |  |  | For example, suppose you wanted to define ``linear(a,b)`` which returns a
 | 
					
						
							|  |  |  | function ``f(x)`` that computes the value ``a*x+b``.  Using nested scopes::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    def linear(a, b):
 | 
					
						
							|  |  |  |        def result(x):
 | 
					
						
							|  |  |  |            return a * x + b
 | 
					
						
							|  |  |  |        return result
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Or using a callable object::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    class linear:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        def __init__(self, a, b):
 | 
					
						
							|  |  |  |            self.a, self.b = a, b
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        def __call__(self, x):
 | 
					
						
							|  |  |  |            return self.a * x + self.b
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In both cases, ::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    taxes = linear(0.3, 2)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | gives a callable object where ``taxes(10e6) == 0.3 * 10e6 + 2``.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The callable object approach has the disadvantage that it is a bit slower and
 | 
					
						
							|  |  |  | results in slightly longer code.  However, note that a collection of callables
 | 
					
						
							|  |  |  | can share their signature via inheritance::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    class exponential(linear):
 | 
					
						
							|  |  |  |        # __init__ inherited
 | 
					
						
							|  |  |  |        def __call__(self, x):
 | 
					
						
							|  |  |  |            return self.a * (x ** self.b)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Object can encapsulate state for several methods::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    class counter:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        value = 0
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        def set(self, x):
 | 
					
						
							|  |  |  |            self.value = x
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        def up(self):
 | 
					
						
							|  |  |  |            self.value = self.value + 1
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        def down(self):
 | 
					
						
							|  |  |  |            self.value = self.value - 1
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    count = counter()
 | 
					
						
							|  |  |  |    inc, dec, reset = count.up, count.down, count.set
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Here ``inc()``, ``dec()`` and ``reset()`` act like functions which share the
 | 
					
						
							|  |  |  | same counting variable.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I copy an object in Python?
 | 
					
						
							|  |  |  | ----------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In general, try :func:`copy.copy` or :func:`copy.deepcopy` for the general case.
 | 
					
						
							|  |  |  | Not all objects can be copied, but most can.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Some objects can be copied more easily.  Dictionaries have a :meth:`~dict.copy`
 | 
					
						
							|  |  |  | method::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    newdict = olddict.copy()
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Sequences can be copied by slicing::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    new_l = l[:]
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How can I find the methods or attributes of an object?
 | 
					
						
							|  |  |  | ------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | For an instance ``x`` of a user-defined class, :func:`dir(x) <dir>` returns an alphabetized
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | list of the names containing the instance attributes and methods and attributes
 | 
					
						
							|  |  |  | defined by its class.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How can my code discover the name of an object?
 | 
					
						
							|  |  |  | -----------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Generally speaking, it can't, because objects don't really have names.
 | 
					
						
							| 
									
										
										
										
											2019-08-29 11:10:50 +05:30
										 |  |  | Essentially, assignment always binds a name to a value; the same is true of
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | ``def`` and ``class`` statements, but in that case the value is a
 | 
					
						
							|  |  |  | callable. Consider the following code::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-05-10 12:01:23 +03:00
										 |  |  |    >>> class A:
 | 
					
						
							|  |  |  |    ...     pass
 | 
					
						
							|  |  |  |    ...
 | 
					
						
							|  |  |  |    >>> B = A
 | 
					
						
							|  |  |  |    >>> a = B()
 | 
					
						
							|  |  |  |    >>> b = a
 | 
					
						
							|  |  |  |    >>> print(b)
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    <__main__.A object at 0x16D07CC>
 | 
					
						
							| 
									
										
										
										
											2016-05-10 12:01:23 +03:00
										 |  |  |    >>> print(a)
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    <__main__.A object at 0x16D07CC>
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | Arguably the class has a name: even though it is bound to two names and invoked
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | through the name ``B`` the created instance is still reported as an instance of
 | 
					
						
							|  |  |  | class ``A``.  However, it is impossible to say whether the instance's name is ``a`` or
 | 
					
						
							|  |  |  | ``b``, since both names are bound to the same value.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | Generally speaking it should not be necessary for your code to "know the names"
 | 
					
						
							|  |  |  | of particular values. Unless you are deliberately writing introspective
 | 
					
						
							|  |  |  | programs, this is usually an indication that a change of approach might be
 | 
					
						
							|  |  |  | beneficial.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In comp.lang.python, Fredrik Lundh once gave an excellent analogy in answer to
 | 
					
						
							|  |  |  | this question:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    The same way as you get the name of that cat you found on your porch: the cat
 | 
					
						
							|  |  |  |    (object) itself cannot tell you its name, and it doesn't really care -- so
 | 
					
						
							|  |  |  |    the only way to find out what it's called is to ask all your neighbours
 | 
					
						
							|  |  |  |    (namespaces) if it's their cat (object)...
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    ....and don't be surprised if you'll find that it's known by many names, or
 | 
					
						
							|  |  |  |    no name at all!
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | What's up with the comma operator's precedence?
 | 
					
						
							|  |  |  | -----------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Comma is not an operator in Python.  Consider this session::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     >>> "a" in "b", "a"
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |     (False, 'a')
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | Since the comma is not an operator, but a separator between expressions the
 | 
					
						
							|  |  |  | above is evaluated as if you had entered::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-06-19 16:58:26 -04:00
										 |  |  |     ("a" in "b"), "a"
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | not::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-06-19 16:58:26 -04:00
										 |  |  |     "a" in ("b", "a")
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | The same is true of the various assignment operators (``=``, ``+=`` etc).  They
 | 
					
						
							|  |  |  | are not truly operators but syntactic delimiters in assignment statements.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Is there an equivalent of C's "?:" ternary operator?
 | 
					
						
							|  |  |  | ----------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2011-12-03 22:11:11 +01:00
										 |  |  | Yes, there is. The syntax is as follows::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    [on_true] if [expression] else [on_false]
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    x, y = 50, 25
 | 
					
						
							|  |  |  |    small = x if x < y else y
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2011-12-03 22:11:11 +01:00
										 |  |  | Before this syntax was introduced in Python 2.5, a common idiom was to use
 | 
					
						
							|  |  |  | logical operators::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    [expression] and [on_true] or [on_false]
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | However, this idiom is unsafe, as it can give wrong results when *on_true*
 | 
					
						
							|  |  |  | has a false boolean value.  Therefore, it is always better to use
 | 
					
						
							|  |  |  | the ``... if ... else ...`` form.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Is it possible to write obfuscated one-liners in Python?
 | 
					
						
							|  |  |  | --------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Yes.  Usually this is done by nesting :keyword:`lambda` within
 | 
					
						
							| 
									
										
										
										
											2022-10-04 02:34:02 -03:00
										 |  |  | :keyword:`!lambda`.  See the following three examples, slightly adapted from Ulf Bartelt::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    from functools import reduce
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    # Primes < 1000
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    print(list(filter(None,map(lambda y:y*reduce(lambda x,y:x*y!=0,
 | 
					
						
							|  |  |  |    map(lambda x,y=y:y%x,range(2,int(pow(y,0.5)+1))),1),range(2,1000)))))
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    # First 10 Fibonacci numbers
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    print(list(map(lambda x,f=lambda x,f:(f(x-1,f)+f(x-2,f)) if x>1 else 1:
 | 
					
						
							|  |  |  |    f(x,f), range(10))))
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    # Mandelbrot set
 | 
					
						
							| 
									
										
										
										
											2022-10-04 02:34:02 -03:00
										 |  |  |    print((lambda Ru,Ro,Iu,Io,IM,Sx,Sy:reduce(lambda x,y:x+'\n'+y,map(lambda y,
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    Iu=Iu,Io=Io,Ru=Ru,Ro=Ro,Sy=Sy,L=lambda yc,Iu=Iu,Io=Io,Ru=Ru,Ro=Ro,i=IM,
 | 
					
						
							|  |  |  |    Sx=Sx,Sy=Sy:reduce(lambda x,y:x+y,map(lambda x,xc=Ru,yc=yc,Ru=Ru,Ro=Ro,
 | 
					
						
							|  |  |  |    i=i,Sx=Sx,F=lambda xc,yc,x,y,k,f=lambda xc,yc,x,y,k,f:(k<=0)or (x*x+y*y
 | 
					
						
							|  |  |  |    >=4.0) or 1+f(xc,yc,x*x-y*y+xc,2.0*x*y+yc,k-1,f):f(xc,yc,x,y,k,f):chr(
 | 
					
						
							|  |  |  |    64+F(Ru+x*(Ro-Ru)/Sx,yc,0,0,i)),range(Sx))):L(Iu+y*(Io-Iu)/Sy),range(Sy
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    ))))(-2.1, 0.7, -1.2, 1.2, 30, 80, 24))
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    #    \___ ___/  \___ ___/  |   |   |__ lines on screen
 | 
					
						
							|  |  |  |    #        V          V      |   |______ columns on screen
 | 
					
						
							|  |  |  |    #        |          |      |__________ maximum of "iterations"
 | 
					
						
							|  |  |  |    #        |          |_________________ range on y axis
 | 
					
						
							|  |  |  |    #        |____________________________ range on x axis
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Don't try this at home, kids!
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2019-03-10 12:30:11 +01:00
										 |  |  | .. _faq-positional-only-arguments:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | What does the slash(/) in the parameter list of a function mean?
 | 
					
						
							|  |  |  | ----------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | A slash in the argument list of a function denotes that the parameters prior to
 | 
					
						
							|  |  |  | it are positional-only.  Positional-only parameters are the ones without an
 | 
					
						
							| 
									
										
										
										
											2022-07-05 05:16:10 -04:00
										 |  |  | externally usable name.  Upon calling a function that accepts positional-only
 | 
					
						
							| 
									
										
										
										
											2019-03-10 12:30:11 +01:00
										 |  |  | parameters, arguments are mapped to parameters based solely on their position.
 | 
					
						
							| 
									
										
										
										
											2019-09-21 00:28:49 -04:00
										 |  |  | For example, :func:`divmod` is a function that accepts positional-only
 | 
					
						
							|  |  |  | parameters. Its documentation looks like this::
 | 
					
						
							| 
									
										
										
										
											2019-03-10 12:30:11 +01:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2019-09-21 00:28:49 -04:00
										 |  |  |    >>> help(divmod)
 | 
					
						
							|  |  |  |    Help on built-in function divmod in module builtins:
 | 
					
						
							| 
									
										
										
										
											2019-03-10 12:30:11 +01:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2019-09-21 00:28:49 -04:00
										 |  |  |    divmod(x, y, /)
 | 
					
						
							|  |  |  |        Return the tuple (x//y, x%y).  Invariant: div*y + mod == x.
 | 
					
						
							| 
									
										
										
										
											2019-03-10 12:30:11 +01:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2019-09-21 00:28:49 -04:00
										 |  |  | The slash at the end of the parameter list means that both parameters are
 | 
					
						
							|  |  |  | positional-only. Thus, calling :func:`divmod` with keyword arguments would lead
 | 
					
						
							|  |  |  | to an error::
 | 
					
						
							| 
									
										
										
										
											2019-03-10 12:30:11 +01:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2019-09-21 00:28:49 -04:00
										 |  |  |    >>> divmod(x=3, y=4)
 | 
					
						
							| 
									
										
										
										
											2019-03-10 12:30:11 +01:00
										 |  |  |    Traceback (most recent call last):
 | 
					
						
							|  |  |  |      File "<stdin>", line 1, in <module>
 | 
					
						
							| 
									
										
										
										
											2019-09-21 00:28:49 -04:00
										 |  |  |    TypeError: divmod() takes no keyword arguments
 | 
					
						
							| 
									
										
										
										
											2019-03-10 12:30:11 +01:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | Numbers and strings
 | 
					
						
							|  |  |  | ===================
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I specify hexadecimal and octal integers?
 | 
					
						
							|  |  |  | ------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | To specify an octal digit, precede the octal value with a zero, and then a lower
 | 
					
						
							|  |  |  | or uppercase "o".  For example, to set the variable "a" to the octal value "10"
 | 
					
						
							|  |  |  | (8 in decimal), type::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    >>> a = 0o10
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    >>> a
 | 
					
						
							|  |  |  |    8
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Hexadecimal is just as easy.  Simply precede the hexadecimal number with a zero,
 | 
					
						
							|  |  |  | and then a lower or uppercase "x".  Hexadecimal digits can be specified in lower
 | 
					
						
							|  |  |  | or uppercase.  For example, in the Python interpreter::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> a = 0xa5
 | 
					
						
							|  |  |  |    >>> a
 | 
					
						
							|  |  |  |    165
 | 
					
						
							|  |  |  |    >>> b = 0XB2
 | 
					
						
							|  |  |  |    >>> b
 | 
					
						
							|  |  |  |    178
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | Why does -22 // 10 return -3?
 | 
					
						
							|  |  |  | -----------------------------
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | It's primarily driven by the desire that ``i % j`` have the same sign as ``j``.
 | 
					
						
							|  |  |  | If you want that, and also want::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |     i == (i // j) * j + (i % j)
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | then integer division has to return the floor.  C also requires that identity to
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | hold, and then compilers that truncate ``i // j`` need to make ``i % j`` have
 | 
					
						
							|  |  |  | the same sign as ``i``.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | There are few real use cases for ``i % j`` when ``j`` is negative.  When ``j``
 | 
					
						
							|  |  |  | is positive, there are many, and in virtually all of them it's more useful for
 | 
					
						
							|  |  |  | ``i % j`` to be ``>= 0``.  If the clock says 10 now, what did it say 200 hours
 | 
					
						
							|  |  |  | ago?  ``-190 % 12 == 2`` is useful; ``-190 % 12 == -10`` is a bug waiting to
 | 
					
						
							|  |  |  | bite.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-10-13 01:14:58 -04:00
										 |  |  | How do I get int literal attribute instead of SyntaxError?
 | 
					
						
							|  |  |  | ----------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Trying to lookup an ``int`` literal attribute in the normal manner gives
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | a :exc:`SyntaxError` because the period is seen as a decimal point::
 | 
					
						
							| 
									
										
										
										
											2021-10-13 01:14:58 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  |    >>> 1.__class__
 | 
					
						
							|  |  |  |      File "<stdin>", line 1
 | 
					
						
							|  |  |  |      1.__class__
 | 
					
						
							|  |  |  |       ^
 | 
					
						
							|  |  |  |    SyntaxError: invalid decimal literal
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The solution is to separate the literal from the period
 | 
					
						
							|  |  |  | with either a space or parentheses.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> 1 .__class__
 | 
					
						
							|  |  |  |    <class 'int'>
 | 
					
						
							|  |  |  |    >>> (1).__class__
 | 
					
						
							|  |  |  |    <class 'int'>
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | How do I convert a string to a number?
 | 
					
						
							|  |  |  | --------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | For integers, use the built-in :func:`int` type constructor, e.g. ``int('144')
 | 
					
						
							|  |  |  | == 144``.  Similarly, :func:`float` converts to floating-point,
 | 
					
						
							|  |  |  | e.g. ``float('144') == 144.0``.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | By default, these interpret the number as decimal, so that ``int('0144') ==
 | 
					
						
							| 
									
										
										
										
											2020-04-25 01:39:04 +02:00
										 |  |  | 144`` holds true, and ``int('0x144')`` raises :exc:`ValueError`. ``int(string,
 | 
					
						
							|  |  |  | base)`` takes the base to convert from as a second optional argument, so ``int(
 | 
					
						
							|  |  |  | '0x144', 16) == 324``.  If the base is specified as 0, the number is interpreted
 | 
					
						
							|  |  |  | using Python's rules: a leading '0o' indicates octal, and '0x' indicates a hex
 | 
					
						
							|  |  |  | number.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | Do not use the built-in function :func:`eval` if all you need is to convert
 | 
					
						
							|  |  |  | strings to numbers.  :func:`eval` will be significantly slower and it presents a
 | 
					
						
							|  |  |  | security risk: someone could pass you a Python expression that might have
 | 
					
						
							|  |  |  | unwanted side effects.  For example, someone could pass
 | 
					
						
							|  |  |  | ``__import__('os').system("rm -rf $HOME")`` which would erase your home
 | 
					
						
							|  |  |  | directory.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | :func:`eval` also has the effect of interpreting numbers as Python expressions,
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | so that e.g. ``eval('09')`` gives a syntax error because Python does not allow
 | 
					
						
							|  |  |  | leading '0' in a decimal number (except '0').
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I convert a number to a string?
 | 
					
						
							|  |  |  | --------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | To convert, e.g., the number ``144`` to the string ``'144'``, use the built-in type
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | constructor :func:`str`.  If you want a hexadecimal or octal representation, use
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | the built-in functions :func:`hex` or :func:`oct`.  For fancy formatting, see
 | 
					
						
							| 
									
										
										
										
											2016-02-13 00:41:37 +00:00
										 |  |  | the :ref:`f-strings` and :ref:`formatstrings` sections,
 | 
					
						
							|  |  |  | e.g. ``"{:04d}".format(144)`` yields
 | 
					
						
							| 
									
										
										
										
											2014-04-14 07:52:53 -04:00
										 |  |  | ``'0144'`` and ``"{:.3f}".format(1.0/3.0)`` yields ``'0.333'``.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I modify a string in place?
 | 
					
						
							|  |  |  | ----------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2011-12-03 22:11:11 +01:00
										 |  |  | You can't, because strings are immutable.  In most situations, you should
 | 
					
						
							|  |  |  | simply construct a new string from the various parts you want to assemble
 | 
					
						
							|  |  |  | it from.  However, if you need an object with the ability to modify in-place
 | 
					
						
							| 
									
										
										
										
											2015-11-02 03:37:02 +00:00
										 |  |  | unicode data, try using an :class:`io.StringIO` object or the :mod:`array`
 | 
					
						
							| 
									
										
										
										
											2011-12-03 22:11:11 +01:00
										 |  |  | module::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-06-19 16:58:26 -04:00
										 |  |  |    >>> import io
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    >>> s = "Hello, world"
 | 
					
						
							| 
									
										
										
										
											2011-12-03 22:11:11 +01:00
										 |  |  |    >>> sio = io.StringIO(s)
 | 
					
						
							|  |  |  |    >>> sio.getvalue()
 | 
					
						
							|  |  |  |    'Hello, world'
 | 
					
						
							|  |  |  |    >>> sio.seek(7)
 | 
					
						
							|  |  |  |    7
 | 
					
						
							|  |  |  |    >>> sio.write("there!")
 | 
					
						
							|  |  |  |    6
 | 
					
						
							|  |  |  |    >>> sio.getvalue()
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    'Hello, there!'
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> import array
 | 
					
						
							| 
									
										
										
										
											2023-06-05 01:45:00 +09:00
										 |  |  |    >>> a = array.array('w', s)
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    >>> print(a)
 | 
					
						
							| 
									
										
										
										
											2023-06-05 01:45:00 +09:00
										 |  |  |    array('w', 'Hello, world')
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    >>> a[0] = 'y'
 | 
					
						
							|  |  |  |    >>> print(a)
 | 
					
						
							| 
									
										
										
										
											2023-06-05 01:45:00 +09:00
										 |  |  |    array('w', 'yello, world')
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    >>> a.tounicode()
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    'yello, world'
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I use strings to call functions/methods?
 | 
					
						
							|  |  |  | -----------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | There are various techniques.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | * The best is to use a dictionary that maps strings to functions.  The primary
 | 
					
						
							|  |  |  |   advantage of this technique is that the strings do not need to match the names
 | 
					
						
							|  |  |  |   of the functions.  This is also the primary technique used to emulate a case
 | 
					
						
							|  |  |  |   construct::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |      def a():
 | 
					
						
							|  |  |  |          pass
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |      def b():
 | 
					
						
							|  |  |  |          pass
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |      dispatch = {'go': a, 'stop': b}  # Note lack of parens for funcs
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |      dispatch[get_input()]()  # Note trailing parens to call function
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | * Use the built-in function :func:`getattr`::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |      import foo
 | 
					
						
							|  |  |  |      getattr(foo, 'bar')()
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |   Note that :func:`getattr` works on any object, including classes, class
 | 
					
						
							|  |  |  |   instances, modules, and so on.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |   This is used in several places in the standard library, like this::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |      class Foo:
 | 
					
						
							|  |  |  |          def do_foo(self):
 | 
					
						
							|  |  |  |              ...
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |          def do_bar(self):
 | 
					
						
							|  |  |  |              ...
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |      f = getattr(foo_instance, 'do_' + opname)
 | 
					
						
							|  |  |  |      f()
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-10-16 12:44:17 -06:00
										 |  |  | * Use :func:`locals` to resolve the function name::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |      def myFunc():
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |          print("hello")
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |      fname = "myFunc"
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |      f = locals()[fname]
 | 
					
						
							|  |  |  |      f()
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Is there an equivalent to Perl's chomp() for removing trailing newlines from strings?
 | 
					
						
							|  |  |  | -------------------------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2011-12-03 22:19:55 +01:00
										 |  |  | You can use ``S.rstrip("\r\n")`` to remove all occurrences of any line
 | 
					
						
							|  |  |  | terminator from the end of the string ``S`` without removing other trailing
 | 
					
						
							|  |  |  | whitespace.  If the string ``S`` represents more than one line, with several
 | 
					
						
							|  |  |  | empty lines at the end, the line terminators for all the blank lines will
 | 
					
						
							|  |  |  | be removed::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    >>> lines = ("line 1 \r\n"
 | 
					
						
							|  |  |  |    ...          "\r\n"
 | 
					
						
							|  |  |  |    ...          "\r\n")
 | 
					
						
							|  |  |  |    >>> lines.rstrip("\n\r")
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    'line 1 '
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | Since this is typically only desired when reading text one line at a time, using
 | 
					
						
							|  |  |  | ``S.rstrip()`` this way works well.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Is there a scanf() or sscanf() equivalent?
 | 
					
						
							|  |  |  | ------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Not as such.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | For simple input parsing, the easiest approach is usually to split the line into
 | 
					
						
							|  |  |  | whitespace-delimited words using the :meth:`~str.split` method of string objects
 | 
					
						
							|  |  |  | and then convert decimal strings to numeric values using :func:`int` or
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | :func:`float`.  :meth:`!split()` supports an optional "sep" parameter which is useful
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | if the line uses something other than whitespace as a separator.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2010-09-23 13:45:21 +00:00
										 |  |  | For more complicated input parsing, regular expressions are more powerful
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | than C's ``sscanf`` and better suited for the task.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | What does 'UnicodeDecodeError' or 'UnicodeEncodeError' error  mean?
 | 
					
						
							|  |  |  | -------------------------------------------------------------------
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | See the :ref:`unicode-howto`.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-12-27 21:30:42 -08:00
										 |  |  | .. _faq-programming-raw-string-backslash:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Can I end a raw string with an odd number of backslashes?
 | 
					
						
							|  |  |  | ---------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | A raw string ending with an odd number of backslashes will escape the string's quote::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> r'C:\this\will\not\work\'
 | 
					
						
							|  |  |  |      File "<stdin>", line 1
 | 
					
						
							|  |  |  |        r'C:\this\will\not\work\'
 | 
					
						
							|  |  |  |             ^
 | 
					
						
							|  |  |  |    SyntaxError: unterminated string literal (detected at line 1)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | There are several workarounds for this. One is to use regular strings and double
 | 
					
						
							|  |  |  | the backslashes::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> 'C:\\this\\will\\work\\'
 | 
					
						
							|  |  |  |    'C:\\this\\will\\work\\'
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Another is to concatenate a regular string containing an escaped backslash to the
 | 
					
						
							|  |  |  | raw string::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> r'C:\this\will\work' '\\'
 | 
					
						
							|  |  |  |    'C:\\this\\will\\work\\'
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | It is also possible to use :func:`os.path.join` to append a backslash on Windows::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> os.path.join(r'C:\this\will\work', '')
 | 
					
						
							|  |  |  |    'C:\\this\\will\\work\\'
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Note that while a backslash will "escape" a quote for the purposes of
 | 
					
						
							|  |  |  | determining where the raw string ends, no escaping occurs when interpreting the
 | 
					
						
							|  |  |  | value of the raw string. That is, the backslash remains present in the value of
 | 
					
						
							|  |  |  | the raw string::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> r'backslash\'preserved'
 | 
					
						
							|  |  |  |    "backslash\\'preserved"
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Also see the specification in the :ref:`language reference <strings>`.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2011-12-09 23:10:31 +01:00
										 |  |  | Performance
 | 
					
						
							|  |  |  | ===========
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | My program is too slow. How do I speed it up?
 | 
					
						
							|  |  |  | ---------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | That's a tough one, in general.  First, here are a list of things to
 | 
					
						
							|  |  |  | remember before diving further:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2012-03-14 22:40:08 +01:00
										 |  |  | * Performance characteristics vary across Python implementations.  This FAQ
 | 
					
						
							| 
									
										
										
										
											2019-12-28 17:16:02 -05:00
										 |  |  |   focuses on :term:`CPython`.
 | 
					
						
							| 
									
										
										
										
											2012-03-14 22:40:08 +01:00
										 |  |  | * Behaviour can vary across operating systems, especially when talking about
 | 
					
						
							| 
									
										
										
										
											2011-12-09 23:10:31 +01:00
										 |  |  |   I/O or multi-threading.
 | 
					
						
							|  |  |  | * You should always find the hot spots in your program *before* attempting to
 | 
					
						
							|  |  |  |   optimize any code (see the :mod:`profile` module).
 | 
					
						
							|  |  |  | * Writing benchmark scripts will allow you to iterate quickly when searching
 | 
					
						
							|  |  |  |   for improvements (see the :mod:`timeit` module).
 | 
					
						
							|  |  |  | * It is highly recommended to have good code coverage (through unit testing
 | 
					
						
							|  |  |  |   or any other technique) before potentially introducing regressions hidden
 | 
					
						
							|  |  |  |   in sophisticated optimizations.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | That being said, there are many tricks to speed up Python code.  Here are
 | 
					
						
							|  |  |  | some general principles which go a long way towards reaching acceptable
 | 
					
						
							|  |  |  | performance levels:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | * Making your algorithms faster (or changing to faster ones) can yield
 | 
					
						
							|  |  |  |   much larger benefits than trying to sprinkle micro-optimization tricks
 | 
					
						
							|  |  |  |   all over your code.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | * Use the right data structures.  Study documentation for the :ref:`bltin-types`
 | 
					
						
							|  |  |  |   and the :mod:`collections` module.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | * When the standard library provides a primitive for doing something, it is
 | 
					
						
							|  |  |  |   likely (although not guaranteed) to be faster than any alternative you
 | 
					
						
							|  |  |  |   may come up with.  This is doubly true for primitives written in C, such
 | 
					
						
							|  |  |  |   as builtins and some extension types.  For example, be sure to use
 | 
					
						
							|  |  |  |   either the :meth:`list.sort` built-in method or the related :func:`sorted`
 | 
					
						
							| 
									
										
										
										
											2016-01-01 23:25:58 -08:00
										 |  |  |   function to do sorting (and see the :ref:`sortinghowto` for examples
 | 
					
						
							| 
									
										
										
										
											2011-12-09 23:10:31 +01:00
										 |  |  |   of moderately advanced usage).
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | * Abstractions tend to create indirections and force the interpreter to work
 | 
					
						
							|  |  |  |   more.  If the levels of indirection outweigh the amount of useful work
 | 
					
						
							|  |  |  |   done, your program will be slower.  You should avoid excessive abstraction,
 | 
					
						
							|  |  |  |   especially under the form of tiny functions or methods (which are also often
 | 
					
						
							|  |  |  |   detrimental to readability).
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | If you have reached the limit of what pure Python can allow, there are tools
 | 
					
						
							| 
									
										
										
										
											2022-08-04 10:13:49 +03:00
										 |  |  | to take you further away.  For example, `Cython <https://cython.org>`_ can
 | 
					
						
							| 
									
										
										
										
											2011-12-09 23:10:31 +01:00
										 |  |  | compile a slightly modified version of Python code into a C extension, and
 | 
					
						
							|  |  |  | can be used on many different platforms.  Cython can take advantage of
 | 
					
						
							|  |  |  | compilation (and optional type annotations) to make your code significantly
 | 
					
						
							|  |  |  | faster than when interpreted.  If you are confident in your C programming
 | 
					
						
							|  |  |  | skills, you can also :ref:`write a C extension module <extending-index>`
 | 
					
						
							|  |  |  | yourself.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | .. seealso::
 | 
					
						
							|  |  |  |    The wiki page devoted to `performance tips
 | 
					
						
							| 
									
										
										
										
											2014-10-29 08:36:35 +01:00
										 |  |  |    <https://wiki.python.org/moin/PythonSpeed/PerformanceTips>`_.
 | 
					
						
							| 
									
										
										
										
											2011-12-09 23:10:31 +01:00
										 |  |  | 
 | 
					
						
							|  |  |  | .. _efficient_string_concatenation:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2011-11-25 16:33:53 +01:00
										 |  |  | What is the most efficient way to concatenate many strings together?
 | 
					
						
							|  |  |  | --------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | :class:`str` and :class:`bytes` objects are immutable, therefore concatenating
 | 
					
						
							|  |  |  | many strings together is inefficient as each concatenation creates a new
 | 
					
						
							|  |  |  | object.  In the general case, the total runtime cost is quadratic in the
 | 
					
						
							|  |  |  | total string length.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | To accumulate many :class:`str` objects, the recommended idiom is to place
 | 
					
						
							|  |  |  | them into a list and call :meth:`str.join` at the end::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    chunks = []
 | 
					
						
							|  |  |  |    for s in my_strings:
 | 
					
						
							|  |  |  |        chunks.append(s)
 | 
					
						
							|  |  |  |    result = ''.join(chunks)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | (another reasonably efficient idiom is to use :class:`io.StringIO`)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | To accumulate many :class:`bytes` objects, the recommended idiom is to extend
 | 
					
						
							|  |  |  | a :class:`bytearray` object using in-place concatenation (the ``+=`` operator)::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    result = bytearray()
 | 
					
						
							|  |  |  |    for b in my_bytes_objects:
 | 
					
						
							|  |  |  |        result += b
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | Sequences (Tuples/Lists)
 | 
					
						
							|  |  |  | ========================
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I convert between tuples and lists?
 | 
					
						
							|  |  |  | ------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The type constructor ``tuple(seq)`` converts any sequence (actually, any
 | 
					
						
							|  |  |  | iterable) into a tuple with the same items in the same order.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | For example, ``tuple([1, 2, 3])`` yields ``(1, 2, 3)`` and ``tuple('abc')``
 | 
					
						
							|  |  |  | yields ``('a', 'b', 'c')``.  If the argument is a tuple, it does not make a copy
 | 
					
						
							|  |  |  | but returns the same object, so it is cheap to call :func:`tuple` when you
 | 
					
						
							|  |  |  | aren't sure that an object is already a tuple.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The type constructor ``list(seq)`` converts any sequence or iterable into a list
 | 
					
						
							|  |  |  | with the same items in the same order.  For example, ``list((1, 2, 3))`` yields
 | 
					
						
							|  |  |  | ``[1, 2, 3]`` and ``list('abc')`` yields ``['a', 'b', 'c']``.  If the argument
 | 
					
						
							|  |  |  | is a list, it makes a copy just like ``seq[:]`` would.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | What's a negative index?
 | 
					
						
							|  |  |  | ------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Python sequences are indexed with positive numbers and negative numbers.  For
 | 
					
						
							|  |  |  | positive numbers 0 is the first index 1 is the second index and so forth.  For
 | 
					
						
							|  |  |  | negative indices -1 is the last index and -2 is the penultimate (next to last)
 | 
					
						
							|  |  |  | index and so forth.  Think of ``seq[-n]`` as the same as ``seq[len(seq)-n]``.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Using negative indices can be very convenient.  For example ``S[:-1]`` is all of
 | 
					
						
							|  |  |  | the string except for its last character, which is useful for removing the
 | 
					
						
							|  |  |  | trailing newline from a string.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I iterate over a sequence in reverse order?
 | 
					
						
							|  |  |  | --------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-10-21 05:25:07 -03:00
										 |  |  | Use the :func:`reversed` built-in function::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    for x in reversed(sequence):
 | 
					
						
							| 
									
										
										
										
											2016-05-10 12:01:23 +03:00
										 |  |  |        ...  # do something with x ...
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | This won't touch your original sequence, but build a new copy with reversed
 | 
					
						
							|  |  |  | order to iterate over.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do you remove duplicates from a list?
 | 
					
						
							|  |  |  | -----------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | See the Python Cookbook for a long discussion of many ways to do this:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-09-26 21:47:25 -03:00
										 |  |  |    https://code.activestate.com/recipes/52560/
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | If you don't mind reordering the list, sort it and then scan from the end of the
 | 
					
						
							|  |  |  | list, deleting duplicates as you go::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    if mylist:
 | 
					
						
							|  |  |  |        mylist.sort()
 | 
					
						
							|  |  |  |        last = mylist[-1]
 | 
					
						
							|  |  |  |        for i in range(len(mylist)-2, -1, -1):
 | 
					
						
							|  |  |  |            if last == mylist[i]:
 | 
					
						
							|  |  |  |                del mylist[i]
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |            else:
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |                last = mylist[i]
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2011-12-03 22:19:55 +01:00
										 |  |  | If all elements of the list may be used as set keys (i.e. they are all
 | 
					
						
							|  |  |  | :term:`hashable`) this is often faster ::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    mylist = list(set(mylist))
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | This converts the list into a set, thereby removing duplicates, and then back
 | 
					
						
							|  |  |  | into a list.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-09-29 01:02:44 -04:00
										 |  |  | How do you remove multiple items from a list
 | 
					
						
							|  |  |  | --------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | As with removing duplicates, explicitly iterating in reverse with a
 | 
					
						
							|  |  |  | delete condition is one possibility.  However, it is easier and faster
 | 
					
						
							|  |  |  | to use slice replacement with an implicit or explicit forward iteration.
 | 
					
						
							|  |  |  | Here are three variations.::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    mylist[:] = filter(keep_function, mylist)
 | 
					
						
							|  |  |  |    mylist[:] = (x for x in mylist if keep_condition)
 | 
					
						
							|  |  |  |    mylist[:] = [x for x in mylist if keep_condition]
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-10-05 10:31:44 -04:00
										 |  |  | The list comprehension may be fastest.
 | 
					
						
							| 
									
										
										
										
											2020-09-29 01:02:44 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | How do you make an array in Python?
 | 
					
						
							|  |  |  | -----------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Use a list::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    ["this", 1, "is", "an", "array"]
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Lists are equivalent to C or Pascal arrays in their time complexity; the primary
 | 
					
						
							|  |  |  | difference is that a Python list can contain objects of many different types.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The ``array`` module also provides methods for creating arrays of fixed types
 | 
					
						
							|  |  |  | with compact representations, but they are slower to index than lists.  Also
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | note that `NumPy <https://numpy.org/>`_
 | 
					
						
							|  |  |  | and other third party packages define array-like structures with
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | various characteristics as well.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | To get Lisp-style linked lists, you can emulate *cons cells* using tuples::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    lisp_list = ("like",  ("this",  ("example", None) ) )
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | If mutability is desired, you could use lists instead of tuples.  Here the
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | analogue of a Lisp *car* is ``lisp_list[0]`` and the analogue of *cdr* is
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | ``lisp_list[1]``.  Only do this if you're sure you really need to, because it's
 | 
					
						
							|  |  |  | usually a lot slower than using Python lists.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2015-09-07 02:08:55 +00:00
										 |  |  | .. _faq-multidimensional-list:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | How do I create a multidimensional list?
 | 
					
						
							|  |  |  | ----------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | You probably tried to make a multidimensional array like this::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-06-19 16:58:26 -04:00
										 |  |  |    >>> A = [[None] * 2] * 3
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-06-04 20:07:34 -07:00
										 |  |  | This looks correct if you print it:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | .. testsetup::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    A = [[None] * 2] * 3
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | .. doctest::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    >>> A
 | 
					
						
							|  |  |  |    [[None, None], [None, None], [None, None]]
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | But when you assign a value, it shows up in multiple places:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-06-04 20:07:34 -07:00
										 |  |  | .. testsetup::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    A = [[None] * 2] * 3
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | .. doctest::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> A[0][0] = 5
 | 
					
						
							|  |  |  |    >>> A
 | 
					
						
							|  |  |  |    [[5, None], [5, None], [5, None]]
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | The reason is that replicating a list with ``*`` doesn't create copies, it only
 | 
					
						
							|  |  |  | creates references to the existing objects.  The ``*3`` creates a list
 | 
					
						
							|  |  |  | containing 3 references to the same list of length two.  Changes to one row will
 | 
					
						
							|  |  |  | show in all rows, which is almost certainly not what you want.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The suggested approach is to create a list of the desired length first and then
 | 
					
						
							|  |  |  | fill in each element with a newly created list::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    A = [None] * 3
 | 
					
						
							|  |  |  |    for i in range(3):
 | 
					
						
							|  |  |  |        A[i] = [None] * 2
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This generates a list containing 3 different lists of length two.  You can also
 | 
					
						
							|  |  |  | use a list comprehension::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    w, h = 2, 3
 | 
					
						
							|  |  |  |    A = [[None] * w for i in range(h)]
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-05-26 22:51:32 -07:00
										 |  |  | Or, you can use an extension that provides a matrix datatype; `NumPy
 | 
					
						
							| 
									
										
										
										
											2022-09-24 14:38:53 +03:00
										 |  |  | <https://numpy.org/>`_ is the best known.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-10 20:16:16 -08:00
										 |  |  | How do I apply a method or function to a sequence of objects?
 | 
					
						
							|  |  |  | -------------------------------------------------------------
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-10 20:16:16 -08:00
										 |  |  | To call a method or function and accumulate the return values is a list,
 | 
					
						
							|  |  |  | a :term:`list comprehension` is an elegant solution::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    result = [obj.method() for obj in mylist]
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-10 20:16:16 -08:00
										 |  |  |    result = [function(obj) for obj in mylist]
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | To just run the method or function without saving the return values,
 | 
					
						
							|  |  |  | a plain :keyword:`for` loop will suffice::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    for obj in mylist:
 | 
					
						
							|  |  |  |        obj.method()
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    for obj in mylist:
 | 
					
						
							|  |  |  |        function(obj)
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2014-03-15 21:13:56 -07:00
										 |  |  | .. _faq-augmented-assignment-tuple-error:
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-05-20 10:32:46 -04:00
										 |  |  | Why does a_tuple[i] += ['item'] raise an exception when the addition works?
 | 
					
						
							|  |  |  | ---------------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This is because of a combination of the fact that augmented assignment
 | 
					
						
							|  |  |  | operators are *assignment* operators, and the difference between mutable and
 | 
					
						
							|  |  |  | immutable objects in Python.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This discussion applies in general when augmented assignment operators are
 | 
					
						
							|  |  |  | applied to elements of a tuple that point to mutable objects, but we'll use
 | 
					
						
							|  |  |  | a ``list`` and ``+=`` as our exemplar.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | If you wrote::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> a_tuple = (1, 2)
 | 
					
						
							|  |  |  |    >>> a_tuple[0] += 1
 | 
					
						
							|  |  |  |    Traceback (most recent call last):
 | 
					
						
							|  |  |  |       ...
 | 
					
						
							|  |  |  |    TypeError: 'tuple' object does not support item assignment
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The reason for the exception should be immediately clear: ``1`` is added to the
 | 
					
						
							|  |  |  | object ``a_tuple[0]`` points to (``1``), producing the result object, ``2``,
 | 
					
						
							|  |  |  | but when we attempt to assign the result of the computation, ``2``, to element
 | 
					
						
							|  |  |  | ``0`` of the tuple, we get an error because we can't change what an element of
 | 
					
						
							|  |  |  | a tuple points to.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Under the covers, what this augmented assignment statement is doing is
 | 
					
						
							|  |  |  | approximately this::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-05-21 11:44:41 -04:00
										 |  |  |    >>> result = a_tuple[0] + 1
 | 
					
						
							| 
									
										
										
										
											2013-05-20 10:32:46 -04:00
										 |  |  |    >>> a_tuple[0] = result
 | 
					
						
							|  |  |  |    Traceback (most recent call last):
 | 
					
						
							|  |  |  |      ...
 | 
					
						
							|  |  |  |    TypeError: 'tuple' object does not support item assignment
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | It is the assignment part of the operation that produces the error, since a
 | 
					
						
							|  |  |  | tuple is immutable.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | When you write something like::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> a_tuple = (['foo'], 'bar')
 | 
					
						
							|  |  |  |    >>> a_tuple[0] += ['item']
 | 
					
						
							|  |  |  |    Traceback (most recent call last):
 | 
					
						
							|  |  |  |      ...
 | 
					
						
							|  |  |  |    TypeError: 'tuple' object does not support item assignment
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The exception is a bit more surprising, and even more surprising is the fact
 | 
					
						
							|  |  |  | that even though there was an error, the append worked::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     >>> a_tuple[0]
 | 
					
						
							|  |  |  |     ['foo', 'item']
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-05-21 11:44:41 -04:00
										 |  |  | To see why this happens, you need to know that (a) if an object implements an
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | :meth:`~object.__iadd__` magic method, it gets called when the ``+=`` augmented
 | 
					
						
							|  |  |  | assignment
 | 
					
						
							| 
									
										
										
										
											2013-05-21 11:44:41 -04:00
										 |  |  | is executed, and its return value is what gets used in the assignment statement;
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | and (b) for lists, :meth:`!__iadd__` is equivalent to calling :meth:`~list.extend` on the list
 | 
					
						
							| 
									
										
										
										
											2013-05-21 11:44:41 -04:00
										 |  |  | and returning the list.  That's why we say that for lists, ``+=`` is a
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | "shorthand" for :meth:`!list.extend`::
 | 
					
						
							| 
									
										
										
										
											2013-05-20 10:32:46 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  |     >>> a_list = []
 | 
					
						
							|  |  |  |     >>> a_list += [1]
 | 
					
						
							|  |  |  |     >>> a_list
 | 
					
						
							|  |  |  |     [1]
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-05-21 11:44:41 -04:00
										 |  |  | This is equivalent to::
 | 
					
						
							| 
									
										
										
										
											2013-05-20 10:32:46 -04:00
										 |  |  | 
 | 
					
						
							|  |  |  |     >>> result = a_list.__iadd__([1])
 | 
					
						
							|  |  |  |     >>> a_list = result
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The object pointed to by a_list has been mutated, and the pointer to the
 | 
					
						
							|  |  |  | mutated object is assigned back to ``a_list``.  The end result of the
 | 
					
						
							|  |  |  | assignment is a no-op, since it is a pointer to the same object that ``a_list``
 | 
					
						
							|  |  |  | was previously pointing to, but the assignment still happens.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Thus, in our tuple example what is happening is equivalent to::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> result = a_tuple[0].__iadd__(['item'])
 | 
					
						
							|  |  |  |    >>> a_tuple[0] = result
 | 
					
						
							|  |  |  |    Traceback (most recent call last):
 | 
					
						
							|  |  |  |      ...
 | 
					
						
							|  |  |  |    TypeError: 'tuple' object does not support item assignment
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | The :meth:`!__iadd__` succeeds, and thus the list is extended, but even though
 | 
					
						
							| 
									
										
										
										
											2013-05-20 10:32:46 -04:00
										 |  |  | ``result`` points to the same object that ``a_tuple[0]`` already points to,
 | 
					
						
							|  |  |  | that final assignment still results in an error, because tuples are immutable.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | I want to do a complicated sort: can you do a Schwartzian Transform in Python?
 | 
					
						
							|  |  |  | ------------------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The technique, attributed to Randal Schwartz of the Perl community, sorts the
 | 
					
						
							|  |  |  | elements of a list by a metric which maps each element to its "sort value". In
 | 
					
						
							| 
									
										
										
										
											2016-06-01 13:54:33 -07:00
										 |  |  | Python, use the ``key`` argument for the :meth:`list.sort` method::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    Isorted = L[:]
 | 
					
						
							|  |  |  |    Isorted.sort(key=lambda s: int(s[10:15]))
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How can I sort one list by values from another list?
 | 
					
						
							|  |  |  | ----------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | Merge them into an iterator of tuples, sort the resulting list, and then pick
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | out the element you want. ::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> list1 = ["what", "I'm", "sorting", "by"]
 | 
					
						
							|  |  |  |    >>> list2 = ["something", "else", "to", "sort"]
 | 
					
						
							|  |  |  |    >>> pairs = zip(list1, list2)
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    >>> pairs = sorted(pairs)
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    >>> pairs
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    [("I'm", 'else'), ('by', 'sort'), ('sorting', 'to'), ('what', 'something')]
 | 
					
						
							|  |  |  |    >>> result = [x[1] for x in pairs]
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    >>> result
 | 
					
						
							|  |  |  |    ['else', 'sort', 'to', 'something']
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | Objects
 | 
					
						
							|  |  |  | =======
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | What is a class?
 | 
					
						
							|  |  |  | ----------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | A class is the particular object type created by executing a class statement.
 | 
					
						
							|  |  |  | Class objects are used as templates to create instance objects, which embody
 | 
					
						
							|  |  |  | both the data (attributes) and code (methods) specific to a datatype.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | A class can be based on one or more other classes, called its base class(es). It
 | 
					
						
							|  |  |  | then inherits the attributes and methods of its base classes. This allows an
 | 
					
						
							|  |  |  | object model to be successively refined by inheritance.  You might have a
 | 
					
						
							|  |  |  | generic ``Mailbox`` class that provides basic accessor methods for a mailbox,
 | 
					
						
							|  |  |  | and subclasses such as ``MboxMailbox``, ``MaildirMailbox``, ``OutlookMailbox``
 | 
					
						
							|  |  |  | that handle various specific mailbox formats.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | What is a method?
 | 
					
						
							|  |  |  | -----------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | A method is a function on some object ``x`` that you normally call as
 | 
					
						
							|  |  |  | ``x.name(arguments...)``.  Methods are defined as functions inside the class
 | 
					
						
							|  |  |  | definition::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    class C:
 | 
					
						
							| 
									
										
										
										
											2016-05-10 12:01:23 +03:00
										 |  |  |        def meth(self, arg):
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |            return arg * 2 + self.attribute
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | What is self?
 | 
					
						
							|  |  |  | -------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Self is merely a conventional name for the first argument of a method.  A method
 | 
					
						
							|  |  |  | defined as ``meth(self, a, b, c)`` should be called as ``x.meth(a, b, c)`` for
 | 
					
						
							|  |  |  | some instance ``x`` of the class in which the definition occurs; the called
 | 
					
						
							|  |  |  | method will think it is called as ``meth(x, a, b, c)``.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | See also :ref:`why-self`.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I check if an object is an instance of a given class or of a subclass of it?
 | 
					
						
							|  |  |  | -----------------------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | Use the built-in function :func:`isinstance(obj, cls) <isinstance>`.  You can
 | 
					
						
							|  |  |  | check if an object
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | is an instance of any of a number of classes by providing a tuple instead of a
 | 
					
						
							|  |  |  | single class, e.g. ``isinstance(obj, (class1, class2, ...))``, and can also
 | 
					
						
							|  |  |  | check whether an object is one of Python's built-in types, e.g.
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | ``isinstance(obj, str)`` or ``isinstance(obj, (int, float, complex))``.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-04-05 12:48:24 -07:00
										 |  |  | Note that :func:`isinstance` also checks for virtual inheritance from an
 | 
					
						
							|  |  |  | :term:`abstract base class`.  So, the test will return ``True`` for a
 | 
					
						
							|  |  |  | registered class even if hasn't directly or indirectly inherited from it.  To
 | 
					
						
							|  |  |  | test for "true inheritance", scan the :term:`MRO` of the class:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | .. testcode::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     from collections.abc import Mapping
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     class P:
 | 
					
						
							|  |  |  |          pass
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     class C(P):
 | 
					
						
							|  |  |  |         pass
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     Mapping.register(P)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | .. doctest::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     >>> c = C()
 | 
					
						
							|  |  |  |     >>> isinstance(c, C)        # direct
 | 
					
						
							|  |  |  |     True
 | 
					
						
							|  |  |  |     >>> isinstance(c, P)        # indirect
 | 
					
						
							|  |  |  |     True
 | 
					
						
							|  |  |  |     >>> isinstance(c, Mapping)  # virtual
 | 
					
						
							|  |  |  |     True
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     # Actual inheritance chain
 | 
					
						
							|  |  |  |     >>> type(c).__mro__
 | 
					
						
							|  |  |  |     (<class 'C'>, <class 'P'>, <class 'object'>)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     # Test for "true inheritance"
 | 
					
						
							|  |  |  |     >>> Mapping in type(c).__mro__
 | 
					
						
							|  |  |  |     False
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | Note that most programs do not use :func:`isinstance` on user-defined classes
 | 
					
						
							|  |  |  | very often.  If you are developing the classes yourself, a more proper
 | 
					
						
							|  |  |  | object-oriented style is to define methods on the classes that encapsulate a
 | 
					
						
							|  |  |  | particular behaviour, instead of checking the object's class and doing a
 | 
					
						
							|  |  |  | different thing based on what class it is.  For example, if you have a function
 | 
					
						
							|  |  |  | that does something::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    def search(obj):
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |        if isinstance(obj, Mailbox):
 | 
					
						
							| 
									
										
										
										
											2016-05-10 12:01:23 +03:00
										 |  |  |            ...  # code to search a mailbox
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |        elif isinstance(obj, Document):
 | 
					
						
							| 
									
										
										
										
											2016-05-10 12:01:23 +03:00
										 |  |  |            ...  # code to search a document
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |        elif ...
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | A better approach is to define a ``search()`` method on all the classes and just
 | 
					
						
							|  |  |  | call it::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    class Mailbox:
 | 
					
						
							|  |  |  |        def search(self):
 | 
					
						
							| 
									
										
										
										
											2016-05-10 12:01:23 +03:00
										 |  |  |            ...  # code to search a mailbox
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    class Document:
 | 
					
						
							|  |  |  |        def search(self):
 | 
					
						
							| 
									
										
										
										
											2016-05-10 12:01:23 +03:00
										 |  |  |            ...  # code to search a document
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    obj.search()
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | What is delegation?
 | 
					
						
							|  |  |  | -------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Delegation is an object oriented technique (also called a design pattern).
 | 
					
						
							|  |  |  | Let's say you have an object ``x`` and want to change the behaviour of just one
 | 
					
						
							|  |  |  | of its methods.  You can create a new class that provides a new implementation
 | 
					
						
							|  |  |  | of the method you're interested in changing and delegates all other methods to
 | 
					
						
							|  |  |  | the corresponding method of ``x``.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Python programmers can easily implement delegation.  For example, the following
 | 
					
						
							|  |  |  | class implements a class that behaves like a file but converts all written data
 | 
					
						
							|  |  |  | to uppercase::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    class UpperOut:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        def __init__(self, outfile):
 | 
					
						
							|  |  |  |            self._outfile = outfile
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        def write(self, s):
 | 
					
						
							|  |  |  |            self._outfile.write(s.upper())
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        def __getattr__(self, name):
 | 
					
						
							|  |  |  |            return getattr(self._outfile, name)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Here the ``UpperOut`` class redefines the ``write()`` method to convert the
 | 
					
						
							|  |  |  | argument string to uppercase before calling the underlying
 | 
					
						
							| 
									
										
										
										
											2020-04-26 21:23:52 -06:00
										 |  |  | ``self._outfile.write()`` method.  All other methods are delegated to the
 | 
					
						
							|  |  |  | underlying ``self._outfile`` object.  The delegation is accomplished via the
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | :meth:`~object.__getattr__` method; consult :ref:`the language reference <attribute-access>`
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | for more information about controlling attribute access.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Note that for more general cases delegation can get trickier. When attributes
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | must be set as well as retrieved, the class must define a :meth:`~object.__setattr__`
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | method too, and it must do so carefully.  The basic implementation of
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | :meth:`!__setattr__` is roughly equivalent to the following::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    class X:
 | 
					
						
							|  |  |  |        ...
 | 
					
						
							|  |  |  |        def __setattr__(self, name, value):
 | 
					
						
							|  |  |  |            self.__dict__[name] = value
 | 
					
						
							|  |  |  |        ...
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | Most :meth:`!__setattr__` implementations must modify
 | 
					
						
							|  |  |  | :meth:`self.__dict__ <object.__dict__>` to store
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | local state for self without causing an infinite recursion.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-09-20 14:09:50 -03:00
										 |  |  | How do I call a method defined in a base class from a derived class that extends it?
 | 
					
						
							|  |  |  | ------------------------------------------------------------------------------------
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | Use the built-in :func:`super` function::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    class Derived(Base):
 | 
					
						
							| 
									
										
										
										
											2016-05-10 12:01:23 +03:00
										 |  |  |        def meth(self):
 | 
					
						
							| 
									
										
										
										
											2020-09-20 14:09:50 -03:00
										 |  |  |            super().meth()  # calls Base.meth
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-09-20 14:09:50 -03:00
										 |  |  | In the example, :func:`super` will automatically determine the instance from
 | 
					
						
							|  |  |  | which it was called (the ``self`` value), look up the :term:`method resolution
 | 
					
						
							|  |  |  | order` (MRO) with ``type(self).__mro__``, and return the next in line after
 | 
					
						
							|  |  |  | ``Derived`` in the MRO: ``Base``.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How can I organize my code to make it easier to change the base class?
 | 
					
						
							|  |  |  | ----------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-10-21 02:25:05 -03:00
										 |  |  | You could assign the base class to an alias and derive from the alias.  Then all
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | you have to change is the value assigned to the alias.  Incidentally, this trick
 | 
					
						
							|  |  |  | is also handy if you want to decide dynamically (e.g. depending on availability
 | 
					
						
							|  |  |  | of resources) which base class to use.  Example::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2020-10-21 02:25:05 -03:00
										 |  |  |    class Base:
 | 
					
						
							|  |  |  |        ...
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    BaseAlias = Base
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    class Derived(BaseAlias):
 | 
					
						
							| 
									
										
										
										
											2020-10-21 02:25:05 -03:00
										 |  |  |        ...
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I create static class data and static class methods?
 | 
					
						
							|  |  |  | -----------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | Both static data and static methods (in the sense of C++ or Java) are supported
 | 
					
						
							|  |  |  | in Python.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | For static data, simply define a class attribute.  To assign a new value to the
 | 
					
						
							|  |  |  | attribute, you have to explicitly use the class name in the assignment::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    class C:
 | 
					
						
							|  |  |  |        count = 0   # number of times C.__init__ called
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        def __init__(self):
 | 
					
						
							|  |  |  |            C.count = C.count + 1
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        def getcount(self):
 | 
					
						
							|  |  |  |            return C.count  # or return self.count
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | ``c.count`` also refers to ``C.count`` for any ``c`` such that ``isinstance(c,
 | 
					
						
							|  |  |  | C)`` holds, unless overridden by ``c`` itself or by some class on the base-class
 | 
					
						
							|  |  |  | search path from ``c.__class__`` back to ``C``.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Caution: within a method of C, an assignment like ``self.count = 42`` creates a
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | new and unrelated instance named "count" in ``self``'s own dict.  Rebinding of a
 | 
					
						
							|  |  |  | class-static data name must always specify the class whether inside a method or
 | 
					
						
							|  |  |  | not::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    C.count = 314
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2011-12-03 22:19:55 +01:00
										 |  |  | Static methods are possible::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    class C:
 | 
					
						
							|  |  |  |        @staticmethod
 | 
					
						
							|  |  |  |        def static(arg1, arg2, arg3):
 | 
					
						
							|  |  |  |            # No 'self' parameter!
 | 
					
						
							|  |  |  |            ...
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | However, a far more straightforward way to get the effect of a static method is
 | 
					
						
							|  |  |  | via a simple module-level function::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    def getcount():
 | 
					
						
							|  |  |  |        return C.count
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | If your code is structured so as to define one class (or tightly related class
 | 
					
						
							|  |  |  | hierarchy) per module, this supplies the desired encapsulation.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How can I overload constructors (or methods) in Python?
 | 
					
						
							|  |  |  | -------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This answer actually applies to all methods, but the question usually comes up
 | 
					
						
							|  |  |  | first in the context of constructors.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In C++ you'd write
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | .. code-block:: c
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     class C {
 | 
					
						
							|  |  |  |         C() { cout << "No arguments\n"; }
 | 
					
						
							|  |  |  |         C(int i) { cout << "Argument is " << i << "\n"; }
 | 
					
						
							|  |  |  |     }
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In Python you have to write a single constructor that catches all cases using
 | 
					
						
							|  |  |  | default arguments.  For example::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    class C:
 | 
					
						
							|  |  |  |        def __init__(self, i=None):
 | 
					
						
							|  |  |  |            if i is None:
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |                print("No arguments")
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |            else:
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |                print("Argument is", i)
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | This is not entirely equivalent, but close enough in practice.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | You could also try a variable-length argument list, e.g. ::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    def __init__(self, *args):
 | 
					
						
							|  |  |  |        ...
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The same approach works for all method definitions.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | I try to use __spam and I get an error about _SomeClassName__spam.
 | 
					
						
							|  |  |  | ------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Variable names with double leading underscores are "mangled" to provide a simple
 | 
					
						
							|  |  |  | but effective way to define class private variables.  Any identifier of the form
 | 
					
						
							|  |  |  | ``__spam`` (at least two leading underscores, at most one trailing underscore)
 | 
					
						
							|  |  |  | is textually replaced with ``_classname__spam``, where ``classname`` is the
 | 
					
						
							|  |  |  | current class name with any leading underscores stripped.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This doesn't guarantee privacy: an outside user can still deliberately access
 | 
					
						
							|  |  |  | the "_classname__spam" attribute, and private values are visible in the object's
 | 
					
						
							|  |  |  | ``__dict__``.  Many Python programmers never bother to use private variable
 | 
					
						
							|  |  |  | names at all.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | My class defines __del__ but it is not called when I delete the object.
 | 
					
						
							|  |  |  | -----------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | There are several possible reasons for this.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | The :keyword:`del` statement does not necessarily call :meth:`~object.__del__` -- it simply
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | decrements the object's reference count, and if this reaches zero
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | :meth:`!__del__` is called.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | If your data structures contain circular links (e.g. a tree where each child has
 | 
					
						
							|  |  |  | a parent reference and each parent has a list of children) the reference counts
 | 
					
						
							|  |  |  | will never go back to zero.  Once in a while Python runs an algorithm to detect
 | 
					
						
							|  |  |  | such cycles, but the garbage collector might run some time after the last
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | reference to your data structure vanishes, so your :meth:`!__del__` method may be
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | called at an inconvenient and random time. This is inconvenient if you're trying
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | to reproduce a problem. Worse, the order in which object's :meth:`!__del__`
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | methods are executed is arbitrary.  You can run :func:`gc.collect` to force a
 | 
					
						
							|  |  |  | collection, but there *are* pathological cases where objects will never be
 | 
					
						
							|  |  |  | collected.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Despite the cycle collector, it's still a good idea to define an explicit
 | 
					
						
							|  |  |  | ``close()`` method on objects to be called whenever you're done with them.  The
 | 
					
						
							| 
									
										
										
										
											2017-08-28 13:43:26 -07:00
										 |  |  | ``close()`` method can then remove attributes that refer to subobjects.  Don't
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | call :meth:`!__del__` directly -- :meth:`!__del__` should call ``close()`` and
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | ``close()`` should make sure that it can be called more than once for the same
 | 
					
						
							|  |  |  | object.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Another way to avoid cyclical references is to use the :mod:`weakref` module,
 | 
					
						
							|  |  |  | which allows you to point to objects without incrementing their reference count.
 | 
					
						
							|  |  |  | Tree data structures, for instance, should use weak references for their parent
 | 
					
						
							|  |  |  | and sibling references (if they need them!).
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | .. XXX relevant for Python 3?
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    If the object has ever been a local variable in a function that caught an
 | 
					
						
							|  |  |  |    expression in an except clause, chances are that a reference to the object
 | 
					
						
							|  |  |  |    still exists in that function's stack frame as contained in the stack trace.
 | 
					
						
							|  |  |  |    Normally, calling :func:`sys.exc_clear` will take care of this by clearing
 | 
					
						
							|  |  |  |    the last recorded exception.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | Finally, if your :meth:`!__del__` method raises an exception, a warning message
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | is printed to :data:`sys.stderr`.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I get a list of all instances of a given class?
 | 
					
						
							|  |  |  | ------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Python does not keep track of all instances of a class (or of a built-in type).
 | 
					
						
							|  |  |  | You can program the class's constructor to keep track of all instances by
 | 
					
						
							|  |  |  | keeping a list of weak references to each instance.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-10-12 18:14:25 +02:00
										 |  |  | Why does the result of ``id()`` appear to be not unique?
 | 
					
						
							|  |  |  | --------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The :func:`id` builtin returns an integer that is guaranteed to be unique during
 | 
					
						
							|  |  |  | the lifetime of the object.  Since in CPython, this is the object's memory
 | 
					
						
							|  |  |  | address, it happens frequently that after an object is deleted from memory, the
 | 
					
						
							|  |  |  | next freshly created object is allocated at the same position in memory.  This
 | 
					
						
							|  |  |  | is illustrated by this example:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2016-06-04 20:07:34 -07:00
										 |  |  | >>> id(1000) # doctest: +SKIP
 | 
					
						
							| 
									
										
										
										
											2013-10-12 18:14:25 +02:00
										 |  |  | 13901272
 | 
					
						
							| 
									
										
										
										
											2016-06-04 20:07:34 -07:00
										 |  |  | >>> id(2000) # doctest: +SKIP
 | 
					
						
							| 
									
										
										
										
											2013-10-12 18:14:25 +02:00
										 |  |  | 13901272
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The two ids belong to different integer objects that are created before, and
 | 
					
						
							|  |  |  | deleted immediately after execution of the ``id()`` call.  To be sure that
 | 
					
						
							|  |  |  | objects whose id you want to examine are still alive, create another reference
 | 
					
						
							|  |  |  | to the object:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | >>> a = 1000; b = 2000
 | 
					
						
							| 
									
										
										
										
											2016-06-04 20:07:34 -07:00
										 |  |  | >>> id(a) # doctest: +SKIP
 | 
					
						
							| 
									
										
										
										
											2013-10-12 18:14:25 +02:00
										 |  |  | 13901272
 | 
					
						
							| 
									
										
										
										
											2016-06-04 20:07:34 -07:00
										 |  |  | >>> id(b) # doctest: +SKIP
 | 
					
						
							| 
									
										
										
										
											2013-10-12 18:14:25 +02:00
										 |  |  | 13891296
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-04-03 19:54:49 -07:00
										 |  |  | When can I rely on identity tests with the *is* operator?
 | 
					
						
							|  |  |  | ---------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The ``is`` operator tests for object identity.  The test ``a is b`` is
 | 
					
						
							|  |  |  | equivalent to ``id(a) == id(b)``.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The most important property of an identity test is that an object is always
 | 
					
						
							|  |  |  | identical to itself, ``a is a`` always returns ``True``.  Identity tests are
 | 
					
						
							|  |  |  | usually faster than equality tests.  And unlike equality tests, identity tests
 | 
					
						
							|  |  |  | are guaranteed to return a boolean ``True`` or ``False``.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | However, identity tests can *only* be substituted for equality tests when
 | 
					
						
							|  |  |  | object identity is assured.  Generally, there are three circumstances where
 | 
					
						
							|  |  |  | identity is guaranteed:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 1) Assignments create new names but do not change object identity.  After the
 | 
					
						
							|  |  |  | assignment ``new = old``, it is guaranteed that ``new is old``.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 2) Putting an object in a container that stores object references does not
 | 
					
						
							|  |  |  | change object identity.  After the list assignment ``s[0] = x``, it is
 | 
					
						
							|  |  |  | guaranteed that ``s[0] is x``.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 3) If an object is a singleton, it means that only one instance of that object
 | 
					
						
							|  |  |  | can exist.  After the assignments ``a = None`` and ``b = None``, it is
 | 
					
						
							|  |  |  | guaranteed that ``a is b`` because ``None`` is a singleton.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In most other circumstances, identity tests are inadvisable and equality tests
 | 
					
						
							|  |  |  | are preferred.  In particular, identity tests should not be used to check
 | 
					
						
							|  |  |  | constants such as :class:`int` and :class:`str` which aren't guaranteed to be
 | 
					
						
							|  |  |  | singletons::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     >>> a = 1000
 | 
					
						
							|  |  |  |     >>> b = 500
 | 
					
						
							|  |  |  |     >>> c = b + 500
 | 
					
						
							|  |  |  |     >>> a is c
 | 
					
						
							|  |  |  |     False
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     >>> a = 'Python'
 | 
					
						
							|  |  |  |     >>> b = 'Py'
 | 
					
						
							|  |  |  |     >>> c = b + 'thon'
 | 
					
						
							|  |  |  |     >>> a is c
 | 
					
						
							|  |  |  |     False
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Likewise, new instances of mutable containers are never identical::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     >>> a = []
 | 
					
						
							|  |  |  |     >>> b = []
 | 
					
						
							|  |  |  |     >>> a is b
 | 
					
						
							|  |  |  |     False
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | In the standard library code, you will see several common patterns for
 | 
					
						
							|  |  |  | correctly using identity tests:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 1) As recommended by :pep:`8`, an identity test is the preferred way to check
 | 
					
						
							|  |  |  | for ``None``.  This reads like plain English in code and avoids confusion with
 | 
					
						
							|  |  |  | other objects that may have boolean values that evaluate to false.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 2) Detecting optional arguments can be tricky when ``None`` is a valid input
 | 
					
						
							| 
									
										
										
										
											2022-02-14 12:45:48 -06:00
										 |  |  | value.  In those situations, you can create a singleton sentinel object
 | 
					
						
							| 
									
										
										
										
											2021-04-03 19:54:49 -07:00
										 |  |  | guaranteed to be distinct from other objects.  For example, here is how
 | 
					
						
							|  |  |  | to implement a method that behaves like :meth:`dict.pop`::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    _sentinel = object()
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    def pop(self, key, default=_sentinel):
 | 
					
						
							|  |  |  |        if key in self:
 | 
					
						
							|  |  |  |            value = self[key]
 | 
					
						
							|  |  |  |            del self[key]
 | 
					
						
							|  |  |  |            return value
 | 
					
						
							|  |  |  |        if default is _sentinel:
 | 
					
						
							|  |  |  |            raise KeyError(key)
 | 
					
						
							|  |  |  |        return default
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 3) Container implementations sometimes need to augment equality tests with
 | 
					
						
							|  |  |  | identity tests.  This prevents the code from being confused by objects such as
 | 
					
						
							|  |  |  | ``float('NaN')`` that are not equal to themselves.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | For example, here is the implementation of
 | 
					
						
							|  |  |  | :meth:`collections.abc.Sequence.__contains__`::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     def __contains__(self, value):
 | 
					
						
							|  |  |  |         for v in self:
 | 
					
						
							|  |  |  |             if v is value or v == value:
 | 
					
						
							|  |  |  |                 return True
 | 
					
						
							|  |  |  |         return False
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-08-22 14:27:06 -05:00
										 |  |  | 
 | 
					
						
							|  |  |  | How can a subclass control what data is stored in an immutable instance?
 | 
					
						
							|  |  |  | ------------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | When subclassing an immutable type, override the :meth:`~object.__new__` method
 | 
					
						
							|  |  |  | instead of the :meth:`~object.__init__` method.  The latter only runs *after* an
 | 
					
						
							| 
									
										
										
										
											2021-08-22 14:27:06 -05:00
										 |  |  | instance is created, which is too late to alter data in an immutable
 | 
					
						
							|  |  |  | instance.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | All of these immutable classes have a different signature than their
 | 
					
						
							|  |  |  | parent class:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | .. testcode::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     from datetime import date
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     class FirstOfMonthDate(date):
 | 
					
						
							|  |  |  |         "Always choose the first day of the month"
 | 
					
						
							|  |  |  |         def __new__(cls, year, month, day):
 | 
					
						
							|  |  |  |             return super().__new__(cls, year, month, 1)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     class NamedInt(int):
 | 
					
						
							|  |  |  |         "Allow text names for some numbers"
 | 
					
						
							|  |  |  |         xlat = {'zero': 0, 'one': 1, 'ten': 10}
 | 
					
						
							|  |  |  |         def __new__(cls, value):
 | 
					
						
							|  |  |  |             value = cls.xlat.get(value, value)
 | 
					
						
							|  |  |  |             return super().__new__(cls, value)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     class TitleStr(str):
 | 
					
						
							|  |  |  |         "Convert str to name suitable for a URL path"
 | 
					
						
							|  |  |  |         def __new__(cls, s):
 | 
					
						
							|  |  |  |             s = s.lower().replace(' ', '-')
 | 
					
						
							|  |  |  |             s = ''.join([c for c in s if c.isalnum() or c == '-'])
 | 
					
						
							|  |  |  |             return super().__new__(cls, s)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The classes can be used like this:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | .. doctest::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     >>> FirstOfMonthDate(2012, 2, 14)
 | 
					
						
							|  |  |  |     FirstOfMonthDate(2012, 2, 1)
 | 
					
						
							|  |  |  |     >>> NamedInt('ten')
 | 
					
						
							|  |  |  |     10
 | 
					
						
							|  |  |  |     >>> NamedInt(20)
 | 
					
						
							|  |  |  |     20
 | 
					
						
							|  |  |  |     >>> TitleStr('Blog: Why Python Rocks')
 | 
					
						
							|  |  |  |     'blog-why-python-rocks'
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-09-18 16:36:20 -05:00
										 |  |  | .. _faq-cache-method-calls:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-06-17 15:39:42 -05:00
										 |  |  | How do I cache method calls?
 | 
					
						
							|  |  |  | ----------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The two principal tools for caching methods are
 | 
					
						
							|  |  |  | :func:`functools.cached_property` and :func:`functools.lru_cache`.  The
 | 
					
						
							|  |  |  | former stores results at the instance level and the latter at the class
 | 
					
						
							|  |  |  | level.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The *cached_property* approach only works with methods that do not take
 | 
					
						
							|  |  |  | any arguments.  It does not create a reference to the instance.  The
 | 
					
						
							|  |  |  | cached method result will be kept only as long as the instance is alive.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-02-14 10:49:28 -08:00
										 |  |  | The advantage is that when an instance is no longer used, the cached
 | 
					
						
							| 
									
										
										
										
											2021-06-17 15:39:42 -05:00
										 |  |  | method result will be released right away.  The disadvantage is that if
 | 
					
						
							|  |  |  | instances accumulate, so too will the accumulated method results.  They
 | 
					
						
							|  |  |  | can grow without bound.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2023-02-14 10:20:11 +00:00
										 |  |  | The *lru_cache* approach works with methods that have :term:`hashable`
 | 
					
						
							| 
									
										
										
										
											2021-06-17 15:39:42 -05:00
										 |  |  | arguments.  It creates a reference to the instance unless special
 | 
					
						
							|  |  |  | efforts are made to pass in weak references.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The advantage of the least recently used algorithm is that the cache is
 | 
					
						
							|  |  |  | bounded by the specified *maxsize*.  The disadvantage is that instances
 | 
					
						
							|  |  |  | are kept alive until they age out of the cache or until the cache is
 | 
					
						
							|  |  |  | cleared.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | This example shows the various techniques::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |     class Weather:
 | 
					
						
							|  |  |  |         "Lookup weather information on a government website"
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |         def __init__(self, station_id):
 | 
					
						
							|  |  |  |             self._station_id = station_id
 | 
					
						
							|  |  |  |             # The _station_id is private and immutable
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |         def current_temperature(self):
 | 
					
						
							|  |  |  |             "Latest hourly observation"
 | 
					
						
							|  |  |  |             # Do not cache this because old results
 | 
					
						
							|  |  |  |             # can be out of date.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |         @cached_property
 | 
					
						
							|  |  |  |         def location(self):
 | 
					
						
							|  |  |  |             "Return the longitude/latitude coordinates of the station"
 | 
					
						
							|  |  |  |             # Result only depends on the station_id
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |         @lru_cache(maxsize=20)
 | 
					
						
							|  |  |  |         def historic_rainfall(self, date, units='mm'):
 | 
					
						
							|  |  |  |             "Rainfall on a given date"
 | 
					
						
							|  |  |  |             # Depends on the station_id, date, and units.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The above example assumes that the *station_id* never changes.  If the
 | 
					
						
							|  |  |  | relevant instance attributes are mutable, the *cached_property* approach
 | 
					
						
							|  |  |  | can't be made to work because it cannot detect changes to the
 | 
					
						
							|  |  |  | attributes.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2022-04-07 21:10:00 -05:00
										 |  |  | To make the *lru_cache* approach work when the *station_id* is mutable,
 | 
					
						
							| 
									
										
										
										
											2022-11-01 21:58:44 -07:00
										 |  |  | the class needs to define the :meth:`~object.__eq__` and :meth:`~object.__hash__`
 | 
					
						
							|  |  |  | methods so that the cache can detect relevant attribute updates::
 | 
					
						
							| 
									
										
										
										
											2021-06-17 15:39:42 -05:00
										 |  |  | 
 | 
					
						
							|  |  |  |     class Weather:
 | 
					
						
							|  |  |  |         "Example with a mutable station identifier"
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |         def __init__(self, station_id):
 | 
					
						
							|  |  |  |             self.station_id = station_id
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |         def change_station(self, station_id):
 | 
					
						
							|  |  |  |             self.station_id = station_id
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |         def __eq__(self, other):
 | 
					
						
							|  |  |  |             return self.station_id == other.station_id
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |         def __hash__(self):
 | 
					
						
							|  |  |  |             return hash(self.station_id)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |         @lru_cache(maxsize=20)
 | 
					
						
							|  |  |  |         def historic_rainfall(self, date, units='cm'):
 | 
					
						
							|  |  |  |             'Rainfall on a given date'
 | 
					
						
							|  |  |  |             # Depends on the station_id, date, and units.
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-04-03 19:54:49 -07:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | Modules
 | 
					
						
							|  |  |  | =======
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I create a .pyc file?
 | 
					
						
							|  |  |  | ----------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-12-13 12:29:29 -05:00
										 |  |  | When a module is imported for the first time (or when the source file has
 | 
					
						
							|  |  |  | changed since the current compiled file was created) a ``.pyc`` file containing
 | 
					
						
							|  |  |  | the compiled code should be created in a ``__pycache__`` subdirectory of the
 | 
					
						
							|  |  |  | directory containing the ``.py`` file.  The ``.pyc`` file will have a
 | 
					
						
							|  |  |  | filename that starts with the same name as the ``.py`` file, and ends with
 | 
					
						
							|  |  |  | ``.pyc``, with a middle component that depends on the particular ``python``
 | 
					
						
							|  |  |  | binary that created it.  (See :pep:`3147` for details.)
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | One reason that a ``.pyc`` file may not be created is a permissions problem
 | 
					
						
							|  |  |  | with the directory containing the source file, meaning that the ``__pycache__``
 | 
					
						
							|  |  |  | subdirectory cannot be created. This can happen, for example, if you develop as
 | 
					
						
							|  |  |  | one user but run as another, such as if you are testing with a web server.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Unless the :envvar:`PYTHONDONTWRITEBYTECODE` environment variable is set,
 | 
					
						
							|  |  |  | creation of a .pyc file is automatic if you're importing a module and Python
 | 
					
						
							|  |  |  | has the ability (permissions, free space, etc...) to create a ``__pycache__``
 | 
					
						
							|  |  |  | subdirectory and write the compiled module to that subdirectory.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-06-19 16:58:26 -04:00
										 |  |  | Running Python on a top level script is not considered an import and no
 | 
					
						
							|  |  |  | ``.pyc`` will be created.  For example, if you have a top-level module
 | 
					
						
							| 
									
										
										
										
											2013-12-13 12:29:29 -05:00
										 |  |  | ``foo.py`` that imports another module ``xyz.py``, when you run ``foo`` (by
 | 
					
						
							|  |  |  | typing ``python foo.py`` as a shell command), a ``.pyc`` will be created for
 | 
					
						
							|  |  |  | ``xyz`` because ``xyz`` is imported, but no ``.pyc`` file will be created for
 | 
					
						
							|  |  |  | ``foo`` since ``foo.py`` isn't being imported.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-12-13 12:29:29 -05:00
										 |  |  | If you need to create a ``.pyc`` file for ``foo`` -- that is, to create a
 | 
					
						
							|  |  |  | ``.pyc`` file for a module that is not imported -- you can, using the
 | 
					
						
							|  |  |  | :mod:`py_compile` and :mod:`compileall` modules.
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | The :mod:`py_compile` module can manually compile any module.  One way is to use
 | 
					
						
							|  |  |  | the ``compile()`` function in that module interactively::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    >>> import py_compile
 | 
					
						
							| 
									
										
										
										
											2013-06-19 16:58:26 -04:00
										 |  |  |    >>> py_compile.compile('foo.py')                 # doctest: +SKIP
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-12-13 12:29:29 -05:00
										 |  |  | This will write the ``.pyc`` to a ``__pycache__`` subdirectory in the same
 | 
					
						
							|  |  |  | location as ``foo.py`` (or you can override that with the optional parameter
 | 
					
						
							|  |  |  | ``cfile``).
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | You can also automatically compile all files in a directory or directories using
 | 
					
						
							|  |  |  | the :mod:`compileall` module.  You can do it from the shell prompt by running
 | 
					
						
							|  |  |  | ``compileall.py`` and providing the path of a directory containing Python files
 | 
					
						
							|  |  |  | to compile::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |        python -m compileall .
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How do I find the current module name?
 | 
					
						
							|  |  |  | --------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | A module can find out its own module name by looking at the predefined global
 | 
					
						
							|  |  |  | variable ``__name__``.  If this has the value ``'__main__'``, the program is
 | 
					
						
							|  |  |  | running as a script.  Many modules that are usually used by importing them also
 | 
					
						
							|  |  |  | provide a command-line interface or a self-test, and only execute this code
 | 
					
						
							|  |  |  | after checking ``__name__``::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    def main():
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |        print('Running test...')
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |        ...
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    if __name__ == '__main__':
 | 
					
						
							|  |  |  |        main()
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | How can I have modules that mutually import each other?
 | 
					
						
							|  |  |  | -------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Suppose you have the following modules:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-04-13 18:03:22 +02:00
										 |  |  | :file:`foo.py`::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    from bar import bar_var
 | 
					
						
							|  |  |  |    foo_var = 1
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-04-13 18:03:22 +02:00
										 |  |  | :file:`bar.py`::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  |    from foo import foo_var
 | 
					
						
							|  |  |  |    bar_var = 2
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The problem is that the interpreter will perform the following steps:
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-04-13 18:03:22 +02:00
										 |  |  | * main imports ``foo``
 | 
					
						
							|  |  |  | * Empty globals for ``foo`` are created
 | 
					
						
							|  |  |  | * ``foo`` is compiled and starts executing
 | 
					
						
							|  |  |  | * ``foo`` imports ``bar``
 | 
					
						
							|  |  |  | * Empty globals for ``bar`` are created
 | 
					
						
							|  |  |  | * ``bar`` is compiled and starts executing
 | 
					
						
							|  |  |  | * ``bar`` imports ``foo`` (which is a no-op since there already is a module named ``foo``)
 | 
					
						
							|  |  |  | * The import mechanism tries to read ``foo_var`` from ``foo`` globals, to set ``bar.foo_var = foo.foo_var``
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | The last step fails, because Python isn't done with interpreting ``foo`` yet and
 | 
					
						
							|  |  |  | the global symbol dictionary for ``foo`` is still empty.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | The same thing happens when you use ``import foo``, and then try to access
 | 
					
						
							|  |  |  | ``foo.foo_var`` in global code.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | There are (at least) three possible workarounds for this problem.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Guido van Rossum recommends avoiding all uses of ``from <module> import ...``,
 | 
					
						
							|  |  |  | and placing all code inside functions.  Initializations of global variables and
 | 
					
						
							|  |  |  | class variables should use constants or built-in functions only.  This means
 | 
					
						
							|  |  |  | everything from an imported module is referenced as ``<module>.<name>``.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Jim Roskind suggests performing steps in the following order in each module:
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | * exports (globals, functions, and classes that don't need imported base
 | 
					
						
							|  |  |  |   classes)
 | 
					
						
							|  |  |  | * ``import`` statements
 | 
					
						
							|  |  |  | * active code (including globals that are initialized from imported values).
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2021-11-20 09:35:39 +00:00
										 |  |  | Van Rossum doesn't like this approach much because the imports appear in a
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | strange place, but it does work.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | Matthias Urlichs recommends restructuring your code so that the recursive import
 | 
					
						
							|  |  |  | is not necessary in the first place.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | These solutions are not mutually exclusive.
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | __import__('x.y.z') returns <module 'x'>; how do I get z?
 | 
					
						
							|  |  |  | ---------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2014-08-04 19:34:29 +03:00
										 |  |  | Consider using the convenience function :func:`~importlib.import_module` from
 | 
					
						
							|  |  |  | :mod:`importlib` instead::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2014-08-04 19:34:29 +03:00
										 |  |  |    z = importlib.import_module('x.y.z')
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | When I edit an imported module and reimport it, the changes don't show up.  Why does this happen?
 | 
					
						
							|  |  |  | -------------------------------------------------------------------------------------------------
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | For reasons of efficiency as well as consistency, Python only reads the module
 | 
					
						
							|  |  |  | file on the first time a module is imported.  If it didn't, in a program
 | 
					
						
							|  |  |  | consisting of many modules where each one imports the same basic module, the
 | 
					
						
							| 
									
										
										
										
											2013-06-14 22:49:00 -04:00
										 |  |  | basic module would be parsed and re-parsed many times.  To force re-reading of a
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | changed module, do this::
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-06-14 22:49:00 -04:00
										 |  |  |    import importlib
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    import modname
 | 
					
						
							| 
									
										
										
										
											2013-06-14 22:49:00 -04:00
										 |  |  |    importlib.reload(modname)
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							|  |  |  | Warning: this technique is not 100% fool-proof.  In particular, modules
 | 
					
						
							|  |  |  | containing statements like ::
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  |    from modname import some_objects
 | 
					
						
							|  |  |  | 
 | 
					
						
							|  |  |  | will continue to work with the old version of the imported objects.  If the
 | 
					
						
							|  |  |  | module contains class definitions, existing class instances will *not* be
 | 
					
						
							|  |  |  | updated to use the new class definition.  This can result in the following
 | 
					
						
							| 
									
										
										
										
											2017-03-18 17:59:33 +01:00
										 |  |  | paradoxical behaviour::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2013-06-14 22:49:00 -04:00
										 |  |  |    >>> import importlib
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    >>> import cls
 | 
					
						
							|  |  |  |    >>> c = cls.C()                # Create an instance of C
 | 
					
						
							| 
									
										
										
										
											2013-06-14 22:49:00 -04:00
										 |  |  |    >>> importlib.reload(cls)
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    <module 'cls' from 'cls.py'>
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  |    >>> isinstance(c, cls.C)       # isinstance is false?!?
 | 
					
						
							|  |  |  |    False
 | 
					
						
							|  |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  | The nature of the problem is made clear if you print out the "identity" of the
 | 
					
						
							| 
									
										
										
										
											2017-03-18 17:59:33 +01:00
										 |  |  | class objects::
 | 
					
						
							| 
									
										
										
										
											2009-10-11 21:25:26 +00:00
										 |  |  | 
 | 
					
						
							| 
									
										
										
										
											2009-12-19 17:51:41 +00:00
										 |  |  |    >>> hex(id(c.__class__))
 | 
					
						
							|  |  |  |    '0x7352a0'
 | 
					
						
							|  |  |  |    >>> hex(id(cls.C))
 | 
					
						
							|  |  |  |    '0x4198d0'
 |