mirror of
				https://github.com/python/cpython.git
				synced 2025-10-31 13:41:24 +00:00 
			
		
		
		
	
		
			
				
	
	
		
			336 lines
		
	
	
	
		
			12 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
	
	
			
		
		
	
	
			336 lines
		
	
	
	
		
			12 KiB
		
	
	
	
		
			ReStructuredText
		
	
	
	
	
	
| .. highlight:: c
 | |
| 
 | |
| 
 | |
| .. _embedding:
 | |
| 
 | |
| ***************************************
 | |
| Embedding Python in Another Application
 | |
| ***************************************
 | |
| 
 | |
| The previous chapters discussed how to extend Python, that is, how to extend the
 | |
| functionality of Python by attaching a library of C functions to it.  It is also
 | |
| possible to do it the other way around: enrich your C/C++ application by
 | |
| embedding Python in it.  Embedding provides your application with the ability to
 | |
| implement some of the functionality of your application in Python rather than C
 | |
| or C++. This can be used for many purposes; one example would be to allow users
 | |
| to tailor the application to their needs by writing some scripts in Python.  You
 | |
| can also use it yourself if some of the functionality can be written in Python
 | |
| more easily.
 | |
| 
 | |
| Embedding Python is similar to extending it, but not quite.  The difference is
 | |
| that when you extend Python, the main program of the application is still the
 | |
| Python interpreter, while if you embed Python, the main program may have nothing
 | |
| to do with Python --- instead, some parts of the application occasionally call
 | |
| the Python interpreter to run some Python code.
 | |
| 
 | |
| So if you are embedding Python, you are providing your own main program.  One of
 | |
| the things this main program has to do is initialize the Python interpreter.  At
 | |
| the very least, you have to call the function :c:func:`Py_Initialize`.  There are
 | |
| optional calls to pass command line arguments to Python.  Then later you can
 | |
| call the interpreter from any part of the application.
 | |
| 
 | |
| There are several different ways to call the interpreter: you can pass a string
 | |
| containing Python statements to :c:func:`PyRun_SimpleString`, or you can pass a
 | |
| stdio file pointer and a file name (for identification in error messages only)
 | |
| to :c:func:`PyRun_SimpleFile`.  You can also call the lower-level operations
 | |
| described in the previous chapters to construct and use Python objects.
 | |
| 
 | |
| 
 | |
| .. seealso::
 | |
| 
 | |
|    :ref:`c-api-index`
 | |
|       The details of Python's C interface are given in this manual. A great deal of
 | |
|       necessary information can be found here.
 | |
| 
 | |
| 
 | |
| .. _high-level-embedding:
 | |
| 
 | |
| Very High Level Embedding
 | |
| =========================
 | |
| 
 | |
| The simplest form of embedding Python is the use of the very high level
 | |
| interface. This interface is intended to execute a Python script without needing
 | |
| to interact with the application directly. This can for example be used to
 | |
| perform some operation on a file. ::
 | |
| 
 | |
|    #define PY_SSIZE_T_CLEAN
 | |
|    #include <Python.h>
 | |
| 
 | |
|    int
 | |
|    main(int argc, char *argv[])
 | |
|    {
 | |
|        wchar_t *program = Py_DecodeLocale(argv[0], NULL);
 | |
|        if (program == NULL) {
 | |
|            fprintf(stderr, "Fatal error: cannot decode argv[0]\n");
 | |
|            exit(1);
 | |
|        }
 | |
|        Py_SetProgramName(program);  /* optional but recommended */
 | |
|        Py_Initialize();
 | |
|        PyRun_SimpleString("from time import time,ctime\n"
 | |
|                           "print('Today is', ctime(time()))\n");
 | |
|        if (Py_FinalizeEx() < 0) {
 | |
|            exit(120);
 | |
|        }
 | |
|        PyMem_RawFree(program);
 | |
|        return 0;
 | |
|    }
 | |
| 
 | |
| The :c:func:`Py_SetProgramName` function should be called before
 | |
| :c:func:`Py_Initialize` to inform the interpreter about paths to Python run-time
 | |
| libraries.  Next, the Python interpreter is initialized with
 | |
| :c:func:`Py_Initialize`, followed by the execution of a hard-coded Python script
 | |
| that prints the date and time.  Afterwards, the :c:func:`Py_FinalizeEx` call shuts
 | |
| the interpreter down, followed by the end of the program.  In a real program,
 | |
| you may want to get the Python script from another source, perhaps a text-editor
 | |
| routine, a file, or a database.  Getting the Python code from a file can better
 | |
| be done by using the :c:func:`PyRun_SimpleFile` function, which saves you the
 | |
| trouble of allocating memory space and loading the file contents.
 | |
| 
 | |
| 
 | |
| .. _lower-level-embedding:
 | |
| 
 | |
| Beyond Very High Level Embedding: An overview
 | |
| =============================================
 | |
| 
 | |
| The high level interface gives you the ability to execute arbitrary pieces of
 | |
| Python code from your application, but exchanging data values is quite
 | |
| cumbersome to say the least. If you want that, you should use lower level calls.
 | |
| At the cost of having to write more C code, you can achieve almost anything.
 | |
| 
 | |
| It should be noted that extending Python and embedding Python is quite the same
 | |
| activity, despite the different intent. Most topics discussed in the previous
 | |
| chapters are still valid. To show this, consider what the extension code from
 | |
| Python to C really does:
 | |
| 
 | |
| #. Convert data values from Python to C,
 | |
| 
 | |
| #. Perform a function call to a C routine using the converted values, and
 | |
| 
 | |
| #. Convert the data values from the call from C to Python.
 | |
| 
 | |
| When embedding Python, the interface code does:
 | |
| 
 | |
| #. Convert data values from C to Python,
 | |
| 
 | |
| #. Perform a function call to a Python interface routine using the converted
 | |
|    values, and
 | |
| 
 | |
| #. Convert the data values from the call from Python to C.
 | |
| 
 | |
| As you can see, the data conversion steps are simply swapped to accommodate the
 | |
| different direction of the cross-language transfer. The only difference is the
 | |
| routine that you call between both data conversions. When extending, you call a
 | |
| C routine, when embedding, you call a Python routine.
 | |
| 
 | |
| This chapter will not discuss how to convert data from Python to C and vice
 | |
| versa.  Also, proper use of references and dealing with errors is assumed to be
 | |
| understood.  Since these aspects do not differ from extending the interpreter,
 | |
| you can refer to earlier chapters for the required information.
 | |
| 
 | |
| 
 | |
| .. _pure-embedding:
 | |
| 
 | |
| Pure Embedding
 | |
| ==============
 | |
| 
 | |
| The first program aims to execute a function in a Python script. Like in the
 | |
| section about the very high level interface, the Python interpreter does not
 | |
| directly interact with the application (but that will change in the next
 | |
| section).
 | |
| 
 | |
| The code to run a function defined in a Python script is:
 | |
| 
 | |
| .. literalinclude:: ../includes/run-func.c
 | |
| 
 | |
| 
 | |
| This code loads a Python script using ``argv[1]``, and calls the function named
 | |
| in ``argv[2]``.  Its integer arguments are the other values of the ``argv``
 | |
| array.  If you :ref:`compile and link <compiling>` this program (let's call
 | |
| the finished executable :program:`call`), and use it to execute a Python
 | |
| script, such as:
 | |
| 
 | |
| .. code-block:: python
 | |
| 
 | |
|    def multiply(a,b):
 | |
|        print("Will compute", a, "times", b)
 | |
|        c = 0
 | |
|        for i in range(0, a):
 | |
|            c = c + b
 | |
|        return c
 | |
| 
 | |
| then the result should be:
 | |
| 
 | |
| .. code-block:: shell-session
 | |
| 
 | |
|    $ call multiply multiply 3 2
 | |
|    Will compute 3 times 2
 | |
|    Result of call: 6
 | |
| 
 | |
| Although the program is quite large for its functionality, most of the code is
 | |
| for data conversion between Python and C, and for error reporting.  The
 | |
| interesting part with respect to embedding Python starts with ::
 | |
| 
 | |
|    Py_Initialize();
 | |
|    pName = PyUnicode_DecodeFSDefault(argv[1]);
 | |
|    /* Error checking of pName left out */
 | |
|    pModule = PyImport_Import(pName);
 | |
| 
 | |
| After initializing the interpreter, the script is loaded using
 | |
| :c:func:`PyImport_Import`.  This routine needs a Python string as its argument,
 | |
| which is constructed using the :c:func:`PyUnicode_FromString` data conversion
 | |
| routine. ::
 | |
| 
 | |
|    pFunc = PyObject_GetAttrString(pModule, argv[2]);
 | |
|    /* pFunc is a new reference */
 | |
| 
 | |
|    if (pFunc && PyCallable_Check(pFunc)) {
 | |
|        ...
 | |
|    }
 | |
|    Py_XDECREF(pFunc);
 | |
| 
 | |
| Once the script is loaded, the name we're looking for is retrieved using
 | |
| :c:func:`PyObject_GetAttrString`.  If the name exists, and the object returned is
 | |
| callable, you can safely assume that it is a function.  The program then
 | |
| proceeds by constructing a tuple of arguments as normal.  The call to the Python
 | |
| function is then made with::
 | |
| 
 | |
|    pValue = PyObject_CallObject(pFunc, pArgs);
 | |
| 
 | |
| Upon return of the function, ``pValue`` is either ``NULL`` or it contains a
 | |
| reference to the return value of the function.  Be sure to release the reference
 | |
| after examining the value.
 | |
| 
 | |
| 
 | |
| .. _extending-with-embedding:
 | |
| 
 | |
| Extending Embedded Python
 | |
| =========================
 | |
| 
 | |
| Until now, the embedded Python interpreter had no access to functionality from
 | |
| the application itself.  The Python API allows this by extending the embedded
 | |
| interpreter.  That is, the embedded interpreter gets extended with routines
 | |
| provided by the application. While it sounds complex, it is not so bad.  Simply
 | |
| forget for a while that the application starts the Python interpreter.  Instead,
 | |
| consider the application to be a set of subroutines, and write some glue code
 | |
| that gives Python access to those routines, just like you would write a normal
 | |
| Python extension.  For example::
 | |
| 
 | |
|    static int numargs=0;
 | |
| 
 | |
|    /* Return the number of arguments of the application command line */
 | |
|    static PyObject*
 | |
|    emb_numargs(PyObject *self, PyObject *args)
 | |
|    {
 | |
|        if(!PyArg_ParseTuple(args, ":numargs"))
 | |
|            return NULL;
 | |
|        return PyLong_FromLong(numargs);
 | |
|    }
 | |
| 
 | |
|    static PyMethodDef EmbMethods[] = {
 | |
|        {"numargs", emb_numargs, METH_VARARGS,
 | |
|         "Return the number of arguments received by the process."},
 | |
|        {NULL, NULL, 0, NULL}
 | |
|    };
 | |
| 
 | |
|    static PyModuleDef EmbModule = {
 | |
|        PyModuleDef_HEAD_INIT, "emb", NULL, -1, EmbMethods,
 | |
|        NULL, NULL, NULL, NULL
 | |
|    };
 | |
| 
 | |
|    static PyObject*
 | |
|    PyInit_emb(void)
 | |
|    {
 | |
|        return PyModule_Create(&EmbModule);
 | |
|    }
 | |
| 
 | |
| Insert the above code just above the :c:func:`main` function. Also, insert the
 | |
| following two statements before the call to :c:func:`Py_Initialize`::
 | |
| 
 | |
|    numargs = argc;
 | |
|    PyImport_AppendInittab("emb", &PyInit_emb);
 | |
| 
 | |
| These two lines initialize the ``numargs`` variable, and make the
 | |
| :func:`emb.numargs` function accessible to the embedded Python interpreter.
 | |
| With these extensions, the Python script can do things like
 | |
| 
 | |
| .. code-block:: python
 | |
| 
 | |
|    import emb
 | |
|    print("Number of arguments", emb.numargs())
 | |
| 
 | |
| In a real application, the methods will expose an API of the application to
 | |
| Python.
 | |
| 
 | |
| .. TODO: threads, code examples do not really behave well if errors happen
 | |
|    (what to watch out for)
 | |
| 
 | |
| 
 | |
| .. _embeddingincplusplus:
 | |
| 
 | |
| Embedding Python in C++
 | |
| =======================
 | |
| 
 | |
| It is also possible to embed Python in a C++ program; precisely how this is done
 | |
| will depend on the details of the C++ system used; in general you will need to
 | |
| write the main program in C++, and use the C++ compiler to compile and link your
 | |
| program.  There is no need to recompile Python itself using C++.
 | |
| 
 | |
| 
 | |
| .. _compiling:
 | |
| 
 | |
| Compiling and Linking under Unix-like systems
 | |
| =============================================
 | |
| 
 | |
| It is not necessarily trivial to find the right flags to pass to your
 | |
| compiler (and linker) in order to embed the Python interpreter into your
 | |
| application, particularly because Python needs to load library modules
 | |
| implemented as C dynamic extensions (:file:`.so` files) linked against
 | |
| it.
 | |
| 
 | |
| To find out the required compiler and linker flags, you can execute the
 | |
| :file:`python{X.Y}-config` script which is generated as part of the
 | |
| installation process (a :file:`python3-config` script may also be
 | |
| available).  This script has several options, of which the following will
 | |
| be directly useful to you:
 | |
| 
 | |
| * ``pythonX.Y-config --cflags`` will give you the recommended flags when
 | |
|   compiling:
 | |
| 
 | |
|   .. code-block:: shell-session
 | |
| 
 | |
|      $ /opt/bin/python3.11-config --cflags
 | |
|      -I/opt/include/python3.11 -I/opt/include/python3.11 -Wsign-compare  -DNDEBUG -g -fwrapv -O3 -Wall
 | |
| 
 | |
| * ``pythonX.Y-config --ldflags --embed`` will give you the recommended flags
 | |
|   when linking:
 | |
| 
 | |
|   .. code-block:: shell-session
 | |
| 
 | |
|      $ /opt/bin/python3.11-config --ldflags --embed
 | |
|      -L/opt/lib/python3.11/config-3.11-x86_64-linux-gnu -L/opt/lib -lpython3.11 -lpthread -ldl  -lutil -lm
 | |
| 
 | |
| .. note::
 | |
|    To avoid confusion between several Python installations (and especially
 | |
|    between the system Python and your own compiled Python), it is recommended
 | |
|    that you use the absolute path to :file:`python{X.Y}-config`, as in the above
 | |
|    example.
 | |
| 
 | |
| If this procedure doesn't work for you (it is not guaranteed to work for
 | |
| all Unix-like platforms; however, we welcome :ref:`bug reports <reporting-bugs>`)
 | |
| you will have to read your system's documentation about dynamic linking and/or
 | |
| examine Python's :file:`Makefile` (use :func:`sysconfig.get_makefile_filename`
 | |
| to find its location) and compilation
 | |
| options.  In this case, the :mod:`sysconfig` module is a useful tool to
 | |
| programmatically extract the configuration values that you will want to
 | |
| combine together.  For example:
 | |
| 
 | |
| .. code-block:: pycon
 | |
| 
 | |
|    >>> import sysconfig
 | |
|    >>> sysconfig.get_config_var('LIBS')
 | |
|    '-lpthread -ldl  -lutil'
 | |
|    >>> sysconfig.get_config_var('LINKFORSHARED')
 | |
|    '-Xlinker -export-dynamic'
 | |
| 
 | |
| 
 | |
| .. XXX similar documentation for Windows missing
 | 
