mirror of
				https://github.com/python/cpython.git
				synced 2025-11-03 23:21:29 +00:00 
			
		
		
		
	
		
			
				
	
	
		
			316 lines
		
	
	
	
		
			12 KiB
		
	
	
	
		
			TeX
		
	
	
	
	
	
			
		
		
	
	
			316 lines
		
	
	
	
		
			12 KiB
		
	
	
	
		
			TeX
		
	
	
	
	
	
\chapter{Embedding Python in Another Application
 | 
						|
     \label{embedding}}
 | 
						|
 | 
						|
The previous chapters discussed how to extend Python, that is, how to
 | 
						|
extend the functionality of Python by attaching a library of C
 | 
						|
functions to it.  It is also possible to do it the other way around:
 | 
						|
enrich your C/\Cpp{} application by embedding Python in it.  Embedding
 | 
						|
provides your application with the ability to implement some of the
 | 
						|
functionality of your application in Python rather than C or \Cpp.
 | 
						|
This can be used for many purposes; one example would be to allow
 | 
						|
users to tailor the application to their needs by writing some scripts
 | 
						|
in Python.  You can also use it yourself if some of the functionality
 | 
						|
can be written in Python more easily.
 | 
						|
 | 
						|
Embedding Python is similar to extending it, but not quite.  The
 | 
						|
difference is that when you extend Python, the main program of the
 | 
						|
application is still the Python interpreter, while if you embed
 | 
						|
Python, the main program may have nothing to do with Python ---
 | 
						|
instead, some parts of the application occasionally call the Python
 | 
						|
interpreter to run some Python code.
 | 
						|
 | 
						|
So if you are embedding Python, you are providing your own main
 | 
						|
program.  One of the things this main program has to do is initialize
 | 
						|
the Python interpreter.  At the very least, you have to call the
 | 
						|
function \cfunction{Py_Initialize()} (on Mac OS, call
 | 
						|
\cfunction{PyMac_Initialize()} instead).  There are optional calls to
 | 
						|
pass command line arguments to Python.  Then later you can call the
 | 
						|
interpreter from any part of the application.
 | 
						|
 | 
						|
There are several different ways to call the interpreter: you can pass
 | 
						|
a string containing Python statements to
 | 
						|
\cfunction{PyRun_SimpleString()}, or you can pass a stdio file pointer
 | 
						|
and a file name (for identification in error messages only) to
 | 
						|
\cfunction{PyRun_SimpleFile()}.  You can also call the lower-level
 | 
						|
operations described in the previous chapters to construct and use
 | 
						|
Python objects.
 | 
						|
 | 
						|
A simple demo of embedding Python can be found in the directory
 | 
						|
\file{Demo/embed/} of the source distribution.
 | 
						|
 | 
						|
 | 
						|
\begin{seealso}
 | 
						|
  \seetitle[../api/api.html]{Python/C API Reference Manual}{The
 | 
						|
            details of Python's C interface are given in this manual.
 | 
						|
            A great deal of necessary information can be found here.}
 | 
						|
\end{seealso}
 | 
						|
 | 
						|
 | 
						|
\section{Very High Level Embedding
 | 
						|
         \label{high-level-embedding}}
 | 
						|
 | 
						|
The simplest form of embedding Python is the use of the very
 | 
						|
high level interface. This interface is intended to execute a
 | 
						|
Python script without needing to interact with the application
 | 
						|
directly. This can for example be used to perform some operation
 | 
						|
on a file.
 | 
						|
 | 
						|
\begin{verbatim}
 | 
						|
#include <Python.h>
 | 
						|
 | 
						|
int
 | 
						|
main(int argc, char *argv[])
 | 
						|
{
 | 
						|
  Py_Initialize();
 | 
						|
  PyRun_SimpleString("from time import time,ctime\n"
 | 
						|
                     "print 'Today is',ctime(time())\n");
 | 
						|
  Py_Finalize();
 | 
						|
  return 0;
 | 
						|
}
 | 
						|
\end{verbatim}
 | 
						|
 | 
						|
The above code first initializes the Python interpreter with
 | 
						|
\cfunction{Py_Initialize()}, followed by the execution of a hard-coded
 | 
						|
Python script that print the date and time.  Afterwards, the
 | 
						|
\cfunction{Py_Finalize()} call shuts the interpreter down, followed by
 | 
						|
the end of the program.  In a real program, you may want to get the
 | 
						|
Python script from another source, perhaps a text-editor routine, a
 | 
						|
file, or a database.  Getting the Python code from a file can better
 | 
						|
be done by using the \cfunction{PyRun_SimpleFile()} function, which
 | 
						|
saves you the trouble of allocating memory space and loading the file
 | 
						|
contents.
 | 
						|
 | 
						|
 | 
						|
\section{Beyond Very High Level Embedding: An overview
 | 
						|
         \label{lower-level-embedding}}
 | 
						|
 | 
						|
The high level interface gives you the ability to execute
 | 
						|
arbitrary pieces of Python code from your application, but
 | 
						|
exchanging data values is quite cumbersome to say the least. If
 | 
						|
you want that, you should use lower level calls. At the cost of
 | 
						|
having to write more C code, you can achieve almost anything.
 | 
						|
 | 
						|
It should be noted that extending Python and embedding Python
 | 
						|
is quite the same activity, despite the different intent. Most
 | 
						|
topics discussed in the previous chapters are still valid. To
 | 
						|
show this, consider what the extension code from Python to C
 | 
						|
really does:
 | 
						|
 | 
						|
\begin{enumerate}
 | 
						|
    \item Convert data values from Python to C,
 | 
						|
    \item Perform a function call to a C routine using the
 | 
						|
        converted values, and
 | 
						|
    \item Convert the data values from the call from C to Python.
 | 
						|
\end{enumerate}
 | 
						|
 | 
						|
When embedding Python, the interface code does:
 | 
						|
 | 
						|
\begin{enumerate}
 | 
						|
    \item Convert data values from C to Python,
 | 
						|
    \item Perform a function call to a Python interface routine
 | 
						|
        using the converted values, and
 | 
						|
    \item Convert the data values from the call from Python to C.
 | 
						|
\end{enumerate}
 | 
						|
 | 
						|
As you can see, the data conversion steps are simply swapped to
 | 
						|
accomodate the different direction of the cross-language transfer.
 | 
						|
The only difference is the routine that you call between both
 | 
						|
data conversions. When extending, you call a C routine, when
 | 
						|
embedding, you call a Python routine.
 | 
						|
 | 
						|
This chapter will not discuss how to convert data from Python
 | 
						|
to C and vice versa.  Also, proper use of references and dealing
 | 
						|
with errors is assumed to be understood.  Since these aspects do not
 | 
						|
differ from extending the interpreter, you can refer to earlier
 | 
						|
chapters for the required information.
 | 
						|
 | 
						|
 | 
						|
\section{Pure Embedding
 | 
						|
         \label{pure-embedding}}
 | 
						|
 | 
						|
The first program aims to execute a function in a Python
 | 
						|
script. Like in the section about the very high level interface,
 | 
						|
the Python interpreter does not directly interact with the
 | 
						|
application (but that will change in th next section).
 | 
						|
 | 
						|
The code to run a function defined in a Python script is:
 | 
						|
 | 
						|
\verbatiminput{run-func.c}
 | 
						|
 | 
						|
This code loads a Python script using \code{argv[1]}, and calls the
 | 
						|
function named in \code{argv[2]}.  Its integer arguments are the other
 | 
						|
values of the \code{argv} array.  If you compile and link this
 | 
						|
program (let's call the finished executable \program{call}), and use
 | 
						|
it to execute a Python script, such as:
 | 
						|
 | 
						|
\begin{verbatim}
 | 
						|
def multiply(a,b):
 | 
						|
    print "Will compute", a, "times", b
 | 
						|
    c = 0
 | 
						|
    for i in range(0, a):
 | 
						|
        c = c + b
 | 
						|
    return c
 | 
						|
\end{verbatim}
 | 
						|
 | 
						|
then the result should be:
 | 
						|
 | 
						|
\begin{verbatim}
 | 
						|
$ call multiply multiply 3 2
 | 
						|
Will compute 3 times 2
 | 
						|
Result of call: 6
 | 
						|
\end{verbatim} % $
 | 
						|
 | 
						|
Although the program is quite large for its functionality, most of the
 | 
						|
code is for data conversion between Python and C, and for error
 | 
						|
reporting.  The interesting part with respect to embedding Python
 | 
						|
starts with
 | 
						|
 | 
						|
\begin{verbatim}
 | 
						|
    Py_Initialize();
 | 
						|
    pName = PyString_FromString(argv[1]);
 | 
						|
    /* Error checking of pName left out */
 | 
						|
    pModule = PyImport_Import(pName);
 | 
						|
\end{verbatim}
 | 
						|
 | 
						|
After initializing the interpreter, the script is loaded using
 | 
						|
\cfunction{PyImport_Import()}.  This routine needs a Python string
 | 
						|
as its argument, which is constructed using the
 | 
						|
\cfunction{PyString_FromString()} data conversion routine.
 | 
						|
 | 
						|
\begin{verbatim}
 | 
						|
    pFunc = PyObject_GetAttrString(pModule, argv[2]);
 | 
						|
    /* pFunc is a new reference */
 | 
						|
 | 
						|
    if (pFunc && PyCallable_Check(pFunc)) {
 | 
						|
        ...
 | 
						|
    }
 | 
						|
    Py_XDECREF(pFunc);
 | 
						|
\end{verbatim}
 | 
						|
 | 
						|
Once the script is loaded, the name we're looking for is retrieved
 | 
						|
using \cfunction{PyObject_GetAttrString()}.  If the name exists, and
 | 
						|
the object returned is callable, you can safely assume that it is a
 | 
						|
function.  The program then proceeds by constructing a tuple of
 | 
						|
arguments as normal.  The call to the Python function is then made
 | 
						|
with:
 | 
						|
 | 
						|
\begin{verbatim}
 | 
						|
    pValue = PyObject_CallObject(pFunc, pArgs);
 | 
						|
\end{verbatim}
 | 
						|
 | 
						|
Upon return of the function, \code{pValue} is either \NULL{} or it
 | 
						|
contains a reference to the return value of the function.  Be sure to
 | 
						|
release the reference after examining the value.
 | 
						|
 | 
						|
 | 
						|
\section{Extending Embedded Python
 | 
						|
         \label{extending-with-embedding}}
 | 
						|
 | 
						|
Until now, the embedded Python interpreter had no access to
 | 
						|
functionality from the application itself.  The Python API allows this
 | 
						|
by extending the embedded interpreter.  That is, the embedded
 | 
						|
interpreter gets extended with routines provided by the application.
 | 
						|
While it sounds complex, it is not so bad.  Simply forget for a while
 | 
						|
that the application starts the Python interpreter.  Instead, consider
 | 
						|
the application to be a set of subroutines, and write some glue code
 | 
						|
that gives Python access to those routines, just like you would write
 | 
						|
a normal Python extension.  For example:
 | 
						|
 | 
						|
\begin{verbatim}
 | 
						|
static int numargs=0;
 | 
						|
 | 
						|
/* Return the number of arguments of the application command line */
 | 
						|
static PyObject*
 | 
						|
emb_numargs(PyObject *self, PyObject *args)
 | 
						|
{
 | 
						|
    if(!PyArg_ParseTuple(args, ":numargs"))
 | 
						|
        return NULL;
 | 
						|
    return Py_BuildValue("i", numargs);
 | 
						|
}
 | 
						|
 | 
						|
static PyMethodDef EmbMethods[] = {
 | 
						|
    {"numargs", emb_numargs, METH_VARARGS,
 | 
						|
     "Return the number of arguments received by the process."},
 | 
						|
    {NULL, NULL, 0, NULL}
 | 
						|
};
 | 
						|
\end{verbatim}
 | 
						|
 | 
						|
Insert the above code just above the \cfunction{main()} function.
 | 
						|
Also, insert the following two statements directly after
 | 
						|
\cfunction{Py_Initialize()}:
 | 
						|
 | 
						|
\begin{verbatim}
 | 
						|
    numargs = argc;
 | 
						|
    Py_InitModule("emb", EmbMethods);
 | 
						|
\end{verbatim}
 | 
						|
 | 
						|
These two lines initialize the \code{numargs} variable, and make the
 | 
						|
\function{emb.numargs()} function accessible to the embedded Python
 | 
						|
interpreter.  With these extensions, the Python script can do things
 | 
						|
like
 | 
						|
 | 
						|
\begin{verbatim}
 | 
						|
import emb
 | 
						|
print "Number of arguments", emb.numargs()
 | 
						|
\end{verbatim}
 | 
						|
 | 
						|
In a real application, the methods will expose an API of the
 | 
						|
application to Python.
 | 
						|
 | 
						|
 | 
						|
%\section{For the future}
 | 
						|
%
 | 
						|
%You don't happen to have a nice library to get textual
 | 
						|
%equivalents of numeric values do you :-) ?
 | 
						|
%Callbacks here ? (I may be using information from that section
 | 
						|
%?!)
 | 
						|
%threads
 | 
						|
%code examples do not really behave well if errors happen
 | 
						|
% (what to watch out for)
 | 
						|
 | 
						|
 | 
						|
\section{Embedding Python in \Cpp
 | 
						|
     \label{embeddingInCplusplus}}
 | 
						|
 | 
						|
It is also possible to embed Python in a \Cpp{} program; precisely how this
 | 
						|
is done will depend on the details of the \Cpp{} system used; in general you
 | 
						|
will need to write the main program in \Cpp, and use the \Cpp{} compiler
 | 
						|
to compile and link your program.  There is no need to recompile Python
 | 
						|
itself using \Cpp.
 | 
						|
 | 
						|
 | 
						|
\section{Linking Requirements
 | 
						|
         \label{link-reqs}}
 | 
						|
 | 
						|
While the \program{configure} script shipped with the Python sources
 | 
						|
will correctly build Python to export the symbols needed by
 | 
						|
dynamically linked extensions, this is not automatically inherited by
 | 
						|
applications which embed the Python library statically, at least on
 | 
						|
\UNIX.  This is an issue when the application is linked to the static
 | 
						|
runtime library (\file{libpython.a}) and needs to load dynamic
 | 
						|
extensions (implemented as \file{.so} files).
 | 
						|
 | 
						|
The problem is that some entry points are defined by the Python
 | 
						|
runtime solely for extension modules to use.  If the embedding
 | 
						|
application does not use any of these entry points, some linkers will
 | 
						|
not include those entries in the symbol table of the finished
 | 
						|
executable.  Some additional options are needed to inform the linker
 | 
						|
not to remove these symbols.
 | 
						|
 | 
						|
Determining the right options to use for any given platform can be
 | 
						|
quite difficult, but fortunately the Python configuration already has
 | 
						|
those values.  To retrieve them from an installed Python interpreter,
 | 
						|
start an interactive interpreter and have a short session like this:
 | 
						|
 | 
						|
\begin{verbatim}
 | 
						|
>>> import distutils.sysconfig
 | 
						|
>>> distutils.sysconfig.get_config_var('LINKFORSHARED')
 | 
						|
'-Xlinker -export-dynamic'
 | 
						|
\end{verbatim}
 | 
						|
\refstmodindex{distutils.sysconfig}
 | 
						|
 | 
						|
The contents of the string presented will be the options that should
 | 
						|
be used.  If the string is empty, there's no need to add any
 | 
						|
additional options.  The \constant{LINKFORSHARED} definition
 | 
						|
corresponds to the variable of the same name in Python's top-level
 | 
						|
\file{Makefile}.
 |