\section{\module{weakref} --- Weak references} \declaremodule{extension}{weakref} \modulesynopsis{Support for weak references and weak dictionaries.} \moduleauthor{Fred L. Drake, Jr.}{fdrake@acm.org} \moduleauthor{Neil Schemenauer}{nas@arctrix.com} \moduleauthor{Martin von L\"owis}{martin@loewis.home.cs.tu-berlin.de} \sectionauthor{Fred L. Drake, Jr.}{fdrake@acm.org} \versionadded{2.1} % When making changes to the examples in this file, be sure to update % Lib/test/test_weakref.py::libreftest too! The \module{weakref} module allows the Python programmer to create \dfn{weak references} to objects. In the following, the term \dfn{referent} means the object which is referred to by a weak reference. A weak reference to an object is not enough to keep the object alive: when the only remaining references to a referent are weak references, garbage collection is free to destroy the referent and reuse its memory for something else. A primary use for weak references is to implement caches or mappings holding large objects, where it's desired that a large object not be kept alive solely because it appears in a cache or mapping. For example, if you have a number of large binary image objects, you may wish to associate a name with each. If you used a Python dictionary to map names to images, or images to names, the image objects would remain alive just because they appeared as values or keys in the dictionaries. The \class{WeakKeyDictionary} and \class{WeakValueDictionary} classes supplied by the \module{weakref} module are an alternative, using weak references to construct mappings that don't keep objects alive solely because they appear in the mapping objects. If, for example, an image object is a value in a \class{WeakValueDictionary}, then when the last remaining references to that image object are the weak references held by weak mappings, garbage collection can reclaim the object, and its corresponding entries in weak mappings are simply deleted. \class{WeakKeyDictionary} and \class{WeakValueDictionary} use weak references in their implementation, setting up callback functions on the weak references that notify the weak dictionaries when a key or value has been reclaimed by garbage collection. Most programs should find that using one of these weak dictionary types is all they need -- it's not usually necessary to create your own weak references directly. The low-level machinery used by the weak dictionary implementations is exposed by the \module{weakref} module for the benefit of advanced uses. Not all objects can be weakly referenced; those objects which can include class instances, functions written in Python (but not in C), methods (both bound and unbound), sets, frozensets, file objects, generators, type objects, DBcursor objects from the \module{bsddb} module, sockets, arrays, deques, and regular expression pattern objects. \versionchanged[Added support for files, sockets, arrays, and patterns]{2.4} Several builtin types such as \class{list} and \class{dict} do not directly support weak references but can add support through subclassing: \begin{verbatim} class Dict(dict): pass obj = Dict(red=1, green=2, blue=3) # this object is weak referencable \end{verbatim} Extension types can easily be made to support weak references; see section \ref{weakref-extension}, ``Weak References in Extension Types,'' for more information. \begin{classdesc}{ref}{object\optional{, callback}} Return a weak reference to \var{object}. The original object can be retrieved by calling the reference object if the referent is still alive; if the referent is no longer alive, calling the reference object will cause \constant{None} to be returned. If \var{callback} is provided and not \constant{None}, and the returned weakref object is still alive, the callback will be called when the object is about to be finalized; the weak reference object will be passed as the only parameter to the callback; the referent will no longer be available. It is allowable for many weak references to be constructed for the same object. Callbacks registered for each weak reference will be called from the most recently registered callback to the oldest registered callback. Exceptions raised by the callback will be noted on the standard error output, but cannot be propagated; they are handled in exactly the same way as exceptions raised from an object's \method{__del__()} method. Weak references are hashable if the \var{object} is hashable. They will maintain their hash value even after the \var{object} was deleted. If \function{hash()} is called the first time only after the \var{object} was deleted, the call will raise \exception{TypeError}. Weak references support tests for equality, but not ordering. If the referents are still alive, two references have the same equality relationship as their referents (regardless of the \var{callback}). If either referent has been deleted, the references are equal only if the reference objects are the same object. \versionchanged[This is now a subclassable type rather than a factory function; it derives from \class{object}] {2.4} \end{classdesc} \begin{funcdesc}{proxy}{object\optional{, callback}} Return a proxy to \var{object} which uses a weak reference. This supports use of the proxy in most contexts instead of requiring the explicit dereferencing used with weak reference objects. The returned object will have a type of either \code{ProxyType} or \code{CallableProxyType}, depending on whether \var{object} is callable. Proxy objects are not hashable regardless of the referent; this avoids a number of problems related to their fundamentally mutable nature, and prevent their use as dictionary keys. \var{callback} is the same as the parameter of the same name to the \function{ref()} function. \end{funcdesc} \begin{funcdesc}{getweakrefcount}{object} Return the number of weak references and proxies which refer to \var{object}. \end{funcdesc} \begin{funcdesc}{getweakrefs}{object} Return a list of all weak reference and proxy objects which refer to \var{object}. \end{funcdesc} \begin{classdesc}{WeakKeyDictionary}{\optional{dict}} Mapping class that references keys weakly. Entries in the dictionary will be discarded when there is no longer a strong reference to the key. This can be used to associate additional data with an object owned by other parts of an application without adding attributes to those objects. This can be especially useful with objects that override attribute accesses. \note{Caution: Because a \class{WeakKeyDictionary} is built on top of a Python dictionary, it must not change size when iterating over it. This can be difficult to ensure for a \class{WeakKeyDictionary} because actions performed by the program during iteration may cause items in the dictionary to vanish "by magic" (as a side effect of garbage collection).} \end{classdesc} \begin{classdesc}{WeakValueDictionary}{\optional{dict}} Mapping class that references values weakly. Entries in the dictionary will be discarded when no strong reference to the value exists any more. \note{Caution: Because a \class{WeakValueDictionary} is built on top of a Python dictionary, it must not change size when iterating over it. This can be difficult to ensure for a \class{WeakValueDictionary} because actions performed by the program during iteration may cause items in the dictionary to vanish "by magic" (as a side effect of garbage collection).} \end{classdesc} \begin{datadesc}{ReferenceType} The type object for weak references objects. \end{datadesc} \begin{datadesc}{ProxyType} The type object for proxies of objects which are not callable. \end{datadesc} \begin{datadesc}{CallableProxyType} The type object for proxies of callable objects. \end{datadesc} \begin{datadesc}{ProxyTypes} Sequence containing all the type objects for proxies. This can make it simpler to test if an object is a proxy without being dependent on naming both proxy types. \end{datadesc} \begin{excdesc}{ReferenceError} Exception raised when a proxy object is used but the underlying object has been collected. This is the same as the standard \exception{ReferenceError} exception. \end{excdesc} \begin{seealso} \seepep{0205}{Weak References}{The proposal and rationale for this feature, including links to earlier implementations and information about similar features in other languages.} \end{seealso} \subsection{Weak Reference Objects \label{weakref-objects}} Weak reference objects have no attributes or methods, but do allow the referent to be obtained, if it still exists, by calling it: \begin{verbatim} >>> import weakref >>> class Object: ... pass ... >>> o = Object() >>> r = weakref.ref(o) >>> o2 = r() >>> o is o2 True \end{verbatim} If the referent no longer exists, calling the reference object returns \constant{None}: \begin{verbatim} >>> del o, o2 >>> print r() None \end{verbatim} Testing that a weak reference object is still live should be done using the expression \code{\var{ref}() is not None}. Normally, application code that needs to use a reference object should follow this pattern: \begin{verbatim} # r is a weak reference object o = r() if o is None: # referent has been garbage collected print "Object has been deallocated; can't frobnicate." else: print "Object is still live!" o.do_something_useful() \end{verbatim} Using a separate test for ``liveness'' creates race conditions in threaded applications; another thread can cause a weak reference to become invalidated before the weak reference is called; the idiom shown above is safe in threaded applications as well as single-threaded applications. Specialized versions of \class{ref} objects can be created through subclassing. This is used in the implementation of the \class{WeakValueDictionary} to reduce the memory overhead for each entry in the mapping. This may be most useful to associate additional information with a reference, but could also be used to insert additional processing on calls to retrieve the referent. This example shows how a subclass of \class{ref} can be used to store additional information about an object and affect the value that's returned when the referent is accessed: \begin{verbatim} import weakref class ExtendedRef(weakref.ref): def __init__(self, ob, callback=None, **annotations): super(ExtendedRef, self).__init__(ob, callback) self.__counter = 0 for k, v in annotations.iteritems(): setattr(self, k, v) def __call__(self): """Return a pair containing the referent and the number of times the reference has been called. """ ob = super(ExtendedRef, self).__call__() if ob is not None: self.__counter += 1 ob = (ob, self.__counter) return ob \end{verbatim} \subsection{Example \label{weakref-example}} This simple example shows how an application can use objects IDs to retrieve objects that it has seen before. The IDs of the objects can then be used in other data structures without forcing the objects to remain alive, but the objects can still be retrieved by ID if they do. % Example contributed by Tim Peters. \begin{verbatim} import weakref _id2obj_dict = weakref.WeakValueDictionary() def remember(obj): oid = id(obj) _id2obj_dict[oid] = obj return oid def id2obj(oid): return _id2obj_dict[oid] \end{verbatim} \subsection{Weak References in Extension Types \label{weakref-extension}} One of the goals of the implementation is to allow any type to participate in the weak reference mechanism without incurring the overhead on those objects which do not benefit by weak referencing (such as numbers). For an object to be weakly referencable, the extension must include a \ctype{PyObject*} field in the instance structure for the use of the weak reference mechanism; it must be initialized to \NULL{} by the object's constructor. It must also set the \member{tp_weaklistoffset} field of the corresponding type object to the offset of the field. Also, it needs to add \constant{Py_TPFLAGS_HAVE_WEAKREFS} to the tp_flags slot. For example, the instance type is defined with the following structure: \begin{verbatim} typedef struct { PyObject_HEAD PyClassObject *in_class; /* The class object */ PyObject *in_dict; /* A dictionary */ PyObject *in_weakreflist; /* List of weak references */ } PyInstanceObject; \end{verbatim} The statically-declared type object for instances is defined this way: \begin{verbatim} PyTypeObject PyInstance_Type = { PyObject_HEAD_INIT(&PyType_Type) 0, "module.instance", /* Lots of stuff omitted for brevity... */ Py_TPFLAGS_DEFAULT | Py_TPFLAGS_HAVE_WEAKREFS /* tp_flags */ 0, /* tp_doc */ 0, /* tp_traverse */ 0, /* tp_clear */ 0, /* tp_richcompare */ offsetof(PyInstanceObject, in_weakreflist), /* tp_weaklistoffset */ }; \end{verbatim} The type constructor is responsible for initializing the weak reference list to \NULL: \begin{verbatim} static PyObject * instance_new() { /* Other initialization stuff omitted for brevity */ self->in_weakreflist = NULL; return (PyObject *) self; } \end{verbatim} The only further addition is that the destructor needs to call the weak reference manager to clear any weak references. This should be done before any other parts of the destruction have occurred, but is only required if the weak reference list is non-\NULL: \begin{verbatim} static void instance_dealloc(PyInstanceObject *inst) { /* Allocate temporaries if needed, but do not begin destruction just yet. */ if (inst->in_weakreflist != NULL) PyObject_ClearWeakRefs((PyObject *) inst); /* Proceed with object destruction normally. */ } \end{verbatim}