__slots__
Python中的目的是什么- 特别是关于我何时想要使用它,何时不想使用它?
在Python中,目的是__slots__
什么以及应该避免这种情况的案例是什么?
特殊属性__slots__
允许您显式声明您希望对象实例具有哪些实例属性,并具有预期结果:
更快的属性访问.
内存节省空间.
节省的空间来自
在槽中存储值引用而不是__dict__
.
如果父类拒绝它们并且您声明,则拒绝__dict__
并__weakref__
创建__slots__
.
小警告,您应该只在继承树中声明一次特定的插槽.例如:
class Base: __slots__ = 'foo', 'bar' class Right(Base): __slots__ = 'baz', class Wrong(Base): __slots__ = 'foo', 'bar', 'baz' # redundant foo and bar
Python doesn't object when you get this wrong (it probably should), problems might not otherwise manifest, but your objects will take up more space than they otherwise should.
>>> from sys import getsizeof >>> getsizeof(Right()), getsizeof(Wrong()) (64, 80)
The biggest caveat is for multiple inheritance - multiple "parent classes with nonempty slots" cannot be combined.
To accommodate this restriction, follow best practices: Factor out all but one or all parents' abstraction which their concrete class respectively and your new concrete class collectively will inherit from - giving the abstraction(s) empty slots (just like abstract base classes in the standard library).
See section on multiple inheritance below for an example.
To have attributes named in __slots__
to actually be stored in slots instead of a __dict__
, a class must inherit from object
.
要防止创建a __dict__
,必须继承,object
并且继承中的所有类都必须声明,__slots__
并且它们都不能有'__dict__'
条目.
如果你想继续阅读,有很多细节.
__slots__
:更快的属性访问.Python的创建者Guido van Rossum 声称他实际上是__slots__
为了更快的属性访问而创建的.
显示可衡量的显着更快访问是微不足道的:
import timeit class Foo(object): __slots__ = 'foo', class Bar(object): pass slotted = Foo() not_slotted = Bar() def get_set_delete_fn(obj): def get_set_delete(): obj.foo = 'foo' obj.foo del obj.foo return get_set_delete
和
>>> min(timeit.repeat(get_set_delete_fn(slotted))) 0.2846834529991611 >>> min(timeit.repeat(get_set_delete_fn(not_slotted))) 0.3664822799983085
在Ubuntu上,Python 3.5的插槽访问速度提高了近30%.
>>> 0.3664822799983085 / 0.2846834529991611 1.2873325658284342
在Windows上的Python 2中,我测得它的速度提高了约15%.
__slots__
:内存节省另一个目的__slots__
是减少每个对象实例占用的内存空间.
My own contribution to the documentation clearly states the reasons behind this:
The space saved over using
__dict__
can be significant.
SQLAlchemy attributes a lot of memory savings to __slots__
.
To verify this, using the Anaconda distribution of Python 2.7 on Ubuntu Linux, with guppy.hpy
(aka heapy) and sys.getsizeof
, the size of a class instance without __slots__
declared, and nothing else, is 64 bytes. That does not include the __dict__
. Thank you Python for lazy evaluation again, the __dict__
is apparently not called into existence until it is referenced, but classes without data are usually useless. When called into existence, the __dict__
attribute is a minimum of 280 bytes additionally.
In contrast, a class instance with __slots__
declared to be ()
(no data) is only 16 bytes, and 56 total bytes with one item in slots, 64 with two.
For 64 bit Python, I illustrate the memory consumption in bytes in Python 2.7 and 3.6, for __slots__
and __dict__
(no slots defined) for each point where the dict grows in 3.6 (except for 0, 1, and 2 attributes):
Python 2.7 Python 3.6 attrs __slots__ __dict__* __slots__ __dict__* | *(no slots defined) none 16 56 + 272† 16 56 + 112† | †if __dict__ referenced one 48 56 + 272 48 56 + 112 two 56 56 + 272 56 56 + 112 six 88 56 + 1040 88 56 + 152 11 128 56 + 1040 128 56 + 240 22 216 56 + 3344 216 56 + 408 43 384 56 + 3344 384 56 + 752
So, in spite of smaller dicts in Python 3, we see how nicely __slots__
scale for instances to save us memory, and that is a major reason you would want to use __slots__
.
Just for completeness of my notes, note that there is a one-time cost per slot in the class's namespace of 64 bytes in Python 2, and 72 bytes in Python 3, because slots use data descriptors like properties, called "members".
>>> Foo.foo>>> type(Foo.foo) >>> getsizeof(Foo.foo) 72
__slots__
:To deny the creation of a __dict__
, you must subclass object
:
class Base(object): __slots__ = ()
now:
>>> b = Base() >>> b.a = 'a' Traceback (most recent call last): File "", line 1, in b.a = 'a' AttributeError: 'Base' object has no attribute 'a'
Or subclass another class that defines __slots__
class Child(Base): __slots__ = ('a',)
and now:
c = Child() c.a = 'a'
but:
>>> c.b = 'b' Traceback (most recent call last): File "", line 1, in c.b = 'b' AttributeError: 'Child' object has no attribute 'b'
To allow __dict__
creation while subclassing slotted objects, just add '__dict__'
to the __slots__
(note that slots are ordered, and you shouldn't repeat slots that are already in parent classes):
class SlottedWithDict(Child): __slots__ = ('__dict__', 'b') swd = SlottedWithDict() swd.a = 'a' swd.b = 'b' swd.c = 'c'
and
>>> swd.__dict__ {'c': 'c'}
Or you don't even need to declare __slots__
in your subclass, and you will still use slots from the parents, but not restrict the creation of a __dict__
:
class NoSlots(Child): pass ns = NoSlots() ns.a = 'a' ns.b = 'b'
And:
>>> ns.__dict__ {'b': 'b'}
However, __slots__
may cause problems for multiple inheritance:
class BaseA(object): __slots__ = ('a',) class BaseB(object): __slots__ = ('b',)
Because creating a child class from parents with both non-empty slots fails:
>>> class Child(BaseA, BaseB): __slots__ = () Traceback (most recent call last): File "", line 1, in class Child(BaseA, BaseB): __slots__ = () TypeError: Error when calling the metaclass bases multiple bases have instance lay-out conflict
If you run into this problem, You could just remove __slots__
from the parents, or if you have control of the parents, give them empty slots, or refactor to abstractions:
from abc import ABC class AbstractA(ABC): __slots__ = () class BaseA(AbstractA): __slots__ = ('a',) class AbstractB(ABC): __slots__ = () class BaseB(AbstractB): __slots__ = ('b',) class Child(AbstractA, AbstractB): __slots__ = ('a', 'b') c = Child() # no problem!
'__dict__'
to __slots__
to get dynamic assignment:class Foo(object): __slots__ = 'bar', 'baz', '__dict__'
and now:
>>> foo = Foo() >>> foo.boink = 'boink'
So with '__dict__'
in slots we lose some of the size benefits with the upside of having dynamic assignment and still having slots for the names we do expect.
When you inherit from an object that isn't slotted, you get the same sort of semantics when you use __slots__
- names that are in __slots__
point to slotted values, while any other values are put in the instance's __dict__
.
Avoiding __slots__
because you want to be able to add attributes on the fly is actually not a good reason - just add "__dict__"
to your __slots__
if this is required.
You can similarly add __weakref__
to __slots__
explicitly if you need that feature.
The namedtuple builtin make immutable instances that are very lightweight (essentially, the size of tuples) but to get the benefits, you need to do it yourself if you subclass them:
from collections import namedtuple class MyNT(namedtuple('MyNT', 'bar baz')): """MyNT is an immutable and lightweight object""" __slots__ = ()
usage:
>>> nt = MyNT('bar', 'baz') >>> nt.bar 'bar' >>> nt.baz 'baz'
And trying to assign an unexpected attribute raises an AttributeError
because we have prevented the creation of __dict__
:
>>> nt.quux = 'quux' Traceback (most recent call last): File "", line 1, in AttributeError: 'MyNT' object has no attribute 'quux'
You can allow __dict__
creation by leaving off __slots__ = ()
, but you can't use non-empty __slots__
with subtypes of tuple.
Even when non-empty slots are the same for multiple parents, they cannot be used together:
class Foo(object): __slots__ = 'foo', 'bar' class Bar(object): __slots__ = 'foo', 'bar' # alas, would work if empty, i.e. () >>> class Baz(Foo, Bar): pass Traceback (most recent call last): File "", line 1, in TypeError: Error when calling the metaclass bases multiple bases have instance lay-out conflict
Using an empty __slots__
in the parent seems to provide the most flexibility, allowing the child to choose to prevent or allow (by adding '__dict__'
to get dynamic assignment, see section above) the creation of a __dict__
:
class Foo(object): __slots__ = () class Bar(object): __slots__ = () class Baz(Foo, Bar): __slots__ = ('foo', 'bar') b = Baz() b.foo, b.bar = 'foo', 'bar'
You don't have to have slots - so if you add them, and remove them later, it shouldn't cause any problems.
Going out on a limb here: If you're composing mixins or using abstract base classes, which aren't intended to be instantiated, an empty __slots__
in those parents seems to be the best way to go in terms of flexibility for subclassers.
To demonstrate, first, let's create a class with code we'd like to use under multiple inheritance
class AbstractBase: __slots__ = () def __init__(self, a, b): self.a = a self.b = b def __repr__(self): return f'{type(self).__name__}({repr(self.a)}, {repr(self.b)})'
We could use the above directly by inheriting and declaring the expected slots:
class Foo(AbstractBase): __slots__ = 'a', 'b'
But we don't care about that, that's trivial single inheritance, we need another class we might also inherit from, maybe with a noisy attribute:
class AbstractBaseC: __slots__ = () @property def c(self): print('getting c!') return self._c @c.setter def c(self, arg): print('setting c!') self._c = arg
Now if both bases had nonempty slots, we couldn't do the below. (In fact, if we wanted, we could have given AbstractBase
nonempty slots a and b, and left them out of the below declaration - leaving them in would be wrong):
class Concretion(AbstractBase, AbstractBaseC): __slots__ = 'a b _c'.split()
And now we have functionality from both via multiple inheritance, and can still deny __dict__
and __weakref__
instantiation:
>>> c = Concretion('a', 'b') >>> c.c = c setting c! >>> c.c getting c! Concretion('a', 'b') >>> c.d = 'd' Traceback (most recent call last): File "", line 1, in AttributeError: 'Concretion' object has no attribute 'd'
Avoid them when you want to perform __class__
assignment with another class that doesn't have them (and you can't add them) unless the slot layouts are identical. (I am very interested in learning who is doing this and why.)
Avoid them if you want to subclass variable length builtins like long, tuple, or str, and you want to add attributes to them.
Avoid them if you insist on providing default values via class attributes for instance variables.
You may be able to tease out further caveats from the rest of the __slots__
documentation (the 3.7 dev docs are the most current), which I have made significant recent contributions to.
The current top answers cite outdated information and are quite hand-wavy and miss the mark in some important ways.
__slots__
when instantiating lots of objects"I quote:
"You would want to use
__slots__
if you are going to instantiate a lot (hundreds, thousands) of objects of the same class."
Abstract Base Classes, for example, from the collections
module, are not instantiated, yet __slots__
are declared for them.
Why?
If a user wishes to deny __dict__
or __weakref__
creation, those things must not be available in the parent classes.
__slots__
contributes to reusability when creating interfaces or mixins.
It is true that many Python users aren't writing for reusability, but when you are, having the option to deny unnecessary space usage is valuable.
__slots__
doesn't break picklingWhen pickling a slotted object, you may find it complains with a misleading TypeError
:
>>> pickle.loads(pickle.dumps(f)) TypeError: a class that defines __slots__ without defining __getstate__ cannot be pickled
This is actually incorrect. This message comes from the oldest protocol, which is the default. You can select the latest protocol with the -1
argument. In Python 2.7 this would be 2
(which was introduced in 2.3), and in 3.6 it is 4
.
>>> pickle.loads(pickle.dumps(f, -1)) <__main__.Foo object at 0x1129C770>
in Python 2.7:
>>> pickle.loads(pickle.dumps(f, 2)) <__main__.Foo object at 0x1129C770>
in Python 3.6
>>> pickle.loads(pickle.dumps(f, 4)) <__main__.Foo object at 0x1129C770>
So I would keep this in mind, as it is a solved problem.
The first paragraph is half short explanation, half predictive. Here's the only part that actually answers the question
The proper use of
__slots__
is to save space in objects. Instead of having a dynamic dict that allows adding attributes to objects at anytime, there is a static structure which does not allow additions after creation. This saves the overhead of one dict for every object that uses slots
The second half is wishful thinking, and off the mark:
While this is sometimes a useful optimization, it would be completely unnecessary if the Python interpreter was dynamic enough so that it would only require the dict when there actually were additions to the object.
Python actually does something similar to this, only creating the __dict__
when it is accessed, but creating lots of objects with no data is fairly ridiculous.
The second paragraph oversimplifies and misses actual reasons to avoid __slots__
. The below is not a real reason to avoid slots (for actual reasons, see the rest of my answer above.):
They change the behavior of the objects that have slots in a way that can be abused by control freaks and static typing weenies.
It then goes on to discuss other ways of accomplishing that perverse goal with Python, not discussing anything to do with __slots__
.
The third paragraph is more wishful thinking. Together it is mostly off-the-mark content that the answerer didn't even author and contributes to ammunition for critics of the site.
Memory usage evidenceCreate some normal objects and slotted objects:
>>> class Foo(object): pass >>> class Bar(object): __slots__ = ()
Instantiate a million of them:
>>> foos = [Foo() for f in xrange(1000000)] >>> bars = [Bar() for b in xrange(1000000)]
Inspect with guppy.hpy().heap()
:
>>> guppy.hpy().heap() Partition of a set of 2028259 objects. Total size = 99763360 bytes. Index Count % Size % Cumulative % Kind (class / dict of class) 0 1000000 49 64000000 64 64000000 64 __main__.Foo 1 169 0 16281480 16 80281480 80 list 2 1000000 49 16000000 16 96281480 97 __main__.Bar 3 12284 1 987472 1 97268952 97 str ...
Access the regular objects and their __dict__
and inspect again:
>>> for f in foos: ... f.__dict__ >>> guppy.hpy().heap() Partition of a set of 3028258 objects. Total size = 379763480 bytes. Index Count % Size % Cumulative % Kind (class / dict of class) 0 1000000 33 280000000 74 280000000 74 dict of __main__.Foo 1 1000000 33 64000000 17 344000000 91 __main__.Foo 2 169 0 16281480 4 360281480 95 list 3 1000000 33 16000000 4 376281480 99 __main__.Bar 4 12284 0 987472 0 377268952 99 str ...
This is consistent with the history of Python, from Unifying types and classes in Python 2.2
If you subclass a built-in type, extra space is automatically added to the instances to accomodate
__dict__
and__weakrefs__
. (The__dict__
is not initialized until you use it though, so you shouldn't worry about the space occupied by an empty dictionary for each instance you create.) If you don't need this extra space, you can add the phrase "__slots__ = []
" to your class.
引用雅各布哈伦:
正确使用
__slots__
是为了节省物体的空间.除了拥有允许随时向对象添加属性的动态dict之外,还有一个静态结构,它不允许在创建后添加.[这种用法__slots__
消除了每个对象的一个dict的开销.]虽然这有时是一个有用的优化,但如果Python解释器足够动态,那么它只需要dict时就完全没有必要了.宾语.不幸的是,插槽有副作用.它们改变了具有插槽的对象的行为,这种方式可能被控制怪物和静态类型中断所滥用.这很糟糕,因为控制狂应该滥用元类,而静态类型weenies应该滥用装饰器,因为在Python中,应该只有一种明显的做法.
使CPython足够智能以处理节省空间而
__slots__
不是一项重大任务,这可能就是为什么它不在P3k(尚未)的更改列表中.
你会想使用__slots__
,如果你要实例化同一个类的对象很多(几百,几千).__slots__
仅作为内存优化工具存在.
非常不鼓励使用__slots__
约束属性创建,并且通常你想避免它,因为它打破了pickle,以及python的一些其他内省功能.
每个python对象都有一个__dict__
atttribute,它是一个包含所有其他属性的字典.例如,当你键入self.attr
python实际上正在做self.__dict__['attr']
.您可以想象使用字典存储属性需要一些额外的空间和时间来访问它.
但是,在使用时__slots__
,为该类创建的任何对象都不具有__dict__
属性.相反,所有属性访问都是通过指针直接完成的.
因此,如果想要一个C风格的结构而不是一个完整的类,您可以使用它__slots__
来压缩对象的大小并减少属性访问时间.一个很好的例子是包含属性x和y的Point类.如果您要获得很多积分,可以尝试使用__slots__
以节省一些内存.
除了其他答案,这里有一个使用示例__slots__
:
>>> class Test(object): #Must be new-style class! ... __slots__ = ['x', 'y'] ... >>> pt = Test() >>> dir(pt) ['__class__', '__delattr__', '__doc__', '__getattribute__', '__hash__', '__init__', '__module__', '__new__', '__reduce__', '__reduce_ex__', '__repr__', '__setattr__', '__slots__', '__str__', 'x', 'y'] >>> pt.x Traceback (most recent call last): File "", line 1, in AttributeError: x >>> pt.x = 1 >>> pt.x 1 >>> pt.z = 2 Traceback (most recent call last): File " ", line 1, in AttributeError: 'Test' object has no attribute 'z' >>> pt.__dict__ Traceback (most recent call last): File " ", line 1, in AttributeError: 'Test' object has no attribute '__dict__' >>> pt.__slots__ ['x', 'y']
因此,要实现__slots__
它,它只需要一个额外的行(并使你的类成为一个新式的类,如果它还没有).这样,您可以将这些类的内存占用量减少5倍,代价是必须编写自定义pickle代码,如果有必要的话.
插槽对于库调用非常有用,可以在进行函数调用时消除"命名方法调度".这在SWIG 文档中提到.对于希望减少使用插槽的常用函数的函数开销的高性能库,要快得多.
现在这可能与OPs问题没有直接关系.它与构建扩展相关,而不是在对象上使用slot语法.但它确实有助于完成插槽使用的图片以及它们背后的一些推理.
类实例的属性有3个属性:实例,属性名称和属性值.
在常规属性访问中,实例充当字典,属性的名称充当该字典查找值的键.
实例(属性) - >值
在__slots__访问中,属性的名称充当字典,实例充当字典查找值的键.
attribute(instance) - > value
在flyweight模式中,属性的名称充当字典,并且值充当查找实例的字典中的键.
attribute(value) - > instance