whatsnew-3.1.rst 43 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593594595596597598599600601602603604605606607608609610611612613614615616617618619620621622623624625626627628629630631632633634635636637638639640641642643644645646647648649650651652653654655656657658659660661662663664665666667668669670671672673674675676677678679680681682683684685686687688689690691692693694695696697698699700701702703704705706707708709710711712713714715716717718719720721722723724725726727728729730731732733734735736737738739740741742743744745746747748749750751752753754755756757758759760761762763764765766767768769770771772773774775776777778779780781782783784785786787788789790791792793794795796797798799800801802803804805806807808809810811812813814815816817818819820821822823824825826827828829830831832833834835836837838839840841842843844845846847848849850851852853854855856857858859860861862863864865866867868869870871872873874875876877878879880881882883884885886887888889890891892893894895896897898899900901902903904905906907908909910911912913914915916917918919920921922923924925926927928929930931932933934935936937938939940941942943944945946947948949950951952953954955956957958959960961962963964965966967968969970971972973974975976977978979980981982983984985986987988989990991992993994995996997998999100010011002100310041005100610071008100910101011101210131014101510161017101810191020102110221023102410251026102710281029103010311032103310341035103610371038103910401041104210431044104510461047104810491050105110521053105410551056105710581059106010611062106310641065106610671068106910701071107210731074107510761077107810791080108110821083108410851086108710881089109010911092109310941095109610971098109911001101110211031104110511061107110811091110111111121113111411151116111711181119112011211122112311241125112611271128112911301131113211331134113511361137113811391140114111421143114411451146114711481149115011511152115311541155115611571158115911601161116211631164116511661167116811691170117111721173117411751176117711781179118011811182118311841185118611871188118911901191119211931194119511961197119811991200120112021203120412051206120712081209121012111212121312141215121612171218121912201221122212231224122512261227122812291230123112321233123412351236123712381239124012411242124312441245124612471248124912501251125212531254125512561257125812591260126112621263126412651266126712681269
  1. .. _whatsnew-3.1:
  2. ===========================================
  3. What's new in Celery 3.1 (Cipater)
  4. ===========================================
  5. :Author: Ask Solem (ask at celeryproject.org)
  6. .. sidebar:: Change history
  7. What's new documents describe the changes in major versions,
  8. we also have a :ref:`changelog` that lists the changes in bugfix
  9. releases (0.0.x), while older series are archived under the :ref:`history`
  10. section.
  11. Celery is a simple, flexible and reliable distributed system to
  12. process vast amounts of messages, while providing operations with
  13. the tools required to maintain such a system.
  14. It's a task queue with focus on real-time processing, while also
  15. supporting task scheduling.
  16. Celery has a large and diverse community of users and contributors,
  17. you should come join us :ref:`on IRC <irc-channel>`
  18. or :ref:`our mailing-list <mailing-list>`.
  19. To read more about Celery you should go read the :ref:`introduction <intro>`.
  20. While this version is backward compatible with previous versions
  21. it's important that you read the following section.
  22. This version is officially supported on CPython 2.6, 2.7 and 3.3,
  23. and also supported on PyPy.
  24. .. _`website`: http://celeryproject.org/
  25. .. topic:: Table of Contents
  26. Make sure you read the important notes before upgrading to this version.
  27. .. contents::
  28. :local:
  29. :depth: 2
  30. Preface
  31. =======
  32. Deadlocks have long plagued our workers, and while uncommon they are
  33. not acceptable. They are also infamous for being extremely hard to diagnose
  34. and reproduce, so to make this job easier I wrote a stress test suite that
  35. bombards the worker with different tasks in an attempt to break it.
  36. What happens if thousands of worker child processes are killed every
  37. second? what if we also kill the broker connection every 10
  38. seconds? These are examples of what the stress test suite will do to the
  39. worker, and it reruns these tests using different configuration combinations
  40. to find edge case bugs.
  41. The end result was that I had to rewrite the prefork pool to avoid the use
  42. of the POSIX semaphore. This was extremely challenging, but after
  43. months of hard work the worker now finally passes the stress test suite.
  44. There's probably more bugs to find, but the good news is
  45. that we now have a tool to reproduce them, so should you be so unlucky to
  46. experience a bug then we'll write a test for it and squash it!
  47. Note that I have also moved many broker transports into experimental status:
  48. the only transports recommended for production use today is RabbitMQ and
  49. Redis.
  50. I don't have the resources to maintain all of them, so bugs are left
  51. unresolved. I wish that someone will step up and take responsibility for
  52. these transports or donate resources to improve them, but as the situation
  53. is now I don't think the quality is up to date with the rest of the code-base
  54. so I cannot recommend them for production use.
  55. The next version of Celery 3.2 will focus on performance and removing
  56. rarely used parts of the library. Work has also started on a new message
  57. protocol, supporting multiple languages and more. The initial draft can
  58. be found :ref:`here <protov2draft>`.
  59. This has probably been the hardest release I've worked on, so no
  60. introduction to this changelog would be complete without a massive
  61. thank you to everyone who contributed and helped me test it!
  62. Thank you for your support!
  63. *— Ask Solem*
  64. .. _v310-important:
  65. Important Notes
  66. ===============
  67. Dropped support for Python 2.5
  68. ------------------------------
  69. Celery now requires Python 2.6 or later.
  70. The new dual code base runs on both Python 2 and 3, without
  71. requiring the ``2to3`` porting tool.
  72. .. note::
  73. This is also the last version to support Python 2.6! From Celery 3.2 and
  74. onwards Python 2.7 or later will be required.
  75. Last version to enable Pickle by default
  76. ----------------------------------------
  77. Starting from Celery 3.2 the default serializer will be json.
  78. If you depend on pickle being accepted you should be prepared
  79. for this change by explicitly allowing your worker
  80. to consume pickled messages using the :setting:`CELERY_ACCEPT_CONTENT`
  81. setting:
  82. .. code-block:: python
  83. CELERY_ACCEPT_CONTENT = ['pickle', 'json', 'msgpack', 'yaml']
  84. Make sure you only select the serialization formats you'll actually be using,
  85. and make sure you have properly secured your broker from unwanted access
  86. (see the :ref:`Security Guide <guide-security>`).
  87. The worker will emit a deprecation warning if you don't define this setting.
  88. .. topic:: for Kombu users
  89. Kombu 3.0 no longer accepts pickled messages by default, so if you
  90. use Kombu directly then you have to configure your consumers:
  91. see the :ref:`Kombu 3.0 Changelog <kombu:version-3.0.0>` for more
  92. information.
  93. Old command-line programs removed and deprecated
  94. ------------------------------------------------
  95. Everyone should move to the new :program:`celery` umbrella
  96. command, so we are incrementally deprecating the old command names.
  97. In this version we've removed all commands that are not used
  98. in init scripts. The rest will be removed in 3.2.
  99. +-------------------+--------------+-------------------------------------+
  100. | Program | New Status | Replacement |
  101. +===================+==============+=====================================+
  102. | ``celeryd`` | *DEPRECATED* | :program:`celery worker` |
  103. +-------------------+--------------+-------------------------------------+
  104. | ``celerybeat`` | *DEPRECATED* | :program:`celery beat` |
  105. +-------------------+--------------+-------------------------------------+
  106. | ``celeryd-multi`` | *DEPRECATED* | :program:`celery multi` |
  107. +-------------------+--------------+-------------------------------------+
  108. | ``celeryctl`` | **REMOVED** | :program:`celery inspect|control` |
  109. +-------------------+--------------+-------------------------------------+
  110. | ``celeryev`` | **REMOVED** | :program:`celery events` |
  111. +-------------------+--------------+-------------------------------------+
  112. | ``camqadm`` | **REMOVED** | :program:`celery amqp` |
  113. +-------------------+--------------+-------------------------------------+
  114. If this is not a new installation then you may want to remove the old
  115. commands:
  116. .. code-block:: bash
  117. $ pip uninstall celery
  118. $ # repeat until it fails
  119. # ...
  120. $ pip uninstall celery
  121. $ pip install celery
  122. Please run :program:`celery --help` for help using the umbrella command.
  123. .. _v310-news:
  124. News
  125. ====
  126. Prefork Pool Improvements
  127. -------------------------
  128. These improvements are only active if you use an async capable
  129. transport. This means only RabbitMQ (AMQP) and Redis are supported
  130. at this point and other transports will still use the thread-based fallback
  131. implementation.
  132. - Pool is now using one IPC queue per child process.
  133. Previously the pool shared one queue between all child processes,
  134. using a POSIX semaphore as a mutex to achieve exclusive read and write
  135. access.
  136. The POSIX semaphore has now been removed and each child process
  137. gets a dedicated queue. This means that the worker will require more
  138. file descriptors (two descriptors per process), but it also means
  139. that performance is improved and we can send work to individual child
  140. processes.
  141. POSIX semaphores are not released when a process is killed, so killing
  142. processes could lead to a deadlock if it happened while the semaphore was
  143. acquired. There is no good solution to fix this, so the best option
  144. was to remove the semaphore.
  145. - Asynchronous write operations
  146. The pool now uses async I/O to send work to the child processes.
  147. - Lost process detection is now immediate.
  148. If a child process is killed or exits mysteriously the pool previously
  149. had to wait for 30 seconds before marking the task with a
  150. :exc:`~celery.exceptions.WorkerLostError`. It had to do this because
  151. the outqueue was shared between all processes, and the pool could not
  152. be certain whether the process completed the task or not. So an arbitrary
  153. timeout of 30 seconds was chosen, as it was believed that the outqueue
  154. would have been drained by this point.
  155. This timeout is no longer necessary, and so the task can be marked as
  156. failed as soon as the pool gets the notification that the process exited.
  157. - Rare race conditions fixed
  158. Most of these bugs were never reported to us, but were discovered while
  159. running the new stress test suite.
  160. Caveats
  161. ~~~~~~~
  162. .. topic:: Long running tasks
  163. The new pool will send tasks to a child process as long as the process
  164. inqueue is writable, and since the socket is buffered this means
  165. that the processes are, in effect, prefetching tasks.
  166. This benefits performance but it also means that other tasks may be stuck
  167. waiting for a long running task to complete::
  168. -> send T1 to Process A
  169. # A executes T1
  170. -> send T2 to Process B
  171. # B executes T2
  172. <- T2 complete
  173. -> send T3 to Process A
  174. # A still executing T1, T3 stuck in local buffer and
  175. # will not start until T1 returns
  176. The buffer size varies based on the operating system: some may
  177. have a buffer as small as 64kb but on recent Linux versions the buffer
  178. size is 1MB (can only be changed system wide).
  179. You can disable this prefetching behavior by enabling the :option:`-Ofair`
  180. worker option:
  181. .. code-block:: bash
  182. $ celery -A proj worker -l info -Ofair
  183. With this option enabled the worker will only write to workers that are
  184. available for work, disabling the prefetch behavior.
  185. .. topic:: Max tasks per child
  186. If a process exits and pool prefetch is enabled the worker may have
  187. already written many tasks to the process inqueue, and these tasks
  188. must then be moved back and rewritten to a new process.
  189. This is very expensive if you have ``--maxtasksperchild`` set to a low
  190. value (e.g. less than 10), so if you need to enable this option
  191. you should also enable ``-Ofair`` to turn off the prefetching behavior.
  192. Django supported out of the box
  193. -------------------------------
  194. Celery 3.0 introduced a shiny new API, but unfortunately did not
  195. have a solution for Django users.
  196. The situation changes with this version as Django is now supported
  197. in core and new Django users coming to Celery are now expected
  198. to use the new API directly.
  199. The Django community has a convention where there's a separate
  200. django-x package for every library, acting like a bridge between
  201. Django and the library.
  202. Having a separate project for Django users has been a pain for Celery,
  203. with multiple issue trackers and multiple documentation
  204. sources, and then lastly since 3.0 we even had different APIs.
  205. With this version we challenge that convention and Django users will
  206. use the same library, the same API and the same documentation as
  207. everyone else.
  208. There is no rush to port your existing code to use the new API,
  209. but if you would like to experiment with it you should know that:
  210. - You need to use a Celery application instance.
  211. The new Celery API introduced in 3.0 requires users to instantiate the
  212. library by creating an application:
  213. .. code-block:: python
  214. from celery import Celery
  215. app = Celery()
  216. - You need to explicitly integrate Celery with Django
  217. Celery will not automatically use the Django settings, so you can
  218. either configure Celery separately or you can tell it to use the Django
  219. settings with:
  220. .. code-block:: python
  221. app.config_from_object('django.conf:settings')
  222. Neither will it automatically traverse your installed apps to find task
  223. modules. If you want this behavior, you must explictly pass a list of Django instances to the Celery app:
  224. .. code-block:: python
  225. from django.conf import settings
  226. app.autodiscover_tasks(settings.INSTALLED_APPS)
  227. - You no longer use ``manage.py``
  228. Instead you use the :program:`celery` command directly:
  229. .. code-block:: bash
  230. celery -A proj worker -l info
  231. For this to work your app module must store the :envvar:`DJANGO_SETTINGS_MODULE`
  232. environment variable, see the example in the :ref:`Django
  233. guide <django-first-steps>`.
  234. To get started with the new API you should first read the :ref:`first-steps`
  235. tutorial, and then you should read the Django-specific instructions in
  236. :ref:`django-first-steps`.
  237. The fixes and improvements applied by the django-celery library are now
  238. automatically applied by core Celery when it detects that
  239. the :envvar:`DJANGO_SETTINGS_MODULE` environment variable is set.
  240. The distribution ships with a new example project using Django
  241. in :file:`examples/django`:
  242. http://github.com/celery/celery/tree/3.1/examples/django
  243. Some features still require the :mod:`django-celery` library:
  244. - Celery does not implement the Django database or cache result backends.
  245. - Celery does not ship with the database-based periodic task
  246. scheduler.
  247. .. note::
  248. If you're still using the old API when you upgrade to Celery 3.1
  249. then you must make sure that your settings module contains
  250. the ``djcelery.setup_loader()`` line, since this will
  251. no longer happen as a side-effect of importing the :mod:`djcelery`
  252. module.
  253. New users (or if you have ported to the new API) don't need the ``setup_loader``
  254. line anymore, and must make sure to remove it.
  255. Events are now ordered using logical time
  256. -----------------------------------------
  257. Keeping physical clocks in perfect sync is impossible, so using
  258. timestamps to order events in a distributed system is not reliable.
  259. Celery event messages have included a logical clock value for some time,
  260. but starting with this version that field is also used to order them.
  261. Also, events now record timezone information
  262. by including a new ``utcoffset`` field in the event message.
  263. This is a signed integer telling the difference from UTC time in hours,
  264. so e.g. an event sent from the Europe/London timezone in daylight savings
  265. time will have an offset of 1.
  266. :class:`@events.Receiver` will automatically convert the timestamps
  267. to the local timezone.
  268. .. note::
  269. The logical clock is synchronized with other nodes
  270. in the same cluster (neighbors), so this means that the logical
  271. epoch will start at the point when the first worker in the cluster
  272. starts.
  273. If all of the workers are shutdown the clock value will be lost
  274. and reset to 0. To protect against this, you should specify
  275. :option:`--statedb` so that the worker can persist the clock
  276. value at shutdown.
  277. You may notice that the logical clock is an integer value and
  278. increases very rapidly. Do not worry about the value overflowing
  279. though, as even in the most busy clusters it may take several
  280. millennia before the clock exceeds a 64 bits value.
  281. New worker node name format (``name@host``)
  282. -------------------------------------------
  283. Node names are now constructed by two elements: name and hostname separated by '@'.
  284. This change was made to more easily identify multiple instances running
  285. on the same machine.
  286. If a custom name is not specified then the
  287. worker will use the name 'celery' by default, resulting in a
  288. fully qualified node name of 'celery@hostname':
  289. .. code-block:: bash
  290. $ celery worker -n example.com
  291. celery@example.com
  292. To also set the name you must include the @:
  293. .. code-block:: bash
  294. $ celery worker -n worker1@example.com
  295. worker1@example.com
  296. The worker will identify itself using the fully qualified
  297. node name in events and broadcast messages, so where before
  298. a worker would identify itself as 'worker1.example.com', it will now
  299. use 'celery@worker1.example.com'.
  300. Remember that the ``-n`` argument also supports simple variable
  301. substitutions, so if the current hostname is *george.example.com*
  302. then the ``%h`` macro will expand into that:
  303. .. code-block:: bash
  304. $ celery worker -n worker1@%h
  305. worker1@george.example.com
  306. The available substitutions are as follows:
  307. +---------------+---------------------------------------+
  308. | Variable | Substitution |
  309. +===============+=======================================+
  310. | ``%h`` | Full hostname (including domain name) |
  311. +---------------+---------------------------------------+
  312. | ``%d`` | Domain name only |
  313. +---------------+---------------------------------------+
  314. | ``%n`` | Hostname only (without domain name) |
  315. +---------------+---------------------------------------+
  316. | ``%%`` | The character ``%`` |
  317. +---------------+---------------------------------------+
  318. Bound tasks
  319. -----------
  320. The task decorator can now create "bound tasks", which means that the
  321. task will receive the ``self`` argument.
  322. .. code-block:: python
  323. @app.task(bind=True)
  324. def send_twitter_status(self, oauth, tweet):
  325. try:
  326. twitter = Twitter(oauth)
  327. twitter.update_status(tweet)
  328. except (Twitter.FailWhaleError, Twitter.LoginError) as exc:
  329. raise self.retry(exc=exc)
  330. Using *bound tasks* is now the recommended approach whenever
  331. you need access to the task instance or request context.
  332. Previously one would have to refer to the name of the task
  333. instead (``send_twitter_status.retry``), but this could lead to problems
  334. in some configurations.
  335. Mingle: Worker synchronization
  336. ------------------------------
  337. The worker will now attempt to synchronize with other workers in
  338. the same cluster.
  339. Synchronized data currently includes revoked tasks and logical clock.
  340. This only happens at startup and causes a one second startup delay
  341. to collect broadcast responses from other workers.
  342. You can disable this bootstep using the ``--without-mingle`` argument.
  343. Gossip: Worker <-> Worker communication
  344. ---------------------------------------
  345. Workers are now passively subscribing to worker related events like
  346. heartbeats.
  347. This means that a worker knows what other workers are doing and
  348. can detect if they go offline. Currently this is only used for clock
  349. synchronization, but there are many possibilities for future additions
  350. and you can write extensions that take advantage of this already.
  351. Some ideas include consensus protocols, reroute task to best worker (based on
  352. resource usage or data locality) or restarting workers when they crash.
  353. We believe that although this is a small addition, it opens
  354. amazing possibilities.
  355. You can disable this bootstep using the ``--without-gossip`` argument.
  356. Bootsteps: Extending the worker
  357. -------------------------------
  358. By writing bootsteps you can now easily extend the consumer part
  359. of the worker to add additional features, like custom message consumers.
  360. The worker has been using bootsteps for some time, but these were never
  361. documented. In this version the consumer part of the worker
  362. has also been rewritten to use bootsteps and the new :ref:`guide-extending`
  363. guide documents examples extending the worker, including adding
  364. custom message consumers.
  365. See the :ref:`guide-extending` guide for more information.
  366. .. note::
  367. Bootsteps written for older versions will not be compatible
  368. with this version, as the API has changed significantly.
  369. The old API was experimental and internal but should you be so unlucky
  370. to use it then please contact the mailing-list and we will help you port
  371. the bootstep to the new API.
  372. New RPC result backend
  373. ----------------------
  374. This new experimental version of the ``amqp`` result backend is a good
  375. alternative to use in classical RPC scenarios, where the process that initiates
  376. the task is always the process to retrieve the result.
  377. It uses Kombu to send and retrieve results, and each client
  378. uses a unique queue for replies to be sent to. This avoids
  379. the significant overhead of the original amqp result backend which creates
  380. one queue per task.
  381. By default results sent using this backend will not persist, so they won't
  382. survive a broker restart. You can enable
  383. the :setting:`CELERY_RESULT_PERSISTENT` setting to change that.
  384. .. code-block:: python
  385. CELERY_RESULT_BACKEND = 'rpc'
  386. CELERY_RESULT_PERSISTENT = True
  387. Note that chords are currently not supported by the RPC backend.
  388. Time limits can now be set by the client
  389. ----------------------------------------
  390. Two new options have been added to the Calling API: ``time_limit`` and
  391. ``soft_time_limit``:
  392. .. code-block:: python
  393. >>> res = add.apply_async((2, 2), time_limit=10, soft_time_limit=8)
  394. >>> res = add.subtask((2, 2), time_limit=10, soft_time_limit=8).delay()
  395. >>> res = add.s(2, 2).set(time_limit=10, soft_time_limit=8).delay()
  396. Contributed by Mher Movsisyan.
  397. Redis: Broadcast messages and virtual hosts
  398. -------------------------------------------
  399. Broadcast messages are currently seen by all virtual hosts when
  400. using the Redis transport. You can now fix this by enabling a prefix to all channels
  401. so that the messages are separated:
  402. .. code-block:: python
  403. BROKER_TRANSPORT_OPTIONS = {'fanout_prefix': True}
  404. Note that you'll not be able to communicate with workers running older
  405. versions or workers that does not have this setting enabled.
  406. This setting will be the default in a future version.
  407. Related to Issue #1490.
  408. :mod:`pytz` replaces ``python-dateutil`` dependency
  409. ---------------------------------------------------
  410. Celery no longer depends on the ``python-dateutil`` library,
  411. but instead a new dependency on the :mod:`pytz` library was added.
  412. The :mod:`pytz` library was already recommended for accurate timezone support.
  413. This also means that dependencies are the same for both Python 2 and
  414. Python 3, and that the :file:`requirements/default-py3k.txt` file has
  415. been removed.
  416. Support for Setuptools extra requirements
  417. -----------------------------------------
  418. Pip now supports the :mod:`setuptools` extra requirements format,
  419. so we have removed the old bundles concept, and instead specify
  420. setuptools extras.
  421. You install extras by specifying them inside brackets:
  422. .. code-block:: bash
  423. $ pip install celery[redis,mongodb]
  424. The above will install the dependencies for Redis and MongoDB. You can list
  425. as many extras as you want.
  426. .. warning::
  427. You can't use the ``celery-with-*`` packages anymore, as these will not be
  428. updated to use Celery 3.1.
  429. +-------------+-------------------------+---------------------------+
  430. | Extension | Requirement entry | Type |
  431. +=============+=========================+===========================+
  432. | Redis | ``celery[redis]`` | transport, result backend |
  433. +-------------+-------------------------+---------------------------+
  434. | MongoDB | ``celery[mongodb]`` | transport, result backend |
  435. +-------------+-------------------------+---------------------------+
  436. | CouchDB | ``celery[couchdb]`` | transport |
  437. +-------------+-------------------------+---------------------------+
  438. | Beanstalk | ``celery[beanstalk]`` | transport |
  439. +-------------+-------------------------+---------------------------+
  440. | ZeroMQ | ``celery[zeromq]`` | transport |
  441. +-------------+-------------------------+---------------------------+
  442. | Zookeeper | ``celery[zookeeper]`` | transport |
  443. +-------------+-------------------------+---------------------------+
  444. | SQLAlchemy | ``celery[sqlalchemy]`` | transport, result backend |
  445. +-------------+-------------------------+---------------------------+
  446. | librabbitmq | ``celery[librabbitmq]`` | transport (C amqp client) |
  447. +-------------+-------------------------+---------------------------+
  448. The complete list with examples is found in the :ref:`bundles` section.
  449. ``subtask.__call__()`` now executes the task directly
  450. -----------------------------------------------------
  451. A misunderstanding led to ``Signature.__call__`` being an alias of
  452. ``.delay`` but this does not conform to the calling API of ``Task`` which
  453. calls the underlying task method.
  454. This means that:
  455. .. code-block:: python
  456. @app.task
  457. def add(x, y):
  458. return x + y
  459. add.s(2, 2)()
  460. now does the same as calling the task directly:
  461. .. code-block:: python
  462. add(2, 2)
  463. In Other News
  464. -------------
  465. - Now depends on :ref:`Kombu 3.0 <kombu:version-3.0.0>`.
  466. - Now depends on :mod:`billiard` version 3.3.
  467. - Worker will now crash if running as the root user with pickle enabled.
  468. - Canvas: ``group.apply_async`` and ``chain.apply_async`` no longer starts
  469. separate task.
  470. That the group and chord primitives supported the "calling API" like other
  471. subtasks was a nice idea, but it was useless in practice and often
  472. confused users. If you still want this behavior you can define a
  473. task to do it for you.
  474. - New method ``Signature.freeze()`` can be used to "finalize"
  475. signatures/subtask.
  476. Regular signature:
  477. .. code-block:: python
  478. >>> s = add.s(2, 2)
  479. >>> result = s.freeze()
  480. >>> result
  481. <AsyncResult: ffacf44b-f8a1-44e9-80a3-703150151ef2>
  482. >>> s.delay()
  483. <AsyncResult: ffacf44b-f8a1-44e9-80a3-703150151ef2>
  484. Group:
  485. .. code-block:: python
  486. >>> g = group(add.s(2, 2), add.s(4, 4))
  487. >>> result = g.freeze()
  488. <GroupResult: e1094b1d-08fc-4e14-838e-6d601b99da6d [
  489. 70c0fb3d-b60e-4b22-8df7-aa25b9abc86d,
  490. 58fcd260-2e32-4308-a2ea-f5be4a24f7f4]>
  491. >>> g()
  492. <GroupResult: e1094b1d-08fc-4e14-838e-6d601b99da6d [70c0fb3d-b60e-4b22-8df7-aa25b9abc86d, 58fcd260-2e32-4308-a2ea-f5be4a24f7f4]>
  493. - Chord exception behavior defined (Issue #1172).
  494. From this version the chord callback will change state to FAILURE
  495. when a task part of a chord raises an exception.
  496. See more at :ref:`chord-errors`.
  497. - New ability to specify additional command line options
  498. to the worker and beat programs.
  499. The :attr:`@user_options` attribute can be used
  500. to add additional command-line arguments, and expects
  501. optparse-style options:
  502. .. code-block:: python
  503. from celery import Celery
  504. from celery.bin import Option
  505. app = Celery()
  506. app.user_options['worker'].add(
  507. Option('--my-argument'),
  508. )
  509. See the :ref:`guide-extending` guide for more information.
  510. - All events now include a ``pid`` field, which is the process id of the
  511. process that sent the event.
  512. - Event heartbeats are now calculated based on the time when the event
  513. was received by the monitor, and not the time reported by the worker.
  514. This means that a worker with an out-of-sync clock will no longer
  515. show as 'Offline' in monitors.
  516. A warning is now emitted if the difference between the senders
  517. time and the internal time is greater than 15 seconds, suggesting
  518. that the clocks are out of sync.
  519. - Monotonic clock support.
  520. A monotonic clock is now used for timeouts and scheduling.
  521. The monotonic clock function is built-in starting from Python 3.4,
  522. but we also have fallback implementations for Linux and OS X.
  523. - :program:`celery worker` now supports a ``--detach`` argument to start
  524. the worker as a daemon in the background.
  525. - :class:`@events.Receiver` now sets a ``local_received`` field for incoming
  526. events, which is set to the time of when the event was received.
  527. - :class:`@events.Dispatcher` now accepts a ``groups`` argument
  528. which decides a white-list of event groups that will be sent.
  529. The type of an event is a string separated by '-', where the part
  530. before the first '-' is the group. Currently there are only
  531. two groups: ``worker`` and ``task``.
  532. A dispatcher instantiated as follows:
  533. .. code-block:: python
  534. app.events.Dispatcher(connection, groups=['worker'])
  535. will only send worker related events and silently drop any attempts
  536. to send events related to any other group.
  537. - New :setting:`BROKER_FAILOVER_STRATEGY` setting.
  538. This setting can be used to change the transport failover strategy,
  539. can either be a callable returning an iterable or the name of a
  540. Kombu built-in failover strategy. Default is "round-robin".
  541. Contributed by Matt Wise.
  542. - ``Result.revoke`` will no longer wait for replies.
  543. You can add the ``reply=True`` argument if you really want to wait for
  544. responses from the workers.
  545. - Better support for link and link_error tasks for chords.
  546. Contributed by Steeve Morin.
  547. - Worker: Now emits warning if the :setting:`CELERYD_POOL` setting is set
  548. to enable the eventlet/gevent pools.
  549. The `-P` option should always be used to select the eventlet/gevent pool
  550. to ensure that the patches are applied as early as possible.
  551. If you start the worker in a wrapper (like Django's manage.py)
  552. then you must apply the patches manually, e.g. by creating an alternative
  553. wrapper that monkey patches at the start of the program before importing
  554. any other modules.
  555. - There's a now an 'inspect clock' command which will collect the current
  556. logical clock value from workers.
  557. - `celery inspect stats` now contains the process id of the worker's main
  558. process.
  559. Contributed by Mher Movsisyan.
  560. - New remote control command to dump a workers configuration.
  561. Example:
  562. .. code-block:: bash
  563. $ celery inspect conf
  564. Configuration values will be converted to values supported by JSON
  565. where possible.
  566. Contributed by Mher Movisyan.
  567. - New settings :setting:`CELERY_EVENT_QUEUE_TTL` and
  568. :setting:`CELERY_EVENT_QUEUE_EXPIRES`.
  569. These control when a monitors event queue is deleted, and for how long
  570. events published to that queue will be visible. Only supported on
  571. RabbitMQ.
  572. - New Couchbase result backend.
  573. This result backend enables you to store and retrieve task results
  574. using `Couchbase`_.
  575. See :ref:`conf-couchbase-result-backend` for more information
  576. about configuring this result backend.
  577. Contributed by Alain Masiero.
  578. .. _`Couchbase`: http://www.couchbase.com
  579. - CentOS init script now supports starting multiple worker instances.
  580. See the script header for details.
  581. Contributed by Jonathan Jordan.
  582. - ``AsyncResult.iter_native`` now sets default interval parameter to 0.5
  583. Fix contributed by Idan Kamara
  584. - New setting :setting:`BROKER_LOGIN_METHOD`.
  585. This setting can be used to specify an alternate login method
  586. for the AMQP transports.
  587. Contributed by Adrien Guinet
  588. - The ``dump_conf`` remote control command will now give the string
  589. representation for types that are not JSON compatible.
  590. - Function `celery.security.setup_security` is now :func:`@setup_security`.
  591. - Task retry now propagates the message expiry value (Issue #980).
  592. The value is forwarded at is, so the expiry time will not change.
  593. To update the expiry time you would have to pass a new expires
  594. argument to ``retry()``.
  595. - Worker now crashes if a channel error occurs.
  596. Channel errors are transport specific and is the list of exceptions
  597. returned by ``Connection.channel_errors``.
  598. For RabbitMQ this means that Celery will crash if the equivalence
  599. checks for one of the queues in :setting:`CELERY_QUEUES` mismatches, which
  600. makes sense since this is a scenario where manual intervention is
  601. required.
  602. - Calling ``AsyncResult.get()`` on a chain now propagates errors for previous
  603. tasks (Issue #1014).
  604. - The parent attribute of ``AsyncResult`` is now reconstructed when using JSON
  605. serialization (Issue #1014).
  606. - Worker disconnection logs are now logged with severity warning instead of
  607. error.
  608. Contributed by Chris Adams.
  609. - ``events.State`` no longer crashes when it receives unknown event types.
  610. - SQLAlchemy Result Backend: New :setting:`CELERY_RESULT_DB_TABLENAMES`
  611. setting can be used to change the name of the database tables used.
  612. Contributed by Ryan Petrello.
  613. - SQLAlchemy Result Backend: Now calls ``enginge.dispose`` after fork
  614. (Issue #1564).
  615. If you create your own sqlalchemy engines then you must also
  616. make sure that these are closed after fork in the worker:
  617. .. code-block:: python
  618. from multiprocessing.util import register_after_fork
  619. engine = create_engine(…)
  620. register_after_fork(engine, engine.dispose)
  621. - A stress test suite for the Celery worker has been written.
  622. This is located in the ``funtests/stress`` directory in the git
  623. repository. There's a README file there to get you started.
  624. - The logger named ``celery.concurrency`` has been renamed to ``celery.pool``.
  625. - New command line utility ``celery graph``.
  626. This utility creates graphs in GraphViz dot format.
  627. You can create graphs from the currently installed bootsteps:
  628. .. code-block:: bash
  629. # Create graph of currently installed bootsteps in both the worker
  630. # and consumer namespaces.
  631. $ celery graph bootsteps | dot -T png -o steps.png
  632. # Graph of the consumer namespace only.
  633. $ celery graph bootsteps consumer | dot -T png -o consumer_only.png
  634. # Graph of the worker namespace only.
  635. $ celery graph bootsteps worker | dot -T png -o worker_only.png
  636. Or graphs of workers in a cluster:
  637. .. code-block:: bash
  638. # Create graph from the current cluster
  639. $ celery graph workers | dot -T png -o workers.png
  640. # Create graph from a specified list of workers
  641. $ celery graph workers nodes:w1,w2,w3 | dot -T png workers.png
  642. # also specify the number of threads in each worker
  643. $ celery graph workers nodes:w1,w2,w3 threads:2,4,6
  644. # …also specify the broker and backend URLs shown in the graph
  645. $ celery graph workers broker:amqp:// backend:redis://
  646. # …also specify the max number of workers/threads shown (wmax/tmax),
  647. # enumerating anything that exceeds that number.
  648. $ celery graph workers wmax:10 tmax:3
  649. - Changed the way that app instances are pickled.
  650. Apps can now define a ``__reduce_keys__`` method that is used instead
  651. of the old ``AppPickler`` attribute. E.g. if your app defines a custom
  652. 'foo' attribute that needs to be preserved when pickling you can define
  653. a ``__reduce_keys__`` as such:
  654. .. code-block:: python
  655. import celery
  656. class Celery(celery.Celery):
  657. def __init__(self, *args, **kwargs):
  658. super(Celery, self).__init__(*args, **kwargs)
  659. self.foo = kwargs.get('foo')
  660. def __reduce_keys__(self):
  661. return super(Celery, self).__reduce_keys__().update(
  662. foo=self.foo,
  663. )
  664. This is a much more convenient way to add support for pickling custom
  665. attributes. The old ``AppPickler`` is still supported but its use is
  666. discouraged and we would like to remove it in a future version.
  667. - Ability to trace imports for debugging purposes.
  668. The :envvar:`C_IMPDEBUG` can be set to trace imports as they
  669. occur:
  670. .. code-block:: bash
  671. $ C_IMDEBUG=1 celery worker -l info
  672. .. code-block:: bash
  673. $ C_IMPDEBUG=1 celery shell
  674. - Message headers now available as part of the task request.
  675. Example adding and retrieving a header value:
  676. .. code-block:: python
  677. @app.task(bind=True)
  678. def t(self):
  679. return self.request.headers.get('sender')
  680. >>> t.apply_async(headers={'sender': 'George Costanza'})
  681. - New :signal:`before_task_publish` signal dispatched before a task message
  682. is sent and can be used to modify the final message fields (Issue #1281).
  683. - New :signal:`after_task_publish` signal replaces the old :signal:`task_sent`
  684. signal.
  685. The :signal:`task_sent` signal is now deprecated and should not be used.
  686. - New :signal:`worker_process_shutdown` signal is dispatched in the
  687. prefork pool child processes as they exit.
  688. Contributed by Daniel M Taub.
  689. - ``celery.platforms.PIDFile`` renamed to :class:`celery.platforms.Pidfile`.
  690. - MongoDB Backend: Can now be configured using an URL:
  691. See :ref:`example-mongodb-result-config`.
  692. - MongoDB Backend: No longer using deprecated ``pymongo.Connection``.
  693. - MongoDB Backend: Now disables ``auto_start_request``.
  694. - MongoDB Backend: Now enables ``use_greenlets`` when eventlet/gevent is used.
  695. - ``subtask()`` / ``maybe_subtask()`` renamed to
  696. ``signature()``/``maybe_signature()``.
  697. Aliases still available for backwards compatibility.
  698. - The ``correlation_id`` message property is now automatically set to the
  699. id of the task.
  700. - The task message ``eta`` and ``expires`` fields now includes timezone
  701. information.
  702. - All result backends ``store_result``/``mark_as_*`` methods must now accept
  703. a ``request`` keyword argument.
  704. - Events now emit warning if the broken ``yajl`` library is used.
  705. - The :signal:`celeryd_init` signal now takes an extra keyword argument:
  706. ``option``.
  707. This is the mapping of parsed command line arguments, and can be used to
  708. prepare new preload arguments (``app.user_options['preload']``).
  709. - New callback: :meth:`@on_configure`.
  710. This callback is called when an app is about to be configured (a
  711. configuration key is required).
  712. - Worker: No longer forks on :sig:`HUP`.
  713. This means that the worker will reuse the same pid for better
  714. support with external process supervisors.
  715. Contributed by Jameel Al-Aziz.
  716. - Worker: The log message ``Got task from broker …`` was changed to
  717. ``Received task …``.
  718. - Worker: The log message ``Skipping revoked task …`` was changed
  719. to ``Discarding revoked task …``.
  720. - Optimization: Improved performance of ``ResultSet.join_native()``.
  721. Contributed by Stas Rudakou.
  722. - The :signal:`task_revoked` signal now accepts new ``request`` argument
  723. (Issue #1555).
  724. The revoked signal is dispatched after the task request is removed from
  725. the stack, so it must instead use the :class:`~celery.worker.job.Request`
  726. object to get information about the task.
  727. - Worker: New :option:`-X` command line argument to exclude queues
  728. (Issue #1399).
  729. The :option:`-X` argument is the inverse of the :option:`-Q` argument
  730. and accepts a list of queues to exclude (not consume from):
  731. .. code-block:: bash
  732. # Consume from all queues in CELERY_QUEUES, but not the 'foo' queue.
  733. $ celery worker -A proj -l info -X foo
  734. - Adds :envvar:`C_FAKEFORK` envvar for simple init script/multi debugging.
  735. This means that you can now do:
  736. .. code-block:: bash
  737. $ C_FAKEFORK=1 celery multi start 10
  738. or:
  739. .. code-block:: bash
  740. $ C_FAKEFORK=1 /etc/init.d/celeryd start
  741. to avoid the daemonization step to see errors that are not visible
  742. due to missing stdout/stderr.
  743. A ``dryrun`` command has been added to the generic init script that
  744. enables this option.
  745. - New public API to push and pop from the current task stack:
  746. :func:`celery.app.push_current_task` and
  747. :func:`celery.app.pop_current_task``.
  748. - ``RetryTaskError`` has been renamed to :exc:`~celery.exceptions.Retry`.
  749. The old name is still available for backwards compatibility.
  750. - New semi-predicate exception :exc:`~celery.exceptions.Reject`.
  751. This exception can be raised to ``reject``/``requeue`` the task message,
  752. see :ref:`task-semipred-reject` for examples.
  753. - :ref:`Semipredicates <task-semipredicates>` documented: (Retry/Ignore/Reject).
  754. .. _v310-removals:
  755. Scheduled Removals
  756. ==================
  757. - The ``BROKER_INSIST`` setting and the ``insist`` argument
  758. to ``~@connection`` is no longer supported.
  759. - The ``CELERY_AMQP_TASK_RESULT_CONNECTION_MAX`` setting is no longer
  760. supported.
  761. Use :setting:`BROKER_POOL_LIMIT` instead.
  762. - The ``CELERY_TASK_ERROR_WHITELIST`` setting is no longer supported.
  763. You should set the :class:`~celery.utils.mail.ErrorMail` attribute
  764. of the task class instead. You can also do this using
  765. :setting:`CELERY_ANNOTATIONS`:
  766. .. code-block:: python
  767. from celery import Celery
  768. from celery.utils.mail import ErrorMail
  769. class MyErrorMail(ErrorMail):
  770. whitelist = (KeyError, ImportError)
  771. def should_send(self, context, exc):
  772. return isinstance(exc, self.whitelist)
  773. app = Celery()
  774. app.conf.CELERY_ANNOTATIONS = {
  775. '*': {
  776. 'ErrorMail': MyErrorMails,
  777. }
  778. }
  779. - Functions that creates a broker connections no longer
  780. supports the ``connect_timeout`` argument.
  781. This can now only be set using the :setting:`BROKER_CONNECTION_TIMEOUT`
  782. setting. This is because functions no longer create connections
  783. directly, but instead get them from the connection pool.
  784. - The ``CELERY_AMQP_TASK_RESULT_EXPIRES`` setting is no longer supported.
  785. Use :setting:`CELERY_TASK_RESULT_EXPIRES` instead.
  786. .. _v310-deprecations:
  787. Deprecations
  788. ============
  789. See the :ref:`deprecation-timeline`.
  790. .. _v310-fixes:
  791. Fixes
  792. =====
  793. - AMQP Backend: join did not convert exceptions when using the json
  794. serializer.
  795. - Non-abstract task classes are now shared between apps (Issue #1150).
  796. Note that non-abstract task classes should not be used in the
  797. new API. You should only create custom task classes when you
  798. use them as a base class in the ``@task`` decorator.
  799. This fix ensure backwards compatibility with older Celery versions
  800. so that non-abstract task classes works even if a module is imported
  801. multiple times so that the app is also instantiated multiple times.
  802. - Worker: Workaround for Unicode errors in logs (Issue #427).
  803. - Task methods: ``.apply_async`` now works properly if args list is None
  804. (Issue #1459).
  805. - Eventlet/gevent/solo/threads pools now properly handles :exc:`BaseException`
  806. errors raised by tasks.
  807. - :control:`autoscale` and :control:`pool_grow`/:control:`pool_shrink` remote
  808. control commands will now also automatically increase and decrease the
  809. consumer prefetch count.
  810. Fix contributed by Daniel M. Taub.
  811. - ``celery control pool_`` commands did not coerce string arguments to int.
  812. - Redis/Cache chords: Callback result is now set to failure if the group
  813. disappeared from the database (Issue #1094).
  814. - Worker: Now makes sure that the shutdown process is not initiated multiple
  815. times.
  816. - Multi: Now properly handles both ``-f`` and ``--logfile`` options
  817. (Issue #1541).
  818. .. _v310-internal:
  819. Internal changes
  820. ================
  821. - Module ``celery.task.trace`` has been renamed to :mod:`celery.app.trace`.
  822. - Module ``celery.concurrency.processes`` has been renamed to
  823. :mod:`celery.concurrency.prefork`.
  824. - Classes that no longer fall back to using the default app:
  825. - Result backends (:class:`celery.backends.base.BaseBackend`)
  826. - :class:`celery.worker.WorkController`
  827. - :class:`celery.worker.Consumer`
  828. - :class:`celery.worker.job.Request`
  829. This means that you have to pass a specific app when instantiating
  830. these classes.
  831. - ``EventDispatcher.copy_buffer`` renamed to
  832. :meth:`@events.Dispatcher.extend_buffer`.
  833. - Removed unused and never documented global instance
  834. ``celery.events.state.state``.
  835. - :class:`@events.Receiver` is now a :class:`kombu.mixins.ConsumerMixin`
  836. subclass.
  837. - :class:`celery.apps.worker.Worker` has been refactored as a subclass of
  838. :class:`celery.worker.WorkController`.
  839. This removes a lot of duplicate functionality.
  840. - The ``Celery.with_default_connection`` method has been removed in favor
  841. of ``with app.connection_or_acquire`` (:meth:`@connection_or_acquire`)
  842. - The ``celery.results.BaseDictBackend`` class has been removed and is replaced by
  843. :class:`celery.results.BaseBackend`.