introduction.txt 13 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309
  1. :Version: 2.6.0rc4
  2. :Web: http://celeryproject.org/
  3. :Download: http://pypi.python.org/pypi/celery/
  4. :Source: http://github.com/celery/celery/
  5. :Keywords: task queue, job queue, asynchronous, rabbitmq, amqp, redis,
  6. python, webhooks, queue, distributed
  7. --
  8. .. contents::
  9. :local:
  10. .. _celery-synopsis:
  11. Synopsis
  12. ========
  13. Celery is an open source asynchronous task queue/job queue based on
  14. distributed message passing. Focused on real-time operation,
  15. but supports scheduling as well.
  16. The execution units, called tasks, are executed concurrently on one or
  17. more worker nodes using multiprocessing, `Eventlet`_ or `gevent`_. Tasks can
  18. execute asynchronously (in the background) or synchronously
  19. (wait until ready).
  20. Celery is used in production systems to process millions of tasks every hour.
  21. Celery is written in Python, but the protocol can be implemented in any
  22. language. It can also `operate with other languages using webhooks`_.
  23. There's also `RCelery` for the Ruby programming language, and a `PHP client`.
  24. The recommended message broker is `RabbitMQ`_, but support for
  25. `Redis`_, `MongoDB`_, `Beanstalk`_, `Amazon SQS`_, `CouchDB`_ and
  26. databases (using `SQLAlchemy`_ or the `Django ORM`_) is also available.
  27. Celery is easy to integrate with web frameworks, some of which even have
  28. integration packages:
  29. +--------------------+------------------------+
  30. | `Django`_ | `django-celery`_ |
  31. +--------------------+------------------------+
  32. | `Pyramid`_ | `pyramid_celery`_ |
  33. +--------------------+------------------------+
  34. | `Pylons`_ | `celery-pylons`_ |
  35. +--------------------+------------------------+
  36. | `Flask`_ | `flask-celery`_ |
  37. +--------------------+------------------------+
  38. | `web2py`_ | `web2py-celery`_ |
  39. +--------------------+------------------------+
  40. | `Tornado`_ | `tornado-celery`_ |
  41. +--------------------+------------------------+
  42. .. _`RCelery`: http://leapfrogdevelopment.github.com/rcelery/
  43. .. _`PHP client`: https://github.com/gjedeer/celery-php
  44. .. _`RabbitMQ`: http://www.rabbitmq.com/
  45. .. _`Redis`: http://code.google.com/p/redis/
  46. .. _`SQLAlchemy`: http://www.sqlalchemy.org/
  47. .. _`Django`: http://djangoproject.com/
  48. .. _`Django ORM`: http://djangoproject.com/
  49. .. _`Memcached`: http://memcached.org/
  50. .. _`Eventlet`: http://eventlet.net/
  51. .. _`gevent`: http://gevent.org/
  52. .. _`Beanstalk`: http://kr.github.com/beanstalkd/
  53. .. _`MongoDB`: http://mongodb.org/
  54. .. _`CouchDB`: http://couchdb.apache.org/
  55. .. _`Amazon SQS`: http://aws.amazon.com/sqs/
  56. .. _`Pylons`: http://pylonshq.com/
  57. .. _`Flask`: http://flask.pocoo.org/
  58. .. _`web2py`: http://web2py.com/
  59. .. _`Bottle`: http://bottlepy.org/
  60. .. _`Pyramid`: http://docs.pylonsproject.org/en/latest/docs/pyramid.html
  61. .. _`pyramid_celery`: http://pypi.python.org/pypi/pyramid_celery/
  62. .. _`django-celery`: http://pypi.python.org/pypi/django-celery
  63. .. _`celery-pylons`: http://pypi.python.org/pypi/celery-pylons
  64. .. _`flask-celery`: http://github.com/ask/flask-celery/
  65. .. _`web2py-celery`: http://code.google.com/p/web2py-celery/
  66. .. _`Tornado`: http://www.tornadoweb.org/
  67. .. _`tornado-celery`: http://github.com/mher/tornado-celery/
  68. .. _`operate with other languages using webhooks`:
  69. http://celery.github.com/celery/userguide/remote-tasks.html
  70. .. _`limited support`:
  71. http://kombu.readthedocs.org/en/latest/introduction.html#transport-comparison
  72. .. _celery-overview:
  73. Overview
  74. ========
  75. This is a high level overview of the architecture.
  76. .. image:: http://cloud.github.com/downloads/ask/celery/Celery-Overview-v4.jpg
  77. The broker delivers tasks to the worker nodes.
  78. A worker node is a networked machine running `celeryd`. This can be one or
  79. more machines depending on the workload.
  80. The result of the task can be stored for later retrieval (called its
  81. "tombstone").
  82. .. _celery-example:
  83. Example
  84. =======
  85. You probably want to see some code by now, so here's an example task
  86. adding two numbers:
  87. .. code-block:: python
  88. from celery import task
  89. @task()
  90. def add(x, y):
  91. return x + y
  92. You can execute the task in the background, or wait for it to finish::
  93. >>> result = add.delay(4, 4)
  94. >>> result.wait() # wait for and return the result
  95. 8
  96. Simple!
  97. .. _celery-features:
  98. Features
  99. ========
  100. +-----------------+----------------------------------------------------+
  101. | Messaging | Supported brokers include `RabbitMQ`_, `Redis`_, |
  102. | | `MongoDB`_, `Beanstalk`_, SQL databases, |
  103. | | Amazon SQS and more. |
  104. +-----------------+----------------------------------------------------+
  105. | Fault-tolerant | Excellent configurable error recovery when using |
  106. | | `RabbitMQ`, ensures your tasks are never lost. |
  107. +-----------------+----------------------------------------------------+
  108. | Distributed | Runs on one or more machines. Supports |
  109. | | broker `clustering`_ and `HA`_ when used in |
  110. | | combination with `RabbitMQ`_. You can set up new |
  111. | | workers without central configuration (e.g. use |
  112. | | your grandma's laptop to help if the queue is |
  113. | | temporarily congested). |
  114. +-----------------+----------------------------------------------------+
  115. | Concurrency | Concurrency is achieved by using multiprocessing, |
  116. | | `Eventlet`_, `gevent` or a mix of these. |
  117. +-----------------+----------------------------------------------------+
  118. | Scheduling | Supports recurring tasks like cron, or specifying |
  119. | | an exact date or countdown for when after the task |
  120. | | should be executed. |
  121. +-----------------+----------------------------------------------------+
  122. | Latency | Low latency means you are able to execute tasks |
  123. | | *while the user is waiting*. |
  124. +-----------------+----------------------------------------------------+
  125. | Return Values | Task return values can be saved to the selected |
  126. | | result store backend. You can wait for the result, |
  127. | | retrieve it later, or ignore it. |
  128. +-----------------+----------------------------------------------------+
  129. | Result Stores | Database, `MongoDB`_, `Redis`_, `Memcached`_, |
  130. | | `Cassandra`, or `AMQP`_ (message notification). |
  131. +-----------------+----------------------------------------------------+
  132. | Webhooks | Your tasks can also be HTTP callbacks, enabling |
  133. | | cross-language communication. |
  134. +-----------------+----------------------------------------------------+
  135. | Rate limiting | Supports rate limiting by using the token bucket |
  136. | | algorithm, which accounts for bursts of traffic. |
  137. | | Rate limits can be set for each task type, or |
  138. | | globally for all. |
  139. +-----------------+----------------------------------------------------+
  140. | Routing | Using AMQP's flexible routing model you can route |
  141. | | tasks to different workers, or select different |
  142. | | message topologies, by configuration or even at |
  143. | | runtime. |
  144. +-----------------+----------------------------------------------------+
  145. | Remote-control | Worker nodes can be controlled from remote by |
  146. | | using broadcast messaging. A range of built-in |
  147. | | commands exist in addition to the ability to |
  148. | | easily define your own. (AMQP/Redis only) |
  149. +-----------------+----------------------------------------------------+
  150. | Monitoring | You can capture everything happening with the |
  151. | | workers in real-time by subscribing to events. |
  152. | | A real-time web monitor is in development. |
  153. +-----------------+----------------------------------------------------+
  154. | Serialization | Supports Pickle, JSON, YAML, or easily defined |
  155. | | custom schemes. One task invocation can have a |
  156. | | different scheme than another. |
  157. +-----------------+----------------------------------------------------+
  158. | Tracebacks | Errors and tracebacks are stored and can be |
  159. | | investigated after the fact. |
  160. +-----------------+----------------------------------------------------+
  161. | UUID | Every task has an UUID (Universally Unique |
  162. | | Identifier), which is the task id used to query |
  163. | | task status and return value. |
  164. +-----------------+----------------------------------------------------+
  165. | Retries | Tasks can be retried if they fail, with |
  166. | | configurable maximum number of retries, and delays |
  167. | | between each retry. |
  168. +-----------------+----------------------------------------------------+
  169. | Task Sets | A Task set is a task consisting of several |
  170. | | sub-tasks. You can find out how many, or if all |
  171. | | of the sub-tasks has been executed, and even |
  172. | | retrieve the results in order. Progress bars, |
  173. | | anyone? |
  174. +-----------------+----------------------------------------------------+
  175. | Made for Web | You can query status and results via URLs, |
  176. | | enabling the ability to poll task status using |
  177. | | Ajax. |
  178. +-----------------+----------------------------------------------------+
  179. | Error Emails | Can be configured to send emails to the |
  180. | | administrators when tasks fails. |
  181. +-----------------+----------------------------------------------------+
  182. .. _`clustering`: http://www.rabbitmq.com/clustering.html
  183. .. _`HA`: http://www.rabbitmq.com/pacemaker.html
  184. .. _`AMQP`: http://www.amqp.org/
  185. .. _`Stomp`: http://stomp.codehaus.org/
  186. .. _`Tokyo Tyrant`: http://tokyocabinet.sourceforge.net/
  187. .. _celery-documentation:
  188. Documentation
  189. =============
  190. The `latest documentation`_ with user guides, tutorials and API reference
  191. is hosted at Github.
  192. .. _`latest documentation`: http://celery.github.com/celery/
  193. .. _celery-installation:
  194. Installation
  195. ============
  196. You can install Celery either via the Python Package Index (PyPI)
  197. or from source.
  198. To install using `pip`,::
  199. $ pip install -U Celery
  200. To install using `easy_install`,::
  201. $ easy_install -U Celery
  202. Bundles
  203. -------
  204. Celery also defines a group of bundles that can be used
  205. to install Celery and the dependencies for a given feature.
  206. The following bundles are available:
  207. :`celery-with-redis`_:
  208. for using Redis as a broker.
  209. :`celery-with-mongodb`_:
  210. for using MongoDB as a broker.
  211. :`django-celery-with-redis`_:
  212. for Django, and using Redis as a broker.
  213. :`django-celery-with-mongodb`_:
  214. for Django, and using MongoDB as a broker.
  215. .. _`celery-with-redis`:
  216. http://pypi.python.org/pypi/celery-with-redis/
  217. .. _`celery-with-mongodb`:
  218. http://pypi.python.org/pypi/celery-with-mongdb/
  219. .. _`django-celery-with-redis`:
  220. http://pypi.python.org/pypi/django-celery-with-redis/
  221. .. _`django-celery-with-mongodb`:
  222. http://pypi.python.org/pypi/django-celery-with-mongdb/
  223. .. _celery-installing-from-source:
  224. Downloading and installing from source
  225. --------------------------------------
  226. Download the latest version of Celery from
  227. http://pypi.python.org/pypi/celery/
  228. You can install it by doing the following,::
  229. $ tar xvfz celery-0.0.0.tar.gz
  230. $ cd celery-0.0.0
  231. $ python setup.py build
  232. # python setup.py install # as root
  233. .. _celery-installing-from-git:
  234. Using the development version
  235. -----------------------------
  236. You can clone the repository by doing the following::
  237. $ git clone https://github.com/celery/celery
  238. $ cd celery
  239. $ python setup.py develop
  240. The development version will usually also depend on the development
  241. version of `kombu`_, the messaging framework Celery uses
  242. to send and receive messages, so you should also install that from git::
  243. $ git clone https://github.com/celery/kombu
  244. $ cd kombu
  245. $ python setup.py develop
  246. .. _`kombu`: http://kombu.readthedocs.org/en/latest/