routing.rst 16 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516
  1. ===============
  2. Routing Tasks
  3. ===============
  4. **NOTE** This document refers to functionality only available in brokers
  5. using AMQP. Other brokers may implement some functionality, see their
  6. respective documenation for more information, or contact the `mailinglist`_.
  7. .. _`mailinglist`: http://groups.google.com/group/celery-users
  8. .. contents::
  9. :local:
  10. Basics
  11. ======
  12. Automatic routing
  13. -----------------
  14. The simplest way to do routing is to use the ``CELERY_CREATE_MISSING_QUEUES``
  15. setting (on by default).
  16. With this setting on, a named queue that is not already defined in
  17. ``CELERY_QUEUES`` will be created automatically. This makes it easy to perform
  18. simple routing tasks.
  19. Say you have two servers, ``x``, and ``y`` that handles regular tasks,
  20. and one server ``z``, that only handles feed related tasks. You can use this
  21. configuration::
  22. CELERY_ROUTES = {"feed.tasks.import_feed": {"queue": "feeds"}}
  23. With this route enabled import feed tasks will be routed to the
  24. ``"feeds"`` queue, while all other tasks will be routed to the default queue
  25. (named ``"celery"`` for historic reasons).
  26. Now you can start server ``z`` to only process the feeds queue like this::
  27. (z)$ celeryd -Q feeds
  28. You can specify as many queues as you want, so you can make this server
  29. process the default queue as well::
  30. (z)$ celeryd -Q feeds,celery
  31. Changing the name of the default queue
  32. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  33. You can change the name of the default queue by using the following
  34. configuration:
  35. .. code-block:: python
  36. CELERY_QUEUES = {"default": {"exchange": "default",
  37. "binding_key": "default"}}
  38. CELERY_DEFAULT_QUEUE = "default"
  39. How the queues are defined
  40. ~~~~~~~~~~~~~~~~~~~~~~~~~~
  41. The point with this feature is to hide the complex AMQP protocol for users
  42. with only basic needs. However — you may still be interested in how these queues
  43. are defined.
  44. A queue named ``"video"`` will be created with the following settings:
  45. .. code-block:: python
  46. {"exchange": "video",
  47. "exchange_type": "direct",
  48. "routing_key": "video"}
  49. The non-AMQP backends like ``ghettoq`` does not support exchanges, so they
  50. require the exchange to have the same name as the queue. Using this design
  51. ensures it will work for them as well.
  52. Manual routing
  53. --------------
  54. Say you have two servers, ``x``, and ``y`` that handles regular tasks,
  55. and one server ``z``, that only handles feed related tasks, you can use this
  56. configuration:
  57. .. code-block:: python
  58. CELERY_DEFAULT_QUEUE = "default"
  59. CELERY_QUEUES = {
  60. "default": {
  61. "binding_key": "task.#",
  62. },
  63. "feed_tasks": {
  64. "binding_key": "feed.#",
  65. },
  66. }
  67. CELERY_DEFAULT_EXCHANGE = "tasks"
  68. CELERY_DEFAULT_EXCHANGE_TYPE = "topic"
  69. CELERY_DEFAULT_ROUTING_KEY = "task.default"
  70. ``CELERY_QUEUES`` is a map of queue names and their exchange/type/binding_key,
  71. if you don't set exchange or exchange type, they will be taken from the
  72. ``CELERY_DEFAULT_EXCHANGE``/``CELERY_DEFAULT_EXCHANGE_TYPE`` settings.
  73. To route a task to the ``feed_tasks`` queue, you can add an entry in the
  74. ``CELERY_ROUTES`` setting:
  75. .. code-block:: python
  76. CELERY_ROUTES = {
  77. "feeds.tasks.import_feed": {
  78. "queue": "feed_tasks",
  79. "routing_key": "feed.import",
  80. },
  81. }
  82. You can also override this using the ``routing_key`` argument to
  83. :func:`~celery.execute.apply_async`, or :func:`~celery.execute.send_task`:
  84. >>> from feeds.tasks import import_feed
  85. >>> import_feed.apply_async(args=["http://cnn.com/rss"],
  86. ... queue="feed_tasks",
  87. ... routing_key="feed.import")
  88. To make server ``z`` consume from the feed queue exclusively you can
  89. start it with the ``-Q`` option::
  90. (z)$ celeryd -Q feed_tasks --hostname=z.example.com
  91. Servers ``x`` and ``y`` must be configured to consume from the default queue::
  92. (x)$ celeryd -Q default --hostname=x.example.com
  93. (y)$ celeryd -Q default --hostname=y.example.com
  94. If you want, you can even have your feed processing worker handle regular
  95. tasks as well, maybe in times when there's a lot of work to do::
  96. (z)$ celeryd -Q feed_tasks,default --hostname=z.example.com
  97. If you have another queue but on another exchange you want to add,
  98. just specify a custom exchange and exchange type:
  99. .. code-block:: python
  100. CELERY_QUEUES = {
  101. "feed_tasks": {
  102. "binding_key": "feed.#",
  103. },
  104. "regular_tasks": {
  105. "binding_key": "task.#",
  106. },
  107. "image_tasks": {
  108. "binding_key": "image.compress",
  109. "exchange": "mediatasks",
  110. "exchange_type": "direct",
  111. },
  112. }
  113. If you're confused about these terms, you should read up on AMQP concepts.
  114. In addition to the :ref:`AMQP Primer` below, there's
  115. `Rabbits and Warrens`_, an excellent blog post describing queues and
  116. exchanges. There's also AMQP in 10 minutes*: `Flexible Routing Model`_,
  117. and `Standard Exchange Types`_. For users of RabbitMQ the `RabbitMQ FAQ`_
  118. could be useful as a source of information.
  119. .. _`Rabbits and Warrens`: http://blogs.digitar.com/jjww/2009/01/rabbits-and-warrens/
  120. .. _`Flexible Routing Model`: http://bit.ly/95XFO1
  121. .. _`Standard Exchange Types`: http://bit.ly/EEWca
  122. .. _`RabbitMQ FAQ`: http://www.rabbitmq.com/faq.html
  123. .. _`AMQP Primer`:
  124. AMQP Primer
  125. ===========
  126. Messages
  127. --------
  128. A message consists of headers and a body. Celery uses headers to store
  129. the content type of the message and its content encoding. In Celery the
  130. content type is usually the serialization format used to serialize the
  131. message, and the body contains the name of the task to execute, the
  132. task id (UUID), the arguments to execute it with and some additional
  133. metadata - like the number of retries and its ETA (if any).
  134. This is an example task message represented as a Python dictionary:
  135. .. code-block:: python
  136. {"task": "myapp.tasks.add",
  137. "id": "54086c5e-6193-4575-8308-dbab76798756",
  138. "args": [4, 4],
  139. "kwargs": {}}
  140. Producers, consumers and brokers
  141. --------------------------------
  142. The client sending messages is typically called a *publisher*, or
  143. a *producer*, while the entity receiving messages is called
  144. a *consumer*.
  145. The *broker* is the message server, routing messages from producers
  146. to consumers.
  147. You are likely to see these terms used a lot in AMQP related material.
  148. Exchanges, queues and routing keys.
  149. -----------------------------------
  150. 1. Messages are sent to exchanges.
  151. 2. An exchange routes messages to one or more queues. Several exchange types
  152. exists, providing different ways to do routing.
  153. 3. The message waits in the queue until someone consumes from it.
  154. 4. The message is deleted from the queue when it has been acknowledged.
  155. The steps required to send and receive messages are:
  156. 1. Create an exchange
  157. 2. Create a queue
  158. 3. Bind the queue to the exchange.
  159. Celery automatically creates the entities necessary for the queues in
  160. ``CELERY_QUEUES`` to work (except if the queue's ``auto_declare`` setting
  161. is set to :const:`False`).
  162. Here's an example queue configuration with three queues;
  163. One for video, one for images and finally, one default queue for everything else:
  164. .. code-block:: python
  165. CELERY_QUEUES = {
  166. "default": {
  167. "exchange": "default",
  168. "binding_key": "default"},
  169. "videos": {
  170. "exchange": "media",
  171. "binding_key": "media.video",
  172. },
  173. "images": {
  174. "exchange": "media",
  175. "binding_key": "media.image",
  176. }
  177. }
  178. CELERY_DEFAULT_QUEUE = "default"
  179. CELERY_DEFAULT_EXCHANGE_TYPE = "direct"
  180. CELERY_DEFAULT_ROUTING_KEY = "default"
  181. **NOTE**: In Celery the ``routing_key`` is the key used to send the message,
  182. while ``binding_key`` is the key the queue is bound with. In the AMQP API
  183. they are both referred to as the routing key.
  184. Exchange types
  185. --------------
  186. The exchange type defines how the messages are routed through the exchange.
  187. The exchange types defined in the standard are ``direct``, ``topic``,
  188. ``fanout`` and ``headers``. Also non-standard exchange types are available
  189. as plugins to RabbitMQ, like the `last-value-cache plug-in`_ by Michael
  190. Bridgen.
  191. .. _`last-value-cache plug-in`:
  192. http://github.com/squaremo/rabbitmq-lvc-plugin
  193. Direct exchanges
  194. ~~~~~~~~~~~~~~~~
  195. Direct exchanges match by exact routing keys, so a queue bound with
  196. the routing key ``video`` only receives messages with the same routing key.
  197. Topic exchanges
  198. ~~~~~~~~~~~~~~~
  199. Topic exchanges matches routing keys using dot-separated words, and can
  200. include wildcard characters: ``*`` matches a single word, ``#`` matches
  201. zero or more words.
  202. With routing keys like ``usa.news``, ``usa.weather``, ``norway.news`` and
  203. ``norway.weather``, bindings could be ``*.news`` (all news), ``usa.#`` (all
  204. items in the USA) or ``usa.weather`` (all USA weather items).
  205. Related API commands
  206. --------------------
  207. exchange.declare(exchange_name, type, passive, durable, auto_delete, internal)
  208. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  209. Declares an exchange by name.
  210. * ``passive`` means the exchange won't be created, but you can use this to
  211. check if the exchange already exists.
  212. * Durable exchanges are persistent. That is - they survive a broker restart.
  213. * ``auto_delete`` means the queue will be deleted by the broker when there
  214. are no more queues using it.
  215. queue.declare(queue_name, passive, durable, exclusive, auto_delete)
  216. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  217. Declares a queue by name.
  218. * exclusive queues can only be consumed from by the current connection.
  219. implies ``auto_delete``.
  220. queue.bind(queue_name, exchange_name, routing_key)
  221. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  222. Binds a queue to an exchange with a routing key.
  223. Unbound queues will not receive messages, so this is necessary.
  224. queue.delete(name, if_unused, if_empty)
  225. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  226. Deletes a queue and its binding.
  227. exchange.delete(name, if_unused)
  228. ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
  229. Deletes an exchange.
  230. **NOTE**: Declaring does not necessarily mean "create". When you declare you
  231. *assert* that the entity exists and that it's operable. There is no rule as to
  232. whom should initially create the exchange/queue/binding, whether consumer
  233. or producer. Usually the first one to need it will be the one to create it.
  234. Hands-on with the API
  235. ---------------------
  236. Celery comes with a tool called ``camqadm`` (short for celery AMQP admin).
  237. It's used for simple admnistration tasks like creating/deleting queues and
  238. exchanges, purging queues and sending messages. In short it's for simple
  239. command-line access to the AMQP API.
  240. You can write commands directly in the arguments to ``camqadm``, or just start
  241. with no arguments to start it in shell-mode::
  242. $ camqadm
  243. -> connecting to amqp://guest@localhost:5672/.
  244. -> connected.
  245. 1>
  246. Here ``1>`` is the prompt. The number is counting the number of commands you
  247. have executed. Type ``help`` for a list of commands. It also has
  248. autocompletion, so you can start typing a command and then hit the
  249. ``tab`` key to show a list of possible matches.
  250. Now let's create a queue we can send messages to::
  251. 1> exchange.declare testexchange direct
  252. ok.
  253. 2> queue.declare testqueue
  254. ok. queue:testqueue messages:0 consumers:0.
  255. 3> queue.bind testqueue testexchange testkey
  256. ok.
  257. This created the direct exchange ``testexchange``, and a queue
  258. named ``testqueue``. The queue is bound to the exchange using
  259. the routing key ``testkey``.
  260. From now on all messages sent to the exchange ``testexchange`` with routing
  261. key ``testkey`` will be moved to this queue. We can send a message by
  262. using the ``basic.publish`` command::
  263. 4> basic.publish "This is a message!" testexchange testkey
  264. ok.
  265. Now that the message is sent we can retrieve it again. We use the
  266. ``basic.get`` command here, which pops a single message off the queue,
  267. this command is not recommended for production as it implies polling, any
  268. real application would declare consumers instead.
  269. Pop a message off the queue::
  270. 5> basic.get testqueue
  271. {'body': 'This is a message!',
  272. 'delivery_info': {'delivery_tag': 1,
  273. 'exchange': u'testexchange',
  274. 'message_count': 0,
  275. 'redelivered': False,
  276. 'routing_key': u'testkey'},
  277. 'properties': {}}
  278. AMQP uses acknowledgment to signify that a message has been received
  279. and processed successfully. The message is sent to the next receiver
  280. if it has not been acknowledged before the client connection is closed.
  281. Note the delivery tag listed in the structure above; Within a connection channel,
  282. every received message has a unique delivery tag,
  283. This tag is used to acknowledge the message. Note that
  284. delivery tags are not unique across connections, so in another client
  285. the delivery tag ``1`` might point to a different message than in this channel.
  286. You can acknowledge the message we received using ``basic.ack``::
  287. 6> basic.ack 1
  288. ok.
  289. To clean up after our test session we should delete the entities we created::
  290. 7> queue.delete testqueue
  291. ok. 0 messages deleted.
  292. 8> exchange.delete testexchange
  293. ok.
  294. Routing Tasks
  295. =============
  296. Defining queues
  297. ---------------
  298. In Celery the queues are defined by the ``CELERY_QUEUES`` setting.
  299. Here's an example queue configuration with three queues;
  300. One for video, one for images and finally, one default queue for everything else:
  301. .. code-block:: python
  302. CELERY_QUEUES = {
  303. "default": {
  304. "exchange": "default",
  305. "binding_key": "default"},
  306. "videos": {
  307. "exchange": "media",
  308. "exchange_type": "topic",
  309. "binding_key": "media.video",
  310. },
  311. "images": {
  312. "exchange": "media",
  313. "exchange_type": "topic",
  314. "binding_key": "media.image",
  315. }
  316. }
  317. CELERY_DEFAULT_QUEUE = "default"
  318. CELERY_DEFAULT_EXCHANGE = "default"
  319. CELERY_DEFAULT_EXCHANGE_TYPE = "direct"
  320. CELERY_DEFAULT_ROUTING_KEY = "default"
  321. Here, the ``CELERY_DEFAULT_QUEUE`` will be used to route tasks that doesn't
  322. have an explicit route.
  323. The default exchange, exchange type and routing key will be used as the
  324. default routing values for tasks, and as the default values for entries
  325. in ``CELERY_QUEUES``.
  326. Specifying task destination
  327. ---------------------------
  328. The destination for a task is decided by the following (in order):
  329. 1. The :ref:`routers` defined in ``CELERY_ROUTES``.
  330. 2. The routing arguments to :func:`~celery.execute.apply_async`.
  331. 3. Routing related attributes defined on the :class:`~celery.task.base.Task` itself.
  332. It is considered best practice to not hard-code these settings, but rather
  333. leave that as configuration options by using :ref:`routers`;
  334. This is the most flexible approach, but sensible defaults can still be set
  335. as task attributes.
  336. .. _routers:
  337. Routers
  338. -------
  339. A router is a class that decides the routing options for a task.
  340. All you need to define a new router is to create a class with a
  341. ``route_for_task`` method:
  342. .. code-block:: python
  343. class MyRouter(object):
  344. def route_for_task(self, task, args=None, kwargs=None):
  345. if task == "myapp.tasks.compress_video":
  346. return {"exchange": "video",
  347. "exchange_type": "topic",
  348. "routing_key": "video.compress"}
  349. return None
  350. If you return the ``queue`` key, it will expand with the defined settings of
  351. that queue in ``CELERY_QUEUES``::
  352. {"queue": "video", "routing_key": "video.compress"}
  353. becomes -->
  354. {"queue": "video",
  355. "exchange": "video",
  356. "exchange_type": "topic",
  357. "routing_key": "video.compress"}
  358. You install router classes by adding it to the ``CELERY_ROUTES`` setting::
  359. CELERY_ROUTES = (MyRouter, )
  360. Router classes can also be added by name::
  361. CELERY_ROUTES = ("myapp.routers.MyRouter", )
  362. For simple task name -> route mappings like the router example above, you can simply
  363. drop a dict into ``CELERY_ROUTES`` to get the same result::
  364. CELERY_ROUTES = ({"myapp.tasks.compress_video": {
  365. "queue": "video",
  366. "routing_key": "video.compress"}}, )
  367. The routers will then be traversed in order, it will stop at the first router
  368. returning a value and use that as the final route for the task.