FAQ 22 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405406407408409410411412413414415416417418419420421422423424425426427428429430431432433434435436437438439440441442443444445446447448449450451452453454455456457458459460461462463464465466467468469470471472473474475476477478479480481482483484485486487488489490491492493494495496497498499500501502503504505506507508509510511512513514515516517518519520521522523524525526527528529530531532533534535536537538539540541542543544545546547548549550551552553554555556557558559560561562563564565566567568569570571572573574575576577578579580581582583584585586587588589590591592593594595596597598599600601602603604605606607608609610611612613614615616617618619620621622623624625626627628629630631632633634635636637638639640641642643644645646647648649650651652653654655656657658659660661662663664665666667668669670671672673674675676677678679680681682683684685686687688689690691692693694695696697698699700701702703704705706707708709710711712713714715716717718719720721722723724
  1. .. _faq:
  2. ============================
  3. Frequently Asked Questions
  4. ============================
  5. .. contents::
  6. :local:
  7. .. _faq-general:
  8. General
  9. =======
  10. .. _faq-when-to-use:
  11. What kinds of things should I use Celery for?
  12. ---------------------------------------------
  13. **Answer:** `Queue everything and delight everyone`_ is a good article
  14. describing why you would use a queue in a web context.
  15. .. _`Queue everything and delight everyone`:
  16. http://decafbad.com/blog/2008/07/04/queue-everything-and-delight-everyone
  17. These are some common use cases:
  18. * Running something in the background. For example, to finish the web request
  19. as soon as possible, then update the users page incrementally.
  20. This gives the user the impression of good performance and "snappiness", even
  21. though the real work might actually take some time.
  22. * Running something after the web request has finished.
  23. * Making sure something is done, by executing it asynchronously and using
  24. retries.
  25. * Scheduling periodic work.
  26. And to some degree:
  27. * Distributed computing.
  28. * Parallel execution.
  29. .. _faq-misconceptions:
  30. Misconceptions
  31. ==============
  32. .. _faq-serializion-is-a-choice:
  33. Is Celery dependent on pickle?
  34. ------------------------------
  35. **Answer:** No.
  36. Celery can support any serialization scheme and has support for JSON/YAML and
  37. Pickle by default. You can even send one task using pickle, and another one
  38. with JSON seamlessly, this is because every task is associated with a
  39. content-type. The default serialization scheme is pickle because it's the most
  40. used, and it has support for sending complex objects as task arguments.
  41. You can set a global default serializer, the default serializer for a
  42. particular Task, or even what serializer to use when sending a single task
  43. instance.
  44. .. _faq-is-celery-for-django-only:
  45. Is Celery for Django only?
  46. --------------------------
  47. **Answer:** No.
  48. Celery does not depend on Django anymore. To use Celery with Django you have
  49. to use the `django-celery`_ package.
  50. .. _`django-celery`: http://pypi.python.org/pypi/django-celery
  51. .. _faq-is-celery-for-rabbitmq-only:
  52. Do I have to use AMQP/RabbitMQ?
  53. -------------------------------
  54. **Answer**: No.
  55. You can also use Redis or an SQL database, see `Using other
  56. queues`_.
  57. .. _`Using other queues`:
  58. http://ask.github.com/celery/tutorials/otherqueues.html
  59. Redis or a database won't perform as well as
  60. an AMQP broker. If you have strict reliability requirements you are
  61. encouraged to use RabbitMQ or another AMQP broker. Redis/database also use
  62. polling, so they are likely to consume more resources. However, if you for
  63. some reason are not able to use AMQP, feel free to use these alternatives.
  64. They will probably work fine for most use cases, and note that the above
  65. points are not specific to Celery; If using Redis/database as a queue worked
  66. fine for you before, it probably will now. You can always upgrade later
  67. if you need to.
  68. .. _faq-is-celery-multilingual:
  69. Is Celery multilingual?
  70. ------------------------
  71. **Answer:** Yes.
  72. :mod:`~celery.bin.celeryd` is an implementation of Celery in python. If the
  73. language has an AMQP client, there shouldn't be much work to create a worker
  74. in your language. A Celery worker is just a program connecting to the broker
  75. to process messages.
  76. Also, there's another way to be language independent, and that is to use REST
  77. tasks, instead of your tasks being functions, they're URLs. With this
  78. information you can even create simple web servers that enable preloading of
  79. code. See: `User Guide: Remote Tasks`_.
  80. .. _`User Guide: Remote Tasks`:
  81. http://ask.github.com/celery/userguide/remote-tasks.html
  82. .. _faq-troubleshooting:
  83. Troubleshooting
  84. ===============
  85. .. _faq-mysql-deadlocks:
  86. MySQL is throwing deadlock errors, what can I do?
  87. -------------------------------------------------
  88. **Answer:** MySQL has default isolation level set to `REPEATABLE-READ`,
  89. if you don't really need that, set it to `READ-COMMITTED`.
  90. You can do that by adding the following to your :file:`my.cnf`::
  91. [mysqld]
  92. transaction-isolation = READ-COMMITTED
  93. For more information about InnoDB`s transaction model see `MySQL - The InnoDB
  94. Transaction Model and Locking`_ in the MySQL user manual.
  95. (Thanks to Honza Kral and Anton Tsigularov for this solution)
  96. .. _`MySQL - The InnoDB Transaction Model and Locking`: http://dev.mysql.com/doc/refman/5.1/en/innodb-transaction-model.html
  97. .. _faq-worker-hanging:
  98. celeryd is not doing anything, just hanging
  99. --------------------------------------------
  100. **Answer:** See `MySQL is throwing deadlock errors, what can I do?`_.
  101. or `Why is Task.delay/apply\* just hanging?`.
  102. .. _faq-publish-hanging:
  103. Why is Task.delay/apply\*/celeryd just hanging?
  104. -----------------------------------------------
  105. **Answer:** There is a bug in some AMQP clients that will make it hang if
  106. it's not able to authenticate the current user, the password doesn't match or
  107. the user does not have access to the virtual host specified. Be sure to check
  108. your broker logs (for RabbitMQ that is :file:`/var/log/rabbitmq/rabbit.log` on
  109. most systems), it usually contains a message describing the reason.
  110. .. _faq-celeryd-on-freebsd:
  111. Why won't celeryd run on FreeBSD?
  112. ---------------------------------
  113. **Answer:** multiprocessing. Pool requires a working POSIX semaphore
  114. implementation which isn't enabled in FreeBSD by default. You have to enable
  115. POSIX semaphores in the kernel and manually recompile multiprocessing.
  116. Luckily, Viktor Petersson has written a tutorial to get you started with
  117. Celery on FreeBSD here:
  118. http://www.playingwithwire.com/2009/10/how-to-get-celeryd-to-work-on-freebsd/
  119. .. _faq-duplicate-key-errors:
  120. I'm having `IntegrityError: Duplicate Key` errors. Why?
  121. ---------------------------------------------------------
  122. **Answer:** See `MySQL is throwing deadlock errors, what can I do?`_.
  123. Thanks to howsthedotcom.
  124. .. _faq-worker-stops-processing:
  125. Why aren't my tasks processed?
  126. ------------------------------
  127. **Answer:** With RabbitMQ you can see how many consumers are currently
  128. receiving tasks by running the following command::
  129. $ rabbitmqctl list_queues -p <myvhost> name messages consumers
  130. Listing queues ...
  131. celery 2891 2
  132. This shows that there's 2891 messages waiting to be processed in the task
  133. queue, and there are two consumers processing them.
  134. One reason that the queue is never emptied could be that you have a stale
  135. worker process taking the messages hostage. This could happen if celeryd
  136. wasn't properly shut down.
  137. When a message is received by a worker the broker waits for it to be
  138. acknowledged before marking the message as processed. The broker will not
  139. re-send that message to another consumer until the consumer is shut down
  140. properly.
  141. If you hit this problem you have to kill all workers manually and restart
  142. them::
  143. ps auxww | grep celeryd | awk '{print $2}' | xargs kill
  144. You might have to wait a while until all workers have finished the work they're
  145. doing. If it's still hanging after a long time you can kill them by force
  146. with::
  147. ps auxww | grep celeryd | awk '{print $2}' | xargs kill -9
  148. .. _faq-task-does-not-run:
  149. Why won't my Task run?
  150. ----------------------
  151. **Answer:** There might be syntax errors preventing the tasks module being imported.
  152. You can find out if Celery is able to run the task by executing the
  153. task manually:
  154. >>> from myapp.tasks import MyPeriodicTask
  155. >>> MyPeriodicTask.delay()
  156. Watch celeryd`s log file to see if it's able to find the task, or if some
  157. other error is happening.
  158. .. _faq-periodic-task-does-not-run:
  159. Why won't my Periodic Task run?
  160. -------------------------------
  161. **Answer:** See `Why won't my Task run?`_.
  162. .. _faq-purge-the-queue:
  163. How do I discard all waiting tasks?
  164. ------------------------------------
  165. **Answer:** Use :func:`~celery.task.control.discard_all`, like this:
  166. >>> from celery.task.control import discard_all
  167. >>> discard_all()
  168. 1753
  169. The number 1753 is the number of messages deleted.
  170. You can also start :mod:`~celery.bin.celeryd` with the
  171. :option:`--discard` argument which will accomplish the same thing.
  172. .. _faq-messages-left-after-purge:
  173. I've discarded messages, but there are still messages left in the queue?
  174. ------------------------------------------------------------------------
  175. **Answer:** Tasks are acknowledged (removed from the queue) as soon
  176. as they are actually executed. After the worker has received a task, it will
  177. take some time until it is actually executed, especially if there are a lot
  178. of tasks already waiting for execution. Messages that are not acknowledged are
  179. held on to by the worker until it closes the connection to the broker (AMQP
  180. server). When that connection is closed (e.g. because the worker was stopped)
  181. the tasks will be re-sent by the broker to the next available worker (or the
  182. same worker when it has been restarted), so to properly purge the queue of
  183. waiting tasks you have to stop all the workers, and then discard the tasks
  184. using :func:`~celery.task.control.discard_all`.
  185. .. _faq-results:
  186. Results
  187. =======
  188. .. _faq-get-result-by-task-id:
  189. How do I get the result of a task if I have the ID that points there?
  190. ----------------------------------------------------------------------
  191. **Answer**: Use `Task.AsyncResult`::
  192. >>> result = MyTask.AsyncResult(task_id)
  193. >>> result.get()
  194. This will give you a :class:`~celery.result.BaseAsyncResult` instance
  195. using the tasks current result backend.
  196. If you need to specify a custom result backend you should use
  197. :class:`celery.result.BaseAsyncResult` directly::
  198. >>> from celery.result import BaseAsyncResult
  199. >>> result = BaseAsyncResult(task_id, backend=...)
  200. >>> result.get()
  201. .. _faq-security:
  202. Security
  203. ========
  204. Isn't using `pickle` a security concern?
  205. ----------------------------------------
  206. **Answer**: Yes, indeed it is.
  207. You are right to have a security concern, as this can indeed be a real issue.
  208. It is essential that you protect against unauthorized
  209. access to your broker, databases and other services transmitting pickled
  210. data.
  211. For the task messages you can set the :setting:`CELERY_TASK_SERIALIZER`
  212. setting to "json" or "yaml" instead of pickle. There is
  213. currently no alternative solution for task results (but writing a
  214. custom result backend using JSON is a simple task)
  215. Note that this is not just something you should be aware of with Celery, for
  216. example also Django uses pickle for its cache client.
  217. Can messages be encrypted?
  218. --------------------------
  219. **Answer**: Some AMQP brokers supports using SSL (including RabbitMQ).
  220. You can enable this using the :setting:`BROKER_USE_SSL` setting.
  221. It is also possible to add additional encryption and security to messages,
  222. if you have a need for this then you should contact the :ref:`mailing-list`.
  223. Is it safe to run :program:`celeryd` as root?
  224. ---------------------------------------------
  225. **Answer**: No!
  226. We're not currently aware of any security issues, but it would
  227. be incredibly naive to assume that they don't exist, so running
  228. the Celery services (:program:`celeryd`, :program:`celerybeat`,
  229. :program:`celeryev`, etc) as an unprivileged user is recommended.
  230. .. _faq-brokers:
  231. Brokers
  232. =======
  233. Why is RabbitMQ crashing?
  234. -------------------------
  235. **Answer:** RabbitMQ will crash if it runs out of memory. This will be fixed in a
  236. future release of RabbitMQ. please refer to the RabbitMQ FAQ:
  237. http://www.rabbitmq.com/faq.html#node-runs-out-of-memory
  238. .. note::
  239. This is no longer the case, RabbitMQ versions 2.0 and above
  240. includes a new persister, that is tolerant to out of memory
  241. errors. RabbitMQ 2.1 or higher is recommended for Celery.
  242. If you're still running an older version of RabbitMQ and experience
  243. crashes, then please upgrade!
  244. Misconfiguration of Celery can eventually lead to a crash
  245. on older version of RabbitMQ. Even if it doesn't crash, this
  246. can still consume a lot of resources, so it is very
  247. important that you are aware of the common pitfalls.
  248. * Events.
  249. Running :mod:`~celery.bin.celeryd` with the :option:`-E`/:option:`--events`
  250. option will send messages for events happening inside of the worker.
  251. Events should only be enabled if you have an active monitor consuming them,
  252. or if you purge the event queue periodically.
  253. * AMQP backend results.
  254. When running with the AMQP result backend, every task result will be sent
  255. as a message. If you don't collect these results, they will build up and
  256. RabbitMQ will eventually run out of memory.
  257. If you don't use the results for a task, make sure you set the
  258. `ignore_result` option:
  259. .. code-block python
  260. @task(ignore_result=True)
  261. def mytask():
  262. ...
  263. class MyTask(Task):
  264. ignore_result = True
  265. Results can also be disabled globally using the
  266. :setting:`CELERY_IGNORE_RESULT` setting.
  267. .. note::
  268. Celery version 2.1 added support for automatic expiration of
  269. AMQP result backend results.
  270. To use this you need to run RabbitMQ 2.1 or higher and enable
  271. the :setting:`CELERY_AMQP_TASK_RESULT_EXPIRES` setting.
  272. .. _faq-use-celery-with-stomp:
  273. Can I use Celery with ActiveMQ/STOMP?
  274. -------------------------------------
  275. **Answer**: No. It used to be supported by Carrot,
  276. but is not currently supported in Kombu.
  277. .. _faq-non-amqp-missing-features:
  278. What features are not supported when not using an AMQP broker?
  279. --------------------------------------------------------------
  280. This is an incomplete list of features not available when
  281. using the virtual transports:
  282. * The `header` exchange type.
  283. * immediate
  284. * mandatory
  285. .. _faq-tasks:
  286. Tasks
  287. =====
  288. .. _faq-tasks-connection-reuse:
  289. How can I reuse the same connection when applying tasks?
  290. --------------------------------------------------------
  291. **Answer**: See :ref:`executing-connections`.
  292. .. _faq-execute-task-by-name:
  293. Can I execute a task by name?
  294. -----------------------------
  295. **Answer**: Yes. Use :func:`celery.execute.send_task`.
  296. You can also execute a task by name from any language
  297. that has an AMQP client.
  298. >>> from celery.execute import send_task
  299. >>> send_task("tasks.add", args=[2, 2], kwargs={})
  300. <AsyncResult: 373550e8-b9a0-4666-bc61-ace01fa4f91d>
  301. .. _faq-get-current-task-id:
  302. How can I get the task id of the current task?
  303. ----------------------------------------------
  304. **Answer**: The current id and more is available in the task request::
  305. @task
  306. def mytask():
  307. cache.set(mytask.request.id, "Running")
  308. For more information see :ref:`task-request-info`.
  309. .. _faq-custom-task-ids:
  310. Can I specify a custom task_id?
  311. -------------------------------
  312. **Answer**: Yes. Use the `task_id` argument to
  313. :meth:`~celery.execute.apply_async`::
  314. >>> task.apply_async(args, kwargs, task_id="...")
  315. Can I use decorators with tasks?
  316. --------------------------------
  317. **Answer**: Yes. But please see note at :ref:`tasks-decorating`.
  318. .. _faq-natural-task-ids:
  319. Can I use natural task ids?
  320. ---------------------------
  321. **Answer**: Yes, but make sure it is unique, as the behavior
  322. for two tasks existing with the same id is undefined.
  323. The world will probably not explode, but at the worst
  324. they can overwrite each others results.
  325. .. _faq-task-callbacks:
  326. How can I run a task once another task has finished?
  327. ----------------------------------------------------
  328. **Answer**: You can safely launch a task inside a task.
  329. Also, a common pattern is to use callback tasks:
  330. .. code-block:: python
  331. @task()
  332. def add(x, y, callback=None):
  333. result = x + y
  334. if callback:
  335. subtask(callback).delay(result)
  336. return result
  337. @task(ignore_result=True)
  338. def log_result(result, **kwargs):
  339. logger = log_result.get_logger(**kwargs)
  340. logger.info("log_result got: %s" % (result, ))
  341. Invocation::
  342. >>> add.delay(2, 2, callback=log_result.subtask())
  343. See :doc:`userguide/tasksets` for more information.
  344. .. _faq-cancel-task:
  345. Can I cancel the execution of a task?
  346. -------------------------------------
  347. **Answer**: Yes. Use `result.revoke`::
  348. >>> result = add.apply_async(args=[2, 2], countdown=120)
  349. >>> result.revoke()
  350. or if you only have the task id::
  351. >>> from celery.task.control import revoke
  352. >>> revoke(task_id)
  353. .. _faq-node-not-receiving-broadcast-commands:
  354. Why aren't my remote control commands received by all workers?
  355. --------------------------------------------------------------
  356. **Answer**: To receive broadcast remote control commands, every worker node
  357. uses its host name to create a unique queue name to listen to,
  358. so if you have more than one worker with the same host name, the
  359. control commands will be received in round-robin between them.
  360. To work around this you can explicitly set the host name for every worker
  361. using the :option:`--hostname` argument to :mod:`~celery.bin.celeryd`::
  362. $ celeryd --hostname=$(hostname).1
  363. $ celeryd --hostname=$(hostname).2
  364. etc., etc...
  365. .. _faq-task-routing:
  366. Can I send some tasks to only some servers?
  367. --------------------------------------------
  368. **Answer:** Yes. You can route tasks to an arbitrary server using AMQP,
  369. and a worker can bind to as many queues as it wants.
  370. See :doc:`userguide/routing` for more information.
  371. .. _faq-change-periodic-task-interval-at-runtime:
  372. Can I change the interval of a periodic task at runtime?
  373. --------------------------------------------------------
  374. **Answer**: Yes. You can override `PeriodicTask.is_due` or turn
  375. `PeriodicTask.run_every` into a property:
  376. .. code-block:: python
  377. class MyPeriodic(PeriodicTask):
  378. def run(self):
  379. # ...
  380. @property
  381. def run_every(self):
  382. return get_interval_from_database(...)
  383. .. _faq-task-priorities:
  384. Does celery support task priorities?
  385. ------------------------------------
  386. **Answer**: No. In theory, yes, as AMQP supports priorities. However
  387. RabbitMQ doesn't implement them yet.
  388. The usual way to prioritize work in Celery, is to route high priority tasks
  389. to different servers. In the real world this may actually work better than per message
  390. priorities. You can use this in combination with rate limiting to achieve a
  391. highly responsive system.
  392. .. _faq-acks_late-vs-retry:
  393. Should I use retry or acks_late?
  394. --------------------------------
  395. **Answer**: Depends. It's not necessarily one or the other, you may want
  396. to use both.
  397. `Task.retry` is used to retry tasks, notably for expected errors that
  398. is catchable with the `try:` block. The AMQP transaction is not used
  399. for these errors: **if the task raises an exception it is still acknowledged!**.
  400. The `acks_late` setting would be used when you need the task to be
  401. executed again if the worker (for some reason) crashes mid-execution.
  402. It's important to note that the worker is not known to crash, and if
  403. it does it is usually an unrecoverable error that requires human
  404. intervention (bug in the worker, or task code).
  405. In an ideal world you could safely retry any task that has failed, but
  406. this is rarely the case. Imagine the following task:
  407. .. code-block:: python
  408. @task()
  409. def process_upload(filename, tmpfile):
  410. # Increment a file count stored in a database
  411. increment_file_counter()
  412. add_file_metadata_to_db(filename, tmpfile)
  413. copy_file_to_destination(filename, tmpfile)
  414. If this crashed in the middle of copying the file to its destination
  415. the world would contain incomplete state. This is not a critical
  416. scenario of course, but you can probably imagine something far more
  417. sinister. So for ease of programming we have less reliability;
  418. It's a good default, users who require it and know what they
  419. are doing can still enable acks_late (and in the future hopefully
  420. use manual acknowledgement)
  421. In addition `Task.retry` has features not available in AMQP
  422. transactions: delay between retries, max retries, etc.
  423. So use retry for Python errors, and if your task is idempotent
  424. combine that with `acks_late` if that level of reliability
  425. is required.
  426. .. _faq-schedule-at-specific-time:
  427. Can I schedule tasks to execute at a specific time?
  428. ---------------------------------------------------
  429. .. module:: celery.task.base
  430. **Answer**: Yes. You can use the `eta` argument of :meth:`Task.apply_async`.
  431. Or to schedule a periodic task at a specific time, use the
  432. :class:`celery.task.schedules.crontab` schedule behavior:
  433. .. code-block:: python
  434. from celery.task.schedules import crontab
  435. from celery.task import periodic_task
  436. @periodic_task(run_every=crontab(hours=7, minute=30, day_of_week="mon"))
  437. def every_monday_morning():
  438. print("This is run every Monday morning at 7:30")
  439. .. _faq-safe-worker-shutdown:
  440. How do I shut down `celeryd` safely?
  441. --------------------------------------
  442. **Answer**: Use the :sig:`TERM` signal, and the worker will finish all currently
  443. executing jobs and shut down as soon as possible. No tasks should be lost.
  444. You should never stop :mod:`~celery.bin.celeryd` with the :sig:`KILL` signal
  445. (:option:`-9`), unless you've tried :sig:`TERM` a few times and waited a few
  446. minutes to let it get a chance to shut down. As if you do tasks may be
  447. terminated mid-execution, and they will not be re-run unless you have the
  448. `acks_late` option set (`Task.acks_late` / :setting:`CELERY_ACKS_LATE`).
  449. .. seealso::
  450. :ref:`worker-stopping`
  451. .. _faq-daemonizing:
  452. How do I run celeryd in the background on [platform]?
  453. -----------------------------------------------------
  454. **Answer**: Please see :ref:`daemonizing`.
  455. .. _faq-windows:
  456. Windows
  457. =======
  458. .. _faq-windows-worker-spawn-loop:
  459. celeryd keeps spawning processes at startup
  460. -------------------------------------------
  461. **Answer**: This is a known issue on Windows.
  462. You have to start celeryd with the command::
  463. $ python -m celeryd.bin.celeryd
  464. Any additional arguments can be appended to this command.
  465. See http://bit.ly/bo9RSw
  466. .. _faq-windows-worker-embedded-beat:
  467. The `-B` / `--beat` option to celeryd doesn't work?
  468. ----------------------------------------------------------------
  469. **Answer**: That's right. Run `celerybeat` and `celeryd` as separate
  470. services instead.
  471. .. _faq-windows-django-settings:
  472. `django-celery` can't find settings?
  473. --------------------------------------
  474. **Answer**: You need to specify the :option:`--settings` argument to
  475. :program:`manage.py`::
  476. $ python manage.py celeryd start --settings=settings
  477. See http://bit.ly/bo9RSw