FAQ 13 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290291292293294295296297298299300301302303304305306307308309310311312313314315316317318319320321322323324325326327328329330331332333334335336337338339340341342343344345346347348349350351352353354355356357358359360361362363364365366367368369370371372373374375376377378379380381382383384385386387388389390391392393394395396397398399400401402403404405
  1. ============================
  2. Frequently Asked Questions
  3. ============================
  4. Troubleshooting
  5. ===============
  6. MySQL is throwing deadlock errors, what can I do?
  7. -------------------------------------------------
  8. **Answer:** MySQL has default isolation level set to ``REPEATABLE-READ``,
  9. if you don't really need that, set it to ``READ-COMMITTED``.
  10. You can do that by adding the following to your ``my.cnf``::
  11. [mysqld]
  12. transaction-isolation = READ-COMMITTED
  13. For more information about InnoDBs transaction model see `MySQL - The InnoDB
  14. Transaction Model and Locking`_ in the MySQL user manual.
  15. (Thanks to Honza Kral and Anton Tsigularov for this solution)
  16. .. _`MySQL - The InnoDB Transaction Model and Locking`: http://dev.mysql.com/doc/refman/5.1/en/innodb-transaction-model.html
  17. celeryd is not doing anything, just hanging
  18. --------------------------------------------
  19. **Answer:** See `MySQL is throwing deadlock errors, what can I do?`_.
  20. or `Why is Task.delay/apply\* just hanging?`.
  21. Why is Task.delay/apply\*/celeryd just hanging?
  22. -----------------------------------------------
  23. **Answer:** There is a bug in some AMQP clients that will make it hang if
  24. it's not able to authenticate the current user, the password doesn't match or
  25. the user does not have access to the virtual host specified. Be sure to check
  26. your broker logs (for RabbitMQ that is ``/var/log/rabbitmq/rabbit.log`` on
  27. most systems), it usually contains a message describing the reason.
  28. Why won't celeryd run on FreeBSD?
  29. ---------------------------------
  30. **Answer:** multiprocessing.Pool requires a working POSIX semaphore
  31. implementation which isn't enabled in FreeBSD by default. You have to enable
  32. POSIX semaphores in the kernel and manually recompile multiprocessing.
  33. I'm having ``IntegrityError: Duplicate Key`` errors. Why?
  34. ----------------------------------------------------------
  35. **Answer:** See `MySQL is throwing deadlock errors, what can I do?`_.
  36. Thanks to howsthedotcom.
  37. Why isn't my tasks processed?
  38. -----------------------------
  39. **Answer:** With RabbitMQ you can see how many consumers are currently
  40. receiving tasks by running the following command::
  41. $ rabbitmqctl list_queues -p <myvhost> name messages consumers
  42. Listing queues ...
  43. celery 2891 2
  44. This shows that there's 2891 messages waiting to be processed in the task
  45. queue, and there are two consumers processing them.
  46. One reason that the queue is never emptied could be that you have a stale
  47. celery process taking the messages hostage. This could happen if celeryd
  48. wasn't properly shut down.
  49. When a message is recieved by a worker the broker waits for it to be
  50. acknowledged before marking the message as processed. The broker will not
  51. re-send that message to another consumer until the consumer is shutdown
  52. properly.
  53. If you hit this problem you have to kill all workers manually and restart
  54. them::
  55. ps auxww | grep celeryd | awk '{print $2}' | xargs kill
  56. You might have to wait a while until all workers has finished the work they're
  57. doing, if it's still hanging after a long time you can kill them by force
  58. with::
  59. ps auxww | grep celeryd | awk '{print $2}' | xargs kill -9
  60. Why won't my Task run?
  61. ----------------------
  62. **Answer:** Did you register the task in the applications ``tasks.py`` module?
  63. (or in some other module Django loads by default, like ``models.py``?).
  64. Also there might be syntax errors preventing the tasks module being imported.
  65. You can find out if the celery daemon is able to run the task by executing the
  66. task manually:
  67. >>> from myapp.tasks import MyPeriodicTask
  68. >>> MyPeriodicTask.delay()
  69. Watch celery daemons logfile (or output if not running as a daemon), to see
  70. if it's able to find the task, or if some other error is happening.
  71. Why won't my Periodic Task run?
  72. -------------------------------
  73. **Answer:** See `Why won't my Task run?`_.
  74. How do I discard all waiting tasks?
  75. ------------------------------------
  76. **Answer:** Use ``celery.task.discard_all()``, like this:
  77. >>> from celery.task import discard_all
  78. >>> discard_all()
  79. 1753
  80. The number ``1753`` is the number of messages deleted.
  81. You can also start celeryd with the ``--discard`` argument which will
  82. accomplish the same thing.
  83. I've discarded messages, but there are still messages left in the queue?
  84. ------------------------------------------------------------------------
  85. **Answer:** Tasks are acknowledged (removed from the queue) as soon
  86. as they are actually executed. After the worker has received a task, it will
  87. take some time until it is actually executed, especially if there are a lot
  88. of tasks already waiting for execution. Messages that are not acknowledged are
  89. hold on to by the worker until it closes the connection to the broker (AMQP
  90. server). When that connection is closed (e.g because the worker was stopped)
  91. the tasks will be re-sent by the broker to the next available worker (or the
  92. same worker when it has been restarted), so to properly purge the queue of
  93. waiting tasks you have to stop all the workers, and then discard the tasks
  94. using ``discard_all``.
  95. Brokers
  96. =======
  97. Can I use celery with ActiveMQ/STOMP?
  98. -------------------------------------
  99. **Answer**: Yes. But this is somewhat experimental for now.
  100. It is certainly working ok for me in a test configuration, but it has not
  101. been tested in production like RabbitMQ. If you have any problems with
  102. using STOMP and celery, please report the bugs to the issue tracker:
  103. http://github.com/ask/celery/issues/
  104. First you have to use the ``master`` branch of ``celery``::
  105. $ git clone git://github.com/ask/celery.git
  106. $ cd celery
  107. $ sudo python setup.py install
  108. $ cd ..
  109. Then you need to install the ``stompbackend`` branch of ``carrot``::
  110. $ git clone git://github.com/ask/carrot.git
  111. $ cd carrot
  112. $ git checkout stompbackend
  113. $ sudo python setup.py install
  114. $ cd ..
  115. And my fork of ``python-stomp`` which adds non-blocking support::
  116. $ hg clone http://bitbucket.org/asksol/python-stomp/
  117. $ cd python-stomp
  118. $ sudo python setup.py install
  119. $ cd ..
  120. In this example we will use a queue called ``celery`` which we created in
  121. the ActiveMQ web admin interface.
  122. **Note**: For ActiveMQ the queue name has to have ``"/queue/"`` prepended to
  123. it. i.e. the queue ``celery`` becomes ``/queue/celery``.
  124. Since a STOMP queue is a single named entity and it doesn't have the
  125. routing capabilities of AMQP you need to set both the ``queue``, and
  126. ``exchange`` settings to your queue name. This is a minor inconvenience since
  127. carrot needs to maintain the same interface for both AMQP and STOMP (obviously
  128. the one with the most capabilities won).
  129. Use the following specific settings in your ``settings.py``:
  130. .. code-block:: python
  131. # Makes python-stomp the default backend for carrot.
  132. CARROT_BACKEND = "stomp"
  133. # STOMP hostname and port settings.
  134. AMQP_HOST = "localhost"
  135. AMQP_PORT = 61613
  136. # The queue name to use (both queue and exchange must be set to the
  137. # same queue name when using STOMP)
  138. CELERY_AMQP_CONSUMER_QUEUE = "/queue/celery"
  139. CELERY_AMQP_EXCHANGE = "/queue/celery"
  140. Now you can go on reading the tutorial in the README, ignoring any AMQP
  141. specific options.
  142. Which features are not supported when using STOMP?
  143. --------------------------------------------------
  144. This is a (possible incomplete) list of features not available when
  145. using the STOMP backend:
  146. * routing keys
  147. * exchange types (direct, topic, headers, etc)
  148. * immediate
  149. * mandatory
  150. Features
  151. ========
  152. Can I send some tasks to only some servers?
  153. --------------------------------------------
  154. **Answer:** As of now there is only one use-case that works like this,
  155. and that is tasks of type ``A`` can be sent to servers ``x`` and ``y``,
  156. while tasks of type ``B`` can be sent to server ``z``. One server can't
  157. handle more than one routing_key, but this is coming in a later release.
  158. Say you have two servers, ``x``, and ``y`` that handles regular tasks,
  159. and one server ``z``, that only handles feed related tasks, you can use this
  160. configuration:
  161. * Servers ``x`` and ``y``: settings.py:
  162. .. code-block:: python
  163. AMQP_SERVER = "rabbit"
  164. AMQP_PORT = 5678
  165. AMQP_USER = "myapp"
  166. AMQP_PASSWORD = "secret"
  167. AMQP_VHOST = "myapp"
  168. CELERY_AMQP_CONSUMER_QUEUE = "regular_tasks"
  169. CELERY_AMQP_EXCHANGE = "tasks"
  170. CELERY_AMQP_PUBLISHER_ROUTING_KEY = "task.regular"
  171. CELERY_AMQP_CONSUMER_ROUTING_KEY = "task.#"
  172. CELERY_AMQP_EXCHANGE_TYPE = "topic"
  173. * Server ``z``: settings.py:
  174. .. code-block:: python
  175. AMQP_SERVER = "rabbit"
  176. AMQP_PORT = 5678
  177. AMQP_USER = "myapp"
  178. AMQP_PASSWORD = "secret"
  179. AMQP_VHOST = "myapp"
  180. CELERY_AMQP_EXCHANGE = "tasks"
  181. CELERY_AMQP_PUBLISHER_ROUTING_KEY = "task.regular"
  182. CELERY_AMQP_EXCHANGE_TYPE = "topic"
  183. # This is the settings different for this server:
  184. CELERY_AMQP_CONSUMER_QUEUE = "feed_tasks"
  185. CELERY_AMQP_CONSUMER_ROUTING_KEY = "feed.#"
  186. Now to make a Task run on the ``z`` server you need to set its
  187. ``routing_key`` attribute so it starts with the words ``"task.feed."``:
  188. .. code-block:: python
  189. from feedaggregator.models import Feed
  190. from celery.task import Task
  191. class FeedImportTask(Task):
  192. routing_key = "feed.importer"
  193. def run(self, feed_url):
  194. # something importing the feed
  195. Feed.objects.import_feed(feed_url)
  196. You can also override this using the ``routing_key`` argument to
  197. :func:`celery.task.apply_async`:
  198. >>> from celery.task import apply_async
  199. >>> from myapp.tasks import RefreshFeedTask
  200. >>> apply_async(RefreshFeedTask, args=["http://cnn.com/rss"],
  201. ... routing_key="feed.importer")
  202. Can I use celery without Django?
  203. --------------------------------
  204. **Answer:** Yes.
  205. Celery uses something called loaders to read/setup configuration, import
  206. modules that registers tasks and to decide what happens when a task is
  207. executed. Currently there are two loaders, the default loader and the Django
  208. loader. If you want to use celery without a Django project, you either have to
  209. use the default loader, or write a loader of your own.
  210. The rest of this answer describes how to use the default loader.
  211. First of all, installation. You need to get the development version of
  212. celery from github::
  213. $ git clone git://github.com/ask/celery.git
  214. $ cd celery
  215. # python setup.py install # as root
  216. While it is possible to use celery from outside of Django, we still need
  217. Django itself to run, this is to use the ORM and cache-framework, etc.
  218. Duplicating these features would be time consuming and mostly pointless, so
  219. we decided that having a dependency on Django itself was a good thing.
  220. Install Django using your favorite install tool, ``easy_install``, ``pip``, or
  221. whatever::
  222. # easy_install django # as root
  223. You need a configuration file named ``celeryconfig.py``, either in the
  224. directory you run ``celeryd`` in, or in a Python library path where it is
  225. able to find it. The configuration file can contain any of the settings
  226. described in :mod:`celery.conf`, and in additional if you're using the
  227. database backend you have to configure the database. Here is an example
  228. configuration using the database backend with MySQL:
  229. .. code-block:: python
  230. # Broker configuration
  231. AMQP_SERVER = "localhost"
  232. AMQP_PORT = "5672"
  233. AMQP_VHOST = "celery"
  234. AMQP_USER = "celery"
  235. AMQP_PASSWORD = "celerysecret"
  236. CARROT_BACKEND="amqp"
  237. # Using the database backend.
  238. CELERY_BACKEND = "database"
  239. DATABASE_ENGINE = "mysql" # see Django docs for a description of these.
  240. DATABASE_NAME = "mydb"
  241. DATABASE_HOST = "mydb.example.org"
  242. DATABASE_USER = "myuser"
  243. DATABASE_PASSWORD = "mysecret"
  244. # Number of processes that processes tasks simultaneously.
  245. CELERYD_CONCURRENCY = 8
  246. # Modules to import when celeryd starts.
  247. # This must import every module where you register tasks so celeryd
  248. # is able to find and run them.
  249. CELERY_IMPORTS = ("mytaskmodule1", "mytaskmodule2")
  250. Now with this configuration file in the current directory you have to
  251. run ``celeryinit`` to create the database tables::
  252. $ celeryinit
  253. Then you should be able to successfully run ``celeryd``::
  254. $ celeryd --loglevel=INFO
  255. and send a task from a python shell (note that it must be able to import
  256. ``celeryconfig.py``):
  257. >>> from celery.task.builtins import PingTask
  258. >>> result = PingTask.apply_async()
  259. >>> result.get()
  260. 'pong'
  261. The celery test-suite is failing
  262. --------------------------------
  263. **Answer**: You're running tests from your own Django applicaiton, and celerys
  264. tests are failing and celerys tests are failing in that context?
  265. If so, read on for a trick, if not please report the test failure to our issue
  266. tracker at GitHub.
  267. http://github.com/ask/celery/issues/
  268. That Django is running tests for all applications in ``INSTALLED_APPS``
  269. is a pet peeve of mine. You should use a test runner that either
  270. 1) Explicitly lists the apps you want to run tests for, or
  271. 2) make a test runner that skips tests for apps you don't want to run.
  272. For example this test runner that celery is using:
  273. http://bit.ly/NVKep
  274. To use this add the following to your settings.py:
  275. .. code-block:: python
  276. TEST_RUNNER = "celery.tests.runners.run_tests"
  277. TEST_APPS = (
  278. "app1",
  279. "app2",
  280. "app3",
  281. "app4",
  282. )
  283. If you just want to skip celery you could use:
  284. .. code-block:: python
  285. INSTALLED_APPS = (.....)
  286. TEST_RUNNER = "celery.tests.runners.run_tests"
  287. TEST_APPS = filter(lambda k: k != "celery", INSTALLED_APPS)