periodic-tasks.rst 11 KB

123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247248249250251252253254255256257258259260261262263264265266267268269270271272273274275276277278279280281282283284285286287288289290
  1. .. _guide-beat:
  2. ================
  3. Periodic Tasks
  4. ================
  5. .. contents::
  6. :local:
  7. Introduction
  8. ============
  9. :program:`celery beat` is a scheduler. It kicks off tasks at regular intervals,
  10. which are then executed by the worker nodes available in the cluster.
  11. By default the entries are taken from the :setting:`CELERYBEAT_SCHEDULE` setting,
  12. but custom stores can also be used, like storing the entries
  13. in an SQL database.
  14. You have to ensure only a single scheduler is running for a schedule
  15. at a time, otherwise you would end up with duplicate tasks. Using
  16. a centralized approach means the schedule does not have to be synchronized,
  17. and the service can operate without using locks.
  18. .. _beat-timezones:
  19. Time Zones
  20. ==========
  21. The periodic task schedules uses the UTC time zone by default,
  22. but you can change the time zone used using the :setting:`CELERY_TIMEZONE`
  23. setting.
  24. The `pytz`_ library is recommended when setting a default timezone.
  25. If :mod:`pytz` is not installed it will fallback to the mod:`dateutil`
  26. library, which depends on a system timezone file being available for
  27. the timezone selected.
  28. Timezone definitions change frequently, so for the best results
  29. an up to date :mod:`pytz` installation should be used.
  30. .. code-block:: bash
  31. $ pip install -U pytz
  32. An example time zone could be `Europe/London`:
  33. .. code-block:: python
  34. CELERY_TIMEZONE = 'Europe/London'
  35. The default scheduler (storing the schedule in the :file:`celerybeat-schedule`
  36. file) will automatically detect that the time zone has changed, and so will
  37. reset the schedule itself, but other schedulers may not be so smart (e.g. the
  38. Django database scheduler, see below) and in that case you will have to reset the
  39. schedule manually.
  40. .. admonition:: Django Users
  41. Celery recommends and is compatible with the new ``USE_TZ`` setting introduced
  42. in Django 1.4.
  43. For Django users the time zone specified in the ``TIME_ZONE`` setting
  44. will be used, or you can specify a custom time zone for Celery alone
  45. by using the :setting:`CELERY_TIMEZONE` setting.
  46. The database scheduler will not reset when timezone related settings
  47. change, so you must do this manually:
  48. .. code-block:: bash
  49. $ python manage.py shell
  50. >>> from djcelery.models import PeriodicTask
  51. >>> PeriodicTask.objects.update(last_run_at=None)
  52. .. _`pytz`: http://pypi.python.org/pypi/pytz/
  53. .. _beat-entries:
  54. Entries
  55. =======
  56. To schedule a task periodically you have to add an entry to the
  57. :setting:`CELERYBEAT_SCHEDULE` setting.
  58. Example: Run the `tasks.add` task every 30 seconds.
  59. .. code-block:: python
  60. from datetime import timedelta
  61. CELERYBEAT_SCHEDULE = {
  62. 'add-every-30-seconds': {
  63. 'task': 'tasks.add',
  64. 'schedule': timedelta(seconds=30),
  65. 'args': (16, 16)
  66. },
  67. }
  68. CELERY_TIMEZONE = 'UTC'
  69. Using a :class:`~datetime.timedelta` for the schedule means the task will
  70. be sent in 30 second intervals (the first task will be sent 30 seconds
  71. after `celery beat` starts, and then every 30 seconds
  72. after the last run).
  73. A crontab like schedule also exists, see the section on `Crontab schedules`_.
  74. Like with ``cron``, the tasks may overlap if the first task does not complete
  75. before the next. If that is a concern you should use a locking
  76. strategy to ensure only one instance can run at a time (see for example
  77. :ref:`cookbook-task-serial`).
  78. .. _beat-entry-fields:
  79. Available Fields
  80. ----------------
  81. * `task`
  82. The name of the task to execute.
  83. * `schedule`
  84. The frequency of execution.
  85. This can be the number of seconds as an integer, a
  86. :class:`~datetime.timedelta`, or a :class:`~celery.schedules.crontab`.
  87. You can also define your own custom schedule types, by extending the
  88. interface of :class:`~celery.schedules.schedule`.
  89. * `args`
  90. Positional arguments (:class:`list` or :class:`tuple`).
  91. * `kwargs`
  92. Keyword arguments (:class:`dict`).
  93. * `options`
  94. Execution options (:class:`dict`).
  95. This can be any argument supported by
  96. :meth:`~celery.task.base.Task.apply_async`,
  97. e.g. `exchange`, `routing_key`, `expires`, and so on.
  98. * `relative`
  99. By default :class:`~datetime.timedelta` schedules are scheduled
  100. "by the clock". This means the frequency is rounded to the nearest
  101. second, minute, hour or day depending on the period of the timedelta.
  102. If `relative` is true the frequency is not rounded and will be
  103. relative to the time when :program:`celery beat` was started.
  104. .. _beat-crontab:
  105. Crontab schedules
  106. =================
  107. If you want more control over when the task is executed, for
  108. example, a particular time of day or day of the week, you can use
  109. the :class:`~celery.schedules.crontab` schedule type:
  110. .. code-block:: python
  111. from celery.schedules import crontab
  112. CELERYBEAT_SCHEDULE = {
  113. # Executes every Monday morning at 7:30 A.M
  114. 'add-every-monday-morning': {
  115. 'task': 'tasks.add',
  116. 'schedule': crontab(hour=7, minute=30, day_of_week=1),
  117. 'args': (16, 16),
  118. },
  119. }
  120. The syntax of these crontab expressions are very flexible. Some examples:
  121. +-----------------------------------------+--------------------------------------------+
  122. | **Example** | **Meaning** |
  123. +-----------------------------------------+--------------------------------------------+
  124. | ``crontab()`` | Execute every minute. |
  125. +-----------------------------------------+--------------------------------------------+
  126. | ``crontab(minute=0, hour=0)`` | Execute daily at midnight. |
  127. +-----------------------------------------+--------------------------------------------+
  128. | ``crontab(minute=0, hour='*/3')`` | Execute every three hours: |
  129. | | 3am, 6am, 9am, noon, 3pm, 6pm, 9pm. |
  130. +-----------------------------------------+--------------------------------------------+
  131. | ``crontab(minute=0,`` | Same as previous. |
  132. | ``hour='0,3,6,9,12,15,18,21')`` | |
  133. +-----------------------------------------+--------------------------------------------+
  134. | ``crontab(minute='*/15')`` | Execute every 15 minutes. |
  135. +-----------------------------------------+--------------------------------------------+
  136. | ``crontab(day_of_week='sunday')`` | Execute every minute (!) at Sundays. |
  137. +-----------------------------------------+--------------------------------------------+
  138. | ``crontab(minute='*',`` | Same as previous. |
  139. | ``hour='*',`` | |
  140. | ``day_of_week='sun')`` | |
  141. +-----------------------------------------+--------------------------------------------+
  142. | ``crontab(minute='*/10',`` | Execute every ten minutes, but only |
  143. | ``hour='3,17,22',`` | between 3-4 am, 5-6 pm and 10-11 pm on |
  144. | ``day_of_week='thu,fri')`` | Thursdays or Fridays. |
  145. +-----------------------------------------+--------------------------------------------+
  146. | ``crontab(minute=0, hour='*/2,*/3')`` | Execute every even hour, and every hour |
  147. | | divisible by three. This means: |
  148. | | at every hour *except*: 1am, |
  149. | | 5am, 7am, 11am, 1pm, 5pm, 7pm, |
  150. | | 11pm |
  151. +-----------------------------------------+--------------------------------------------+
  152. | ``crontab(minute=0, hour='*/5')`` | Execute hour divisible by 5. This means |
  153. | | that it is triggered at 3pm, not 5pm |
  154. | | (since 3pm equals the 24-hour clock |
  155. | | value of "15", which is divisible by 5). |
  156. +-----------------------------------------+--------------------------------------------+
  157. | ``crontab(minute=0, hour='*/3,8-17')`` | Execute every hour divisible by 3, and |
  158. | | every hour during office hours (8am-5pm). |
  159. +-----------------------------------------+--------------------------------------------+
  160. | ``crontab(day_of_month='2')`` | Execute on the second day of every month. |
  161. | | |
  162. +-----------------------------------------+--------------------------------------------+
  163. | ``crontab(day_of_month='2-30/3')`` | Execute on every even numbered day. |
  164. | | |
  165. +-----------------------------------------+--------------------------------------------+
  166. | ``crontab(day_of_month='1-7,15-21')`` | Execute on the first and third weeks of |
  167. | | the month. |
  168. +-----------------------------------------+--------------------------------------------+
  169. | ``crontab(day_of_month='11',`` | Execute on 11th of May every year. |
  170. | ``month_of_year='5')`` | |
  171. +-----------------------------------------+--------------------------------------------+
  172. | ``crontab(month_of_year='*/3')`` | Execute on the first month of every |
  173. | | quarter. |
  174. +-----------------------------------------+--------------------------------------------+
  175. See :class:`celery.schedules.crontab` for more documentation.
  176. .. _beat-starting:
  177. Starting the Scheduler
  178. ======================
  179. To start the :program:`celery beat` service:
  180. .. code-block:: bash
  181. $ celery beat
  182. You can also start embed `beat` inside the worker by enabling
  183. workers `-B` option, this is convenient if you only intend to
  184. use one worker node:
  185. .. code-block:: bash
  186. $ celery worker -B
  187. Beat needs to store the last run times of the tasks in a local database
  188. file (named `celerybeat-schedule` by default), so it needs access to
  189. write in the current directory, or alternatively you can specify a custom
  190. location for this file:
  191. .. code-block:: bash
  192. $ celery beat -s /home/celery/var/run/celerybeat-schedule
  193. .. note::
  194. To daemonize beat see :ref:`daemonizing`.
  195. .. _beat-custom-schedulers:
  196. Using custom scheduler classes
  197. ------------------------------
  198. Custom scheduler classes can be specified on the command line (the `-S`
  199. argument). The default scheduler is :class:`celery.beat.PersistentScheduler`,
  200. which is simply keeping track of the last run times in a local database file
  201. (a :mod:`shelve`).
  202. `django-celery` also ships with a scheduler that stores the schedule in the
  203. Django database:
  204. .. code-block:: bash
  205. $ celery beat -S djcelery.schedulers.DatabaseScheduler
  206. Using `django-celery`'s scheduler you can add, modify and remove periodic
  207. tasks from the Django Admin.