преди 15 години · 2e8b4de810
--- a/docs/userguide/optimizing.rst
+++ b/docs/userguide/optimizing.rst
@@ -6,17 +6,14 @@
 
				 
			
 
				 Introduction
			
 
				 ============
			
 
				+The default configuration makes a lot of compromises.  It's not optimal for
			
 
				+any single case, but works well enough for most situations.
			
 
				 
			
 
				-The default configuration, like any good default, is full of compromises.
			
 
				-It is not tweaked to be optimal for any single use case, but tries to
			
 
				-find middle ground that works *well enough* for most situations.
			
 
				+There are optimizations that can be applied based on specific use cases.
			
 
				 
			
 
				-There are key optimizations to be done if your application is mainly
			
 
				-processing lots of short tasks, and also if you have fewer but very
			
 
				-long tasks.
			
 
				-
			
 
				-Optimization here does not necessarily mean optimizing for runtime, but also
			
 
				-optimizing resource usage and ensuring responsiveness at times of high load.
			
 
				+Optimizations can apply to different properties of the running environment,
			
 
				+be it the time tasks take to execute, the amount of memory used, or
			
 
				+responsiveness at times of high load.
			
 
				 
			
 
				 Ensuring Operations
			
 
				 ===================
			
@@ -26,22 +23,23 @@ back-of-the-envelope calculations by asking the question;
 
				 
			
 
				     ❝ How much water flows out of the Mississippi River in a day? ❞
			
 
				 
			
 
				-The point of this exercise[*] is to demonstrate that there is a limit
			
 
				-to how much data a system can process in a timely manner, and teaches
			
 
				-back of the envelope calculations as a means to plan for this ahead of time.
			
 
				+The point of this exercise[*] is to show that there is a limit
			
 
				+to how much data a system can process in a timely manner.
			
 
				+Back of the envelope calculations can be used as a means to plan for this
			
 
				+ahead of time.
			
 
				 
			
 
				-This is very relevant to Celery; If a task takes 10 minutes to complete,
			
 
				-and there are 10 new tasks coming in every minute, then this means
			
 
				-the queue will *never be processed*.  This is why it's very important
			
 
				+In Celery; If a task takes 10 minutes to complete,
			
 
				+and there are 10 new tasks coming in every minute, the queue will never
			
 
				+be empty.  This is why it's very important
			
 
				 that you monitor queue lengths!
			
 
				 
			
 
				-One way to do this is by :ref:`using Munin <monitoring-munin>`.
			
 
				-You should set up alerts, so you are notified as soon as any queue has
			
 
				-reached an unacceptable size, this way you can take appropriate action like
			
 
				-adding new worker nodes, or revoking unnecessary tasks.
			
 
				+A way to do this is by :ref:`using Munin <monitoring-munin>`.
			
 
				+You should set up alerts, that will notify you as soon as any queue has
			
 
				+reached an unacceptable size.  This way you can take appropriate action
			
 
				+like adding new worker nodes, or revoking unnecessary tasks.
			
 
				 
			
 
				 .. [*] The chapter is available to read for free here:
			
 
				-       `The back of the envelope`_.  This book is a classic text, highly
			
 
				+       `The back of the envelope`_.  The book is a classic text. Highly
			
 
				        recommended.
			
 
				 
			
 
				 .. _`Programming Pearls`: http://www.cs.bell-labs.com/cm/cs/pearls/
			
@@ -62,36 +60,36 @@ Prefetch Limits
 
				 *Prefetch* is a term inherited from AMQP that is often misunderstood
			
 
				 by users.
			
 
				 
			
 
				-The prefetch limit is a **limit** for the number of messages (tasks) a worker
			
 
				-can reserve in advance.  If this is set to zero, the worker will keep
			
 
				-consuming messages *ad infinitum*, not respecting that there may be other
			
 
				+The prefetch limit is a **limit** for the number of tasks (messages) a worker
			
 
				+can reserve for itself.  If it is zero, the worker will keep
			
 
				+consuming messages, not respecting that there may be other
			
 
				 available worker nodes that may be able to process them sooner[#],
			
 
				 or that the messages may not even fit in memory.
			
 
				 
			
 
				-The workers initial prefetch count is set by multiplying
			
 
				-the :setting:`CELERYD_PREFETCH_MULTIPLIER` setting by the number
			
 
				-of child worker processes[#].  The default is 4 messages per child process.
			
 
				+The workers' default prefetch count is the
			
 
				+:setting:`CELERYD_PREFETCH_MULTIPLIER` setting multiplied by the number
			
 
				+of child worker processes[#].
			
 
				 
			
 
				-If you have many expensive tasks with a long duration you would want
			
 
				+If you have many tasks with a long duration you want
			
 
				 the multiplier value to be 1, which means it will only reserve one
			
 
				-unacknowledged task per worker process at a time.
			
 
				+task per worker process at a time.
			
 
				 
			
 
				-However -- If you have lots of short tasks, and throughput/roundtrip latency
			
 
				-is important to you, then you want this number to be large.  Say 64, or 128
			
 
				-for example, as the worker is able to process a lot more *tasks/s* if the
			
 
				-messages have already been prefetched in memory.  You may have to experiment
			
 
				-to find the best value that works for you.
			
 
				+However -- If you have many short-running tasks, and throughput/roundtrip
			
 
				+latency[#] is important to you, this number should be large. The worker is
			
 
				+able to process more tasks per second if the messages have already been
			
 
				+prefetched, and is available in memory.  You may have to experiment to find
			
 
				+the best value that works for you.  Values like 50 or 150 might make sense in
			
 
				+these circumstances. Say 64, or 128.
			
 
				 
			
 
				-If you have a combination of both very long and short tasks, then the best
			
 
				-option is to use two worker nodes that is configured individually, and route
			
 
				-the tasks accordingly (see :ref:`guide-routing`).
			
 
				+If you have a combination of long- and short-running tasks, the best option
			
 
				+is to use two worker nodes that are configured separatly, and route
			
 
				+the tasks according to the run-time. (see :ref:`guide-routing`).
			
 
				 
			
 
				-.. [*] RabbitMQ and other brokers will deliver the messages in round-robin,
			
 
				-       so this doesn't apply to an active system.  But if there is no prefetch
			
 
				+.. [*] RabbitMQ and other brokers deliver messages round-robin,
			
 
				+       so this doesn't apply to an active system.  If there is no prefetch
			
 
				        limit and you restart the cluster, there will be timing delays between
			
 
				-       nodes starting, so if there are 3 offline nodes and one active node,
			
 
				-       then all messages will be delivered to the active node while the others
			
 
				-       are offline.
			
 
				+       nodes starting. If there are 3 offline nodes and one active node,
			
 
				+       all messages will be delivered to the active node.
			
 
				 
			
 
				 .. [*] This is the concurrency setting; :setting:`CELERYD_CONCURRENCY` or the
			
 
				        :option:`-c` option to :program:`celeryd`.
			
@@ -104,18 +102,18 @@ When using early acknowledgement (default), a prefetch multiplier of 1
 
				 means the worker will reserve at most one extra task for every active
			
 
				 worker process.
			
 
				 
			
 
				-Often when users ask if it's possible to disable "prefetching of tasks",
			
 
				-what they really want is to have a worker only reserve as many tasks
			
 
				-as there are child processes at a time.
			
 
				+When users ask if it's possible to disable "prefetching of tasks", often
			
 
				+what they really want is to have a worker only reserve as many tasks as there
			
 
				+are child processes.
			
 
				 
			
 
				-Sadly, this requirement is not possible without enabling late
			
 
				+But this is not possible without enabling late acknowledgements
			
 
				 acknowledgements; A task that has been started, will be
			
 
				 retried if the worker crashes mid execution so the task must be `reentrant`_
			
 
				 (see also notes at :ref:`faq-acks_late-vs-retry`).
			
 
				 
			
 
				 .. _`reentrant`: http://en.wikipedia.org/wiki/Reentrant_(subroutine)
			
 
				 
			
 
				-You can enable this behavior by using the following configuration:
			
 
				+You can enable this behavior by using the following configuration options:
			
 
				 
			
 
				 .. code-block:: python
			
 
				 
			
@@ -127,9 +125,10 @@ You can enable this behavior by using the following configuration:
 
				 Rate Limits
			
 
				 -----------
			
 
				 
			
 
				-The subsystem responsible for enforcing rate limits introduces extra
			
 
				-complexity, so if you're not using rate limits it may be a good idea to
			
 
				-disable them completely.
			
 
				+The system responsible for enforcing rate limits introduces some overhead,
			
 
				+so if you're not using rate limits it may be a good idea to
			
 
				+disable them completely.  This will disable one thread, and it won't
			
 
				+spend as many CPU cycles when the queue is inactive.
			
 
				 
			
 
				 Set the :setting:`CELERY_DISABLE_RATE_LIMITS` setting to disable
			
 
				 the rate limit subsystem: