123456789101112131415161718192021222324252627282930313233343536373839404142434445464748495051525354555657585960616263646566676869707172737475767778798081828384858687888990919293949596979899100101102103104105106107108109110111112113114115116117118119120121122123124125126127128129130131132133134135136137138139140141142143144145146147148149150151152153154155156157158159160161162163164165166167168169170171172173174175176177178179180181182183184185186187188189190191192193194195196197198199200201202203204205206207208209210211212213214215216217218219220221222223224225226227228229230231232233234235236237238239240241242243244245246247 |
- .. _guide-security:
- ==========
- Security
- ==========
- .. contents::
- :local:
- Introduction
- ============
- While Celery is written with security in mind, it should be treated as an
- unsafe component.
- Depending on your `Security Policy`_, there are
- various steps you can take to make your Celery installation more secure.
- .. _`Security Policy`: https://en.wikipedia.org/wiki/Security_policy
- Areas of Concern
- ================
- Broker
- ------
- It is imperative that the broker is guarded from unwanted access, especially
- if accessible to the public.
- By default, workers trust that the data they get from the broker has not
- been tampered with. See `Message Signing`_ for information on how to make
- the broker connection more trustworthy.
- The first line of defense should be to put a firewall in front of the broker,
- allowing only white-listed machines to access it.
- Keep in mind that both firewall misconfiguration, and temporarily disabling
- the firewall, is common in the real world. Solid security policy includes
- monitoring of firewall equipment to detect if they have been disabled, be it
- accidentally or on purpose.
- In other words, one should not blindly trust the firewall either.
- If your broker supports fine-grained access control, like RabbitMQ,
- this is something you should look at enabling. See for example
- http://www.rabbitmq.com/access-control.html.
- If supported by your broker backend, you can enable end-to-end SSL encryption
- and authentication using :setting:`broker_use_ssl`.
- Client
- ------
- In Celery, "client" refers to anything that sends messages to the
- broker, e.g. web-servers that apply tasks.
- Having the broker properly secured doesn't matter if arbitrary messages
- can be sent through a client.
- *[Need more text here]*
- Worker
- ------
- The default permissions of tasks running inside a worker are the same ones as
- the privileges of the worker itself. This applies to resources such as
- memory, file-systems and devices.
- An exception to this rule is when using the multiprocessing based task pool,
- which is currently the default. In this case, the task will have access to
- any memory copied as a result of the :func:`fork` call,
- and access to memory contents written by parent tasks in the same worker
- child process.
- Limiting access to memory contents can be done by launching every task
- in a subprocess (:func:`fork` + :func:`execve`).
- Limiting file-system and device access can be accomplished by using
- `chroot`_, `jail`_, `sandboxing`_, virtual machines or other
- mechanisms as enabled by the platform or additional software.
- Note also that any task executed in the worker will have the
- same network access as the machine on which it's running. If the worker
- is located on an internal network it's recommended to add firewall rules for
- outbound traffic.
- .. _`chroot`: https://en.wikipedia.org/wiki/Chroot
- .. _`jail`: https://en.wikipedia.org/wiki/FreeBSD_jail
- .. _`sandboxing`:
- https://en.wikipedia.org/wiki/Sandbox_(computer_security)
- Serializers
- ===========
- The default `pickle` serializer is convenient because it supports
- arbitrary Python objects, whereas other serializers only
- work with a restricted set of types.
- But for the same reasons the `pickle` serializer is inherently insecure [*]_,
- and should be avoided whenever clients are untrusted or
- unauthenticated.
- You can disable untrusted content by specifying
- a white-list of accepted content-types in the :setting:`accept_content`
- setting:
- .. versionadded:: 3.0.18
- .. note::
- This setting was first supported in version 3.0.18. If you're
- running an earlier version it will simply be ignored, so make
- sure you're running a version that supports it.
- .. code-block:: python
- accept_content = ['json']
- This accepts a list of serializer names and content-types, so you could
- also specify the content type for json:
- .. code-block:: python
- accept_content = ['application/json']
- Celery also comes with a special `auth` serializer that validates
- communication between Celery clients and workers, making sure
- that messages originates from trusted sources.
- Using `Public-key cryptography` the `auth` serializer can verify the
- authenticity of senders, to enable this read :ref:`message-signing`
- for more information.
- .. _`pickle`: http://docs.python.org/library/pickle.html
- .. _`Public-key cryptography`:
- https://en.wikipedia.org/wiki/Public-key_cryptography
- .. _message-signing:
- Message Signing
- ===============
- Celery can use the `pyOpenSSL`_ library to sign message using
- `Public-key cryptography`, where
- messages sent by clients are signed using a private key
- and then later verified by the worker using a public certificate.
- Optimally certificates should be signed by an official
- `Certificate Authority`_, but they can also be self-signed.
- To enable this you should configure the :setting:`task_serializer`
- setting to use the `auth` serializer.
- Also required is configuring the
- paths used to locate private keys and certificates on the file-system:
- the :setting:`security_key`,
- :setting:`security_certificate` and :setting:`security_cert_store`
- settings respectively.
- With these configured it is also necessary to call the
- :func:`celery.setup_security` function. Note that this will also
- disable all insecure serializers so that the worker won't accept
- messages with untrusted content types.
- This is an example configuration using the `auth` serializer,
- with the private key and certificate files located in `/etc/ssl`.
- .. code-block:: python
- app = Celery()
- app.conf.update(
- security_key='/etc/ssl/private/worker.key'
- security_certificate='/etc/ssl/certs/worker.pem'
- security_cert_store='/etc/ssl/certs/*.pem',
- )
- app.setup_security()
- .. note::
- While relative paths are not disallowed, using absolute paths
- is recommended for these files.
- Also note that the `auth` serializer won't encrypt the contents of
- a message, so if needed this will have to be enabled separately.
- .. _`pyOpenSSL`: http://pypi.python.org/pypi/pyOpenSSL
- .. _`X.509`: https://en.wikipedia.org/wiki/X.509
- .. _`Certificate Authority`:
- https://en.wikipedia.org/wiki/Certificate_authority
- Intrusion Detection
- ===================
- The most important part when defending your systems against
- intruders is being able to detect if the system has been compromised.
- Logs
- ----
- Logs are usually the first place to look for evidence
- of security breaches, but they are useless if they can be tampered with.
- A good solution is to set up centralized logging with a dedicated logging
- server. Access to it should be restricted.
- In addition to having all of the logs in a single place, if configured
- correctly, it can make it harder for intruders to tamper with your logs.
- This should be fairly easy to setup using syslog (see also `syslog-ng`_ and
- `rsyslog`_.). Celery uses the :mod:`logging` library, and already has
- support for using syslog.
- A tip for the paranoid is to send logs using UDP and cut the
- transmit part of the logging server's network cable :-)
- .. _`syslog-ng`: https://en.wikipedia.org/wiki/Syslog-ng
- .. _`rsyslog`: http://www.rsyslog.com/
- Tripwire
- --------
- `Tripwire`_ is a (now commercial) data integrity tool, with several
- open source implementations, used to keep
- cryptographic hashes of files in the file-system, so that administrators
- can be alerted when they change. This way when the damage is done and your
- system has been compromised you can tell exactly what files intruders
- have changed (password files, logs, back-doors, root-kits and so on).
- Often this is the only way you will be able to detect an intrusion.
- Some open source implementations include:
- * `OSSEC`_
- * `Samhain`_
- * `Open Source Tripwire`_
- * `AIDE`_
- Also, the `ZFS`_ file-system comes with built-in integrity checks
- that can be used.
- .. _`Tripwire`: http://tripwire.com/
- .. _`OSSEC`: http://www.ossec.net/
- .. _`Samhain`: http://la-samhna.de/samhain/index.html
- .. _`AIDE`: http://aide.sourceforge.net/
- .. _`Open Source Tripwire`: http://sourceforge.net/projects/tripwire/
- .. _`ZFS`: https://en.wikipedia.org/wiki/ZFS
- .. rubric:: Footnotes
- .. [*] http://nadiana.com/python-pickle-insecure
|