Missing a repository? Have a look at https://code.cacert.org/.

summaryrefslogtreecommitdiff
blob: f5963ad47a5e081001ed10f10ee4c3baceeb4f71 (plain)
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
====================
Non-Critical Systems
====================

Non-critical systems are those that are managed by the infrastructure
administrator team.

.. toctree::
   :maxdepth: 1

   systems/infra02
   systems/arbitration
   systems/blog
   systems/board
   systems/coaudit
   systems/bugs
   systems/cats
   systems/email
   systems/emailout
   systems/monitor
   systems/webmail


General
=======

.. todo:: consider whether a central MySQL service should be setup

   Many containers contain their own instance of MySQL. It might be a better
   idea to centralize the MySQL setups in a single container.

.. todo:: consider whether a central PostgreSQL service should be setup

.. todo::

   setup a central syslog service and install syslog clients in each container

.. _setup_apt_checking:

.. topic:: Setup package update monitoring for a new container

   For Icinga to be able to check the update status of packages on you server
   you need to install NRPE, a helper service. Install the necessary packages::

      sudo aptitude install nagios-plugins-basic nagios-nrpe-server

   Put :doc:`systems/monitor` on the list of allowed hosts to access the NRPE
   service by adding the following line to :file:`/etc/nagios/nrpe_local.cfg`::

      allowed_hosts=172.16.2.18

   Tell the NRPE service that there is such a thing as the check_apt command by
   creating the file :file:`/etc/nagios/nrpe.d/apt.cfg` with the following
   contents::

      # 'check_apt' command definition
      command[check_apt]=/usr/lib/nagios/plugins/check_apt

      # 'check_apt_distupgrade' command definition
      command[check_apt_distupgrade]=/usr/lib/nagios/plugins/check_apt -d

   Restart the NRPE service::

      sudo service nagios-nrpe-server restart

   Check that everything went well by going to https://monitor.cacert.org/,
   going to the APT service on the host and clicking :guilabel:`"Re-schedule
   the next check of this service"`. Make sure that :guilabel:`"Force Check"`
   is checked and click :guilabel:`"Commit"`. Now you should see a page with a
   green background. If not something went wrong, please contact the
   :doc:`systems/monitor` administrators with the details.

   That's it, now the package update status should be properly displayed in
   Icinga.

.. todo:: think about replacing nrpe with Icinga2 satellites

Checklist
=========

.. index::
   single: etckeeper
   single: nrpe

* All containers should be monitored by :doc:`systems/monitor` and should
  therefore have :program:`nagios-nrpe-server` installed
* All containers should use :program:`etckeeper` to put their local setup into
  version control. All local setup should use :file:`/etc` to make sure it is
  handled by :program:`etckeeper`
* All infrastructure systems must send their mail via :doc:`systems/emailout`
* All infrastructure systems should have an system-admin@cacert.org alias to
  reach their admins
* The installation of :index:`systemd-sysv` in containers can be blocked by
  putting the following lines in :file:`/etc/apt/preferences.d/systemd-sysv`::

    Package: systemd-sysv
    Pin: release a=stable
    Pin-Priority: -1

.. todo:: document how to setup the system-admin alias on the email system