Pacemaker

Contenu du pack

Modèles

Le connecteur de supervision Pacemaker apporte un modèle d'hôte :

App-Pacemaker-SSH-custom

Le connecteur apporte les modèles de service suivants (classés selon le modèle d'hôte auquel ils sont rattachés) :

App-Pacemaker-SSH-custom

Alias	Modèle de service	Description
CRM	App-Pacemaker-CRM-SSH-custom	Contrôle l'état du cluster via la commande crm_mon
Clustat	App-Pacemaker-Clustat-SSH-custom	[Déprécié] Contrôle l'état du cluster via la commande clustat
Constraints	App-Pacemaker-Constraints-SSH-custom	Contrôle si une contrainte est en place sur une ressource du cluster

Les services listés ci-dessus sont créés automatiquement lorsque le modèle d'hôte App-Pacemaker-SSH-custom est utilisé.

Métriques & statuts collectés

Voici le tableau des services pour ce connecteur, détaillant les métriques rattachées à chaque service.

CRM
Clustat
Constraints

Nom	Unité
cluster~connection-status	N/A
cluster~quorum-status	N/A
cluster~cluster.nodes.online.count	count
cluster~cluster.nodes.offline.count	count
cluster~cluster.nodes.standby.count	count
cluster~cluster.actions.failed.count	count
resources#resource-status	N/A
resources#resource.actions.failed.count	count
resources#resource.migration.failed.count	count
clone_resources#clone-resource-status	N/A
clone_resources#clone_resource.actions.failed.count	count
clone_resources#clone_resource.migration.failed.count	count

Ce modèle de service est déprécié et n'est plus fourni dans les dernières versions du connecteur car l'utilitaire clustat n'est plus livré avec Pacemaker depuis RHEL7.

Nom	Unité
nodes#node	N/A
groups#group	N/A

Nom	Unité
status	N/A

Prérequis

Configuration SSH

L'utilisation de ce connecteur requiert la création d'un utilisateur sur la ressource supervisée, lequel sera utilisé par le collecteur Centreon pour s'authentifier et exécuter les requêtes SSH. Les privilèges sudo ou root ne sont pas nécessaires, un utilisateur 'simple' est suffisant.

Deux méthodes de connexion SSH sont possibles :

soit en échangeant la clé SSH publique de l'utilisateur centreon-engine du collecteur Centreon
soit en définissant votre utilisateur et votre mot de passe directement dans les macros d'hôtes.

Privilèges de l'utilisateur local

L'utilisateur local de l'hôte supervisé doit être ajouté au groupe haclient s'il n'en fait pas déjà partie (c'est le cas si on utilise un autre utilisateur que hacluster). Par exemple, si l'utilisateur local est centreon-engine, lancez :

usermod -a -G haclient centreon-engine

Il vous faudra alors vous authentifier auprès du cluster :

pcs client local-auth

Entrez alors hacluster comme login et le mot de passe correspondant.

Installer le connecteur de supervision

Pack

La procédure d'installation des connecteurs de supervision diffère légèrement suivant que votre licence est offline ou online.

Si la plateforme est configurée avec une licence online, l'installation d'un paquet n'est pas requise pour voir apparaître le connecteur dans le menu Configuration > Connecteurs > Connecteurs de supervision. Au contraire, si la plateforme utilise une licence offline, installez le paquet sur le serveur central via la commande correspondant au gestionnaire de paquets associé à sa distribution :

Alma / RHEL / Oracle Linux 8
Alma / RHEL / Oracle Linux 9
Debian 11 & 12
CentOS 7

dnf install centreon-pack-applications-pacemaker-ssh

dnf install centreon-pack-applications-pacemaker-ssh

apt install centreon-pack-applications-pacemaker-ssh

yum install centreon-pack-applications-pacemaker-ssh

Quel que soit le type de la licence (online ou offline), installez le connecteur Pacemaker depuis l'interface web et le menu Configuration > Connecteurs > Connecteurs de supervision.

Plugin

À partir de Centreon 22.04, il est possible de demander le déploiement automatique du plugin lors de l'utilisation d'un connecteur. Si cette fonctionnalité est activée, et que vous ne souhaitez pas découvrir des éléments pour la première fois, alors cette étape n'est pas requise.

Plus d'informations dans la section Installer le plugin.

Utilisez les commandes ci-dessous en fonction du gestionnaire de paquets de votre système d'exploitation :

Alma / RHEL / Oracle Linux 8
Alma / RHEL / Oracle Linux 9
Debian 11 & 12
CentOS 7

dnf install centreon-plugin-Applications-Pacemaker-Ssh

dnf install centreon-plugin-Applications-Pacemaker-Ssh

apt install centreon-plugin-applications-pacemaker-ssh

yum install centreon-plugin-Applications-Pacemaker-Ssh

Utiliser le connecteur de supervision

Utiliser un modèle d'hôte issu du connecteur

Ajoutez un hôte à Centreon depuis la page Configuration > Hôtes.
Complétez les champs Nom, Alias & IP Address/DNS correspondant à votre ressource.
Appliquez le modèle d'hôte App-Pacemaker-SSH-custom. Une liste de macros apparaît. Les macros vous permettent de définir comment le connecteur se connectera à la ressource, ainsi que de personnaliser le comportement du connecteur.
Renseignez les macros désirées. Attention, certaines macros sont obligatoires.

Macro	Description	Valeur par défaut
SSHUSERNAME	Define the user name to log in to the host
SSHPASSWORD	Define the password associated with the user name. Cannot be used with the sshcli backend. Warning: using a password is not recommended. Use --ssh-priv-key instead
SSHPRIVKEY	Define the path to the private key file for user authentication.	`/var/lib/centreon-engine/.ssh/id_ed25519`
SSHPORT	Define the TCP port on which SSH is listening
SSHBACKEND	Define the backend you want to use. It can be: sshcli, plink and libssh	libssh
SSHEXTRAOPTIONS	Any extra option you may want to add to every command (a --verbose flag for example). Toutes les options sont listées ici.

Déployez la configuration. L'hôte apparaît dans la liste des hôtes supervisés, et dans la page Statut des ressources. La commande envoyée par le connecteur est indiquée dans le panneau de détails de l'hôte : celle-ci montre les valeurs des macros.

Utiliser un modèle de service issu du connecteur

Si vous avez utilisé un modèle d'hôte et coché la case Créer aussi les services liés aux modèles, les services associés au modèle ont été créés automatiquement, avec les modèles de services correspondants. Sinon, créez les services désirés manuellement et appliquez-leur un modèle de service.
Renseignez les macros désirées (par exemple, ajustez les seuils d'alerte). Les macros indiquées ci-dessous comme requises (Obligatoire) doivent être renseignées.

CRM
Clustat
Constraints

Macro	Description	Valeur par défaut
FILTERRESOURCENAME	Filter resource (also clone resource) by name (can be a regexp)
WARNINGCLONERESOURCEACTIONSFAILED	Thresholds
CRITICALCLONERESOURCEACTIONSFAILED	Thresholds
WARNINGCLONERESOURCEMIGRATIONFAILED	Thresholds
CRITICALCLONERESOURCEMIGRATIONFAILED	Thresholds
CRITICALCLONERESOURCESTATUS	Define the conditions to match for the status to be CRITICAL. You can use the following variables: %{name}, %{status}, %{masters_nodes_name}, %{slaves_nodes_name}, %{is_unmanaged}	%{status} =~ /failed/i
WARNINGCLONERESOURCESTATUS	Define the conditions to match for the status to be WARNING. You can use the following variables: %{name}, %{status}, %{masters_nodes_name}, %{slaves_nodes_name}, %{is_unmanaged}
CRITICALCLUSTERACTIONSFAILED	Thresholds	0
WARNINGCLUSTERACTIONSFAILED	Thresholds
CRITICALCONNECTIONSTATUS	Define the conditions to match for the status to be CRITICAL. You can use the following variables: %{connection_status}, %{connection_error}	%{connection_status} =~ /failed/i
WARNINGCONNECTIONSTATUS	Define the conditions to match for the status to be WARNING. You can use the following variables: %{connection_status}, %{connection_error}
CRITICALNODESOFFLINE	Thresholds	0
WARNINGNODESOFFLINE	Thresholds
WARNINGNODESONLINE	Thresholds
CRITICALNODESONLINE	Thresholds
WARNINGNODESSTANDBY	Thresholds
CRITICALNODESSTANDBY	Thresholds
CRITICALQUORUMSTATUS	Define the conditions to match for the status to be CRITICAL. You can use the following variables: %{quorum_status}	%{quorum_status} =~ /noQuorum/i
WARNINGQUORUMSTATUS	Define the conditions to match for the status to be WARNING. You can use the following variables: %{quorum_status}
WARNINGRESOURCEACTIONSFAILED	Thresholds
CRITICALRESOURCEACTIONSFAILED	Thresholds
CRITICALRESOURCEMIGRATIONFAILED	Thresholds	0
WARNINGRESOURCEMIGRATIONFAILED	Thresholds
CRITICALRESOURCESTATUS	Define the conditions to match for the status to be CRITICAL. You can use the following variables: %{name}, %{status}, %{node}, %{is_unmanaged}	%{status} =~ /stopped\|failed/i
WARNINGRESOURCESTATUS	Define the conditions to match for the status to be WARNING. You can use the following variables: %{name}, %{status}, %{node}, %{is_unmanaged}
EXTRAOPTIONS	Any extra option you may want to add to the command (a --verbose flag for example). Toutes les options sont listées ici.	--verbose

Ce modèle de service est déprécié et n'est plus fourni dans les dernières versions du connecteur car l'utilitaire clustat n'est plus livré avec pacemaker depuis RHEL7.

Macro	Description	Valeur par défaut	Obligatoire
EXTRAOPTIONS	Any extra option you may want to add to the command (a --verbose flag for example). Toutes les options sont listées ici.	--verbose

Macro	Description	Valeur par défaut	Obligatoire
RESOURCENAME	Set the resource name you want to check	RESOURCENAME	X
EXTRAOPTIONS	Any extra option you may want to add to the command (a --verbose flag for example). Toutes les options sont listées ici.

Déployez la configuration. Le service apparaît dans la liste des services supervisés, et dans la page Statut des ressources. La commande envoyée par le connecteur est indiquée dans le panneau de détails du service : celle-ci montre les valeurs des macros.

Comment puis-je tester le plugin et que signifient les options des commandes ?

Une fois le plugin installé, vous pouvez tester celui-ci directement en ligne de commande depuis votre collecteur Centreon en vous connectant avec l'utilisateur centreon-engine (su - centreon-engine). Vous pouvez tester que le connecteur arrive bien à superviser une ressource en utilisant une commande telle que celle-ci (remplacez les valeurs d'exemple par les vôtres) :

/usr/lib/centreon/plugins/centreon_pacemaker_ssh.pl \
	--plugin=apps::pacemaker::local::plugin \
	--mode=crm \
	--hostname='10.0.0.1' \
	--ssh-backend='libssh' \
	--ssh-username='' \
	--ssh-password='' \
	--ssh-port=''  \
	--filter-resource-name='' \
	--warning-nodes-online='' \
	--critical-nodes-online='' \
	--warning-nodes-offline='' \
	--critical-nodes-offline='0' \
	--warning-nodes-standby='' \
	--critical-nodes-standby='' \
	--warning-clone-resource-status='' \
	--critical-clone-resource-status='%\{status\} =~ /failed/i' \
	--warning-clone-resource-actions-failed='' \
	--critical-clone-resource-actions-failed='' \
	--warning-clone-resource-migration-failed='' \
	--critical-clone-resource-migration-failed='' \
	--warning-connection-status='' \
	--critical-connection-status='%\{connection_status\} =~ /failed/i' \
	--warning-cluster-actions-failed='' \
	--critical-cluster-actions-failed='0' \
	--warning-resource-status='' \
	--critical-resource-status='%\{status\} =~ /stopped|failed/i' \
	--warning-resource-actions-failed='' \
	--critical-resource-actions-failed='' \
	--warning-resource-migration-failed='' \
	--critical-resource-migration-failed='0' \
	--warning-quorum-status='' \
	--critical-quorum-status='%\{quorum_status\} =~ /noQuorum/i' \
	--verbose

La commande devrait retourner un message de sortie similaire à :

OK:    actions failed: 31 actions failed: 63 migration failed: 93 actions failed: 67 migration failed: 5 | '*cluster*~cluster.nodes.online.count'=94;;;0;'*cluster*~cluster.nodes.offline.count'=25;;;0;'*cluster*~cluster.nodes.standby.count'=69;;;0;'*cluster*~cluster.actions.failed.count'=31;;;0;'*resources*#resource.actions.failed.count'=63;;;0;'*resources*#resource.migration.failed.count'=93;;;0;'*clone_resources*#clone_resource.actions.failed.count'=67;;;0;'*clone_resources*#clone_resource.migration.failed.count'=5;;;0;

Diagnostic des erreurs communes

Rendez-vous sur la documentation dédiée pour le diagnostic des erreurs communes des plugins Centreon.

Modes disponibles

Dans la plupart des cas, un mode correspond à un modèle de service. Le mode est renseigné dans la commande d'exécution du connecteur. Dans l'interface de Centreon, il n'est pas nécessaire de les spécifier explicitement, leur utilisation est implicite dès lors que vous utilisez un modèle de service. En revanche, vous devrez spécifier le mode correspondant à ce modèle si vous voulez tester la commande d'exécution du connecteur dans votre terminal.

Tous les modes disponibles peuvent être affichés en ajoutant le paramètre --list-mode à la commande :

/usr/lib/centreon/plugins/centreon_pacemaker_ssh.pl \
	--plugin=apps::pacemaker::local::plugin \
	--list-mode

Le plugin apporte les modes suivants :

Mode	Modèle de service associé
clustat [code]	App-Pacemaker-Clustat-SSH-custom
constraints [code]	App-Pacemaker-Constraints-SSH-custom
crm [code]	App-Pacemaker-CRM-SSH-custom

Options disponibles

Options génériques

Les options génériques sont listées ci-dessous :

Option	Description
--mode	Define the mode in which you want the plugin to be executed (see--list-mode).
--dyn-mode	Specify a mode with the module's path (advanced).
--list-mode	List all available modes.
--mode-version	Check minimal version of mode. If not, unknown error.
--version	Return the version of the plugin.
--custommode	When a plugin offers several ways (CLI, library, etc.) to get information the desired one must be defined with this option.
--list-custommode	List all available custom modes.
--multiple	Multiple custom mode objects. This may be required by some specific modes (advanced).
--pass-manager	Define the password manager you want to use. Supported managers are: environment, file, keepass, hashicorpvault and teampass.
--verbose	Display extended status information (long output).
--debug	Display debug messages.
--filter-perfdata	Filter perfdata that match the regexp. Example: adding --filter-perfdata='avg' will remove all metrics that do not contain 'avg' from performance data.
--filter-perfdata-adv	Filter perfdata based on a "if" condition using the following variables: label, value, unit, warning, critical, min, max. Variables must be written either %{variable} or %(variable). Example: adding --filter-perfdata-adv='not (%(value) == 0 and %(max) eq "")' will remove all metrics whose value equals 0 and that don't have a maximum value.
--explode-perfdata-max	Create a new metric for each metric that comes with a maximum limit. The new metric will be named identically with a '_max' suffix). Example: it will split 'used_prct'=26.93%;0:80;0:90;0;100 into 'used_prct'=26.93%;0:80;0:90;0;100 'used_prct_max'=100%;;;;
--change-perfdata --extend-perfdata	Change or extend perfdata. Syntax: --extend-perfdata=searchlabel,newlabel,target[,[newuom],[min],[m ax]] Common examples: Convert storage free perfdata into used: --change-perfdata='free,used,invert()' Convert storage free perfdata into used: --change-perfdata='used,free,invert()' Scale traffic values automatically: --change-perfdata='traffic,,scale(auto)' Scale traffic values in Mbps: --change-perfdata='traffic_in,,scale(Mbps),mbps' Change traffic values in percent: --change-perfdata='traffic_in,,percent()'
--extend-perfdata-group	Add new aggregated metrics (min, max, average or sum) for groups of metrics defined by a regex match on the metrics' names. Syntax: --extend-perfdata-group=regex,namesofnewmetrics,calculation[,[ne wuom],[min],[max]] regex: regular expression namesofnewmetrics: how the new metrics' names are composed (can use $1, $2... for groups defined by () in regex). calculation: how the values of the new metrics should be calculated newuom (optional): unit of measure for the new metrics min (optional): lowest value the metrics can reach max (optional): highest value the metrics can reach Common examples: Sum wrong packets from all interfaces (with interface need --units-errors=absolute): --extend-perfdata-group=',packets_wrong,sum(packets_(discard \|error)_(in\|out))' Sum traffic by interface: --extend-perfdata-group='traffic_in_(.*),traffic_$1,sum(traf fic_(in\|out)_$1)'
--change-short-output --change-long-output	Modify the short/long output that is returned by the plugin. Syntax: --change-short-output=pattern~~replacement~~modifier Most commonly used modifiers are i (case insensitive) and g (replace all occurrences). Example: adding --change-short-output='OKUpgi' will replace all occurrences of 'OK', 'ok', 'Ok' or 'oK' with 'Up'
--change-exit	Replace an exit code with one of your choice. Example: adding --change-exit=unknown=critical will result in a CRITICAL state instead of an UNKNOWN state.
--range-perfdata	Rewrite the ranges displayed in the perfdata. Accepted values: 0: nothing is changed. 1: if the lower value of the range is equal to 0, it is removed. 2: remove the thresholds from the perfdata.
--filter-uom	Mask the units when they don't match the given regular expression.
--opt-exit	Replace the exit code in case of an execution error (i.e. wrong option provided, SSH connection refused, timeout, etc). Default: unknown.
--output-ignore-perfdata	Remove all the metrics from the service. The service will still have a status and an output.
--output-ignore-label	Remove the status label ("OK:", "WARNING:", "UNKNOWN:", CRITICAL:") from the beginning of the output. Example: 'OK: Ram Total:...' will become 'Ram Total:...'
--output-xml	Return the output in XML format (to send to an XML API).
--output-json	Return the output in JSON format (to send to a JSON API).
--output-openmetrics	Return the output in OpenMetrics format (to send to a tool expecting this format).
--output-file	Write output in file (can be combined with json, xml and openmetrics options). E.g.: --output-file=/tmp/output.txt will write the output in /tmp/output.txt.
--disco-format	Applies only to modes beginning with 'list-'. Returns the list of available macros to configure a service discovery rule (formatted in XML).
--disco-show	Applies only to modes beginning with 'list-'. Returns the list of discovered objects (formatted in XML) for service discovery.
--float-precision	Define the float precision for thresholds (default: 8).
--source-encoding	Define the character encoding of the response sent by the monitored resource Default: 'UTF-8'.
--hostname	Hostname to query in ssh.
--timeout	Timeout in seconds for the command (default: 45). Default value can be override by the mode.
--command	Command to get information. Used it you have output in a file.
--command-path	Command path.
--command-options	Command options.
--sudo	Use 'sudo' to execute the command.
--ssh-backend	Define the backend you want to use. It can be: sshcli (default), plink and libssh.
--ssh-username	Define the user name to log in to the host.
--ssh-password	Define the password associated with the user name. Cannot be used with the sshcli backend. Warning: using a password is not recommended. Use --ssh-priv-key instead.
--ssh-port	Define the TCP port on which SSH is listening.
--ssh-priv-key	Define the private key file to use for user authentication.
--sshcli-command	ssh command (default: 'ssh').
--sshcli-path	ssh command path (default: none)
--sshcli-option	Specify ssh cli options (example: --sshcli-option='-o=StrictHostKeyChecking=no'). The default option is --sshcli-options='-o=LogLevel=ERROR' which hides the SSH banner. If you override this parameter, make sure to append '-o=LogLevel=ERROR' to your new value to maintain this behavior. This parameter can be used multiple times and multiple options can be specified in the same parameter.
--plink-command	plink command (default: 'plink').
--plink-path	plink command path (default: none)
--plink-option	Specify plink options (example: --plink-option='-T').
--libssh-strict-connect	Connection won't be OK even if there is a problem (server known changed or server found other) with the ssh server.

Options des modes

Les options disponibles pour chaque modèle de services sont listées ci-dessous :

CRM
Clustat
Constraints

Option	Description
--filter-resource-name	Filter resource (also clone resource) by name (can be a regexp).
--warning-connection-status	Define the conditions to match for the status to be WARNING. You can use the following variables: %{connection_status}, %{connection_error}
--critical-connection-status	Define the conditions to match for the status to be CRITICAL (default: '%{connection_status} =~ /failed/i'). You can use the following variables: %{connection_status}, %{connection_error}
--warning-quorum-status	Define the conditions to match for the status to be WARNING. You can use the following variables: %{quorum_status}
--critical-quorum-status	Define the conditions to match for the status to be CRITICAL (default: '%{quorum_status} =~ /noQuorum/i'). You can use the following variables: %{quorum_status}
--warning-resource-status	Define the conditions to match for the status to be WARNING. You can use the following variables: %{name}, %{status}, %{node}, %{is_unmanaged}
--critical-resource-status	Define the conditions to match for the status to be CRITICAL (default: '%{status} =~ /stopped\|failed/i'). You can use the following variables: %{name}, %{status}, %{node}, %{is_unmanaged}
--warning-clone-resource-status	Define the conditions to match for the status to be WARNING. You can use the following variables: %{name}, %{status}, %{masters_nodes_name}, %{slaves_nodes_name}, %{is_unmanaged}
--critical-clone-resource-status	Define the conditions to match for the status to be CRITICAL (default: '%{status} =~ /failed/i'). You can use the following variables: %{name}, %{status}, %{masters_nodes_name}, %{slaves_nodes_name}, %{is_unmanaged}
--ignore-failed-actions	Failed actions errors (that match) are skipped.
--resources	If resources not started on the node specified, send a warning message: (format: <rsc_name>:<node>,<rsc_name>:<node>,...)
--warning-* --critical-*	Thresholds. Can be: 'cluster-actions-failed', 'clone-resource-actions-failed', 'clone-resource-migration-failed', 'nodes-online', 'nodes-offline', 'nodes-standby', 'resource-actions-failed', 'resource-migration-failed'.

Option	Description
--warning-*	Can be ('group','node') Define the conditions to match for the status to be WARNING.
--critical-*	Can be ('group','node') Define the conditions to match for the status to be CRITICAL. (default: --critical-node '%{state} !~ /up\|clean/' --critical-group '%{state} !~ /started\|starting/')

Option	Description
--resource	Set the resource name you want to check
--warning	Return a warning instead of a critical

Pour un mode, la liste de toutes les options disponibles et leur signification peut être affichée en ajoutant le paramètre --help à la commande :

/usr/lib/centreon/plugins/centreon_pacemaker_ssh.pl \
	--plugin=apps::pacemaker::local::plugin \
	--mode=crm \
	--help

Contenu du pack​

Modèles​

Métriques & statuts collectés​

Prérequis​

Configuration SSH​

Privilèges de l'utilisateur local​

Installer le connecteur de supervision​

Pack​

Plugin​

Utiliser le connecteur de supervision​

Utiliser un modèle d'hôte issu du connecteur​

Utiliser un modèle de service issu du connecteur​

Comment puis-je tester le plugin et que signifient les options des commandes ?​

Diagnostic des erreurs communes​

Modes disponibles​

Options disponibles​

Options génériques​

Options des modes​