Advanced Kadeploy: Difference between revisions

From Grid5000
Jump to navigation Jump to search
(27 intermediate revisions by 7 users not shown)
Line 3: Line 3:
{{TutorialHeader}}
{{TutorialHeader}}


{{Warning|text=Please mind also reading the [[Environment creation]] guide, which gives automated mechanisms to build kadeploy environnements}}
 
{{Warning|text=Please see the [[Environment creation]] guide, which gives automated mechanisms to build kadeploy environnements}}


= What you need to know before starting =
= What you need to know before starting =
The first thing to understand is that by using kadeploy3, you will be running a command that attempts to remotely reboot many nodes at the same time, and boot them using configuration files hosted on a server. On some clusters, there is a failure rate associated with this operation that is not null. You might therefore experience failures on some operations during this tutorial. In this case, retry. The system doesn't retry for you as this implies waiting for long timeouts in all cases, even those where a 90% success rate is sufficient.
The first thing to understand is that by using kadeploy3, you will be running a command that attempts to remotely reboot possibly many nodes at the same time, and boot them over the network using configuration files hosted on a server.


== What is an Environment? ==
== What is an Environment? ==
Line 16: Line 17:
needs at least 4 files in the most common cases
needs at least 4 files in the most common cases
# An '''image'''
# An '''image'''
#* An image is a file containing all the Operating System files. It can be a compressed archive (ie tgz file) or a dump of a device (ie dd file). In this tutorial, you will learn to build new images for Kadeploy3
#* An image is a file containing all the Operating System files. It is a compressed archive (compressed tarball).
# A '''kernel file'''
# A '''kernel file'''
#* Specifies the kernel file to boot. The full file path in the target system is expected.
#* Specifies the kernel file to boot. The full file path in the target system is expected.
# An '''initrd file''' (optional)
# An '''initrd file''' (optional)
#* For the Linux systems, the <code class="file">initrd</code> or <code class="file">initramfs</code> file it the initial ramdisk used by the kernel as a preliminary system before the root filesytem is mounted for instance. More information: [http://en.wikipedia.org/wiki/Initrd Initrd on Wikipedia]. The full file path in the target system is expected.
#* For the Linux systems, the <code class="file">initrd</code> or <code class="file">initramfs</code> file is the initial ramdisk used by the kernel as a preliminary system before the root filesytem is mounted. More information: [http://en.wikipedia.org/wiki/Initrd Initrd on Wikipedia]. The full file path in the target system is expected.
# A '''postinstall''' file (optional)
# A '''postinstall''' file (optional)
#*The postinstall file allows for adapting the deployed environment to specificities of each cluster, as well as setting up some platform-specific services.
#*The postinstall file allows for adapting the deployed environment to specificities of each site and cluster, as well as setting up some platform-specific services.


Once you have this set of files, you can describe your environment to <code class="command">kadeploy3</code>. This description represents an <code class="env">environment</code> in the <code class="command">kadeploy3</code> sense.
Once you have this set of files, you can describe your environment to <code class="command">kadeploy3</code>. This description represents an <code class="env">environment</code> in the <code class="command">kadeploy3</code> sense.
Note: the kadeploy software also supports deploying environments made of binary disk images (''dd'' format) but this is not supported on Grid'5000.


== How can I make my own environment? ==
== How can I make my own environment? ==
To create our own environment, there are two main ways:
To create our own environment, there are two main ways:
* One way is to deploy an existing environment, customize it and save it with <code class="command">tgz-g5k</code>
* One way is to deploy an existing environment, customize it and save it with <code class="command">tgz-g5k</code>
* The other one is to built the environment from a recipe, just like the Grid'5000 supported reference environments. See [[Environment creation]].
* The other one is to build the environment from a recipe, using <code class="command">kameleon</code>, just like the Grid'5000 supported reference environments. See [[Environment creation]].


== Disk partitioning ==
== Disk partitioning ==


Environments are deployed on the (first) local disk of each node (each Grid'5000 node is equipped with at least one hard drive). The following partitioning scheme is used:
Environments are deployed on the (first) local disk of each node (each Grid'5000 node is equipped with at least one hard drive, HDD or SSD). The following partitioning scheme is used:
 
; Legacy / DOS


{|
{|
|-
|-
!style="background: lightgrey;"| Device
!style="background: lightgrey;"| Label
!style="background: lightgrey;"| Role
!style="background: lightgrey;"| Role
|-
|-
| <code>sda1</code>
| <code>SWAP</code>
| Linux swap
| Linux swap
|-
|-
| <code>sda2</code>
| <code>PROD</code>
| Standard environment (default environment installed on nodes)
| Standard environment (default environment installed on nodes)
|-
|-
| <code>sda3</code>
| <code>DEPLOY</code>
| User's environment (when Kadeploy is used)
| User's environment (when Kadeploy is used)
|-
|-
| <code>sda4</code>
| <code>EFI</code>
| MS-DOS extended partition (container for the logical partitions, cannot be used)
|-
| <code>sda5</code>
| Remaining disk space made available in <code class="file">/tmp</code>
|}
 
; GPT
 
This new partition scheme is applied particularly to clusters that boot using UEFI ("gemini" and "pyxis" in Lyon) but also in other cases such as for Power systems ("drac" in Grenoble)
 
{|
|-
!style="background: lightgrey;"| Device
!style="background: lightgrey;"| Role
|-
| <code>sda1</code>
| Linux swap
|-
| <code>sda2</code>
| Standard environment (default environment installed on nodes)
|-
| <code>sda3</code>
| User's environment (when Kadeploy is used)
|-
| <code>sda4</code>
| For UEFI systems: UEFI system partition (store boot loaders)
| For UEFI systems: UEFI system partition (store boot loaders)
|-
|-
| <code>sda5</code>
| <code>TMP</code>
| Remaining disk space made available in <code class="file">/tmp</code>
| Remaining disk space made available in <code class="file">/tmp</code>
|}
|}
Line 85: Line 61:
; Notes
; Notes


* Some clusters have more than one local disk. On some clusters, those disks can be reserved apart from the node itself. See [[Disk_reservation]] to find out how to use them.
* Some clusters have more than one local disk. On some clusters, those disks can be reserved apart from the node itself. See [[Disk reservation]] to find out how to use them.
* Kadeploy allows deployments on other partitions than <code>sda3</code> or even creating a custom partitioning scheme. See below for details.
* Kadeploy allows deployments on other partitions than <code>DEPLOY</code> or even creating a custom partitioning scheme. See below for details.


= Search and deploy an existing environment =
= Search and deploy an existing environment =
Line 92: Line 68:
== Search an environment ==
== Search an environment ==


Grid'5000 maintains several reference environments directly available in all sites. These environments are based on Debian, Ubuntu and Centos.  
Grid'5000 maintains several reference environments directly available for deployment in all sites. These environments are based on the Debian, Ubuntu, or Centos Linux distribution.  


For Debian, different variants of reference environments are offered. For Ubuntu and Centos, only environment with a minimal system are offered.
For Debian, different variants of reference environments are offered. For Ubuntu and Centos, only environments with a minimal system are offered.


They are called ''reference'' environments because they can be used to generate customized environments.
They are called ''reference'' environments because they can be used to generate customized environments and because they are provided and supported by the Grid'5000 technical team.
The full list of reference environments is:
{{:Generated/Environments}}


; The description of the reference environments can be found here: {{Link|[[Getting_Started#Deploying_nodes_with_Kadeploy]]}}
Kadeploy provides a registry of environments in each site, where reference environments are registered along with environments of users.


An environment registry is maintained in each site (see <code class="command">kaenv3</code>), with the associated filesystem images stored in the <code class="dir">/grid5000</code> directory of the <code class="host">frontend</code>.
For reference environments, associated filesystem images are stored in the <code class="dir">/grid5000</code> directory of the <code class="host">frontend</code>.


To deploy a registered environment, you must know its name as registered in the Kadeploy database. It is the first information on the environment description page. This tutorial uses the <code class="env">debian10-x64-base</code> environment.
To deploy a registered environment, you must know its name as registered in the Kadeploy registry. This tutorial uses the <code class="env">debian11-base</code> environment.


You can also list all available environment in a site by using the <code class="command">kaenv3</code> command:
You can also list all available environment in a site by using the <code class="command">kaenv3</code> command:
Line 110: Line 88:
We distinguish three levels of visibility for an environment:
We distinguish three levels of visibility for an environment:


* ''public'': All users can see those environments. Only administrators can tag them this way.
* ''public'': Only administrators can register public environments. They are shown and used by default by the <code class=command>kaenv3</code> and <code class=command>kadeploy3</code> commands unless a user is specified (with <code class=command>-u</code> <code class=replace>user</code>).


* ''shared'': Every users can see the environment provided they use the -u option to specify the user the environment belongs to.
* ''shared'': Shared user environments. They are shown and used by the <code class=command>kaenv3</code> and <code class=command>kadeploy3</code> commands when a user is specified with <code class=command>-u</code> <code class=replace>user</code>.


* ''private'': The environment is only visible by the user the environment belongs to.
* ''private'': The environment is only shown and usable by the user the environment belongs to.


For example, a shared environment added by user <code class="replace">user</code> is listed this way:
For example, a shared environment added by user <code class="replace">user</code> is listed this way:
{{Term|location=frontend|cmd=<code class="command">kaenv3</code> -l -u <code class="replace">user</code>}}
{{Term|location=frontend|cmd=<code class="command">kaenv3</code> -l -u <code class="replace">user</code>}}


You can also look for a specific version with the <code>--env-version</code> option. All the versions of the environments can be found in <code>/grid5000/images</code>. The version number is the last part of the tgz file.
You can also look for a specific version with the <code class=command>--env-version</code> </code class=replace>version</code> option. Most of the versions of the reference environments images files are available in <code>/grid5000/images</code>. The version number is the last part of the image file.


For example: <code>debian10-x64-min-2019100414.tgz</code> => it's the min debian10-x64 reference environment version <code>2019100414</code>.
For instance: <code class=file>debian11-min-2021092316.tar.zst</code> is the image file of the <code class=replace>debian11-min</code> reference environment version <code class=replace>2021092316</code>.


So if you want to deploy this specific image with kadeploy you will have to procede like this:
Whenever you want to deploy this specific image on some nodes, use:


{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -f $OAR_NODEFILE -k -e debian10-x64-min --env-version <code classe="replace">2019100414</code>}}
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> <code class=replace>debian11-min</code> --env-version <code class=replace>2021092316</code>}}


Being able to reproduce the experiments that are done is a desirable feature. Therefore, you should always try to control as much as possible the environment the experiment is done in. Therefore, we will attempt to check that the environment that was chosen in the environment directory is the one available on a given cluster. On the cluster you would like to deploy, type the following command to print information about an environment:
Being able to reproduce a past experiment is a desirable feature. Therefore, you should always try to control as much as possible the environment the experiment is done in.  
{{Term|location=frontend|cmd=<code class="command">kaenv3</code> <code>-p</code> <code class="env">debian10-x64-base</code> <code>-u deploy</code>}}
Therefore, we will attempt to check that the environment that was chosen in the environment directory is the one available on a given cluster. On the cluster you would like to deploy, type the following command to print information about an environment:
{{Term|location=frontend|cmd=<code class="command">kaenv3</code> <code class="env">debian11-base</code> <code>-u deploy</code>}}


You must specify the user option. In our case, all public environments belong to user <code>deploy</code>.
Adding <code>-u deploy</code> makes sure we get the description of the environment supported by the Grid'5000 staff, as ''deploy'' is the user owning them (one might have registred his own environment with the same, that would show up if no user is specified).


In theory, you should also check the post-install script. A post-install script adapts an environment to the site it is deployed on.  
In theory, you should also check the post-install script. A post-install script adapts an environment to the site it is deployed on.  
Line 137: Line 116:


== Make a job on a deployable node ==
== Make a job on a deployable node ==
By default, Grid'5000 nodes are running on the ''production'' environment, which already contains most of the important features and can be used to run experiments. But you will not have administrative privileges (root privileges) on these nodes. So you will not be able to customize these environments at will. In fact, only reference environments can be customized at will. But to have the right to deploy a reference environment on a node, you must supply the option <code>-t deploy</code> when submitting your job.
By default, Grid'5000 nodes are running on the ''production'' environment, which already contains most of the important features and can be used to run experiments. This environment however imposes choices and has some limitations due to its general-purpose target. While you can gain the root privileges on it thanks to the <code class=command>sudo-g5k</code> command, you cannot reboot for instance, as this ends your reservation. Deploying on the contrary allows to have a dedicated environment and to have full control over the machine (reboot, serial console, ...).


For this part of the tutorial, job made will be interactive (<code>-I</code>), of the deploy type (<code>-t deploy</code>), on only one machine (<code>-l nodes=1</code>) to do environment customization (we will give ourselves 3 hours with <code>-l walltime=3</code>), which gives us the following command, that will open a new shell session on the frontend node:
For this part of the tutorial, jobs made will be interactive (<code>-I</code>), of the deploy type (<code>-t deploy</code>), on only one machine (<code>-l nodes=1</code>) to do environment customization (we will give ourselves 3 hours with <code>-l walltime=3</code>), which gives us the following command, that will open a new shell session on the frontend node:
{{Term|location=frontend|cmd=<code class="command">oarsub</code> -I -t deploy -l nodes=1,walltime=3}}
{{Term|location=frontend|cmd=<code class="command">oarsub</code> -I -t deploy -l nodes=1,walltime=3}}


Since all Grid'5000 nodes do not necessary have console access, it is recommended in the context of this tutorial to add the option <code>rconsole="YES"</code> to your reservation command.
Indeed, when you submit a job of the deploy type, a new shell is opened on the frontend node and not on the first machine of the job as for standard jobs. When you exit from this shell, the job ends. The shell is populated with <code class="env">OAR_*</code> environment variables. You should look at the list of available variables to get an idea of the information you can use to script deployment later. As usual, if the job is successful, you will get the name of the machine allocated to your job with:
{{Term|location=frontend|cmd=<code class="command">oarsub</code> -I -t deploy -l {"rconsole='YES'"}/nodes=1,walltime=3}}
 
Indeed, when you submit a job of the deploy type, a new shell is opened on the frontend node and not on the first machine of the job as for standard jobs. When you exit from this shell, the job ends. The shell is populated with <code class="env">OAR_*</code> environment variables. You should look at the list of available variables to get an idea of the information you can use to script deployment later. As usual, if the job is successfull, you will get the name of the machine allocated to your job with:
{{Term|location=frontend|cmd=<code class="command">cat</code> <code class="env">$OAR_FILE_NODES</code>}}
{{Term|location=frontend|cmd=<code class="command">cat</code> <code class="env">$OAR_FILE_NODES</code>}}


{{Warning|text=At the end of a reservation with the options <code>-t deploy</code>, the reserved nodes will be restarted to boot on the standard environment and thus be available to any other user.
{{Warning|text=At the end of a reservation with the <code>-t deploy</code> option, the reserved nodes will be reboot on the standard environment and thus make them available for another job possibly of another user. Rebooting may take up to 15 minutes, so please mind your commands as they have an implicit cost.}}
So you should only use this option <code>-t deploy</code> when you actually intend to deploy a reference environment on the reserved nodes.}}


== Deploy a reference environment ==
== Deploy a reference environment ==
To deploy your environment, you must discover the nodes you were allocated by OAR. The simplest way of doing this is to look at the content of the file whose name is stored in <code class="env">$OAR_FILE_NODES</code> (this variable is labelled <code class="env">$OAR_NODE_FILE</code> too) or the messages displayed when the job was made. This variable <code class="env">$OAR_NODE_FILE</code> simply stores the url of the file containing the FQDN of all your reserved nodes. Deployment happens when you run the following command:
By default, Kadeploy use all the nodes of the reservation. It is done by using the file whose name is given by the <code class="env">$OAR_FILE_NODES</code> environment variable (or <code class="env">$OAR_NODE_FILE</code>, or <code class="env">$OAR_NODEFILE</code>)
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -e debian10-x64-base -m <code class="replace">node.site</code>.grid5000.fr}}
So, to start the deployment on all the nodes, run the following command:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> debian11-base}}


You can automate this to deploy on all nodes of your job with the -f option:
You can also just provide the nodes to deploy on, with the -m option:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -e debian10-x64-base -f <code class="env">$OAR_FILE_NODES</code>}}
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> debian11-base -m <code class="replace">node.site</code>.grid5000.fr}}




In order to be able to connect to the node (as <code>root</code>), you must use the <code>-k</code> option and proceed by two ways:
By default, Kadeploy will copy your <code class=file>~/.ssh/authorized_keys</code> and replace the <code class=file>/root/.ssh/authorized_keys</code> file on the deployed nodes.
Alternatively, you can use the <code>-k</code> option in two ways:
* You can either specify the public key that will be copied in <code class=file>/root/.ssh/authorized_keys</code> on the deployed nodes:
* You can either specify the public key that will be copied in <code class=file>/root/.ssh/authorized_keys</code> on the deployed nodes:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -e debian10-x64-base -f <code class="env">$OAR_FILE_NODES</code> -k ~/.ssh/my_special_key.pub}}
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> debian11-base -k ~/.ssh/my_special_key.pub}}
* Or you can supply the <code>-k</code> option without argument. This will automatically copy your <code class=file>~/.ssh/authorized_keys</code> and replace the <code class=file>/root/.ssh/authorized_keys</code> file on the deployed nodes.
* Or to not copy any key on the deployed node with <code>-k none</code>. You will need to provide a password to connect. However, SSH is often configured to disallow root login using password. The root password for all reference environments (i.e. provided by the Grid'5000 technical team) is <code>grid5000</code>.  
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -e <code>debian10-x64-base</code> -f <code class="env">$OAR_FILE_NODES</code> -k}}
The second case is actually the simplest way. One of its advantages is that after deployments, you will be able to connect directly from your local computer to the deployed nodes, the same way you connect to the frontend of the site were those nodes are.<br>
Once kadeploy has run successfully, the allocated node is deployed under <code>debian10-x64-base</code> environment. It will then be possible to tune this environment according to your needs.


{{Note|text=It is not necessary here, but you can specify destination partition with the -p option}}
 
In our case, the node file contains only 1 node.
 
Once the kadeploy command was executed successfully, the deployed node runs the <code>debian11-base</code> environment as their operating system. It will then be possible to tune this environment according to your needs.


== Connect to the deployed environment and customize it ==
== Connect to the deployed environment and customize it ==
;1. Connection
;1. Connection


On reference environments managed by the staff, you can use the <code>root</code> account for login through <code>ssh</code> (kadeploy checks that sshd is running before declaring a deployment successful). To connect to the node type:
On reference environments managed by the Grid'5000 technical team, you can use the <code>root</code> account for log in with <code>ssh</code> (kadeploy checks that sshd is running before declaring a deployment successful). To connect to the node type:
{{Term|location=frontend|cmd=<code class="command">ssh</code> root@<code class="replace">node.site</code>.grid5000.fr}}
{{Term|location=frontend|cmd=<code class="command">ssh</code> root@<code class="replace">node.site</code>.grid5000.fr}}


Line 179: Line 156:
;2. Adding software to an environment
;2. Adding software to an environment


''Where you learn to install software using the package repository of your distribution on Grid'5000''
You can alter your environment (to add missing libraries that you need, or remove packages that you don't need ; to reduce the size of the image and speed up the deployment process ; etc.) using commands such as:
 
You can therefore update your environment (to add missing libraries that you need, or remove packages that you don't need, to reduce the size of the image and speed up the deployment process, etc.) using:
{{Term|location=node|cmd=<code class="command">apt-get</code> update <br> <code class="command">apt-get</code> upgrade <br> <code class="command">apt-get</code> install <code class="replace">list of desired packages and libraries</code> <br> <code class="command">apt-get</code> --purge remove <code class="replace">list of unwanted packages</code> <br> <code class="command">apt-get</code> clean}}
{{Term|location=node|cmd=<code class="command">apt-get</code> update <br> <code class="command">apt-get</code> upgrade <br> <code class="command">apt-get</code> install <code class="replace">list of desired packages and libraries</code> <br> <code class="command">apt-get</code> --purge remove <code class="replace">list of unwanted packages</code> <br> <code class="command">apt-get</code> clean}}


== Create a new environment from a customized environment ==
== Create a new environment from a node's customized operating system ==
We now need to save this customized environment, where you have a user account, to be able to use this account again each time you deploy it.<br>
We now need to save this customized environment, where you have a user account, to be able to use this account again each time you deploy it.<br>
The first step to create an environment is to create an archive of the node you just customized. Because of the various implementations of the <code class="dir">/dev</code> filesystem tree, this can be a more or less complex operation.  
The first step to create an environment is to create an archive of the node's operating system you just customized. You can use <code class=command>tgz-g5k</code> to extract a Grid'5000 environment tarball from a running node. It's usage is describe in the '''[[Environment creation]]''' tutorial page.


;1. Use the provided tools
= Advanced deployment options =
== Multisite deployment ==
In order to achieve a deployment on nodes from different sites, you can use the multiserver option of kadeploy, using the <code class="command">-M</code> option.
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> '''-M''' -f <code class="file">file_with_all_nodes</code> <code class="replace">debian11-big</code>}}


You can use <code class=command>tgz-g5k</code> to extract a Grid'5000 environment tarball from a running node. From the frontend, run:
== Deploy on other partition of disk(s) ==
Kadeploy default handling of partitions may be too limited for some usages. One may need to use disks differently. Kadeploy offers several options to deploy on another existing partition of the primary disk, or if required to repartition disks entirely and/or use several disks (on nodes with many disks).


{{Term|location=frontend|cmd=<code class=command>tgz-g5k -m </code><code class=replace>node</code><code class=command> -z -f </code><code class=replace>~/path_to_myimage.tar.zst</code>}}
=== Deploy on partition labeled PROD or TMP ===
As seen earlier, kadeploy manages to label portions to identify them more easily, e.g. PROD, DEPLOY, or TMP. Kadeploy can be instructed to deploy on the PROD or TMP partition instead of the DEPLOY partition.


This will create a file <code class=replace>path_to_image.tar.zst</code> into your home directory on <code class=host>frontend</code>.
Because this kind of deployment will break some node standard operations after the end of your job, you must add to your <code class=command>oarsub</code> command for the job creation the <code class="command">-t destructive</code> option. This will cause the node to be completely reinstalled after your job.
{{Term|location=frontend|cmd=<code class="command">oarsub</code> -t deploy -t destructive -l nodes=1,walltime=1 -p hercule -I}}
Then you can deploy on PROD or TMP with the <code class="command">-p PROD</code> or <code class="command">-p TMP</code> option:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> debian11-nfs -p <code class="replace">PROD</code>}}


{{Note|text=Please consider the following:
=== Deploy on secondary disks ===
*If you want to extract a tarball from the Grid'5000 standard environment (i.e., a non-deployed job), you will need to add the option <code class=command>-o</code> to use oarsh/oarcp instead of ssh/scp
Kadeploy identifies disks by their id (disk0, disk1...). You can find them in the Reference API or in the Hardware pages:
*If you want <code class="command">tgz-g5k</code> to access the node with a custom SSH user, you can use the option <code class=command>-u myUsername</code> (default is root)
{{Site link|Hardware}}
*The <code class="command">-z</code> option compresses the tarball with zstd, which is much faster than gzip. This requires the <code class="command">zstd</code> program to be installed on the node. To produce a more classical gzip-compressed archive, simply omit the <code class="command">-z</code> option.
*You can find more information on <code class=command>tgz-g5k</code> (e.g., available options, command line examples) by executing <code class=command>tgz-g5k -h</code>. Some implementation details are also available on the man page (<code class=command>man tgz-g5k</code>).}}
 
;2. Describe the newly created environment for deployments
 
Kadeploy3 works using an environment description. The easiest way to create a description for your new environment is to change the description of the environment it is based on. We have based this tutorial on the <code>debian10-x64-base</code> environment of user <code>deploy</code>. We therefore print its description to a file that will be used as a good basis:
{{Term|location=frontend|cmd=<code class=command>kaenv3</code> -p debian10-x64-base -u deploy > <code class=replace>mydebian10-x64-base.env</code> }}
 
It should be edited to change the <code class='replace'>name</code>, <code class='replace'>description</code>, <code class='replace'>author</code> lines, as well as the <code class='replace'>image/file</code> line. Check that the <code class='replace'>image/compression</code> is consistent with your tarball, it should be <code>zstd</code> or <code>gzip</code>. Since the tarball is local, the path should be a simple absolute path (without a leading <code>server://</code>). The visibility line should be removed, or changed to <code>shared</code> or <code>private</code>. Once this is done, the newly created environment can be deployed using:
{{Term|location=frontend|cmd=<code class=command>kadeploy3</code> -f <code class=env>$OAR_NODEFILE</code> -a <code class=replace>mydebian10-x64-base.env</code> }}
 
{{Note|text=A local path for the tarball (no leading <code>server://</code>) will not work with the API. If you want to use the API, you may want to put your tarball in the <code>public</code> directory of your home and specify the path with HTTP (eg: <code>http://public.</code><code class=replace>site</code><code>.grid5000.fr/~</code><code class=replace>username</code>/<code class=replace>mydebian10-x64-base.env</code>)}}
 
This kind of deployment is called ''anonymous deployment'' because the description is not recorded into the Kadeploy3 database. It is particularly useful when you perform the tuning of your environment if you have to update the environment tarball several times.
 
Once your customized environment is successfully tuned, you can save it to Kadeploy3 database so that you can directly deploy it with <code class=command>kadeploy3</code>, by specifying its name:
 
{{Term|location=frontend|cmd=<code class=command>kaenv3</code> -a <code class=replace>mydebian10-x64-base.env</code> }}
and then (if your environment is named "mydebian10-base"):
{{Term|location=frontend|cmd=<code class=command>kadeploy3</code> -f <code class=env>$OAR_NODEFILE</code> -e <code class=replace>mydebian10-base</code>}}
 
With the <code class=command>kaenv3</code> command, you can easily manage your environments. Please refer to its documentation for an overview of its features.
 
= Deployment options =
== Multisite deployment ==
In order to achieve a deployment on nodes from differents sites, you can use the multiserver option of kadeploy, using the <code class="command">-M</code>
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> '''-M''' -f <code class="file">file_with_all_nodes</code> -e <code class="replace">debian10-x64-std</code>}}
 
== Deploy on other partition of disk(s) ==
Kadeploy default handling of partitions may be too limited for some usage. One may need to use disks differently (e.g. to deploy our environment in an optimal way). Kadeploy offers several options to simply deploy on another existing partition of the primary disk (e.g. sda2 or sda5), or if required to repartition disks entirely and/or use several disks (on nodes with many disks).


=== Deploy on sda2 or sda5 ===
Because this kind of deployment will break some node standard operations after the end of your job, you must add to your <code class=command>oarsub</code> command for the job creation the <code class="command">-t destructive</code> option. This will cause the node to be completely reinstalled after your job.
First, as this kind of deployment will break node standard operation, you must tell to OAR that it should be redeployed entirely after the reservation with the <code class="command">-t destructive</code> option:
{{Term|location=frontend|cmd=<code class="command">oarsub</code> -t deploy -t destructive -l nodes=1,walltime=1 -p hercule -I}}
{{Term|location=frontend|cmd=<code class="command">oarsub</code> -t deploy -t destructive -l nodes=1,walltime=1 -p "cluster='hercule'" -I}}
Then you can deploy on sda2 or sda5 with the <code class="command">-p 2</code> or <code class="command">-p 5</code> option:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -e debian10-x64-nfs -f $OAR_NODEFILE -p <code class="replace">2</code> -k}}


=== Deploy on secondary disks ===
Then you can deploy on an secondary disk such as <code class="file">disk1</code> with the <code class="command">-b disk1</code> option:
First, as this kind of deployment will break node standard operation, you must tell to OAR that it should be redeployed entirely after the reservation with the <code class="command">-t destructive</code> option:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> debian11-min -b <code class="replace">disk1</code>}}
{{Term|location=frontend|cmd=<code class="command">oarsub</code> -t deploy -t destructive -l nodes=1,walltime=1 -p "cluster='hercule'" -I}}
{{Warning|text=It will not work for reservable disks}}
Then you can deploy on an secondary disk such as <code class="file">sdb</code> with the <code class="command">-b /dev/sdb</code> option:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -e debian10-x64-min -f $OAR_NODEFILE -b <code class="replace">/dev/sdb</code> -r ext4 -k}}
{{Note|text= Mind that in the case of nodes with reservable disks, a special care must be taken to find out the device file name.}}
{{Warning|text=Disk with size ≥ 4TB are currently not supported ({{Bug|11020}})}}
Disks can also be handled differently by modifying the deployment automata, see below.
Disks can also be handled differently by modifying the deployment automata, see below.


== About the kernel and bootloader ==
== About the kernel and bootloader ==
As a reminder, a deployed environment may be booted either with '''kexec''' (e.g. at the end of the deployment) or by '''grub''' (e.g. after a <code class=command>kareboot</code>, or a call of the <code class=command>reboot</code> command from the node itself, or if kexec is deactivated for the cluster because not supported)
As a reminder, a deployed environment may be booted either with '''kexec''' (e.g. at the end of the deployment) or by '''grub''' (e.g. after the deployment with a <code class=command>kareboot</code>, or a call of the <code class=command>reboot</code> command from the node itself, or if kexec is deactivated for the cluster because not supported).


By default, the bootloader installation step of kadeploy will install and setup grub using the grub commands provided by your environment. The grub configuration file will be generated with <code class=command>grub-mkconfig</code>, which will boot by default the most recent kernel version available in the <code class=file>/boot</code> of your environment. In case this version does not match the kernel version provided in the environment description, an error will be reported (''Kernel file mismatch between grub and the environment description''). This is a sanity check.
By default, the bootloader installation step of kadeploy will install and configure grub using the grub commands provided by your environment. The grub configuration file will be generated with <code class=command>grub-mkconfig</code>, which will boot by default the most recent kernel version available in the <code class=file>/boot</code> directory of your environment. In case that version does not match the kernel version provided in the environment description, an error will be reported (''Kernel file mismatch between grub and the environment description''). This is a sanity check.


If the grub commands are not available in the deployed environment, as a fallback the grub commands provided in the deploy kernel will be used for the installation, and a very simple grub configuration will be generated (unlike when <code class=command>grub-mkconfig</code> is used) that just use the ''kernel'', ''initrd'' and ''kernel params'' from your environment description.
If the grub commands are not available in the deployed environment, as a fallback the grub commands provided in the deploy kernel will be used for the installation, and a very simple grub configuration will be generated (unlike when <code class=command>grub-mkconfig</code> is used) that just use the ''kernel'', ''initrd'' and ''kernel params'' from your environment description.
Line 254: Line 200:
Note that having grub installed and configured with the commands that are provided in the deployed environment is the preferred way, because it will be isofunctional whenever the deployed system later has upgrades that impact the kernel or grub (or if the bootloader configuration command <code class=command>grub-mkconfig</code> or <code class=command>update-grub</code> is called).  
Note that having grub installed and configured with the commands that are provided in the deployed environment is the preferred way, because it will be isofunctional whenever the deployed system later has upgrades that impact the kernel or grub (or if the bootloader configuration command <code class=command>grub-mkconfig</code> or <code class=command>update-grub</code> is called).  


Whenever relevant, by letting the postinstall create a ''empty file'' in the '/var/lib/kadeploy/bootloader/' directory of the deploy kernel (hence before kadeploy runs the bootloader installation step), it allows to trigger a change of the behavior of the bootloader installation step, as such:
Whenever relevant, by using the ''--custom-variable'' option of kadeploy, you can trigger a change of the behavior of the bootloader installation step, as such:
* if file exists: '/var/lib/kadeploy/bootloader/skip', do nothing (grub not installed nor configured)  
* <code class=replace>BOOTLOADER_SKIP=1</code>, do nothing (grub not installed nor configured)  
* if file exists: '/var/lib/kadeploy/bootloader/no-config', do not configure grub  
* <code class=replace>BOOTLOADER_NO_CONFIG=1</code>, do not configure grub  
* if file exists: '/var/lib/kadeploy/bootloader/no-install', do not install grub  
* <code class=replace>BOOTLOADER_NO_INSTALL=1</code>, do not install grub  
* if file exists: '/var/lib/kadeploy/bootloader/no-grub-from-deployed-env', do not configure nor install using grub from your deployed env but from the deploy kernel  
* <code class=replace>BOOTLOADER_NO_GRUB_FROM_DEST=1</code>, do not configure nor install using grub from your deployed env but from the deploy kernel  
* if file exists: '/var/lib/kadeploy/bootloader/no-grub-mkconfig-from-deployed-env', do not configure using grub from your deployed env but from the deploy kernel  
* <code class=replace>BOOTLOADER_NO_GRUB_MKCONFIG_FROM_DEST</code>, do not configure using grub from your deployed env but from the deploy kernel  
* if file exists: '/var/lib/kadeploy/bootloader/no-grub-install-from-deployed-env', do not install using grub from your deployed env but from the deploy kernel  
* <code class=replace>BOOTLOADER_NO_GRUB_INSTALL_FROM_DEST=1</code> do not install using grub from your deployed env but from the deploy kernel
See below the <code class=command>g5k-postinstall --bootloader</code> option, which handles that, so that you just have to change the postinstall script in the environment description to change the behavior, and let it handle the creation of the before mentioned ''empty files''.
* <code class=replace>BOOTLOADER_NO_UUID=1</code> do not use filesystem UUID to identify the root partition
* <code class=replace>BOOTLOADER_SHOW_MENU=1</code> enable the grub menu on the console
For example, use <code class="command">kadeploy3  my_env --custom-variable </code><code class=replace>BOOTLOADER_NO_INSTALL=1</code> to not install grub.


= Customizing the postinstalls =
= Customizing the postinstalls =
Line 268: Line 216:
== Using g5k-postinstall ==
== Using g5k-postinstall ==
The source code for g5k-postinstall is [https://gitlab.inria.fr/grid5000/g5k-postinstall/tree/master/g5k-postinstall available on gitlab.inria.fr]. Its parameters at the time of writing are:
The source code for g5k-postinstall is [https://gitlab.inria.fr/grid5000/g5k-postinstall/tree/master/g5k-postinstall available on gitlab.inria.fr]. Its parameters at the time of writing are:
{{Term|location=frontend|cmd=<code class=command>g5k-postinstall</code> --help}}
<syntaxhighlight lang="text">
<syntaxhighlight lang="text">
Usage: g5k-postinstall [options]
Usage: g5k-postinstall [options]
Line 279: Line 226:
     -r, --restrict-user MODE        User restriction mode
     -r, --restrict-user MODE        User restriction mode
         --inittab PATTERN            Configure console in inittab
         --inittab PATTERN            Configure console in inittab
         --bootloader b1,b2,b3        Set the kadeploy bootloader step setup options
         --bootloader b1,b2,b3        Set the kadeploy bootloader step setup options (deprecated)
         --no-ref-api                Do not use the Reference API
         --no-ref-api                Do not use the Reference API
         --disable-hacks h1,h2,h3    Hacks to disable
         --disable-hacks h1,h2,h3    Hacks to disable
         --no-guix                    Disable guix
         --no-guix                    Disable guix
        --disk-aliases              Enable disk aliases (e.g. /dev/disk0p1)


Valid tags for network specification:
Valid tags for network specification:
Line 302: Line 250:
   nfs            include generic NFS mounts (/home with autofs, /grid5000)
   nfs            include generic NFS mounts (/home with autofs, /grid5000)
   no-autofs      do not use autofs, just mount the user's NFS directory
   no-autofs      do not use autofs, just mount the user's NFS directory
  site-specific  include site-specific NFS mounts (e.g. /softs)
   no-uuid        do not use filesystem UUID to identify the partitions, use the
   no-uuid        do not use filesystem UUID to identify the partitions, use the
                 block device filenames
                 block device filenames
   custom        include custom mounts for custom partitionning. Need fstab
   custom        include custom mounts for custom partitionning. Need fstab
                 file included in postinstall archive.
                 file included in postinstall archive.
Example: --fstab nfs,site-specific
Example: --fstab nfs


Valid modes for user restriction:
Valid modes for user restriction:
Line 321: Line 268:
   Where SPEED and TTYSX are replaced by g5k-postinstall using information
   Where SPEED and TTYSX are replaced by g5k-postinstall using information
   retieved from the parameters which are passed in /proc/cmdline.
   retieved from the parameters which are passed in /proc/cmdline.
Valid tags for the kadeploy bootloader step:
  skip        request the bootloader step to do nothing
  no-config  request the bootloader step to skip the bootloader configuration
  no-install  request the bootloader step to skip the bootloader installation
  no-grub-from-deployed-env
              request the bootloader step to not use the grub commands from the
              deployed env to configure and install the bootloader (but write a
              basic grub.cfg and the grub-install from the deploy kernel), same
              as using both next 2 options
  no-grub-mkconfig-from-deployed-env
              request the bootloader step to not use grub-mkconfig from the
              deployed env to configure the bootloader (but write a basic
              grub.cfg)
  no-grub-install-from-deployed-env
              request the bootloader step to not use grub-install from the
              deployed env to install the bootloader (but use grub-install from
              the deploy kernel)
  no-uuid    do not use filesystem UUID to identify the root partition, use the
              block device filename
  show-menu
              enable the grub menu on the console


No reference API option:
No reference API option:
Line 351: Line 276:
   g5k-postinstall includes hacks that can optionally be disabled.
   g5k-postinstall includes hacks that can optionally be disabled.
   Current hacks are:
   Current hacks are:
   - oot-i40e     install the i40e driver on chifflot and chiclet (lille) and
   - oot-i40e       install the i40e driver on chifflot and chiclet (lille) and
                  grappe (nancy)
                    grappe (nancy) on debian9 environment.
   - beegfs-gr520 configure beegfs shares on grcinq and grvingt (nancy)
  - force-net-name  on several clusters, the predictable network interface name
Example: --disable-hacks oot-i40e
                    can change depending on udev version. This hack forces the
                    predictable network interface name in such cases.
   - beegfs-gr520   configure beegfs shares on grcinq and grvingt (nancy).
Example: --disable-hacks oot-i40e,force-net-name
 
Disk aliases option:
  g5k-postinstall can optionally install udev rules to provide predictable disk aliases.
  Example of disk aliases automatically created by these udev rules:
    /dev/disk0  -> /dev/sdb
    /dev/disk0p2 -> /dev/sdb2
    /dev/disk1  -> /dev/nvme0n1
    /dev/disk1p4 -> /dev/nvme0n1p4
  This is useful because, starting from Linux 5.3, traditional block device names
  are non-deterministic: /dev/sda might not refer to the same disk on every boot.
 
</syntaxhighlight>
</syntaxhighlight>


An example environment description using g5k-postinstall is:
An example environment description using g5k-postinstall is:
<syntaxhighlight lang="yaml" highlight='13-16'>
<syntaxhighlight lang="yaml" highlight='14-17'>
---
---
name: debian10-x64-min
name: debian11-min
version: 2019100414
version: 2021092316
description: debian 10 (buster) - min
arch: x86_64
author: support-staff@list.grid5000.fr
description: debian 11 (bullseye) for x64 - min
visibility: public
author: pierre.neyron@imag.fr
visibility: private
destructive: false
destructive: false
os: linux
os: linux
image:
image:
   file: server:///grid5000/images/debian10-x64-min-2019100414.tgz
   file: server:///grid5000/images/debian11-x64-min-2021092316.tar.zst
   kind: tar
   kind: tar
   compression: gzip
   compression: zstd
postinstalls:
postinstalls:
- archive: server:///grid5000/postinstalls/g5k-postinstall.tgz
- archive: server:///grid5000/postinstalls/g5k-postinstall.tgz
   compression: gzip
   compression: gzip
   script: g5k-postinstall --net debian
   script: g5k-postinstall --net debian --disk-aliases
boot:
boot:
   kernel: "/vmlinuz"
   kernel: "/vmlinuz"
   initrd: "/initrd.img"
   initrd: "/initrd.img"
   kernel_params: foo=bar
   kernel_params: ''
filesystem: ext4
filesystem: ext4
partition_type: 131
partition_type: 131
multipart: false
multipart: false
</syntaxhighlight>
</syntaxhighlight>


Line 388: Line 329:
** Debian min environment with traditional NIC naming: <tt><code class=command>g5k-postinstall</code> --net debian --net traditional-names</tt>
** Debian min environment with traditional NIC naming: <tt><code class=command>g5k-postinstall</code> --net debian --net traditional-names</tt>
** Debian min environment with predictable NIC naming: <tt><code class=command>g5k-postinstall</code> --net debian</tt>
** Debian min environment with predictable NIC naming: <tt><code class=command>g5k-postinstall</code> --net debian</tt>
** Debian min environment with predictable disk aliases: <tt><code class=command>g5k-postinstall</code> --net debian --disk-aliases</tt>
** Debian NFS environment (mount /home, setup LDAP, restrict login to user who reserved the node): <tt><code class=command>g5k-postinstall</code> --net debian --fstab nfs --restrict-user current</tt>
** Debian NFS environment (mount /home, setup LDAP, restrict login to user who reserved the node): <tt><code class=command>g5k-postinstall</code> --net debian --fstab nfs --restrict-user current</tt>
** Debian big environment (NFS + setup HPC networks and mount site-specific directories): <tt><code class=command>g5k-postinstall</code> --net debian --net traditional-names --net hpc --fstab nfs --fstab site-specific</tt>
** Debian big environment (NFS + setup HPC networks and mount site-specific directories): <tt><code class=command>g5k-postinstall</code> --net debian --net traditional-names --net hpc --fstab nfs --fstab site-specific</tt>
** Use block device filenames (e.g. /dev/sda3) instead of UUID to identify the root partition in the kernel command line: <tt><code class=command>g5k-postinstall</code> --bootloader no-uuid</tt>
** Use GPT label (e.g. /dev/disk/by-partlabel/KDPL_DEPLOY_disk0) instead of UUID to identify the root partition in the kernel command line: <tt><code class=command>g5k-postinstall</code> --bootloader no-uuid</tt>
** Use block device filenames (e.g. /dev/sda3) instead of UUID to identify the partitions in the fstab: <tt><code class=command>g5k-postinstall</code> --fstab no-uuid</tt>
** Use GPT label (e.g. /dev/disk/by-partlabel/KDPL_DEPLOY_disk0) instead of UUID to identify the partitions in the fstab: <tt><code class=command>g5k-postinstall</code> --fstab no-uuid</tt>
** RHEL/Centos style for network configuration: <tt><code class=command>g5k-postinstall</code> --net redhat --net traditional-names</tt>
** RHEL/Centos style for network configuration: <tt><code class=command>g5k-postinstall</code> --net redhat --net traditional-names</tt>
** Ubuntu 1710 or later: NetPlan for network configuration: <tt><code class=command>g5k-postinstall</code> --net netplan</tt>
** Ubuntu 1710 or later: NetPlan for network configuration: <tt><code class=command>g5k-postinstall</code> --net netplan</tt>
** Do not do any network configuration (useful for Gentoo), but force serial console settings: <tt><code class=command>g5k-postinstall</code> --inittab='s0:12345:respawn:/sbin/agetty -L SPEED TTYSX vt100'</tt>
** Do not do any network configuration (useful for Gentoo), but force serial console settings: <tt><code class=command>g5k-postinstall</code> --inittab='s0:12345:respawn:/sbin/agetty -L SPEED TTYSX vt100'</tt>
* Use a customized version of g5k-postinstall: after building a modified g5k-postinstall, just point the postinstalls/archive/ field to the new tar archive. See [https://gitlab.inria.fr/grid5000/g5k-postinstall/blob/master/g5k-postinstall/README.md README.md] and [[TechTeam:Postinstalls]] for details on g5k-postinstall internals.
* Use a customized version of g5k-postinstall: after building a modified g5k-postinstall, just point the postinstalls/archive/ field to the new tar archive. See [https://gitlab.inria.fr/grid5000/g5k-postinstall/blob/master/g5k-postinstall/README.md README.md] and [[TechTeam:Postinstalls]] for details on g5k-postinstall internals.
* Add additional postinstall to execute after g5k-postinstall. That way, the additionnal postinstall can be written in any language (e.g. just a shell script) and complement what is already done by g5k-postinstall, see below.
* Add an additional postinstall to execute after g5k-postinstall. That way, the additionnal postinstall can be written in any language (e.g. just a shell script) and complete what is already done by g5k-postinstall. See below.


== Adding an extra postinstall ==
== Adding an extra postinstall ==
Line 417: Line 359:
   kind: tar
   kind: tar
multipart: false
multipart: false
name: debiantesting-x64-custom
name: debiantesting-custom
arch: x86_64
os: linux
os: linux
partition_type: 131
partition_type: 131
Line 450: Line 393:
Each one of these implementations is divided in micro steps. You can can see the name of those micro-steps if you use the kadeploy3 option <code>--verbose-level 4</code>. And to see what is actually executed during those micro-steps you can add the debug option of kadeploy3 <code>-d</code>
Each one of these implementations is divided in micro steps. You can can see the name of those micro-steps if you use the kadeploy3 option <code>--verbose-level 4</code>. And to see what is actually executed during those micro-steps you can add the debug option of kadeploy3 <code>-d</code>


{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -f <code class=file>$OAR_FILE_NODES</code> -k -e debian10-x64-base --verbose-level 4 -d  &#62; <code class=file>~/kadeploy3_steps</code>}}
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> debian11-base --verbose-level 4 -d  &#62; <code class=file>~/kadeploy3_steps</code>}}


This command will store the kadeploy3 standard output in the file <code class=file>~/kadeploy3_steps</code>. Lets analyse its content:  
This command will store the kadeploy3 standard output in the file <code class=file>~/kadeploy3_steps</code>. Lets analyse its content:  
Line 456: Line 399:
{{Term|location=frontend|cmd=<code class="command">grep</code> "Time in" <code class=file>~/kadeploy3_steps</code>}}
{{Term|location=frontend|cmd=<code class="command">grep</code> "Time in" <code class=file>~/kadeploy3_steps</code>}}


This command will print on the terminal all the micro-steps executed during the deployment process, and the time spent for each execution. Here are the micro-steps that you should see:
This command will print on the terminal all the micro-steps executed during the deployment process, and the time spent for each execution. Here are the micro-steps that you should see in a typical deployment:
# <code class=file>SetDeploymentEnvUntrusted</code>-<code class="replace">switch_pxe</code>: Configures the PXE server so that this node will boot on an environment that contains all the required tools to perform the deployment,
# <code class=file>SetDeploymentMiniOSTrusted</code>-<code class="replace">switch_pxe</code>: Configures the PXE server so that this node will boot on an environment that contains all the required tools to perform the deployment,
# <code class=file>SetDeploymentEnvUntrusted</code>-<code class="replace">reboot</code>: Sends a reboot signal to the node
# <code class=file>SetDeploymentMiniOSTrusted</code>-<code class="replace">kexec</code>: Reboot the environnement using kexec
# <code class=file>SetDeploymentEnvUntrusted</code>-<code class="replace">wait_reboot</code>: Waits for the node to restart.
# <code class=file>SetDeploymentMiniOSTrusted</code>-<code class="replace">wait_reboot</code>: Waits for the node to restart.
# <code class=file>SetDeploymentEnvUntrusted</code>-<code class="replace">send_key_in_deploy_env</code>: Sends kadeploy's user's ssh public key into the node's authorized_keys to ease the following ssh connections,
# <code class=file>SetDeploymentMiniOSTrusted</code>-<code class="replace">send_key_in_deploy_env</code>: Sends kadeploy's user's ssh public key into the node's authorized_keys to ease the following ssh connections,
# <code class=file>SetDeploymentEnvUntrusted</code>-<code class="replace">create_partition_table</code>: Creates the partition table
# <code class=file>SetDeploymentMiniOSTrusted</code>-<code class="replace">create_partition_table</code>: Creates the partition table
# <code class=file>SetDeploymentEnvUntrusted</code>-<code class="replace">format_deploy_part</code>: Format the partition where your environment will be installed. This partition is by default /dev/sda3
# <code class=file>SetDeploymentMiniOSTrusted</code>-<code class="replace">format_deploy_part</code>: Format the partition where your environment will be installed. This partition is by default /dev/sda3
# <code class=file>SetDeploymentEnvUntrusted</code>-<code class="replace">mount_deploy_part</code>: Mounts the deployment partition in a local directory.
# <code class=file>SetDeploymentMiniOSTrusted</code>-<code class="replace">mount_deploy_part</code>: Mounts the deployment partition in a local directory.
# <code class=file>SetDeploymentEnvUntrusted</code>-<code class="replace">format_tmp_part</code>: Format the partition defined as tmp (by default, /dev/sda5)
# <code class=file>SetDeploymentMiniOSTrusted</code>-<code class="replace">format_tmp_part</code>: Format the partition defined as tmp (by default, /dev/sda5)
# <code class=file>SetDeploymentEnvUntrusted</code>-<code class="replace">format_swap_part</code>: Format the swap partition
# <code class=file>SetDeploymentMiniOSTrusted</code>-<code class="replace">format_swap_part</code>: Format the swap partition
# <code class=file>BroadcastEnvKascade</code>-<code class="replace">send_environment</code>: Sends your environments into the node and untar it into the deployment partition.
# <code class=file>BroadcastEnvKascade</code>-<code class="replace">send_environment</code>: Sends your environments into the node and untar it into the deployment partition.
# <code class=file>BroadcastEnvKascade</code>-<code class="replace">manage_admin_post_install</code>: Execute post installation instructions defined by the site admins, in general to adapt to the specificities of the cluster: console baud rate, Infiniband,...
# <code class=file>BroadcastEnvKascade</code>-<code class="replace">manage_admin_post_install</code>: Execute post installation instructions defined by the site admins, in general to adapt to the specificities of the cluster: console baud rate, Infiniband,...
# <code class=file>BroadcastEnvKascade</code>-<code class="replace">manage_user_post_install</code>: Execute user defined post installation instructions to automatically configure its node depending on its cluster, site, network capabilities, disk capabilities,...
# <code class=file>BroadcastEnvKascade</code>-<code class="replace">manage_user_post_install</code>: Execute user defined post installation instructions to automatically configure its node depending on its cluster, site, network capabilities, disk capabilities,...
# <code class=file>BroadcastEnvKascade</code>-<code class="replace">send_key</code>: Sends the user public ssh key(s) to the node (if the user specified it with the option <code>-k</code>).
# <code class=file>BroadcastEnvKascade</code>-<code class="replace">send_key</code>: Sends the user public ssh key(s) to the node
# <code class=file>BroadcastEnvKascade</code>-<code class="replace">install_bootloader</code>: Properly configures the bootloader
# <code class=file>BroadcastEnvKascade</code>-<code class="replace">install_bootloader</code>: Properly configures the bootloader
# <code class=file>BootNewEnvKexec</code>-<code class="replace">switch_pxe</code>: Configure the PXE server so that this node will boot on the partition where your environment has been installed
# <code class=file>BootNewEnvKexec</code>-<code class="replace">switch_pxe</code>: Configure the PXE server so that this node will boot on the partition where your environment has been installed
Line 517: Line 460:


You can increase the timeout of the second step to 1200 seconds with the following command:  
You can increase the timeout of the second step to 1200 seconds with the following command:  
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -e my_big_env -f <code class="env">$OAR_FILE_NODES</code> -k --force-steps "SetDeploymentEnv&#124;SetDeploymentEnvUntrusted:1:450&BroadcastEnv&#124;BroadcastEnvKascade:1:1200&BootNewEnv&#124;BootNewEnvClassical:1:400"}}
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> my_big_env --force-steps "SetDeploymentEnv&#124;SetDeploymentEnvUntrusted:1:450&BroadcastEnv&#124;BroadcastEnvKascade:1:1200&BootNewEnv&#124;BootNewEnvClassical:1:400"}}


== Set Break-Point during deployment ==
== Set Break-Point during deployment ==
Line 524: Line 467:
; Examples:
; Examples:
* Breakpoint before the postinstall is run:
* Breakpoint before the postinstall is run:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -f <code class="env">$OAR_FILE_NODES</code> -k -e debian10-x64-base --verbose-level 4 -d --breakpoint <code class=file>BroadcastEnvKascade</code>:<code class="replace">manage_user_post_install</code>}}
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> debian11-base --verbose-level 4 -d --breakpoint <code class=file>BroadcastEnvKascade</code>:<code class="replace">manage_user_post_install</code>}}
This command can be used for debugging purpose. It performs a deployment with the maximum verbose level and it asks to stop the deployment workflow just '''''before''''' executing the ''manage_user_post_install''  micro-step of the ''BroadcastEnvKascade''  macro-step. Thus you will be able to connect in the deployment environment and to manually run the user post install script to debug it.
This command can be used for debugging purpose. It performs a deployment with the maximum verbose level and it asks to stop the deployment workflow just '''''before''''' executing the ''manage_user_post_install''  micro-step of the ''BroadcastEnvKascade''  macro-step. Thus you will be able to connect in the deployment environment and to manually run the user post install script to debug it.
* Breakpoint before rebooting to the deployed environment:
* Breakpoint before rebooting to the deployed environment:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -f <code class="env">$OAR_FILE_NODES</code> -k -e debian10-x64-base --verbose-level 4  -d --breakpoint <code class=file>BootNewEnv</code>}}
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> debian11-base --verbose-level 4  -d --breakpoint <code class=file>BootNewEnv</code>}}
Stops before rebooting to the deployed env. Mind that rebooting the machine will boot again the kadeploy kernel, not the deployed system.
Stops before rebooting to the deployed env. Mind that rebooting the machine will boot again the kadeploy kernel, not the deployed system.
{{Warning|text=At the current state of <code class="command">kadeploy3</code>, it is not possible to resume the deployment from the break-point step. Thus you will have to redeploy you environment from the first step. This feature may be implemented in future version of <code class="command">kadeploy3</code>.}}
{{Warning|text=At the current state of <code class="command">kadeploy3</code>, it is not possible to resume the deployment from the break-point step. Thus you will have to redeploy you environment from the first step. This feature may be implemented in future version of <code class="command">kadeploy3</code>.}}
Line 553: Line 496:


First of all, when you do your reservation, you must tell to OAR that it should redeploy the node entirely after the reservation with the <code class="command">-t destructive</code> parameter:
First of all, when you do your reservation, you must tell to OAR that it should redeploy the node entirely after the reservation with the <code class="command">-t destructive</code> parameter:
{{Term|location=frontend|cmd=<code class="command">oarsub</code> -t deploy -t destructive -l nodes=1,walltime=2 -p "cluster='hercule'" -I}}
{{Term|location=frontend|cmd=<code class="command">oarsub</code> -t deploy -t destructive -l nodes=1,walltime=2 -p hercule -I}}


;2. Describe the custom operations
;2. Describe the custom operations
Line 620: Line 563:


Now you can deploy you environment with this custom operation:
Now you can deploy you environment with this custom operation:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -e debian10-x64-min -f $OAR_NODE_FILE -k --custom-steps ./custom-partitioning.yml}}
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> debian11-min --custom-steps ./custom-partitioning.yml}}
{{Warning|text=In some cases you should increase the step timeout (for some long formatting for example) see [[Advanced_Kadeploy#Adjusting timeout for some environments]] for details.}}
{{Warning|text=In some cases you should increase the step timeout (for some long formatting for example) see [[Advanced_Kadeploy#Adjusting timeout for some environments]] for details.}}


Line 674: Line 617:
         command: /bin/true
         command: /bin/true
</syntaxhighlight>
</syntaxhighlight>


* The file '''map.parted''', which will be passed to '''parted''', will look like this:
* The file '''map.parted''', which will be passed to '''parted''', will look like this:
<syntaxhighlight lang="bash">
<syntaxhighlight lang="bash">
mklabel gpt
mklabel gpt
mkpart partition-system ext4 0% 100%
mkpart KDPL_SYSTEM_disk0 ext4 0% 100%
toggle 1 boot
toggle 1 boot
align-check optimal 1
align-check optimal 1
</syntaxhighlight>
</syntaxhighlight>
{{Note|text=Kadeploy identify the partitions using GPT label. The form of the label should be <code>KDPL_</code><code class="replace">name</code><code>_</code><code class="replace">disk</code>. So the label <code>KDPL_SYSTEM_disk1</code> will be use on a deployment {{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -a custom-env.yml -b disk1 -p SYSTEM -k --custom-steps custom-partitioning.yml}} The default disk on Grid'5000 is <code>disk1</code> and the partition is <code>DEPLOY</code>, so by default, kadeploy look for the label <code>KDPL_DEPLOY_disk0</code>}}


;3. Customize the environment's postinstall
;3. Customize the environment's postinstall
Line 692: Line 639:
* Add your custom /etc/fstab file in this directory, named '''fstab''':
* Add your custom /etc/fstab file in this directory, named '''fstab''':
<syntaxhighlight lang="bash">
<syntaxhighlight lang="bash">
/dev/sda1      /          ext4    defaults 1      2
PARTLABEL=KDPL_SYSTEM_disk0    /          ext4    defaults 1      2
</syntaxhighlight>
</syntaxhighlight>
When you will pass "--fstab custom" option to the postinstall, it will copy this file in /etc/fstab
When you will pass "--fstab custom" option to the postinstall, it will copy this file in /etc/fstab
Line 699: Line 646:
* Make some cleanup:
* Make some cleanup:
* Create the environment's description file (let's say '''custom-env.dsc''') based on the reference one:
* Create the environment's description file (let's say '''custom-env.dsc''') based on the reference one:
** use kaenv3 -p debian10-x64-base to have an example of environment description.
** use kaenv3 -p debian10-base to have an example of environment description.
Your '''custom-env.dsc''' should look like this:
Your '''custom-env.dsc''' should look like this:
<syntaxhighlight lang="yaml">
<syntaxhighlight lang="yaml">
Line 705: Line 652:
name: custom-env
name: custom-env
version: 1
version: 1
arch: x86_64
description: Custom env based on Debian 10
description: Custom env based on Debian 10
author: me@domain.tld
author: me@domain.tld
Line 729: Line 677:


Finally, we deploy our custom environment with your custom operations:
Finally, we deploy our custom environment with your custom operations:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -a custom-env.dsc -f $OAR_NODE_FILE -p 1 -k --custom-steps custom-partitioning.yml}}
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -a custom-env.dsc -p SYSTEM  --custom-steps custom-partitioning.yml}}
{{Note|text=In some case you should increase the step timeout (for some long formatting for example) see [[Advanced_Kadeploy#Adjusting timeout for some environments]] for details.}}
{{Note|text=In some case you should increase the step timeout (for some long formatting for example) see [[Advanced_Kadeploy#Adjusting timeout for some environments]] for details.}}


Line 739: Line 687:
! Mount point !! Partition !! Disk space !! File System
! Mount point !! Partition !! Disk space !! File System
|-
|-
| swap || /dev/sda1 || 2G || linux-swap
| swap || SWAP || 2G || linux-swap
|-
|-
| / || /dev/sda2 || 18G || ext4
| / || SYSTEM || 18G || ext4
|-
|-
| /var || /dev/sda3 || 30G || ext4
| /var || VAR || 30G || ext4
|-
|-
| /opt || /dev/sda4 || 20G || ext4
| /opt || OPT || 20G || ext4
|-
|-
| /tmp || /dev/sda5 || ''everything else'' || ext4
| /tmp || TMP || ''everything else'' || ext4
|}
|}


Line 795: Line 743:
         name: mount_partitions
         name: mount_partitions
         file: mount.sh
         file: mount.sh
# Hack to disable useless steps
  format_tmp_part:
    substitute:
      - action: exec
        name: remove_format_tmp_part_step
        command: /bin/true
  format_swap_part:
    substitute:
      - action: exec
        name: remove_format_swap_part_step
        command: /bin/true
</syntaxhighlight>
</syntaxhighlight>
{{Note|text=In order for Kadeploy to be able to perform the installation correctly, every partitions have to be mounted before the installation process which is done in the macro-step BroadcastEnv}}
{{Note|text=In order for Kadeploy to be able to perform the installation correctly, every partitions have to be mounted before the installation process which is done in the macro-step BroadcastEnv}}
Line 811: Line 748:
<syntaxhighlight lang="bash">
<syntaxhighlight lang="bash">
mklabel gpt
mklabel gpt
u GB mkpart partition-swap linux-swap 0% 2
u GB mkpart KDPL_SWAP_disk0 linux-swap 0% 2
u GB mkpart partition-system ext4 2 20
u GB mkpart KDPL_SYSTEM_disk0 ext4 2 20
u GB mkpart partition-var ext4 20 50
u GB mkpart KDPL_VAR_disk0 ext4 20 50
u GB mkpart partition-opt ext4 50 70
u GB mkpart KDPL_OPT_disk0 ext4 50 70
u GB mkpart partition-tmp ext4 70 100%
u GB mkpart KDPL_TMP_disk0 ext4 70 100%
toggle 2 boot
toggle 2 boot
align-check optimal 1
align-check optimal 1
Line 823: Line 760:
align-check optimal 5
align-check optimal 5
</syntaxhighlight>
</syntaxhighlight>
{{Note|text=Kadeploy identify the partitions using GPT label. The form of the label should be <code>KDPL_</code><code class="replace">name</code><code>_</code><code class="replace">disk</code>. So the label <code>KDPL_SYSTEM_disk1</code> will be use on a deployment {{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -a custom-env.yml -b disk1 -p SYSTEM --custom-steps custom-partitioning.yml}} The default disk on Grid'5000 is <code>disk1</code> and the partition is <code>DEPLOY</code>, so by default, kadeploy look for the label <code>KDPL_DEPLOY_disk0</code>. The <code>KDPL_SWAP_</code><code class="replace">disk</code> is also detected as the swap partition and will be formated by kadeploy when detected.}}


* The file '''format.sh''' will look like something like this:
* The file '''format.sh''' will look like something like this:
Line 832: Line 772:
ext4_blocksize="4096"
ext4_blocksize="4096"


# create swap
# / will be formated by Kadeploy since we will precise the -p SYSTEM option
mkswap ${KADEPLOY_BLOCK_DEVICE}1
# / will be formated by Kadeploy since we will precise the -p 2 option
# formating /var
# formating /var
mkfs -t ext4 -b ${ext4_blocksize} -O ${mkfs_opts} -q ${KADEPLOY_BLOCK_DEVICE}3
mkfs -t ext4 -b ${ext4_blocksize} -O ${mkfs_opts} -q /dev/disk/by-partlabel/KDPL_VAR_disk0
# formating /opt
# formating /opt
mkfs -t ext4 -b ${ext4_blocksize} -O ${mkfs_opts} -q ${KADEPLOY_BLOCK_DEVICE}4
mkfs -t ext4 -b ${ext4_blocksize} -O ${mkfs_opts} -q /dev/disk/by-partlabel/KDPL_OPT_disk0
# formating /tmp
# formating /tmp
mkfs -t ext4 -b ${ext4_blocksize} -O ${mkfs_opts} -q ${KADEPLOY_BLOCK_DEVICE}5
mkfs -t ext4 -b ${ext4_blocksize} -O ${mkfs_opts} -q /dev/disk/by-partlabel/KDPL_TMP_disk0
</syntaxhighlight>
</syntaxhighlight>
{{Note|text=When running a custom script, Kadeploy will export different variables, you can get a list of them by running "kadeploy -i".}}
{{Note|text=When running a custom script, Kadeploy will export different variables, you can get a list of them by running "kadeploy -I".}}
* The file '''mount.sh''' will look like something like this:
* The file '''mount.sh''' will look like something like this:
<syntaxhighlight lang="bash">
<syntaxhighlight lang="bash">
Line 851: Line 789:
# mount /var
# mount /var
mkdir ${KADEPLOY_ENV_EXTRACTION_DIR}/var
mkdir ${KADEPLOY_ENV_EXTRACTION_DIR}/var
mount ${KADEPLOY_BLOCK_DEVICE}3 ${KADEPLOY_ENV_EXTRACTION_DIR}/var/
mount /dev/disk/by-partlabel/KDPL_VAR_disk0 ${KADEPLOY_ENV_EXTRACTION_DIR}/var/
# mount /opt
# mount /opt
mkdir ${KADEPLOY_ENV_EXTRACTION_DIR}/opt
mkdir ${KADEPLOY_ENV_EXTRACTION_DIR}/opt
mount ${KADEPLOY_BLOCK_DEVICE}4 ${KADEPLOY_ENV_EXTRACTION_DIR}/opt/
mount /dev/disk/by-partlabel/KDPL_OPT_disk0 ${KADEPLOY_ENV_EXTRACTION_DIR}/opt/
# mount /tmp
# mount /tmp
mkdir ${KADEPLOY_ENV_EXTRACTION_DIR}/tmp
mkdir ${KADEPLOY_ENV_EXTRACTION_DIR}/tmp
mount ${KADEPLOY_BLOCK_DEVICE}5 ${KADEPLOY_ENV_EXTRACTION_DIR}/tmp/
mount /dev/disk/by-partlabel/KDPL_TMP_disk0 ${KADEPLOY_ENV_EXTRACTION_DIR}/tmp/
</syntaxhighlight>
</syntaxhighlight>


;3. Customize the environment's postinstall
;3. Customize the environment's postinstall
Line 871: Line 810:
* Add your custom /etc/fstab file in this temporary directory, named '''fstab''':
* Add your custom /etc/fstab file in this temporary directory, named '''fstab''':
<syntaxhighlight lang="bash">
<syntaxhighlight lang="bash">
/dev/sda1       none          swap    sw      0      0
PARTLABEL=KDPL_SWAP_disk0       none          swap    sw      0      0
/dev/sda3      /var          ext4    defaults 1      2
PARTLABEL=KDPL_VAR_disk0      /var          ext4    defaults 1      2
/dev/sda4       /opt          ext4    defaults 1      2
PARTLABEL=KDPL_OPT_disk0       /opt          ext4    defaults 1      2
/dev/sda5       /tmp          ext4    defaults 1      2
PARTLABEL=KDPL_TMP_disk0       /tmp          ext4    defaults 1      2
</syntaxhighlight>
</syntaxhighlight>
/ will be added by Kadeploy since we will precise the <code class="command">-p 2</code> option
/ will be added by Kadeploy since we will precise the <code class="command">-p SYSTEM</code> option
* Regenerate the postinstall archive:
* Regenerate the postinstall archive:
{{Term|location=frontend|cmd=<code class="command">tar</code> -czvf <code class="replace">~/g5k-postinstall-custom.tgz</code> *}}
{{Term|location=frontend|cmd=<code class="command">tar</code> -czvf <code class="replace">~/g5k-postinstall-custom.tgz</code> *}}
Line 882: Line 821:
{{Term|location=frontend|cmd=<code class="command">popd</code> && <code class="command">rm</code> -R $tmpdir}}
{{Term|location=frontend|cmd=<code class="command">popd</code> && <code class="command">rm</code> -R $tmpdir}}
* Create the environment's description file (let's say '''custom-env.yml''') based on the reference one:
* Create the environment's description file (let's say '''custom-env.yml''') based on the reference one:
{{Term|location=frontend|cmd=<code class="command">kaenv3</code> -p debian10-x64-base -u deploy <nowiki>|</nowiki> sed -e "s/archive:.*$/archive: <code class="replace">\/home\/${USER}\/g5k-postinstall-custom.tgz</code>/" -e 's/public/shared/' > custom-env.yml}}
{{Term|location=frontend|cmd=<code class="command">kaenv3</code> -p debian11-base -u deploy <nowiki>|</nowiki> sed -e "s/archive:.*$/archive: <code class="replace">\/home\/${USER}\/g5k-postinstall-custom.tgz</code>/" -e 's/public/shared/' > custom-env.yml}}
and customize the '''custom-env.yml''' file to suit your needs (especially your archive path):
and customize the '''custom-env.yml''' file to suit your needs (especially your archive path):


Line 895: Line 834:
os: linux
os: linux
image:
image:
   file: server:///grid5000/images/debian10-x64-base-2019100414.tgz
   file: server:///grid5000/images/debian11-base-2021092316.tar.zst
   kind: tar
   kind: tar
   compression: gzip
   compression: zstd
postinstalls:
postinstalls:
- archive: /home/me/g5k-postinstall-custom.tgz
- archive: /home/me/g5k-postinstall-custom.tgz
Line 915: Line 854:


Finally, we deploy our custom environment with your custom operations:
Finally, we deploy our custom environment with your custom operations:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -a custom-env.yml -f $OAR_NODE_FILE -p 2 -k --custom-steps custom-partitioning.yml}}
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> -a custom-env.yml -p SYSTEM --custom-steps custom-partitioning.yml}}
{{Note|text=In some case you should increase the step timeout (for some long formatting for example) see [[Advanced_Kadeploy#Adjusting timeout for some environments]] for details.}}
{{Note|text=In some case you should increase the step timeout (for some long formatting for example) see [[Advanced_Kadeploy#Adjusting timeout for some environments]] for details.}}


=== Boot a custom kernel with kexec  ===
=== Boot a custom kernel with kexec  ===


In this example, we change some custom operations of the '''reboot''' workflow: our nodes will use kexec to start a custom kernel that we upload beforehand, instead of rebooting. This allows us to boot a custom kernel without deploying a whole system (just use the <code class=command>kareboot3</code> command, not not the <code class=command>kadeploy3</code> one).
In this example, we change some custom operations of the '''reboot''' workflow: our nodes will use kexec to start a custom kernel that we upload beforehand, instead of rebooting. This allows us to boot a custom kernel without deploying a whole system (just use the <code class=command>kareboot3</code> command, not the <code class=command>kadeploy3</code> one).


;1. Make the reservation in destructive mode
;1. Make the reservation in destructive mode


First of all, when you do your reservation, you must tell OAR that it must redeploy the node entirely after your reservation. For this, use the <code class="command">-t destructive</code> parameter:
First of all, when we do our reservation, we have to tell OAR that it must redeploy the node entirely after our reservation. For this, we use the <code class="command">-t destructive</code> parameter:
{{Term|location=frontend|cmd=<code class="command">oarsub</code> -t deploy -t destructive -l nodes=1,walltime=2 -p "cluster='hercule'" -I}}
{{Term|location=frontend|cmd=<code class="command">oarsub</code> -t deploy -t destructive -l nodes=1,walltime=2 -p hercule -I}}


;2. Describe the custom operations
;2. Describe the custom operations


Next, you must create a file that describes custom operations to perform during the reboot step.
Next, we must create a file that describes custom operations to perform during the reboot step.
In our example, we will first send kernel and initrd files to the nodes, then run kexec using with them.
 
A operation description file (here '''custom-kexec-reboot.yml''') to realise those actions looks like this:
In our example, we first send kernel and initrd files to the nodes, then run kexec using them.
A operation description file (here '''custom-kexec-reboot.yml''') to perform those actions looks like this:


<syntaxhighlight lang="yaml">
<syntaxhighlight lang="yaml">
Line 955: Line 895:
;3. Run the reboot
;3. Run the reboot


Finally, we reboot with your custom operations:
Finally, we call <code class=command>kareboot3</code> with our custom operations:
{{Term|location=frontend|cmd=<code class="command">kareboot3</code> -f $OAR_NODE_FILE --custom-steps custom-kexec-reboot.yml}}
{{Term|location=frontend|cmd=<code class="command">kareboot3</code> simple --custom-steps custom-kexec-reboot.yml}}
 
=== Skip the first reboot/kexec  ===
If you are already on the deployment kernel (by using as breakpoint or rebooting with  <code class="command">kareboot3 -r deploy_env</code>) you may want to skip the first reboot/kexec. This can be done by masking the reboot/kexec via custom operations. You need to be in a destructive job to be able to use custom operations (<code class="command">oarsub -t deploy -t destructive</code>)).Write a file with your custom operation, eg '''custom-kexec-reboot.yml''':
<syntaxhighlight lang="yaml">
---
SetDeploymentEnv:
    reboot:
        substitute:
            - action: exec
              name: nothing
              command: 'true'
    kexec:
        substitute:
            - action: exec
              name: nothing
              command: 'true'
</syntaxhighlight>
 
Start the deployment with our custom operations:
{{Term|location=frontend|cmd=<code class="command">kadeploy3</code> debian11-min --custom-steps custom-kexec-reboot.yml }}

Revision as of 10:12, 22 June 2022

Note.png Note

This page is actively maintained by the Grid'5000 team. If you encounter problems, please report them (see the Support page). Additionally, as it is a wiki page, you are free to make minor corrections yourself if needed. If you would like to suggest a more fundamental change, please contact the Grid'5000 team.


Warning.png Warning

Please see the Environment creation guide, which gives automated mechanisms to build kadeploy environnements

What you need to know before starting

The first thing to understand is that by using kadeploy3, you will be running a command that attempts to remotely reboot possibly many nodes at the same time, and boot them over the network using configuration files hosted on a server.

What is an Environment?

Where we describe what exactly is image, kernel, initrd and postinstall

An environment in kadeploy3 is a set of file describing a fully functional Operating System. To be able to setup a Operating System, kadeploy3 needs at least 4 files in the most common cases

  1. An image
    • An image is a file containing all the Operating System files. It is a compressed archive (compressed tarball).
  2. A kernel file
    • Specifies the kernel file to boot. The full file path in the target system is expected.
  3. An initrd file (optional)
    • For the Linux systems, the initrd or initramfs file is the initial ramdisk used by the kernel as a preliminary system before the root filesytem is mounted. More information: Initrd on Wikipedia. The full file path in the target system is expected.
  4. A postinstall file (optional)
    • The postinstall file allows for adapting the deployed environment to specificities of each site and cluster, as well as setting up some platform-specific services.

Once you have this set of files, you can describe your environment to kadeploy3. This description represents an environment in the kadeploy3 sense.

Note: the kadeploy software also supports deploying environments made of binary disk images (dd format) but this is not supported on Grid'5000.

How can I make my own environment?

To create our own environment, there are two main ways:

  • One way is to deploy an existing environment, customize it and save it with tgz-g5k
  • The other one is to build the environment from a recipe, using kameleon, just like the Grid'5000 supported reference environments. See Environment creation.

Disk partitioning

Environments are deployed on the (first) local disk of each node (each Grid'5000 node is equipped with at least one hard drive, HDD or SSD). The following partitioning scheme is used:

Label Role
SWAP Linux swap
PROD Standard environment (default environment installed on nodes)
DEPLOY User's environment (when Kadeploy is used)
EFI For UEFI systems: UEFI system partition (store boot loaders)
TMP Remaining disk space made available in /tmp
Notes
  • Some clusters have more than one local disk. On some clusters, those disks can be reserved apart from the node itself. See Disk reservation to find out how to use them.
  • Kadeploy allows deployments on other partitions than DEPLOY or even creating a custom partitioning scheme. See below for details.

Search and deploy an existing environment

Search an environment

Grid'5000 maintains several reference environments directly available for deployment in all sites. These environments are based on the Debian, Ubuntu, or Centos Linux distribution.

For Debian, different variants of reference environments are offered. For Ubuntu and Centos, only environments with a minimal system are offered.

They are called reference environments because they can be used to generate customized environments and because they are provided and supported by the Grid'5000 technical team. The full list of reference environments is:

Name x86_64 ppc64le aarch64 Description
debian11-min Check.png Check.png Check.png debian 11 (bullseye) minimalistic installation
debian11-base Check.png Check.png Check.png debian 11 (bullseye) with various Grid'5000-specific tuning for performance
debian11-nfs Check.png Check.png Check.png debian 11 (bullseye) with support for mounting NFS home
debian11-big Check.png Check.png Check.png debian 11 (bullseye) with packages for development, system tools, editors, shells.
debiantesting-min Check.png Check.png Check.png debian testing minimalistic installation
debiantesting-nfs Check.png Check.png Check.png debian testing with support for mounting NFS home
centos7-min Check.png Check.png Check.png centos 7 minimalistic installation
centos7-nfs Check.png Check.png Check.png centos 7 with support for mounting NFS home
centos8-min Check.png Check.png Check.png centos 8 minimalistic installation
centos8-nfs Check.png Check.png Check.png centos 8 with support for mounting NFS home
rocky8-min Check.png NoStarted.png Check.png rocky 8 minimalistic installation
rocky8-nfs Check.png NoStarted.png Check.png rocky 8 with support for mounting NFS home
rocky9-min Check.png NoStarted.png Check.png rocky 9 minimalistic installation
rocky9-nfs Check.png NoStarted.png Check.png rocky 9 with support for mounting NFS home
centosstream8-min Check.png NoStarted.png Check.png centos-stream 8 minimalistic installation
centosstream8-nfs Check.png NoStarted.png Check.png centos-stream 8 with support for mounting NFS home
centosstream9-min Check.png NoStarted.png Check.png centos-stream 9 minimalistic installation
centosstream9-nfs Check.png NoStarted.png Check.png centos-stream 9 with support for mounting NFS home
ubuntu2004-min Check.png Check.png Check.png ubuntu 20.04 (focal) minimalistic installation
ubuntu2004-nfs Check.png Check.png Check.png ubuntu 20.04 (focal) with support for mounting NFS home
ubuntu2204-min Check.png NoStarted.png Check.png ubuntu 22.04 (jellyfish) minimalistic installation
ubuntu2204-nfs Check.png NoStarted.png Check.png ubuntu 22.04 (jellyfish) with support for mounting NFS home
almalinux9-min Check.png NoStarted.png Check.png almalinux 9 minimalistic installation
debian12-min Check.png Check.png Check.png debian 12 (bookworm) for ppc64le - min
ubuntu2404-min Check.png NoStarted.png NoStarted.png ubuntu 22.04 (noble) minimalistic installation
almalinux9-nfs Check.png NoStarted.png Check.png almalinux 9 with support for mounting NFS home
debian12-nfs Check.png Check.png Check.png debian 12 (bookworm) with support for mounting NFS home
ubuntu2404-nfs Check.png NoStarted.png NoStarted.png ubuntu 24.04 (noble) with support for mounting NFS home
debian12-big Check.png Check.png Check.png debian 12 (bookworm) with packages for development, system tools, editors, shells.
ubuntugh2404-big NoStarted.png NoStarted.png Check.png ubuntu 24.04 (noble) for NVIDIA Grace Hopper
ubuntul4t200435-big NoStarted.png NoStarted.png Check.png ubuntu 20.04 (focal) with L4T r35.4.1-cti001 for aarch64 - big
ubuntu2404-rocm Check.png NoStarted.png NoStarted.png ubuntu 24.04 (noble) for x86_64 - rocm

Last generated from the Grid'5000 API on 2025-10-01

Kadeploy provides a registry of environments in each site, where reference environments are registered along with environments of users.

For reference environments, associated filesystem images are stored in the /grid5000 directory of the frontend.

To deploy a registered environment, you must know its name as registered in the Kadeploy registry. This tutorial uses the debian11-base environment.

You can also list all available environment in a site by using the kaenv3 command:

Terminal.png frontend:
kaenv3 -l

This command lists all public as well as your private environments.

We distinguish three levels of visibility for an environment:

  • public: Only administrators can register public environments. They are shown and used by default by the kaenv3 and kadeploy3 commands unless a user is specified (with -u user).
  • shared: Shared user environments. They are shown and used by the kaenv3 and kadeploy3 commands when a user is specified with -u user.
  • private: The environment is only shown and usable by the user the environment belongs to.

For example, a shared environment added by user user is listed this way:

Terminal.png frontend:
kaenv3 -l -u user

You can also look for a specific version with the --env-version version option. Most of the versions of the reference environments images files are available in /grid5000/images. The version number is the last part of the image file.

For instance: debian11-min-2021092316.tar.zst is the image file of the debian11-min reference environment version 2021092316.

Whenever you want to deploy this specific image on some nodes, use:

Terminal.png frontend:
kadeploy3 debian11-min --env-version 2021092316

Being able to reproduce a past experiment is a desirable feature. Therefore, you should always try to control as much as possible the environment the experiment is done in. Therefore, we will attempt to check that the environment that was chosen in the environment directory is the one available on a given cluster. On the cluster you would like to deploy, type the following command to print information about an environment:

Terminal.png frontend:
kaenv3 debian11-base -u deploy

Adding -u deploy makes sure we get the description of the environment supported by the Grid'5000 staff, as deploy is the user owning them (one might have registred his own environment with the same, that would show up if no user is specified).

In theory, you should also check the post-install script. A post-install script adapts an environment to the site it is deployed on.

If everything seems ok, please proceed to the next step.

Make a job on a deployable node

By default, Grid'5000 nodes are running on the production environment, which already contains most of the important features and can be used to run experiments. This environment however imposes choices and has some limitations due to its general-purpose target. While you can gain the root privileges on it thanks to the sudo-g5k command, you cannot reboot for instance, as this ends your reservation. Deploying on the contrary allows to have a dedicated environment and to have full control over the machine (reboot, serial console, ...).

For this part of the tutorial, jobs made will be interactive (-I), of the deploy type (-t deploy), on only one machine (-l nodes=1) to do environment customization (we will give ourselves 3 hours with -l walltime=3), which gives us the following command, that will open a new shell session on the frontend node:

Terminal.png frontend:
oarsub -I -t deploy -l nodes=1,walltime=3

Indeed, when you submit a job of the deploy type, a new shell is opened on the frontend node and not on the first machine of the job as for standard jobs. When you exit from this shell, the job ends. The shell is populated with OAR_* environment variables. You should look at the list of available variables to get an idea of the information you can use to script deployment later. As usual, if the job is successful, you will get the name of the machine allocated to your job with:

Terminal.png frontend:
cat $OAR_FILE_NODES
Warning.png Warning

At the end of a reservation with the -t deploy option, the reserved nodes will be reboot on the standard environment and thus make them available for another job possibly of another user. Rebooting may take up to 15 minutes, so please mind your commands as they have an implicit cost.

Deploy a reference environment

By default, Kadeploy use all the nodes of the reservation. It is done by using the file whose name is given by the $OAR_FILE_NODES environment variable (or $OAR_NODE_FILE, or $OAR_NODEFILE) So, to start the deployment on all the nodes, run the following command:

Terminal.png frontend:
kadeploy3 debian11-base

You can also just provide the nodes to deploy on, with the -m option:

Terminal.png frontend:
kadeploy3 debian11-base -m node.site.grid5000.fr


By default, Kadeploy will copy your ~/.ssh/authorized_keys and replace the /root/.ssh/authorized_keys file on the deployed nodes. Alternatively, you can use the -k option in two ways:

  • You can either specify the public key that will be copied in /root/.ssh/authorized_keys on the deployed nodes:
Terminal.png frontend:
kadeploy3 debian11-base -k ~/.ssh/my_special_key.pub
  • Or to not copy any key on the deployed node with -k none. You will need to provide a password to connect. However, SSH is often configured to disallow root login using password. The root password for all reference environments (i.e. provided by the Grid'5000 technical team) is grid5000.


In our case, the node file contains only 1 node.

Once the kadeploy command was executed successfully, the deployed node runs the debian11-base environment as their operating system. It will then be possible to tune this environment according to your needs.

Connect to the deployed environment and customize it

1. Connection

On reference environments managed by the Grid'5000 technical team, you can use the root account for log in with ssh (kadeploy checks that sshd is running before declaring a deployment successful). To connect to the node type:

Terminal.png frontend:
ssh root@node.site.grid5000.fr

In case this doesn't work, please take a look at the kadeploy section of the Sidebar > FAQ

2. Adding software to an environment

You can alter your environment (to add missing libraries that you need, or remove packages that you don't need ; to reduce the size of the image and speed up the deployment process ; etc.) using commands such as:

Terminal.png node:
apt-get update
apt-get upgrade
apt-get install list of desired packages and libraries
apt-get --purge remove list of unwanted packages
apt-get clean

Create a new environment from a node's customized operating system

We now need to save this customized environment, where you have a user account, to be able to use this account again each time you deploy it.
The first step to create an environment is to create an archive of the node's operating system you just customized. You can use tgz-g5k to extract a Grid'5000 environment tarball from a running node. It's usage is describe in the Environment creation tutorial page.

Advanced deployment options

Multisite deployment

In order to achieve a deployment on nodes from different sites, you can use the multiserver option of kadeploy, using the -M option.

Terminal.png frontend:
kadeploy3 -M -f file_with_all_nodes debian11-big

Deploy on other partition of disk(s)

Kadeploy default handling of partitions may be too limited for some usages. One may need to use disks differently. Kadeploy offers several options to deploy on another existing partition of the primary disk, or if required to repartition disks entirely and/or use several disks (on nodes with many disks).

Deploy on partition labeled PROD or TMP

As seen earlier, kadeploy manages to label portions to identify them more easily, e.g. PROD, DEPLOY, or TMP. Kadeploy can be instructed to deploy on the PROD or TMP partition instead of the DEPLOY partition.

Because this kind of deployment will break some node standard operations after the end of your job, you must add to your oarsub command for the job creation the -t destructive option. This will cause the node to be completely reinstalled after your job.

Terminal.png frontend:
oarsub -t deploy -t destructive -l nodes=1,walltime=1 -p hercule -I

Then you can deploy on PROD or TMP with the -p PROD or -p TMP option:

Terminal.png frontend:
kadeploy3 debian11-nfs -p PROD

Deploy on secondary disks

Kadeploy identifies disks by their id (disk0, disk1...). You can find them in the Reference API or in the Hardware pages:

Because this kind of deployment will break some node standard operations after the end of your job, you must add to your oarsub command for the job creation the -t destructive option. This will cause the node to be completely reinstalled after your job.

Terminal.png frontend:
oarsub -t deploy -t destructive -l nodes=1,walltime=1 -p hercule -I

Then you can deploy on an secondary disk such as disk1 with the -b disk1 option:

Terminal.png frontend:
kadeploy3 debian11-min -b disk1
Warning.png Warning

It will not work for reservable disks

Disks can also be handled differently by modifying the deployment automata, see below.

About the kernel and bootloader

As a reminder, a deployed environment may be booted either with kexec (e.g. at the end of the deployment) or by grub (e.g. after the deployment with a kareboot, or a call of the reboot command from the node itself, or if kexec is deactivated for the cluster because not supported).

By default, the bootloader installation step of kadeploy will install and configure grub using the grub commands provided by your environment. The grub configuration file will be generated with grub-mkconfig, which will boot by default the most recent kernel version available in the /boot directory of your environment. In case that version does not match the kernel version provided in the environment description, an error will be reported (Kernel file mismatch between grub and the environment description). This is a sanity check.

If the grub commands are not available in the deployed environment, as a fallback the grub commands provided in the deploy kernel will be used for the installation, and a very simple grub configuration will be generated (unlike when grub-mkconfig is used) that just use the kernel, initrd and kernel params from your environment description.

Note that having grub installed and configured with the commands that are provided in the deployed environment is the preferred way, because it will be isofunctional whenever the deployed system later has upgrades that impact the kernel or grub (or if the bootloader configuration command grub-mkconfig or update-grub is called).

Whenever relevant, by using the --custom-variable option of kadeploy, you can trigger a change of the behavior of the bootloader installation step, as such:

  • BOOTLOADER_SKIP=1, do nothing (grub not installed nor configured)
  • BOOTLOADER_NO_CONFIG=1, do not configure grub
  • BOOTLOADER_NO_INSTALL=1, do not install grub
  • BOOTLOADER_NO_GRUB_FROM_DEST=1, do not configure nor install using grub from your deployed env but from the deploy kernel
  • BOOTLOADER_NO_GRUB_MKCONFIG_FROM_DEST, do not configure using grub from your deployed env but from the deploy kernel
  • BOOTLOADER_NO_GRUB_INSTALL_FROM_DEST=1 do not install using grub from your deployed env but from the deploy kernel
  • BOOTLOADER_NO_UUID=1 do not use filesystem UUID to identify the root partition
  • BOOTLOADER_SHOW_MENU=1 enable the grub menu on the console

For example, use kadeploy3 my_env --custom-variable BOOTLOADER_NO_INSTALL=1 to not install grub.

Customizing the postinstalls

In Kadeploy3, postinstalls are scripts that are executed after the copy of the image file in order to customize site-specific or cluster-specific aspects. Since the beginning on 2018, on Grid'5000 the same postinstall script (called g5k-postinstall) is used for all reference environments (and is thus compatible with all supported Debian versions and distributions). That script takes parameters in order to define its behaviour (for example, to choose the style of network configuration to use).

Using g5k-postinstall

The source code for g5k-postinstall is available on gitlab.inria.fr. Its parameters at the time of writing are:

Usage: g5k-postinstall [options]

Options:
    -d, --debug                      Run in debug mode, with output to terminal
    -v, --version                    Print g5k-postinstall version
    -n, --net n1,n2,n3               Network configuration specification
    -f, --fstab f1,f2,f3             Filesystems configuration specification
    -r, --restrict-user MODE         User restriction mode
        --inittab PATTERN            Configure console in inittab
        --bootloader b1,b2,b3        Set the kadeploy bootloader step setup options (deprecated)
        --no-ref-api                 Do not use the Reference API
        --disable-hacks h1,h2,h3     Hacks to disable
        --no-guix                    Disable guix
        --disk-aliases               Enable disk aliases (e.g. /dev/disk0p1)

Valid tags for network specification:
  debian               write config in /etc/network/interfaces
  debian-bridged       write config in /etc/network/interfaces, with a bridge
                       setup (for the std env)
  netplan              write config in /etc/netplan/01-netcfg.yaml
                       (https://wiki.ubuntu.com/Netplan)
  redhat               write config in /etc/sysconfig/network-scripts/*
  traditional-names    use traditional NIC naming (e.g. eth0) instead of
                       predictable
  force-ref-api-names  force the use of the name provided in the reference API
                       (by default, the predictable name determined by the
                       kernel is used)
  hpc                  add support for HPC (eg InfiniBand) interfaces
Example: --net debian-bridged,traditional-names,hpc

Valid tags for filesystems configuration in fstab:
  nfs            include generic NFS mounts (/home with autofs, /grid5000)
  no-autofs      do not use autofs, just mount the user's NFS directory
  no-uuid        do not use filesystem UUID to identify the partitions, use the
                 block device filenames
  custom         include custom mounts for custom partitionning. Need fstab
                 file included in postinstall archive.
Example: --fstab nfs

Valid modes for user restriction:
  std         if deployed on production partition, restrict to root,oar. else,
              restrict to the current user (see below)
  current     restrict to root and the user having currently reserved the node
  login:jdoe  restrict to a specific login (e.g. jdoe)
  none        no restriction (DEFAULT)

Inittab option:
  Needed for non-systemd systems.
  Example of pattern: s0:12345:respawn:/sbin/agetty -L SPEED TTYSX vt100
  Where SPEED and TTYSX are replaced by g5k-postinstall using information
  retieved from the parameters which are passed in /proc/cmdline.

No reference API option:
  Do not use the Reference API. This is useful during initial configuration of
  new clusters.

Disable hacks option:
  g5k-postinstall includes hacks that can optionally be disabled.
  Current hacks are:
  - oot-i40e        install the i40e driver on chifflot and chiclet (lille) and
                    grappe (nancy) on debian9 environment.
  - force-net-name  on several clusters, the predictable network interface name
                    can change depending on udev version. This hack forces the
                    predictable network interface name in such cases.
  - beegfs-gr520    configure beegfs shares on grcinq and grvingt (nancy).
Example: --disable-hacks oot-i40e,force-net-name

Disk aliases option:
  g5k-postinstall can optionally install udev rules to provide predictable disk aliases.
  Example of disk aliases automatically created by these udev rules:
    /dev/disk0   -> /dev/sdb
    /dev/disk0p2 -> /dev/sdb2
    /dev/disk1   -> /dev/nvme0n1
    /dev/disk1p4 -> /dev/nvme0n1p4
  This is useful because, starting from Linux 5.3, traditional block device names
  are non-deterministic: /dev/sda might not refer to the same disk on every boot.

An example environment description using g5k-postinstall is:

---
name: debian11-min
version: 2021092316
arch: x86_64
description: debian 11 (bullseye) for x64 - min
author: pierre.neyron@imag.fr
visibility: private
destructive: false
os: linux
image:
  file: server:///grid5000/images/debian11-x64-min-2021092316.tar.zst
  kind: tar
  compression: zstd
postinstalls:
- archive: server:///grid5000/postinstalls/g5k-postinstall.tgz
  compression: gzip
  script: g5k-postinstall --net debian --disk-aliases
boot:
  kernel: "/vmlinuz"
  initrd: "/initrd.img"
  kernel_params: ''
filesystem: ext4
partition_type: 131
multipart: false

Things that you can do from there:

  • Use different parameters to change the behaviour of the postinstall. Example parameters for various situations are:
    • Debian min environment with traditional NIC naming: g5k-postinstall --net debian --net traditional-names
    • Debian min environment with predictable NIC naming: g5k-postinstall --net debian
    • Debian min environment with predictable disk aliases: g5k-postinstall --net debian --disk-aliases
    • Debian NFS environment (mount /home, setup LDAP, restrict login to user who reserved the node): g5k-postinstall --net debian --fstab nfs --restrict-user current
    • Debian big environment (NFS + setup HPC networks and mount site-specific directories): g5k-postinstall --net debian --net traditional-names --net hpc --fstab nfs --fstab site-specific
    • Use GPT label (e.g. /dev/disk/by-partlabel/KDPL_DEPLOY_disk0) instead of UUID to identify the root partition in the kernel command line: g5k-postinstall --bootloader no-uuid
    • Use GPT label (e.g. /dev/disk/by-partlabel/KDPL_DEPLOY_disk0) instead of UUID to identify the partitions in the fstab: g5k-postinstall --fstab no-uuid
    • RHEL/Centos style for network configuration: g5k-postinstall --net redhat --net traditional-names
    • Ubuntu 1710 or later: NetPlan for network configuration: g5k-postinstall --net netplan
    • Do not do any network configuration (useful for Gentoo), but force serial console settings: g5k-postinstall --inittab='s0:12345:respawn:/sbin/agetty -L SPEED TTYSX vt100'
  • Use a customized version of g5k-postinstall: after building a modified g5k-postinstall, just point the postinstalls/archive/ field to the new tar archive. See README.md and TechTeam:Postinstalls for details on g5k-postinstall internals.
  • Add an additional postinstall to execute after g5k-postinstall. That way, the additionnal postinstall can be written in any language (e.g. just a shell script) and complete what is already done by g5k-postinstall. See below.

Adding an extra postinstall

Modifying g5k-postinstall may for some purpose not be relevant (or overkill), for instance if actions are independent and can be executed after g5k-postinstall is run. In such a case, it is very easy to provide an additional postinstall that for instance may just be a shell script:

Assuming additional-postinstall.sh is a script located at the root of the /home/jdoe/public/debiantesting-x64-additional-postinstall.tar.gz archive, we just have to declare the additional postinstall in the environment description as follows:

---
author: John Doe
boot:
  initrd: /initrd.img
  kernel: /vmlinuz
description: debian testing with some customizations
destructive: false
filesystem: ext4
image:
  compression: gzip
  file: local:///home/jdoe/public/debiantesting-x64-custom.tar.gz
  kind: tar
multipart: false
name: debiantesting-custom
arch: x86_64
os: linux
partition_type: 131
postinstalls:
- archive: server:///grid5000/postinstalls/g5k-postinstall.tgz
  compression: gzip
  script: g5k-postinstall --net debian --fstab nfs --restrict-user current
- archive: local:///home/jdoe/public//debiantesting-x64-additional-postinstall.tar.gz
  compression: gzip
  script: additional-postinstall.sh
version: 2020071009
visibility: shared

Both postinstalls will execute during the deployment, the one after the other.

Tuning the Kadeploy3 deployment workflow

kadeploy3 allows to fully modify the deployment workflow.

First of all you have to understand the different steps of a deployment. There are 3 macro-steps:

  1. SetDeploymentEnv: this step aims at setting up the deployment environment that contains all the required tools to perform a deployment ;
  2. BroadcastEnv: this step aims at broadcasting the new environment to the nodes and writing it to disk;
  3. BootNewEnv: this step aims at rebooting the nodes on their new environment.

kadeploy3 provides several implementations for each of those 3 macro-steps. You can consult that list in the kadeploy3 page. In Grid'5000, we use the following steps by default in all our clusters:

  • SetDeploymentEnv -> SetDeploymentEnvUntrusted: use an embedded deployment environment
  • BroadcastEnv -> BroadcastEnvKascade: use the Kascade tool to broadcast the environment
  • BootNewEnv -> BootNewEnvKexec: the nodes use kexec to reboot (if it fails, a BootNewEnvClassical, classical reboot, will be performed)

Each one of these implementations is divided in micro steps. You can can see the name of those micro-steps if you use the kadeploy3 option --verbose-level 4. And to see what is actually executed during those micro-steps you can add the debug option of kadeploy3 -d

Terminal.png frontend:
kadeploy3 debian11-base --verbose-level 4 -d > ~/kadeploy3_steps

This command will store the kadeploy3 standard output in the file ~/kadeploy3_steps. Lets analyse its content:

Terminal.png frontend:
grep "Time in" ~/kadeploy3_steps

This command will print on the terminal all the micro-steps executed during the deployment process, and the time spent for each execution. Here are the micro-steps that you should see in a typical deployment:

  1. SetDeploymentMiniOSTrusted-switch_pxe: Configures the PXE server so that this node will boot on an environment that contains all the required tools to perform the deployment,
  2. SetDeploymentMiniOSTrusted-kexec: Reboot the environnement using kexec
  3. SetDeploymentMiniOSTrusted-wait_reboot: Waits for the node to restart.
  4. SetDeploymentMiniOSTrusted-send_key_in_deploy_env: Sends kadeploy's user's ssh public key into the node's authorized_keys to ease the following ssh connections,
  5. SetDeploymentMiniOSTrusted-create_partition_table: Creates the partition table
  6. SetDeploymentMiniOSTrusted-format_deploy_part: Format the partition where your environment will be installed. This partition is by default /dev/sda3
  7. SetDeploymentMiniOSTrusted-mount_deploy_part: Mounts the deployment partition in a local directory.
  8. SetDeploymentMiniOSTrusted-format_tmp_part: Format the partition defined as tmp (by default, /dev/sda5)
  9. SetDeploymentMiniOSTrusted-format_swap_part: Format the swap partition
  10. BroadcastEnvKascade-send_environment: Sends your environments into the node and untar it into the deployment partition.
  11. BroadcastEnvKascade-manage_admin_post_install: Execute post installation instructions defined by the site admins, in general to adapt to the specificities of the cluster: console baud rate, Infiniband,...
  12. BroadcastEnvKascade-manage_user_post_install: Execute user defined post installation instructions to automatically configure its node depending on its cluster, site, network capabilities, disk capabilities,...
  13. BroadcastEnvKascade-send_key: Sends the user public ssh key(s) to the node
  14. BroadcastEnvKascade-install_bootloader: Properly configures the bootloader
  15. BootNewEnvKexec-switch_pxe: Configure the PXE server so that this node will boot on the partition where your environment has been installed
  16. BootNewEnvKexec-umount_deploy_part: Umount the deployment partition from the directory where it has been mounted during the step 7.
  17. BootNewEnvKexec-mount_deploy_part: ReMount the deployment partition
  18. BootNewEnvKexec-kexec: Perform a kexec reboot on the node
  19. BootNewEnvKexec-set_vlan: Properly configure the node's VLAN
  20. BootNewEnvKexec-wait_reboot: Wait for the node to be up.

That is it. You now know all the default micro-steps used to deploy your environments.

Environment boot: grub vs. kexec

On most clusters, kadeploy boots the user environment using kexec from the deploy kernel, after the image deployment and the postinstall steps. With kexec, Linux serves as a bootloader for itself. This means that the node is not "cold" rebooted, thus the classical bootloader (grub) is not used in that "warm" reboot process.

Whenever the node will be rebooted after the deployment, it will however rather use the classical bootloader (grub) than kexec (unless kexec is also configured to do so in the user environment).

You may look at the kadeploy3 command output to see if kexec is indeed used at the end of the deployment.

If you think the use of kexec may introduce a bias and prefer a classical cold reboot at the end of the deployment, you can either change the kadeploy workflow (see below) or just use the --no-kexec option of the kadeploy3 command.

Adjusting timeout for some environments

Since kadeploy3 provides multiple macro-steps and micro-steps, its is important to detect when a step in failing its execution. This error detection is done by using timeout on each step. When a timeout is reached, the nodes that have not completed the given step are discarded from the deployment process.
The value of those timeouts varies from one cluster to another since they depend on the hardware configuration (network speed, hard disk speed, reboot speed, ...). All defaults timeouts are entered in the configurations files on the kadeploy3 server. But you can consult the default timeouts of each macro-steps by using the command kastat3

Terminal.png frontend:
kastat3 -I
 Kadeploy server configuration:
 Custom PXE boot method: PXElinux
 Automata configuration:
   hercule:
     SetDeploymentEnv: SetDeploymentEnvUntrusted,1,600
     BroadcastEnv: BroadcastEnvKascade,0,1000
     BootNewEnv: BootNewEnvKexec,0,180; BootNewEnvHardReboot,0,900
   nova:
     SetDeploymentEnv: SetDeploymentEnvUntrusted,1,600
     BroadcastEnv: BroadcastEnvKascade,0,1000
     BootNewEnv: BootNewEnvKexec,0,150; BootNewEnvHardReboot,0,600
 ...


kadeploy3 allow users to change timeouts in the command line. In some cases, when you try to deploy an environment with a large tarball or with a post-install that lasts too long, you may get discarded nodes. This false positive behavior can be avoided by manually modifying the timeouts for each step at the deployment time.

For instance, in our previous example, the timeout of each steps are:

  • SetDeploymentEnvUntrusted: 143
  • BroadcastEnvKascade: 111
  • BootNewEnvKexec: 33

You can increase the timeout of the second step to 1200 seconds with the following command:

Terminal.png frontend:
kadeploy3 my_big_env --force-steps "SetDeploymentEnv|SetDeploymentEnvUntrusted:1:450&BroadcastEnv|BroadcastEnvKascade:1:1200&BootNewEnv|BootNewEnvClassical:1:400"

Set Break-Point during deployment

As mentioned in the section above, a deployment is a succession of micro steps that can be consulted and modified.
Moreover, kadeploy3 allows user to set a break-point during deployment.

Examples
  • Breakpoint before the postinstall is run:
Terminal.png frontend:
kadeploy3 debian11-base --verbose-level 4 -d --breakpoint BroadcastEnvKascade:manage_user_post_install

This command can be used for debugging purpose. It performs a deployment with the maximum verbose level and it asks to stop the deployment workflow just before executing the manage_user_post_install micro-step of the BroadcastEnvKascade macro-step. Thus you will be able to connect in the deployment environment and to manually run the user post install script to debug it.

  • Breakpoint before rebooting to the deployed environment:
Terminal.png frontend:
kadeploy3 debian11-base --verbose-level 4 -d --breakpoint BootNewEnv

Stops before rebooting to the deployed env. Mind that rebooting the machine will boot again the kadeploy kernel, not the deployed system.

Warning.png Warning

At the current state of kadeploy3, it is not possible to resume the deployment from the break-point step. Thus you will have to redeploy you environment from the first step. This feature may be implemented in future version of kadeploy3.

Modify the deployment workflow with custom operations

In Kadeploy3, we can easily customize the deployment's automata. It's possible to add custom pre, post or substitute operations to each steps. In a custom operation it's possible to: send a file, execute a command or run a script.

This feature in explained in Kadeploy3's documentation (available on Kadeploy3's website) in the section 4.2.2, Use Case 10 and 4.7.

This is illustrated in the following sub-sections.

Note.png Note

When running a custom script, Kadeploy will export different variables, you can get a list of them by running kadeploy3 -I.
A description of each of this variables is available in Kadeploy3's documentation (on Kadeploy3 website) in the section 4.4

Format additional disks

In this example, we will add some custom operations to the deployment workflow: our nodes have two additional hard disks and we want them to be formated during the deployment process.

We want to a new partition scheme such as:

  • classical grid5000 partitioning on sda
  • data1 ext4 on sdb1
  • data2 ext2 on sdc1

The three following sections describe how to perform such an operation.

1. Make the reservation in destructive mode

First of all, when you do your reservation, you must tell to OAR that it should redeploy the node entirely after the reservation with the -t destructive parameter:

Terminal.png frontend:
oarsub -t deploy -t destructive -l nodes=1,walltime=2 -p hercule -I
2. Describe the custom operations

After that you have to create a file that describe the custom operations you want to be performed during the deployment. In our example we will first repartition the additional disks (using parted) and then format them (using the script format.sh).

  • The operation description file (let's say custom-partitioning.yml) should look like something like this:
---
# Our custom steps should be performed during the SetDeploymentEnv macro-step
SetDeploymentEnvUntrusted:
  # Custom partitioning step, done after the create_partition_table micro-step
  # In the sample this step is exploded in 4 steps but it can be done in 1 using a single parted command
  create_partition_table:
      post-ops:
        # We send a file on the node
        - action: send
          file: sdb.parted
          # The variable $KADEPLOY_TMP_DIR will be substitued by kadeploy
          destination: $KADEPLOY_TMP_DIR 
          name: send_partition_map_sdb
        # Then we execute the parted command using the previously sent file
        - action: exec
          name: partitioning_sdb
          # The variable $KADEPLOY_TMP_DIR will be substitued by kadeploy
          command: parted -a optimal /dev/sdb --script $(cat $KADEPLOY_TMP_DIR/sdb.parted)
        # Same operation for the second disk
        - action: send
          file: sdc.parted
          destination: $KADEPLOY_TMP_DIR 
          name: send_partition_map_sdc
        - action: exec
          name: partitioning_sdc
          command: parted -a optimal /dev/sdc --script $(cat $KADEPLOY_TMP_DIR/sdc.parted)
  # Custom format step, done after the format_deploy_part micro-step
  format_deploy_part:
      post-ops:
        # We run the script contained in the file 'format.sh'
        - action: run 
          name: format_disks
          file: format.sh
  • The file sdb.parted will look like something like this:
mklabel msdos
u GB mkpart primary ext4 0% 100%
align-check optimal 1
  • The file sdc.parted will look like something like this:
mklabel msdos
u GB mkpart primary ext2 0% 100%
align-check optimal 1
  • The file format.sh will look like something like this:
#!/bin/sh
set -e
# formating /dev/sdb
mkfs -t ext4 -b 4096 -O sparse_super,filetype,resize_inode,dir_index -q /dev/sdb1
# formating /dev/sdc
mkfs -t ext2 -b 4096 -O sparse_super,filetype,resize_inode,dir_index -q /dev/sdc1
3. Run the deployment

Now you can deploy you environment with this custom operation:

Terminal.png frontend:
kadeploy3 debian11-min --custom-steps ./custom-partitioning.yml
Warning.png Warning

In some cases you should increase the step timeout (for some long formatting for example) see Advanced_Kadeploy#Adjusting timeout for some environments for details.

Note: Both partitions are not mounted on boot. To mount those partitions you should do:

Terminal.png NODE:
mkdir -p /media/data1
Terminal.png NODE:
mkdir /media/data2
Terminal.png NODE:
mount /dev/sdb1 /media/data1
Terminal.png NODE:
mount /dev/sdc1 /media/data2

Use a custom partitioning scheme

Example 1: Deploy on the whole disk

In this example, we will modify the deployment workflow to deploy the system on a unique disk partition ( '/' on sda1 )

1. Make the reservation in destructive mode

As you will change partitioning of the disk, you must tell to OAR that it should redeploy the node entirely after the reservation with the -t destructive parameter:

Terminal.png frontend:
oarsub -t deploy -t destructive -l nodes=1,walltime=2 -I
2. Describe the custom operations

After that you have to create a file that describe the custom operations you want to be performed during the deployment. In this example we will create our custom partitioning scheme and bypass some steps that are not necessary to deploy the system on a unique partition.

  • The operation description file (let's say custom-partitioning.yml) should look like something like this:
---
# Our custom steps should be performed during the SetDeploymentEnv macro-step
SetDeploymentEnv:
  # Custom partitioning step that is substituted to the create_partition_table micro-step
  create_partition_table:
    substitute:
      # We send a file on the node
      - action: send
        file: map.parted
        # The variable $KADEPLOY_TMP_DIR will be substituted by Kadeploy
        destination: $KADEPLOY_TMP_DIR
        name: send_partition_map
      # Then we execute the parted command using the previously sent file
      - action: exec
        name: partitioning
        # The variable $KADEPLOY_TMP_DIR will be substituted by Kadeploy
        command: parted -a optimal /dev/sda --script $(cat $KADEPLOY_TMP_DIR/map.parted)
# Hack to disable useless steps
  format_tmp_part:
    substitute:
      - action: exec
        name: remove_format_tmp_part_step
        command: /bin/true
  format_swap_part:
    substitute:
      - action: exec
        name: remove_format_swap_part_step
        command: /bin/true


  • The file map.parted, which will be passed to parted, will look like this:
mklabel gpt
mkpart KDPL_SYSTEM_disk0 ext4 0% 100%
toggle 1 boot
align-check optimal 1
Note.png Note

Kadeploy identify the partitions using GPT label. The form of the label should be KDPL_name_disk. So the label KDPL_SYSTEM_disk1 will be use on a deployment

Terminal.png frontend:
kadeploy3 -a custom-env.yml -b disk1 -p SYSTEM -k --custom-steps custom-partitioning.yml
The default disk on Grid'5000 is disk1 and the partition is DEPLOY, so by default, kadeploy look for the label KDPL_DEPLOY_disk0

3. Customize the environment's postinstall

In order for our new partitions to be mounted at boot time we will modify the Grid'5000 postinstall files.

  • Create and go in your public directory:
Terminal.png frontend:
mkdir public/custom-postinstall && cd public/custom-postinstall
  • Then decompress the postinstall archive:
Terminal.png frontend:
tar xzf /grid5000/postinstalls/g5k-postinstall.tgz
  • Add your custom /etc/fstab file in this directory, named fstab:
PARTLABEL=KDPL_SYSTEM_disk0     /          ext4    defaults 1      2

When you will pass "--fstab custom" option to the postinstall, it will copy this file in /etc/fstab

  • Regenerate the postinstall archive:
Terminal.png frontend:
tar -czvf ~/public/g5k-postinstall-custom.tgz *
  • Make some cleanup:
  • Create the environment's description file (let's say custom-env.dsc) based on the reference one:
    • use kaenv3 -p debian10-base to have an example of environment description.

Your custom-env.dsc should look like this:

--- 
name: custom-env
version: 1
arch: x86_64
description: Custom env based on Debian 10
author: me@domain.tld
visibility: shared
destructive: true
os: linux
image:
  file: server:///grid5000/images/debian10-x64-base-2020012812.tgz
  kind: tar
  compression: gzip
postinstalls:
- archive: http://public/~<login>/g5k-postinstall-custom.tgz
  compression: gzip
  script: g5k-postinstall --net debian --fstab custom
boot:
  kernel: "/vmlinuz"
  initrd: "/initrd.img"
filesystem: ext4
partition_type: 131
multipart: false
4. Run the deployment

Finally, we deploy our custom environment with your custom operations:

Terminal.png frontend:
kadeploy3 -a custom-env.dsc -p SYSTEM --custom-steps custom-partitioning.yml
Note.png Note

In some case you should increase the step timeout (for some long formatting for example) see Advanced_Kadeploy#Adjusting timeout for some environments for details.

Example 2: Deploy on multiple partitions

In this example, we will modify the deployment workflow: a different partition will be used for each of the /, /home, /opt and /tmp directories. Imagine that you want to make your own partitioning scheme like that:

Mount point Partition Disk space File System
swap SWAP 2G linux-swap
/ SYSTEM 18G ext4
/var VAR 30G ext4
/opt OPT 20G ext4
/tmp TMP everything else ext4

The four following sections describe how to perform such an operation.

1. Make the reservation in destructive mode

First of all, when you do your reservation, you must tell to OAR that it should redeploy the node entirely after the reservation with the -t destructive parameter:

Terminal.png frontend:
oarsub -t deploy -t destructive -l nodes=1,walltime=2 -I
2. Describe the custom operations

After that you have to create a file that describe the custom operations you want to be performed during the deployment. In our example we will first create apply our custom partitioning scheme, format the partition and the mount them.

  • The operation description file (let's say custom-partitioning.yml) should look like something like this:
---
# Our custom steps should be performed during the SetDeploymentEnv macro-step
SetDeploymentEnvUntrusted:
  # Custom partitioning step that is substituted to the create_partition_table micro-step
  create_partition_table:
    substitute:
      # We send a file on the node
      - action: send
        file: map.parted
        # The variable $KADEPLOY_TMP_DIR will be substituted by Kadeploy
        destination: $KADEPLOY_TMP_DIR 
        name: send_partition_map
      # Then we execute the parted command using the previously sent file
      - action: exec
        name: partitioning
        # The variable $KADEPLOY_TMP_DIR will be substituted by Kadeploy
        command: parted -a optimal /dev/sda --script $(cat $KADEPLOY_TMP_DIR/map.parted)
  # Custom format step, done after the format_deploy_part micro-step
  format_deploy_part:
    post-ops:
      # We run the script contained in the file 'format.sh'
      - action: run 
        name: format_partitions
        file: format.sh
  # Custom mount step, done after the mount_deploy_part micro-step
  mount_deploy_part:
    post-ops:
      # We run the script contained in the file 'format.sh'
      - action: run 
        name: mount_partitions
        file: mount.sh
Note.png Note

In order for Kadeploy to be able to perform the installation correctly, every partitions have to be mounted before the installation process which is done in the macro-step BroadcastEnv

  • The file map.parted will look like something like this:
mklabel gpt
u GB mkpart KDPL_SWAP_disk0 linux-swap 0% 2
u GB mkpart KDPL_SYSTEM_disk0 ext4 2 20
u GB mkpart KDPL_VAR_disk0 ext4 20 50
u GB mkpart KDPL_OPT_disk0 ext4 50 70
u GB mkpart KDPL_TMP_disk0 ext4 70 100%
toggle 2 boot
align-check optimal 1
align-check optimal 2
align-check optimal 3
align-check optimal 4
align-check optimal 5


Note.png Note

Kadeploy identify the partitions using GPT label. The form of the label should be KDPL_name_disk. So the label KDPL_SYSTEM_disk1 will be use on a deployment

Terminal.png frontend:
kadeploy3 -a custom-env.yml -b disk1 -p SYSTEM --custom-steps custom-partitioning.yml
The default disk on Grid'5000 is disk1 and the partition is DEPLOY, so by default, kadeploy look for the label KDPL_DEPLOY_disk0. The KDPL_SWAP_disk is also detected as the swap partition and will be formated by kadeploy when detected.

  • The file format.sh will look like something like this:
#!/bin/sh
set -e

mkfs_opts="sparse_super,filetype,resize_inode,dir_index"
ext4_blocksize="4096"

# / will be formated by Kadeploy since we will precise the -p SYSTEM option
# formating /var
mkfs -t ext4 -b ${ext4_blocksize} -O ${mkfs_opts} -q /dev/disk/by-partlabel/KDPL_VAR_disk0
# formating /opt
mkfs -t ext4 -b ${ext4_blocksize} -O ${mkfs_opts} -q /dev/disk/by-partlabel/KDPL_OPT_disk0
# formating /tmp
mkfs -t ext4 -b ${ext4_blocksize} -O ${mkfs_opts} -q /dev/disk/by-partlabel/KDPL_TMP_disk0
Note.png Note

When running a custom script, Kadeploy will export different variables, you can get a list of them by running "kadeploy -I".

  • The file mount.sh will look like something like this:
#!/bin/sh
set -e

# / will be mounted in ${KADEPLOY_ENV_EXTRACTION_DIR} by Kadeploy
# mount /var
mkdir ${KADEPLOY_ENV_EXTRACTION_DIR}/var
mount /dev/disk/by-partlabel/KDPL_VAR_disk0 ${KADEPLOY_ENV_EXTRACTION_DIR}/var/
# mount /opt
mkdir ${KADEPLOY_ENV_EXTRACTION_DIR}/opt
mount /dev/disk/by-partlabel/KDPL_OPT_disk0 ${KADEPLOY_ENV_EXTRACTION_DIR}/opt/
# mount /tmp
mkdir ${KADEPLOY_ENV_EXTRACTION_DIR}/tmp
mount /dev/disk/by-partlabel/KDPL_TMP_disk0 ${KADEPLOY_ENV_EXTRACTION_DIR}/tmp/


3. Customize the environment's postinstall

In order for our new partitions to be mounted at boot time we can modify the Grid'5000 postinstall files (this customization can also be done by adding another custom operation).

  • Create and go in a temporary directory:
Terminal.png frontend:
tmpdir=$(mktemp -d) && export tmpdir && pushd $tmpdir
  • Then decompress the postinstall archive:
Terminal.png frontend:
tar xzf /grid5000/postinstalls/g5k-postinstall.tgz
Note.png Note

We assume that the current shell is BASH, if not please replace the "export" instruction

  • Add your custom /etc/fstab file in this temporary directory, named fstab:
PARTLABEL=KDPL_SWAP_disk0       none          swap    sw       0      0
PARTLABEL=KDPL_VAR_disk0      /var          ext4    defaults 1      2
PARTLABEL=KDPL_OPT_disk0       /opt          ext4    defaults 1      2
PARTLABEL=KDPL_TMP_disk0       /tmp          ext4    defaults 1      2

/ will be added by Kadeploy since we will precise the -p SYSTEM option

  • Regenerate the postinstall archive:
Terminal.png frontend:
tar -czvf ~/g5k-postinstall-custom.tgz *
  • Make some cleanup:
Terminal.png frontend:
popd && rm -R $tmpdir
  • Create the environment's description file (let's say custom-env.yml) based on the reference one:
Terminal.png frontend:
kaenv3 -p debian11-base -u deploy | sed -e "s/archive:.*$/archive: \/home\/${USER}\/g5k-postinstall-custom.tgz/" -e 's/public/shared/' > custom-env.yml

and customize the custom-env.yml file to suit your needs (especially your archive path):

--- 
name: custom-env
version: 1
description: Custom env based on Debian 10
author: me@domain.tld
visibility: shared
destructive: true
os: linux
image:
  file: server:///grid5000/images/debian11-base-2021092316.tar.zst
  kind: tar
  compression: zstd
postinstalls:
- archive: /home/me/g5k-postinstall-custom.tgz
  compression: gzip
  script: g5k-postinstall --net debian --fstab custom
boot:
  kernel: "/vmlinuz"
  initrd: "/initrd.img"
filesystem: ext4
partition_type: 131
multipart: false
Warning.png Warning

Do not forget the --fstab custom option to g5k-postinstall.

4. Run the deployment

Finally, we deploy our custom environment with your custom operations:

Terminal.png frontend:
kadeploy3 -a custom-env.yml -p SYSTEM --custom-steps custom-partitioning.yml
Note.png Note

In some case you should increase the step timeout (for some long formatting for example) see Advanced_Kadeploy#Adjusting timeout for some environments for details.

Boot a custom kernel with kexec

In this example, we change some custom operations of the reboot workflow: our nodes will use kexec to start a custom kernel that we upload beforehand, instead of rebooting. This allows us to boot a custom kernel without deploying a whole system (just use the kareboot3 command, not the kadeploy3 one).

1. Make the reservation in destructive mode

First of all, when we do our reservation, we have to tell OAR that it must redeploy the node entirely after our reservation. For this, we use the -t destructive parameter:

Terminal.png frontend:
oarsub -t deploy -t destructive -l nodes=1,walltime=2 -p hercule -I
2. Describe the custom operations

Next, we must create a file that describes custom operations to perform during the reboot step.

In our example, we first send kernel and initrd files to the nodes, then run kexec using them. A operation description file (here custom-kexec-reboot.yml) to perform those actions looks like this:

---
Simple:
    reboot:
        pre-ops:
            - action: send
              file: /home/me/custom_kernel.vmlinuz
              destination: $KADEPLOY_TMP_DIR
              name: send_custom_kernel
            - action: send
              file: /home/me/custom_kernel.initrd.img
              destination: $KADEPLOY_TMP_DIR
              name: send_custom_initrd
        substitute:
            - action: exec
              name: kexec
              # nohup to be sure the script exit correctly before the kexec start
              command: kexec -l $KADEPLOY_TMP_DIR/custom_krenel.vmlinuz --initrd=$KADEPLOY_TMP_DIR/custom_kernel.initrd.img && nohup /bin/sh -c 'sleep 1; systemctl kexec' 1>/dev/null 2>/dev/null </dev/null &
3. Run the reboot

Finally, we call kareboot3 with our custom operations:

Terminal.png frontend:
kareboot3 simple --custom-steps custom-kexec-reboot.yml

Skip the first reboot/kexec

If you are already on the deployment kernel (by using as breakpoint or rebooting with kareboot3 -r deploy_env) you may want to skip the first reboot/kexec. This can be done by masking the reboot/kexec via custom operations. You need to be in a destructive job to be able to use custom operations (oarsub -t deploy -t destructive)).Write a file with your custom operation, eg custom-kexec-reboot.yml:

---
SetDeploymentEnv:
    reboot:
        substitute:
            - action: exec
              name: nothing
              command: 'true'
    kexec:
        substitute:
            - action: exec
              name: nothing
              command: 'true'

Start the deployment with our custom operations:

Terminal.png frontend:
kadeploy3 debian11-min --custom-steps custom-kexec-reboot.yml