Grid5000:Home
Grid'5000 is a precursor infrastructure of SLICES-RI, Scientific Large Scale Infrastructure for Computing/Communication Experimental Studies.
|
Grid'5000 is a large-scale and flexible testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing, including Cloud, HPC, Big Data and AI. Key features:
Older documents:
|
Random pick of publications
Five random publications that benefited from Grid'5000 (at least 2758 overall):
- Daniel Rosendo, Marta Mattoso, Alexandru Costan, Renan Souza, Débora Pina, et al.. ProvLight: Efficient Workflow Provenance Capture on the Edge-to-Cloud Continuum. Cluster 2023 - IEEE International Conference on Cluster Computing, Oct 2023, Santa Fe, New Mexico, United States. pp.1-13. hal-04161546 view on HAL pdf
- Vladimir Ostapenco, Laurent Lefèvre, Anne-Cécile Orgerie, Benjamin Fichel. Exploring RAPL as a Power Capping Leverage for Power-Constrained Infrastructures. ICA3PP 2024 - 24th International Conference on Algorithms and Architectures for Parallel Processing, Oct 2024, Macau SAR, China. pp.1-10. hal-04742418 view on HAL pdf
- Alan Lira Nunes, Cristina Boeres, Lúcia Maria de A. Drummond, Laércio Lima Pilla. Optimal Time and Energy-Aware Client Selection Algorithms for Federated Learning on Heterogeneous Resources. 2024 IEEE 36th International Symposium on Computer Architecture and High Performance Computing (SBAC-PAD), Nov 2024, Hilo, France. pp.148-158, 10.1109/SBAC-PAD63648.2024.00021. hal-04690494v2 view on HAL pdf
- Emile Cadorel, Dimitri Saingre. A Protocol to Assess the Accuracy of Process-Level Power Models. Cluster 2024, IEEE, Sep 2024, Kobe, Japan. hal-04720926 view on HAL pdf
- Jolan Philippe, Antoine Omond, Hélène Coullon, Charles Prud'Homme, Issam Raïs. Fast Choreography of Cross-DevOps Reconfiguration with Ballet: A Multi-Site OpenStack Case Study. SANER 2024: IEEE International Conference on Software Analysis, Evolution and Reengineering, Mar 2024, Rovaniemi, Finland. pp.1-11, 10.1109/SANER60148.2024.00007. hal-04457484 view on HAL pdf
Latest news
Cluster "estats" (Jetson nodes in Toulouse) is now kavlan capable
The network topology of the estats Jetson nodes can now be configured, just like for other clusters.
More info in the Network reconfiguration tutorial.
-- Grid'5000 Team 18:25, 21 May 2025 (CEST)
Cluster "chirop" is now in the default queue of Lille with energy monitoring.
Dear users,
We are pleased to announce that the Chirop[1] cluster of Lille is now available in the default queue.
This cluster consists of 5 HPE DL360 Gen10+ nodes with:
Energy monitoring[2] is also available for this cluster[3], provided by newly installed Wattmetres (similar to those already available at Lyon).
This cluster was funded by CPER CornelIA.
[1] https://www.grid5000.fr/w/Lille:Hardware#chirop
[2] https://www.grid5000.fr/w/Energy_consumption_monitoring_tutorial [3] https://www.grid5000.fr/w/Monitoring_Using_Kwollect#Metrics_available_in_Grid.275000
-- Grid'5000 Team 16:25, 05 May 2025 (CEST)
Change of default queue based on platform
Until now, Abaca (production) users had to specify `-q production` when reserving Abaca resources with OAR.
This is no longer necessary as your default queue is now automatically selected based on the platform your default group is associated to, as shown at https://api.grid5000.fr/explorer/selector/ and in the message displayed when connecting to a frontend.
For SLICES-FR users, there is no change since the correct queue was already selected by default.
Additionally, the "production" queue has been renamed to "abaca", although "production" will continue to work for the foreseeable future.
Please note one case where this change may affect your workflow:
When an Abaca user reserves a resource from SLICES-FR (a non-production resource), they must explicitly specify they want to use the SLICES-FR queue, which is called "default", by adding `-q default` the OAR command.
-- Abaca Grid'5000 Team 10:10, 31 March 2025 (CEST)
Cluster "musa" with Nvidia H100 GPUs is available in production queue
We are pleased to announce that a new cluster named "musa" is available in the production queue¹ of Abaca.
This cluster has been funded by Inria DSI as a shared computing resource.
It is accessible to all Abaca users. Users affiliated with Inria have access with the same level of priority, regardless of the research center to which they are attached.
This cluster is composed of six HPE Proliant DL385 Gen11 nodes² with 2 AMD EPYC 9254 24-Core Processor, 512 GiB of RAM, 2 x Nvidia H100 NVL (94 GiB) with NVLink, one 6 TB SSD NVME and 25 Gbps Ethernet Connexion
Please note that in order to share it efficiently, walltime is limited:
The cluster "musa" is located at Sophia, hosted in the datacenter of Inria Centre at Université Côte d’Azur.
¹: https://api.grid5000.fr/explorer/hardware/sophia/#musa
²: the nodes are named musa-1, musa-2,.., musa-6
-- Grid'5000 Team 13:30, 19 March 2025 (CEST)
Grid'5000 sites
Current funding
INRIA |
CNRS |
UniversitiesIMT Atlantique |
Regional councilsAquitaine |