Grid5000:Home: Difference between revisions
No edit summary |
Lpouilloux (talk | contribs) No edit summary |
||
(5 intermediate revisions by 4 users not shown) | |||
Line 3: | Line 3: | ||
|- valign="top" | |- valign="top" | ||
|bgcolor="#f5fff5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | |bgcolor="#f5fff5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | ||
[[Image: | [[Image:g5k-backbone.png|thumbnail|260px|right|Grid'5000]] | ||
'''Grid'5000 is a large-scale and flexible testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing including Cloud, HPC and Big Data and AI.''' | '''Grid'5000 is a large-scale and flexible testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing including Cloud, HPC and Big Data and AI.''' | ||
Key features: | Key features: | ||
* provides '''access to a large amount of resources''': | * provides '''access to a large amount of resources''': 15000 cores, 800 compute-nodes grouped in homogeneous clusters, and featuring various technologies: PMEM, GPU, SSD, NVMe, 10G and 25G Ethernet, Infiniband, Omni-Path | ||
* '''highly reconfigurable and controllable''': researchers can experiment with a fully customized software stack thanks to bare-metal deployment features, and can isolate their experiment at the networking layer | * '''highly reconfigurable and controllable''': researchers can experiment with a fully customized software stack thanks to bare-metal deployment features, and can isolate their experiment at the networking layer | ||
* '''advanced monitoring and measurement features for traces collection of networking and power consumption''', providing a deep understanding of experiments | * '''advanced monitoring and measurement features for traces collection of networking and power consumption''', providing a deep understanding of experiments | ||
Line 39: | Line 39: | ||
==Latest news== | ==Latest news== | ||
<rss max=4 item-max-length="2000">https://www.grid5000.fr/ | <rss max=4 item-max-length="2000">https://www.grid5000.fr/rss/G5KNews.php</rss> | ||
---- | ---- | ||
[[News|Read more news]] | [[News|Read more news]] | ||
Line 74: | Line 74: | ||
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | | width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"| | ||
===Universities=== | ===Universities=== | ||
IMT Atlantique<br/> | |||
Université Grenoble Alpes, Grenoble INP<br/> | Université Grenoble Alpes, Grenoble INP<br/> | ||
Université Rennes 1, Rennes<br/> | Université Rennes 1, Rennes<br/> |
Latest revision as of 10:29, 26 October 2023
Grid'5000 is a large-scale and flexible testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing including Cloud, HPC and Big Data and AI. Key features:
Grid'5000 is merging with FIT to build the SILECS Infrastructure for Large-scale Experimental Computer Science. Read an Introduction to SILECS (April 2018)
Older documents:
|
Random pick of publications
Five random publications that benefited from Grid'5000 (at least 2758 overall):
- Miguel Couceiro, Esteban Marquer, Pierre-Alexandre Murena, Pierre Monnin. Workshop Analogies: from Theory to Applications (ATA@ICCBR 2022). Workshop Analogies: from Theory to Applications (ATA@ICCBR 2022), CEUR Workshop Proceedings, 3389, CEUR-WS.org, pp.3-103, 2023, Workshop Proceedings of the 30th International Conference on Case-Based Reasoning co-located with the 30th International Conference on Case-Based Reasoning (ICCBR 2022). hal-04392022 view on HAL pdf
- Quentin Guilloteau, Sophie Cerf, Raphaël Bleuse, Bogdan Robu, Eric Rutten. Under Control: A Control Theory Introduction for Computer Scientists. ACSOS 2024 - 5th IEEE International Conference on Autonomic Computing and Self-Organizing Systems (ACSOS 2024), Sep 2024, Aahrus, Denmark. pp.1-10. hal-04666859 view on HAL pdf
- Félix Gaschi, Xavier Fontaine, Parisa Rastin, Yannick Toussaint. Multilingual Clinical NER: Translation or Cross-lingual Transfer?. 5th Clinical Natural Language Processing Workshop, Jul 2023, Toronto, Canada. pp.289-311, 10.18653/v1/2023.clinicalnlp-1.34. hal-04193182 view on HAL pdf
- Lucian Trestioreanu, Flaviene Scheidt, Wazen M. Shbair, Jerome Francois, Damien Magoni, et al.. To Squelch or not to Squelch: Enabling Improved Message Dissemination on the XRP Ledger. 37th IEEE/IFIP Network Operations and Management Symposium (NOMS 2024), May 2024, Séoul, South Korea. hal-04621124 view on HAL pdf
- Pierre-François Gimenez, Jérôme Mengin. Learning Conditional Preference Networks: an Approach Based on the Minimum Description Length Principle. IJCAI 2024 - 33rd International Joint Conference on Artificial Intelligence, Aug 2024, Jeju, South Korea. pp.3395-3403, 10.24963/ijcai.2024/376. hal-04572196 view on HAL pdf
Latest news
Cluster chirop is now in the default queue of Lille with energy monitoring.
Dear users,
We are pleased to announce that the Chirop[1] cluster of Lille is now available in the default queue.
This cluster consists of 5 HPE DL360 Gen10+ nodes with:
Energy monitoring[2] is also available for this cluster[3], provided by newly installed Wattmetres (similar to those already available at Lyon).
This cluster was funded by CPER CornelIA.
[1] https://www.grid5000.fr/w/Lille:Hardware#chirop
[2] https://www.grid5000.fr/w/Energy_consumption_monitoring_tutorial [3] https://www.grid5000.fr/w/Monitoring_Using_Kwollect#Metrics_available_in_Grid.275000
-- Grid'5000 Team 16:25, 05 May 2025 (CEST)
Change of default queue based on platform
Until now, Abaca (production) users had to specify `-q production` when reserving Abaca resources with OAR.
This is no longer necessary as your default queue is now automatically selected based on the platform your default group is associated to, as shown at https://api.grid5000.fr/explorer/selector/ and in the message displayed when connecting to a frontend.
For SLICES-FR users, there is no change since the correct queue was already selected by default.
Additionally, the "production" queue has been renamed to "abaca", although "production" will continue to work for the foreseeable future.
Please note one case where this change may affect your workflow:
When an Abaca user reserves a resource from SLICES-FR (a non-production resource), they must explicitly specify they want to use the SLICES-FR queue, which is called "default", by adding `-q default` the OAR command.
-- Abaca Grid'5000 Team 10:10, 31 March 2025 (CEST)
Cluster "musa" with Nvidia H100 GPUs is available in production queue
We are pleased to announce that a new cluster named "musa" is available in the production queue¹ of Abaca.
This cluster has been funded by Inria DSI as a shared computing resource.
It is accessible to all Abaca users. Users affiliated with Inria have access with the same level of priority, regardless of the research center to which they are attached.
This cluster is composed of six HPE Proliant DL385 Gen11 nodes² with 2 AMD EPYC 9254 24-Core Processor, 512 GiB of RAM, 2 x Nvidia H100 NVL (94 GiB) with NVLink, one 6 TB SSD NVME and 25 Gbps Ethernet Connexion
Please note that in order to share it efficiently, walltime is limited:
The cluster "musa" is located at Sophia, hosted in the datacenter of Inria Centre at Université Côte d’Azur.
¹: https://api.grid5000.fr/explorer/hardware/sophia/#musa
²: the nodes are named musa-1, musa-2,.., musa-6
-- Grid'5000 Team 13:30, 19 March 2025 (CEST)
Cluster "Hydra" is now in the testing queue in Lyon
We are pleased to announce that the hydra[1] cluster of Lyon is now available in the testing queue.
Hydra is a cluster composed of 4 NVIDIA Grace-Hopper servers[2].
Each node features:
Due to its bleeding edge hardware, usual Grid'5000 environments are not supported by default for this cluster.
(Hydra requires system environments featuring a Linux kernel >= 6.6). The default system on the hydra nodes is based on Debian11, but **does not provide functional GPU**. However, users may deploy the ubuntugh2404-arm64-big environment, which is similar to official Nvidia image provided for this machine and provides GPU support.
To submit a job on this cluster, the following command may be used:
oarsub -q testing -t exotic -p hydra
This cluster is funded by INRIA and by Laboratoire de l'Informatique du Parallélisme with ENS Lyon support.
[1] Hydra is the largest of the modern constellations according to Wikipedia: https://en.wikipedia.org/wiki/Hydra_(constellation)
[2] https://developer.nvidia.com/blog/nvidia-grace-hopper-superchip-architecture-in-depth/
-- Grid'5000 Team 16:10, 11 March 2025 (CEST)
Grid'5000 sites
Current funding
As from June 2008, Inria is the main contributor to Grid'5000 funding.
INRIA |
CNRS |
UniversitiesIMT Atlantique |
Regional councilsAquitaine |