Grid5000:Home

From Grid5000
Revision as of 09:29, 26 October 2023 by Lpouilloux (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigation Jump to search
Grid'5000

Grid'5000 is a large-scale and flexible testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing including Cloud, HPC and Big Data and AI.

Key features:

  • provides access to a large amount of resources: 15000 cores, 800 compute-nodes grouped in homogeneous clusters, and featuring various technologies: PMEM, GPU, SSD, NVMe, 10G and 25G Ethernet, Infiniband, Omni-Path
  • highly reconfigurable and controllable: researchers can experiment with a fully customized software stack thanks to bare-metal deployment features, and can isolate their experiment at the networking layer
  • advanced monitoring and measurement features for traces collection of networking and power consumption, providing a deep understanding of experiments
  • designed to support Open Science and reproducible research, with full traceability of infrastructure and software changes on the testbed
  • a vibrant community of 500+ users supported by a solid technical team


Read more about our teams, our publications, and the usage policy of the testbed. Then get an account, and learn how to use the testbed with our Getting Started tutorial and the rest of our Users portal.

Grid'5000 is merging with FIT to build the SILECS Infrastructure for Large-scale Experimental Computer Science. Read an Introduction to SILECS (April 2018)


Recently published documents and presentations:

Older documents:


Grid'5000 is supported by a scientific interest group (GIS) hosted by Inria and including CNRS, RENATER and several Universities as well as other organizations. Inria has been supporting Grid'5000 through ADT ALADDIN-G5K (2007-2013), ADT LAPLACE (2014-2016), and IPL HEMERA (2010-2014).


Current status (at 2023-11-30 21:56): 1 current events, None planned (details)


Random pick of publications

Five random publications that benefited from Grid'5000 (at least 2486 overall):

  • Tomer Laor, Naif Mehanna, Antonin Durey, Vitaly Dyadyuk, Pierre Laperdrix, et al.. DRAWNAPART: A Device Identification Technique based on Remote GPU Fingerprinting. Network and Distributed System Security Symposium, Feb 2022, San Diego, United States. 10.14722/ndss.2022.24093. hal-03526240 view on HAL pdf
  • Ilias Benjelloun, Efoevi Angelo Koudou, Bart Lamiroy. Convolutional network fabric pruning with label noise. 2022. hal-03569057 view on HAL pdf
  • Alexandre Bettinger, Armelle Brun, Anne Boyer. Influence indépendante de l'exploration et de l'exploitation : le cas des systèmes de recommandation par métaheuristiques. CNIA 2022 - Conférence Nationale en Intelligence Artificielle, Jun 2022, Saint Etienne, France. hal-03659318 view on HAL pdf
  • Kadir Korkmaz, Joachim Bruneau-Queyreix, Sonia Ben Mokhtar, Laurent Réveillère. ALDER: Unlocking blockchain performance by multiplexing consensus protocols. 2022 IEEE 21st International Symposium on Network Computing and Applications (NCA), Dec 2022, Boston, United States. pp.9-18, 10.1109/NCA57778.2022.10013556. hal-03966159 view on HAL pdf
  • Ali Tehranijamsaz, Mihail Popov, Akash Dutta, Emmanuelle Saillard, Ali Jannesari. Learning Intermediate Representations using Graph Neural Networks for NUMA and Prefetchers Optimization. IPDPS 2022 - 36th IEEE International Parallel & Distributed Processing Symposium, May 2022, Lyon / Virtual, France. hal-03603118 view on HAL pdf


Latest news

Rss.svgNew "edge-computing"-class nodes in Toulouse's testing queue: cluster Estats with 12 Nvidia AGX Xavier SoCs

A new cluster named "estats" is available in the testing queue of the Toulouse site, composed of 12 "Edge computing"-class nodes.

Estats is composed of 12 Nvidia AGX Xavier SoCs¹. Each SoC features:

  • a ARM64 CPU (Nvidia Carmel micro-arch) with 8 cores
  • a Nvidia GPU (Nvidia Volta micro-arch)
  • 32 GB RAM shared between CPU and GPU
  • a 2TB NVMe
  • 1 Gbps NIC
  • The 12 modules are packaged in a chassis manufactured by Connecttech³.

    Since it is not a cluster of server-class machines (unlike all current other Grid'5000 nodes), estats runs a different default system environment. This environment includes Nvidia's Linux for Tegra² overlay on top of the Grid'5000 standard environment. This means:

  • a Debian 11 system like other clusters, but
  • a special Linux kernel,
  • several specific tools and services,
  • and several incompatible tools (e.g. Cuda).
  • This default environment does not include the required Tegra-specific version of Cuda.

    In order to benefit from the whole Nvidia stack with e.g. the specific Cuda version and DL accelerators support for Nvidia Tegra, it is advised to deploy on the node the Nvidia-supported Ubuntu 20.04 OS with the full L4T support, using kadeploy. You can use the ubuntu2004-nfsl4t environment. E.g.:

    ftoulouse$ oarsub -q testing -t exotic -p estats -t deploy -l nodes=1 -I

    ftoulouse$ kadeploy3 ubuntu2004-nfsl4t

    This tutorial page explains how this ubuntu2004-nfsl4t environment is built and how...

    Rss.svgDebian 12 "Bookworm" environments - min and nfs variants are ready for deployments

    We are pleased to announce that Debian 12 (Bookworm) environments' min and nfs variants are now supported for deployments in Grid'5000. See `kaenv3 -l debian12%` for details.

    The default environment available on nodes will remain the same (debian11-std) for some time to come.

    New features and changes in Debian 12 are described in: https://www.debian.org/releases/bookworm/amd64/release-notes/ch-whats-new.en.html.

    The wiki documentation¹ has been updated to take into account Debian 12-min|nfs usage.

    Debian12-bookworm big will be available later.

    ¹: https://www.grid5000.fr/w/Getting_Started#On_Grid.275000_reference_environments

    -- Grid'5000 Team 16:11, Oct 16th 2023 (CEST)

    Rss.svgNew cluster “roazhon4” available in Rennes

    We have the pleasure to announce a new cluster named “roazhon4” available in the production queue.

    It features 1 Dell PowerEdge R7525 with 2 AMD EPYC 7H12 (Zen 2), x86_64, 64 cores/CPU 512 GiB of RAM, one 480 GB HDD SATA and one 2.0TB HDD SAS

    This node comes from the migration of igrida00-04 node from Igrida cluster at the Inria Rennes center.

    See our hardware page for details about this cluster: https://www.grid5000.fr/w/Rennes:Hardware#roazhon4

    About the name: Roazhon[1] is the name of Rennes city in Breton

    ¹: https://fr.wiktionary.org/wiki/Roazhon

    -- Grid'5000 Team 14:00, Sep 25th 2023 (CET)

    Rss.svgEnd of support of Xen environments

    Deployment statistics showed that Xen Kadeploy environments were rarely used, so we decided to end support for them, as they add a significant maintenance load.

    The environments that were published up to now remain available for deployment (see /grid5000/README.unmaintained-envs), and the support remains in Kadeploy, but no future update to other environments will be made.

    Documentation has also been archived in https://www.grid5000.fr/w/Unmaintained:Virtualization_in_Grid%275000_with_Xen

    -- Grid'5000 Team 14:00, Aug 28th 2023 (CEST)


    Read more news

    Grid'5000 sites

    Current funding

    As from June 2008, Inria is the main contributor to Grid'5000 funding.

    INRIA

    Logo INRIA.gif

    CNRS

    CNRS-filaire-Quadri.png

    Universities

    IMT Atlantique
    Université Grenoble Alpes, Grenoble INP
    Université Rennes 1, Rennes
    Institut National Polytechnique de Toulouse / INSA / FERIA / Université Paul Sabatier, Toulouse
    Université Bordeaux 1, Bordeaux
    Université Lille 1, Lille
    École Normale Supérieure, Lyon

    Regional councils

    Aquitaine
    Auvergne-Rhône-Alpes
    Bretagne
    Champagne-Ardenne
    Provence Alpes Côte d'Azur
    Hauts de France
    Lorraine