Grid5000:Home: Difference between revisions

From Grid5000
Jump to navigation Jump to search
No edit summary
No edit summary
 
(222 intermediate revisions by 13 users not shown)
Line 1: Line 1:
__NOTOC__ __NOEDITSECTION__
__NOTOC__ __NOEDITSECTION__
Welcome to '''Grid'5000 official web site'''
''5000 CPUs distributed in 9 sites for research in Grid Computing, eScience and Cyber-infrastructures''
{|width="95%"
{|width="95%"
|- valign="top"
|- valign="top"
|bgcolor="#f5fff5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#f5fff5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
==Latest news==
[[Image:g5k-backbone.png|thumbnail|260px|right|Grid'5000]]
'''Grid'5000 is a large-scale and flexible testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing including Cloud, HPC and Big Data and AI.'''


[[Image:Gdx.png|120px|right|GdX]]
Key features:
===[http://www.lri.fr/~gdx/index.php/GdX:News '''Grid eXplorer days''']===
* provides '''access to a large amount of resources''': 15000 cores, 800 compute-nodes grouped in homogeneous clusters, and featuring various technologies: PMEM, GPU, SSD, NVMe, 10G and 25G Ethernet, Infiniband, Omni-Path
October 12th and 13th, IDRIS, Orsay. Grid eXplorer users, take this opportunity to present your last research results obtained using GdX. 
* '''highly reconfigurable and controllable''': researchers can experiment with a fully customized software stack thanks to bare-metal deployment features, and can isolate their experiment at the networking layer
* '''advanced monitoring and measurement features for traces collection of networking and power consumption''', providing a deep understanding of experiments
* '''designed to support Open Science and reproducible research''', with full traceability of infrastructure and software changes on the testbed
* '''a vibrant community''' of 500+ users supported by a solid technical team


===[http://www2.lifl.fr/~melab/RECH/JourneesGrid5000Lille/journeesGrid5000LIlle.htm '''Journées Grid'5000 à Lille''']===
<br>
October 30th and 31th, Grid'5000 Lille site's days will take place in USTL campus, Villeneuve d'Ascq. Project/Platform presentations and tutorials will be proposed.
Read more about our [[Team|teams]], our [[Publications|publications]], and the [[Grid5000:UsagePolicy|usage policy]] of the testbed. Then [[Grid5000:Get_an_account|get an account]], and learn how to use the testbed with our [[Getting_Started|Getting Started tutorial]] and the rest of our [[:Category:Portal:User|Users portal]].


[[Image:Logo sc06.gif|100px|left|SuperComputing 2006]]
<b>Grid'5000 is merging with [https://fit-equipex.fr FIT] to build the [http://www.silecs.net/ SILECS Infrastructure for Large-scale Experimental Computer Science]. Read [http://www.silecs.net/wp-content/uploads/2018/04/Desprez-SILECS.pdf an Introduction to SILECS] (April 2018)</b>
===Grid'5000@[http://sc06.supercomputing.org/ SC'06]===
''SuperComputing 2006'' will take place in Tampa, Florida (USA) from November 11th to 17th.


===Grid'5000@[http://paristic.loria.fr/ Paristic 2006]===
<br>
The 2006's edition of ''Parsitic'' will be hosted by the [http://www.loria.fr LORIA] in Nancy, from November 22th to 24th.
Recently published documents and presentations:
* [[Media:Grid5000.pdf|Presentation of Grid'5000]] (April 2019)
* [https://www.grid5000.fr/mediawiki/images/Grid5000_science-advisory-board_report_2018.pdf Report from the Grid'5000 Science Advisory Board (2018)]


[[Image:Plugtest2006.jpg|80px|right|PlugTest 2006]]
Older documents:
===[http://www.etsi.org/Plugtests/Upcoming/GRID2006/GRID2006.htm PlugTest 2006]===
* [https://www.grid5000.fr/slides/2014-09-24-Cluster2014-KeynoteFD-v2.pdf Slides from Frederic Desprez's keynote at IEEE CLUSTER 2014]
''GRIDS@work: CoreGRID Conference, Grid Plugtests and Contest'' will take place in ETSI Headquarters, Sophia-Antipolis from November, 27th to December, 1st.
* [https://www.grid5000.fr/ScientificCommittee/SAB%20report%20final%20short.pdf Report from the Grid'5000 Science Advisory Board (2014)]


[[Image:LogoRENATERtd-s.jpg|100px|left]]
<br>
===[[Grid5000:Network|Grid'5000 network interlink migration toward Renater4]]===
Grid'5000 is supported by a scientific interest group (GIS) hosted by Inria and including CNRS, RENATER and several Universities as well as other organizations. Inria has been supporting Grid'5000 through ADT ALADDIN-G5K (2007-2013), ADT LAPLACE (2014-2016), and IPL [[Hemera|HEMERA]] (2010-2014).
The migration of [[Grid5000:Network|our network interlink]] is in progress... Rennes, Nancy, Toulouse, Lille and Grenoble and Sophia are now interconnected using the new infrastructure (''dark fiber''), with already a '''10Gb link between Sophia, Rennes and Nancy'''.
 
[[Grid5000:News|read more news]]
|}
|}


<br>
<br>
==Grid'5000 at a glance==
{{#status:0|0|0|http://bugzilla.grid5000.fr/status/upcoming.json}}
[[Image:CarteSites.JPG|thumbnail|128px|right|Grid'5000 sites]]
<br>
* '''Grid'5000''' project aims at building a '''highly reconfigurable, controlable and monitorable experimental Grid platform''' gathering '''9 sites''' geographically distributed in France featuring a total of 5000 CPUs:
 
===Sites:===
== Random pick of publications ==
{|width="75%" cellspacing="3"  
{{#publications:}}
 
==Latest news==
<rss max=4 item-max-length="2000">https://www.grid5000.fr/rss/G5KNews.php</rss>
----
[[News|Read more news]]
 
=== Grid'5000 sites===
{|width="100%" cellspacing="3"  
|- valign="top"
|- valign="top"
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
* [[Bordeaux:Home|Bordeaux]]
* [[Grenoble:Home|Grenoble]]
* [[Grenoble:Home|Grenoble]]
* [[Lille:Home|Lille]]
* [[Lille:Home|Lille]]
* [[Luxembourg:Home|Luxembourg]]
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
* [[Lyon:Home|Lyon]]
* [[Lyon:Home|Lyon]]
* [[Nancy:Home|Nancy]]
* [[Nancy:Home|Nancy]]
* [[Orsay:Home|Orsay]]
* [[Nantes:Home|Nantes]]
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|width="33%" bgcolor="#f5f5f5" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
* [[Rennes:Home|Rennes]]
* [[Rennes:Home|Rennes]]
Line 56: Line 61:
|}
|}


* The main purpose of this platform is to serve as an experimental testbed for research in Grid Computing.
== Current funding ==
* This project is one initiative of the [http://www.recherche.gouv.fr/recherche/fns/grid.htm French ACI Grid] incentive (see below: Funding Institutions) which provides a large part of Grid'5000 funding on behalf of the French Ministry of Research & Education.
As from June 2008, Inria is the main contributor to [[Grid5000:Funding|Grid'5000 funding]].  
 
 
 
[[Image:Grid5000 software layers.png|thumb|left|Grid'5000 will allow Grid experiments France wide in all these software layers]]
* '''Grid'5000''' is a research effort developping a '''large scale nation wide infrastructure for Grid research'''.
 
* '''17 [[Grid5000:Laboratories|laboratories]]''' are involved, nation wide, in the objective of providing the community of Grid researchers a testbed allowing experiments in all the software layers between the network protocols up to the applications.
 
 
 
The current plans are to assemble a physical platform featuring 9 local platform (at least one cluster per site), each with 100 to a thousand PCs, connected by the [http://www.renater.fr RENATER] Education and Research Network.
 
All clusters will be connected to Renater with a 10Gb/s link (or at least 1 Gb/s, when 10Gb/s is not available yet).
 
 
 
This high collaborative research effort is funded by the French ministry of Education and Research, INRIA, CNRS, the Universities of all sites and some regional councils.
 
 
 
==Rationale==
'''The foundations of Grid'5000''' have emerged from a thorough analysis and numerous discussions about methodologies used for scientific research in the Grid domain. A report presents the [http://www-sop.inria.fr/aci/grid/public/Library/rapport-grid5000-V3.pdf rationale for Grid'5000].
 
In addition to theory, simulators and emulators, there is a strong need for '''large scale testbeds''' where real life experimental conditions hold. '''The size of Grid'5000''', in terms of number of sites and number of CPUs per site, was established according to the scale of the experiments and the number of researchers involved in the project.
 
 
 
==Funding Institutions==
{|width="100%" cellspacing="3"
{|width="100%" cellspacing="3"
|-
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
===Ministère de l'Education, de la Jeunesse et de la Recherche===
[[Image:Logo-Ministere-Recherche.jpg]]
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
===ACI Grid===
[[Image:LogoACIGRID.jpg|350px]]
|-
|-
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
===INRIA===
===INRIA===
[[Image:Logo-inria.gif]]
[[Image:Logo_INRIA.gif|300px]]
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
===CNRS===
===CNRS===
[[Image:Logo-cnrs.gif]]
[[Image:CNRS-filaire-Quadri.png|125px]]
|-
|-
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
===Universities===
===Universities===
University of Paris Sud, Orsay<br/>
IMT Atlantique<br/>
University Joseph Fourier, Grenoble<br/>
Université Grenoble Alpes, Grenoble INP<br/>
University of Nice-Sophia Antipolis, Sophia Antipolis<br/>
Université Rennes 1, Rennes<br/>
University of Rennes 1, Rennes<br/>
Institut National Polytechnique de Toulouse / INSA / FERIA / Université Paul Sabatier, Toulouse<br/>
Institut National Polytechnique de Toulouse / INSA / FERIA / Université Paul Sabatier, Toulouse<br/>
University Bordeaux 1, Bordeaux<br/>
Université Bordeaux 1, Bordeaux<br/>
University Lille 1 / GENOPOLE, Lille<br/>
Université Lille 1, Lille<br/>
Ecole Normale Supérieure / MYRICOM, Lyon<br/>
École Normale Supérieure, Lyon<br/>
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
| width="50%" bgcolor="#f5f5f5" valign="top" align="center" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
===Regional councils===
===Regional councils===
Aquitaine<br/>
Auvergne-Rhône-Alpes<br/>
Bretagne<br/>
Bretagne<br/>
Champagne-Ardenne<br/>
Provence Alpes Côte d'Azur<br/>
Provence Alpes Côte d'Azur<br/>
Aquitaine<br/>
Hauts de France<br/>
Ile de France<br/>
Lorraine<br/>
Lorraine<br/>
===General Councils===
Alpes Maritimes
|}
|}

Latest revision as of 09:29, 26 October 2023

Grid'5000

Grid'5000 is a large-scale and flexible testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing including Cloud, HPC and Big Data and AI.

Key features:

  • provides access to a large amount of resources: 15000 cores, 800 compute-nodes grouped in homogeneous clusters, and featuring various technologies: PMEM, GPU, SSD, NVMe, 10G and 25G Ethernet, Infiniband, Omni-Path
  • highly reconfigurable and controllable: researchers can experiment with a fully customized software stack thanks to bare-metal deployment features, and can isolate their experiment at the networking layer
  • advanced monitoring and measurement features for traces collection of networking and power consumption, providing a deep understanding of experiments
  • designed to support Open Science and reproducible research, with full traceability of infrastructure and software changes on the testbed
  • a vibrant community of 500+ users supported by a solid technical team


Read more about our teams, our publications, and the usage policy of the testbed. Then get an account, and learn how to use the testbed with our Getting Started tutorial and the rest of our Users portal.

Grid'5000 is merging with FIT to build the SILECS Infrastructure for Large-scale Experimental Computer Science. Read an Introduction to SILECS (April 2018)


Recently published documents and presentations:

Older documents:


Grid'5000 is supported by a scientific interest group (GIS) hosted by Inria and including CNRS, RENATER and several Universities as well as other organizations. Inria has been supporting Grid'5000 through ADT ALADDIN-G5K (2007-2013), ADT LAPLACE (2014-2016), and IPL HEMERA (2010-2014).


Current status (at 2024-03-28 21:45): 1 current events, None planned (details)


Random pick of publications

Five random publications that benefited from Grid'5000 (at least 2503 overall):

  • Tom Cornebize, Arnaud Legrand. Simulation-based Optimization and Sensibility Analysis of MPI Applications: Variability Matters. Journal of Parallel and Distributed Computing, 2022, 10.1016/j.jpdc.2022.04.002. hal-03141988v2 view on HAL pdf
  • A Kishore Kumar, Shefali Waldekar, Md Sahidullah, Goutam Saha. Robust acoustic domain identification with its application to speaker diarization. International Journal of Speech Technology, 2022, 25 (December), pp.933-945. 10.1007/s10772-022-09990-9. hal-03719697 view on HAL pdf
  • Igor Fontana de Nardin, Patricia Stolf, Stéphane Caux. Mixing Offline and Online Electrical Decisions in Data Centers Powered by Renewable Sources. 48th Annual Conference of the IEEE Industrial Electronics Society (IECON 2022), IEEE Industrial Electronics Society (IES), Oct 2022, Brussels, Belgium. pp.1-6, 10.1109/IECON49645.2022.9968999. hal-03841611 view on HAL pdf
  • Safa Alsaidi, Miguel Couceiro, Esteban Marquer, Sophie Quennelle, Anita Burgun, et al.. An analogy based framework for patient-stay identification in healthcare. ATA@ICCBR 2022 - Workshop Analogies: from Theory to Applications, Sep 2022, Nancy, France. hal-03763772 view on HAL pdf
  • David Sierra González, Anshul Paigwar, Özgür Erkent, Christian Laugier. MultiLane: Lane Intention Prediction and Sensible Lane-Oriented Trajectory Forecasting on Centerline Graphs. ITSC 2022 - 25th IEEE International Conference on Intelligent Transportation Systems, Sep 2022, Macao, China. pp.1-8. hal-03790450 view on HAL pdf


Latest news

Rss.svgCluster "estats" is now in the default queue in Toulouse

We are pleased to announce that the estats cluster of Toulouse (the name refers to Pica d'Estats) is now available in the default queue.

As a reminder, estats is composed of 12 edge-class nodes powered by Nvidia AGX Xavier SoCs. Each node features:

  • 1 ARM64 CPU (Nvidia Carmel micro-arch) with 8 cores
  • 1 Nvidia GPU (Nvidia Volta micro-arch)
  • 32 GB RAM shared between CPU and GPU
  • 1 NVMe of 2TB
  • 1 Gbps NIC
  • Since it is not a cluster of server-class machines (unlike all current other Grid'5000 nodes), estats runs a different default system environment, but other common functionalities are the same (kadeploy etc., except kavlan which is not supported yet).

    For the experimentations, it is recommended to deploy Ubuntu L4T.

    More information in the Jetson page.

    The cluster was funded by a CNRS grant.

    -- Grid'5000 Team 9:51, March 6th 2024 (CEST)

    Rss.svgThe big variant of Debian 12 "Bookworm" environments is ready for deployments

    We are pleased to inform you that the big variant of Debian 12 (Bookworm) environments is now supported for deployments in Grid'5000. Check `kaenv3 -l debian12%` for detailed information.

    Notably, the NVIDIA driver has been updated to version 535.129.03, and CUDA has been upgraded to version 12.2.2_535.104.05_linux for the amd64 architecture.

    The default environment available on nodes will continue to be debian11-std for the foreseeable future.

    Please refer to the updated wiki documentation¹ for guidance on Debian 12-min|nfs|big usage.

    ¹: https://www.grid5000.fr/w/Getting_Started#On_Grid.275000_reference_environments

    -- Grid'5000 Team 14:21, Jan 22nd 2024 (CEST)

    Rss.svgCluster "montcalm" is now in the default queue in Toulouse

    We have the pleasure to announce that the "montaclm" cluster is now available in the default queue of the Toulouse site, which makes the site full-fledged again!

    This cluster consists of 10 HPE Proliant DL360 Gen10+ nodes with 2 CPUs Intel Xeon Silver 4314 (16 cores per CPUs), 256 GB of DDR4 RAM, and 894GB SSD.

    Jobs submitted on the Toulouse site will run by default on this cluster.

    Beside the "montcalm" cluster, the "edge-class" cluster "estats" is still available in the testing queue for now.

    In order to support the SLICES-FR project, the site infrastructure has been funded by CNRS/INS2I and the "montcalm" cluster has been funded by University Paul Sabatier (UT3).

    -- Grid'5000 Team 10:30, 18 Jan 2024 (CET)

    Rss.svgNew "edge-computing"-class nodes in Toulouse's testing queue: cluster Estats with 12 Nvidia AGX Xavier SoCs

    A new cluster named "estats" is available in the testing queue of the Toulouse site, composed of 12 "Edge computing"-class nodes.

    Estats is composed of 12 Nvidia AGX Xavier SoCs¹. Each SoC features:

  • a ARM64 CPU (Nvidia Carmel micro-arch) with 8 cores
  • a Nvidia GPU (Nvidia Volta micro-arch)
  • 32 GB RAM shared between CPU and GPU
  • a 2TB NVMe
  • 1 Gbps NIC
  • The 12 modules are packaged in a chassis manufactured by Connecttech⁴.

    Since it is not a cluster of server-class machines (unlike all current other Grid'5000 nodes), estats runs a different default system environment. This environment includes Nvidia's Linux for Tegra²³ overlay on top of the Grid'5000 standard environment. This means:

  • a Debian 11 system like other clusters, but
  • a special Linux kernel,
  • several specific tools and services,
  • and several incompatible tools (e.g. Cuda).
  • This default environment does not include the required Tegra-specific version of Cuda.

    To benefit from the whole Nvidia stack with e.g. the specific Cuda version and DL accelerators support for Nvidia Tegra, it is advised to deploy on the node the Nvidia-supported Ubuntu 20.04 OS with the full L4T support, using kadeploy. You can use the ubuntul4t200435-big environment. E.g.:

    ftoulouse$ oarsub -q testing -t exotic -p estats -t deploy -l nodes=1 -I

    ftoulouse$ kadeploy3 ubuntul4t200435-big

    This tutorial page explains how this ubuntul4t200435-big environment is built and how to...


    Read more news

    Grid'5000 sites

    Current funding

    As from June 2008, Inria is the main contributor to Grid'5000 funding.

    INRIA

    Logo INRIA.gif

    CNRS

    CNRS-filaire-Quadri.png

    Universities

    IMT Atlantique
    Université Grenoble Alpes, Grenoble INP
    Université Rennes 1, Rennes
    Institut National Polytechnique de Toulouse / INSA / FERIA / Université Paul Sabatier, Toulouse
    Université Bordeaux 1, Bordeaux
    Université Lille 1, Lille
    École Normale Supérieure, Lyon

    Regional councils

    Aquitaine
    Auvergne-Rhône-Alpes
    Bretagne
    Champagne-Ardenne
    Provence Alpes Côte d'Azur
    Hauts de France
    Lorraine