Difference between revisions of "Grid5000:Home"
Line 7: | Line 7: | ||
Key features: | Key features: | ||
− | * provides '''access to a large amount of resources''': 12000 cores, 800 compute-nodes grouped in homogeneous clusters, and featuring various technologies: GPU, SSD, NVMe, 10G Ethernet, Infiniband, | + | * provides '''access to a large amount of resources''': 12000 cores, 800 compute-nodes grouped in homogeneous clusters, and featuring various technologies: GPU, SSD, NVMe, 10G and 25G Ethernet, Infiniband, Omni-Path |
* '''highly reconfigurable and controllable''': researchers can experiment with a fully customized software stack thanks to bare-metal deployment features, and can isolate their experiment at the networking layer | * '''highly reconfigurable and controllable''': researchers can experiment with a fully customized software stack thanks to bare-metal deployment features, and can isolate their experiment at the networking layer | ||
* '''advanced monitoring and measurement features for traces collection of networking and power consumption''', providing a deep understanding of experiments | * '''advanced monitoring and measurement features for traces collection of networking and power consumption''', providing a deep understanding of experiments |
Revision as of 09:05, 9 November 2018
Grid'5000 is a large-scale and versatile testbed for experiment-driven research in all areas of computer science, with a focus on parallel and distributed computing including Cloud, HPC and Big Data. Key features:
Older documents:
|
Random pick of publications
Five random publications that benefited from Grid'5000 (at least 2144 overall):
- Ahmed Amamou, Martin Camey, Christophe Cérin, Jonathan Rivalan, Julien Sopena. Resources management for controlling dynamic loads in clouds environments. The Wolphin project experience. Research Report Université Sorbonne Paris Nord; Sorbonne Université. 2020. hal-02481264 view on HAL pdf
- Baptiste Jonglez, Sinan Birbalta, Martin Heusse. Persistent DNS connections for improved performance. NETWORKING 2019 - IFIP Networking 2019, May 2019, Warsaw, Poland. pp.1-2. hal-02149978 view on HAL pdf
- Jad Darrous, Shadi Ibrahim. Enabling Data Processing under Erasure Coding in the Fog. ICPP 2019 - 48th International Conference on Parallel Processing, Aug 2019, Kyoto, Japan. pp.1. hal-02388835 view on HAL pdf
- Nathalie Bertrand, Igor Konnov, Marijana Lazic, Josef Widder. Verification of Randomized Consensus Algorithms under Round-Rigid Adversaries. CONCUR 2019 - 30th International Conference on Concurrency Theory, Aug 2019, Amsterdam, Netherlands. pp.1-16, 10.4230/LIPIcs.CONCUR.2019.33. hal-02191348 view on HAL pdf
- Nicolas Turpault, Romain Serizel, Emmanuel Vincent. Semi-supervised triplet loss based learning of ambient audio embeddings. ICASSP 2019, May 2019, Brighton, United Kingdom. hal-02025824 view on HAL pdf
Latest news
New Grid'5000 API's documentation and specification
Grid'5000 API's documentation has been updated. Before this update, the documentation contained both the specification and tutorials of the API (with some parts also present in the wiki).
To be more consistent, https://api.grid5000.fr/doc/ provides now only the specification (HTTP paths, parameters, payload, …). All tutorials were moved (along with being updated) to the Grid'5000's wiki.
The new API specification can be viewed with two tools: The first one allows to read the specification and find information ; the second one allows to discover the API thanks to a playground.
Please note that the specification may contain errors. Please report any of such errors to Support Staff.
-- Grid'5000 Team 14:30, January 11th 2021 (CET)
Important changes in the privileges levels of users
Each Grid'5000 user is a member of at least one granting access group, which depends on their situation (location, laboratory, ...).
Each group is given a privilege level (bronze, silver, gold), depending on how the related organization is involved in Grid'5000's development and support.
Until now, however, these levels had no impact on how Grid'5000 could be used.
Starting from December 10th, 2020, each user will be granted different usages on the testbed depending on their privileges level. In particular:
- While every level continues to give access to the Grid'5000 default queue (most of Grid'5000 resources) ;
- Access to the production and besteffort queues will only be granted to silver and gold levels.
The complete description of each level of privileges is available here.
The privilege level of the groups a user is a member of is shown in the "group" tab of the management interface.
Note that if a user is a member of several groups, one is set as default and is implicitly used when submitting jobs.
But the "--project" OAR option can also set explicitly which group the job should use. For instance:
oarsub
-I -q production --project=myothergroup
Do not hesitate to contact the Support Staff for any questions related to the privilege levels.
-- Grid'5000 Team 15:30, December 8st 2020 (CET)
Reminder: Testing phase of the new monitoring service named Kwollect
As a reminder, the testing phase of Kwollect, the new monitoring solution for Grid'5000, is still ongoing.
Some new features are available since the last announcement :
- Support for Prometheus metrics
- Basic visualization dashboard
- Fine-tuning of on-demand metrics
- Ability to push your own metrics
See: Monitoring Using Kwollect
Do not hesitate to give us some feedback!
Kwollect is intended to replace the legacy monitoring systems, Kwapi and Ganglia, in the (hopefully) near future.
-- Grid'5000 Team 09:00, December 1st 2020 (CET)
New IBM POWER8 cluster "drac" available for beta testing in Grenoble
We are happy to announce that a new cluster "drac" is available in the testing queue in Grenoble.
The cluster has 12 "Minsky" nodes from IBM, more precisely "Power Systems S822LC for HPC".
Each node has 2x10 POWER8 cores, 4 Tesla P100 GPU, and 128 GB of RAM. The GPUs are directly interconnected with a NVLINK fabric. Support for Infiniband 100G and kavlan is planned to be added soon.
This is the first cluster in Grid'5000 using the POWER architecture, and we are also aware of some stability issues related to GPUs, hence the "beta testing" status for now.
Feedback is highly welcome on performance and stability, as well as on our software environment for this new architecture.
Note that, since the CPU architecture is not x86, you need to explicitly ask for an "exotic" job type when reserving the nodes. For instance, to get a single GPU:
grenoble
$oarsub
-I -qtesting
-texotic
-l gpu=1
More information on the hardware is available at Grenoble:Hardware
Acknowledgment: This cluster was donated by GENCI, many thanks to them. It was formerly known as the Ouessant platform of Genci's Cellule de Veille Technologique.
-- Grid'5000 Team 15:50, November 26th 2020 (CET)
Grid'5000 sites
Current funding
As from June 2008, Inria is the main contributor to Grid'5000 funding.
INRIA |
CNRS |
UniversitiesUniversité Grenoble Alpes, Grenoble INP |
Regional councilsAquitaine |