Current platform events (maintenance, outages, issues...)
If you experience problems, please check the platform's operation schedule (Past, present and future incidents (planned or not...) are notified for all sites).
For other long running minor issue that may affect your experiment, you can check the list of known artifacts : Grid5000 Artifacts (this list is also displayed when you connect on frontends).
Resources reservations (OAR) status
Drawgantt (past, current and future OAR jobs scheduling)
Forecast view for 1 week:
Monika (current placement and queued jobs status)
Backbone network status and load
Grid'5000 Weathermap (courtesy of Renater)
Shows the actual state of the opticals links between the Grid'5000 10Gb-ready sites. A link painted in black on the weathermap means that you won't be able to access this site nodes from the Grid'5000 internal network.
Sites network traffic
A dashboard combining links and real-time data is available on the Grid'5000 Backbone Network Monitoring page.
Ganglia provides resources usage metrics (memory, cpu, jobs...) for individual sites or the whole platform.
Stats5k gathers a lot of statistics about the testbed.
Jenkins tests most of Grid'5000 services. The web interface provides a summary of results, indicating platform health. Detailed logs are not normally available to users, but access can be requested if needed.
Last generated from the Grid'5000 Reference API on 2021-04-12 (commit 7be20839fc)