Difference between revisions of "Status"

From Grid5000
Jump to: navigation, search
(Jenkins)
(30 intermediate revisions by 9 users not shown)
Line 1: Line 1:
{{Maintainer|Sebastien Badia}}
 
{{Author|Florian Le Goff}}
 
 
{{Status|In production}}
 
{{Status|In production}}
 
{{Portal|User}}
 
{{Portal|User}}
 
{{Portal|Platform}}
 
{{Portal|Platform}}
  
= [https://www.grid5000.fr/status/ Current events] (maintenance, issues...) =
+
= [https://www.grid5000.fr/status/ Current platform events] (maintenance, outages, issues...) =
If you experience problems, please [https://www.grid5000.fr/status/ check the platform's administration schedule], where past, present and future incidents (planned or not...) are notified for all sites.
+
If you experience problems, please check '''[https://www.grid5000.fr/status/ the platform's operation schedule]''' ''(Past, present and future incidents (planned or not...) are notified for all sites).''
  
= Monika =
 
[http://oar.imag.fr/ Monika] displays current and scheduled OAR jobs.
 
  
You can select an individual site or cluster:
+
For other long running minor issue that may affect your experiment, you can check the list of known artifacts : '''[https://intranet.grid5000.fr/status/artifact/ Grid5000 Artifacts]''' ''(this list is also displayed when you connect on frontends).''
 +
 
 +
= Resources reservations (OAR jobs) status =
 +
 
 
{|
 
{|
 +
|bgcolor="#aaaaaa" colspan="8"|
 +
'''Monika''' ''(current placement and queued jobs status)''
 
|-
 
|-
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Grenoble/monika.cgi Grenoble]
+
[https://intranet.grid5000.fr/oar/Grenoble/monika.cgi '''Grenoble''']
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Lille/monika.cgi Lille]
+
[https://intranet.grid5000.fr/oar/Lille/monika.cgi '''Lille''']
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Luxembourg/monika.cgi Luxembourg]
+
[https://intranet.grid5000.fr/oar/Luxembourg/monika.cgi '''Luxembourg''']
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Lyon/monika.cgi Lyon]
+
[https://intranet.grid5000.fr/oar/Lyon/monika.cgi '''Lyon''']
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Nancy/monika.cgi Nancy]
+
[https://intranet.grid5000.fr/oar/Nancy/monika.cgi '''Nancy''']<br>
 +
[https://intranet.grid5000.fr/oar/Nancy/monika-prod.cgi '''Nancy (production)''']
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Nancy/monika-prod.cgi Nancy (production)]
+
[https://intranet.grid5000.fr/oar/Nantes/monika.cgi '''Nantes''']
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Nantes/monika.cgi Nantes]
+
[https://intranet.grid5000.fr/oar/Rennes/monika.cgi '''Rennes''']
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Rennes/monika.cgi Rennes]
+
[https://intranet.grid5000.fr/oar/Sophia/monika.cgi '''Sophia''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
+
|-
[https://intranet.grid5000.fr/oar/Sophia/monika.cgi Sophia]
+
|bgcolor="#aaaaaa" colspan="8"|
|}
+
'''Drawgantt''' ''(past, current and future OAR jobs scheduling)''
 
+
|-
Or view the [https://www.grid5000.fr/gridstatus/oargridmonika.cgi global snapshot of the platform].
+
|bgcolor="#eeeeee" colspan="8"|
 
 
= Drawgantt =
 
[http://oar.imag.fr/ Drawgantt] displays past, current and future OAR jobs.
 
 
 
 
Default view:
 
Default view:
{|
 
 
|-
 
|-
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg/ Grenoble]
+
<big>'''Grenoble'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg/ '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-disks/ disks]<br>
 +
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-subnets/ subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-vlans/ vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg/ Lille]
+
<big>'''Lille'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg/ '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-disks/ disks]<br>
 +
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-subnets/ subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-vlans/ vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg/ Luxembourg]
+
<big>'''Luxembourg'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg/ '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-subnets/ subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-vlans/ vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg/ Lyon]
+
<big>'''Lyon'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg/ '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-subnets/ subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-vlans/ vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg/ Nancy]
+
<big>'''Nancy'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg/ '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-prod/ '''nodes (production)''']<br>
 +
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-disks/ disks]<br>
 +
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-subnets/ subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-vlans/ vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-prod/ Nancy (production)]
+
<big>'''Nantes'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg/ '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-subnets/ subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-vlans/ vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg/ Nantes]
+
<big>'''Rennes'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg/ '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-disks/ disks]<br>
 +
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-subnets/ subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-vlans/ vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg/ Rennes]
+
<big>'''Sophia'''</big><br>
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
+
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg/ Sophia]
+
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-subnets/ subnets]<br>
|}
+
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-vlans/ vlans]
 
+
|-
 
+
|bgcolor="#eeeeee" colspan="8"|
Forecast over 1 week:
+
Forecast view for 1 week:
{|
 
 
|-
 
|-
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg/?relative_start=-28800&relative_stop=604800&filter=all%20clusters&timezone=Europe/Paris&resource_base=host&scale=10 Grenoble]
+
<big>'''Grenoble'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-disks//?relative_start=-28800&relative_stop=604800 disks]<br>
 +
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg/?relative_start=-28800&relative_stop=604800&filter=all%20clusters&timezone=Europe/Paris&resource_base=host&scale=10 Lille]
+
<big>'''Lille'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-disks/?relative_start=-28800&relative_stop=604800 disks]<br>
 +
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg/?relative_start=-28800&relative_stop=604800&filter=all%20clusters&timezone=Europe/Paris&resource_base=host&scale=10 Luxembourg]
+
<big>'''Luxembourg'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg/?relative_start=-28800&relative_stop=604800&filter=all%20clusters&timezone=Europe/Paris&resource_base=host&scale=10 Lyon]
+
<big>'''Lyon'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg/?relative_start=-28800&relative_stop=604800&filter=all%20clusters&timezone=Europe/Paris&resource_base=host&scale=10 Nancy]
+
<big>'''Nancy'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-prod/?relative_start=-28800&relative_stop=604800 '''nodes (production)''']<br>
 +
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-disks/?relative_start=-28800&relative_stop=604800 disks]<br>
 +
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-prod/?relative_start=-28800&relative_stop=604800&filter=all%20clusters&timezone=Europe/Paris&resource_base=host&scale=10 Nancy (production)]
+
<big>'''Nantes'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg/?relative_start=-28800&relative_stop=604800&filter=all%20clusters&timezone=Europe/Paris&resource_base=host&scale=10 Nantes]
+
<big>'''Rennes'''</big><br>
 +
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
 +
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-disks/?relative_start=-28800&relative_stop=604800 disks]<br>
 +
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
 
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg/?relative_start=-28800&relative_stop=604800&filter=all%20clusters&timezone=Europe/Paris&resource_base=host&scale=10 Rennes]
+
<big>'''Sophia'''</big><br>
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
+
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg/?relative_start=-28800&relative_stop=604800&filter=all%20clusters&timezone=Europe/Paris&resource_base=host&scale=10 Sophia]
+
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
 +
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
 
|}
 
|}
 
 
Or view the [https://www.grid5000.fr/gridstatus/oargridgantt.cgi global grid Gantt diagram].
 
  
 
= Network Monitoring =
 
= Network Monitoring =
Line 94: Line 140:
  
 
Shows the actual state of the opticals links between the Grid'5000 10Gb-ready sites. A link painted in black on the weathermap means that you won't be able to access this site nodes from the Grid'5000 internal network.
 
Shows the actual state of the opticals links between the Grid'5000 10Gb-ready sites. A link painted in black on the weathermap means that you won't be able to access this site nodes from the Grid'5000 internal network.
 
== Historical network load ==
 
 
[http://pasillo.renater.fr/metrologie/GRID5000/ Grid'5000 Network monitoring] User:GRID PASS:5000 (courtesy of Renater)
 
 
This page gives you some nice graphs built from the SNMP counters of Renater switches.
 
 
You can see on those graphs if one or several experiments are creating congestion on a switch interface. It is quite interesting if you experiment weird things like packet loss or anormal delay. The EoMPLS graphs are outdated since Orsay and Bordeaux have been migrated to Renater-5.
 
  
 
== Sites network traffic ==
 
== Sites network traffic ==
[https://intranet.grid5000.fr/supervision/grenoble/monitoring/network/last/minute/ Grenoble]
 
[https://intranet.grid5000.fr/supervision/lille/monitoring/network/last/minute/ Lille]
 
[https://intranet.grid5000.fr/supervision/lyon/monitoring/network/last/minute/ Lyon]
 
[https://intranet.grid5000.fr/supervision/luxembourg/monitoring/network/last/minute/ Luxembourg]
 
[https://intranet.grid5000.fr/supervision/nancy/monitoring/network/last/minute/ Nancy]
 
[https://intranet.grid5000.fr/supervision/nantes/monitoring/network/last/minute/ Nantes]
 
[https://intranet.grid5000.fr/supervision/rennes/monitoring/network/last/minute/ Rennes]
 
[https://intranet.grid5000.fr/supervision/sophia/monitoring/network/last/minute/ Sophia]
 
  
== Latency monitoring ==
+
A dashboard combining links and real-time data is available on the [https://intranet.grid5000.fr/net/Lille/ Grid'5000 Backbone Network Monitoring] page.
 
 
[https://intranet.grid5000.fr/smokeping/Lille/?target=G5KCore Grid'5000 Interlink Latency] (please check  [http://oss.oetiker.ch/smokeping/doc/reading.en.html Reading the Graphs] if you are not used to Smokeping graphs).
 
 
 
We are using the ping / ICMP probes in order to monitor the backbone's latency. Smokeping forks Fping every 300 seconds on each site in order to ping 20 times the adminfront of each site (including himself). Each site is trying to ping the others one, allowing us to get a full view of the network from each site.
 
 
 
Each host is pinged 20 times (similar to a ping -c 20) in order to study :
 
 
 
* '''Packet Loss''' (PL) occurring the link. Packet Loss can be caused by the saturation of a link. When a router or switch buffers are unable to store packets, the packets are dropped. It may also be caused by a faulty transmitting equipment somewhere, a link flapping (up/down/up/down...) caused by a faulty component.
 
 
 
* '''The variance between each ping'''. If the first comes back in 10ms, then the second in 50ms, then 5ms... there is something weird going on (network overload or routing issues). The graphs is plotting the median value then drawing smoke under and upper the point. If the median is 20ms, the min 10m and max 80ms, you will have a colored point at 20ms then smoke going from 10ms to 80ms.
 
  
 +
= Power Monitoring =
  
A dashboard combining links and real-time data is also available on the [https://intranet.grid5000.fr/net/Lille/ Grid'5000 Backbone Network Monitoring] page.
 
  
= Power Monitoring =
+
* [https://intranet.grid5000.fr/supervision/grenoble/monitoring/energy/last/minute/ Grenoble]
* [https://intranet.grid5000.fr/supervision/lille/monitoring/energy/last/minute/ Lille]
 
 
* [https://intranet.grid5000.fr/supervision/lyon/monitoring/energy/last/minute/ Lyon]
 
* [https://intranet.grid5000.fr/supervision/lyon/monitoring/energy/last/minute/ Lyon]
 
* [https://intranet.grid5000.fr/supervision/nancy/monitoring/energy/last/minute/ Nancy]
 
* [https://intranet.grid5000.fr/supervision/nancy/monitoring/energy/last/minute/ Nancy]
 
* [https://intranet.grid5000.fr/supervision/rennes/monitoring/energy/last/minute/ Rennes]
 
* [https://intranet.grid5000.fr/supervision/rennes/monitoring/energy/last/minute/ Rennes]
 +
 +
Clusters where kwapi is available are listed on this page : https://intranet.grid5000.fr/jenkins-status/?job=test_kwapi
  
 
= Usage statistics =
 
= Usage statistics =
[https://intranet.grid5000.fr/stats/ Site availability] over time gathers a lot of statistics about raw usage of the platform
+
[https://intranet.grid5000.fr/stats/ Stats5k] gathers a lot of statistics about the testbed.
 
 
= Kaspied =
 
Kaspied is a statistic tool provided to show who is using the platform.
 
 
 
https://www.grid5000.fr/kaspied/
 
  
 
= Ganglia =
 
= Ganglia =
[http://ganglia.sourceforge.net/ Ganglia] provides resources usage metrics (memory, cpu, jobs...) for individual sites or the whole grid.
+
[http://ganglia.sourceforge.net/ Ganglia] provides resources usage metrics (memory, cpu, jobs...) for individual sites or the whole platform.
  
 
https://intranet.grid5000.fr/ganglia/
 
https://intranet.grid5000.fr/ganglia/
  
= Nagios =
+
= Jenkins =
[http://www.nagios.org/ Nagios] monitors critical grid servers and services and automatically reports incidents and failures.
+
Most of Grid'5000 services are tested using Jenkins. Summary of results, indicating platform health, is available on:
  
[https://intranet.grid5000.fr/nagios/ Grid'5000 Nagios monitoring page.]
+
https://intranet.grid5000.fr/jenkins-status/

Revision as of 17:46, 18 January 2019


Current platform events (maintenance, outages, issues...)

If you experience problems, please check the platform's operation schedule (Past, present and future incidents (planned or not...) are notified for all sites).


For other long running minor issue that may affect your experiment, you can check the list of known artifacts : Grid5000 Artifacts (this list is also displayed when you connect on frontends).

Resources reservations (OAR jobs) status

Monika (current placement and queued jobs status)

Grenoble

Lille

Luxembourg

Lyon

Nancy
Nancy (production)

Nantes

Rennes

Sophia

Drawgantt (past, current and future OAR jobs scheduling)

Default view:

Grenoble
nodes
disks
subnets
vlans

Lille
nodes
disks
subnets
vlans

Luxembourg
nodes
subnets
vlans

Lyon
nodes
subnets
vlans

Nancy
nodes
nodes (production)
disks
subnets
vlans

Nantes
nodes
subnets
vlans

Rennes
nodes
disks
subnets
vlans

Sophia
nodes
subnets
vlans

Forecast view for 1 week:

Grenoble
nodes
disks
subnets
vlans

Lille
nodes
disks
subnets
vlans

Luxembourg
nodes
subnets
vlans

Lyon
nodes
subnets
vlans

Nancy
nodes
nodes (production)
disks
subnets
vlans

Nantes
nodes
subnets
vlans

Rennes
nodes
disks
subnets
vlans

Sophia
nodes
subnets
vlans

Network Monitoring

Backbone network status and load

Grid'5000 Weathermap (courtesy of Renater)

Shows the actual state of the opticals links between the Grid'5000 10Gb-ready sites. A link painted in black on the weathermap means that you won't be able to access this site nodes from the Grid'5000 internal network.

Sites network traffic

A dashboard combining links and real-time data is available on the Grid'5000 Backbone Network Monitoring page.

Power Monitoring

Clusters where kwapi is available are listed on this page : https://intranet.grid5000.fr/jenkins-status/?job=test_kwapi

Usage statistics

Stats5k gathers a lot of statistics about the testbed.

Ganglia

Ganglia provides resources usage metrics (memory, cpu, jobs...) for individual sites or the whole platform.

https://intranet.grid5000.fr/ganglia/

Jenkins

Most of Grid'5000 services are tested using Jenkins. Summary of results, indicating platform health, is available on:

https://intranet.grid5000.fr/jenkins-status/