Status: Difference between revisions

From Grid5000
Jump to navigation Jump to search
(114 intermediate revisions by 25 users not shown)
Line 1: Line 1:
{{Maintainer|Pierre Neyron}}
{{Status|In production}}
{{Status|In production}}
{{Portal|User}}
{{Portal|Platform}}


If you experience problems, please [[Current_events|check the grid administration schedule]], where past, present and future incidents (planned or not...) are notified for all sites:
= [https://www.grid5000.fr/status/ Current platform events] (maintenance, outages, issues...) =
If you experience problems, please check '''[https://www.grid5000.fr/status/ the platform's operation schedule]''' ''(Past, present and future incidents (planned or not...) are notified for all sites).''


= Monika =
[http://oar.imag.fr/ Monika] displays current and scheduled [[OAR]] jobs.


You can select an individual site or cluster:
For other long running minor issue that may affect your experiment, you can check the list of known artifacts : '''[https://intranet.grid5000.fr/status/artifact/ Grid5000 Artifacts]''' ''(this list is also displayed when you connect on frontends).''
 
= Resources reservations (OAR jobs) status =
 
{|
{|
|bgcolor="#aaaaaa" colspan="8"|
'''Monika''' ''(current placement and queued jobs status)''
|-
|-
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://helpdesk.grid5000.fr/oar/Bordeaux/monika.cgi Bordeaux]
[https://intranet.grid5000.fr/oar/Grenoble/monika.cgi '''Grenoble''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Lille/monika.cgi '''Lille''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
Grenoble:
[https://intranet.grid5000.fr/oar/Luxembourg/monika.cgi '''Luxembourg''']
* [https://helpdesk.grid5000.fr/oar/Grenoble/monika.cgi Idpot]
* [http://ita.imag.fr/cgi-bin/monika.cgi ICluster2]
* Icare
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://helpdesk.grid5000.fr/oar/Lille/monika.cgi Lille]
[https://intranet.grid5000.fr/oar/Lyon/monika.cgi '''Lyon''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
Lyon:
[https://intranet.grid5000.fr/oar/Nancy/monika.cgi '''Nancy''']<br>
* [https://helpdesk.grid5000.fr/oar/Lyon/monika-capricorne.cgi capricorne]
[https://intranet.grid5000.fr/oar/Nancy/monika-prod.cgi '''Nancy (production)''']
* [https://helpdesk.grid5000.fr/oar/Lyon/monika-sagittaire.cgi sagittaire (testing)]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://helpdesk.grid5000.fr/oar/Nancy/monika.cgi Nancy]
[https://intranet.grid5000.fr/oar/Nantes/monika.cgi '''Nantes''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://helpdesk.grid5000.fr/oar/Orsay/monika.cgi Orsay]
[https://intranet.grid5000.fr/oar/Rennes/monika.cgi '''Rennes''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
Rennes:
[https://intranet.grid5000.fr/oar/Sophia/monika.cgi '''Sophia''']
* [https://helpdesk.grid5000.fr/oar/Rennes/monika-paraci.cgi paraci]
|-
* [https://helpdesk.grid5000.fr/oar/Rennes/monika-parasol.cgi parasol]
|bgcolor="#aaaaaa" colspan="8"|
* [https://helpdesk.grid5000.fr/oar/Rennes/monika-paravent.cgi paravent]
'''Drawgantt''' ''(past, current and future OAR jobs scheduling)''
* [https://helpdesk.grid5000.fr/oar/Rennes/monika-tartopom.cgi tartopom]
|-
|bgcolor="#eeeeee" colspan="8"|
Default view:
|-
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
Sophia:
<big>'''Grenoble'''</big><br>
* [https://helpdesk.grid5000.fr/oar/Sophia/monika-azur.cgi Azur]
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg/ '''nodes''']<br>
* [https://helpdesk.grid5000.fr/oar/Sophia/monika-helios.cgi Helios]
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-disks/ disks]<br>
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://helpdesk.grid5000.fr/oar/Toulouse/monika.cgi Toulouse]
<big>'''Lille'''</big><br>
|}
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg/ '''nodes''']<br>
 
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-disks/ disks]<br>
Or view the [https://www.grid5000.fr/gridstatus/oargridmonika.cgi global snapshot of the grid].
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-subnets/ subnets]<br>
 
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-vlans/ vlans]
= DrawOARGantt =
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[http://oar.imag.fr/ DrawOARGantt] displays past, current and scheduled [[OAR]] jobs.
<big>'''Luxembourg'''</big><br>
 
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg/ '''nodes''']<br>
You can select an individual site or cluster:
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-subnets/ subnets]<br>
{|
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Lyon'''</big><br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Nancy'''</big><br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-prod/ '''nodes (production)''']<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-disks/ disks]<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Nantes'''</big><br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Rennes'''</big><br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-disks/ disks]<br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Sophia'''</big><br>
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-vlans/ vlans]
|-
|bgcolor="#eeeeee" colspan="8"|
Forecast view for 1 week:
|-
|-
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://helpdesk.grid5000.fr/oar/Bordeaux/DrawOARGantt.pl Bordeaux]
<big>'''Grenoble'''</big><br>
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-disks//?relative_start=-28800&relative_stop=604800 disks]<br>
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
Grenoble:
<big>'''Lille'''</big><br>
* [https://helpdesk.grid5000.fr/oar/Grenoble/DrawOARGantt.pl Grenoble]
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
* [http://ita101.imag.fr/cgi-bin/DrawOARGantt.pl ICluster2]
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-disks/?relative_start=-28800&relative_stop=604800 disks]<br>
* Icare
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://helpdesk.grid5000.fr/oar/Lille/DrawOARGantt.pl Lille]
<big>'''Luxembourg'''</big><br>
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://helpdesk.grid5000.fr/oar/Lyon/DrawOARGantt.pl Lyon]
<big>'''Lyon'''</big><br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://helpdesk.grid5000.fr/oar/Nancy/DrawOARGantt.pl Nancy]
<big>'''Nancy'''</big><br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-prod/?relative_start=-28800&relative_stop=604800 '''nodes (production)''']<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-disks/?relative_start=-28800&relative_stop=604800 disks]<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://helpdesk.grid5000.fr/oar/Orsay/DrawOARGantt.pl Orsay]
<big>'''Nantes'''</big><br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
Rennes:
<big>'''Rennes'''</big><br>
*[https://helpdesk.grid5000.fr/oar/Rennes/DrawOARGantt-paraci.pl paraci]
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
* [https://helpdesk.grid5000.fr/oar/Rennes/DrawOARGantt-parasol.pl parasol]
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-disks/?relative_start=-28800&relative_stop=604800 disks]<br>
* [https://helpdesk.grid5000.fr/oar/Rennes/DrawOARGantt-paravent.pl paravent]
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
* [https://helpdesk.grid5000.fr/oar/Rennes/DrawOARGantt-tartopom.pl tartopom]
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
Sophia:
<big>'''Sophia'''</big><br>
* [https://helpdesk.grid5000.fr/oar/Sophia/DrawOARGantt-azur.pl Azur]
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
* [https://helpdesk.grid5000.fr/oar/Sophia/DrawOARGantt-helios.pl Helios]
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
[https://helpdesk.grid5000.fr/oar/Toulouse/DrawOARGantt.pl Toulouse]
|}
|}


Or view the [https://www.grid5000.fr/gridstatus/DrawGridGantt.cgi global grid Gantt diagram].
= Network Monitoring =
== Backbone network status and load ==
[http://pasillo.renater.fr/weathermap/weathermap_g5k.html Grid'5000 Weathermap] (courtesy of Renater)


= Kaspied =
Shows the actual state of the opticals links between the Grid'5000 10Gb-ready sites. A link painted in black on the weathermap means that you won't be able to access this site nodes from the Grid'5000 internal network.
Ka-spied is a statistic tool provided to show who is using the platform.


https://www.grid5000.fr/kaspied
== Sites network traffic ==


= GridPrems =
A dashboard combining links and real-time data is available on the [https://intranet.grid5000.fr/net/Lille/ Grid'5000 Backbone Network Monitoring] page.
[http://gforge.inria.fr/projects/gridprems GridPrems] is an alternate grid scheduler running in Rennes site.


https://helpdesk.grid5000.fr/gridprems/
= Power Monitoring =


= Ganglia =
[http://ganglia.sourceforge.net/ Ganglia] provides resources usage metrics (memory, cpu, jobs...) for individual sites or the whole grid.


https://helpdesk.grid5000.fr/ganglia/
* [https://intranet.grid5000.fr/supervision/grenoble/monitoring/energy/last/minute/ Grenoble]
* [https://intranet.grid5000.fr/supervision/lyon/monitoring/energy/last/minute/ Lyon]
* [https://intranet.grid5000.fr/supervision/nancy/monitoring/energy/last/minute/ Nancy]
* [https://intranet.grid5000.fr/supervision/rennes/monitoring/energy/last/minute/ Rennes]


= Nagios =
Clusters where kwapi is available are listed on this page : https://intranet.grid5000.fr/jenkins-status/?job=test_kwapi
[http://www.nagios.org/ Nagios] monitors critical grid servers and services and automatically reports incidents and failures.


https://helpdesk.grid5000.fr/nagios/
= Usage statistics =
[https://intranet.grid5000.fr/stats/ Stats5k] gathers a lot of statistics about the testbed.


= Global Grid'5000 status geographical map =
= Ganglia =
[[Image:G5K_geographical_map.jpg|thumbnail|250px|right|Geographical map screenshot]]
[http://ganglia.sourceforge.net/ Ganglia] provides resources usage metrics (memory, cpu, jobs...) for individual sites or the whole platform.
{{Warning|text=This is still an experimental tool (eg. unstable).}}
This tool places all Grid'5000 sites geographically, and displays their current status.


http://www.lri.fr/~herault/G5K/action.html
https://intranet.grid5000.fr/ganglia/


= Renater4 monitoring map =
= Jenkins =
Most of Grid'5000 services are tested using Jenkins. Summary of results, indicating platform health, is available on:


Renater provides this monitoring page for Renater4 network:
https://intranet.grid5000.fr/jenkins-status/
[http://www.renater.fr/Metrologie/map-Renater4 Renater4 monitoring]

Revision as of 16:46, 18 January 2019


Current platform events (maintenance, outages, issues...)

If you experience problems, please check the platform's operation schedule (Past, present and future incidents (planned or not...) are notified for all sites).


For other long running minor issue that may affect your experiment, you can check the list of known artifacts : Grid5000 Artifacts (this list is also displayed when you connect on frontends).

Resources reservations (OAR jobs) status

Monika (current placement and queued jobs status)

Grenoble

Lille

Luxembourg

Lyon

Nancy
Nancy (production)

Nantes

Rennes

Sophia

Drawgantt (past, current and future OAR jobs scheduling)

Default view:

Grenoble
nodes
disks
subnets
vlans

Lille
nodes
disks
subnets
vlans

Luxembourg
nodes
subnets
vlans

Lyon
nodes
subnets
vlans

Nancy
nodes
nodes (production)
disks
subnets
vlans

Nantes
nodes
subnets
vlans

Rennes
nodes
disks
subnets
vlans

Sophia
nodes
subnets
vlans

Forecast view for 1 week:

Grenoble
nodes
disks
subnets
vlans

Lille
nodes
disks
subnets
vlans

Luxembourg
nodes
subnets
vlans

Lyon
nodes
subnets
vlans

Nancy
nodes
nodes (production)
disks
subnets
vlans

Nantes
nodes
subnets
vlans

Rennes
nodes
disks
subnets
vlans

Sophia
nodes
subnets
vlans

Network Monitoring

Backbone network status and load

Grid'5000 Weathermap (courtesy of Renater)

Shows the actual state of the opticals links between the Grid'5000 10Gb-ready sites. A link painted in black on the weathermap means that you won't be able to access this site nodes from the Grid'5000 internal network.

Sites network traffic

A dashboard combining links and real-time data is available on the Grid'5000 Backbone Network Monitoring page.

Power Monitoring

Clusters where kwapi is available are listed on this page : https://intranet.grid5000.fr/jenkins-status/?job=test_kwapi

Usage statistics

Stats5k gathers a lot of statistics about the testbed.

Ganglia

Ganglia provides resources usage metrics (memory, cpu, jobs...) for individual sites or the whole platform.

https://intranet.grid5000.fr/ganglia/

Jenkins

Most of Grid'5000 services are tested using Jenkins. Summary of results, indicating platform health, is available on:

https://intranet.grid5000.fr/jenkins-status/