Status: Difference between revisions

From Grid5000
Jump to navigation Jump to search
(162 intermediate revisions by 27 users not shown)
Line 1: Line 1:
__NOTOC__
{{Status|In production}}
= Travaux de maintenance de la grille =
{{Portal|User}}
Avant toute chose, consulter les [[Current_events|planifications et rapports d'incidents ou autres sources d'indisponibilité]] de noeuds répertoriés pour l'ensemble des divers sites.
{{Portal|Platform}}
 
= [https://www.grid5000.fr/status/ Current platform events] (maintenance, outages, issues...) =
If you experience problems, please check '''[https://www.grid5000.fr/status/ the platform's operation schedule]''' ''(Past, present and future incidents (planned or not...) are notified for all sites).''


= Ganglia =
Ganglia fourni des informations sur l'utilisation des ressources (mémoire, cpu, jobs, ...)


https://helpdesk.grid5000.fr/ganglia/
For other long running minor issue that may affect your experiment, you can check the list of known artifacts : '''[https://intranet.grid5000.fr/status/artifact/ Grid5000 Artifacts]''' ''(this list is also displayed when you connect on frontends).''


= OAR =
= Resources reservations (OAR jobs) status =
OAR présente la réservation des différents noeuds pour l'exécution de jobs.


== Visualisation des réservations/jobs en cours ==
{|
Il est possible d'accéder à une représentation [https://frontal38.imag.fr/cgi-bin/oargridmonika.cgi globale] ou par site:
|bgcolor="#aaaaaa" colspan="8"|
{| width="100%"
'''Monika''' ''(current placement and queued jobs status)''
|-
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Grenoble/monika.cgi '''Grenoble''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Lille/monika.cgi '''Lille''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Luxembourg/monika.cgi '''Luxembourg''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Lyon/monika.cgi '''Lyon''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Nancy/monika.cgi '''Nancy''']<br>
[https://intranet.grid5000.fr/oar/Nancy/monika-prod.cgi '''Nancy (production)''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Nantes/monika.cgi '''Nantes''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Rennes/monika.cgi '''Rennes''']
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
[https://intranet.grid5000.fr/oar/Sophia/monika.cgi '''Sophia''']
|-
|bgcolor="#aaaaaa" colspan="8"|
'''Drawgantt''' ''(past, current and future OAR jobs scheduling)''
|-
|bgcolor="#eeeeee" colspan="8"|
Default view:
|-
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Grenoble'''</big><br>
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-disks/ disks]<br>
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Lille'''</big><br>
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-disks/ disks]<br>
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Luxembourg'''</big><br>
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Lyon'''</big><br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Nancy'''</big><br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-prod/ '''nodes (production)''']<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-disks/ disks]<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Nantes'''</big><br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Rennes'''</big><br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-disks/ disks]<br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-vlans/ vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Sophia'''</big><br>
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg/ '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-subnets/ subnets]<br>
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-vlans/ vlans]
|-
|bgcolor="#eeeeee" colspan="8"|
Forecast view for 1 week:
|-
|-
| valign="top" width="25%"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
* [https://helpdesk.grid5000.fr/oar/Bordeaux/monika.cgi Bordeaux]
<big>'''Grenoble'''</big><br>
* [https://helpdesk.grid5000.fr/oar/Grenoble/monika.cgi Grenoble]
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
| valign="top" width="25%"|
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-disks//?relative_start=-28800&relative_stop=604800 disks]<br>
* [https://helpdesk.grid5000.fr/oar/Lille/monika.cgi Lille]
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
* [https://helpdesk.grid5000.fr/oar/Lyon/monika.cgi Lyon]
[https://intranet.grid5000.fr/oar/Grenoble/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
| valign="top" width="25%"|
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
* [https://helpdesk.grid5000.fr/oar/Orsay/monika.cgi Orsay]
<big>'''Lille'''</big><br>
* [https://helpdesk.grid5000.fr/oar/Rennes/monika.cgi Rennes]
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
| valign="top" width="25%"|
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-disks/?relative_start=-28800&relative_stop=604800 disks]<br>
* [https://helpdesk.grid5000.fr/oar/Sophia/monika.cgi Sophia]
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
* [https://helpdesk.grid5000.fr/oar/Toulouse/monika.cgi Toulouse]
[https://intranet.grid5000.fr/oar/Lille/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Luxembourg'''</big><br>
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Luxembourg/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Lyon'''</big><br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Lyon/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Nancy'''</big><br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-prod/?relative_start=-28800&relative_stop=604800 '''nodes (production)''']<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-disks/?relative_start=-28800&relative_stop=604800 disks]<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Nancy/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Nantes'''</big><br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Nantes/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Rennes'''</big><br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-disks/?relative_start=-28800&relative_stop=604800 disks]<br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Rennes/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|bgcolor="#ffffff" valign="top" style="border:1px solid #cccccc;padding:1em;padding-top:0.5em;"|
<big>'''Sophia'''</big><br>
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg/?relative_start=-28800&relative_stop=604800 '''nodes''']<br>
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-subnets/?relative_start=-28800&relative_stop=604800 subnets]<br>
[https://intranet.grid5000.fr/oar/Sophia/drawgantt-svg-vlans/?relative_start=-28800&relative_stop=604800 vlans]
|}
|}


== Réservations passées et à venir ==
= Network Monitoring =
Il est possible d'accéder à une représentation [https://frontal38.imag.fr/cgi-bin/DrawGridGantt.cgi globale] ou par site:
== Backbone network status and load ==
{| width="100%"
[http://pasillo.renater.fr/weathermap/weathermap_g5k.html Grid'5000 Weathermap] (courtesy of Renater)
|-
 
| valign="top" width="25%"|
Shows the actual state of the opticals links between the Grid'5000 10Gb-ready sites. A link painted in black on the weathermap means that you won't be able to access this site nodes from the Grid'5000 internal network.
* [https://helpdesk.grid5000.fr/oar/Bordeaux/DrawOARGantt.pl Bordeaux]
 
* [https://helpdesk.grid5000.fr/oar/Grenoble/DrawOARGantt.pl Grenoble]
== Sites network traffic ==
| valign="top" width="25%"|
 
* [https://helpdesk.grid5000.fr/oar/Lille/DrawOARGantt.pl Lille]
A dashboard combining links and real-time data is available on the [https://intranet.grid5000.fr/net/Lille/ Grid'5000 Backbone Network Monitoring] page.
* [https://helpdesk.grid5000.fr/oar/Lyon/DrawOARGantt.pl Lyon]
 
| valign="top" width="25%"|
= Power Monitoring =
* [https://helpdesk.grid5000.fr/oar/Orsay/DrawOARGantt.pl Orsay]
 
* [https://helpdesk.grid5000.fr/oar/Rennes/DrawOARGantt.pl Rennes]
| valign="top" width="25%"|
* [https://helpdesk.grid5000.fr/oar/Sophia/DrawOARGantt.pl Sophia]
* [https://helpdesk.grid5000.fr/oar/Toulouse/DrawOARGantt.pl Toulouse]
|}


= GridPrems =
* [https://intranet.grid5000.fr/supervision/grenoble/monitoring/energy/last/minute/ Grenoble]
https://helpdesk.grid5000.fr/gridprems/
* [https://intranet.grid5000.fr/supervision/lyon/monitoring/energy/last/minute/ Lyon]
* [https://intranet.grid5000.fr/supervision/nancy/monitoring/energy/last/minute/ Nancy]
* [https://intranet.grid5000.fr/supervision/rennes/monitoring/energy/last/minute/ Rennes]


= Nagios =
Clusters where kwapi is available are listed on this page : https://intranet.grid5000.fr/jenkins-status/?job=test_kwapi
Nagios permet le suivi des incidents sur le réseaux et les différentes machines de la grille.


https://helpdesk.grid5000.fr/nagios/
= Usage statistics =
[https://intranet.grid5000.fr/stats/ Stats5k] gathers a lot of statistics about the testbed.


= Outils expérimentaux de visualisation =
= Ganglia =
[http://ganglia.sourceforge.net/ Ganglia] provides resources usage metrics (memory, cpu, jobs...) for individual sites or the whole platform.


Attention ! Expérimental = instable ... (notamment l'URL a déjà changé plusieurs fois !)
https://intranet.grid5000.fr/ganglia/


https://helpdesk.grid5000.fr/map/map.html
= Jenkins =
(ouvrir https://helpdesk.grid5000.fr/map/datafeed.php pour mettre les données à jour)
Most of Grid'5000 services are tested using Jenkins. Summary of results, indicating platform health, is available on:


[[Category:Tech]] [[Category:User]]
https://intranet.grid5000.fr/jenkins-status/

Revision as of 16:46, 18 January 2019


Current platform events (maintenance, outages, issues...)

If you experience problems, please check the platform's operation schedule (Past, present and future incidents (planned or not...) are notified for all sites).


For other long running minor issue that may affect your experiment, you can check the list of known artifacts : Grid5000 Artifacts (this list is also displayed when you connect on frontends).

Resources reservations (OAR jobs) status

Monika (current placement and queued jobs status)

Grenoble

Lille

Luxembourg

Lyon

Nancy
Nancy (production)

Nantes

Rennes

Sophia

Drawgantt (past, current and future OAR jobs scheduling)

Default view:

Grenoble
nodes
disks
subnets
vlans

Lille
nodes
disks
subnets
vlans

Luxembourg
nodes
subnets
vlans

Lyon
nodes
subnets
vlans

Nancy
nodes
nodes (production)
disks
subnets
vlans

Nantes
nodes
subnets
vlans

Rennes
nodes
disks
subnets
vlans

Sophia
nodes
subnets
vlans

Forecast view for 1 week:

Grenoble
nodes
disks
subnets
vlans

Lille
nodes
disks
subnets
vlans

Luxembourg
nodes
subnets
vlans

Lyon
nodes
subnets
vlans

Nancy
nodes
nodes (production)
disks
subnets
vlans

Nantes
nodes
subnets
vlans

Rennes
nodes
disks
subnets
vlans

Sophia
nodes
subnets
vlans

Network Monitoring

Backbone network status and load

Grid'5000 Weathermap (courtesy of Renater)

Shows the actual state of the opticals links between the Grid'5000 10Gb-ready sites. A link painted in black on the weathermap means that you won't be able to access this site nodes from the Grid'5000 internal network.

Sites network traffic

A dashboard combining links and real-time data is available on the Grid'5000 Backbone Network Monitoring page.

Power Monitoring

Clusters where kwapi is available are listed on this page : https://intranet.grid5000.fr/jenkins-status/?job=test_kwapi

Usage statistics

Stats5k gathers a lot of statistics about the testbed.

Ganglia

Ganglia provides resources usage metrics (memory, cpu, jobs...) for individual sites or the whole platform.

https://intranet.grid5000.fr/ganglia/

Jenkins

Most of Grid'5000 services are tested using Jenkins. Summary of results, indicating platform health, is available on:

https://intranet.grid5000.fr/jenkins-status/