Status: Difference between revisions

From Grid5000
Jump to navigation Jump to search
(re-organize info)
Line 1: Line 1:
= Grid administration schedule =
= Grid administration schedule =
If you experience problems using Grid'5000, please [[Current_events|check the grid administration schedule]], where past, present and future incidents (planned or not...) are notified for all sites.
If you experience problems, please [[Current_events|check the grid administration schedule]], where past, present and future incidents (planned or not...) are notified for all sites:
{{Link|text=[[Current_events|Sidebar # Users Portal > Platform events]]}}


Note that as far as possible most of the administration operations are planned on thursday mornings.
{{Note|text=As far as possible most of the administration operations are planned on thursday mornings.}}


Other monitoring tools are also available (see below) to help you trace your experiment. If you still have questions, please [https://helpdesk.grid5000.fr/bugzilla/ report your problem] or [mailto:staff@site.grid5000.fr contact the technical staff] from the most relevant site (or [mailto:ct@grid5000.fr the whole technical committee] for global cases).
= Problems =
[[#Monitoring_tools|Monitoring tools]] are available to help you trace your experiment.


* [https://helpdesk.grid5000.fr/phpldapadmin/ phpLDAPadmin]
If you still have questions, please:
* [https://www.grid5000.fr/cgi-bin/bugzilla/index.cgi BugZilla]
* send email to the [mailto:users@grid5000.fr grid5000 community]
* or report your problem on [https://helpdesk.grid5000.fr/bugzilla/ BugZilla]


= Ganglia =
= Monitoring tools =
Ganglia provides information about resources usage (memory, cpu, jobs...) for individual sites or the whole grid.
== OAR and OARgrid ==
 
OAR is the grid scheduler, which may also be queried for current and planned jobs and nodes reservations, either from command lines (see [[OAR|OAR documentation]]) or as web services.
https://helpdesk.grid5000.fr/ganglia/


= OAR and OARgrid =
=== Monika ===
OAR is the grid scheduler, which may also be queried for current and planned jobs and nodes reservations, either from command lines (see [[OAR|OAR documentation]]) or as web services.
[http://oar.imag.fr/ Monika] displays current and scheduled jobs.


== Monika ==
You can select an individual site:
[http://oar.imag.fr/ Monika] displays current and scheduled jobs. It is possible to view a [https://frontal38.imag.fr/cgi-bin/oargridmonika.cgi global snapshot] or select an individual site:
{| width="100%"
{| width="100%"
|-
|-
Line 35: Line 36:
|}
|}


== DrawOARGantt ==
Or view the [https://frontal38.imag.fr/cgi-bin/oargridmonika.cgi entire grid snapshot].
[http://oar.imag.fr/ DrawOARGantt] displays past, current and scheduled jobs. It is possible to view a [https://frontal38.imag.fr/cgi-bin/DrawGridGantt.cgi global Gantt Diagram] or select on an individual site:
 
=== DrawOARGantt ===
[http://oar.imag.fr/ DrawOARGantt] displays past, current and scheduled jobs.
 
You can select an individual site:
{| width="100%"
{| width="100%"
|-
|-
Line 53: Line 58:
|}
|}


= GridPrems =
Or view the [https://frontal38.imag.fr/cgi-bin/DrawGridGantt.cgi entire grid Gantt Diagram].
GridPrems is an alternate grid scheduler running on certain sites.
 
== GridPrems ==
[http://gforge.inria.fr/projects/gridprems GridPrems] is an alternate grid scheduler running on certain sites.


https://helpdesk.grid5000.fr/gridprems/
https://helpdesk.grid5000.fr/gridprems/


= Nagios =
== Ganglia ==
Nagios allows to monitor critical grid servers and services and automatically reports incidents and failures.
[http://ganglia.sourceforge.net/ Ganglia] provides resources usage metrics (memory, cpu, jobs...) for individual sites or the whole grid.
 
https://helpdesk.grid5000.fr/ganglia/
 
== Nagios ==
[http://www.nagios.org/ Nagios] monitors critical grid servers and services and automatically reports incidents and failures.


https://helpdesk.grid5000.fr/nagios/
https://helpdesk.grid5000.fr/nagios/


= Experimental network monitoring tools =
== phpLDAPadmin ==
[http://phpldapadmin.sourceforge.net/ phpLDAPadmin] provides easy administration for your LDAP account entries.


'''Beware:''' experimental = unstable...
https://helpdesk.grid5000.fr/phpldapadmin/
 
== Current status map ==
This tool positions geographically sites and displays their current status.


http://www.lri.fr/~herault/G5K/action.html
http://www.lri.fr/~herault/G5K/action.html
{{Warning|text=This tool is experimental for the moment (eg. unstable).}}

Revision as of 16:07, 2 March 2006

Grid administration schedule

If you experience problems, please check the grid administration schedule, where past, present and future incidents (planned or not...) are notified for all sites:

Link.png {{{1}}}
Note.png Note

As far as possible most of the administration operations are planned on thursday mornings.

Problems

Monitoring tools are available to help you trace your experiment.

If you still have questions, please:

Monitoring tools

OAR and OARgrid

OAR is the grid scheduler, which may also be queried for current and planned jobs and nodes reservations, either from command lines (see OAR documentation) or as web services.

Monika

Monika displays current and scheduled jobs.

You can select an individual site:

Or view the entire grid snapshot.

DrawOARGantt

DrawOARGantt displays past, current and scheduled jobs.

You can select an individual site:

Or view the entire grid Gantt Diagram.

GridPrems

GridPrems is an alternate grid scheduler running on certain sites.

https://helpdesk.grid5000.fr/gridprems/

Ganglia

Ganglia provides resources usage metrics (memory, cpu, jobs...) for individual sites or the whole grid.

https://helpdesk.grid5000.fr/ganglia/

Nagios

Nagios monitors critical grid servers and services and automatically reports incidents and failures.

https://helpdesk.grid5000.fr/nagios/

phpLDAPadmin

phpLDAPadmin provides easy administration for your LDAP account entries.

https://helpdesk.grid5000.fr/phpldapadmin/

Current status map

This tool positions geographically sites and displays their current status.

http://www.lri.fr/~herault/G5K/action.html

Warning.png Warning

This tool is experimental for the moment (eg. unstable).