Nancy:Network: Difference between revisions

From Grid5000
Jump to navigation Jump to search
No edit summary
(87 intermediate revisions by 8 users not shown)
Line 1: Line 1:
{{Template:Site link|Network}}
{{Template:Site link|Network}}
{{Portal|Network}}
{{Portal|Network}}
{{Status|In production}}
{{Portal|User}}
{{Maintainer|Philippe Robert}}
= Overview of Ethernet network topology =


= IP networks in use =
[[File:NancyNetwork.png|1200px]]


You have to use a public network range to run an experiment between several Grid5000 sites.
{{:Nancy:GeneratedNetwork}}


=== Public Networks ===
= HPC Networks =


* computing : '''172.28.52.0/22'''
Several HPC Networks are available.
* virtual : '''10.144.0.0/14'''


=== Local Networks ===
== Omni-Path 100G on grele and grimani nodes  ==


* ipmi : '''172.28.152.0/22'''
*<code class="host">grele-1</code> to <code class="host">grele-14</code> have one 100GB Omni-Path card.
* ib : '''192.168.210.0/24''' : see [[Nancy:Network#High_Performance_Networks|High Performance Networks]]
*<code class="host">grimani-1</code> to <code class="host">grimani-6</code> have one 100GB Omni-Path card.


= Network =
* Card Model:  Intel Omni-Path Host Fabric adaptateur series 100 1 Port PCIe x8


== Production Network ==
== Omni-Path 100G on grvingt nodes  ==
[[Image:NetworkNancy2.png|center|Production network]]


== Physical link details ==
There's another, separate Omni-Path network connecting the 64 grvingt nodes and some servers.


The following diagram show a view of the central router, a hp procurve 5406zl, named sgravillon1.
Topology, generated from <code>opareports -o topology</code>:
* All internal links are 10GB CX4.
* Renater link is 10GB optical fiber.


[[Image:Sgravillon1-graphene.png|center|600px]]
[[File:Topology-grvingt.png|400px]]


More information about using Omni-Path with MPI is available from the [[Run_MPI_On_Grid%275000]] tutorial.


<table class="program">
=== Switch ===
<tr>
    <th>A1</th>
    <th>A3</th>
    <th>A5</th>
    <th>A7</th>
    <th>A9</th>
    <th>A11</th>
    <th>A13</th>
    <th>A15</th>
    <th>A17</th>
    <th>A19</th>
    <th>A21</th>
    <th>A23</th>
</tr>
<tr>
    <td> - </td>
    <td>fgriffon4-eth1</td>
    <td>fgriffon3-ipmi</td>
    <td>fgriffon4-ipmi</td>
    <td>grog-ipmi</td>
    <td>fgriffon1-eth0</td>
    <td>sgraphene1-ipmi</td>
    <td>sgraphene3-ipmi</td>
    <td> - </td>
    <td>sgriffon2-ipmi</td>
    <td>sgriffon3-ipmi</td>
    <td>fgriffon1-eth1</td>
</tr>
<tr>
    <th>A2</th>
    <th>A4</th>
    <th>A6</th>
    <th>A8</th>
    <th>A10</th>
    <th>A12</th>
    <th>A14</th>
    <th>A16</th>
    <th>A18</th>
    <th>A20</th>
    <th>A22</th>
    <th>A24</th>
</tr>
    <td>fgriffon2-ipmi</td>
    <td>fgriffon4-eth0</td>
    <td> - </td>
    <td>grog-eth0</td>
    <td>fgriffon1-ipmi</td>
    <td> - </td>
    <td>sgraphene2-ipmi</td>
    <td>sgraphene4-ipmi</td>
    <td>LORIA</td>
    <td>sgriffon1-ipmi</td>
    <td>fgriffon3-eth0</td>
    <td>fgriffon3-eth1</td>
</table>
 
== High Performance Networks ==
 
=== Infiniband 20G on griffon nodes  ===
 
==== Nodes ====
 
*<code class="host">griffon-1</code> to <code class="host">griffon-92</code> have one 20GB Infiniband card.
 
* Card Model : Mellanox Technologies MT26418 [ConnectX IB DDR, PCIe 2.0 5GT/s] ( [http://www.mellanox.com/content/pages.php?pg=products_dyn&product_family=4&menu_section=41 ConnectX] ).
* Driver : <code class="dir">mlx4_ib</code>
* OAR property : ib20g=YES ( [https://helpdesk.grid5000.fr/oar/Nancy/monika.cgi?props=ib20g%3DYES&Action=Display+nodes+for+these+properties&.cgifields=props Monika] ).
* IP over IB addressing : <code class="host">griffon-[1..92]-ib</code>.nancy.grid5000.fr ( 192.168.210.[1..92] )
 
==== Switch ====


* Infiniband Switch 4X DDR
* Infiniband Switch 4X DDR
* Model based on [http://www.mellanox.com/content/pages.php?pg=products_dyn&product_family=16&menu_section=33 Infiniscale_III]
* Model based on [http://www.mellanox.com/content/pages.php?pg=products_dyn&product_family=16&menu_section=33 Infiniscale_III]
* 1 commutation card Flextronics F-X43M204
* 1 commutation card Flextronics F-X43M204
* 11 line cards 4X 12 ports DDR Flextronics F-X43M203
* 12 line cards 4X 12 ports DDR Flextronics F-X43M203
 
==== Interconnection ====
 
Infiniband network is physically isolated from Ethernet networks. Therefore, Ethernet network emulated over Infiniband is isolated as well. There isn't any interconnexion, neither at the data link layer nor at the network layer.
 
=== Myrinet 10G on griffon nodes ===
 
==== Nodes ====


*<code class="host">griffon-11</code> to <code class="host">griffon-14</code> have one 10GB Myrinet card.
=== Interconnection ===


* Card Model : Myrinet 10G ( [http://www.myri.com/Myri-10G/10gbe_solutions.html] ) 10G-PCIE-8B-C NIC
Infiniband network is physically isolated from Ethernet networks. Therefore, Ethernet network emulated over Infiniband is isolated as well. There isn't any interconnexion, neither at the L2 or L3 layers.
* Driver : <code class="dir">myri10ge</code>
* OAR property : ib20g=YES ( [https://helpdesk.grid5000.fr/oar/Nancy/monika.cgi?props=eth10g%3DYES&Action=Display+nodes+for+these+properties&.cgifields=props Monika] ).
* Ethernet. IP addresses : <code class="host">griffon-[11..14]-myri0</code>.nancy.grid5000.fr ( 172.28.55.[221..224] )


==== Interconnection ====
== Infiniband 56G on graphite/graoully/grimoire/grisou nodes  ==


The nodes having myrinet configured are connected to the 10G Ethernet interfaces of the switches sgriffon|1-3] as following:
*<code class="host">graoully-[1-16]</code> have one 56GB Infiniband card.
* griffon-11 to sgriffon1
*<code class="host">grimoire-[1-8]</code> have one 56GB Infiniband card.
* griffon-12 to sgriffon1 (the configuration is not working probably due to the switch configuration)
*<code class="host">graphite-[1-4]</code> have one 56GB Infiniband card.
* griffon-13 to sgriffon2
*<code class="host">grisou-[50-51]</code> have one 56GB Infiniband card.
* griffon-14 to sgriffon3


= Grid5000 interconnect =
* Card Model : Mellanox Technologies MT27500 Family [ConnectX-3] ( [http://www.mellanox.com/related-docs/user_manuals/ConnectX-3_VPI_Single_and_Dual_QSFP_Port_Adapter_Card_User_Manual.pdf ConnectX-3] ).
== Interconnect type ==
* Driver : <code class="dir">mlx4_core</code>
[[Image:nancy_g5k_interconnect.png|Nancy Grid'5000 interconnect]]
* OAR property : ib_rate='56'
* IP over IB addressing : <code class="host">graoully-[1-16]-ib0</code>.nancy.grid5000.fr ( 172.18.70.[1-16] )
* IP over IB addressing : <code class="host">grimoire-[1-8]-ib0</code>.nancy.grid5000.fr ( 172.18.71.[1-8] )
* IP over IB addressing : <code class="host">graphite-[1-4]-ib0</code>.nancy.grid5000.fr ( 172.16.68.[9-12] )
* IP over IB addressing : <code class="host">grisou-[50-51]-ib0</code>.nancy.grid5000.fr ( 172.16.72.[50-51] )


== Link details ==
=== Switch ===
[[Image:nancy_g5k_linking.png|Nancy's Grid'5000 linking]]


'''Note''': All the used fiber cables are dedicated to our Grid'5000 interconnect. When it is said ''trunk'' on the above figure it means a ''more rigid garter''.
* 36-port Mellanox InfiniBand SX6036
* [http://www.mellanox.com/page/products_dyn?product_family=132 Documentation]
* 36 FDR (56Gb/s) ports in a 1U switch
* 4.032Tb/s switching capacity
* FDR/FDR10 support for Forward Error Correction (FEC)


= Loria interconnect =
=== Interconnection ===
== Interconnect type ==
[[Image:nancy_loria_interconnect.png|Nancy's Loria interconnect]]


== Link details ==
Infiniband network is physically isolated from Ethernet networks. Therefore, Ethernet network emulated over Infiniband is isolated as well. There isn't any interconnexion, neither at the L2 or L3 layers.
[[Image:nancy_loria_linking.png|Nancy's Loria linking]]

Revision as of 07:39, 26 March 2019

Overview of Ethernet network topology

NancyNetwork.png


Network devices models

  • gw: Cisco Nexus 9508
  • sgraoullyib: Infiniband
  • sgrappe: Dell S5224F-ON
  • sgrele-opf: Omni-Path
  • sgros1: Dell Z9264F-ON
  • sgros2: Dell Z9264F-ON
  • sgruss: Dell S5224F-ON
  • sgrvingt: Dell S4048

More details (including address ranges) are available from the Grid5000:Network page.

HPC Networks

Several HPC Networks are available.

Omni-Path 100G on grele and grimani nodes

  • grele-1 to grele-14 have one 100GB Omni-Path card.
  • grimani-1 to grimani-6 have one 100GB Omni-Path card.
  • Card Model: Intel Omni-Path Host Fabric adaptateur series 100 1 Port PCIe x8

Omni-Path 100G on grvingt nodes

There's another, separate Omni-Path network connecting the 64 grvingt nodes and some servers.

Topology, generated from opareports -o topology:

Topology-grvingt.png

More information about using Omni-Path with MPI is available from the Run_MPI_On_Grid'5000 tutorial.

Switch

  • Infiniband Switch 4X DDR
  • Model based on Infiniscale_III
  • 1 commutation card Flextronics F-X43M204
  • 12 line cards 4X 12 ports DDR Flextronics F-X43M203

Interconnection

Infiniband network is physically isolated from Ethernet networks. Therefore, Ethernet network emulated over Infiniband is isolated as well. There isn't any interconnexion, neither at the L2 or L3 layers.

Infiniband 56G on graphite/graoully/grimoire/grisou nodes

  • graoully-[1-16] have one 56GB Infiniband card.
  • grimoire-[1-8] have one 56GB Infiniband card.
  • graphite-[1-4] have one 56GB Infiniband card.
  • grisou-[50-51] have one 56GB Infiniband card.
  • Card Model : Mellanox Technologies MT27500 Family [ConnectX-3] ( ConnectX-3 ).
  • Driver : mlx4_core
  • OAR property : ib_rate='56'
  • IP over IB addressing : graoully-[1-16]-ib0.nancy.grid5000.fr ( 172.18.70.[1-16] )
  • IP over IB addressing : grimoire-[1-8]-ib0.nancy.grid5000.fr ( 172.18.71.[1-8] )
  • IP over IB addressing : graphite-[1-4]-ib0.nancy.grid5000.fr ( 172.16.68.[9-12] )
  • IP over IB addressing : grisou-[50-51]-ib0.nancy.grid5000.fr ( 172.16.72.[50-51] )

Switch

  • 36-port Mellanox InfiniBand SX6036
  • Documentation
  • 36 FDR (56Gb/s) ports in a 1U switch
  • 4.032Tb/s switching capacity
  • FDR/FDR10 support for Forward Error Correction (FEC)

Interconnection

Infiniband network is physically isolated from Ethernet networks. Therefore, Ethernet network emulated over Infiniband is isolated as well. There isn't any interconnexion, neither at the L2 or L3 layers.