Nodes are not started in the simulation after 50 node are up
Posted: Thu Jan 07, 2021 11:03 pm
I took advice from the thread and first would describe my setup.
I have quite powerful server which I'm using for my eve-ng.
Hardware : 40 CPUs x Intel(R) Xeon(R) Gold 6138 CPU @ 2.00GHz, 512GB RAM, 2TB HDD Raid 1, 2 physical NICs
EVE-NG Running on - Inside ESXI (v6.7) (VM has allocated 32 vCPUs, 128Gb RAM, 500Gb HDD)
VT-X - Enabled
EVE-NG is community version.
In most of the labs I'm able to start nodes normally. I do have different images and all of them are verified and booting properly. When I tested IOL images I was able to successfully run 64 Nodes in simulation which is the upper limit of community version. I use EVE-NG for my preparation for CCIE Enterprise preparation. I have found a topology provided by Data Knox. It sims to be quite good to play and I wanted to take advantage of it. It heavily leverages IOL images, CSR and Viptela SDWAN images.
The issue that I'm facing is that when I'm trying to run all nodes after 51 of them are up, the other nodes doesn't start. It is always 51 node. Might be IOL or mix of IOL and QEMU but it is 51 node according to the status.
I thought that it might be related to some limit so I tried to shutdown different node and start the one that didn't start. This doesn't help however. New nodes doesn't start after this.
I checked the unl_wraper log and the only thing I can see there is that node has started:
INFO: starting /opt/unetlab/wrappers/iol_wrapper -T 0 -D 64 -t "Edge1" -F /opt/unetlab/addons/iol/bin/i86bi_LinuxL2-AdvEnterpriseK9-M_152_May_2018.bin -d 0 -e 1 -s 0 -- -n 1024 -q -m 1024 > /opt/unetlab/tmp/0/a507c74a-55bf-4f66-96b4-25fa3833704a/64/wrapper.txt 2>&1 &
I don't see anything suspicious in other log files.
If I create new lab and simply add multiple nodes 64 nodes are running without any issue.
My question is how can I further troubleshoot this? Is there any sort of debug which might help me to understand why nodes doesn't run? Could it be some sort of limitation within the topology itself?
I'm attaching the topology as it is free to use.
I have quite powerful server which I'm using for my eve-ng.
Hardware : 40 CPUs x Intel(R) Xeon(R) Gold 6138 CPU @ 2.00GHz, 512GB RAM, 2TB HDD Raid 1, 2 physical NICs
EVE-NG Running on - Inside ESXI (v6.7) (VM has allocated 32 vCPUs, 128Gb RAM, 500Gb HDD)
VT-X - Enabled
EVE-NG is community version.
In most of the labs I'm able to start nodes normally. I do have different images and all of them are verified and booting properly. When I tested IOL images I was able to successfully run 64 Nodes in simulation which is the upper limit of community version. I use EVE-NG for my preparation for CCIE Enterprise preparation. I have found a topology provided by Data Knox. It sims to be quite good to play and I wanted to take advantage of it. It heavily leverages IOL images, CSR and Viptela SDWAN images.
The issue that I'm facing is that when I'm trying to run all nodes after 51 of them are up, the other nodes doesn't start. It is always 51 node. Might be IOL or mix of IOL and QEMU but it is 51 node according to the status.
I thought that it might be related to some limit so I tried to shutdown different node and start the one that didn't start. This doesn't help however. New nodes doesn't start after this.
I checked the unl_wraper log and the only thing I can see there is that node has started:
INFO: starting /opt/unetlab/wrappers/iol_wrapper -T 0 -D 64 -t "Edge1" -F /opt/unetlab/addons/iol/bin/i86bi_LinuxL2-AdvEnterpriseK9-M_152_May_2018.bin -d 0 -e 1 -s 0 -- -n 1024 -q -m 1024 > /opt/unetlab/tmp/0/a507c74a-55bf-4f66-96b4-25fa3833704a/64/wrapper.txt 2>&1 &
I don't see anything suspicious in other log files.
If I create new lab and simply add multiple nodes 64 nodes are running without any issue.
My question is how can I further troubleshoot this? Is there any sort of debug which might help me to understand why nodes doesn't run? Could it be some sort of limitation within the topology itself?
I'm attaching the topology as it is free to use.