V77 "Freeze" Problem

Before posting something, READ the changelog, WATCH the videos, howto and provide following:
Your install is: Bare metal, ESXi, what CPU model, RAM, HD, what EVE version you have, output of the uname -a and any other info that might help us faster.

Moderator: mike

Chris929
Posts: 83
Joined: Tue Jun 27, 2017 8:51 am

V77 "Freeze" Problem

Post by Chris929 » Thu Aug 31, 2017 9:57 am

Hi all,

I have a Problem with the new V77:

Randomly my vSRXes start to freeze - completely.
Even when clicking "stop" it still thinks it's running (but i verified with top, that it stopped).
In this case i have to reboot the whole eve host to have everything "offline" and to start my lab again - this is really annoying - has anyone else experienced this problem?
I'm using vSRX 15.1D100 and vSRX 17.3R1 with EVE V77 and virtioa.qcow2 disks.
I labbed a lot with v71 and never had problems like this.

Regards
Chris

breakintheweb
Posts: 10
Joined: Fri Sep 01, 2017 12:25 pm

Re: V77 "Freeze" Problem

Post by breakintheweb » Fri Sep 01, 2017 12:39 pm

Could be related to the new per node cpu limiting which was added in v77. You can try disable by setting the value to 1 in the node config.

Uldis (UD)
Posts: 5067
Joined: Wed Mar 15, 2017 4:44 pm
Location: London
Contact:

Re: V77 "Freeze" Problem

Post by Uldis (UD) » Sat Sep 02, 2017 11:08 pm

CPU limit can be disabled per node from web UI, edit node and uncheck CPU limit

Chris929
Posts: 83
Joined: Tue Jun 27, 2017 8:51 am

Re: V77 "Freeze" Problem

Post by Chris929 » Mon Sep 04, 2017 8:02 pm

I disabled it globally and set it only on the "per node" basis - seems to work now.
But as soon as i enable it globally AND on the node (like double limiting) the fun begins...

ecze
Posts: 533
Joined: Wed Mar 15, 2017 1:54 pm

Re: V77 "Freeze" Problem

Post by ecze » Mon Sep 04, 2017 9:29 pm

CPU limit on globally will monitor all Qemu process Except those not checked.

For all Juniper, it seems mandatory to uncheck cpulimit.
Indeed, Juniper doesn't support pause mechanism. The engine for packet processing require to poll the cpu without interruption...
I don't like much this model of event handling but that's the Juniper choice....

E.

kmanthezu
Posts: 54
Joined: Mon Mar 20, 2017 3:52 pm

Re: V77 "Freeze" Problem

Post by kmanthezu » Fri Sep 08, 2017 8:09 pm

i'm seeing this same issue with IOL instances as well

dasi3907
Posts: 2
Joined: Mon Sep 11, 2017 10:12 pm

Re: V77 "Freeze" Problem

Post by dasi3907 » Tue Sep 12, 2017 11:04 pm

I'm also seeing the same issue with IOL nodes and I'm able to recreate it. When the terminal session times out and you restart it, then type a command like show eigrp tech support the node freezes but then unfreezes after about 20 minutes.

Image

After it unfreezes it comes up with these errors but the nodes work fine.

*Sep 12 22:42:08.505: %AMDP2_FE-6-EXCESSCOLL: Ethernet0/2 TDR=0, TRC=0
*Sep 12 22:42:38.517: %AMDP2_FE-6-EXCESSCOLL: Ethernet0/2 TDR=0, TRC=0
*Sep 12 22:43:08.525: %AMDP2_FE-6-EXCESSCOLL: Ethernet0/2 TDR=0, TRC=0
*Sep 12 22:43:38.529: %AMDP2_FE-6-EXCESSCOLL: Ethernet0/2 TDR=0, TRC=0


Is there an option or command to manual force nodes to restart. (shutdown, start do not work when the node is frozen)

Chris929
Posts: 83
Joined: Tue Jun 27, 2017 8:51 am

Re: V77 "Freeze" Problem

Post by Chris929 » Wed Sep 13, 2017 9:55 am

I had the same problem - either wait (30mins in my case) or reboot the eve-host - the Webinterface does not recognize, that the qemu isn't running anymore - it still shows my device as "up" even when all qemu instances are down - really weird...

Today i found a workaround for me:

More Actions > Console to all nodes (native mode) unfreezes everything for me - strange, right?

kmanthezu
Posts: 54
Joined: Mon Mar 20, 2017 3:52 pm

Re: V77 "Freeze" Problem

Post by kmanthezu » Wed Sep 13, 2017 12:55 pm

my connection is native always and it has been doing this since the upgrade to 77

zanswer
Posts: 6
Joined: Tue Aug 29, 2017 4:12 am
Location: Nyagan

Re: V77 "Freeze" Problem

Post by zanswer » Thu Sep 21, 2017 5:52 am

Same here, I've seeing IOL and VPC node freezing.
CCNA Routing and Switching & CCNA Security Certified... CCNP Routing and Switching in queue...

Locked