Hi everyone,
We have a critical issue with Exchange 2010 running on Server 2008 R2.
At random intervals we will receive alerts that All Domain Controller Servers in use are not responding (Event ID: 2102). One domain controller is virtual, one physical, and Exchange is virtual. All three servers are running 2008 R2. We are using VMWare ESXi
4.0 Update 1.
Process MAD.EXE (PID=6080). All Domain Controller Servers in use are not responding:
DC01.ourcompany.corp
DC02.ourcompany.corp
When this alert is triggered clients cannot access the Exchange server. The task manager reports W3WP.exe as using near 100% CPU and large amounts of memory. IMAP is enabled for a few users and all users connect to the Exchange server via BIS using OWA.
Both DCs are GC and this is a normal DC contact event:
Process MAD.EXE (PID=3712). Exchange Active Directory Provider has discovered the following servers with the following characteristics:
(Server name | Roles | Enabled | Reachability | Synchronized | GC capable | PDC | SACL right | Critical Data | Netlogon | OS Version)
In-site:
DC01.ourcompany.corp CDG 1 7 7 1 0 1 1 7 1
DC02.ourcompany.corp CDG 1 7 7 1 0 1 1 7 1
Steps taken so far:
Reenabled IPV6
Disabled TCP Checksum Offload
Ran DCDiag (All Domain Controllers come back fine)
Pinged Domain Controllers
Enabled IIS Recycling for the W3WP.exe process
Apply ALL updates and roll-ups
We are at a loss at this point and the issue needs to be resolved ASAP. Any suggestions would be appreciated.