OS: Microsoft Server 2012 R2
Domain Environment: 2012 R2 Functional Level
Test Server: Virtual Machine running on Hyper-V
We have been struggling to solve a problem that a 3rd party tool has been causing in many of our domain environments for the last 6 months, and I am hoping there are some GP experts here that can help us to improve our debugging to flush out the root cause.
Overview: We are a software company, and we are using another company's application for our reporting module. This 3rd party company's tool (Pentaho) is utilizing PostgreSQL and Tomcat Apache - both are managed via a Windows Service we created. The application
works well, but we have seen that when installed on domain joined machines with GPO's applied, there is a conflict with Group Policy client which causes major delays during reboots and problems running gpupdate/rsop.msc while the PostgresQL and Tomcat Apache
services are actively running.
Behavior: If we have the PostgreSQL/Tomcat services running, we find that a reboot will cause a delay of upwards of one hour, and running gpupdate /force will hang indefinitely. If gpupdate /force is run while the PostgreSQL/Tomcat services are running,
it puts gpclient into a bad state, requiring a reboot to resolve. Simply disabling the services and rebooting brings the system back into a healthy state and allows group policy to operate normally until we re-enable the PostgreSQL/Tomcat services.
We have enabled all available debug logging in Group Policy, PostgreSQL, and Tomcat, performed xBootMgr traces, performed Process Monitor analysis, and Packet Captures, but we have been unable to pinpoint the cause of the conflict with GroupPolicy. We have
also opened tickets with all other involved vendors to see if we can solve the problem from their side, but I would like to see if we can get a Group Policy expert to review our gpsvc logs to see if anything is obvious, or see if there is anything else we
can enable to get more details in regards to what is causing this.
I have collected a series of logs and network captures - descriptions and links below:
1. Normal login with PostgreSQL/Tomcat fully stopped/disabled: Group Policy processes normally with no extended delays - https://www.dropbox.com/s/0yrkcky34pdnljb/normal_gp.txt?dl=0
2. Normal login with PostgreSQL/Tomcat running: Group Policy completely hangs for multiple minutes at various points with no explanation, and eventually completes - https://www.dropbox.com/s/kzzjewmuj5ga9y5/essence_gp.log?dl=0
3. 'gpupdate /force' run after login with PostgreSQL/Tomcat services enabled: https://www.dropbox.com/s/n4sobuaabs1f1li/gpupdate_fail.log?dl=0
4. Packet capture while the gpupdate /force from above was running - https://www.dropbox.com/s/xa1032bcgq9bmib/gpupdate_fail_trace.pcapng?dl=0
Is there anything obvious in these logs/captures that I am missing? Is there any additional debugging/tracing that we can enable to get further details about what is causing gpsvc to fail while PostgreSQL/Tomcat services are running?
Please let me know if there is any additional information that I can provide.
Nick