Content replication timeouts causing high utilization on database

  • 7005887
  • 03-May-2010
  • 18-Jun-2012

Environment

Novell ZENworks 10 Configuration Management

Situation

Periodic high utilization on database server due to excessive retries from Satellite content synchronization requests.
 
ERROR (from zmd-messages.log on satellite server):
 
[Debug] [04/24/2010 14:34:59.073] [] [2355] [ZenworksAgent] [37,366] [CDP] [No work to do in the cleanup thread - cannot contact content service] [] []
[Debug] [04/24/2010 14:35:29.511] [] [2355] [ZenworksAgent] [37,365] [SoapUtility] [{0} Exception: {1}] [] []
[Debug] [04/24/2010 14:35:29.511] [] [2355] [ZenworksAgent] [37,365] [ZMD] [Current URI is
https://server/zenworks-contentservice/ is bad...trying for another one] [] []
[Debug] [04/24/2010 14:35:29.511] [] [2355] [ZenworksAgent] [37,365] [ZMD] [New URI is null] [] []
[Debug] [04/24/2010 14:35:29.512] [] [2355] [ZenworksAgent] [37,365] [CDP] [No response from server.] [] []
[Debug] [04/24/2010 14:35:29.512] [] [2355] [ZenworksAgent] [37,365] [CDP] [No replication work - cannot contact content service] [] []

Resolution

If the CDP (Content Distribution Point) service is timing out on the request to the parent primary, it may keep trying and put load on the database.  See Documentation for setting CDPWebCallWaitTimeout in 10.3 doc at https://www.novell.com/documentation/zcm10/zcm10_system_admin/data/bafdqz7.html
 
NOTE:  In 10.3.1 and later the default setting for CDPWebCallWaitTimeout is changed to 240000 milliseconds.
 
Additionally there is an outstanding issue:  If the agent fails to download content, it does not fail over to download from other servers.   This failover is fixed in version 10.3.1 - see KB 7006265 "ZENworks Configuration Management 10.3.1 - update information and list of fixes" which can be found at https://www.novell.com/support

Additional Information

Note tags in services-messages.log for estimating times to fullfil requests.  For example:
 
[DEBUG] [5/20/10 6:06:43 AM] [] [Content Web Service] [] [getCDPSuperfluousContent complete, time: 40ms, device: 5dad4884401fa54fac0935ebb0fdaca6, thread: 60] [] []
[DEBUG] [5/20/10 6:08:45 AM] [] [Content Web Service] [] [getCDPMissingContent(null) complete, time: 2712ms, device: 5dad4884401fa54fac0935ebb0fdaca6, thread: 73] [] []