on 10-15-2014 8:41 PM
Dear All,
In our cps system one of the platform agent is not running, the platform agent service is keep on in connecting state, and getting operator message as "JCS-116012: Warning: Transient communication failure: HTTP response code: -1" ; can any one help me to solve this issue? let me know if need more information. thanks in advance.
KR
Muhammad Asif.
Hi All,
The executable files inside */latest/bin directorys user/ group and permissions are modified by some one with roos user, after reverting thebin folder file permissions to original the platform agent started. following are correct file permissions.
$ ls -ltr
total 72576
-rwxr-xr-x 1 redwood redwood 759 Aug 6 2013 chown.sh
-rwxr-xr-x 1 redwood redwood 31430 Dec 11 2013 rca
-r-s--x--x 1 root redwood 2064664 Apr 2 2014 job-processor
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jmail
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jmessage
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jsecret
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jmonitor
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jregister
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jscript
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jlink
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jgetfile
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jgetpar
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jftp
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jevent
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jdescription
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jtool
-rwxr-xr-x 14 redwood redwood 2336824 Apr 2 2014 jsleep
-rwxr-xr-x 1 redwood redwood 2321704 Apr 2 2014 network-processor
-rwxr-xr-x 1 redwood redwood 8387 Oct 9 22:07 platform-agent
$ pwd
/usr/sap/redwood/agent_CPS08/latest/bin
$
KR
Muhammad Asif.
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Please try the following:
thanks
Nanda
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.
Hi Nanda,
Thanks for the reply, following is the results.
1. telnet is working fine, no issues :
$ telnet avggssbh 3388
Trying...
Connected to avggssbh.
Escape character is '^]'
2. seuid is selected during installation, i guess no need to maintain os user credentials in CPS, we have around 20 servers all PAs running fine except this.
3. Scheduler is in running state in satellite system, where as in CPS GUI its in connecting state(Platform Agent Service)
$ ./scheduler status
Instance ZD2_AVGGSPPH_PA: Running
$
4. auto restart is selected during host reboot.
more Analysis: from network processor log i found one fatal error ( fsqueue.c(385)): any idea which memory is this? how to check current status?
fatal 2014-10-11 02:11:42,858 METDST [9432-http-request #c tid=d00000000] common.memory - Out of memory allocating 293811038 bytes at fsqueue.c(385)
one more error:
opsys.socket - Connecting socket to localhost port xxxxxx failed: Error 0
http.http - Unable to connect to localhost:xxxxx
Any other clues?
KR
Muhammad Asif.
I hope you did the telnet from the "CPS server", and not your desktop.
Did you also telnet from the AVGGSSPPH server back to CPS?
The user you are talking about is the user with which platform agent is running in the OS.
What about the user you used to register the platform agent with the CPS?
Is that user locked? login enabled?
Have you tried stopping the process server in CPS, restart the platform agent in remote system, start the process server in CPS?
Is there any other information available in operator messages?
Also what was the last job that ran in this process server? was it successful?
or Did it fail at the same time when the memory error occurred?
Thanks
Nanda
User | Count |
---|---|
91 | |
10 | |
10 | |
9 | |
9 | |
7 | |
6 | |
5 | |
5 | |
4 |
You must be a registered user to add a comment. If you've already registered, sign in. Otherwise, register and sign in.