process exited with code 247 (0xf7, -9)

Saenger
Saenger
Joined: 15 Feb 05
Posts: 24
Credit: 600993
RAC: 0
Topic 84775

All my Cuda-WUs fail with this error.

stderr:

[18:18:15][31139][INFO ] Starting data processing...
[18:18:15][31139][INFO ] CUDA global memory status (initial GPU state, including context):
------> Used in total: 116 MB (396 MB free / 512 MB total) -> Used by this application (assuming a single GPU task): 0 MB
[18:18:15][31139][INFO ] Using CUDA device #0 "GeForce GT 240" (96 CUDA cores / 385.92 GFLOPS)
[18:18:15][31139][INFO ] Version of installed CUDA driver: 4010
[18:18:15][31139][INFO ] Version of CUDA driver API used: 3020
[18:18:15][31139][INFO ] Checkpoint file unavailable: status.cpt (No such file or directory).
------> Starting from scratch...
[18:18:15][31139][INFO ] Header contents:
------> Original WAPP file: ./p2030.20100913.G48.73+01.03.S.b6s0g0.00000_DM637.60
------> Sample time in microseconds: 65.4762
------> Observation time in seconds: 274.62705
------> Time stamp (MJD): 55453.031573597393
------> Number of samples/record: 0
------> Center freq in MHz: 1214.289551
------> Channel band in MHz: 0.33605957
------> Number of channels/record: 960
------> Nifs: 1
------> RA (J2000): 191718.8153
------> DEC (J2000): 143053.3507
------> Galactic l: 0
------> Galactic b: 0
------> Name: G48.73+01.03.S
------> Lagformat: 0
------> Sum: 1
------> Level: 3
------> AZ at start: 0
------> ZA at start: 0
------> AST at start: 0
------> LST at start: 0
------> Project ID: --
------> Observers: --
------> File size (bytes): 0
------> Data size (bytes): 0
------> Number of samples: 4194304
------> Trial dispersion measure: 637.6 cm^-3 pc
------> Scale factor: 0.117303
[18:18:19][31139][INFO ] Seed for random number generator is 1112316178.
[18:18:21][31139][INFO ] Derived global search parameters:
------> f_A probability = 0.08
------> single bin prob(P_noise > P_thr) = 9.93986e-09
------> thr1 = 18.4267
------> thr2 = 21.5421
------> thr4 = 26.5915
------> thr8 = 35.0049
------> thr16 = 49.3672
[18:18:21][31139][INFO ] CUDA global memory status (GPU setup complete):
------> Used in total: 375 MB (137 MB free / 512 MB total) -> Used by this application (assuming a single GPU task): 259 MB
[18:18:22][31139][ERROR] Error launching CUDA TSP kernel (error: 1)
[18:18:22][31139][ERROR] Demodulation failed (error: 1015)!
18:18:22 (31139): called boinc_finish

Messages in BOINC:

Di 15 Nov 2011 19:53:56 CET | Albert@Home | Started download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384.binary
Di 15 Nov 2011 19:53:56 CET | Albert@Home | Started download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1385.binary
Di 15 Nov 2011 19:53:56 CET | Albert@Home | Started download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1386.binary
Di 15 Nov 2011 19:54:23 CET | Albert@Home | Finished download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1386.binary
Di 15 Nov 2011 19:54:23 CET | Albert@Home | Started download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1387.binary
Di 15 Nov 2011 19:54:26 CET | Albert@Home | Finished download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1385.binary
Di 15 Nov 2011 19:54:26 CET | Albert@Home | Started download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1388.binary
Di 15 Nov 2011 19:54:31 CET | Albert@Home | Finished download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384.binary
Di 15 Nov 2011 19:54:31 CET | Albert@Home | Started download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1389.binary
Di 15 Nov 2011 19:54:54 CET | Albert@Home | Finished download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1387.binary
Di 15 Nov 2011 19:54:54 CET | Albert@Home | Started download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1390.binary
Di 15 Nov 2011 19:54:59 CET | Albert@Home | Finished download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1388.binary
Di 15 Nov 2011 19:54:59 CET | Albert@Home | Started download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1391.binary
Di 15 Nov 2011 19:55:01 CET | Albert@Home | Finished download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1389.binary
Di 15 Nov 2011 19:55:17 CET | Albert@Home | Finished download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1391.binary
Di 15 Nov 2011 19:55:19 CET | Albert@Home | Finished download of p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1390.binary
Di 15 Nov 2011 19:55:19 CET | Albert@Home | [task] result state=FILES_DOWNLOADED for p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 from CS::update_results
Di 15 Nov 2011 20:00:19 CET | Albert@Home | [task] ACTIVE_TASK::start(): forked process: pid 25662
Di 15 Nov 2011 20:00:19 CET | Albert@Home | [task] task_state=EXECUTING for p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 from start
Di 15 Nov 2011 20:00:19 CET | Albert@Home | Starting task p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 using einsteinbinary_BRP4 version 108
Di 15 Nov 2011 20:00:27 CET | Albert@Home | [task] Process for p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 exited
Di 15 Nov 2011 20:00:27 CET | Albert@Home | [task] task_state=EXITED for p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 from handle_exited_app
Di 15 Nov 2011 20:00:27 CET | Albert@Home | [sched_op] Deferring communication for 1 min 34 sec
Di 15 Nov 2011 20:00:27 CET | Albert@Home | [sched_op] Reason: Unrecoverable error for task p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 (process exited with code 247 (0xf7, -9))
Di 15 Nov 2011 20:00:27 CET | Albert@Home | [task] result state=COMPUTE_ERROR for p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 from CS::report_result_error
Di 15 Nov 2011 20:00:27 CET | Albert@Home | [task] process exited with status 247
Di 15 Nov 2011 20:00:27 CET | Albert@Home | Computation for task p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 finished
Di 15 Nov 2011 20:00:27 CET | Albert@Home | Output file p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10_0 for task p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 absent
Di 15 Nov 2011 20:00:27 CET | Albert@Home | Output file p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10_1 for task p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 absent
Di 15 Nov 2011 20:00:27 CET | Albert@Home | Output file p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10_2 for task p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 absent
Di 15 Nov 2011 20:00:27 CET | Albert@Home | Output file p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10_3 for task p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 absent
Di 15 Nov 2011 20:00:27 CET | Albert@Home | Output file p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10_4 for task p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 absent
Di 15 Nov 2011 20:00:27 CET | Albert@Home | Output file p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10_5 for task p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 absent
Di 15 Nov 2011 20:00:27 CET | Albert@Home | Output file p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10_6 for task p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 absent
Di 15 Nov 2011 20:00:27 CET | Albert@Home | Output file p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10_7 for task p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 absent
Di 15 Nov 2011 20:00:27 CET | Albert@Home | [task] result state=COMPUTE_ERROR for p2030.20100913.G48.73+01.03.S.b6s0g0.00000_1384_10 from CS::app_finished

Grüße vom Sänger

Saenger
Saenger
Joined: 15 Feb 05
Posts: 24
Credit: 600993
RAC: 0

process exited with code 247 (0xf7, -9)

Got the same problem here before, as I just found out with Google:
http://www.renderfarm.fi/forum/post/4122

Grüße vom Sänger

Bernd Machenschalk
Bernd Machenschalk
Administrator
Joined: 15 Oct 04
Posts: 155
Credit: 6218130
RAC: 0

Thanks for reporting! I

Thanks for reporting!

I disabled the CUDA apps again. Apparently they choke on the modified BRP workunits we are sending out here.

BM

BM

Comment viewing options

Select your preferred way to display the comments and click "Save settings" to activate your changes.