DAQ/Online Quick Reference: Difference between revisions

From CLONWiki
Jump to navigation Jump to search
No edit summary
No edit summary
Line 92: Line 92:
  SmartSockets
  SmartSockets
  Mysql
  Mysql
'''Logbook and/or run database does not work'''
Contact Marty Wise.





Revision as of 13:45, 19 July 2010

DAQ Recovery Procedure

In most cases DAQ will crash because of ROC problem. Do following:

  • click 'End Run' or 'Abort', whatever button is active
  • click 'Cancel' and then 'Reset'
  • go to 'EB_TERM' window, Ctrl-C and run previous command using arrow up (command is 'coda_eb ...')
  • run 'roc_status' command from clon10 window; it will show troubled ROC(s) as 'UNDEFINED' or 'active'
  • from clon10 window, reboot troubled ROC(s) using command 'roc_reboot <roc>'
  • wait 2 minutes and run 'roc_status' again; rebooted ROC(s) must be in 'booted' stage
  • start new run from 'Download'

If it does not help, page DAQ pager.


FOLLOWING INFORMATION IS FOR DAQ EXPERTS ONLY (DO NOT DO IT IF YOU ARE NOT SURE !!!)

DAQ procedures

First make sure you can open X window from the terminal clon03:0.1: type 'xterm' and make sure xterminal is opened. If it is not, message 'X11 connection rejected because of wrong authentication.' will shows up. In this case kill xwindow, start new one (you will be on clon03) and 'ssh clon10'. Make sure 'xterm' command can open xterminal.

To start all DAQ processes goto clon10 as clasrun and type 3 following commands on 3 different workspaces:

daq_start this
roc_xterm_this
pmc_xterms_this

to open all windows on current workspace. If daq_start executed without parameter, it will open everything on clon03's bottom screen on currently active workspace.

To kill all DAQ processes goto clon10 as clasrun and type

daq_exit
roc_xterms_kill
pmc_xterms_kill


Online procedures

To start online monitoring processes use following commands:

mon0_start (starts everything on clonmon0 - fast online reconstruction;
     NOTE: root presenter must be started separately: ssh to clonmon0 (usually from clon03:0.2) and type 'cmroot clasprod hist_clasprod')
mon1_start (starts everything on clonmon1 - event_monitor)
mon2_start (starts everything on clonmon2 - online recsis, ced; if ced does not started, run it manually by ssh to clonmon2 from clon04:0.2)


To kill online monitoring processes use following commands:

mon0_exit (kills everything on clonmon0)
mon1_exit (kills everything on clonmon1)
mon2_exit (kills everything on clonmon2)

Occupancy plots must be started on clonmon1. Login as clasrun and type

monb

Stadis to be started on clon05:0.1.

DCHV to be started on clon06:0.1.

Diman to be started on clon03:0.0.


Background information

  • CRON JOBS: on clon10, clon00, clon01 and clondb1 make sure cron jobs are running (they must start automatically after reboot); if have to start cron jobs, login as clascron to the mentioned machines, goto directory /home/clascron/backup and follow instructions in README file, normally it will be following:
login as 'clascron' to the clon10
goto /home/clascron/backup
type 'crontab -l' to see what is running
type 'crontab -r' to erase all existing cron jobs
type 'crontab clon10'
repeat everything for clon00 and clon01, using file 'clon00' and 'clon01' respectively

Process management

Sometimes you may want to turn some cron jobs off to avoid alarms during system reconfiguration: do not forget to turn them on again after you are done !

  • Process management: SmartSockets-based process management procedures must be started on clon10 as clasrun using following command:
control_process_manager start clasprod clasprod
control_procmgr_manager start clasprod clasprod
control_ipc_process_manager start clasprod clasprod

and killed by following:

control_process_manager stop clasprod clasprod
control_procmgr_manager stop clasprod clasprod
control_ipc_process_manager stop clasprod clasprod


Services on clondb1

Clondb1 is running several important servers which suppose to be started automatically on reboot:

SmartSockets
Mysql

Logbook and/or run database does not work

Contact Marty Wise.


Sergey Boyarinov (from JLAB local phone):

Moscow cell phone: 9-011-7-916-729-8203
Moscow home phones: 9-011-7-495-392-3569 (otec), 9-011-7-499-189-0189 (teshya)
SKYPE: boiarino