TO DO Lists
This is Sergey Boyarinov's TODO list and the list of completed jobs.
Sergey Boyarinov's TODO list
- add /usr/clas/archive automount to the Linux procedures (see clon04)
- found a solution for et_2_et_10_00 etc problem: we are running 3 et_2_et's on clon10, question is how our monitoring will distinguish them ? ps command cuts parameters, and process name et_2_et is the same for all of them; temporary et_2_et included in to 2 others (see $CODA/src/et/main).
- look at UEP15 NIM power supply from WIENER (plug into old NIM bin)
- decide on electronics inventory database and form of presentation - with Sergey P.
- CODA: use et_2_et everywhere (check byte swap!)
- respond on property (see email Oct-9-2007)
- there is an incompartibility on Solaris 10 update 2 with new Studio 12 (bosio does not compile for example), need to fix, maybe install Solaris 10 update 3 from scratch ? if so, clon10 and clon00 need extra work installing dns, nis, bootp, realport etc
- modify CMON and CED for new TOF counters (ask Heddle for CED)
- make datafile for DVCS Trigger simulation using BOS file from Rustam
Oct-9-2007: done, Rustam sent me pedestal file, I generated 10000 events ascii file for Ben, it is on DVCS Trigger wiki page, sent mail to Ben; program is $CODA/src/bosio/main/bosdvcstrig.c
- talk to Hovanes about adding new names 'lac' and 'dc' to the dictionary generation procedure (Makefile modifications, save/restore procedure etc); update wiki page accordingly
Oct-5-2007: Hovanes shown me how to do that, we did it in new EPICS, must write it to wiki and repeat in old epics
- buy 2 more A1520 500V CAEN boards on Stepan's request for hodoscope
Oct-8-2007: got quotation for A1520's, v288's and v895's
- arrange AC socket replacement on sy527's
- preshower_exit did not kill rcServer (probably daq_exit too) - need check
- old sy527 driver for VME-based caenet board
Sep-2007: need to pass to Nerses what I did; Oct-1-2007: doing it myself with Nerses's help Oct-4-2007: driver is finished, last puzzle was switching on floating point in vxWorks; everything works fine; remaining changes: make status bits consistent with sy1527s, check if 'input V' field must be filled in; merge with sy1527 driver; tune timing inside v288.c and sy527.c; check power/enable handling (currently 'dis'=Pw and PrON, 'Ena'=PrOn); check ID's (where it should be 0, where 4 etc) Oct-9-2007: finish another round of tuning; add extra checks, particularly in v288Get calls, call v288Reset in case of transmission problems, fix some sleeps, set lower priority for main thread to let EPICS processes breeze; still have error messages, but ioc seems stable thanks to v288Reset; TODO: make sure enable/disable and on/off logic works correctly (probably in v288.c)
- cleanup/replug/label BigIron switch, update MRTG names
- shutting down clasonl1 with command
shutdown -y -i 5 -g 120
got following
Shutdown started. Fri Sep 28 16:20:49 EDT 2007 Broadcast Message from root (pts/28) on clasonl1 Fri Sep 28 16:20:49... The system clasonl1 will be shut down in 2 minutes showmount: clasonl1: RPC: Program not registered Broadcast Message from root (pts/28) on clasonl1 Fri Sep 28 16:21:49... The system clasonl1 will be shut down in 1 minute showmount: clasonl1: RPC: Program not registered Broadcast Message from root (pts/28) on clasonl1 Fri Sep 28 16:22:19... The system clasonl1 will be shut down in 30 seconds showmount: clasonl1: RPC: Program not registered Broadcast Message from root (pts/28) on clasonl1 Fri Sep 28 16:22:39... THE SYSTEM clasonl1 IS BEING SHUT DOWN NOW ! ! ! Log off now or risk your files being damaged showmount: clasonl1: RPC: Program not registered Changing to init state 5 - please wait clasonl1:/root>
Need to understand.
- v1190/v1290 testing: CBLT does not work, wrong slot number
- runcontrol does not compiles on clon10, but compiles on clon03 - new studio12 etc - need to install Sol10 U3 everywhere !; runcontrol hungs in 'go' for TDC_CALIB - working on it
- write notes on standalone DAQ operations (enable signal necessity etc)
- equipment list with DB, lebles (with Sergey P.)
- JLAB discriminators: ask Volker to push it
28-set-2007: sent email to Volker, he replied asking for extra info, sent email with requested info
- need 1881M ADCs, at least few modules
- test sy527 which arrived from repair (with George Jacobs)
28-sep-2007: unit tested, boots fine, serial works; George will test alarm output, Sergey - remote control 28-sep-2007: AC connectors arrived, George will arrange replacement (or me ?); box with connectors in counting room on the table
- buy labels for both labeling machines
17-sep-2007 requisitions 266507 and 266519 have been submitted 26-sep-2007 labels for panel labler received
- learn, start and test auto-shutdown software on clons
29-sep-2007 soft installed on 18 machines, partially configured, need to send signal from UPS_CHB1 and make sure it reaches all machines
- on Nerses's request: install S99caRepeater and S99logServer scripts, update corresponding procedure for Solaris (ask Paul Letta if necessary)
14-sep-2007: installed to /etc/rc3.d on clon01, but actial reboot was not tested
- fix colors for clasrun accounts (and others ?) on clons
- order 2 more discriminators for DVCSCAL
- on clon10 msql and rtserver must restart automatically on reboot
- nrpe did not restarted on clon06, it was in maintenance state; as clasrun I did disable and enable - it works now; probably network was down when it was trying to restart ?
- make sure emergency generator is by-passed if in service
26-sep-2007: sent mail to Bob Rice 26-sep-2007: I decided to submit request to inforce better power routing during emergency generator panel repair: normal power must be supplied during that time
Sergey Boyarinov's COMPLETED list
- replug AC power to emergency generators
17-sep-2007: done
- NTP servers on Solaris, update Solaris post-install page (and Linux on clon04)
24-sep-2007: done
- get TIBCO license:
17-sep-2007 requisition 266520 has been submitted temporary password received, changed to permanent; access to tibco web site works
25-sep-2007: done
- PCAL test setup (with Sergey P.)
14-sep-2007: everything seems working, except missing delay cables and wierd problem: FASTBUS hung on first event 26-sep-2007: replaced signal distr. card (TDC START did not work) 27-sep-2007: add 1 sec delay in 'Go' trabsition of fbrol1.c for standalone only, it seems fixed first-event-hung problem; everything seems Ok
27-sep-2007: done
- make paper 'DVCSCAL trigger system' and pass it to Chris and Ben
25-sep-2005 create new page DVCS Trigger System in clonweb wiki, document is there
25-sep-2007: done
- reply to Motorola about prpmc880
with Sergey P.: we asked Motorola for replacement, they offered PrPMC280-like replacement, Sergey on top of that, I'm done for now 28-sep-2007: done
- on clonweb wrong apache is starting on reboot (check nagios and mrtg as well)
30-sep-2007: done
- on upcoming run request ran 3 ethernet cables and 1 serial cable to the target cartrige; new name 'ioctstarg' was added to the tsconnect.conf; all connections are tested
1-oct-2007: done
- ET system debugging (with Carl)
clonpc7:/etc> ifconfig -a lo0: flags=8049<UP,LOOPBACK,RUNNING,MULTICAST> mtu 16384 inet 127.0.0.1 netmask 0xff000000 inet6 ::1 prefixlen 128 inet6 fe80::1%lo0 prefixlen 64 scopeid 0x1 gif0: flags=8010<POINTOPOINT,MULTICAST> mtu 1280 stf0: flags=0<> mtu 1280 en0: flags=8863<UP,BROADCAST,SMART,RUNNING,SIMPLEX,MULTICAST> mtu 1500 inet6 fe80::20d:93ff:fe65:ba7e%en0 prefixlen 64 scopeid 0x4 inet 129.57.68.7 netmask 0xffffff00 broadcast 129.57.68.255 inet 192.168.2.1 netmask 0xffffff00 broadcast 192.168.2.255 ether 00:0d:93:65:ba:7e media: autoselect (100baseTX <full-duplex>) status: active supported media: none autoselect 10baseT/UTP <half-duplex> 10baseT/UTP <full-duplex> 10baseT/UTP <full-duplex,hw-loopback> 100baseTX <half-duplex> 100baseTX <full-duplex> 100baseTX <full-duplex,hw-loopback> en1: flags=8863<UP,BROADCAST,SMART,RUNNING,SIMPLEX,MULTICAST> mtu 1500 ether 00:11:24:a1:d0:47 media: autoselect (<unknown type>) status: inactive supported media: autoselect fw0: flags=8863<UP,BROADCAST,SMART,RUNNING,SIMPLEX,MULTICAST> mtu 2030 lladdr 00:0d:93:ff:fe:65:ba:7e media: autoselect <full-duplex> status: inactive supported media: autoselect <full-duplex>
ET system was confused:
clonpc7:et> et_start -n 100 -s 10000 -f /tmp/test3 et_netinfo reached et_netinfo: fully qualified default hostname >clonpc7.jlab.org<
ifi->ifi_flags = 0xffff8049 LOOPBACK INTERFACE INTERFACE IS UP ifi->ifi_addr = 0x00300300 hptr = 0x00300190 addr_in->sin_addr = 127.0.0.1,
ifi->ifi_flags = 0xffff8863 INTERFACE IS UP ifi->ifi_addr = 0x00300350 hptr = 0x00300190 addr_in->sin_addr = 129.57.68.7,
ifi->ifi_flags = 0xffff8863 INTERFACE IS UP ifi->ifi_addr = 0x003003a0 hptr = 0x00000000 addr_in->sin_addr = 192.168.2.1,
et_netinfo: address = 129.57.68.7 et_netinfo: error in gethostbyaddr we've got 192.168.2.1, do not believe it is true et_netinfo: address = 129.57.68.7 removing file >/tmp/test3< file >/tmp/test3< removed et_udpreceive: port=11111 et_udpreceive: port=11112 et_udpreceive: port=11112 et_udpreceive: port=11112 ET user library >/usr/local/clas/devel/coda/Darwin_powerpc/lib/libet_user.so< will be used
to eliminate alias 192.168.2.1 following command was used:
ifconfig en0 -alias 192.168.2.1
Now it looks better:
clonpc7:/etc> et_start -n 100 -s 10000 -f /tmp/test3 et_netinfo reached et_netinfo: fully qualified default hostname >clonpc7.jlab.org<
ifi->ifi_flags = 0xffff8049 LOOPBACK INTERFACE INTERFACE IS UP ifi->ifi_addr = 0x00300300 hptr = 0x00300190 addr_in->sin_addr = 127.0.0.1,
ifi->ifi_flags = 0xffff8863 INTERFACE IS UP ifi->ifi_addr = 0x00300350 hptr = 0x00300190 addr_in->sin_addr = 129.57.68.7,
et_netinfo: address = 129.57.68.7 removing file >/tmp/test3< file >/tmp/test3< removed et_udpreceive: port=11111 et_udpreceive: port=11112 ET user library >/usr/local/clas/devel/coda/Darwin_powerpc/lib/libet_user.so< will be used
Oct-3-2007: got new version from Carl, test et_2_et: can connect through multiple ports to multiple ETs, BUT cannot connect between machines on different subnets. Tried DIRECT - does not work, ask Carl to check.
Oct-4-2007: pcal EB and ER are not working with new ET !!! Switch to old one, need check ...
Oct-9-2007: final check, everything looks good; add new option 'direct' to 'et_2_et', will be used for communication between machines which are not sharing any subnets, in that case only one ET system on remote machine can be connected as before; if machines are sharing at least one subnet, there is no limitations for the number of ET systems we can communicate with - good enough for CLON cluster
Oct-9-2007: done
- order extended maintenance kit for clonhp2
Oct-9-2007: done