TO DO Lists

From CLONWiki
Revision as of 09:34, 11 October 2007 by Boiarino (talk | contribs)
Jump to navigation Jump to search

This is Sergey Boyarinov's TODO list and the list of completed jobs.

Sergey Boyarinov's TODO list

- look at UEP15 NIM power supply from WIENER (plug into old NIM bin)

- decide on electronics inventory database and form of presentation - with Sergey P.

- CODA: use et_2_et everywhere (check byte swap!)

- respond on property (see email Oct-9-2007)

- there is an incompartibility on Solaris 10 update 2 with new Studio 12 (bosio does not compile for example), need to fix, maybe install Solaris 10 update 3 from scratch ? if so, clon10 and clon00 need extra work installing dns, nis, bootp, realport etc

- modify CMON and CED for new TOF counters (ask Heddle for CED)

- make datafile for DVCS Trigger simulation using BOS file from Rustam

Oct-9-2007: done, Rustam sent me pedestal file, I generated 10000 events ascii
  file for Ben, it is on DVCS Trigger wiki page, sent mail to Ben; program is
  $CODA/src/bosio/main/bosdvcstrig.c

- talk to Hovanes about adding new names 'lac' and 'dc' to the dictionary generation procedure (Makefile modifications, save/restore procedure etc); update wiki page accordingly

Oct-5-2007: Hovanes shown me how to do that, we did it in new EPICS, must write it
  to wiki and repeat in old epics

- buy 2 more A1520 500V CAEN boards on Stepan's request for hodoscope

Oct-8-2007: got quotation for A1520's, v288's and v895's

- arrange AC socket replacement on sy527's

- preshower_exit did not kill rcServer (probably daq_exit too) - need check

- old sy527 driver for VME-based caenet board

Sep-2007: need to pass to Nerses what I did;
Oct-1-2007: doing it myself with Nerses's help
Oct-4-2007: driver is finished, last puzzle was switching on floating point in vxWorks;
   everything works fine; remaining changes: make status bits consistent with sy1527s,
   check if 'input V' field must be filled in; merge with sy1527 driver; tune timing
   inside v288.c and sy527.c; check power/enable handling (currently 'dis'=Pw and PrON, 'Ena'=PrOn);
   check ID's (where it should be 0, where 4 etc)
Oct-9-2007: finish another round of tuning; add extra checks, particularly in v288Get calls, call v288Reset
   in case of transmission problems, fix some sleeps, set lower priority for main thread to let EPICS
   processes breeze; still have error messages, but ioc seems stable thanks to v288Reset;
TODO: make sure enable/disable and on/off logic works correctly (probably in v288.c)

- cleanup/replug/label BigIron switch, update MRTG names

- shutting down clasonl1 with command

shutdown -y -i 5 -g 120

got following

Shutdown started.    Fri Sep 28 16:20:49 EDT 2007
Broadcast Message from root (pts/28) on clasonl1 Fri Sep 28 16:20:49...
The system clasonl1 will be shut down in 2 minutes
showmount: clasonl1: RPC: Program not registered
Broadcast Message from root (pts/28) on clasonl1 Fri Sep 28 16:21:49...
The system clasonl1 will be shut down in 1 minute
showmount: clasonl1: RPC: Program not registered
Broadcast Message from root (pts/28) on clasonl1 Fri Sep 28 16:22:19...
The system clasonl1 will be shut down in 30 seconds
showmount: clasonl1: RPC: Program not registered
Broadcast Message from root (pts/28) on clasonl1 Fri Sep 28 16:22:39...
THE SYSTEM clasonl1 IS BEING SHUT DOWN NOW ! ! !
Log off now or risk your files being damaged
showmount: clasonl1: RPC: Program not registered
Changing to init state 5 - please wait
clasonl1:/root>

Need to understand.

- v1190/v1290 testing: CBLT does not work, wrong slot number

- runcontrol does not compiles on clon10, but compiles on clon03 - new studio12 etc - need to install Sol10 U3 everywhere !; runcontrol hungs in 'go' for TDC_CALIB - working on it

- write notes on standalone DAQ operations (enable signal necessity etc)

- equipment list with DB, lebles (with Sergey P.)

- JLAB discriminators: ask Volker to push it

28-set-2007: sent email to Volker, he replied asking for extra info, sent email with requested info

- need 1881M ADCs, at least few modules

- test sy527 which arrived from repair (with George Jacobs)

28-sep-2007: unit tested, boots fine, serial works; George will test alarm output, Sergey - remote control
28-sep-2007: AC connectors arrived, George will arrange replacement (or me ?); box with connectors in
  counting room on the table

- buy labels for both labeling machines

17-sep-2007 requisitions 266507 and 266519 have been submitted
26-sep-2007 labels for panel labler received

- learn, start and test auto-shutdown software on clons

29-sep-2007 soft installed on 18 machines, partially configured, need to send signal from UPS_CHB1 and make sure
it reaches all machines

- on Nerses's request: install S99caRepeater and S99logServer scripts, update corresponding procedure for Solaris (ask Paul Letta if necessary)

14-sep-2007: installed to /etc/rc3.d on clon01, but actial reboot was not tested

- fix colors for clasrun accounts (and others ?) on clons

- order 2 more discriminators for DVCSCAL

- on clon10 msql and rtserver must restart automatically on reboot

- nrpe did not restarted on clon06, it was in maintenance state; as clasrun I did disable and enable - it works now; probably network was down when it was trying to restart ?

- make sure emergency generator is by-passed if in service

26-sep-2007: sent mail to Bob Rice
26-sep-2007: I decided to submit request to inforce better power routing during emergency generator panel repair: normal
power must be supplied during that time

Sergey Boyarinov's COMPLETED list

- replug AC power to emergency generators

17-sep-2007: done

- NTP servers on Solaris, update Solaris post-install page (and Linux on clon04)

24-sep-2007: done

- get TIBCO license:

17-sep-2007 requisition 266520 has been submitted
temporary password received, changed to permanent; access to tibco web site works
25-sep-2007: done

- PCAL test setup (with Sergey P.)

14-sep-2007: everything seems working, except missing delay cables and wierd
problem: FASTBUS hung on first event
26-sep-2007: replaced signal distr. card (TDC START did not work)
27-sep-2007: add 1 sec delay in 'Go' trabsition of fbrol1.c for standalone only, it seems
fixed first-event-hung problem; everything seems Ok
27-sep-2007: done

- make paper 'DVCSCAL trigger system' and pass it to Chris and Ben

25-sep-2005 create new page DVCS Trigger System in clonweb wiki, document is there
25-sep-2007: done

- reply to Motorola about prpmc880

with Sergey P.: we asked Motorola for replacement, they offered PrPMC280-like replacement, Sergey on top of that, I'm done
for now

28-sep-2007: done

- on clonweb wrong apache is starting on reboot (check nagios and mrtg as well)

30-sep-2007: done

- on upcoming run request ran 3 ethernet cables and 1 serial cable to the target cartrige; new name 'ioctstarg' was added to the tsconnect.conf; all connections are tested

1-oct-2007: done


- ET system debugging (with Carl)

clonpc7:/etc> ifconfig -a
lo0: flags=8049<UP,LOOPBACK,RUNNING,MULTICAST> mtu 16384
       inet 127.0.0.1 netmask 0xff000000 
       inet6 ::1 prefixlen 128 
       inet6 fe80::1%lo0 prefixlen 64 scopeid 0x1 
gif0: flags=8010<POINTOPOINT,MULTICAST> mtu 1280
stf0: flags=0<> mtu 1280
en0: flags=8863<UP,BROADCAST,SMART,RUNNING,SIMPLEX,MULTICAST> mtu 1500
       inet6 fe80::20d:93ff:fe65:ba7e%en0 prefixlen 64 scopeid 0x4 
       inet 129.57.68.7 netmask 0xffffff00 broadcast 129.57.68.255
       inet 192.168.2.1 netmask 0xffffff00 broadcast 192.168.2.255
       ether 00:0d:93:65:ba:7e 
       media: autoselect (100baseTX <full-duplex>) status: active
       supported media: none autoselect 10baseT/UTP <half-duplex> 10baseT/UTP <full-duplex> 10baseT/UTP 
   <full-duplex,hw-loopback> 100baseTX <half-duplex> 100baseTX <full-duplex> 100baseTX <full-duplex,hw-loopback>
en1: flags=8863<UP,BROADCAST,SMART,RUNNING,SIMPLEX,MULTICAST> mtu 1500
       ether 00:11:24:a1:d0:47 
       media: autoselect (<unknown type>) status: inactive
       supported media: autoselect
fw0: flags=8863<UP,BROADCAST,SMART,RUNNING,SIMPLEX,MULTICAST> mtu 2030
       lladdr 00:0d:93:ff:fe:65:ba:7e 
       media: autoselect <full-duplex> status: inactive
       supported media: autoselect <full-duplex>

ET system was confused:

clonpc7:et> et_start -n 100 -s 10000 -f /tmp/test3
et_netinfo reached
et_netinfo: fully qualified default hostname >clonpc7.jlab.org<
ifi->ifi_flags = 0xffff8049
LOOPBACK INTERFACE
INTERFACE IS UP
ifi->ifi_addr = 0x00300300
hptr = 0x00300190
addr_in->sin_addr = 127.0.0.1,
ifi->ifi_flags = 0xffff8863
INTERFACE IS UP
ifi->ifi_addr = 0x00300350
hptr = 0x00300190
addr_in->sin_addr = 129.57.68.7,
ifi->ifi_flags = 0xffff8863
INTERFACE IS UP
ifi->ifi_addr = 0x003003a0
hptr = 0x00000000
addr_in->sin_addr = 192.168.2.1,
et_netinfo: address = 129.57.68.7
et_netinfo: error in gethostbyaddr
we've got 192.168.2.1, do not believe it is true
et_netinfo: address = 129.57.68.7
removing file >/tmp/test3<
file >/tmp/test3< removed
et_udpreceive: port=11111
et_udpreceive: port=11112
et_udpreceive: port=11112
et_udpreceive: port=11112
ET user library >/usr/local/clas/devel/coda/Darwin_powerpc/lib/libet_user.so< will be used


to eliminate alias 192.168.2.1 following command was used:

ifconfig en0 -alias 192.168.2.1

Now it looks better:

clonpc7:/etc> et_start -n 100 -s 10000 -f /tmp/test3
et_netinfo reached
et_netinfo: fully qualified default hostname >clonpc7.jlab.org<
ifi->ifi_flags = 0xffff8049
LOOPBACK INTERFACE
INTERFACE IS UP
ifi->ifi_addr = 0x00300300
hptr = 0x00300190
addr_in->sin_addr = 127.0.0.1,
ifi->ifi_flags = 0xffff8863
INTERFACE IS UP
ifi->ifi_addr = 0x00300350
hptr = 0x00300190
addr_in->sin_addr = 129.57.68.7,
et_netinfo: address = 129.57.68.7
removing file >/tmp/test3<
file >/tmp/test3< removed
et_udpreceive: port=11111
et_udpreceive: port=11112
ET user library >/usr/local/clas/devel/coda/Darwin_powerpc/lib/libet_user.so< will be used

Oct-3-2007: got new version from Carl, test et_2_et: can connect through multiple ports to multiple ETs, BUT cannot connect between machines on different subnets. Tried DIRECT - does not work, ask Carl to check.

Oct-4-2007: pcal EB and ER are not working with new ET !!! Switch to old one, need check ...

Oct-9-2007: final check, everything looks good; add new option 'direct' to 'et_2_et', will be used for communication between machines which are not sharing any subnets, in that case only one ET system on remote machine can be connected as before; if machines are sharing at least one subnet, there is no limitations for the number of ET systems we can communicate with - good enough for CLON cluster

Oct-9-2007: done

- order extended maintenance kit for clonhp2

Oct-9-2007: done


=