CS 404/504 ST:
Reliability & Performability Modeling
in addition to main course page: http://husky.if.uidaho.edu/cs504r05/
Fall 2005


Class notes

  • Session 01 (08.24.05) Introduction, syllabus, grading policy, references (ref:Shooman Ch.1, Pham Ch1.;)
  • Session 02 (08.31.05) (ref. mat.) Basic terms (system, reliability, dependability, performability, etc.); Probability theory (ref: ref:Shooman Ch.1,App.A, URLs;)
  • Session 03 (09.07.05) (ref. mat.) Markov models (derivation from probability theory); Modeling Techniques for software evaluation (ref: Shooman App.A, Tai: Ch.4;)
    • Reading material & discussion (Seba slides): Early work on software reliability, 70-ies: Excerpts from conference proceedings: Software Reliability, Infotech 1977 England slides
  • Session 04 (09.14.05) (ref. mat.) Redundancy, combinatorial reliability (series, parallel, r-out-of-n), tie-sets; reward structures (reward rates & reward impulses); (ref. Shooman Ch.3; Tai Ch.1-4; )
    • Reading material & discussion (Don slides& Jeff slides): Ellison's papers "Survivable Network systems" & "Survivable Network Analysis Method".
  • Session 05 (09.21.05) (ref. mat.) MTTF, hazard rates, parallel/standby reliability, r-out-of-n, repairable structures, reliability & MTTF; Modeling concepts; Analytic/simulation models; (ref: Shooman Ch.3, Tai Ch.1-4;)
    • Reading material & discussion (Dann slides1 2 & Kimball slides ): IEEE Comp.Soc. Series, Fault-Tolerant Software systems, ed. H.Pham, 1992.
  • Session 06 (09.28.05) (ref. mat.) coverage & system reliability & availability (ref: Shooman Ch.3, App.A,B;)
    • Reading material (Seba slides, class ref. page): Reliability in Space Shuttle.
    • Note: first draft paper due.
  • Session 07 (10.05.05) (ref. mat.) Petri Nets, Generalized Stochastic Petri Nets, modeling constructs, GSPNs, Reachability Graphs, examples, examples.
    • Reading material & discussion (Dann slides, instructor slides, class ref. page): Avionics reliability Boeing 777.
    • Reading material & discussion (Kurt slides, url1, url2): SRE - Handbook of Software Reliability, Chapter 6 and Appendix A.
  • Session 08 (10.12.05) (ref. mat.) Component redundancies, voter redundancies, passive, active, hybrid redundancies; Triple Modular Redundancy (TMR), reliability, error rate (majority voting), TMR 3-2, 3-2-1, MTTF;
    N-Modular Redundancy, voter coverage, majority voting, perfect, imperfect, and redundant voters
    • Reading material & discussion: (Don slides): Byzantine generals problem.
  • Session 09 (10.19.05) (ref. mat.) NMR, component redundancies, voter redundancies, passive, active, hybrid redundancies
    NMR, TMR, 5MR with repair, reliability & MTTF; (ref: Shooman Ch.4; Lyu Ch.14;)
    • Reading material & discussion: no RM. Exam #1
  • Session 10 (10.26.05) (ref. mat.) (SHARPE ref. mat.) Higher level redundancy, imperfect voters, NMR availability with repair & imperfect voters, majority voting in digital circuits, Voting strategies, lockout (standoff) (ref: Sahner et.al SHARPE book Ch.4,5,6; Lyu Ch.14; )
    • Reading material & discussion (Kurt slides, class ref. page): FlexRay Communications System Specifications.
    • Note: second draft paper due.
  • Session 11 (11.02.05) (ref. mat.) (SHARPE, SPNP, Möbius ref.mat.) FT software and voting strategies comparison (FTS adjudicator, pairwise, adaptive voting, choice of comparison tolerance value); FT software techniques (NVP, Consensus RB, Acceptance Voting, N-self checking programming), Bus-Guardian Unit (concept and FTU & RBD), Fault-Tolerant Trees in SHARPE;
    • Reading material & discussion (Dann slides, Don slides): Using Fault Injection to Increase Software Test Coverage, FTSW, Bieman's Fault injection; M.Vouk's papers: Excerpts from conference proceedings: 7th Int. Symposium on Software Reliability Engineering, ISSRE'96.
  • Session 12 (11.09.05) (ref. mat.) (SHARPE, SPNP, Möbius ref.mat.) More on software FT techniques (distributed recovery blocks, comparison); data diverse FT software (retry blocks, n-copy programming, data re-expression algorithms, two-pass adjudicators (RtB, NCP, DRA, TPA), examples;
    • Reading material & discussion (Kurt slides, class ref. page): Software Development Life Cycle & Reliability; (SDLC & Henderson’s book on OOD).
  • Session 13 (11.16.05) (ref. mat.) (SHARPE, SPNP, Möbius ref.mat.) Reliability optimization; optimizatio techniques (optimum vs. good solutions, improving reliability), hierarchical decomposition (graphs, sapn of control), apporionment (equal weighting, relative difficulty, relative failure rates, Albert's method, stratified optimization , availability apportionment), subsystem optimization (exhaustive enumeration), bounded enumeration;
    • Reading material & discussion (Jeff slides): IEEE 7th Symposium on Reliable Distributed Systems.
  • Week of 11.23.05 - Fall recess (Thanksgiving)
  • Session 14 (ref. mat.) (SHARPE, SPNP, Möbius ref.mat.) (11.30.05) Reliability optimization, parallel subsystem optimization, standby system optimization (min, optimum design), parallel/standby optimization comparison, greedy algorithm; (ref: Shooman Ch.7;)
    Software reliability&recovery; software error models (error-removal, error-generation); software reliability engineering (data collection & model estimation); software reliability models (deterministic - Halstead; probabilistic - error seeding (Mills), failure rate (Jelinski&Moranda), NHPP-Musa, Goel&Okumoto, hyperexponential, S-shaped) (ref: Musa Ch13; Lyu Ch.3; Pham Ch.4; Shooman Ch.5; 11.29.04)
    • Reading material & discussion (Dann slides, class ref. page, Kurt slides 1 ,2,): Life Cycle Management For Dependability, F. Redmill 1997; Computational Intelligence and Reliability;
    • Note: final paper submission due.
  • Session 15 (ref. mat.) (SHARPE, SPNP, Möbius ref.mat.) (12.07.05) Computational technologies in fault-tolerance and reliability: Artificial neural networks (ANNs) in reliability; fuzzy logic (FL) in reliability (ref: Lyu Ch.17; UI EE578 F03 lectures; 12.06.04)
    • Paper presentation.


  • Reminder on UI academic calendar:
    • Nov. 24-25 Thanksgiving week, UI closed
    • Dec. 5-9 dead week; Dec. 12-16 final examinations, Dec. 16 close of Fall semester.

 

Reading material not assigned yet:

Along with the provided material, feel free to bring papers you can find or want to discuss. Your activity constitutes a portion of your grade. We will talk about details in class.

Already assigned presentations (choose a session, please):

  • Reading material & discussion (Jeff slides): Excerpts from conference proceedings: Computer Safety, Reliability, and Security, 22nd Int. Conf., SAFECOMP 2003, UK.class ref. page

 

Not taken yet:

  • Reading material & discussion (presenter not assigned yet, slides): Trivedi's sharpe papers from course site (ideas.pdf, paperSharpe.pdf, Katerina&Trivedi_00.pdf, Smith&Trivedi_88.pdf). You are welcome to discuss other Trivedi's papers as well.
  • Reading material & discussion (presenter not assigned yet, slides): Excerpts from conference proceedings: Software Reliability, State of the Art Report, Infotech 1986 England.
  • Reading material & discussion (presenter not assigned yet, slides): Excerpts from conference proceedings: Dependable Network Computing, ed. by Dimiter Avresky, Kluwer, 2000.
  • Reading material & discussion (presenter not assigned yet, slides): Excerpts from conference proceedings: DISCEX II, DARPA Information Survivability Conference & Exposition, June 2001.
  • Reading material & discussion (presenter not assigned yet, slides): Excerpts from conference proceedings: Dependable Computing for Critical Appications 7, ed. by A.Avizienis, H.Kopetz, J.C.Laprie, Vol. 12, 1999.
  • Reading material & discussion (presenter not assigned yet, slides): Reliability & quality Control, Paris, France, June 24-26, 1987, ed. M.H.Hamza, IASTED, ISBN 0-88986-128-5.
  • Reading material & discussion (presenter not assigned yet, slides): Debra Herrman's book: non-industry (IEC, IEE, ANSI/IEEE) and key industry (RTCA/DO-178B, ESA, NASA, ANSI/AIAA, MIL-STD, DEF STAN, NATO COTS, IEC, CE) standards in software safety & reliability.


 

back to main page