As new technologies develop and mature, it becomes critical to provide both formative and summative assessments on their performance. Performance assessment events range in form from a few simple tests of key elements of the technology to highly complex and extensive evaluation exercises targeting specific levels and capabilities of the system under scrutiny. Typically the more advanced the system, the more often performance evaluations are warranted, and the more complex the evaluation planning becomes. Numerous evaluation frameworks have been developed to generate evaluation designs intent on characterizing the performance of intelligent systems. Many of these frameworks enable the design of extensive evaluations, but each has its own focused objectives within an inherent set of known boundaries. This paper introduces the Multi-Relationship Evaluation Design (MRED) framework whose ultimate goal is to automatically generate an evaluation design based upon multiple inputs. The MRED framework takes input goal data and outputs an evaluation blueprint complete with specific evaluation elements including level of technology to be tested, metric type, user type, and, evaluation environment. Some of MRED’s unique features are that it characterizes these relationships and manages their uncertainties along with those associated with evaluation input. The authors will introduce MRED by first presenting relationships between four main evaluation design elements. These evaluation elements are defined and the relationships between them are established including the connections between evaluation personnel (not just the users), their level of knowledge, and decision-making authority. This will be further supported through the definition of key terms. An example will be presented in which these terms and relationships are applied to the evaluation design of an automobile technology. An initial validation step follows where MRED is applied to the speech translation technology whose evaluation design was inspired by the successful use of a pre-existing evaluation framework. It is important to note that MRED is still in its early stages of development where this paper presents numerous MRED outputs. Future publications will present the remaining outputs, the uncertain inputs, and MRED’s implementation steps that produce the detailed evaluation blueprints.
Skip Nav Destination
ASME 2010 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference
August 15–18, 2010
Montreal, Quebec, Canada
Conference Sponsors:
- Design Engineering Division and Computers in Engineering Division
ISBN:
978-0-7918-4413-7
PROCEEDINGS PAPER
The Multi-Relationship Evaluation Design Framework: Designing Testing Plans to Comprehensively Assess Advanced and Intelligent Technologies
Brian A. Weiss,
Brian A. Weiss
National Institute of Standards and Technology (NIST), Gaithersburg, MD
Search for other works by this author on:
Linda C. Schmidt,
Linda C. Schmidt
University of Maryland-College Park, College Park, MD
Search for other works by this author on:
Harry A. Scott,
Harry A. Scott
National Institute of Standards and Technology (NIST), Gaithersburg, MD
Search for other works by this author on:
Craig I. Schlenoff
Craig I. Schlenoff
National Institute of Standards and Technology (NIST), Gaithersburg, MD
Search for other works by this author on:
Brian A. Weiss
National Institute of Standards and Technology (NIST), Gaithersburg, MD
Linda C. Schmidt
University of Maryland-College Park, College Park, MD
Harry A. Scott
National Institute of Standards and Technology (NIST), Gaithersburg, MD
Craig I. Schlenoff
National Institute of Standards and Technology (NIST), Gaithersburg, MD
Paper No:
DETC2010-28928, pp. 603-616; 14 pages
Published Online:
March 8, 2011
Citation
Weiss, BA, Schmidt, LC, Scott, HA, & Schlenoff, CI. "The Multi-Relationship Evaluation Design Framework: Designing Testing Plans to Comprehensively Assess Advanced and Intelligent Technologies." Proceedings of the ASME 2010 International Design Engineering Technical Conferences and Computers and Information in Engineering Conference. Volume 5: 22nd International Conference on Design Theory and Methodology; Special Conference on Mechanical Vibration and Noise. Montreal, Quebec, Canada. August 15–18, 2010. pp. 603-616. ASME. https://doi.org/10.1115/DETC2010-28928
Download citation file:
3
Views
0
Citations
Related Proceedings Papers
Related Articles
An Approach for Testing Methods for Modeling Uncertainty
J. Mech. Des (September,2006)
Lagrangian Relaxation Approach for Decentralized Decision Making in Engineering Design
J. Comput. Inf. Sci. Eng (March,2010)
Demonstration of Solar-Generated Electricity on Demand: The Solar Two Project
J. Sol. Energy Eng (February,2001)
Related Chapters
Predicting the Learning Performance of Artificial Intelligent Systems Using Non-Homogeneous Poisson Process Models
Intelligent Engineering Systems Through Artificial Neural Networks, Volume 17
Utility Function Fundamentals
Decision Making in Engineering Design
Decision Making in Two-Dimensional Warranty Planning (PSAM-0186)
Proceedings of the Eighth International Conference on Probabilistic Safety Assessment & Management (PSAM)