Statistical Methods for Analyzing Speedup Learning Experiments

Oren Etzioni; Ruth Etzioni

doi:10.1023/A:1022617931401

Statistical Methods for Analyzing Speedup Learning Experiments

Oren Etzioni, Ruth Etzioni

Research output: Contribution to journal › Article › peer-review

17 Scopus citations

Abstract

Speedup learning systems are typically evaluated by comparing their impact on a problem solver's performance. The impact is measured by running the problem solver, before and after learning, on a sample of problems randomly drawn from some distribution. Often, the experimenter imposes a bound on the CPU time the problem solver is allowed to spend on any individual problem. Segre et al. (1991) argue that the experimenter's choice of time bound can bias the results of the experiment. To address this problem, we present statistical hypothesis tests specifically designed to analyze speedup data and eliminate this bias. We apply the tests to the data reported by Etzioni (1990a) and show that most (but not all) of the speedups observed are statistically significant.

Original language	English (US)
Pages (from-to)	333-347
Number of pages	15
Journal	Machine Learning
Volume	14
Issue number	3
DOIs	https://doi.org/10.1023/A:1022617931401
State	Published - Mar 1994
Externally published	Yes

Keywords

experimental methodology
explanation-based learning
speedup learning
statistics

ASJC Scopus subject areas

Software
Artificial Intelligence

Access to Document

10.1023/A:1022617931401

Cite this

@article{531c449cb7214cd4b318b0bbcb0242ff,

title = "Statistical Methods for Analyzing Speedup Learning Experiments",

abstract = "Speedup learning systems are typically evaluated by comparing their impact on a problem solver's performance. The impact is measured by running the problem solver, before and after learning, on a sample of problems randomly drawn from some distribution. Often, the experimenter imposes a bound on the CPU time the problem solver is allowed to spend on any individual problem. Segre et al. (1991) argue that the experimenter's choice of time bound can bias the results of the experiment. To address this problem, we present statistical hypothesis tests specifically designed to analyze speedup data and eliminate this bias. We apply the tests to the data reported by Etzioni (1990a) and show that most (but not all) of the speedups observed are statistically significant.",

keywords = "experimental methodology, explanation-based learning, speedup learning, statistics",

author = "Oren Etzioni and Ruth Etzioni",

year = "1994",

month = mar,

doi = "10.1023/A:1022617931401",

language = "English (US)",

volume = "14",

pages = "333--347",

journal = "Machine Learning",

issn = "0885-6125",

publisher = "Springer Netherlands",

number = "3",

}

TY - JOUR

T1 - Statistical Methods for Analyzing Speedup Learning Experiments

AU - Etzioni, Oren

AU - Etzioni, Ruth

PY - 1994/3

Y1 - 1994/3

N2 - Speedup learning systems are typically evaluated by comparing their impact on a problem solver's performance. The impact is measured by running the problem solver, before and after learning, on a sample of problems randomly drawn from some distribution. Often, the experimenter imposes a bound on the CPU time the problem solver is allowed to spend on any individual problem. Segre et al. (1991) argue that the experimenter's choice of time bound can bias the results of the experiment. To address this problem, we present statistical hypothesis tests specifically designed to analyze speedup data and eliminate this bias. We apply the tests to the data reported by Etzioni (1990a) and show that most (but not all) of the speedups observed are statistically significant.

AB - Speedup learning systems are typically evaluated by comparing their impact on a problem solver's performance. The impact is measured by running the problem solver, before and after learning, on a sample of problems randomly drawn from some distribution. Often, the experimenter imposes a bound on the CPU time the problem solver is allowed to spend on any individual problem. Segre et al. (1991) argue that the experimenter's choice of time bound can bias the results of the experiment. To address this problem, we present statistical hypothesis tests specifically designed to analyze speedup data and eliminate this bias. We apply the tests to the data reported by Etzioni (1990a) and show that most (but not all) of the speedups observed are statistically significant.

KW - experimental methodology

KW - explanation-based learning

KW - speedup learning

KW - statistics

UR - http://www.scopus.com/inward/record.url?scp=2542423444&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=2542423444&partnerID=8YFLogxK

U2 - 10.1023/A:1022617931401

DO - 10.1023/A:1022617931401

M3 - Article

AN - SCOPUS:2542423444

SN - 0885-6125

VL - 14

SP - 333

EP - 347

JO - Machine Learning

JF - Machine Learning

IS - 3

ER -

Statistical Methods for Analyzing Speedup Learning Experiments

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this