Evaluation in artificial intelligence: from task-oriented to ability-oriented measurement
Published 2016 View Full Article
- Home
- Publications
- Publication Search
- Publication Details
Title
Evaluation in artificial intelligence: from task-oriented to ability-oriented measurement
Authors
Keywords
AI evaluation, AI competitions, Machine intelligence, Cognitive abilities, Universal psychometrics, Turing test
Journal
ARTIFICIAL INTELLIGENCE REVIEW
Volume 48, Issue 3, Pages 397-447
Publisher
Springer Nature
Online
2016-08-20
DOI
10.1007/s10462-016-9505-7
References
Ask authors/readers for more resources
Related references
Note: Only part of the references are listed.- Report on the 2008 Reinforcement Learning Competition
- (2017) Shimon Whiteson et al. AI MAGAZINE
- Building Watson: An Overview of the DeepQA Project
- (2017) David Ferrucci et al. AI MAGAZINE
- Competitive Benchmarking: Lessons Learned from the Trading Agent Competition
- (2017) Wolfgang Ketter et al. AI MAGAZINE
- Mapping the Landscape of Human-Level Artificial General Intelligence
- (2017) Sam Adams et al. AI MAGAZINE
- The Reinforcement Learning Competition 2014
- (2017) Christos Dimitrakakis et al. AI MAGAZINE
- Planning, Executing, and Evaluating the Winograd Schema Challenge
- (2016) Leora Morgenstern et al. AI MAGAZINE
- I-athlon: Towards A Multidimensional Turing Test
- (2016) Sam S. Adams et al. AI MAGAZINE
- Principles for Designing an AI Competition, or Why the Turing Test Fails as an Inducement Prize
- (2016) Stuart M. Shieber AI MAGAZINE
- Beyond the Turing Test
- (2016) Gary Marcus et al. AI MAGAZINE
- Computer models solving intelligence test problems: Progress and implications
- (2016) José Hernández-Orallo et al. ARTIFICIAL INTELLIGENCE
- Mastering the game of Go with deep neural networks and tree search
- (2016) David Silver et al. NATURE
- The 2014 International Planning Competition: Progress and Trends
- (2015) Mauro Vallati et al. AI MAGAZINE
- Inductive programming meets the real world
- (2015) Sumit Gulwani et al. COMMUNICATIONS OF THE ACM
- Competitions for Benchmarking: Task and Functionality Scoring Complete Performance Assessment
- (2015) Francesco Amigoni et al. IEEE ROBOTICS & AUTOMATION MAGAZINE
- Human-level control through deep reinforcement learning
- (2015) Volodymyr Mnih et al. NATURE
- Beyond the Turing Test
- (2015) J. You SCIENCE
- On our best behaviour
- (2014) Hector J. Levesque ARTIFICIAL INTELLIGENCE
- On environment difficulty and discriminating power
- (2014) José Hernández-Orallo AUTONOMOUS AGENTS AND MULTI-AGENT SYSTEMS
- Artificial Intelligence approaches for the generation and assessment of believable human-like behaviour in virtual characters
- (2014) Joan Marc Llargues Asensio et al. EXPERT SYSTEMS WITH APPLICATIONS
- An Extensible Description Language for Video Games
- (2014) Tom Schaul IEEE Transactions on Computational Intelligence and AI in Games
- How universal can an intelligence test be?
- (2013) David L Dowe et al. ADAPTIVE BEHAVIOR
- Universal psychometrics: Measuring cognitive abilities in the machine kingdom
- (2013) José Hernández-Orallo et al. Cognitive Systems Research
- Virtual and Real World Adaptation for Pedestrian Detection
- (2013) David Vazquez et al. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
- Towards UCI+: A mindful repository design
- (2013) Núria Macià et al. INFORMATION SCIENCES
- On Potential Cognitive Abilities in the Machine Kingdom
- (2013) José Hernández-Orallo et al. MINDS AND MACHINES
- The intelligence in ETI—What can we know?
- (2012) William Edmondson ACTA ASTRONAUTICA
- An anthropomorphic method for number sequence problems
- (2012) Claes Strannegård et al. Cognitive Systems Research
- Better GP benchmarks: community survey results and proposals
- (2012) David R. White et al. Genetic Programming and Evolvable Machines
- A survey of techniques for incremental learning of HMM parameters
- (2012) Wael Khreich et al. INFORMATION SCIENCES
- Experiment databases
- (2012) Joaquin Vanschoren et al. MACHINE LEARNING
- Anthropomorphism and AI: Turingʼs much misunderstood imitation game
- (2011) Diane Proudfoot ARTIFICIAL INTELLIGENCE
- Psychometric artificial intelligence
- (2011) Selmer Bringsjord JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE
- Robotics competitions as benchmarks for AI research
- (2011) John Anderson et al. KNOWLEDGE ENGINEERING REVIEW
- The changing science of machine learning
- (2011) Pat Langley MACHINE LEARNING
- Ultimate IQ: one test to rule them all
- (2011) Celeste Biever NEW SCIENTIST
- Measuring universal intelligence: Towards an anytime intelligence test
- (2010) José Hernández-Orallo et al. ARTIFICIAL INTELLIGENCE
- Human-competitive results produced by genetic programming
- (2010) John R. Koza Genetic Programming and Evolvable Machines
- Deep Machine Learning - A New Frontier in Artificial Intelligence Research [Research Frontier]
- (2010) I Arel et al. IEEE Computational Intelligence Magazine
- Learning Mazes with Aliasing States: An LCS Algorithm with Associative Perception
- (2009) Zhanna V. Zatuchna et al. ADAPTIVE BEHAVIOR
- A Survey on Transfer Learning
- (2009) Sinno Jialin Pan et al. IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING
- The TPTP Problem Library and Associated Infrastructure
- (2009) Geoff Sutcliffe JOURNAL OF AUTOMATED REASONING
- Warning: statistical benchmarking is addictive. Kicking the habit in machine learning
- (2009) Chris Drummond et al. JOURNAL OF EXPERIMENTAL & THEORETICAL ARTIFICIAL INTELLIGENCE
- Cognitive Developmental Robotics: A Survey
- (2009) M. Asada et al. IEEE Transactions on Autonomous Mental Development
- An experimental comparison of performance measures for classification
- (2008) C. Ferri et al. PATTERN RECOGNITION LETTERS
- reCAPTCHA: Human-Based Character Recognition via Web Security Measures
- (2008) Luis von Ahn et al. SCIENCE
Find Funding. Review Successful Grants.
Explore over 25,000 new funding opportunities and over 6,000,000 successful grants.
ExploreAdd your recorded webinar
Do you already have a recorded webinar? Grow your audience and get more views by easily listing your recording on Peeref.
Upload Now