Search 5,000,000+ questions and answers.

Frequently Asked Questions

Who invented RL?

RL FAQ
There is plenty of blame here to go around. Minsky, Bellman, Howard, Andreae, Werbos, Barto, Sutton, Watkins... All played important or early roles. See the history from the Sutton and Barto text.
Related Questions

Who is R.L.?

The Unofficial Kevin and Kell FAQ
R.L. was Kell's boss at HerdThinners, Inc. He may be a wolf; he's only seen as fangs and hands. He is 'fifty-ish'. While he is an adept manager and savage predator, he is not particularly computer-literate. He is another secret sufferer from Domestication, a malady and secret that he shares with Kell and Rudy. His predecessor at HerdThinners was named 'L.D.'. R.L. is married to Angelique, Kevin's ex.
Related Questions

How does RL relate to Neuroscience?

RL FAQ
Ideally, the ideas of reinforcement learning could constitute part of a computational theory of what the brain is doing and why. A number of links have been drawn between reinforcement learning and neuroscience, beginning with early models of classical conditioning based on temporal-difference learning (see Barto and Sutton, 1982; Sutton and Barto, 1981, 1990; Moore et al., 1986), and continuing through work on foraging and prediction learning (see Montague et al.
Related Questions

How does RL relate to behaviorism?

RL FAQ
Formally, RL is unrelated to behaviorism, or at least to the aspects of behaviorism that are widely viewed as undesireable. Behaviorism has been disparaged for focusing exclusively on behavior, refusing to consider what was going on inside the head of the subject. RL of course is all about the algorithms and processes going on inside the agent. For example, we often consider the construction of internal models of the environment within the agent, which is far outside the scope of behaviorism.
Related Questions

How does RL relate to Neuro-Dynamic Programming?

RL FAQ
To a first approximation, Reinforcement Learning and Neuro-Dynamic Programming are synonomous. The name "reinforcement learning" came from psychology (although psychologists rarely use exactly this term) and dates back to the eary days of cybernetics. For example, Marvin Minsky used this term in his 1954 thesis, and Barto and Sutton revived it in the early 1980's.
Related Questions

How does RL relate to the psychology of animal behavior?

RL FAQ
Broadly speaking, RL works as a pretty good model of instrumental learning, though a detailed argument for this has never been publically made (the closest to this is probably Barto, Sutton and Watkins, 1990). On the other hand, the links between classical (or Pavlovian) conditioning and temporal-difference (TD) learning (one of the central elements of RL) are close and widely acknowledged (see Sutton and Barto, 1990).
Related Questions

How does RL relate to genetic algorithms?

RL FAQ
Most work with genetic algorithms simulates evolution, not learning during an individual's life, and because of this is very different from work in RL. That having been said, there are two provisos. First, there is a large body of work on classifier systems that uses or is closely related to genetic algorithms. This work is concerned with learning during a single agent's lifetime (using GAs to organize the components of the agent's mind) and is in fact RL research.
Related Questions

My state and/or action space is huge! Can I still apply RL?

RL FAQ
Function approximation" refers to the use of a parameterized functional form to represent the value function (and/or the policy), as opposed to a simple table. A table is able to represent the value of each state separately, without confusion, interaction, or generalization with the value of any other state. In typical problems, however, there are far too many states to learn or represent their values individually; instead we have to generalize from observed to states to new, unobserved ones.
Related Questions

Is RL just trial-and-error learning, or does it include planning?

RL FAQ
Modern reinforcement learning concerns both trial-and-error learning without a model of the environment, and deliberative planning with a model. By "a model" here we mean a model of the dynamics of the environment. In the simplest case, this means just an estimate of the state-transition probabilities and expected immediate rewards of the environment. In general it means any predictions about the environment's future behavior conditional on the agent's behavior.
Related Questions

What advantages does RL offer in Operations Research problems?

RL FAQ
Using function approximation, RL can apply to much larger state spaces than classical sequential optimization techniques such as dynamic programming. In addition, using simulations (sampling), RL can apply to systems that are too large or complicated to explicitly enumerate the next-state transition probabilities.
Related Questions

Most RL work assumes the action space is discrete; what about continuous actions?

RL FAQ
It is true that most RL work has considered discrete action spaces, but this was usually done for convenience, not as an essential limitation of the ideas; and there are exceptions. Nevertheless, it is often not obvious how to extend RL methods to continuous, or even large discrete, action spaces. The key problem is that RL methods typically involve a max or sum over elements of the action space, which is not feasible if the space is large or infinite.
Related Questions

I am doing RL with a backpropagation neural network and it doesn't work; what should I do?

RL FAQ
It is a common error to use a backpropagation neural network as the function approximator in one's first experiments with reinforcement learning, which almost always leads to an unsatisfying failure. The primary reason for the failure is that backpropation is fairly tricky to use effectively, doubly so in an online application like reinforcement learning.
Related Questions

What is the range of the RL units?

ICESPY
There is no simple answer to this as the range is influenced by so many factors. In open level ground a range of 100-200 metres can be expected. Where there are obstacles such as walls of buildings and cold room enclosures, the range will be substantially reduced. The attainable range can be increased by the addition of our Repeater units. If in doubt please contact us: a plan of your site will help us assess your needs.
Related Questions

Q - What is a Raphael Frame (RL)?

A - Raphael mats are a top mat with a fillet, a back mat of the same color as the top mat, and centered in the opening is a piece of self adhesive foam board. The opening of the top mat is slightly oversized to allow you to mount your print onto the foamboard to give the appearance of 'float and rise'. You can dekeledge your print for an even more unique look. Goto the Designer Collection Page under product section to see a sample.
Related Questions

What do we have to do once we receive an e-mail report from RL?

Random Lengths: Information Services for the Forest Products...
The PDF file with the report will arrive as an “attachment” to an e-mail message. The software that you use to access your e-mail will determine where the attachment is stored and what you have to do to view and print it.
Related Questions

What do we get when we receive the e-mail report from RL?

Random Lengths: Information Services for the Forest Products...
The report is sent to you as an “attachment” to an e-mail message. The software that you use to access your e-mail will determine where the attachment is stored. The attachment is an executable self-extracting compressed (ZIP) file. You must execute the attachment before you can use the file or files that it contains. NO.
Related Questions

How can I get a sample of the XLS or DBF files from RL?

Random Lengths: Information Services for the Forest Products...
You can view/download samples of XLS and DBF files from the Random Lengths Web site at http://www.rlpi.com. Go to Internet & Fax Services, then Internet E-mail Spreadsheet and Database. Links to the samples pages are available under each publication, so you can view or download the file you wish to see. OR you can request samples from Random Lengths. Voice: 888-686-9925 (in the U.S. and Canada) or 541-686-9925, Fax: 800-874-7979 or 541-686-9629, E-mail: rlonline@rlpi.com.
Related Questions

Ace, can you fix what RL Page hath wrought?

Paul Gigot & WSJ Editorial Board: The Right Is "Not Even Rat...
Geeez, you guys are cranky. What's the matter ? Reality beginning to intrude on the dream of a "permanent Republican majority?" Nice to see a group of people express such diverse ideas and question each other's assumptions so vigourously! Meanwhile the Supreme Court is rolling along with Alito and Roberts kicking ass. Sorry if I do not care about the pet issue of the day.
Related Questions

Simple question, how many RL hours to a game month, is it 3 or 4 hours?

FAQ
The most important conversions are:- 4 RL hours = 1 game month 1 RL day = 6 game months 2 RL days = 1 game year
Related Questions

I worked on Saturday (after 1pm), I should get RL, but system reject my application, what to do?

LMS - Help Centre
We developed LMS based on the policy given by the President of Cosmopoint, nothing we (LMS developer) can do until Mr. President ask us to change it. Meanwhile please contact President Office to make any suggestion or comment about this matter!
Related Questions

Is the RL-i Series really an unadulterated TC2+ driver?

SoundSplinter.com - Frequently Asked Questions ]
Yes. Aside from the cosmetics, the RL-i drivers are mechanically identical to the TC2+ standard driver engineered by the savvy sub connoisseurs over at TC Sounds.
Related Questions

Why don't you recommend vented enclsoures for the RL-i series?

SoundSplinter.com - Frequently Asked Questions ]
Simply stated, because the Premium series is geared better for the job. The RL-i Series is certainly capable of being used in a vented application, however ported enclosures cause the driver(s) to absorb more power from the amplifier, thus the risk of thermal failure is greater on the RL-i's 2" voicecoil with respect to the RL-p's 3" voicecoil. The RL-i's Qt parameter is also not optimal for vented enclosures, optimal being a Qt of 0.4 or less.
Related Questions

How low can analytes be detected? What is an MDL? LOQ? RL? PQL? DLR?

BSK Analytical Laboratories
There are many terms used in the environmental laboratory business to describe the lowest detectable quantities of analytes in a sample matrix. While many are similar, BSK Labs uses these definitions for final calculating and reporting of results: MDL (Method Detection Limit): Minimum concentration of a substance that can be reported with 99% confidence that the analyte concentration is greater than zero. Empirically determined by each laboratory for each analyte, method, instrument, and matrix.
Related Questions

initially purchased - what comes with the RL 550 B or the XL 650?

Dillon Precision Reloading Frequently Asked Questions
Automatic Powder Measure and Powder Die, inc. both Small (pistol) and Large (rifle) powder charge bars. (Handles 99% of all calibers.)
Related Questions

Got A Question? Ask Our Community!


© Copyright 2007-2008 QueryCAT
About • Webmasters • Contact