Managing Input
October 18th, 2009You know, I’m pretty lazy. I don’t want to do anything. I’d be happy to just read cracked.com and tvtropes.org all day. But I’m also a megalomaniac. I wanna know everything. And I mean everything. I wanna understand about the Great Vowel Shift, the colonization of Australia, the evolution of the influenza virus, the performance of text matching… everything. Unfortunately, for now at least, the human brain can’t know everything. You can’t just pour stuff in until you run out of it. Upload Wikipedia and be done with it. You need to actually work for it and study. Slowly, painfully and did I mention slowly?
Sure, you can use a SRS to manage information so that you don’t forget the stuff again. You only learn it once and then do your daily repetitions so it stays in your brain. That’s pretty cool already, but there are two drawbacks: it’s a little boring and, worse, tiring. It’s in fact so tiring that I had to give up an otherwise pretty neat sleeping schedule. I read a few articles and forum posts on this and everyone seems to agree – you can do about 200 repetitions per day, maybe 300. More than that is too tiring, too time-consuming, too boring and maybe just plain impossible. Sure, for a few days you can do more, but I never saw anyone maintain this. You just burn out. (But you can study more, just not via SRS.)
Of course, old repetitions and new facts compete for resources. You can’t just add and add facts. Soon your daily repetitions surpass 200 and you will forget them anyway. But what is the optimal course of action? Is it better to add facts slowly to minimize the risk of burn-out? Soon, university starts again and I’ll have to study for exams. That means adding facts I don’t really wanna learn that much (at least right now), but I’m also learning other things (2 languages, for example). So on one hand, I want to learn as comfortable as possible, but on the other hand, I need to get done in time, so I’d better be fast.
What do you do in such a situation? You run a simulation, of course! I expanded my handy SRS simulator and tested a few possible configurations. I’m just going to show you the 3 most interesting ones:
A few short notes first.
I set the amount of facts to learn to 3400. That’s my current number of new, unseen facts. Don’t ask how I got that many (*shame*, *shame*), but it’s not unusual for many learners. Of course, if you are learning basically forever, there won’t be a “last new fact” – you’ll constantly add new ones. In this case, just look at the first graph and find the plateau for each curve. You will never drop below this.
The 3 configurations are:
- A -> a maximum of 100 repetitions per day; add up to 50 new facts
- B -> a maximum of 200 repetitions per day; add up to 50 new facts
- G -> a maximum of 200 repetitions per day; add up to 200 new facts
The maximum is only respected when adding new facts, not when doing due facts.
Ok, now let’s have a look at the actual graphs.
The first and most obvious thing is that G totally fails at it’s goal. After about a week, the due cards explode right in your face and you’ll have to face up to 350 repetitions per day. And this continues to happen all the time, so you are pretty likely, at some point, to just give up. You feel so bad about the extreme and unexpected workload you’re facing that it becomes counter-productive. So to just “fill up” until you hit your maximum number of comfortable repetitions a day is a really bad idea.
Furthermore, we can see that both A and B are pretty good at keeping the maximum under control, A is pretty inefficient at it. You have lazy and busy days in a pretty regular pattern, but you never max out. Not really that good, but workable nonetheless. However, what you can see is that if you are working through a fixed amount of facts, B gets you into the “lazy phase” much sooner. A has many lazy days, but on average it is actually more work than B, not less.
The second graph shows us how fast we are progressing. It takes G about 50 days to work through all facts, B needs about 80 days and A about 150. Sure, G is pretty fast, but actually not much faster than B, at only about 40% less time. However, it is quite clear that A really is slow. It needs at least twice as long to work through the same amount of facts than the other two approaches.
Conclusion
I think the data allows us to make two important conclusions.
First, do not add facts like crazy. Trying to just add facts until you fall asleep might work for a few days, but very soon you hit the point where all those repetitions burn you out. They would demand up to twice the amount of work you were capable of just a week ago, so most likely, you will just fail to do them and forget everything again. Basically, you end up really tired, demotivated and not much smarter than before. A good waste of your time.
Second, to be lazy, work harder. It might sound counter-intuitive at first, but it makes sense. If you try to be lazy early on by working below your actual threshold, you will, on most days, actually have to work more than you expected. You will learn much slower and spend a lot more time on your repetitions. The reason for this is that you are spreading out your early repetitions, which are by far the hardest. If you work a bit more and get those early repetitions over with, it will get much easier later on. If you work consistently at your threshold, it wil be easier to make a routine out of it and your progress will be faster.
Finally, the simulation allows me to pick some more useful values. If you have another look at the second graph again, you can see that A and G look a bit like stairs, but B is smooth. This means there is an optimal value of new facts per day to pick that consistently maxes out your workload by looking at the slope of the curve. This optimal value, interestingly enough, is about 42. I do not believe this to be a coincidence.







