Difference between revisions of "Flash Cards as Cognitive Test"

From Personal Science Wiki
Jump to navigation Jump to search
Line 34: Line 34:
  
 
=== Specific cognitive health tests ===
 
=== Specific cognitive health tests ===
Goal of anki skill is minimize user time required to remember information on a card to a target future date. Decomposition of the general skill<ref>[[wikipedia:Job_analysis|en.wikipedia.org/wiki/Job_analysis]]</ref><ref>Foundations of Fluency: An Exploration: Reading Psychology: Vol 26, No 2 (tandfonline.com) www.tandfonline.com/doi/abs/10.1080/02702710590930519</ref> by looking for sources of time loss results in learning speed, relearning speed, forgetting, ability to remember effectively today, slowness to answer, and distraction. Each corresponds to a potential test and it's validations.       
+
Goal of anki skill is minimize user time required to remember information on a card to a target future date. Decomposition of the general skill<ref>[[wikipedia:Job_analysis|en.wikipedia.org/wiki/Job_analysis]]</ref><ref>Foundations of Fluency: An Exploration: Reading Psychology: Vol 26, No 2 (tandfonline.com) www.tandfonline.com/doi/abs/10.1080/02702710590930519</ref> by looking for sources of time loss results in learning speed, relearning speed, forgetting, ability to remember effectively today, slowness to answer, and distraction. Each corresponds to a potential test and it's validations. New theory of Disuse separates memory into retrieval strength and storage strength or middle and long term memory.<ref>www.learningscientists.org/blog/2016/5/10-1</ref><ref>www.researchgate.net/profile/Robert-Bjork-2/publication/281322665_A_new_theory_of_disuse_and_an_old_theory_of_stimulus_fluctuation/links/58b6f20945851591c5d55e96/A-new-theory-of-disuse-and-an-old-theory-of-stimulus-fluctuation.pdf</ref>      
 
==== Recall<ref>supermemo.guru/wiki/Recall</ref>====
 
==== Recall<ref>supermemo.guru/wiki/Recall</ref>====
  

Revision as of 23:40, 31 July 2023

Project Infobox Question-icon.png
Self researcher(s) User:DG
Related tools Anki, Spaced Repetition
Related topics Tools for Cognitive Testing

Builds on project(s)
Spaced Listening, Spaced Repetition: A Cognitive QS Method for Knowledge Acquisition
Has inspired Projects (0)


Flash cards are cards with question on one side and answers on opposite. They are used for memorization[1], making explicit[2] (requires effort to remember) declarative[3] semantic[4] memory though the goal of language learning is to make each memory automatic and therefore implicit.[5] Several computer apps automated the process and have recorded a lot of data. I expected electronic flashcard data to be useful as a cognitive test, so I started a project to analyze the data that Anki records. Turns out the project will teach users about learning and allow them to experiment with and optimize their own learning process. All flashcard apps already optimize their student's learning but do not open the process to the user, except Super Memo 18. Resulting visualizations also encourage studying by illustrating success in different way than existing apps, similar to gamification.

The project is a work in progress and much of the intended functionality is not yet working. I cannot guarantee that each of the goals will be all that useful to the end user. Project will take about ten thousand lines of code to complete, so I expect to burn out a few times before it is done. No AI or LLM was used in the making of this project.

Other Goals

Simply adding colorful plots that Anki does not already have will encourage many people to study more. Several variables may potentially correlate with general mood and feelings towards Anki.

The goal of teaching is almost as easy to guarantee because analyzing the data requires delving into the learning process. This goal may actually be the most impactful.

Project will advise user on how to optimize their studying by comparing what actually happened with what would have happened if they had done something different, according to a machine learning model. Effects will be illustrated using Partial Dependence Plots and ICE[6].

To help user conduct experiments, project will compare one part of time series with another or progress on one set of cards with another. User may have to use another tool like Open Humans.

Potential Tests of Emotion

Tests that should mostly measure user's opinion of Anki itself and possibly general mood. Possibly strongly influenced by outside factors. Useful for optimizing Anki performance but not cognitive testing. For example, expectation of reward (candy) right after session could improve performance by improving mood or opinion of Anki.

  • Starting early in the day
  • Not skipping days
  • Not getting distracted
  • Reviewing cards fast
  • Reviewing many cards

Confounders and Artifacts of Procedure

Any decent skill test will detect when subject is severely sick. For a test to be useful for optimization and experimentation, it must detect more subtle patterns. Statistical tests detect plenty of patterns in my data that are both subtle enough and clearly not generated by the same process as the rest of the data.[7] Unfortunately, those patterns could easily be artifacts of the analysis or of the test taking process. Including all available variables that should NOT correlate with target in a machine learning model and then using resulting residual errors as the final test results should help, but not all variables that cause artifacts are available. Some independent variables will not be useable by the ML model in which case I will plot them against test series with changepoints.

Some of the artifacts will differ between tests so comparing results from multiple tests may result in a single decent time series strongly dependent on skill. If these solutions are not enough, one of the tests may correlate and transfer to an established validated cognitive test. That would still allow experimentation on and optimization of a skill, but through a cognitive ability.

Cognitive Health

Everyone should track their cognitive ability as much as health-conscious people track their heartrate and exercise. IQ tests are supposed to have high 'reliability' and not change much between days for any individual. Formal cognitive testing takes too much time and effect on daily life of the specific thing the cognitive test tests is often questioned. Skill trainers and testers like typing tutors have none of the mentioned problems. However, even if a skill test is useful for optimizing the skill, things like dependence on psychological factors may not make it a good cognitive test. The skill test will have to correlate with validated cognitive tests or obviously important health things or at least transfer to other tests to become validated[8] as a cognitive or health test. The tests are unlikely to be pure in the sense of Quantified Mind' science page. This makes them better checks for general unhealth but harder to diagnose a specific problem with.

Wozniak of supermemo has found correlation between sleep and flash card skill.[9][10]

Specific cognitive health tests

Goal of anki skill is minimize user time required to remember information on a card to a target future date. Decomposition of the general skill[11][12] by looking for sources of time loss results in learning speed, relearning speed, forgetting, ability to remember effectively today, slowness to answer, and distraction. Each corresponds to a potential test and it's validations. New theory of Disuse separates memory into retrieval strength and storage strength or middle and long term memory.[13][14]

Recall[15]

Remembering on a given day. Wozniak explains it better but I would say that this should be separated in to real forgetting that changes state for several reviews or just momentary lapse. Both are measured separately making two tests. There are also more mild state of card consolidation changes that may happen that are not just forgetting. This may be a third test.

Consolidation of Memory[16]

If on a set day the cards were memorized better or worse as will be seen on their next review. This is like the following two potential identical tests but more generally about change and not about speed?

Time taken to learn a new word

If considering only the first session, learning new words in Anki is like cued verbal learning like in the CVLT, a cognitive test. Though in the CVLT[17] the cue is much weaker. And verbal learning transfers to other abilities: "There is considerable evidence that verbal learning correlates reasonably strongly with performance in a number of important practical tasks. For instance, verbal learning tests have been demonstrated to be highly correlated with prospective remembering in real life, which means remembering to perform a planned action at the appropriate time [29]."[18] Sever sleep deprivation harms verbal learning.[19] Most of time spent on this task may be on short term memory (10 min) and not long term memory (1 day) so the real test could be percent of successful learnings of new words and not speed.

Time taken to relearn a forgotten word

Like learning new word but with stronger cue or part of the memory still in mind.

Forgetting between sessions

Forgetting as a consequence of something bad happening between test days, like a concussion. This is similar to multiple sequential days being bad for longer term cards other but over a much longer period of test days. So lets say a concussion makes all cards from week to month since last review more likely to be forgotten but only those in mid term state. When the cards would be reviewed, they would affect the score and so a lowering of overall memory for some time period (before concussion or after depending). So this is not a test but rather a pattern in other tests. In each test and each session results should be subdivide by original state as well in case informational pattern?

Speed answering cards

Fluency is a goal of language learning[20] so less time per card is good.

Vs Restraint. Taking enough time before answering a prompt rather than speeding through it.

Time taken to answer a question will be compared against ML model's recommended amount. Both are likely influenced more by emotion than mental ability.

Predicting optimal interval

Choosing optimal interval (day, month) for next review after seeing answer. Also compared against ML model's recommended amount.

References

  1. en.wikipedia.org/wiki/Testing_effect
  2. en.wikipedia.org/wiki/Explicit_memory
  3. en.wikipedia.org/wiki/Declarative_learning
  4. en.wikipedia.org/wiki/Semantic_memory
  5. en.wikipedia.org/wiki/Implicit_memory
  6. Visualizing ML Models with LIME · UC Business Analytics R Programming Guide (uc-r.github.io)
  7. en.wikipedia.org/wiki/Internal_consistency
  8. en.wikipedia.org/wiki/Validity_(statistics)
  9. supermemo.guru/wiki/Sleep_and_learning#Studying_sleep_and_learning_with_SuperMemo
  10. supermemo.guru/wiki/Biphasic_life#Biphasic_learning
  11. en.wikipedia.org/wiki/Job_analysis
  12. Foundations of Fluency: An Exploration: Reading Psychology: Vol 26, No 2 (tandfonline.com) www.tandfonline.com/doi/abs/10.1080/02702710590930519
  13. www.learningscientists.org/blog/2016/5/10-1
  14. www.researchgate.net/profile/Robert-Bjork-2/publication/281322665_A_new_theory_of_disuse_and_an_old_theory_of_stimulus_fluctuation/links/58b6f20945851591c5d55e96/A-new-theory-of-disuse-and-an-old-theory-of-stimulus-fluctuation.pdf
  15. supermemo.guru/wiki/Recall
  16. supermemo.guru/wiki/Memory_consolidation
  17. en.wikipedia.org/wiki/California_Verbal_Learning_Test
  18. www.quantified-mind.com/science
  19. pubmed.ncbi.nlm.nih.gov/10688201/
  20. What's the story? The tale of reading fluency told at speed - Benjamin - 2012 - Human Brain Mapping - Wiley Online Library onlinelibrary.wiley.com/doi/abs/10.1002/hbm.21384