Talk:Finding relations between variables in time series

From Personal Science Wiki
Jump to navigation Jump to search

Categorization[edit | edit source]

Currently this page is sorted in as a tool, though it's rather a meta-article. I wonder whether it would make more sense to file it under Topic instead but would love to hear second opinions on this! - Gedankenstuecke (talk) 12:53, 30 November 2021 (UTC)

if you think thats better then do it DG (talk)
I've moved it to a topic page and also restructured/renamed the page slightly to fit into the topic dimension. - Gedankenstuecke (talk) 08:56, 2 December 2021 (UTC)

todo suggestion[edit | edit source]

mp April 7th at 1:51 PM @ oh an additional thought, post chat — I think a “fast up / slow down” pattern would be reflected in a Markov probability distribution where any given number has a low probability of later numbers being higher than it. e.g. if prior was 7, then 7-or-lower are likely (gradual decline), but 8+ very unlikely.I was struggling to remember the language here, but I think a “first order” Markov model is where probability distributions at any given step are based only according to the previous step (and no further “memory” in the system). A “second order” is influenced by the two previous steps (a bit more memory).

First! Carefully compare variance internally to days and between days. If internal is too high this variable has too poor "distinctness". Can also look for "stability" in derivatives and between any time measured such as 12 hours or a month... or maybe strongly unlikely Markov model? Also maybe first, multimodality and outlier and anomalies.

potential sources of solutions[edit | edit source]

Convert time series into bunch of splines? Especially MARS! wow really so much I forgot! Anomalies and Events and Periodicity! mentions QS : suggested by gedankenstuek for raw sensor data. personalized time series machine learning it is. Fairly commonly recommended procedures to Data scientists. I suspect faults from not takeing into account issues with time series; no mention of unit root for example. "Analytics code is available upon request from the corresponding author." just linear lasso but lag and other issues discussed. intense math for flashcard prediction and timing adjustment! "physIQ is the only company that uses FDA-cleared, AI-based analytics to “learn” and detect even the most subtle changes in an individual’s own unique physiology 24/7." formal elf experiment support app from brown. definitely not advanced analytics but still

do not forget is also a source where to headhunt data scientists

other potential source[edit | edit source] just let microsoft do it. analysis before designing an intervention hoogendoorn and funke

to test algorithm generate data but is it time series?


maybe ask here again maybe datasets

correlation analysis what was their meta-analysis?? It might be relevant for individual analysis not just meta.

even more sources[edit | edit source]

From and more at; just survival analysis also EHR time series EHR rare binary features possibly automatic generation ER time series "robustness of this approach" deep learning interpretable DEEP, irregular time intervals, EHR, Long-term Dependencies and Short-term Correlations !!!? marked temporal point processes DEEP missing events DEEP !!!!! Gaus mix model, autoencoders similarities among different time series distance metric ! anomaly detection Time Series Imputation autoregressive models score-based diffusion models !! irregular sampling, graph neural network, dynamics of sensors purely from observational data, classify time series, healthcare ... ? LSTM DEEP REINFORCEMENT recommend exercise routines to user sepcific needs extreme events RNN s&p500 stocks !!! ML for time series LSTM ....walk-forward algorithm that also calculates point-wise confidence intervals for the predictions !!!!!!! healthcare predictive analytics, DEEP ?feature selection is not an issue? " feature engineering to capture the sequential nature of patient data, which may not adequately leverage the temporal patterns" " representations of key factors (e.g., medical concepts or patients) and their interactions from high-dimensional raw" summarises key research streams EHR predicting "The common time-series representation is indirect in extracting such information from EHR because it focuses on detailed dependencies between values in successive observations, not cumulative information. " !!! Disease prediction with ML. heterogeneity complex factors at the individual level. phase diagram "the g-formula (i.e., standardization, back-door adjustment) under serial interfer- ence. It estimates stable recurring effects, as is done in n-of-1 trials and single case experimental designs. We compare our approach to standard methods (with possible confounding) to show how to use causal inference to make better personalized recommendations for health behavior change, " crisis easy nice read. Since will be data dredging eventually and making stats test this useful." Well, the experimental design of n-of-1 trials and SCEDs actually checks for causation, not just correlation. This is why randomized controlled trials (RCTs) in clinical research are a gold-standard technique for figuring out if a new intervention or treatment actually works. “Flipping the coin” in a way balances everything else that might confuse or “confound” the way the treatment might impact the health-related outcome." 2 rules- takeways You are not measuring what you think you are measuring but enough data sources and types of things you measure you may find out what that is. IMPORTANT

also eric j daza 's papers.

funny on this stat analysis[edit | edit source]

Quick write I made here for later.[edit | edit source]

Collection is really just a matter of finding the right devices and taking the time to use them. Analysis outside of immediate obvious effect can become difficult. If the effect is subtle and drowned in other effects, or hard to measure. If the intervention is not something user can easily or wants to reproduce. If the effect take long time to build up, or is shifted in time from intervention. If the successful effect only happens under several conditions or several interventions together. If the spray and pray approach is dangerous. If the spray and pray approach only hits gold once in a while. Multiple comparison problem (see wikipedia). If user is bad at keeping records. There are probably more. There are many many apps that just do correlation and none that do anything more. Here is a list of both problems and apps.

new section : single variable validity[edit | edit source]

how to prove that what you are measuring really is what you are trying to measure. aka construct validation

quick way; compare to a scientifically validated standard.

also consider many tests all agree more or less and "divergent validity" they do not correlate with things that they should not several constructs and their relationships to each other such as ageing causes memory loss

some more thoughts[edit | edit source] the question is not answered but in the side bar similar questions are well answered. Copulas may be important? The difference between time series and non-time series seems to be that with time series the patterns are cyclical, not a specific type of pattern/shape that happens every so often. Health tracking data seems to need both. I imagine blood sugar spike after meals but meals are not eaten at constant time. Long term trend and smoothing is covered in some Arima like models. Also changepoint which is like trend but in shorter time. And outliers? THat is like changepoint. results in a pretty visualization illustration of a single time series. If any of the apps were serious this would appear there. Maybe multiple moving averages as which kernel width fits best. Maybe ecg decomposition with DWT and ICA This all called: single subject longitudinal analysis . how about Temporal Dynamics?

research chat suggests[edit | edit source]

look at the extremes of the predictor variables; like three day long terrible sleep and three day long good sleep and compare cognitive ability

another aggregator[edit | edit source]

Series analysis is not it[edit | edit source]

Old concept about constantly doing old school statistical testing to know when there is enough data to stop something like a clinical trial.