You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The tk_summary_diagnostics() function is meant to provide us insight into our time series. Part of its usefulness come from being able to pinpoint irregular time series by looking at the mean/median difference between time stamps to check for gaps. The diff columns in this function display the distribution between timestamps is in terms of seconds, and while this may not be a problem when dealing with many daily time series, it can become problematic to think of weeks, months, or years in terms of seconds when every diff column is displayed in scientific notation. I'd like to see this updated to be able to display the diff statistics in terms of days, weeks, months, or years respectively to help give users a better handle on whether any irregular time series or gaps exist in the data.
Example:
# from High-Performance Time Series section 1.3.1
mailchimp_users_tbl <- read_rds("00_data/mailchimp_users.rds")
optins_day_tbl <- mailchimp_users_tbl %>%
summarize_by_time(
.date_var = optin_time,
.by = "day",
optins = n()
)
optins_day_tbl %>% tk_summary_diagnostics(.date_var = optin_time)
# monthly data set given as an example in the tk_summary_diagnostics() documentation where differences are harder to interpret
m4_monthly %>%
group_by(id) %>%
tk_summary_diagnostics()
The text was updated successfully, but these errors were encountered:
The tk_summary_diagnostics() function is meant to provide us insight into our time series. Part of its usefulness come from being able to pinpoint irregular time series by looking at the mean/median difference between time stamps to check for gaps. The diff columns in this function display the distribution between timestamps is in terms of seconds, and while this may not be a problem when dealing with many daily time series, it can become problematic to think of weeks, months, or years in terms of seconds when every diff column is displayed in scientific notation. I'd like to see this updated to be able to display the diff statistics in terms of days, weeks, months, or years respectively to help give users a better handle on whether any irregular time series or gaps exist in the data.
Example:
The text was updated successfully, but these errors were encountered: