Error calculation for a time series using autocorrelation

Ask Question

Asked 1 month ago

Modified 30 days ago

Viewed 84 times

I'm trying to calculate the autocorrelation time for a time series generated from molecular-dynamics trajectory frames. I used several methods to estimate the autocorrelation time because I need it to compute the statistical error of an observable. My first attemp was block averaging. The block-averaging plot for the standard error of the mean (SEM) shows no clear plateau. The code I used for block-averaging:

sem=[]
            
            #return sems
            for i in range(1,1000):
                block_size.append(i)
                block_mean=[]
                no_blocks=0
                for l in range(0, len(nums),i):
                    if len(nums[l:l+i])==i:
                        block_mean.append(np.mean(nums[l:l+i]))
                        no_blocks=no_blocks+1
                sem.append(np.std(block_mean)/np.sqrt(no_blocks))

Example output (block-averaging SEM vs block size): My question: If there is no clear plateau in the SEM from block averaging, is it still meaningful to attempt to estimate the statistical error for these data? If so, what methods/precautions should I use to get a reliable error estimate (or a conservative bound) from a single long trajectory of Molecular Dynamics frames?

I tried using other methods such as statsmodel.tsa.stattools.acf for autocorrelation time calculation the error I end up getting is 0.0008712 which seems a bit unrealistic in my opinion. Is there a different way to calculate a more accurate error for these data?

I used the following code to calculate error using statsmodel.tsa.stattools.acf,

def tau_cal(acf_out,nums):
    positive_acf =np.where(acf_out > 0.0000)[0]
                
    result = [acf_out[x] for x in positive_acf]
    tau = 1 + 2 * np.sum(result[1:-1])
    Neff = len(nums)/tau
    error = (statistics.stdev(nums))/np.sqrt(Neff)
    return error
time = statsmodel.tsa.stattools.acf(nums)
error = tau_cal(acf_out,nums)

edited Oct 22 at 16:04

asked Oct 22 at 4:58

Dimuthu Kodituwakku

631 silver badge7 bronze badges

I don't quite follow the calculation you're doing. It sounds like you're trying to divide your time series up into blocks with N samples apiece, take the mean of those blocks individually, then compute summary statistics on the block means. However, the code you posted won't do that. e.g. If you have 100 numbers, and a block size of 5, the code you posted will use statistics.stdev(), which will divide variance by 100, rather than dividing it by 20, which will lead an underestimate in your block variance calculation.

Nick ODell
– Nick ODell

2025-10-22 06:46:26 +00:00
Commented Oct 22 at 6:46
1

I'm sorry about the confusion. I have attached the block averaging code in the question now.

Dimuthu Kodituwakku
– Dimuthu Kodituwakku

2025-10-22 13:22:53 +00:00
Commented Oct 22 at 13:22

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Error calculation for a time series using autocorrelation

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest