Working with text file in Python

Question

I am reading a text file with >10,000 number of lines.

results_file = open("Region_11_1_micron_o", 'r')

I would like to skip to the line in the file after a particular string "charts" which occurs at around line no. 7000 (different for different files). Is there a way to conveniently do that without having to read each single line of the file?

Possible duplicate of Reading specific lines only (Python)

Van Peer
– Van Peer

2017-11-06 17:23:38 +00:00
Commented Nov 6, 2017 at 17:23 — Van Peer
– Van Peer, Commented Nov 6, 2017 at 17:23

Miraj50 · Accepted Answer · 2017-11-06 17:29:18Z

5

If you know the precise line number then you can use python's linecache module to read a particular line. You don't need to open the file.

import linecache

line = linecache.getline("test.txt", 3)
print(line)

Output:

chart

If you want to start reading from that line, you can use islice.

from itertools import islice

with open('test.txt','r') as f:
    for line in islice(f, 3, None):
        print(line)

Output:

chart
dang!
It
Works

If you don't know the precise line number and want to start after the line containing that particular string, use another for loop.

with open('test.txt','r') as f:
    for line in f:
        if "chart" in line:
            for line in f:
                # Do your job
                print(line)

Output:

dang!
It    
Works

test.txt contains:

hello
world!
chart
dang!
It
Works

I don't think you can directly skip to a particular line number. If you want to do that, then certainly you must have gone through the file and stored the lines in some format or the other. In any case, you need to traverse atleast once through the file.

edited Nov 6, 2017 at 17:29

answered Nov 6, 2017 at 16:18

Miraj50

4,4471 gold badge25 silver badges37 bronze badges

Sign up to request clarification or add additional context in comments.

5 Comments

erhesto Over a year ago

linecache internally reads whole file into memory, so it's contradiction to OPs 'Is there a way to conveniently do that without having to read each single line of the file' need.

Miraj50 Over a year ago

@erhesto Yes, but I think if you want to go somewhere, you need to have the data somewhere, right? Take for example a list. How will you go to a particular line when you don't have the data stored somewhere. Correct me If I am wrong.

erhesto Over a year ago

Well, I totally agree with you! I'd just add this information to your answer that it might be problematic to find deterministic algorithm which might accomplish this task without reading the file at least once. Of course, it might be possible in some cases (for example - if we have predefined number of characters per line - in other words, we do know exact places of line breaks), but not in general.

DPdl Over a year ago

Thank you. The thing I do not always know the exact line number. I have to look for a certain string in the text file and start with the next line.

Miraj50 Over a year ago

@DPdl Then in that case you will have to go line by line. I shall update my answer. But if you have a rough idea of the line number then probably you can make it faster by skipping some of the lines as given in my answer.

A.Bau · Accepted Answer · 2017-11-06 16:27:04Z

1

You can use itertools.dropwhile to consume the lines up to the point you want.

from itertools import dropwhile, islice

with open(fname) as fin:
    start_at = dropwhile(lambda L: 'Abstract' not in L.split(), fin)
    for line in islice(start_at, 1, None):
        print line

answered Nov 6, 2017 at 16:27

A.Bau

821 silver badge10 bronze badges

Comments

gboffi · Accepted Answer · 2017-11-06 17:07:29Z

1

If your text file has lines whose length is evenly distributed across your file you could try with seeking into thefile

from os import stat
size = stat(your_file).st_size
start = int(0.65*size)
f = open(your_file)
f.seek(start)
buff = f.read() 
n = buff.index('\nchart\n')
start = n+len('\nchart\n')
buff = buff[start:]

answered Nov 6, 2017 at 17:07

gboffi

25.4k10 gold badges62 silver badges98 bronze badges

Collectives™ on Stack Overflow

Working with text file in Python

3 Answers 3

5 Comments

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

5 Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related