How to replace a word in pdf with Python [duplicate]

Question

i want to replace a word in a pdf but when i try to do that it always returns me same pdf. Here is my code block. Currentyle i am using pypdf2 but if is there any suggestion i can switch it. What is the missing part at my code?

  with open(file_path, 'rb') as file:
        pdf_reader = PdfFileReader(file)
        # Encrypt the word in the PDF content
        encrypted_word = self.cipher.encrypt(word_to_encrypt_bytes)
        encrypted_word_b64 = base64.b64encode(encrypted_word)
        # Write the encrypted PDF content to a new PDF file
        pdf_writer = PdfFileWriter()
        for i in range(pdf_reader.getNumPages()):
            page = pdf_reader.getPage(i)
            page_content = page.extractText()
            page_content_b = page_content.encode('utf-8')
            page_content_b = page_content_b.replace(word_to_encrypt.encode(), encrypted_word_b64)
            page_content = page_content_b.decode('utf-8')
            pdf_writer.addPage(page)

        output_path = os.path.join(file_dir, file_name_without_ext + '_encryptedm' + ext)
        with open(output_path, 'wb') as output_file:
            pdf_writer.write(output_file)

I want to place a word in my pdf.

This has already been answered : stackoverflow.com/questions/41769120/… — S M
– S M, Commented Feb 20, 2023 at 10:11

Jorj McKie · Accepted Answer · 2023-02-25 20:43:23Z

0

Forgive me when suggesting a solutiuon with PyMuPDF. Example page text:

Suppose we want to correct the misspelling, we can use this snippet:

In [1]: import fitz  # PyMuPDF
...
In [9]: doc=fitz.open("test.pdf")
In [10]: page=doc[0]
In [11]: words=page.get_text("words")  # extract variant by single words
...
In [13]: for word in words:
    ...:     if word[4] == "Currentyle":
    ...:         page.add_redact_annot(word[:4],text="Currently")
    ...:
In [14]: page.apply_redactions()
Out[14]: True
In [15]: doc.ez_save("text-replaced.pdf")

Gives us this:

answered Feb 25, 2023 at 20:43

Jorj McKie

3,2831 gold badge17 silver badges24 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

dsDjango Over a year ago

Thanks a lot. I have one more question the how can fix the position of new item? Because as you realized it moves a little bit down and also i have another problem some words not readable maybe because of Turkish alphabet and my font size and font is different how can i fix this?

Jorj McKie Over a year ago

Of course I showed the most basic approach only. With a little more effort, you can position the replacing text centered (or otherwise aligned) in the free space. As well it is possible to position the new text vertically at the precise position. You can also extract all font information of the old text, and then use all this for inserting the new text. If you need this level of precision (I realize you do!) more code is obviously needed - but everything can be done. May I invite you to continue this discussion here.

Collectives™ on Stack Overflow

How to replace a word in pdf with Python [duplicate]

1 Answer 1

2 Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

2 Comments

Linked

Related