How do I split a markdown file into separate files at the heading

Question

I have a book in markdown format. I want to split it into separate files at the chapter headings. How can I do this?

So far I've used pandoc to convert a docx file into a markdown file. I've not tried anything else. I usually use php but I would imagine that trying to use regex to match the chapter headings isn't the most reliable. — weaveoftheride
– weaveoftheride, Commented Nov 24, 2015 at 10:22

Raniere Silva · Accepted Answer · 2015-11-24 10:53:41Z

8

have a book in markdown format. I want to split it into separate files at the chapter headings. How can I do this?

If you are using Pandoc, you can convert your Markdown file to EPUB, unzip the EPUB file and convert the HTML files into Markdown. Not the perfect solution but you can accomplish it with a few lines of bash script like

pandoc -f markdown -t epub -o my-book.epub my-book.md
unzip my-book.epub
for chapter in *.html
do
pandoc -f html -t markdown -o ${chapter/html/md} ${chapter}
done

You need to fix the path to the HTML files.

If you want to program something and you have some experience, shouldn't be hard to write a Python/... script to split the file.

answered Nov 24, 2015 at 10:53

Raniere Silva

2,7351 gold badge22 silver badges39 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Pier-Eric Chamberland · Accepted Answer · 2021-10-27 05:50:57Z

6

I stumbled upon a straightforward solution. Credit goes Christian Tietze and mediapathic!

`gcsplit --prefix='novelname' --suffix-format='%03d.md'  novel-file.md /##/ "{*}"`

https://christiantietze.de/posts/2019/12/markdown-split-by-chapter/

Other options:

https://github.com/marceljs/markdown-split

https://github.com/accraze/split-md

edited Oct 27, 2021 at 5:50

answered Oct 27, 2021 at 1:06

Pier-Eric Chamberland

911 silver badge6 bronze badges

Comments

evod · Accepted Answer · 2023-05-20 13:41:34Z

5

I needed that exact functionality and was not content with the solutions provided in the other answers mostly because heading tags within code blocks were not respected which lead to problems with my documents.

So I went ahead and wrote a small python tool named mdsplit to do the job. Install it via pip (pip install mdsplit) and then run this to e.g. split at level 2 headings:

mdsplit input.md --max-level 2

Only later I found out there is already a C++ based tool named mdsplit as well that does about the same:

mdsplit -i input.md -l 2

answered May 20, 2023 at 13:41

evod

1771 silver badge4 bronze badges

Comments

henyxia · Accepted Answer · 2025-10-01 13:30:39Z

0

I also needed this functionality but in an environment without Python nor additional downloadable tools.

I ended up with the following solution using awk only.

cat myfile.md |awk '{if ($0~/^## /) {++count} if (count>1) {exit} print $0}'

It will stop printing lines after the second ## header 2 is found.

answered Oct 1 at 13:30

henyxia

1291 silver badge5 bronze badges

Collectives™ on Stack Overflow

How do I split a markdown file into separate files at the heading

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

4 Answers 4

Comments

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related