In the previous few chapters, we looked at working with CSV files and then extended our scope to learn about working with Excel worksheets. While CSV files are in a simple text format, Excel files are available in binary format.
In this chapter, we will discuss two more binary file formats: .pdf
and .docx
. You'll build knowledge on generating and reading PDF files, copying them and even manipulating them to build your own header and footer formats. Do you know you could merge many PDF files with a simple Python recipe?
This chapter also takes you on a journey of working with Word documents. It helps you build knowledge on reading and writing data into Word files. Adding tables, images, charts, you name it and this chapter covers it. Sounds interesting? Then this chapter is definitely for you!
Specifically, we will focus on the following Python modules in this chapter:
PyPDF2
(https://pythonhosted.org/PyPDF2/)python-docx
(http://python-docx...