PDF TO WORD CONVERTER

Premises

[UPDATE 2024] This project was developed rapidly, based on my skills and knowledge at the time (2022). It's important to note that during development, I may not have strictly adhered to commonly accepted architectural patterns such as the Model-View-Controller (MVC) pattern.

The primary purpose of this project was to create a study support tool for university exams, with the aim of facilitating my own learning process and that of my peers. Therefore, the main focus was on implementing the necessary features to achieve this specific goal, rather than rigidly adhering to more complex design conventions.

Due to the rapid development cycle and the focus on functionality, the graphical user interface (GUI) may not have been meticulously polished. It was functional and aimed at providing a straightforward user experience, but detailed design considerations were not prioritized.

After a year of continuous use, I can confirm that this tool has fully achieved its sole creation objective. It has proven to be a valuable ally in studying, providing practical support throughout the academic journey.

I hope that this project can be equally useful for other students in similar situations, providing them with an additional option to enhance the efficiency and effectiveness of their learning process.

Overview

The goal of this software is to convert pdf files into docx files where:

Notes on the pdf (taken with normal software such as Adobe Acrobat, Edge etc.) are converted into editable text in word.
The pdf page becomes an image in word.

The final result is then an editable docx file where for each page of the pdf there will be a docx page with the annotations in text format and a screenshot of the pdf page they were on. The image below shows just the final result.

For example given the following pdf in input

We get the following docx as output

N.B: It's possible through an option within the software to clean the images by eliminating the annotations before taking the screenshot, in this case we get the following output:

Instructions

Requirements (by downloading the source code)

Requirements (by downloading the .exe)

.exe

Interface explanation

Brief preamble

The graphical interface has not been treated in detail due to lack of time, it is minimalistic with few and simple instructions. Anyone who wants to update the graphical interface is welcome, all they have to do is work on the MainWindow.py class

The first checkbox: if checked it will be possible to select an entire folder. This will be inspected and will convert all pdf's inside.
The first checkbox: if checked it will delete the clipboard from the pdf page before taking the screenshot. (WARNING: this option will not modify the original pdf in any way so the notes will continue to be there at the end of the process)
The third element: allows you to select the pdf/folder containing the pdfs.
The fourth element: start the conversion process. (WARNING: if the file extension is not pdf the process will do nothing)
The last element: is a bar that indicates the progress of the process.

How to install all the necessary libraries

In order to do this, it is necessary to go from the terminal to the "PDF-to-Docx-converter-with-annotations" folder and type the command:

pip install -r requirements.txt

pip will take care of doing all the work.

License ☢️

PDF-to-Docx-converter-with-annotations is licensed under the GNU General Public License v3.0. Please, see the license.

Contacts 🪪

[mail] renato [ dot ] esposito1999 [ at ] outlook [ dot ] com (you can write to me in english or italian).

04/01/2023

Name		Name	Last commit message	Last commit date
Latest commit History 81 Commits
.vscode		.vscode
PDF		PDF
Resources		Resources
env		env
.DS_Store		.DS_Store
BatchPdfController.py		BatchPdfController.py
DocxController.py		DocxController.py
EntryPoint.py		EntryPoint.py
IGNORE.gitignore		IGNORE.gitignore
LICENSE		LICENSE
MainWindow.py		MainWindow.py
PathWidget.py		PathWidget.py
PdfController.py		PdfController.py
ProgressBarController.py		ProgressBarController.py
README.md		README.md
main.py		main.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

PDF TO WORD CONVERTER

Table of Contents

Premises

Overview

Instructions

Requirements (by downloading the source code)

Requirements (by downloading the .exe)

Interface explanation

How to install all the necessary libraries

License ☢️

Contacts 🪪

About

Releases 1

Packages

Languages

License

RenatoEsposito1999/Clipboard-PDF-to-Docx-converter

Folders and files

Latest commit

History

Repository files navigation

PDF TO WORD CONVERTER

Table of Contents

Premises

Overview

Instructions

Requirements (by downloading the source code)

Requirements (by downloading the .exe)

Interface explanation

How to install all the necessary libraries

License ☢️

Contacts 🪪

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages