A simple java application (or a series of java classes) that create pdf, read text from pdf, extract words from the pdf and even fetch the coordinates of each word in the document
This application uses open-source Apache PDFBox dependency that allows creation of new PDF documents, manipulation of existing documents and the ability to extract content from documents.
Reference link: https://pdfbox.apache.org/index.html