resumeanalyser.text_reading

Module Contents

Functions

docx_to_text(filepath)

Basic function to extract text from a Word document, given a specified file path.

pdf_to_text(filepath)

Basic function to extract text from a PDF file, given a specified file path.

resumeanalyser.text_reading.docx_to_text(filepath)[source]

Basic function to extract text from a Word document, given a specified file path.

Parameters: filepath (str): A string containing the filepath.

Returns: text (str): A string containing the extracted text.

Example: >>> pdf_to_text(‘~/alphabet.docx’) ‘abcdefghijklmnopqrstuvwxyz’

resumeanalyser.text_reading.pdf_to_text(filepath)[source]

Basic function to extract text from a PDF file, given a specified file path.

Parameters: filepath (str): A string containing the filepath.

Returns: text (str): A string containing the extracted text.

Example: >>> pdf_to_text(‘~/alphabet.pdf’) ‘abcdefghijklmnopqrstuvwxyz’