In this tutorial, we’ll guide you through the process of extracting text from a batch of PDF invoices using FileDrop’s OCR (Optical Character Recognition) tools. Whether you have a stack of invoices, documents, or any other PDF files that need text extraction, FileDrop’s bulk OCR feature can save you time and effort.
Let’s get started.
Batch Extract Text from Invoices
Step 1: Accessing FileDrop’s OCR Tools
First, navigate to the “Tools” menu within FileDrop. Within the “Tools” menu, locate the “Bulk OCR” option. Click on it to begin the process. You will be prompted to provide some necessary information.
Step 3: Adding Folder ID
To proceed, you need to add your folder ID. You can find your folder ID within FileDrop. Simply copy the folder ID and paste it into the designated field.
Step 4: Language Selection
Select the language of the text in your PDF files. If your language isn’t listed, choose English as a default option.
Step 5: Choosing Output Format
Decide on the output format for the extracted text. By default, the tool will extract the text and place it into a Google Sheet. This allows each file’s content to be stored in a separate cell. You can also extract the text in a Google Doc, each file in it’s own Google Sheet, or each file in it’s own Google Doc.
Step 6: Start the OCR Process
After configuring the settings, initiate the OCR process by clicking the “Start” button. The tool will begin processing all the PDF files in your specified folder.
Depending on the number and size of your PDF files, the OCR process may take a few minutes. Keep in mind that there might be limits on the size and page count of individual PDFs.
Step 7: Receiving Email Confirmation and Results
Once the OCR process is completed, you will receive an email confirmation. This email will contain a link to access the resulting file.
Open the email and click the provided link to access the extracted text. Note that only pdf and images are supported, and there could be limits on PDF page counts.
Batch OCR Video Tutorial
Conclusion
FileDrop’s bulk OCR feature is a convenient tool for extracting text from PDF files. It’s suitable for various types of PDFs and offers the flexibility to work with different languages.