Hi all, my first post here.
Over the years, I've scanned many thousands of documents using various settings and scanners. Now I want to OCR them. I don't have any document imaging system (should I?), I just keep them in folders. I did Acrobat Action to insert a bunch of files and save them as PDFs. This worked, but created a number of problems:
1) it's sloooow
2) I have not checked yet, but I was told it results in image loss? That's the most important part I guess. Any way to wrap my jpg into pdf as is? I was told it can be done programmatically!
3) somewhere in the middle of conversion, Acrobat froze, and now I have thousands of files that have been converted, and thousands more than have not.
I am actually a programmer, but I am under a time crunch and want to do this fast, so while tips and tricks for learning to program Adobe would be very interesting, I just want to get it done at the simplest level possible, for now. I don't care if the process is very slow, as long as it's automated. The one exception is, if there are any command line arguments that could help me, I'd definitely be game (I have cygwin). Or if I get precise instructions.
While I am at it, are there any 3rd party products that do what I want?
Edit: I was told I want to look at DCTDecode
filter, but I am very new to Adobe (in fact, this is my second day); how do I do that?