re: TensorFlow to filter PDF files VIEW POST

TOP OF THREAD FULL DISCUSSION
re: I would use pdf - to- text and then feed the data to github.com/vi3k6i5/flashtext with annotated dictionary of keywords. Elasticsearch seems to com...
 

Hey, Alex thanks for the answer makes sense to me!

I have a question: So what you are saying is to create a dictionary with the keywords and then extract the pdf to text and filter the cv correspondingly? Right?

How will you do the PDF -> txt?

 
 

Why don't you just do OCR to extract the PDF to txt?

code of conduct - report abuse