I would use pdf - to- text and then feed the data to github.com/vi3k6i5/flashtext with annotated dictionary of keywords. Elasticsearch seems to complex solution since you only want some basic filtering like Qa/Devops/Java Dev/Python dev to group resumes by category.
If you still want to play with tensorflow I suggest you to think about what kind of feature you can extract from image and what kind of output to expect. That's 90% of success the rest 10% is just to code up tensorflow model
Hey, Alex thanks for the answer makes sense to me!
I have a question: So what you are saying is to create a dictionary with the keywords and then extract the pdf to text and filter the cv correspondingly? Right?
I would use pdf - to- text and then feed the data to github.com/vi3k6i5/flashtext with annotated dictionary of keywords. Elasticsearch seems to complex solution since you only want some basic filtering like Qa/Devops/Java Dev/Python dev to group resumes by category.
If you still want to play with tensorflow I suggest you to think about what kind of feature you can extract from image and what kind of output to expect. That's 90% of success the rest 10% is just to code up tensorflow model
Hey, Alex thanks for the answer makes sense to me!
I have a question: So what you are saying is to create a dictionary with the keywords and then extract the pdf to text and filter the cv correspondingly? Right?
How will you do the PDF -> txt?
Well that's for you to figure out :)
Why don't you just do OCR to extract the PDF to txt?