ZF-7169: Zend_Search_Lucene_Document_Pdf


This is feature request for such a component.


What's the status?

There seams to be a way to implement this: http://github.com/philipnorton42/PDFSearch

not tested ...

Possible solution in attached file

example usage: Zend_Loader::loadClass('Zend_Search_Lucene_Document_Pdf'); $content = Zend_Search_Lucene_Document_Pdf::loadPdfFile('example.pdf');

The solution posted here only works with English characters.

In 178 line you are using:

if (substr($chunk["filter"], "FlateDecode") !== false) {

it should be:

f (strpos($chunk["filter"], "FlateDecode") !== false) {

substr as second param gets int not a string. This generate a lot of php warnings.