Java的面向?qū)ο缶幊?/a>">Java的面向?qū)ο缶幊?/a>
687
2025-03-31
待處理
https://pdfbox.apache.org/
https://stackoverflow.com/questions/18098400/how-to-get-raw-text-from-pdf-file-using-Java
https://stackoverflow.com/questions/50692771/multiple-pdf-file-to-txt-in-Java
https://stackoverflow.com/questions/30570196/how-to-convert-pdf-into-text-file-using-itext-liberary
https://stackoverflow.com/questions/23813727/how-to-extract-text-from-a-pdf-file-with-apache-pdfbox
https://stackoverflow.com/questions/583615/pdf-to-text-tool-or-java-library
https://stackoverflow.com/questions/17986305/how-can-i-convert-pdf-file-to-word-file-using-java
lucene 全文檢索
https://www.toptal.com/database/full-text-search-of-dialogues-with-apache-lucene(https://github.com/dougsparling/lucene-testbed)
https://stackoverflow.com/questions/6807701/lucene-full-text-search
https://medium.com/@wkrzywiec/full-text-search-with-hibernate-search-lucene-part-1-e245b889aa8e
(https://github.com/wkrzywiec/Library-Spring/tree/163fbbac65750b199cc665a2ba61fd4b80fc2ff6)
https://blog.csdn.net/forfuture1978/article/details/4711308
https://blog.csdn.net/yerenyuan_pku/article/details/72582979
https://blog.csdn.net/u014704496/article/details/40408387
https://www.baeldung.com/lucene-file-search(https://github.com/eugenp/tutorials/tree/master/lucene)
https://github.com/tantivy-search/tantivy
https://www.wave-access.com/public_en/blog/2014/october/02/full-text-search-by-using-apache-lucene.aspx
分解出pdf中的目錄:
https://pdfbox.apache.org/docs/2.0.2/javadocs/org/apache/pdfbox/pdmodel/PDDocument.html
版權(quán)聲明:本文內(nèi)容由網(wǎng)絡(luò)用戶投稿,版權(quán)歸原作者所有,本站不擁有其著作權(quán),亦不承擔(dān)相應(yīng)法律責(zé)任。如果您發(fā)現(xiàn)本站中有涉嫌抄襲或描述失實(shí)的內(nèi)容,請聯(lián)系我們jiasou666@gmail.com 處理,核實(shí)后本網(wǎng)站將在24小時(shí)內(nèi)刪除侵權(quán)內(nèi)容。
版權(quán)聲明:本文內(nèi)容由網(wǎng)絡(luò)用戶投稿,版權(quán)歸原作者所有,本站不擁有其著作權(quán),亦不承擔(dān)相應(yīng)法律責(zé)任。如果您發(fā)現(xiàn)本站中有涉嫌抄襲或描述失實(shí)的內(nèi)容,請聯(lián)系我們jiasou666@gmail.com 處理,核實(shí)后本網(wǎng)站將在24小時(shí)內(nèi)刪除侵權(quán)內(nèi)容。