圖書標籤: lucene 搜索引擎 信息檢索 java IR Lucene 自然語言處理 計算機科學
发表于2025-02-25
Lucene in Action, Second Edition pdf epub mobi txt 電子書 下載 2025
HIGHLIGHT New edition of top-selling book on the new version of Lucene--the core open-source technology behind most full-text search and "Intelligent Web" applications. DESCRIPTION When Lucene first hit the scene five years ago, it was nothing short of amazing. By using this open-source, highly scalable, super-fast search engine, developers could integrate search into applications quickly and efficiently. A lot has changed since then--search has grown from a "nice-to-have" feature into an indispensable part of most enterprise applications. Lucene now powers search in diverse companies including Akamai, Netflix, LinkedIn, Technorati, HotJobs, Epiphany, FedEx, Mayo Clinic, MIT, New Scientist Magazine, and many others. Some things remain the same, though. Lucene still delivers high-performance search features in a disarmingly easy-to-use API. Due to its vibrant and diverse open-source community of developers and users, Lucene is relentlessly improving, with evolutions to APIs, significant new features such as payloads, and a huge increase (as much as 8x) in indexing speed with Lucene 2.3. And with clear writing, reusable examples, and unmatched advice on best practices, Lucene in Action, Second Edition is still the definitive guide to developing with Lucene. KEY POINTS * Completely revised and updated to current Lucene 2.3 APIs. * Practical coverage, like how to index MS Word, PDF, HTML, and XML. * Full introduction to Intelligent Web topics like smart searching, sorting, and filtering.
MICHAEL MCCANDLESS has been building search engines for over a decade. In 1999,with three other people, he founded iPhrase Technologies, a startup providing usercentric enterprise search engine software, written in Python and C++. After IBM acquired iPhrase in 2005, Michael became involved in Lucene and started contributing patches, becoming a committer in 2006 and PMC member in 2008. Michael received his B.S., M.S and Ph.D. from MIT, and now lives in Lexington, MA along with his wonderful wife, Jane, and four delightful kids, Mia, Kyra, Joel and Kyle. Michael’s blog is at http://chbits.blogspot.com.
ERIK HATCHER codes, writes, and speaks on technical topics that he finds fun and challenging. He has written software for a number of diverse industries using many different technologies and languages. Erik coauthored Java Development with Ant (Manning,2002) with Steve Loughran, a book that has received industry acclaim. Since the release of Erik’s first book, he has spoken at numerous venues including the No Fluff, Just Stuff symposium circuit, JavaOne, O’Reilly’s Open Source Convention, JavaZone, devoxx, user groups, and even sometimes webinars. As an Apache Software Foundation member, he is an active contributor and committer on several Apache projects including Lucene and Solr. Erik proudly presents his favorite technologies passionately, recently notables are Solr, Solritas, Flare, Blacklight, and solr-ruby—preferring to dabble at the intersection of user experiences and Solr. Erik cofounded Lucid Imagination, where he helps carry the torch for open-source search goodness. Erik keeps fit and serene in central Virginia.
OTIS GOSPODNETIC ′ has been a Lucene developer since before Lucene became Apache Lucene. He is the co-founder of Sematext, a company that focuses on providing services and products around search (focusing on Lucene, Solr, and Nutch) and analytics (think BigData, Hadoop, etc.). Otis has given talks about Lucene and Solr over the years and some of his previous technical publications include articles about Lucene, published by O’Reilly Network and IBM developerWorks. Years ago, Otis also wrote To Choose and Be Chosen: Pursuing Education in America, a guidebook for foreigners wishing to study in the United States; it’s based on his own experience. Otis currently lives in New York City where he runs the NY Search & Discovery Meetup.
中文版翻譯得實在太差瞭,原版的第2章、3、4章值得好好讀下,雖然lucene都到8.2版本瞭,但這些內容並不過時。
評分附錄B關於Lucene索引格式的說明非常棒
評分因為工作需要開始瞭Lucene的學習, 雖然纔開始但覺得是一門非常有用的技術。雖然它的搜索領域還是有局限的, 但核心就是 現在的信息太多, 我們如何能夠獲取我們想要的信息, 是一個很重要的領域。 其實像豆瓣FM, Jing.FM,在我看來就是個性化的IR, 我們身邊不缺音樂,而是根據我們的偏好和情緒來選擇相應的音樂, 可能它們並沒有用到Lucene但是核心沒變, 從海量音樂中截取顧客最喜歡的。
評分附錄B關於Lucene索引格式的說明非常棒
評分開源的IR係統中lucene是做得最好最有名,本書詳細介紹瞭重要的模塊。但是我最喜歡的是最後的例子:LinkedIn,SIREn他們所使用的技術和實現方法。在一個更高層次的觀欖全局,真的讓我學到瞭很多東西。
抛去翻译的问题,还是一本不错的lucene入门读物。最少可以让读者知道怎么简单的使用Lucene,进行简单的性能调整。不过现在lucuen已经扩展出太多的应用,无论是中文分词,文件系统调整或者动态的及时索引更新等问题都是没有讨论。当然作者是老外人家不分词,这个我忘记了。有兴...
評分昨天去图书城,在最显眼的位置就是一堆Lucene实战!花了点时间翻了翻,个人感觉翻译得一般,很多翻译的都很直白,在因为中很多有前后语义逻辑关系的,翻译过后就看不出有这层关系了。不过可以理解的是,原版是09年6月左右出的,然后联系出版社,翻译,校对等等都是很需...
評分不错的一本书,对Lucene,或者说,Search中的一些关键点都有详细的讲述。 看完后再去看源代码,可以做到事半功倍。
評分做Lucene也只有这本书能参考了,没啥选择。还不错,全面,重要的细节也讲了,做Lucene必备参考书。
Lucene in Action, Second Edition pdf epub mobi txt 電子書 下載 2025