Anamnesis is a clipboard history manager. It stores the clipboard history and offers an easy interface (GUI and command line) for performing full-text searches on it.
Stupa is an associative search engine. It lets you search related documents with high performance and high precision. Since document data and inverted indexes are kept in memory, Stupa reflects updates of documents in search results in real time. A server implementation of Stupa is possible by using Thrift.
bot_recognizer is a PHP class that can be used to recognize Web robots and handle them specially. It can check the IP address of the computer or the user agent of the browser program currently accessing the Web server to determine if it is within a range of IP addresses known to be of Web robots like search engine site crawlers or even malicious crawlers. The class can call different callback functions depending on the type of crawler that was identified. It can also be set on debug mode by taking a given IP address or string as user agent instead of the user agent string sent by the accessing browser. The Web robots information is stored in a database. The class can load that database from a text data file.
mimescan searches for a file within another file using the MIME database. It works similarly to the Unix command "file", but instead of searching only the header, it advances byte by byte, looking for the second file type. It uses the libmagic library.
G2P is a tool to help you get the music you want. It opens a Web browser to Google and instructs it to search certain criteria to find a link that lets you download the music you want, whether it be a full album or just one song.
Open Semantic Search is a personal search engine that features full-text search, faceted search (interactive filters), tagging, annotation, and vizualisations (trend charts and, tag cloud, and word cloud) for documents, files, images, videos, tables, sheets, and news from different data sources such as file servers, RSS, databases, wiki, or CMS. It integrate Apache Solr with UI, connectors, semantic wiki for metadata management (tagging, annotation, and structured notes), OCR, and other tools.
Waterstone Web OS is a Web application that uses the latest Web technologies to provide an integrated Webtop experience. Its features include a blog, application dock, image/video gallery, document search, MP3 Player, calendar, file manager, and a contact manager.
Ackr is a very small subset of grep/ack/rak, for lazy developers. grep is a great tool, a very powerful tool, but often too powerful for simple needs. Ackr looks for a search string in all text files and in all subfolders from the working directory, is case insensitive, has no options, doesn't look into hidden folders/files, and displays the search term in a bold font.
InstaSearch is an Eclipse IDE plug-in for performing quick and advanced searches of source code files. It uses the Apache Lucene library for indexing and fast searching of files in the workspace. The search is performed instantly as you type, and resulting files are displayed in an Eclipse view. Each file then can be previewed using a few of the most closely matching and relevant lines. A double-click on the match leads to the matching line in the file.
InvestiGateIX is a live system for search empowering journalists to set up their own Open Source search engine on an encrypted external device to search large amounts of documents and files
The goal of Funani is to solve the management of large image and other media collections in a practical way. To help you find the data you want in big data collections, this software allows you to sort it in many different ways. Advanced queries on dates, locations, events, or people, or more generically categories and tags, allow you to narrow the search quickly. Funani can be thought of primarily as a safe. Every file put into the system remains in the system. It is an additive system, which makes unintended deletion impossible. A maintenance mode can still remove unwanted files, but standard users cannot perform this operation. Funani is also a framework designed for extensibility. When new features are requested, they can be implemented by adding new functionality and leaving the existing system alone. Each feature is completely isolated from others as long as it does not depend on them.
HogTrans provides an automatic word translation engine built on statistics of text translations used for free software. It basically provides an automatically created dictionary with multiple translations and example usages for each. HogTrans can import translations from standard GNU .mo-files.
x2search is a crawler based on machine learning algorithms that finds pages and documents that are similar to given positive and different to given negative examples. The learned classifiers can be exported and saved for later reuse. It features multiple settings for searching by domain/server, etc. and has a plug-in mechanism for adding document types to be searched.
Explicitor allows a user to scan through and download lyrics for their entire iTunes library and search for profanity or other words. It is useful for DJs or music providers who need a means of searching for profanity-free songs.
Search Engine Referrals Confluence Plugin is a Confluence plugin that displays the most recent searches on Google. When someone enters search terms on Google and clicks on one of the search results, the search terms are sent to the found Web site in the "referer" part of the HTTP request. This plugin collects this information and displays it to the user. A click on the search result opens the page Google has found. A click on the search engine icon at the left displays the corresponding Google search result page.