Produce application:
Document database, Books database, Regulations database, Enterprise database, Paten database, Population database, Personal document database, Heterogeneous databases
Overview of the project
At present, the majorities of users use the database to store and publish mass information, with the growth of business, the amount of user information sustained high-speed growth, the complexity of the data management substantially increased, especially on a higher demand on the database search function.
Most software rely on search functions of the commercial database, such as SQL language implemented retrieval, the speed and accuracy can not be guaranteed, and more difficult to satisfy the depth of database mining and the effective demand, it severely limit the use of user information.
Based on many heterogeneous databases, different document formats, low speed retrieval, we have introduced a Goonie database content retrieval solutions, this project is based on information retrieval systems, it inherited the efficient retrieval technology and could achieve accurate and fast database query.
Technology Architecture
Features
Data collection of database
Goonie information retrieve technology is the core of goonie database retrieve solution project. As deep depth, and the high precision collection and high speed crawling characteristics of professional users, it has been optimized and adopted a distributed multi-threaded architecture with instruction execution, 95% information accessed to the local index in the sec level.
Support the mass visit
In the ideal environment, stand-alone systems can provide millions of times/day to visit, the response time of single visit less than 0.5 seconds, and support 100 people query per second at the same time.
Multi-database support
Now, support such as Oracle, Sybase, DB2, SQL-Server, and other mainstream relational database.
Support a variety of document formats retrieval
Support ppt, doc, xls, pdf, txt, htm, html files, and other types of document identification.
Real-Time Incremental Index
Realize the real time Index update of records add, modify, and delete and so on to database and so. From the interval of database operation to the information that users can search, it is Second-class level, and it is strong in real-time data synchronization capability.
Comprehensive search function
Provide accurate full-text search function foe the database, database multiple word fields retrieval; multiple incremental retrieval; both Chinese and English mixed retrieval; intelligent fuzzy retrieval; a wide range of results sort.
WEB-based systems management platform
Using standard Web template management interface, simply through the regular browser, administrators can remote set up the parameters of the retrieval server intuitively and control kinds of process services.
Provides a wealth of standard API interface
Systems provide Java/COM interface to meet a second development for users, seamless connect for various kinds of customer system easily ensure the stable operation of retrieval system.
Running environment
Microsoft Windows XP/NT/2000/2003/
Linux/Unix/Aix