中文 | English  
 
 
  • Goonie search engines solution project
  • 2008-01-10 10:54:08

  • Applications direction:
    Inner network search engine, Outer network cluster search engine, Industry search engine, Local search engine
    With the development of the Internet, the website has become the most important gateway to the public of corporate or institutional. Every day, a large number of potential customers, partners, investors and analysts will log enterprises or institutions website, the experience of the website will be a direct impact on the evaluation of their units.
    People have a strong direction and region about information, so user more and more accustomed to use search engines to access their specific information conveniently, but the world's best search engine can only collected 40 % information in the internet, so it unrealistic and impossible to complete our own search or website clusters search fully expect to these search engine.
      Information retrieve is the core of Goonie search engine solution project. Goonie provide a super website for special area and special industry which has real-time and fast index, strong information retrieve function. You can found information engine in inner web, web group, industry portals easily.
    Application value
    Goonie search engine is a new type of search engine capable platform; it built intelligence, personality characteristics. And it is also a powerful tool about the establishment of enterprise-level search engine. Specific applications include the following:
    Construction industries / local search engine
    Search engine can collect network information in a variety of different websites efficiently on the specific regions or specific industries, establish real-time index, provide a powerful intelligent information retrieval function, so you can access specific industries or local network resources easily, establish local industries vertical search portal and establish industry leadership.
    Construction of the network search engine
    Search engine include data content distribute services modules, integrated document, e-mail, photographs, and other unstructured data, provide a complete, intelligent, security, personalized rich enterprise search engines. It provides services of competitive intelligence, knowledge management, and decision support for enterprises and organizations.
    Website cluster search engine services
    With its advanced collection and retrieval technology, it can monitor information changes on the website periodically, establish index automatically on the changed information, manage database, document and other kinds of resource in view of full-text search in the webpage content and characteristics of Web search in various attributes webpage, provide "full, accurate, fast," retrieval services of information for users.
    Overview of the program
    Goonie search engine is consists of three parts: collection, indexing, retrieval, collection collect network information or unstructured information in the enterprise, indexing provides vast amounts of information storage and the real-time index, retrieval provides full-text retrieval and a variety of features search capabilities and multiple output processing functions.
    Main features
    Information collection
    Integration search of heterogeneous resources
    Search engine will not only search the Webpage, but also search all kinds of file systems, as well as scattered office documents, photographs, and other unstructured data in all corners of the enterprise, thus provide more comprehensive information search application.
    Synchronous search and distributed dispose
    Search engine robots adopt multi-threaded synchronous search technology adjust the number of threads dynamically according to the actual situation to realize multi-thread synchronous search. Distributed collection of information will improve collection efficiency, and shorter collection time.
    Multiple collection strategy and update strategies
    Support includes priority breadth, depth and kinds of collection strategy to provide efficient updates, to the website which has been collected, only to collect the new changes and the resources added to ensure the effectiveness of information.
    Information process
    Coding
    Identify a variety of characters automatically, including Chinese, English, Simplified Chinese, Traditional Chinese, and can be integrated into GBK encoding format.
    Content extraction
    Can analyze and filter the content of webpage, remove advertisement, copyright, columns, and other useless information automatically, accurate access to contents of the main goals.
    Release automatically
    Identify the relationship of classified article automatically through identification related-content technology, if it found that the article described the same incident remove the duplicate parts automatically.
    Information Storage and Retrieval
    Powerful unstructured data management functions
    Support TEXT, HTML, RTF, MS OFFICE, PDF, S2/PS2/PS, MARC format, and other storage, indexing and retrieval.
    Support distributed architecture of massive information processing improve the Perfect information retrieval, accuracy and speed
      Retrieve service in search engine provide kinds of function for user. Retrieve engine give kinds of special retrieve support include whole text retrieve in standard search engine such as keywords search, headline search, URL search, special phrase search, extent search, resemble search and so on intellectual function. And give kinds of sort operation function supports.
    Technology Architecture
     
    Technical features
    △Collect high-speed real-time incremental webpage collection, monitor information of website changing.
    △Ensure real-time collected and retrieve based real-time index technology.
    △Response mass information instantaneously and retrieve in Sec-level.
    △Support 1000 times per second synchronously.
    △Unique relevant retrieval and synonymous retrieval features to meet the needs of a variety of retrieval.
    △Support web page snapshot features, and view the original page.
    △Advanced Chinese change technology to avoid ambiguity and one word many meanings phenomenon, ensure accurate search results.
    △The manner of showing search results is multiplicities, it can show the different presentation for different users.
 

TEL:86(010)-52107060 Mailgooniesoft@126.com

    谷尼国际软件(北京)有限公司 版权所有
    Copyright Gooniesoft Co.,Ltd All Rights Reserved
    京ICP备09060067号

谷尼国际软件,网络舆情监控系统,互联网舆情监控系统,舆情监控系统,网络舆情监控分析系统,企业竞争情报系统,网络舆情分析报告,网络舆情软件,全文检索软件,全文检索系统,网络口碑监测系统

谷尼内容管理系统http://www.goonie.cn 授权用户:http://www.goonie.cn