US20110099164A1
2011-04-28
12/909,703
2010-10-21
A system for the support and management of search for documents is presents. The system includes knowledge-database, query interface and communication to a database of documents to be searched. Information generated during a search session is collected by the system and is added to the knowledge-database. The information is ranked automatically according to the usage of that information by the user. During successive search session, or during search made by other users, the system uses the knowledge-database to support the users with keywords, queries and reference to documents. The knowledge-database is used for improvement of advertising targeting.
Get notified when new applications in this technology area are published.
H04L43/065 » CPC main
Arrangements for monitoring or testing data switching networks; Generation of reports related to network devices
G06F16/3322 » CPC further
Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data; Querying; Query formulation using system suggestions
G06N7/005 » CPC further
Computing arrangements based on specific mathematical models Probabilistic networks
H04L41/14 » CPC further
Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks Network analysis or design
H04L43/045 » CPC further
Arrangements for monitoring or testing data switching networks; Processing captured monitoring data, e.g. for logfile generation for graphical visualisation of monitoring data
H04L43/0805 » CPC further
Arrangements for monitoring or testing data switching networks; Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
H04L43/16 » CPC further
Arrangements for monitoring or testing data switching networks Threshold monitoring
The invention is related to the field of digital information management and, in particular, to search and retrieval of the same.
The growth of the Internet introduced broad access to information not available before. Access to digital information sources, once an asset of a small group of professional, holding information technology skills, has become a daily tool for millions of the Internet users.
Lower skill levels for search of such information is now an inseparable part of searches made for information by most of the users. Skill and knowledge aspects became limiting factors for successful search and retrieve of such information. Such limitations present themselves in the composition of Boolean queries, knowledge of relevant information sources such as the AltaVista index at www.altavista.digital.com (maximum coverage of Internet documents by a single index is less then 40%). Many of the documents are accessed only by a direct reference from a person with a specific knowledge. Many of the information provided by querying an index such as Yahoo are not relevant. The search process is long, queries are modified many times and many information pieces are missed.
Natural language query is one suggested replacement for Boolean query method. However, a research from Search Insider (www.searchinsider.com) indicates clearly that users prefer the Boolean query methods and that search engines such as AltaVista and Yahoo (www.yahoo.com) provide require much improvement.
The present status of information search and retrieval in the Internet is characterized by long and tedious search process, poor relevancy of retrieved documents and low level of success in retrieval of relevant documents.
It is the purpose of the present invention to provide a method and tools for efficient search and retrieval of documents. The description in reference to the Internet is maid in a way of example only. It would be appreciated by those skilled in the art, that the proposed method is applicable to any digital information source.
In a preferred embodiment of the present invention, the system comprises a Server, Clients having a Human Interface (H/I) and communication with the server, a Database, a connection to the Internet and Software of that system. A User of the Client is using the H/I to compose a query view various information pieces including retrieved documents or titles or summary of those documents. The Software traces the user's search activity, collecting data such as queries and relevant or irrelevant documents and information sources. The data is evaluated and stored in the Database.
The data is also compared to other data, collected from all users, available from the Database. A data that is relevant to assist the current search process is presented to the user. This data include highly ranked queries that are similar to the query composed by the current user and highly ranked documents retrieved in the past by those queries.
As the Database is the accumulation of many such search processes, made by a single user or by many users, in the same subject, the current user can benefit from the many hours invested by himself or by others to find information in this subject.
This method is effective for a single user searching the internet or his own computer and it is particularly effective when used within an organization or work group, whereas the organization members search for information that is relevant to the activity of that organization. Thus the Database generated overtime is highly relevant to all members of such organization or work group.
The invention will be better understood in reference to the following Figures:
A general block diagram of a preferred embodiment of the system of the present invention.
A description of a preferred embodiment of a Human Interface of the present invention.
FIG. 3A and FIG. 3B (referred to as FIG. 3).
A workflow chart, describing a preferred process of working with the system of the present invention.
Reference is made now to FIG. 1, which is a general description of one preferred embodiment of the system, presented in the form of a block diagram. A preferred embodiment of the invention is presented. System 100 for search and retrieval of documents includes:
6. System 100 is also connected to at least one source of digital information 116 by communication means 118. In the preferred embodiment presented here, such a source of digital information is the Internet. This connection may be done using any method, some of the methods are described in âUsing the Internetâ, 2nd edition by Mary Ann Pike, © 1995.
It would be appreciated that the above embodiment is provided as example and that, for example, all the functionality of system 100 can be implemented in a single computer, on a web-server or using any other infrastructure or technology.
Initial search is characterized by no data in System Database 108. At this stage System 100 can not assist the user to find the information he is looking for. In this stage System 100 traces the user's search session, collects data from the search session, processes the data and stores the results in the database.
In the present embodiment of the invention, the user composes a Boolean query using Human Interface 114. The user then submits the query to a search engine such as AltaVista. A set of titles and summaries is presented to the user by the search engine.
The user may now select some of the titles for a detailed review and ignore the others.
When the detailed document is presented to the user it is desired that the evaluation of the document, by the user, will be provided to the System. For that purpose a window is displayed with the document. The window contains a dual button interface indicating Y and N. To change from the document display back to the search session the user must click either the Y or the N. Y indicates that the document is relevant to the user and N indicates irrelevant document. System 100 also traces activities such as printing a document or saving a document on disk. These actions are considered as parts of positive evaluations of a document.
The user, repetitively composes improved queries, submits them to the search engine and evaluates results (direct or indirect evaluation). During the search session the user may submit queries to different sources such as AltaVista, Yahoo and Lycos (www.lycos.com). Also meta-search engines such as MetaCrawler (www.metacrawler.com) may be used.
When the user finishes the search session, the system has collected raw data ready for process. This data includes at least some details such as the following:
A process of the data is begun at this stage to produce a set of valuable information in the System Database, the information contains details such as:
Rank=K1Ă(v1/n1)+K2Ă(v3/n3)+K3ĂT1ââ(1)
| car | automobile | vehicle | bus | road | sand | mud | |
| car | â | OR | OR | NOT | AND | AND | AND |
| automobile | OR | â | OR | NOT | AND | AND | AND |
| vehicle | OR | OR | â | NOT | AND | AND | AND |
| bus | NOT | NOT | NOT | â | NOT | NOT | NOT |
| road | AND | AND | AND | NOT | â | OR | OR |
| sand | AND | AND | AND | NOT | OR | â | OR |
| mud | AND | AND | AND | NOT | OR | OR | â |
The data collected during the search session is then stored in the System Database together with the set of parameters and calculated ranks. This also includes submitted queries, URLs, summaries, terms and their relations and ranking.
The amount of such data accumulates more rapidly when the System is used by a group of users. This is the case in organizations, where many members of the organization search for information regarding similar subjects.
Reference is made now to FIG. 2, which demonstrates a preferred embodiment of Human Interface 114 of FIG. 1.
In order to enable a comprehensive explanation, it is assumed that an amount of data has already been collected in System Database 108 by the process described hereinabove and this data is available for assistance of searches made by individuals.
In a preferred embodiment of the invention, Query Composition section 200 contains Query window 202 where the current query is composed and Related Queries window 204 for display of relevant queries stored in System Database 108. It will be appreciated that although this preferred embodiment is explained in reference to complex Boolean queries, other embodiments of the present invention may be implemented without or partial Boolean relations. Example: Ther query âTerm1 OR Term2 AND Term3â maybe used in a relaxed form âTerm1 Term2 Term3â and allow the search engine default Boolean interpretation to take place.
Basic query composition is done in Query window 202, by typing words, expressions, Boolean operators and conventional singes. Following are some examples:
Related queries are displayed in Related Queries window 204. In a simple implementation of the present invention, terms from Query window 202 are used to search System Database for stored queries that contains such terms. The queries are displayed in Related Query window according to their rank, highest rank first.
In a more sophisticated implementation of the present invention, the queries containing terms of Query window 202 are specially ranked for display in Related Queries window 204. This ranking may first rank the queries in accordance to the number of terms from Query window 202 that are contained in these queries and then, a sub-level sorting is performed, where all queries of the same ranking are sub-sorted by their own rank (as stored in System Database 108). Other ranking schemes are possible.
In default, the top query of Related Query window 204 is highlighted. Any other query of window 204 can be highlighted by clicking on it. More then one query may be highlighted simultaneously. Double clicking a query of Query window 204 will copy the query to Query window 202, instead of what was previously in that window. Related Query window 204 will refresh then, according to the new query in Query window 202.
More queries may be observed in Related Queries window 204 by using Roll Slider 236.
Related Terms section 206 within Query Composition section 200 supports the process of query composition by listing terms that are related to the terms of the query in Query window 202. Related Terms section 206 contains four windows:
Nested Boolean relations are supported. For example, query of the type
| Key | |||
| Term | Or | And | Not |
| A | B | C, D | E, F |
| B | A | C, D | E, F |
| C | D | A, B | E, F |
| D | C | A, B | E, F |
| E | F | A, B, C, D | |
| F | E | A, B, C, D | |
Button âGoâ, 216, is used to indicate that the composition of the query is complete and the system is to retrieve information in accordance to that query.
It would be appreciated that the above example of preferred embodiment can be used in a more simple form such as displaying to the user only a list of Related Terms, in a single window. For example, the Related Terms section may display the list of âTerm1, Term2, . . . , Term9â in a single window without any additional information.
Document section 218 is used to display titles and summaries of documents in accordance to the highlighted Query in Related Queries section 204.
The titles of the documents are displayed in Title window 220 are titles that are highly relater to the highlighted query of window 204, Query2 in the example of FIG. 2.
The titles are available from System Database 108 and are displayed in order according to their ranking. High rank displayed first.
If the number of such titles is lower then No. of Titles 228 (20 in the example of FIG. 2), the next group of titles is extracted from System Database 108. These titles are related to the non-highlighted title with the highest rank in window 208. If this does not provide the required number of titles, the next group of titles is extracted from System Database 108. These titles are related to the non-highlighted title with the second highest rank in window 208. This procedure is repeated until the required number of titles is provided.
Each such group of titles is sub-sorted according to the rank of the titles.
Titles are also filtered for display by Which Title? window 232. In the example of FIG. 2, only New titles are presented. These are titles that have not been reviewed yet by the present user. The filtering action of the preferences is explained in more details hereinbelow, in the Preference section part.
A single click on a title of window 220 will effect the display of summaries in Summary window 222 as explained hereinbelow.
A double click on a title of window 220 will display the document of that title.
Window 224 is an indicative window that displays the URL of the highlighted title of Window 220.
The summaries of the documents that are displayed in Summary window 222 are related to the list of titles in window 220. The summary of the highlighted title, Title3 of window 220 in the example of FIG. 2, is displayed in bold underlined characters, at the top of window 222, indicated as Summary3. Unused area, below Summary3, is used for summaries of subsequent tiles, as provided by the size of window 222. These summaries are presented in the same order as the titles of window 220. The roll slider of this window allows both up-rolling for summaries preceding Sammary3 and down-rolling for summaries succeeding Sammary3.
Double clicking a summary will display the document of that summary.
Preferences section 226 is used for input, by the user, of preferences for the search session. A maximum number of titles to be presented is indicated in window 228, adjustable by the user. Date Range windows 230 indicate the earliest and latest dates of that rang. Only documents that were updated in System Database 108 within this range will be presented. The dates are adjustable by the user. Which Title window 232 contains multi-choice options. Three such options may be:
Database windows 234 indicates the database to be used by the system.
Local window 238 is used to select the preferred local database to perform as System Database 108. This may be one of few local databases or a portion of a database. The different options may be indicated by names such as:
This arrangement is described in more details in Israeli patent application number 119183 dated Sep. 2, 1996 (Haim Zvi Melman et al.).
The selected database is used during the user's interaction with the human interface of FIG. 2 including when a query is submitted for search (old or new query).
Internet window 240 is used to select the preferred Internet database sources. This is useful for submission of queries for search of new documents (or titles) that are not in the local database. Pre-configured selections of sources are available at this window.
The aided search session will be described hereinbelow in reference to FIG. 2 and FIG. 3.
For simplicity, all elements of FIG. 2 are indicated with the digit 2 for the hundreds digit, such as 202, 206 and 214.
All elements of FIG. 3 are indicated with the digit 3 for the hundreds digit such as 302, 306 and 314.
It will be appreciated that the system can handle multiple simultaneous search sessions, conducted by different users, independently. Each user may use his own preferences configuration in Preferences window 226. The data collected from each user is added to System Database 108 to serve all users. Parts of System Database 108 may be limited for use by only few users, not accessible to others.
In this example, the preferences of window 226 will be as indicated in FIG. 2:
| No. of Titles: | 20 | ||
| Which Title: | New | ||
| Date Range | Earliest: | 14 Feb. 1997 | |
| Latest: | 01 Jan. 1998 | ||
| Source: | Local: | Mechanics | |
| Internet: | Set 5 | ||
The user may view documents, step back to the search process modifying his search activity in a variety of ways such as composing new queries, editing queries and changing preferences of window 226.
Following predetermined conditions (such as every 10 minutes or end of search session or every logout) System 100 is processing the new search information and generates new data for System Database 108. This data is added to the previously stored data for future aided search sessions, as described hereinabove (section of Initial search process).
Aided Search with Key Terms and Queries External to Accumulated Database
It is appreciated that when a user uses the system for the first time or searches a new subject, the database might not contain Related Terms or Related Queries to support the user with the query composition process.
In the following preferred embodiment of the invention, a method is presented to support the query composition in such a case.
In this embodiment, the query composed by the user is submitted to the internet sources, such as Alta Vista (www.altavista.com). When server 102 finds no Related Terms or Related Queries; the server analyses the results page received from the internet source in search for relevant terms and or expressions.
In the following example the query âsharonâ was placed to www.altavista.com. A section of the results page received is displayed here (the term âsharonâ is underlined with double-line for easy reference):
| â>More |
| pages from en.wikipedia.org</A> <BR><BR><BR class=lb><A |
| class=res |
| href=âhttp://av.rds.yahoo.com/_ylt=A0geulwYBzhlGxABCitrCqMX;_ylu=X3 |
| oDMTBvdmM3bGlxBHBndANhdl93ZWJfcmVzdWx0BHNlYwNzcg-- |
| /SIG=12lb2st5a/EXP=1211717784/**http://www.jewishvirtuallibrary.org/jsource |
| /biography/sharon.htmlâ>Jewish Virtual Library: Ariel |
| <B>Sharon</B></A><BR><SPAN class=s>Brief profile of Ariel |
| <B>Sharon</B>, the former Prime Minister of Israel.<BR></SPAN><SPAN |
| class=ngrn>www.jewishvirtuallibrary.org/jsource/biography/sharon.html |
| </SPAN><SPAN class=rgy></SPAN><BR><A class=rgy |
| href=http://av.rds.yahoo.com/_ylt=A0geulwYBzhlGxABCytrCqMX;_ylu=X3 |
| oDMTBoMXBjOWUxBHBndANhdl93ZWJfcmVzdWx0/SIG=13ik8r56t/EXP |
| =1211717784/**http://www.altavista.com/web/results?sc=off&q=sharon |
| +domain%3Ajewishvirtuallibrary.org |
| >More |
| pages from jewishvirtuallibrary.org</A> <BR><BR><BR |
| class=lb><A class=res |
| href=âhttp://av.rds.yahoo.com/_ylt=A0geulwYBzhlGxABDCtrCqMX;_ylu=X |
| 3oDMTBvdmM3bGlxBHBndANhdl93ZWJfcmVzdWx0BHNlYwNzcg-- |
| /SIG=11n7bfa4b/EXP=1211717784/**http://en.wikipedia.org/wiki/Sharonâ> |
| <B>Sharon</B> |
| - Wikipedia, the free encyclopedia</A><BR><SPAN |
| class=s><B>Sharon</B> Mann, American voice actress working in Paris, |
| France. <B>Sharon</B> Osbourne, music promoter and TV <B>...</B> |
| <B>Sharon</B> Stone, American actress, model and producer |
| <B>...</B><BR></SPAN><SPAN |
| class=ngrn>en.wikipedia.org/wiki/Sharon </SPAN><SPAN |
| class=rgy></SPAN><BR><A class=rgy |
| href=âhttp://av.rds.yahoo.com/_ylt=A0geulwYBzhlGxABDStrCqMX;_ylu=X |
| 3oDMTBoMXBjOWUxBHBndANhdl93ZWJfcmVzdWx0/SIG=13al9kbnr/EXP |
| =1211717784/**http://www.altavista.com/web/results?sc=off&q=sharon |
| +domain%3Aen.wikipedia.orgâ>More |
Related terms search algorithm may consist of a simple method such as: fins first word adjacent to the term âsharonâ. Select only single words that are separated from âsharonâ by expressions <B> or </B>, ignore connection words and punctuation marks.
This simple algorithm will provide the following Related Terms from the above section of Alta Vista results page (underlined with a single-line for easy reference):
These terms may be offered to the user as Related Terms to help him improve his query.
It would be appreciated that, using other terms extraction algorithms, additional Related terms and even Related queries may be extracted from the results pages of the internet resources.
For example, the complete term âSharon Mann, American voice actress working in Paris, Franceâ can be extracted from the result page and offered as a Related Query.
In yet another preferred embodiment of the invention, the commonly html search results of the internet source can be saved as a text file using the save function of the operation system such as Microsoft Windows XP.
A section of the results page after saving as a text file is displayed here (the term âsharonâ is underlined with double-line for easy reference):
Also here, a simple algorithm will extract relevant Related Terms. For example:
Select one word before or after the term âSharonâ with one space or new-line separation, ignore connection words and punctuation marks.
This will extract the terms underlined above with a single line:
These terms can be ranked using on-page criteria such as number of appearance in the page to provide a higher relevancy presentation order.
It is understood by those skilled in the art that expressions to serve as queries may also be extracted form this page by using more then first neighbor to the term âSharonâ.
In yet additional preferred embodiment of the present invention, the Related Queries presented to the user as response to his query may contain a composition of Related Queries extracted from the internet source result page and system database 108. Same way, Related Terms presented to the user as response to his query may contain a composition of Related Terms extracted from the internet source result page and system database 108.
As it might be desired by the user to distinguish Related Terms (or Queries) from internet sources and Related Terms (or Queries) from system database 108, the terms from the different sources may be displayed using different colors (example: blue for terms extracted from Internet resources search results and black for terms extracted from system database 108). Other distinguishing methods can be used such as underlining one group or presenting each group in a separate window.
This will enable the user to see which support is based on general algorithm and which support is based on his (or his group) pervious work.
In another embodiment of the invention, the user may select display options that provide any of the combinations such as:
Simple queries such as of one or two Key Terms are the most intuitive and are frequently the first ones composed in a search session. In another embodiment of the invention, the user may submit such a simple query that in a present art search system will fail to produce useful results and provide many falls results. In the present invention, submission of such a query will actually result in submission of a number of more sophisticated queries, these are Related Queries available from System Database 108. These queries may be those of a rank above a predetermined threshold. The results will be those that are associated with the highly ranked Related Queries. This methods enables valuable results from a simple and intuitive queryânot effective by itself.
A method is presented hereinbelow, to associate such simple queries that, in most cases, do not provide the desired results, with more sophisticated queries that are usually not intuitive and are composed only after few interaction with search results and query modifications accordingly.
A search session is defined to be the process of searching information related to a specific subject. A search session includes the composing of queries, submission of queries to a search engine, evaluation of results, modification of queries as a response to such search results, submission of such modified queriesâand so on.
Two sessions are different if the subject is different.
In the present embodiment of the invention, semi-automatic session tracing is performed by the system. Queries are identified to belong to the same search session in the following way:
New search session starts by a query (normally after the application is activated). Each new or modified query is compared to all the previous queries. If at least one keyword of this query is used in one of the previous queries of that sessionâthe new query is belongs of the same session.
If the user go through strategy change in his queries there may be no keyword relation anymore to previous queries even if this is the same search session. For example, a user may search for information on 35 mm film dimensions. He may start with a query âfilm and 35 mmâ and change strategy to search for standard organizations with the query âisoâ. To overcome this problem in the present embodiment, when this occurs, the system responds to the user with a question: âHave you started a new search session? Y/Nâ. If the user answers âNâ the queries âfilm and 35 mmâ and âisoâ are associated with the same search session.
Next time that the query âfilm and 35 mmâ will be composed by a user, a reference will also be made to the query âisoâ as a relevant query. Useful titles retrieved by the query âisoâ will be available then also for the query âfilm and 35 mmâ.
If the user replies âYâ, the previous set of queries are associated with one search session while the new query is the first one of the new search session.
In yet another embodiment of the invention, the data that is collected and processed trough the search sessions is used to generate and support direct information exchange and communication among the members of an organization.
As described hereinabove, during the search session Related Queries and Titles from System Database 108 are associated with the search activity of the user. Since such queries and titles are related also to other users who used them in the past, the names of such users can be presented to the current user for user to user communication.
In the present example, after the user has worked through at least a part of his search session, such a relation to Related Queries and Titles of System Database 108 has been established. The user interface of FIG. 2 may contain âNamesâ indicator. By clicking this indicator, System 100, opens a new window. This window contains names, email addresses and phone numbers of the users who used the Related Queries and Titles in the past.
The order of names in the window may by according to the order of Related Queries of window 236 of FIG. 2. Where two or more names are related to a query the order of presentation of these names would be according to the number of Related Queries with which the names are associated, names associated with more queries to appear first.
The names are adjacent to checkboxes. The user may now, for example, use the checkboxes to select a subgroup of the names and then select an Email option. An email form addressed to the checked names will be opened. The user may then write an email to this group of user, asking for more information about their activity in the subject of his search session.
Of course, the user may also select to use the phone numbers to call these people.
An example that highlights the usefulness of this method is provided hereinbelow:
A user may look for an international standard in a specific matter. By looking at the ISO Internet site (www.iso.ch) He may find a reference to the standard he needs but the standard itself is not available there. The standard has to be ordered separately, be paid for and delivered by mail.
In this example the user may place a relevant query in query window 202 in order to get names of people, in the organization, who are involved in that subject. Then he can email them an inquiry to find whether they have in possession the requested document. By doing so he may shorten the time and cost to get the required document.
In yet another preferred embodiment of the invention, the information accumulated in the system database 108 can be used to provide the user with Search Targeted Promotions.
In this embodiment, a Promotions Database containing items such as advertisements, coupons and special discount or other types of information is stored in the system database or another database in the system computer or any other storage accessible by the system.
These items are associated with the providers of the promotions. The association may be made in a variety of ways. In the present example the association is made by the URL of the provider. As demonstrated in the following table:
| Promotion | URL | |
| Ad1 | www.a.com | |
| Coupon1 | www.a.com | |
| Ad2 | www.b.com | |
| Coupon2 | www.c.com | |
Ad2 for example is associated with URL www.b.com.
An example step-by-step usage of the accumulated information in system database 108 is described hereinbelow:
It would be appreciated that searching for a related URL in the database is not limited to the complete and exact URL only. A more sophisticated algorithm may search for a useful fraction of the complete URL. For example, when given a title-related URL such as âhttp://www.coveo.com/en/News/Default.aspxâ, the system can extract only â.coveo.comâ and search for this string in the Promotion Database. In such a case a related URL such as âhttp://www.coveo.com/en/Contact/ContactUs.aspxâ may be the result of this process. The string used to search a related URL can be extracted from the complete URL using simple algorithms such as:
The promotion may be presented in a variety of ways including, but not limited to, on a specific area in the results page, a separate pop-up page or a banner.
The order of displaying such promotions may be according to the order of the titles in the search results page. This will fit an already implemented relevancy algorithm generated by the user activity and thus, be of the same relevancy value to the user. If more then one promotion is in the list for the same URL, the system can be designed to display them in a pre-determined order such as alphabetic order, present only one promotion for each URL, selected arbitrarily from the promotions available for this URL or any other method.
In the following section a specific example is provided for Search Targeted Promotions.
In this example the user query database includes the following list of queries with related titles, associated URLs and ranking.
The list should be interpreted per the following key:
The list in this example:
In this example the promotion database includes the following items:
| Promotion | URL | |
| Ad1 | www.crestaurant.com | |
| Coupon1 | www.amazon.com | |
| Ad2 | www.ebay.com | |
| Coupon2 | www.pinkpearl.com | |
The step by step process is described herein as a specific and non limiting example:
| Promotion | URL | |
| Ad1 | www.crestaurant.com | |
| Coupon2 | www.pinkpearl.com | |
It would be appreciated that in a case of a database created and used by multiple users, the user may be presented with promotions that are selected with the above process except that the queries, titles and URLs used in the process are an aggregate of at least a part of the queries, titles and URLs from all users or from a users sub-group selected by the user.
In yet another embodiment of the present invention, the accumulated database can be used to target advertising or promotions to the user without the use of promotions database. In this embodiment the accumulated database is used to provide the advertising source with more information that is normally available, to support this source with better selection of more suitable advertisement. This method will be explained by the following example.
In this example the user query database includes the following queries with related titles, associated URLs and ranking [R]:
In addition, in this example the following terms database and terms ranking is available from the query database:
| Key Term | Related Key Terms [and ranking] | |
| Vancouver | Restaurants[14], Indian[2], Chinese[6] | |
| Restaurants | Vancouver[7], Indian[3], Chinese[15] | |
| Indian | Vancouver[5], Restaurants[7] | |
| Chinese | Vancouver[8], Restaurants[3] | |
| Beethoven | Symphony[7], 5[1] | |
| Symphony | 5[2], Beethoven[21] | |
| 5 | Beethoven[3], Symphony[4] | |
| Racing | Car[6], 5[2], Liter[8] | |
| Car | Racing[7], 5[1], Liter[3] | |
| 5 | Racing[2], Car[1], Liter[4] | |
| Liter | Racing[7], Car[19], 5[2] | |
The Related Key Terms are related to the Key Terms because the user once included them in a single query, meaning that he considers this relation valid for himself.
We also assume here an Internet source that includes functionality to receive a query and deliver advertisement in response to the query. Such an example is Goggle (www.google.com) Sponsored Links enabled by AdSense and AdWords programs. The term âSponsored Adsâ will be used here to refer to these advertisements.
The usage of this database for delivering of better-targeted promotions or advertising is explained here step-by-step.
It would also be appreciated that the above process can be executed by the Internet source. In this example www.Google.com may receive both queries and generate the combined result page and send it to the user already containing the search results for âVancouverâ and the Sponsored Ads for âVancouver restaurantsâ.
If the user database is available to the Internet source, the whole process can take place at the Internet source, such as www.google.com.
It would be appreciated that submitting a single term as the preliminary query is suggested hereinabove only as an example and that any query can be used at this stage.
The invention is not limited to the above usage of related terms. Also complete queries used in the past by the user can be used for this process. For example, when submitting the query âVancouverâ the system can search for a related query and find âVancouver Indian restaurantsâ as a highly ranked query in the user database. This query can be used to search for relevant Sponsored Ads.
Also more then one related term might be added to a query, and more then one subsequent queries can be submitted to extract relevant Sponsored Ads.
The above methods described using the user's related terms or query database to enhance the relevancy of Sponsored Ads to the specific user.
In addition to the user's related terms or query database, the system can use also documents that are registered to the user. Such documents can be registered to the user via the URLs in his database, as described in this invention or via âFavoritesâ as provided by Internet Explorer of Microsoft Corporation. Related terms can be extracted from such documents using the method of âAided search with Key Terms and Queries external to accumulated databaseâ described hereinabove. These terms can be used to compose the secondary query aimed for the extraction of Sponsored Ads that are better targeted to the specific user.
It would be appreciated, throughout the above examples, that at least a part of databases of different users may be integrated to serve a given user so that he can benefit from search activity of these users. So are documents and other similar sources that might resident on separate and unrelated databases. A typical such integration might be based on importing such data to a single database.
In another embodiment of the present invention, ranking can be a weighted balance between older ranking and newer ranking. This approach serves to fit better possible changes in user activity.
One such weighting method can be time driven. The ranking of an item (query, URL or any other item) is eroded as a function of time that passed since it was last used.
In this example, existing ranking of an item (R) might be multiplied each month by a factor (F). By doing so, the ranking of such item is reduced over time when not used.
For example, selecting F=0.99, the ranking of the item relative to the ranking it was last used (or ranked) will be as shown in the table below, using the following formula:
R(now)=R(last)*Fm.ââ(2)
R(now) is the current rank
R(last) is the rank of the item following the last ranking action
F is the monthly rank reduction factor
m is the number of months passed since the last ranking action
| Year | Relative ranking | |
| 1 | 0.89 | |
| 2 | 0.79 | |
| 3 | 0.70 | |
| 4 | 0.62 | |
| 5 | 0.55 | |
| 6 | 0.48 | |
| 7 | 0.43 | |
| 8 | 0.38 | |
| 9 | 0.34 | |
| 10 | 0.30 | |
If the user is ranking now and already ranked item, the ranking process may take a weighted sum of the old ranking (R0) and the new one (R1) to create the new ranking (R):
R=R0*Fm+R1ââ(3)
The above specific example provides few time-depended features for ranking management:
In another embodiment of the current invention the âranking erosionâ may be used on a basing of âordered ranking weightsâ and not as a function of time. This can be implemented using equation (3) above without the time parameter m:
R=R0*F+R1ââ(4)
In such a case a ranking of an item does not change due to passing time without ranking. The contribution of each ranking event to the accumulated ranking is lowest for the first ranking and highest for the last ranking.
Both equations (3) and (4) represent methods for weighted ranking reflecting the usage and interest of the users. The specific implementation of these or other methods depends on the needs of the user and can be configured to combine these equations and also configured to use other weighting methods.
In another embodiment of the present invention the search results can be presented with checkbox associated with each reference, as shown by reference number 242 in FIG. 2.
Part of these checkboxes, or all of them, can be checked to indicate that a specific action is desired in relation to the associated references. The action can be selected by a variety of means, such as a pull-down menu shown by reference number 244 in FIG. 2.
A particular action is a generation of a new results page containing only the items selected by checking the checkboxes. The selections made by the user are fed-back to the system. The program is configured to generate a new document containing only the selected items.
This new document containing only the selected results can be used to serve a variety of needs:
It would be appreciated by those skilled in the art that efficiency of System 100 depends on the computers in use, communication networks and other device parameters.
The flow of process, as described hereinabove may be modified to suit less efficient devices by avoiding updating the windows of FIG. 2 following any change in any window. Instead, update may be performed as a response to a predetermined partial group of changes or only by an explicit request from the user.
It is also appreciated that non-Boolean query systems, such as Natural Language Queries, may be used in the present invention.
The hereinabove embodiments are described in a way of example only and do not restrict the scope of the invention.
The scope of the invention is defined solely by the claims provided hereinbelow:
1. A computer implemented method for search and retrieval comprising at least one of: a computer, a storage device, display means, input means, a search engine; and
comprising the steps of:
submitting a first search query;
submitting at least one second search query, said at least one second query is not identical to said first query; and
display at least one results page that contains items retrieved in response to said first query and also items retrieved in response to said at least one second query.
2. The method according to claim 1 whereas said first query is used for search in a first results repository and said at least one second query is used for search in at least one second results repository.
3. The method according to claim 17 whereas said stored items are those that are registered in reference to any of the following:
a. the current user submitting said first query;
b. a user included in a predetermined group of users, said predetermination is made in relation to the current user;
c. a predetermined group of users selected by the current user; and
d. at least two predetermined groups of users, said at least two groups of users are selected by the current user.
4. The method according to claim 3 whereas said registration in reference to users is made to those users who used said items.
5. The method according to claim 4 whereas said stored items are at least one of:
a. queries submitted for search process;
b. keywords from queries submitted for search process; and
c. keywords from items retrieved in a search process in response to a query.
6. The method according to claim 1 including the steps of:
after submitting said first query; searching in a storage device for at least one stored item, containing at list a part of said first query; and
submitting at least a part of said at least one stored item as at least one of said second queries.
7. The method of claim 6 whereas said first query is used for search in a first results repository and said at least one second query is used for search in at least one second results repository.
8. The method of claim 1 whereas said at least one second query includes at least one stored keyword;
said at least one stored keyword is extracted according to matching of said at least one stored keyword to at least one keyword of said first query; and
said matching criteria is having said at least one stored keyword included in at least one submitted query that is stored in a storage device whereas said at least one stored query contains also said at least one keyword of said first query.
9. The method according to claim 1 additionally comprising the steps of:
selecting, by the user, at least one item in a results page;
indication, by the user, at least one target location;
automatic composition of a new page containing only the selected items; and
sending said new page to said at least one target location.
10. A computer implemented method for search and retrieval comprising at least one computer, storage device, display means, input means, and a search engine;
said storage device stores:
queries that have been submitted by users;
reference to documents that were retrieved using said stored queries, said references to documents are saved with a registration of which of said stored queries were used to retrieve said documents;
said references to documents include the path to said documents;
each reference to a document is stored if a user submitting said stored query that retrieved said reference to a document indicated, through his interaction with the search results, that this document is relevant to said stored query; and
a list of at least a part of paths to documents, whereas at list one of said path parts is associated with at least one stored item;
comprising the steps of:
submitting, by the user, a first search query;
identifying at least one of said stored queries as containing at least a part of said first query;
identifying at least one references to documents that is registered to said at least one identified stored query;
finding a match between a unique part of the path included in said identified at least one references to documents and at least one of the stored at least a part of paths; and
displaying at least one of said stored items associated with said matching at least one of the stored at least a part of paths.
11. The method according to claim 10 whereas at least a part of:
a. said stored queries; and
b. said stored references to documents;
is registered in reference to any of the following:
a. the current user submitting said first query;
b. a user included in a predetermined group of users, said predetermination is made in relation to the current user;
c. a predetermined group of users selected by the current user; and
d. at least two predetermined groups of users, said at least two groups of users are selected by the current user.
12. The method according to claim 11 whereas at least one of the following conditions is maintained:
a. said registration of a stored query in reference to a user is made to a user who used said query; and
b. said registration of a reference to a document in reference to a user is made to a user who interacted with said reference to a document.
13. The method according to claim 10 whereas said displaying is arranged in order according to at least one of:
a. ranking of said stored queries; and
b. ranking of said stored references to documents.
14. A computer implemented method for delivering search results comprising the steps of:
submitting, by a user, of a first search query;
displaying search results page including results retrieved with said first query;
selecting, by the user, at least one result in said results page;
indication, by the user, at least one target location;
automatic composition of a new results page containing only the results selected by the user; and
sending said new results page to said at least one target location.
15. The method of claim 14 whereas said at least one target location is at least one of the following:
a. display device;
b. a folder on a storage device; and
c. an email address.
16. The method of claim 14 whereas said results page includes
also items retrieved with at least one second search query;
said at least one second query is not identical to said first query; and
said automatically composed new results page contains also items retrieved with said second search query.
17. The method according to claim 1 including the steps of:
after submitting said first query; searching in a storage device for at least one stored item, containing at list a part of said first query; and
submitting at least a part of said at least one stored item as at least a part of one of said second queries.
18. The method according to claim 10 including the steps of:
selecting, by the user, at least one item in a results page;
indication, by the user, at least one target location;
automatic composition of a new page containing only the selected items; and
sending said new page to said at least one target location.
19. The method according to claim 16 whereas said first query is used for search in a first results repository and said at least one second query is used for search in a second results repository.
20. The method according to claim 16 including the steps of
after submitting said first query; searching in a storage device for at least one stored item, containing at list a part of said first query; and
submitting at least a part of said at least one stored item as at least a part of one of said second queries.