Google Website Optimizer has announced three flexible service plans that will be offered through their Authorized Consultants. The plans can help users with the design, implementation, training and analysis of Website Optimizer tests. Here are the details of the three plans:
Earlier this week, I spoke with Tom Leung, Senior Product Manager of Website Optimizer. He told me that some companies paid upwards of $10,000 a month for testing. He also said that if your competitors are doing testing and analytics and you're not, that it's like going into a gunfight with a knife.
The new service plans are certainly more affordable and accessible to small businesses and startups, and can help you be more competitive in your industry or niche.
Posted by Nathania Johnson at 9:27 AM | Permalink | Comments (2)
Last week, Google launched Google Merchant Search. This week, Edward Cowell, Director of digital marketing agency Guava; says, ““Google Merchant Search will truly put the cat amongst pigeons for some of Google’s biggest search advertisers, the financial services comparison websites.”
Price comparison websites are big business in the UK and all the major industry players advertise heavily on Google. Research by Resolution Foundation shows that 45% of UK adults used a comparison site to help them make a financial decision in the last year and that the price-comparison market is estimated to be growing at 30% a year.
That's why the launch of the new service comes at a critical time for Google and its financial services advertisers. Says Cowell, “Most big financial services websites are just coming to terms with a marked increase in their paid search advertising budgets due to the recent changes in Google’s trademark bidding policies, so Merchant Search could be rubbing salt into the wound.”
Sites such as Ebay have boycotted Google Adwords by withdrawing its adverting when Google attempted to encroach on PayPal’s territory. So, uprisings are not unheard of in the search arena.
That's why Cowell and the rest of the industry is waiting to see how the price comparisons companies react to the launch of Merchant Search.
Posted by Greg Jarboe at 9:11 AM | Permalink | Comments (2)
Seems Google has turned its eyes on the Wikipedia space and has a spin that may get a lot of attention from knowledgeable authors.
They just announced a beta project called "Knol" - a unit of knowledge - that gets experts in various field to write for an aggregated collection of knowledge. Why sign on for this project as opposed to Wikipedia etc.? Well Google has smartly added bylines - their articles will reference the author which could have all sorts of future implications.
Who wouldn't want to be listed as an expert on a topic by Google?
"Our goal is to encourage people who know a particular subject to write an authoritative article about it. The tool is still in development and this is just the first phase of testing. For now, using it is by invitation only. But we wanted to share with everyone the basic premises and goals behind this project," the Google Blog explained.
I have already had people ask me if I can get them on the beta - this one is going to be hot. GMail hot I am starting to think.... remember when people were selling gmail accounts on EBay!
"The key idea behind the knol project is to highlight authors. Books have authors' names right on the cover, news articles have bylines, scientific articles always have authors -- but somehow the web evolved without a strong standard to keep authors names highlighted. We believe that knowing who wrote what will significantly help users make better use of web content. At the heart, a knol is just a web page; we use the word "knol" as the name of the project and as an instance of an article interchangeably. It is well-organized, nicely presented, and has a distinct look and feel, but it is still just a web page. Google will provide easy-to-use tools for writing, editing, and so on, and it will provide free hosting of the content. Writers only need to write; we'll do the rest.A knol on a particular topic is meant to be the first thing someone who searches for this topic for the first time will want to read. The goal is for knols to cover all topics, from scientific concepts, to medical information, from geographical and historical, to entertainment, from product information, to how-to-fix-it instructions. Google will not serve as an editor in any way, and will not bless any content. All editorial responsibilities and control will rest with the authors. We hope that knols will include the opinions and points of view of the authors who will put their reputation on the line. Anyone will be free to write. For many topics, there will likely be competing knols on the same subject. Competition of ideas is a good thing' the blog explained.
Posted by Frank Watson at 10:43 AM | Permalink
Seems Google is getting ready to launch another of their G ideas. This time it is GPay - a mobile payment method that seems similar to that used in Europe and Asia already.
The patent outlines "a computer-implemented method of effectuating a payment, comprising: receiving at a computer server system a text message from a payor containing a payment request comprising a payment amount sent by a payor device operating independently of the computer server system; debiting a payor account for an amount corresponding to the amount of the payment request; and crediting a payee account that is independent of the computer server system," according to PC World.
Using a mobile device to make payments has been around Europe and Asia for some time. If the other people who started this have not filed a patent in the US it will be interesting to see if Google gets it by just being the first to register it here.
Philipp Lenssen does a great job with diagrams at BlogoScoped.
Posted by Frank Watson at 3:58 PM | Permalink
Okay this is just getting more and more science fiction like. Google has submitted a patent for psychological profiling of users. The patent outlines the ability to profile game users by their chat conversations and other text based interaction with the games.
"The company thinks it can glean information about an individual's preferences and personality type by tracking their online behaviour, which could then be sold to advertisers. Details such as whether a person is more likely to be aggressive, hostile or dishonest could be obtained and stored for future use," The Age reported earlier today.
The patent says user dialogue may be used to characterise the user as, for example, profane, blunt, polite, cautious, aggressive, non-confrontational, stealthy, honest, cooperative or uncooperative.
The information could be used to make ads that appear inside the game more "relevant to the user", the patent stated.
I guess we can gather whatever information we want all for the good of better marketing. We joke about this quite a bit, but Google is fast changing into Big Brother. When our screens record what we are doing I think it will be all over - hey it does that now.......
Posted by Frank Watson at 3:56 PM | Permalink
Google Website Optimizer, which was launched in beta in October, is now being made widely available. The A/B and multivariate page testing tool is being billed by Google as the "third leg" of a "three-legged stool" that is comprised of AdWords to drive traffic, Google Analytics to measure that traffic, and Website Optimizer to convert that traffic into customers.
Zachary Rodgers has the details in his ClickZ News story, Google Unwraps Multi-Variate Site Testing, Anoints Partners: "The main problem we're trying to solve is to get people out of the dark ages in terms of how they develop pages," Tom Leung, Google's product manager for Website Optimizer, told ClickZ News. "All too often, they'll just put a page together and maybe the designer will do a few mock-ups, and they'll point to the one they feel is going to be the best one."
Google has also teamed with five consulting firms that specialize in conversion marketing to provide options for users that would rather not do it themselves, or who want additional professional services layered in. Those partners are FutureNow, Optimost, EpikOne, ROI Revolution, and SiteTuners.com.
Posted by Kevin Newcomb at 8:01 AM | Permalink
The New York Times had a great piece on how Google is doing in their efforts in the offline media world, newspapers, radio etc.
Seems the bigger publishers like NYT are pleased with the efforts to date, while Google is pleased as they are nearly double what they expected, the article stated.
Google is building a TV ad sales team, have radio and print in beta, and their recent purchases for video game advertising, YouTube and other buys show they are building a company that really wants to be an all-emcompassing media company.
Posted by Frank Watson at 4:47 PM | Permalink
Okay I was asked to fill out a survey from Google today. We all get them from various places, sometimes we fill them out other times we hit delete...
Today I thought okay I have a few minutes and I am glad I did. It was for a study of advertising across traditional media and hints at the tools and methods Google may be unvieling.
They gathered info on radio, tv and print ad costs, design costs, regularity of use and other fun things.
They obviously have named some of these areas and are using the survey to further develop features. The Creative Market Place seems to concentrate on design costs and ask if you would use a bid system to outsource design work to creative agencies.
The Online Ad-Creative Tool "let's you create and customize your ads yourself".
All areas surveyed asked about usage if integrated with AdWords. The pricing feelers covered creating, running and managing traditional media.
Guess Google is gearing up to really take on traditional media.
Posted by Frank Watson at 1:16 PM | Permalink
Niall Kennedy has a good summary of Google's Universal Gadgets that can now be put on the Google Personalized Homepage, Google Desktop, Google Pages or your own web site using the Google Gadgets For Your Webpage collection of applets. The Google announcement on this is here and tells you how you can even have your own pony. Google also announced the release of Google AJAX Search API that enables you to place a Google search box on your site. Google's allowed this for ages, but with AJAX, people can search without having to leave your web site.
Posted by Barry Schwartz at 8:43 AM | Permalink
Gary Price reported that Google registered a few new interesting domain names including bench-index.com, benchindex.com, index-bench.com and indexbench.com. Since then Garett Rogers speculated that this may mean Google is building a product to allow users to build their own flavor of the Google search engine, much like how Rollyo does. Philipp Lenssen guesses that Google may be releasing some sort of Alexa engine. Some folks at Philipp's forum suggest that "IndexBench could be tools that measure the quality of an index." Loren Baker leans to siding with Garett Rogers's guess. Me? I have no better guess at this time.
Posted by Barry Schwartz at 9:23 AM | Permalink
The Google Code Blog announced that Google has "re-released" the Tesseract OCR software to the open source community. OCR, optical character recognition, is the technology for converting text on a physical paper into computer based text. So if you have a ton of papers you typed up in your college days and you want them stored in digital format, you can use OCR to translate those documents for you.
Tesseract was originally developed by HP between the years of 1985 and 1995. In 2005 HP and University of Nevada in Las Vegas opened it to the community. Google claims that Tesseract OCR is "far more accurate than any other Open Source OCR package out there." Some more detail at Computing.co.uk.
Posted by Barry Schwartz at 8:26 AM | Permalink
Google has announced that its Google Talk instant messaging platform now allows you to share files with other Google Talk users by dropping files or entire folders into the client. Photo files get special treatment, showing up in your client so you can talk about them with someone else, as covered more here. Listening to music? Another new feature, music status, allows other Google Talkers to see what hip (or embarrassing) song you're listening to, if you use one of these supported players. Along with music status comes a new Google Music Trends feature we mentioned earlier, which allows you to see what music is most popular across the entire Google Talk network of users. Finally, want to talk by voice using Google Talk but your contact isn't around? Now you can leave them up to 10 minutes of voicemail, through that new feature. Note that some Google Talk users already got these new features a few weeks ago. Now they are rolling out to everyone.
Posted by Danny Sullivan at 6:05 AM | Permalink
Via Google Blogoscoped, What's in Google's Sandbox? from Tony Ruscoe has him stumbling upon new services that Google may plan to release such as Google Events, Google Real Estate Search and "Google Guess."
Want to try it out yourself? Go to https://sandbox.google.com/. It looks like Google Checkout, but ignore that. Don't try to sign in with an existing Google Account that you have. You need to create a new one just for this sandbox service, Ruscoe says. And that seems to involve registering your credit card, so I gave it a pass.
Postscript: Tony contacted me to say there's a way to register via the sandbox area and not have to enter credit card details. He emailed:
This isn't the case. All you need to do is remove everything after the "service=sierra" parameter from the URL you're directed to (which is for the Google Checkout service) and you'll be able to register an account without entering your credit card details... or just follow this link:
https://sandbox.google.com/accounts/NewAccount
You'll then be able to append "?service=codename" to that URL to add each of the services I included in my post.
I went to https://sandbox.google.com/accounts/NewAccount, opened a new account, then went back and did this:
https://sandbox.google.com/accounts/NewAccount?service=re
See the part in bold? The re part? That's the codename for one of these Google services, which Tony has listed in his post. Doing that let me sign up for Google Real Estate Search. After the screen to enroll came up, I got an error message and kicked back out into a personalized Google home page.
That's OK. Now go to https://sandbox.google.com/accounts/ManageAccount and sign back in. The next screen will show your account, and you'll see that Google Real Estate Search is now one of your subscribed services. Clicking on the link doesn't do anything, but at least you can make cool screenshots like everyone else :)
To add more services, keep going back as above and use different code names.
Posted by Danny Sullivan at 7:24 AM | Permalink
Barry noted on Search Engine Roundtable that some people in the UK are having problems connecting to Google. I'm one of those unlucky ones, and so far, it remains a mystery to Google about what's going on.
I've talked with a Google engineer this evening for about a half hour, trying various things to figure out what's wrong. Google still isn't certain. For me, it means that I cannot connect to:
I can reach things like Blogger or Google UK, rather than Google.com. I can also receive Gmail via POP, but I can't send.
Oddly, if I shift over to Internet Explorer rather than Firefox, I can reach Google Analytics except that the log-in window, which comes off a secure server, fails to load.
I access Google through BT Broadband, and a few others using BT seem to be having similar problems. So far, it's not a ton of people -- but enough that if you're having problems, it's not just your imagination. Short answer is, Google's aware of the issue and looking into it. For myself, I'm calling it a night and figuring that when I wake up, things will likely have cleared up.
Problems Reaching Google From The UK is a thread I've started over in our Search Engine Watch Forums on the issue. If anyone else is having problems, feel free to contribute what your situation is, in hopes that might help get things resolved.
Posted by Danny Sullivan at 4:18 PM | Permalink
Problems Connecting To Google From The UKBarry noted on Search Engine Roundtable that some people in the UK are having problems connecting to Google. I'm one of those unlucky ones, and so far, it remains a mystery to Google about what's going on.
I've talked with a Google engineer this evening for about a half hour, trying various things to figure out what's wrong. Google still isn't certain. For me, it means that I cannot connect to:
I can reach things like Blogger or Google UK, rather than Google.com. I can also receive Gmail via POP, but I can't send.
Oddly, if I shift over to Internet Explorer rather than Firefox, I can reach Google Analytics except that the log-in window, which comes off a secure server, fails to load.
I access Google through BT Broadband, and a few others using BT seem to be having similar problems. So far, it's not a ton of people -- but enough that if you're having problems, it's not just your imagination. Short answer is, Google's aware of the issue and looking into it. For myself, I'm calling it a night and figuring that when I wake up, things will likely have cleared up.
Problems Reaching Google From The UK is a thread I've started over in our Search Engine Watch Forums on the issue. If anyone else is having problems, feel free to contribute what your situation is, in hopes that might help get things resolved.
Posted by Kevin Heisler at 4:18 PM | Permalink
Problems Connecting To Google From The UKBarry noted on Search Engine Roundtable that some people in the UK are having problems connecting to Google. I'm one of those unlucky ones, and so far, it remains a mystery to Google about what's going on.
I've talked with a Google engineer this evening for about a half hour, trying various things to figure out what's wrong. Google still isn't certain. For me, it means that I cannot connect to:
I can reach things like Blogger or Google UK, rather than Google.com. I can also receive Gmail via POP, but I can't send.
Oddly, if I shift over to Internet Explorer rather than Firefox, I can reach Google Analytics except that the log-in window, which comes off a secure server, fails to load.
I access Google through BT Broadband, and a few others using BT seem to be having similar problems. So far, it's not a ton of people -- but enough that if you're having problems, it's not just your imagination. Short answer is, Google's aware of the issue and looking into it. For myself, I'm calling it a night and figuring that when I wake up, things will likely have cleared up.
Problems Reaching Google From The UK is a thread I've started over in our Search Engine Watch Forums on the issue. If anyone else is having problems, feel free to contribute what your situation is, in hopes that might help get things resolved.
Posted by Kevin Heisler at 4:18 PM | Permalink
Problems Connecting To Google From The UKBarry noted on Search Engine Roundtable that some people in the UK are having problems connecting to Google. I'm one of those unlucky ones, and so far, it remains a mystery to Google about what's going on.
I've talked with a Google engineer this evening for about a half hour, trying various things to figure out what's wrong. Google still isn't certain. For me, it means that I cannot connect to:
I can reach things like Blogger or Google UK, rather than Google.com. I can also receive Gmail via POP, but I can't send.
Oddly, if I shift over to Internet Explorer rather than Firefox, I can reach Google Analytics except that the log-in window, which comes off a secure server, fails to load.
I access Google through BT Broadband, and a few others using BT seem to be having similar problems. So far, it's not a ton of people -- but enough that if you're having problems, it's not just your imagination. Short answer is, Google's aware of the issue and looking into it. For myself, I'm calling it a night and figuring that when I wake up, things will likely have cleared up.
Problems Reaching Google From The UK is a thread I've started over in our Search Engine Watch Forums on the issue. If anyone else is having problems, feel free to contribute what your situation is, in hopes that might help get things resolved.
Posted by Kevin Heisler at 4:18 PM | Permalink
Four patent applications from Google describe fighting spam in emails, providing product review searches, moving large amounts of data, and autolinking. Yahoo matches, and raises with five patent filings. One on watching deletions to choose better ads, another on serving dynamic information through a additional browser interface, and three more on multimedia and RSS.
Microsoft goes TV 2.0 with an electronic program guide, and describes a way of matching advertising content with certain search queries before those searches are made. IBM comes up with a unique way of presenting the results of a search from more than one search engine, and a way of reducing the amount of irrelevant results in a search by analyzing an initial set of results, identifying an appropriate additional query term from those results, and searching the original results again but with the additional query term included in the search.
Go Daddy describes a way of fighting spam in emails. Xerox employs collaborative filtering from previous users' searches to predict search results. Apostolos Gerasoulis, from Ask.com, with a couple of co-inventors, ranks and displays pages (objects) based upon linkage and textual data, and then defines a way to identifiy and assign topics to them.
Email Spam
Emails with links in them could be considered spam if the links point to pages that are in a conceptual category considered spammy. This patent application really doesn't describe the concept categorization part of the process. That's done in a related patent application mentioned within this document, and the related document lists Georges Harik as one inventor. Dr. Harik's name is on a very large percentage of the patent applications involving Gmail-type processes.
Method and system to detect e-mail spam using concept categorization of linked content Invented by Johnny Chen US Patent Application 20060122957 Published June 8, 2006 Filed December 3, 2004
Abstract
A system and method for detecting undesired electronic messages (e.g., spam) using concept categorization of hyperlinks is disclosed. A server receives an electronic message and retrieves web pages that correspond to hyperlinks in the message. The server performs concept categorization on the retrieved web pages based on semantic relationships in the received information to determine whether the electronic message meets predefined criteria associated with undesired messages.Searching and Aggregating Product Reviews
If Google wanted to get into the product or services review business, the next patent filing describes a blue print for the process that might make an effective and innovative system.
Method and system for finding and aggregating reviews for a product Invented by Jan Matthias Ruhl and Mayur D. Datar US Patent Application 20060129446 Published June 15, 2006 Filed December 14, 2004
Abstract
The embodiments disclosed herein include new, more efficient ways to collect product reviews from the Internet, aggregate reviews for the same product, and provide an aggregated review to end users in a searchable format. One aspect of the invention is a graphical user interface on a computer that includes a plurality of portions of reviews for a product and a search input area for entering search terms to search for reviews of the product that contain the search terms.Scaling and Distributing Data
Arvind Jain is the head of Research and Development in Google's Bangalore office, and has spoken at a number of conferences on infrastructure projects and issues involving such things as Google's crawl and indexing system, distributed file replication system, and compression techniques for large scale storage systems. He's listed as the inventor for this next Google filing.
System and method for scalable data distribution Invented by Arvind Jain US Patent Application 20060126201 Published June 15, 2006 Filed December 10, 2004
Abstract
A system having a resource manager, a plurality of masters, and a plurality of slaves, interconnected by a communications network. To distribute data, a master determined that a destination slave of the plurality slaves requires data. The master then generates a list of slaves from which to transfer the data to the destination slave. The master transmits the list to the resource manager. The resource manager is configured to select a source slave from the list based on available system resources. Once a source is selected by the resource manager, the master receives an instruction from the resource manager to initiate a transfer of the data from the source slave to the destination slave. The master then transmits an instruction to commence the transfer.Autolinking
Google's Autolink raised a lot of eyebrows, and brought some negative reactions. A Search Engine Watch Blog post from Danny Sullivan, Google Toolbar's AutoLink & The Need For Opt-Out defined many of the issues around the toolbar feature. The following patent application explains how such a system might work from the search engine's perspective.
Providing useful information associated with an item in a document Invented by Gueorgui Djabarov US Patent Application 20060129910 Published June 15, 2006 Filed December 14, 2004
Abstract
A method includes recognizing an item within a first document based on a pattern associated with the item but not the exact content of the item. The method further includes identifying a link for the item and providing a second document that includes information associated with the item when the link for the item is selected.Yahoo
Choosing Better Ads through User Behavior
Some queries involve the use of concepts and units, as described in at least five Yahoo patent filings (see previous patent posts in the Yahoo sections from Yahoo Units and Microsoft Redundancy Filters and More Yahoo Concepts and Google Predictive Searches.)
But sometimes a two term query isn't a concept as much as it is a couple of keywords that someone may use to search for something. If that person performs a second search after deleting one of the words, then the record of that deletion and second search might help Yahoo calculate "deletion probability scores" for words being used in these kind of two term queries.
This can be helpful when there isn't a good keyword based advertising match for that query, but there might be a good match individually for each of the terms that make up the query. The "deletion probability scores" can help determine which of the two terms to show keyword-based advertising for in search results.
System and methods for ranking the relative value of terms in a multi-term search query using deletion prediction Invented by Rosemary Jones and Daniel C. Fain US Patent Application 20060129534 Published June 15, 2006 Filed December 14, 2004
Abstract
The likely relevance of each term of a search-engine query of two or more terms is determined by their deletion probability scores. If the deletion probability scores are significantly different, the deletion probability score can be used to return targeted ads related to the more relevant term or terms along with the search results. Deletion probability scores are determined by first gathering historical records of search queries of two or more terms in which a subsequent query was submitted by the same user after one or more of the terms had been deleted. The deletion probability score for a particular term of a search query is calculated as the ratio of the number of times that particular term was itself deleted prior to a subsequent search by the same user divided by the number of times there were subsequent search queries by the same user in which any term or terms including that given term was deleted by the same user prior to the subsequent search. Terms are not limited to individual alphabetic words.Browser Interface Helpers
This next document describes some ways to provide additional dynamic information to someone via a toolbar styled interface, while they are browsing pages on the web.
Method of controlling an Internet browser interface and a controllable browser interface Invented by Thomas J. Shafron Assigned to Yahoo US Patent Application 20060129937 Published June 15, 2006 Filed February 2, 2006
Abstract
The present invention is directed to a method of dynamically controlling and displaying an Internet browser interface, and to a dynamically controllable Internet browser interface. In accordance with the present invention, a browser interface may be customized using a controlling software program that may be provided by an Internet content provider, an ISP, or that may reside on an Internet user's computer. The controlling software program enables the Internet user, the content provider, or the ISP to customize and control the information and/or functionality of a user's browser and browser interface.RSS Enhancements
The following three Yahoo filings all list the same inventors, including John Thrall who is the head of media search engineering, for Yahoo Search. They provide different aspects of using RSS with multimedia files.
Syndicating multiple media objects with RSS Invented by Andrew R. Volk, David D. Hall, and John J. Thrall US Patent Application 20060129917 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for syndicating more than one media object in an element using Real Simple Syndication (RSS). In one embodiment, multiple media objects with at least one shared characteristic are syndicated under the same element. For example, a single media object can come in multiple formats and/or compression rates.Syndicating multimedia information with RSS Invented by Andrew R. Volk, David D. Hall, John J. Thrall US Patent Application 20060129907 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for adding descriptive information to a Real Simple Syndication (RSS) document. The descriptive information describes the content of media objects syndicated through the document. The descriptive information can be used to provided additional information to a subscriber, and can be used in searching for syndicated media content.RSS rendering via a media player Invented by Andrew R. Volk, David D. Hall, John J. Thrall US Patent Application 20060129916 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for syndicating media objects through a link to a media player using Real Simple Syndication (RSS). A content provider may not want to give direct access to a media object to a subscriber. Instead a content provider can give the subscriber a link to a media player that can access the media object.Microsoft
Searching electronic program guide data Invented by Pradhan S. Rao, David Hendler Sloo, Daniel Danker, and George K. Nyako Assigned to Microsoft US Patent Application 20060130098 Published June 15, 2006 Filed December 15, 2004
Abstract
Searching electronic program guide (EPG) data is described. The EPG data may be compartmentalized into channel metadata that describes characteristics of one or more channels and content metadata that describes characteristics of one or more content items. In a implementation, a method includes searching channel metadata and content metadata. A result of the searching is formed for output in conjunction with an electronic program guide (EPG).System and method for indexing and prefiltering Invented by Brian Burdick, Joshua J. Forman, Kevin P. Kornelson, Murali Vajjiravel, and Rajeev Prasad Assigned to Microsoft US Patent Application 20060129555 Published June 15, 2006 Filed December 9, 2004
Abstract
A method and system are provided for selecting advertisements for presentation to a user in response to a user search query. The system may include a keyword server for parsing the user search query and an index server for receiving the parsed search query. The index server may include an index of advertising phrases and pre-filtering components for comparing index entries to the parsed user search query in order to discard non-matching index entries and locate matching entries. The pre-filtering components may include either a phrase length pre-filtering component or a word hash pre-filtering component. The system may additionally include a listing server for sorting through the matching entries located by the index server and further filtering the matching entries for retrieval and presentation to the user.IBM
Ring method, apparatus, and computer program product for managing federated search results in a heterogeneous environment Invented by Wade Shelby Beavers and David Joseph Borrillo Assigned to IBM US Patent Application 20060129530 Published June 15, 2006 Filed December 9, 2004
Abstract
A method, apparatus and computer program product are provided for managing federated search results in a heterogeneous environment. A user enters a search term and the search term is submitted to multiple selected search engines. Search results are gathered from each selected search engine. A search ring is generated including a ring section to represent each of the selected search engines for enabling the user to view search results from one or more of the selected search engines.Method and system for suggesting search engine keywords Invented by Cary Lee Bates Assigned to IBM US Patent Application 20060129531 Published June 15, 2006 Filed December 9, 2004
Abstract
A search engine receives a search query having one or more keywords. The documents in the result set from that search query are analyzed to identify one or more additional keywords that further segment, or separate, the initial result set. These additional keywords are presented to the user who then selects whether to include or exclude documents matching the additional keywords. In this way, the number of documents in the initial result set is reduced in a relatively quick and effortless manner.Go Daddy
Email filtering system and method Invented by Brad Owen and Jason Steiner US Patent Application 20060129644 Published June 15, 2006 Filed December 14, 2004
Abstract
Systems and methods of the present invention allow filtering out spam and phishing email messages based on the links embedded into the email messages. In a preferred embodiment, an Email Filter extracts links from the email message and obtains desirability values for the links. The Email Filter may route the email message based on desirability values. Such routing includes delivering the email message to a Recipient, delivering the message to a Quarantine Mailbox, or deleting the message.Xerox
Personalized web search method Invented by Lisa S. Purvis Assigned to Xerox Corporation US Patent Application 20060129533 Published June 15, 2006 Filed December 15, 2004
Abstract
A method for contextualizing search results is disclosed. The method includes performing a traditional web query that returns a set of result pages, using collaborative filtering techniques to generate a set of predicted pages, comparing the set of predicted pages with the set of result pages, and ranking the set of result pages so that result pages that are also included in the set of predicted pages are ranked higher than those that are not. Methods herein also contemplate using the search history of the user or others to refine the results of searches.Ask.com
Relevancy-based database retrieval and display techniques Invented by Tao Yang, Wei Wang, and Apostolos Gerasoulis US Patent Application 20060129552 Published June 15, 2006 Filed February 2, 2006
Abstract
Techniques to retrieve, rank and display data objects retrieved form a database are described. In particular, methods to assign a global ranking value to a data object based on a combination of that object's link-based (e.g., vector-space cluster analysis) and text-based (e.g., word frequency) ranks are described. Additional techniques to determine a set of concepts, topics or key words associated with each retrieved data objects are described.My usual reminder about patents: Some of the processes and technology described in patents are created in house, and some are developed with the assistance of contractors and partners. A percentage are never developed in a tangible manner, but may serve as a way to attempt to exclude others from using the technology, or even to possibly mislead competitors into exploring an area that they might not have an interest in (sometimes skepticism is good.)
There are times when a Google or Yahoo acquires a company to gain access to the intellectual property of that company, or the intellectual prowess and expertise of that company's employees. And sometimes patents are just purchased.
Want to comment or discuss? Visit our Search Technology & Relevancy area of the Search Engine Watch Forums.
Posted by Bill Slawski at 8:42 PM | Permalink
New Search Patent Applications: June 19, 2006 - Autolinking, and Better Advertising through Deletion PredictionsFour patent applications from Google describe fighting spam in emails, providing product review searches, moving large amounts of data, and autolinking. Yahoo matches, and raises with five patent filings. One on watching deletions to choose better ads, another on serving dynamic information through a additional browser interface, and three more on multimedia and RSS.
Microsoft goes TV 2.0 with an electronic program guide, and describes a way of matching advertising content with certain search queries before those searches are made. IBM comes up with a unique way of presenting the results of a search from more than one search engine, and a way of reducing the amount of irrelevant results in a search by analyzing an initial set of results, identifying an appropriate additional query term from those results, and searching the original results again but with the additional query term included in the search.
Go Daddy describes a way of fighting spam in emails. Xerox employs collaborative filtering from previous users' searches to predict search results. Apostolos Gerasoulis, from Ask.com, with a couple of co-inventors, ranks and displays pages (objects) based upon linkage and textual data, and then defines a way to identifiy and assign topics to them.
Email Spam
Emails with links in them could be considered spam if the links point to pages that are in a conceptual category considered spammy. This patent application really doesn't describe the concept categorization part of the process. That's done in a related patent application mentioned within this document, and the related document lists Georges Harik as one inventor. Dr. Harik's name is on a very large percentage of the patent applications involving Gmail-type processes.
Method and system to detect e-mail spam using concept categorization of linked content Invented by Johnny Chen US Patent Application 20060122957 Published June 8, 2006 Filed December 3, 2004
Abstract
A system and method for detecting undesired electronic messages (e.g., spam) using concept categorization of hyperlinks is disclosed. A server receives an electronic message and retrieves web pages that correspond to hyperlinks in the message. The server performs concept categorization on the retrieved web pages based on semantic relationships in the received information to determine whether the electronic message meets predefined criteria associated with undesired messages.Searching and Aggregating Product Reviews
If Google wanted to get into the product or services review business, the next patent filing describes a blue print for the process that might make an effective and innovative system.
Method and system for finding and aggregating reviews for a product Invented by Jan Matthias Ruhl and Mayur D. Datar US Patent Application 20060129446 Published June 15, 2006 Filed December 14, 2004
Abstract
The embodiments disclosed herein include new, more efficient ways to collect product reviews from the Internet, aggregate reviews for the same product, and provide an aggregated review to end users in a searchable format. One aspect of the invention is a graphical user interface on a computer that includes a plurality of portions of reviews for a product and a search input area for entering search terms to search for reviews of the product that contain the search terms.Scaling and Distributing Data
Arvind Jain is the head of Research and Development in Google's Bangalore office, and has spoken at a number of conferences on infrastructure projects and issues involving such things as Google's crawl and indexing system, distributed file replication system, and compression techniques for large scale storage systems. He's listed as the inventor for this next Google filing.
System and method for scalable data distribution Invented by Arvind Jain US Patent Application 20060126201 Published June 15, 2006 Filed December 10, 2004
Abstract
A system having a resource manager, a plurality of masters, and a plurality of slaves, interconnected by a communications network. To distribute data, a master determined that a destination slave of the plurality slaves requires data. The master then generates a list of slaves from which to transfer the data to the destination slave. The master transmits the list to the resource manager. The resource manager is configured to select a source slave from the list based on available system resources. Once a source is selected by the resource manager, the master receives an instruction from the resource manager to initiate a transfer of the data from the source slave to the destination slave. The master then transmits an instruction to commence the transfer.Autolinking
Google's Autolink raised a lot of eyebrows, and brought some negative reactions. A Search Engine Watch Blog post from Danny Sullivan, Google Toolbar's AutoLink & The Need For Opt-Out defined many of the issues around the toolbar feature. The following patent application explains how such a system might work from the search engine's perspective.
Providing useful information associated with an item in a document Invented by Gueorgui Djabarov US Patent Application 20060129910 Published June 15, 2006 Filed December 14, 2004
Abstract
A method includes recognizing an item within a first document based on a pattern associated with the item but not the exact content of the item. The method further includes identifying a link for the item and providing a second document that includes information associated with the item when the link for the item is selected.Yahoo
Choosing Better Ads through User Behavior
Some queries involve the use of concepts and units, as described in at least five Yahoo patent filings (see previous patent posts in the Yahoo sections from Yahoo Units and Microsoft Redundancy Filters and More Yahoo Concepts and Google Predictive Searches.)
But sometimes a two term query isn't a concept as much as it is a couple of keywords that someone may use to search for something. If that person performs a second search after deleting one of the words, then the record of that deletion and second search might help Yahoo calculate "deletion probability scores" for words being used in these kind of two term queries.
This can be helpful when there isn't a good keyword based advertising match for that query, but there might be a good match individually for each of the terms that make up the query. The "deletion probability scores" can help determine which of the two terms to show keyword-based advertising for in search results.
System and methods for ranking the relative value of terms in a multi-term search query using deletion prediction Invented by Rosemary Jones and Daniel C. Fain US Patent Application 20060129534 Published June 15, 2006 Filed December 14, 2004
Abstract
The likely relevance of each term of a search-engine query of two or more terms is determined by their deletion probability scores. If the deletion probability scores are significantly different, the deletion probability score can be used to return targeted ads related to the more relevant term or terms along with the search results. Deletion probability scores are determined by first gathering historical records of search queries of two or more terms in which a subsequent query was submitted by the same user after one or more of the terms had been deleted. The deletion probability score for a particular term of a search query is calculated as the ratio of the number of times that particular term was itself deleted prior to a subsequent search by the same user divided by the number of times there were subsequent search queries by the same user in which any term or terms including that given term was deleted by the same user prior to the subsequent search. Terms are not limited to individual alphabetic words.Browser Interface Helpers
This next document describes some ways to provide additional dynamic information to someone via a toolbar styled interface, while they are browsing pages on the web.
Method of controlling an Internet browser interface and a controllable browser interface Invented by Thomas J. Shafron Assigned to Yahoo US Patent Application 20060129937 Published June 15, 2006 Filed February 2, 2006
Abstract
The present invention is directed to a method of dynamically controlling and displaying an Internet browser interface, and to a dynamically controllable Internet browser interface. In accordance with the present invention, a browser interface may be customized using a controlling software program that may be provided by an Internet content provider, an ISP, or that may reside on an Internet user's computer. The controlling software program enables the Internet user, the content provider, or the ISP to customize and control the information and/or functionality of a user's browser and browser interface.RSS Enhancements
The following three Yahoo filings all list the same inventors, including John Thrall who is the head of media search engineering, for Yahoo Search. They provide different aspects of using RSS with multimedia files.
Syndicating multiple media objects with RSS Invented by Andrew R. Volk, David D. Hall, and John J. Thrall US Patent Application 20060129917 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for syndicating more than one media object in an element using Real Simple Syndication (RSS). In one embodiment, multiple media objects with at least one shared characteristic are syndicated under the same element. For example, a single media object can come in multiple formats and/or compression rates.Syndicating multimedia information with RSS Invented by Andrew R. Volk, David D. Hall, John J. Thrall US Patent Application 20060129907 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for adding descriptive information to a Real Simple Syndication (RSS) document. The descriptive information describes the content of media objects syndicated through the document. The descriptive information can be used to provided additional information to a subscriber, and can be used in searching for syndicated media content.RSS rendering via a media player Invented by Andrew R. Volk, David D. Hall, John J. Thrall US Patent Application 20060129916 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for syndicating media objects through a link to a media player using Real Simple Syndication (RSS). A content provider may not want to give direct access to a media object to a subscriber. Instead a content provider can give the subscriber a link to a media player that can access the media object.Microsoft
Searching electronic program guide data Invented by Pradhan S. Rao, David Hendler Sloo, Daniel Danker, and George K. Nyako Assigned to Microsoft US Patent Application 20060130098 Published June 15, 2006 Filed December 15, 2004
Abstract
Searching electronic program guide (EPG) data is described. The EPG data may be compartmentalized into channel metadata that describes characteristics of one or more channels and content metadata that describes characteristics of one or more content items. In a implementation, a method includes searching channel metadata and content metadata. A result of the searching is formed for output in conjunction with an electronic program guide (EPG).System and method for indexing and prefiltering Invented by Brian Burdick, Joshua J. Forman, Kevin P. Kornelson, Murali Vajjiravel, and Rajeev Prasad Assigned to Microsoft US Patent Application 20060129555 Published June 15, 2006 Filed December 9, 2004
Abstract
A method and system are provided for selecting advertisements for presentation to a user in response to a user search query. The system may include a keyword server for parsing the user search query and an index server for receiving the parsed search query. The index server may include an index of advertising phrases and pre-filtering components for comparing index entries to the parsed user search query in order to discard non-matching index entries and locate matching entries. The pre-filtering components may include either a phrase length pre-filtering component or a word hash pre-filtering component. The system may additionally include a listing server for sorting through the matching entries located by the index server and further filtering the matching entries for retrieval and presentation to the user.IBM
Ring method, apparatus, and computer program product for managing federated search results in a heterogeneous environment Invented by Wade Shelby Beavers and David Joseph Borrillo Assigned to IBM US Patent Application 20060129530 Published June 15, 2006 Filed December 9, 2004
Abstract
A method, apparatus and computer program product are provided for managing federated search results in a heterogeneous environment. A user enters a search term and the search term is submitted to multiple selected search engines. Search results are gathered from each selected search engine. A search ring is generated including a ring section to represent each of the selected search engines for enabling the user to view search results from one or more of the selected search engines.Method and system for suggesting search engine keywords Invented by Cary Lee Bates Assigned to IBM US Patent Application 20060129531 Published June 15, 2006 Filed December 9, 2004
Abstract
A search engine receives a search query having one or more keywords. The documents in the result set from that search query are analyzed to identify one or more additional keywords that further segment, or separate, the initial result set. These additional keywords are presented to the user who then selects whether to include or exclude documents matching the additional keywords. In this way, the number of documents in the initial result set is reduced in a relatively quick and effortless manner.Go Daddy
Email filtering system and method Invented by Brad Owen and Jason Steiner US Patent Application 20060129644 Published June 15, 2006 Filed December 14, 2004
Abstract
Systems and methods of the present invention allow filtering out spam and phishing email messages based on the links embedded into the email messages. In a preferred embodiment, an Email Filter extracts links from the email message and obtains desirability values for the links. The Email Filter may route the email message based on desirability values. Such routing includes delivering the email message to a Recipient, delivering the message to a Quarantine Mailbox, or deleting the message.Xerox
Personalized web search method Invented by Lisa S. Purvis Assigned to Xerox Corporation US Patent Application 20060129533 Published June 15, 2006 Filed December 15, 2004
Abstract
A method for contextualizing search results is disclosed. The method includes performing a traditional web query that returns a set of result pages, using collaborative filtering techniques to generate a set of predicted pages, comparing the set of predicted pages with the set of result pages, and ranking the set of result pages so that result pages that are also included in the set of predicted pages are ranked higher than those that are not. Methods herein also contemplate using the search history of the user or others to refine the results of searches.Ask.com
Relevancy-based database retrieval and display techniques Invented by Tao Yang, Wei Wang, and Apostolos Gerasoulis US Patent Application 20060129552 Published June 15, 2006 Filed February 2, 2006
Abstract
Techniques to retrieve, rank and display data objects retrieved form a database are described. In particular, methods to assign a global ranking value to a data object based on a combination of that object's link-based (e.g., vector-space cluster analysis) and text-based (e.g., word frequency) ranks are described. Additional techniques to determine a set of concepts, topics or key words associated with each retrieved data objects are described.My usual reminder about patents: Some of the processes and technology described in patents are created in house, and some are developed with the assistance of contractors and partners. A percentage are never developed in a tangible manner, but may serve as a way to attempt to exclude others from using the technology, or even to possibly mislead competitors into exploring an area that they might not have an interest in (sometimes skepticism is good.)
There are times when a Google or Yahoo acquires a company to gain access to the intellectual property of that company, or the intellectual prowess and expertise of that company's employees. And sometimes patents are just purchased.
Want to comment or discuss? Visit our Search Technology & Relevancy area of the Search Engine Watch Forums.
Posted by Kevin Heisler at 8:42 PM | Permalink
New Search Patent Applications: June 19, 2006 - Autolinking, and Better Advertising through Deletion PredictionsFour patent applications from Google describe fighting spam in emails, providing product review searches, moving large amounts of data, and autolinking. Yahoo matches, and raises with five patent filings. One on watching deletions to choose better ads, another on serving dynamic information through a additional browser interface, and three more on multimedia and RSS.
Microsoft goes TV 2.0 with an electronic program guide, and describes a way of matching advertising content with certain search queries before those searches are made. IBM comes up with a unique way of presenting the results of a search from more than one search engine, and a way of reducing the amount of irrelevant results in a search by analyzing an initial set of results, identifying an appropriate additional query term from those results, and searching the original results again but with the additional query term included in the search.
Go Daddy describes a way of fighting spam in emails. Xerox employs collaborative filtering from previous users' searches to predict search results. Apostolos Gerasoulis, from Ask.com, with a couple of co-inventors, ranks and displays pages (objects) based upon linkage and textual data, and then defines a way to identifiy and assign topics to them.
Email Spam
Emails with links in them could be considered spam if the links point to pages that are in a conceptual category considered spammy. This patent application really doesn't describe the concept categorization part of the process. That's done in a related patent application mentioned within this document, and the related document lists Georges Harik as one inventor. Dr. Harik's name is on a very large percentage of the patent applications involving Gmail-type processes.
Method and system to detect e-mail spam using concept categorization of linked content Invented by Johnny Chen US Patent Application 20060122957 Published June 8, 2006 Filed December 3, 2004
Abstract
A system and method for detecting undesired electronic messages (e.g., spam) using concept categorization of hyperlinks is disclosed. A server receives an electronic message and retrieves web pages that correspond to hyperlinks in the message. The server performs concept categorization on the retrieved web pages based on semantic relationships in the received information to determine whether the electronic message meets predefined criteria associated with undesired messages.Searching and Aggregating Product Reviews
If Google wanted to get into the product or services review business, the next patent filing describes a blue print for the process that might make an effective and innovative system.
Method and system for finding and aggregating reviews for a product Invented by Jan Matthias Ruhl and Mayur D. Datar US Patent Application 20060129446 Published June 15, 2006 Filed December 14, 2004
Abstract
The embodiments disclosed herein include new, more efficient ways to collect product reviews from the Internet, aggregate reviews for the same product, and provide an aggregated review to end users in a searchable format. One aspect of the invention is a graphical user interface on a computer that includes a plurality of portions of reviews for a product and a search input area for entering search terms to search for reviews of the product that contain the search terms.Scaling and Distributing Data
Arvind Jain is the head of Research and Development in Google's Bangalore office, and has spoken at a number of conferences on infrastructure projects and issues involving such things as Google's crawl and indexing system, distributed file replication system, and compression techniques for large scale storage systems. He's listed as the inventor for this next Google filing.
System and method for scalable data distribution Invented by Arvind Jain US Patent Application 20060126201 Published June 15, 2006 Filed December 10, 2004
Abstract
A system having a resource manager, a plurality of masters, and a plurality of slaves, interconnected by a communications network. To distribute data, a master determined that a destination slave of the plurality slaves requires data. The master then generates a list of slaves from which to transfer the data to the destination slave. The master transmits the list to the resource manager. The resource manager is configured to select a source slave from the list based on available system resources. Once a source is selected by the resource manager, the master receives an instruction from the resource manager to initiate a transfer of the data from the source slave to the destination slave. The master then transmits an instruction to commence the transfer.Autolinking
Google's Autolink raised a lot of eyebrows, and brought some negative reactions. A Search Engine Watch Blog post from Danny Sullivan, Google Toolbar's AutoLink & The Need For Opt-Out defined many of the issues around the toolbar feature. The following patent application explains how such a system might work from the search engine's perspective.
Providing useful information associated with an item in a document Invented by Gueorgui Djabarov US Patent Application 20060129910 Published June 15, 2006 Filed December 14, 2004
Abstract
A method includes recognizing an item within a first document based on a pattern associated with the item but not the exact content of the item. The method further includes identifying a link for the item and providing a second document that includes information associated with the item when the link for the item is selected.Yahoo
Choosing Better Ads through User Behavior
Some queries involve the use of concepts and units, as described in at least five Yahoo patent filings (see previous patent posts in the Yahoo sections from Yahoo Units and Microsoft Redundancy Filters and More Yahoo Concepts and Google Predictive Searches.)
But sometimes a two term query isn't a concept as much as it is a couple of keywords that someone may use to search for something. If that person performs a second search after deleting one of the words, then the record of that deletion and second search might help Yahoo calculate "deletion probability scores" for words being used in these kind of two term queries.
This can be helpful when there isn't a good keyword based advertising match for that query, but there might be a good match individually for each of the terms that make up the query. The "deletion probability scores" can help determine which of the two terms to show keyword-based advertising for in search results.
System and methods for ranking the relative value of terms in a multi-term search query using deletion prediction Invented by Rosemary Jones and Daniel C. Fain US Patent Application 20060129534 Published June 15, 2006 Filed December 14, 2004
Abstract
The likely relevance of each term of a search-engine query of two or more terms is determined by their deletion probability scores. If the deletion probability scores are significantly different, the deletion probability score can be used to return targeted ads related to the more relevant term or terms along with the search results. Deletion probability scores are determined by first gathering historical records of search queries of two or more terms in which a subsequent query was submitted by the same user after one or more of the terms had been deleted. The deletion probability score for a particular term of a search query is calculated as the ratio of the number of times that particular term was itself deleted prior to a subsequent search by the same user divided by the number of times there were subsequent search queries by the same user in which any term or terms including that given term was deleted by the same user prior to the subsequent search. Terms are not limited to individual alphabetic words.Browser Interface Helpers
This next document describes some ways to provide additional dynamic information to someone via a toolbar styled interface, while they are browsing pages on the web.
Method of controlling an Internet browser interface and a controllable browser interface Invented by Thomas J. Shafron Assigned to Yahoo US Patent Application 20060129937 Published June 15, 2006 Filed February 2, 2006
Abstract
The present invention is directed to a method of dynamically controlling and displaying an Internet browser interface, and to a dynamically controllable Internet browser interface. In accordance with the present invention, a browser interface may be customized using a controlling software program that may be provided by an Internet content provider, an ISP, or that may reside on an Internet user's computer. The controlling software program enables the Internet user, the content provider, or the ISP to customize and control the information and/or functionality of a user's browser and browser interface.RSS Enhancements
The following three Yahoo filings all list the same inventors, including John Thrall who is the head of media search engineering, for Yahoo Search. They provide different aspects of using RSS with multimedia files.
Syndicating multiple media objects with RSS Invented by Andrew R. Volk, David D. Hall, and John J. Thrall US Patent Application 20060129917 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for syndicating more than one media object in an element using Real Simple Syndication (RSS). In one embodiment, multiple media objects with at least one shared characteristic are syndicated under the same element. For example, a single media object can come in multiple formats and/or compression rates.Syndicating multimedia information with RSS Invented by Andrew R. Volk, David D. Hall, John J. Thrall US Patent Application 20060129907 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for adding descriptive information to a Real Simple Syndication (RSS) document. The descriptive information describes the content of media objects syndicated through the document. The descriptive information can be used to provided additional information to a subscriber, and can be used in searching for syndicated media content.RSS rendering via a media player Invented by Andrew R. Volk, David D. Hall, John J. Thrall US Patent Application 20060129916 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for syndicating media objects through a link to a media player using Real Simple Syndication (RSS). A content provider may not want to give direct access to a media object to a subscriber. Instead a content provider can give the subscriber a link to a media player that can access the media object.Microsoft
Searching electronic program guide data Invented by Pradhan S. Rao, David Hendler Sloo, Daniel Danker, and George K. Nyako Assigned to Microsoft US Patent Application 20060130098 Published June 15, 2006 Filed December 15, 2004
Abstract
Searching electronic program guide (EPG) data is described. The EPG data may be compartmentalized into channel metadata that describes characteristics of one or more channels and content metadata that describes characteristics of one or more content items. In a implementation, a method includes searching channel metadata and content metadata. A result of the searching is formed for output in conjunction with an electronic program guide (EPG).System and method for indexing and prefiltering Invented by Brian Burdick, Joshua J. Forman, Kevin P. Kornelson, Murali Vajjiravel, and Rajeev Prasad Assigned to Microsoft US Patent Application 20060129555 Published June 15, 2006 Filed December 9, 2004
Abstract
A method and system are provided for selecting advertisements for presentation to a user in response to a user search query. The system may include a keyword server for parsing the user search query and an index server for receiving the parsed search query. The index server may include an index of advertising phrases and pre-filtering components for comparing index entries to the parsed user search query in order to discard non-matching index entries and locate matching entries. The pre-filtering components may include either a phrase length pre-filtering component or a word hash pre-filtering component. The system may additionally include a listing server for sorting through the matching entries located by the index server and further filtering the matching entries for retrieval and presentation to the user.IBM
Ring method, apparatus, and computer program product for managing federated search results in a heterogeneous environment Invented by Wade Shelby Beavers and David Joseph Borrillo Assigned to IBM US Patent Application 20060129530 Published June 15, 2006 Filed December 9, 2004
Abstract
A method, apparatus and computer program product are provided for managing federated search results in a heterogeneous environment. A user enters a search term and the search term is submitted to multiple selected search engines. Search results are gathered from each selected search engine. A search ring is generated including a ring section to represent each of the selected search engines for enabling the user to view search results from one or more of the selected search engines.Method and system for suggesting search engine keywords Invented by Cary Lee Bates Assigned to IBM US Patent Application 20060129531 Published June 15, 2006 Filed December 9, 2004
Abstract
A search engine receives a search query having one or more keywords. The documents in the result set from that search query are analyzed to identify one or more additional keywords that further segment, or separate, the initial result set. These additional keywords are presented to the user who then selects whether to include or exclude documents matching the additional keywords. In this way, the number of documents in the initial result set is reduced in a relatively quick and effortless manner.Go Daddy
Email filtering system and method Invented by Brad Owen and Jason Steiner US Patent Application 20060129644 Published June 15, 2006 Filed December 14, 2004
Abstract
Systems and methods of the present invention allow filtering out spam and phishing email messages based on the links embedded into the email messages. In a preferred embodiment, an Email Filter extracts links from the email message and obtains desirability values for the links. The Email Filter may route the email message based on desirability values. Such routing includes delivering the email message to a Recipient, delivering the message to a Quarantine Mailbox, or deleting the message.Xerox
Personalized web search method Invented by Lisa S. Purvis Assigned to Xerox Corporation US Patent Application 20060129533 Published June 15, 2006 Filed December 15, 2004
Abstract
A method for contextualizing search results is disclosed. The method includes performing a traditional web query that returns a set of result pages, using collaborative filtering techniques to generate a set of predicted pages, comparing the set of predicted pages with the set of result pages, and ranking the set of result pages so that result pages that are also included in the set of predicted pages are ranked higher than those that are not. Methods herein also contemplate using the search history of the user or others to refine the results of searches.Ask.com
Relevancy-based database retrieval and display techniques Invented by Tao Yang, Wei Wang, and Apostolos Gerasoulis US Patent Application 20060129552 Published June 15, 2006 Filed February 2, 2006
Abstract
Techniques to retrieve, rank and display data objects retrieved form a database are described. In particular, methods to assign a global ranking value to a data object based on a combination of that object's link-based (e.g., vector-space cluster analysis) and text-based (e.g., word frequency) ranks are described. Additional techniques to determine a set of concepts, topics or key words associated with each retrieved data objects are described.My usual reminder about patents: Some of the processes and technology described in patents are created in house, and some are developed with the assistance of contractors and partners. A percentage are never developed in a tangible manner, but may serve as a way to attempt to exclude others from using the technology, or even to possibly mislead competitors into exploring an area that they might not have an interest in (sometimes skepticism is good.)
There are times when a Google or Yahoo acquires a company to gain access to the intellectual property of that company, or the intellectual prowess and expertise of that company's employees. And sometimes patents are just purchased.
Want to comment or discuss? Visit our Search Technology & Relevancy area of the Search Engine Watch Forums.
Posted by Kevin Heisler at 8:42 PM | Permalink
New Search Patent Applications: June 19, 2006 - Autolinking, and Better Advertising through Deletion PredictionsFour patent applications from Google describe fighting spam in emails, providing product review searches, moving large amounts of data, and autolinking. Yahoo matches, and raises with five patent filings. One on watching deletions to choose better ads, another on serving dynamic information through a additional browser interface, and three more on multimedia and RSS.
Microsoft goes TV 2.0 with an electronic program guide, and describes a way of matching advertising content with certain search queries before those searches are made. IBM comes up with a unique way of presenting the results of a search from more than one search engine, and a way of reducing the amount of irrelevant results in a search by analyzing an initial set of results, identifying an appropriate additional query term from those results, and searching the original results again but with the additional query term included in the search.
Go Daddy describes a way of fighting spam in emails. Xerox employs collaborative filtering from previous users' searches to predict search results. Apostolos Gerasoulis, from Ask.com, with a couple of co-inventors, ranks and displays pages (objects) based upon linkage and textual data, and then defines a way to identifiy and assign topics to them.
Email Spam
Emails with links in them could be considered spam if the links point to pages that are in a conceptual category considered spammy. This patent application really doesn't describe the concept categorization part of the process. That's done in a related patent application mentioned within this document, and the related document lists Georges Harik as one inventor. Dr. Harik's name is on a very large percentage of the patent applications involving Gmail-type processes.
Method and system to detect e-mail spam using concept categorization of linked content Invented by Johnny Chen US Patent Application 20060122957 Published June 8, 2006 Filed December 3, 2004
Abstract
A system and method for detecting undesired electronic messages (e.g., spam) using concept categorization of hyperlinks is disclosed. A server receives an electronic message and retrieves web pages that correspond to hyperlinks in the message. The server performs concept categorization on the retrieved web pages based on semantic relationships in the received information to determine whether the electronic message meets predefined criteria associated with undesired messages.Searching and Aggregating Product Reviews
If Google wanted to get into the product or services review business, the next patent filing describes a blue print for the process that might make an effective and innovative system.
Method and system for finding and aggregating reviews for a product Invented by Jan Matthias Ruhl and Mayur D. Datar US Patent Application 20060129446 Published June 15, 2006 Filed December 14, 2004
Abstract
The embodiments disclosed herein include new, more efficient ways to collect product reviews from the Internet, aggregate reviews for the same product, and provide an aggregated review to end users in a searchable format. One aspect of the invention is a graphical user interface on a computer that includes a plurality of portions of reviews for a product and a search input area for entering search terms to search for reviews of the product that contain the search terms.Scaling and Distributing Data
Arvind Jain is the head of Research and Development in Google's Bangalore office, and has spoken at a number of conferences on infrastructure projects and issues involving such things as Google's crawl and indexing system, distributed file replication system, and compression techniques for large scale storage systems. He's listed as the inventor for this next Google filing.
System and method for scalable data distribution Invented by Arvind Jain US Patent Application 20060126201 Published June 15, 2006 Filed December 10, 2004
Abstract
A system having a resource manager, a plurality of masters, and a plurality of slaves, interconnected by a communications network. To distribute data, a master determined that a destination slave of the plurality slaves requires data. The master then generates a list of slaves from which to transfer the data to the destination slave. The master transmits the list to the resource manager. The resource manager is configured to select a source slave from the list based on available system resources. Once a source is selected by the resource manager, the master receives an instruction from the resource manager to initiate a transfer of the data from the source slave to the destination slave. The master then transmits an instruction to commence the transfer.Autolinking
Google's Autolink raised a lot of eyebrows, and brought some negative reactions. A Search Engine Watch Blog post from Danny Sullivan, Google Toolbar's AutoLink & The Need For Opt-Out defined many of the issues around the toolbar feature. The following patent application explains how such a system might work from the search engine's perspective.
Providing useful information associated with an item in a document Invented by Gueorgui Djabarov US Patent Application 20060129910 Published June 15, 2006 Filed December 14, 2004
Abstract
A method includes recognizing an item within a first document based on a pattern associated with the item but not the exact content of the item. The method further includes identifying a link for the item and providing a second document that includes information associated with the item when the link for the item is selected.Yahoo
Choosing Better Ads through User Behavior
Some queries involve the use of concepts and units, as described in at least five Yahoo patent filings (see previous patent posts in the Yahoo sections from Yahoo Units and Microsoft Redundancy Filters and More Yahoo Concepts and Google Predictive Searches.)
But sometimes a two term query isn't a concept as much as it is a couple of keywords that someone may use to search for something. If that person performs a second search after deleting one of the words, then the record of that deletion and second search might help Yahoo calculate "deletion probability scores" for words being used in these kind of two term queries.
This can be helpful when there isn't a good keyword based advertising match for that query, but there might be a good match individually for each of the terms that make up the query. The "deletion probability scores" can help determine which of the two terms to show keyword-based advertising for in search results.
System and methods for ranking the relative value of terms in a multi-term search query using deletion prediction Invented by Rosemary Jones and Daniel C. Fain US Patent Application 20060129534 Published June 15, 2006 Filed December 14, 2004
Abstract
The likely relevance of each term of a search-engine query of two or more terms is determined by their deletion probability scores. If the deletion probability scores are significantly different, the deletion probability score can be used to return targeted ads related to the more relevant term or terms along with the search results. Deletion probability scores are determined by first gathering historical records of search queries of two or more terms in which a subsequent query was submitted by the same user after one or more of the terms had been deleted. The deletion probability score for a particular term of a search query is calculated as the ratio of the number of times that particular term was itself deleted prior to a subsequent search by the same user divided by the number of times there were subsequent search queries by the same user in which any term or terms including that given term was deleted by the same user prior to the subsequent search. Terms are not limited to individual alphabetic words.Browser Interface Helpers
This next document describes some ways to provide additional dynamic information to someone via a toolbar styled interface, while they are browsing pages on the web.
Method of controlling an Internet browser interface and a controllable browser interface Invented by Thomas J. Shafron Assigned to Yahoo US Patent Application 20060129937 Published June 15, 2006 Filed February 2, 2006
Abstract
The present invention is directed to a method of dynamically controlling and displaying an Internet browser interface, and to a dynamically controllable Internet browser interface. In accordance with the present invention, a browser interface may be customized using a controlling software program that may be provided by an Internet content provider, an ISP, or that may reside on an Internet user's computer. The controlling software program enables the Internet user, the content provider, or the ISP to customize and control the information and/or functionality of a user's browser and browser interface.RSS Enhancements
The following three Yahoo filings all list the same inventors, including John Thrall who is the head of media search engineering, for Yahoo Search. They provide different aspects of using RSS with multimedia files.
Syndicating multiple media objects with RSS Invented by Andrew R. Volk, David D. Hall, and John J. Thrall US Patent Application 20060129917 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for syndicating more than one media object in an element using Real Simple Syndication (RSS). In one embodiment, multiple media objects with at least one shared characteristic are syndicated under the same element. For example, a single media object can come in multiple formats and/or compression rates.Syndicating multimedia information with RSS Invented by Andrew R. Volk, David D. Hall, John J. Thrall US Patent Application 20060129907 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for adding descriptive information to a Real Simple Syndication (RSS) document. The descriptive information describes the content of media objects syndicated through the document. The descriptive information can be used to provided additional information to a subscriber, and can be used in searching for syndicated media content.RSS rendering via a media player Invented by Andrew R. Volk, David D. Hall, John J. Thrall US Patent Application 20060129916 Published June 15, 2006 Filed December 1, 2005
Abstract
System and method for syndicating media objects through a link to a media player using Real Simple Syndication (RSS). A content provider may not want to give direct access to a media object to a subscriber. Instead a content provider can give the subscriber a link to a media player that can access the media object.Microsoft
Searching electronic program guide data Invented by Pradhan S. Rao, David Hendler Sloo, Daniel Danker, and George K. Nyako Assigned to Microsoft US Patent Application 20060130098 Published June 15, 2006 Filed December 15, 2004
Abstract
Searching electronic program guide (EPG) data is described. The EPG data may be compartmentalized into channel metadata that describes characteristics of one or more channels and content metadata that describes characteristics of one or more content items. In a implementation, a method includes searching channel metadata and content metadata. A result of the searching is formed for output in conjunction with an electronic program guide (EPG).System and method for indexing and prefiltering Invented by Brian Burdick, Joshua J. Forman, Kevin P. Kornelson, Murali Vajjiravel, and Rajeev Prasad Assigned to Microsoft US Patent Application 20060129555 Published June 15, 2006 Filed December 9, 2004
Abstract
A method and system are provided for selecting advertisements for presentation to a user in response to a user search query. The system may include a keyword server for parsing the user search query and an index server for receiving the parsed search query. The index server may include an index of advertising phrases and pre-filtering components for comparing index entries to the parsed user search query in order to discard non-matching index entries and locate matching entries. The pre-filtering components may include either a phrase length pre-filtering component or a word hash pre-filtering component. The system may additionally include a listing server for sorting through the matching entries located by the index server and further filtering the matching entries for retrieval and presentation to the user.IBM
Ring method, apparatus, and computer program product for managing federated search results in a heterogeneous environment Invented by Wade Shelby Beavers and David Joseph Borrillo Assigned to IBM US Patent Application 20060129530 Published June 15, 2006 Filed December 9, 2004
Abstract
A method, apparatus and computer program product are provided for managing federated search results in a heterogeneous environment. A user enters a search term and the search term is submitted to multiple selected search engines. Search results are gathered from each selected search engine. A search ring is generated including a ring section to represent each of the selected search engines for enabling the user to view search results from one or more of the selected search engines.Method and system for suggesting search engine keywords Invented by Cary Lee Bates Assigned to IBM US Patent Application 20060129531 Published June 15, 2006 Filed December 9, 2004
Abstract
A search engine receives a search query having one or more keywords. The documents in the result set from that search query are analyzed to identify one or more additional keywords that further segment, or separate, the initial result set. These additional keywords are presented to the user who then selects whether to include or exclude documents matching the additional keywords. In this way, the number of documents in the initial result set is reduced in a relatively quick and effortless manner.Go Daddy
Email filtering system and method Invented by Brad Owen and Jason Steiner US Patent Application 20060129644 Published June 15, 2006 Filed December 14, 2004
Abstract
Systems and methods of the present invention allow filtering out spam and phishing email messages based on the links embedded into the email messages. In a preferred embodiment, an Email Filter extracts links from the email message and obtains desirability values for the links. The Email Filter may route the email message based on desirability values. Such routing includes delivering the email message to a Recipient, delivering the message to a Quarantine Mailbox, or deleting the message.Xerox
Personalized web search method Invented by Lisa S. Purvis Assigned to Xerox Corporation US Patent Application 20060129533 Published June 15, 2006 Filed December 15, 2004
Abstract
A method for contextualizing search results is disclosed. The method includes performing a traditional web query that returns a set of result pages, using collaborative filtering techniques to generate a set of predicted pages, comparing the set of predicted pages with the set of result pages, and ranking the set of result pages so that result pages that are also included in the set of predicted pages are ranked higher than those that are not. Methods herein also contemplate using the search history of the user or others to refine the results of searches.Ask.com
Relevancy-based database retrieval and display techniques Invented by Tao Yang, Wei Wang, and Apostolos Gerasoulis US Patent Application 20060129552 Published June 15, 2006 Filed February 2, 2006
Abstract
Techniques to retrieve, rank and displ