Extraction analysis techniques biased, in part, by query frequency information from a query log file and/or search engine cache are employed along with machine learning processes to determine candidate keywords and/or phrases of web documents. Web oriented features associated with the candidate keywords...http://www.google.com.tw/patents/US8135728?utm_source=gb-gplus-share專利 US8135728 - Web document keyword and phrase extraction