Techniques are disclosed that locate implicitly defined semantic structures in a document, such as, for example, implicitly defined lists in an HTML document. The semantic structures can be used in the calculation of distance values between terms in the documents. The distance values may be used, for...http://www.google.com.tw/patents/US7716216?utm_source=gb-gplus-share專利 US7716216 - Document ranking based on semantic distance between terms in a document