产品详情
说明: 从网络提取关系
WWW是一个庞大的信息资源。
在同一时间,它是极其分布。一个特定类型的数据,如餐厅名单可能分散在成千上万的独立
在许多不同的格式的信息来源。在本文中,我们考虑这样一个从所有这些来源的数据类型中提取一个关系的问题automatically.We呈现模式和关系之间的一种技术,它利用二元增长目标的关系,开始
从一个小样本。为了测试我们的技术,我们用它来提取关系
(作者,标题)对从WWW。
(The WWW is a vast resource for information. At the same time it is extremely distributed. A particular type of data such as restaurant lists may b e scattered across thousands of independent information sources in many different formats. In this paper, we consider the problem of extracting a relation for such a data type from all of these sources automatically.We present a technique which exploits the duality between sets of patterns and relations to grow the target relation starting from a small sample. To test our technique we use it to extract a relation of (author,title) pairs from the WWW.)
(The WWW is a vast resource for information. At the same time it is extremely distributed. A particular type of data such as restaurant lists may b e scattered across thousands of independent information sources in many different formats. In this paper, we consider the problem of extracting a relation for such a data type from all of these sources automatically.We present a technique which exploits the duality between sets of patterns and relations to grow the target relation starting from a small sample. To test our technique we use it to extract a relation of (author,title) pairs from the WWW.)
文件列表:
