Application of semantic similarity calculation in parameter matching of detection data

Abstract:

The alignment of inline inspection datasets can help to improve the utilization rate of the data. At present, domestic and foreign scholars have preliminarily established the alignment method. However, there is still a lack of solutions to the complexity and the diversity of Chinese characters, which are used in the inline inspection reports. Here the method of Chinese semantic similarity calculation was used to determine the matching degree between fields, select the matched fields from a large number of fields and achieve the data alignment between different testing companies. This method is improved based on Synonym Forest, and the actual fields from the inline inspection test reports are used. The improved method can distinguish the different fields and has good applicability to the multiple inspection data alignment.


 

 

 

 
 


Key words:semantic similarity; inline inspection; data alignment; Synonym Forest; long distance pipeline

Received: 2018-04-20

Corresponding Authors: shdong@cup.edu.cn

Cite this article:ZHANG Hewei, JIN Jian, DONG Shaohua, ZHANG Laibin, Li Ning. Application of semantic similarity calculation in parameter matching of detection data. Petroleum Science Bulletin, 2018, 04: 446-451.

URL: