On the basis of analyzing current problems existing in data cleaning , especially after abundant researching on exploring and eliminating approximately duplicated records , this paper brings forward record matching method and eliminating approximately duplicated records method based on rdbms , expecting to eliminate approximately duplicated records in data warehouse 本文在对当前的数据清洗问题,特别是探测和消除重复记录方面,做了充分的研究后,提出了基于rdbms的记录匹配方法和消除数据仓库中相似重复记录的方法,以期消除数据仓库中的相似重复记录。