Data fusion is the process of putting together information obtained from many sensors, on many platforms, into a single composite picture of the environment. In this case, the sensor are used to be the data bases existing in various disintegrated organizations on various operational platforms. The Data Fusion & Analysis solution is carried out by the CiteLink platform which is designed to address the data fusion & Link Analysis challenge. CiteLink is based on the Semantic Web Server, which approaches to the data fusion problem from the point of view of the Semantic Web technology and architecture.
The Data Fusion solution derives its data from the data bases via the ETL layer in accordance with the vocabulary of data presentation. The ETL is performed to define the types of data to be exported from the data bases and to create the data vocabulary. The vocabulary of presentation of the data is written in a semantic way and is set upon the ETL layer to mediate between the raw data from diverse data bases of different built up structure and the queries of the analyst in a common non-programming language. The analysis done over the data is in accordance with various inference rules and alerts. The solution suggests in addition to the linear data presented, the inferred data on the base of the inheritance and other domain applicable rules.
Big enterprises and governmental agencies need single point of access to the data located in various data bases within the organization or available outside of it on the same entity for further investigation and research.