We present an on-going research project addressing the problem of massive amounts of unstructured data that is generated on a daily basis in most business organisations, regardless of size. Our motivation is to support in particular small and medium seized enterprises to gain a competitive advantage in the market. The goal is to improve their processes for extracting valuable business information from such disorganised data. To achieve this, we introduce a flexible and scalable data analysis framework capable of transforming various types of documents into semantically annotated structures. This includes emails, text files in various formats, slide presentations, blog entries, etc. Additionally, the solution provides a semantic search engine for structured retrieval of the analyzed information and a graphical layer to dynamically visualize the search results as an interactive graph. Throughout the paper, the architecture of two main engines that are responsible for data and text analysis and semantic search are described. We conclude that semantic processing of unstructured sources signi cantly improves data management and data integration within the enterprises.
Adam Bartusiak has been a research associate at Enterprise Application Development group at University of Applied Sciences Zittau/Görlitz since October 2014, where he works on research projects in the fields of data analysis, semantic web and cognitive computing. He has a five year experience as a software engineer, developing mobile applications for IT-companies in Poland. He received the B.Sc degree in Communcation Management from the Neisse University in 2006 and the M.Sc. degree in Computer Science from the University of Applied Sciences Zittau/Görlitz in 2008. Adam lives with his wife Anne and children Alwin, 3 and Artur, 1 in Görlitz, Germany. In his spare time, he enjoys flying sailplanes, sailing and listening to good music.