Unleashing Semantics of Research Data

Research depends to a large degree on the availability and quality of primary research data, i.e., data generated through experiments and evaluations. While the Web in general and Linked Data in particular provide a platform and the necessary technologies for sharing, managing and utilizing research data, an ecosystem supporting those tasks is still missing. The vision of the CODE project is the establishment of a sophisticated ecosystem for Linked Data. Here, the extraction of knowledge encapsulated in scientific research paper along with its public release as Linked Data serves as the major use case. Further, Visual Analytics approaches empower end users to analyse, integrate and organize data. During these tasks, specific Big Data issues are present.

[1]  S. Soderland,et al.  - based Named Entity Disambiguation to Arbitrary Web Text , 2009 .

[2]  Alexandros Labrinidis,et al.  Challenges and Opportunities with Big Data , 2012, Proc. VLDB Endow..

[3]  Tobias Bürger,et al.  LMF: A Framework for Linked Media , 2011, 2011 Workshop on Multimedia on the Web.

[4]  Ying Liu,et al.  An Efficient Pre-processing Method to Identify Logical Components from PDF Documents , 2011, PAKDD.

[5]  Rajeev Rastogi,et al.  Entity disambiguation with hierarchical topic models , 2011, KDD.

[6]  Tim Berners-Lee,et al.  Linked Data - The Story So Far , 2009, Int. J. Semantic Web Inf. Syst..

[7]  Roman Kern,et al.  A comparison of layout based bibliographic metadata extraction techniques , 2012, WIMS '12.

[8]  Jiawei Han,et al.  Graph cube: on warehousing and OLAP multidimensional networks , 2011, SIGMOD '11.

[9]  Abraham Bernstein,et al.  The Semantic Web - ISWC 2009, 8th International Semantic Web Conference, ISWC 2009, Chantilly, VA, USA, October 25-29, 2009. Proceedings , 2009, SEMWEB.

[10]  Andreas Harth,et al.  Transforming statistical linked data for use in OLAP systems , 2011, I-Semantics '11.

[11]  Alvaro Barreiro,et al.  Improving the Extraction of Text in PDFs by Simulating the Human Reading Order , 2012, J. Univers. Comput. Sci..

[12]  Alistair Moffat,et al.  Improvements that don't add up: ad-hoc retrieval results since 1998 , 2009, CIKM.

[13]  Roman Kern,et al.  TeamBeam - Meta-Data Extraction from Scientific Literature , 2012, D Lib Mag..

[14]  Mark Dredze,et al.  Entity Disambiguation for Knowledge Base Population , 2010, COLING.

[15]  Martin Gaedke,et al.  Discovering and Maintaining Links on the Web of Data , 2009, SEMWEB.

[16]  Dietrich Rebholz-Schuhmann,et al.  Annotation and Disambiguation of Semantic Types in Biomedical Text: A Cascaded Approach to Named Entity Recognition , 2006, NLPXML@EACL.