251st American Chemical Society National Meeting & Exposition

Conference
March 13, 2016 to March 17, 2016
Venue: , United States

Linguamatics looks forward to seeing you at the 251st American Chemical Society National Meeting & Exposition in San Diego, March 2016.

Jeff Nauss (Linguamatics) will be speaking on:

Chemistry, Data, & the Semantic Web: An Important Triple to Advance Science

What the presentation will cover:

Chemical data representation is a challenge that has been addressed using different methodologies. Representation includes not only a set of unique chemical descriptors for the molecules themselves, but also the linking process (reactions) that they belong to in the form of metadata. The structured nature of this data makes it easy to store in structured databases. However, one common issue remains: the low quality of metadata associated with each chemical entity. This could hinder the extraction of meaningful knowledge from the stored information without time consuming human intervention. Efforts have been made in a) the optimization of chemical and reaction representation in order to achieve real-time text and data mining and b) the integration of chemical information with
semantic analysis of surrounding text generated by researchers. In this talk we will focus on addressing the first issue in detail and discuss strategies for the second part.

We will provide the background on chemical /reactions representations used by Dotmatics and the tools that enable Chemists to generate these into a comprehensive chemistry toolkit. Additionally this talk will cover how chemistry
descriptors can be converted into computer fingerprints or bit-strings, allowing high performance searching (super and sub-structure searching) and ranking of chemistry data. These solutions also take advantage of advanced memory mapping and threading to provide interactive capability additional to those available on standard laptop computers. This enables data discovery to done at the application level instantly and without recourse to large scale server infrastructure.