I am primarily dealing with the entity coreference problem in the Semantic Web, which is to detect coreferent (owl:sameAs) links for identifiers (ontology instances) from heterogeneous information sources. I am interested in the domain-independence and scalability problems of entity coreference.
I am also generally interested in other research where Semantic Web technologies are involved and places and applications that adopt Semantic Web techniques.
Contact Infomation:
Address:
Dept. of Computer Science and Engineering
Lehigh University
19 Memorial Drive West
Bethlehem, PA 18015
Automatically Generating Data Linkages Using a Domain-Independent Candidate Selection Approach (PDF). November, 2011. Beijing, China. This is a talk given at Beijing Institute of Technology about my ISWC2011 work.
Automatically Generating Data Linkages Using a Domain-Independent Candidate Selection Approach (PDF, Video). October, 2011. Bonn, Germany. This is my presentation at the 10th International Semantic Web Conference.
Named Entity Coreference: A Brief Survey (PDF). October, 2011. Bethlehem, USA. This is the my Depth Study talk on the topic of entity coreference. It briefly introduces the relevant techniques and models related to entity coreference in general.
DAE: A Platform for Document Analysis Research (PDF). December, 2010. Bethlehem, USA. This is a summary talk for my work in the DARPA DAE project given at the Graduate Research Seminar Series of the Department of Computer Science and Engineering of Lehigh University.
I worked as a student intern at the Division of Biomedical statistics and Informatics, Mayo Clinic, Rochester, MN. My primary duties at Mayo was to develop a Protege plugin to support ontology based semi-automatic annotation on clinical narratives. This tool supports manually annotating clinical documents with the classes and propoerties in an ontology. It also enables automatic annotation by connecting to state-of-the-art tools, such as cTAKES and the NCBO annotator. For more details about this tool, please refer to the official website of Semantator.
NSF project: Structuring, Reasoning, and Querying in a Very Large Medical Image Database
I worked on a NSF funded Structuring, Reasoning, and Querying in a Very Large Medical Image Database project. In this project, I was exploring how to apply Entity Coreference techniques to assist diagnosis of cervical cancer. Consider the purpose of entity coreference is to determine if different mentions refer to the same real world entity, we were exploring how such techniques can help to locate similar patient cases of cervical cancer together with image processing techniques. We were able to achieve interesting results when applying our method to real world datasets.
DARPA project: Document Analysis and Exploitation (DAE)
I worked on the DARPA funded Document Analysis and Exploitation (DAE) project. In this project, I designed a database schema for storing document analysis related data, experiment results and data provenance. Also, I explored the technical and economical feasibility of utilizing cloud computing for data storage and performing necessary computing. I gave a talk (PDF) on this project in a research seminar organized by the Department of Computer Science and Engineering at Lehigh University.