An Annotated Corpus for Machine Reading of Instructions in Wet Lab Protocols
Abstract
We describe an effort to annotate a corpus of natural language instructions consisting of 622 wet lab protocols to facilitate automatic or semi-automatic conversion of protocols into a machine-readable format and benefit biological research. Experimental results demonstrate the utility of our corpus for developing machine learning approaches to shallow semantic parsing of instructional texts. We make our annotated Wet Lab Protocol Corpus available to the research community.
- Publication:
-
arXiv e-prints
- Pub Date:
- May 2018
- DOI:
- 10.48550/arXiv.1805.00195
- arXiv:
- arXiv:1805.00195
- Bibcode:
- 2018arXiv180500195K
- Keywords:
-
- Computer Science - Computation and Language;
- Computer Science - Artificial Intelligence