The Author Curation to Knowledgebase project (ACKnowledge, previously known as “Author First Pass”) aims at facilitating author curation to biological knowledgebases.
Biological knowledgebases serve as a critical resource for researchers and accelerate scientific discoveries by providing manually curated, machine-readable data collections. At present, the aggregation and manual curation of biological data is a labor-intensive process that relies almost entirely on professional biocurators. Biocuration, however, is greatly aided when authors directly participate in the process.
ACKnowledge employs a semi-automated curation pipeline that identifies biological entities and data types (for example, genes and anatomic expression) from newly published research articles and then automatically emails authors to review the extracted information through a dedicated web interface. During the curation process, authors may modify extracted data, an important quality control measure. Participating authors are publicly acknowledged for their contribution in the knowledgebase.
The ACKnowledge system, currently implemented by WormBase for the C. elegans literature and soon to be implemented for other species, couples statistical methods and state-of-the-art text mining algorithms to identify entities and data types.
Additional tools in the ACKnowledge system include software to send automated reminder emails to authors who have not completed their submissions, a notification system to inform knowledgebase curators about new author submissions, and a dashboard for curators to monitor submission status.
The ACKnowledge project is supported by grant R01 OD023041 from the National Library of Medicine at the US National Institutes of Health.
If you have questions about the ACKnowledge project, please contact help.acknowledge@wormbase.org.
ACKnowledge Github repositoryRelease 4.0 marks an exciting transition for our project. With support from the National Library of Medicine, the Author First Pass (AFP) project is now officially known as ACKnowledge (Author Curation to Knowledgebase). This change reflects our commitment to expanding artificial intelligence (AI)-enhanced community curation to capture more detailed experimental findings and to introduce our curation pipeline to additional model organism communities and their respective knowledgebases. While busy transitioning our project and enhancing our AI methods, we also implemented further improvements to the current pipeline based on continuing valuable feedback from authors. These improvements, listed below, will help authors add more information about their papers and readily find previously published papers processed by our pipeline for curation using our Author Curation Portal.
Based on valuable user feedback, we continue to improve the ACKnowledge experience. In addition to providing more streamlined ways to enter entities, we’ve added further instructions for adding new entities, and a link to look up specific entities in WormBase. We now flag papers for enzymatic activity and include a free-text field to describe other gene functions. We updated the FAQs and provide a link to the ACKnowledge webinar presented in February 2021. Lastly, to thank authors for their valuable contributions, we now acknowledge authors on WormBase person and paper pages.
We have implemented updates and improvements to our curator dashboard. The dashboard is a tool available to WormBase curators to keep track of submissions and perform quality control of submitted data.
We improved the ACKnowledge system based on feedback received from authors and statistics collected during one year of ACKnowledge v1.0 author submissions. We improved the definition of data types in the form and we refined our text mining algorithms to obtain better results.
We are proud to introduce our new ACKnowledge system, which leverages text mining to help authors curate their papers and send data to WormBase! The system replaces the previous form which required authors to submit data manually through free-text forms.