I recently found out that the NLM Administrative Supplement for Informationist Services that I am included on received funding! This opportunity is very exciting to me because I will be working on an interesting project with a great group of people.
I will be providing data curation services for an R01 project by Dr. Katerina Kechris that generated a Next Generation Sequencing (NGS) dataset from an inbred mouse panel. The mice are closely related, but have known genetic differences. They also exhibit an array of behavioral traits that relate to alcohol use disorders, such as ethanol sensitivity, tolerance and consumption. The NGS dataset is limited to a small RNA molecules known as micro RNAs (miRNAs). These molecules typically regulate gene expression rather than getting read by ribosomes to make protein, as the central dogma dictates. The goal of this project is to discern whether expression any of these miRNAs correlates with the alcohol use phenotypes mentioned above. Additionally, these miRNAs are closely related to those in humans, which could give clues to the mechanisms of alcohol use disorders in humans.
The mouse panel that the NGS samples came from can be used for much more than this alcohol use disorder study, and Dr. Kechris had already written in her R01 proposal that she wanted to share this resource with the research community in the PhenoGen database. Thus, we proposed the following Aims to increase the usability of this dataset by other research groups:
Aim 1 Make the NGS data, appropriate metadata, and code publicly available.
I will deposit the raw data in the NCBI databases along with appropriate metadata, or data that describes their data, to give it context and reusability. I will also deposit the code that they have used to clean and analyze their data to GitHub, so other people can repeat their analyses. This aim also supports a web programmer who will add functionality to the PhenoGen database to support this new dataset. We are also creating an entry for our institutional repository to link all this information together and to our campus.
Aim 2 Create tutorials to show other researchers how to use these data.
All the information is on the web, so it should be usable, right? Well we’re going to make it even easier to use these data by making tutorials in a variety of formats: video, text/static images, and Guide on the Side. These resources will also be referenced on the repository entry.
Aim 3 Evaluate the efficacy of Aims 1 and 2.
Finally, we will evaluate whether the first 2 aims are effective. I will do this by tracking data download and citation statistics, and by including assessments within the tutorials to evaluate their efficacy.
I’m so excited about this project! I can’t wait to get started. Now I just need to figure out how grant funding works here.
Questions and feedback are, of course, welcome.
– Tobin