A new, open-source genetic database has been developed by Australian and North American researchers.

The MaveDB database is a repository for data from experiments – called multiplex assays of variant effect (MAVEs) – that systematically measure the impact of thousands of individual sequence variants on a gene’s function.

These experiments can provide valuable information about how proteins produced by that gene function, how variants in that gene may contribute to disease, and how to engineer synthetic versions of naturally occurring proteins that are more effective than the original protein.

MaveDB is the first publicly accessible database for this data. Its development was led by  Dr Alan Rubin from Australia’s Walter and Eliza Hall Institute.

MAVEs have revolutionised the ability of researchers to understand the function of genes and their roles in disease, Dr Rubin says.

“In the past, researchers had to focus on a handful of changes in a gene to understand its function,” he said.

“It was too complex to generate the data from an exhaustive scan of variants of a gene that might be hundreds or thousands of bases in length.

“The development of MAVEs provided a way for researchers to experimentally measure every single genetic change in a gene with its functional consequence. These assays can handle tens of thousands of genetic variants, allowing researchers to home in on the relevant changes and place them in context.”

Until now, MAVE data from experiments has existed in isolation, with data from individual studies uploaded to journal websites when research papers are published, or provided upon request to other researchers.

“This made it hard for researchers to access the data of other groups, or even know that a particular MAVE experiment had been done. So it potentially hindered collaborations and the progress of genomics research,” Dr Rubin said.

“MaveDB makes it easier for scientists to share their datasets in a single location, using a flexible format that is applicable to multiple research fields, and enables other scientists to easily access this data to enhance their research.

“We’ve also ensured MaveDB can ‘talk’ to other databases to add an extra level of collaborative capacity. For the growing field of MAVE research this database is an important step towards open science and reproducibility by ensuring data is made available.”

A paper describing the development is accessible here.