Researchers Beef Up DNA Storage Density By Adding More Letters

ExtremeTech
3 min readMar 8, 2022

by Ryan Whitwam

This site may earn affiliate commissions from the links on this page. Terms of use.

We have become very good at storing data with hard drives closing in on 20 terabytes, but even our best 21st-century engineering can’t come close to the elegance and density of DNA. Most of the cells in your body contain a complete genetic copy of what makes you a human being, and DNA is surprisingly durable compared to chips and spinning platters that will probably end up in a landfill inside of a decade. DNA might even be viable for storing digital data, but we’re not limited by the way human DNA works. Researchers from the University of Illinois Urbana-Champaign have expanded the capabilities of DNA data storage by adding more letters to its alphabet.

The genetic information in your cells relies on four primary base pairs, also known as nucleotides or nucleic acid. There’s adenine, guanine, cytosine, and thymine — the A, G, C, and T you’ve seen when genetic information is written out. The human body also uses another base called uracil in place of thymine when translating genetic information into RNA to make proteins.

Even without any modifications, DNA is a very dense storage medium. The researchers note that the world creates several petabytes of new data every day, and a single gram of DNA could store it all. That’s what you get with the standard four-base system from life on Earth, but there are plenty more nucleotides in chemistry that can link up to form a DNA strand. The team created an encoding scheme relying on 11 different bases, which gives the synthetic DNA much higher data density than a system of just four bases.

So why aren’t we all using DNA hard drives? While DNA can last for thousands of years without irreparable data loss, it’s difficult to encode and decode that data. You need advanced laboratory equipment, and most tools can’t even interpret the 11-base DNA strands created in the new study. The team found that ring-like proteins known as MspA nanopores, which are commonly used in DNA sensing, could correctly read the synthetic…

ExtremeTech

ExtremeTech is the Web’s top destination for news and analysis of emerging science and technology trends, and important software, hardware, and gadgets.