1、1|2023 SNIA.All Rights Reserved.Virtual ConferenceSeptember 28-29,2021Approximate DNA Storage with High Robustness and Density for ImagesPresented byBingzhe LiAssistant ProfessorUniversity of Texas at Dallas2|2023 SNIA.All Rights Reserved.3|2023 SNIA.All Rights Reserved.Big Data EraData is doubled a
2、lmost every 2 years44 Zettabytes in 2020175 Zettabytes in 2025Image from:https:/ SNIA.All Rights Reserved.Why DNA Storage?1 Allentoft et al.The half-life of dna in bone:measuring decay kinetics in 158 dated fossils.Proceedings of the Royal Society B:Biological Sciences,279(1748):47244733,2012.2 Gras
3、s et al.Robust chemical preservation of digital information on dna in silica with error-correcting codes.Angewandte Chemie International Edition,54(8):25522555,2015.3 Figure source:IDC25,000 x 8TB HDDs5 10 years of warrantyLarge gap between generated data and installed storage capacity.1 EB data cen
4、ter Fort Worth,TX750,000 sq ft1 gram DNA 1Several centuries 2Photo:Tara Brown/UW5|2023 SNIA.All Rights Reserved.What is DNA Storage?Nucleotides/Bases:ATCGData:BitBase00A01T10G11CGTACACTGSimple encoding:150 300 basesTACAGT1001001100110110primerprimerGCTmetadatapayloadEncodingAssemblingSynthesis100100
5、1100110110SequencingDisassemblingDecodingWriteRead6|2023 SNIA.All Rights Reserved.Issues of DNA StorageErrors of DNA storage:Some patterns may increase error rates:Consecutive identical nucleotides(e.g.,“AAAA”)Hairpin structure/secondary structure etc.GTACAOriginal sequence:GTGCAGTCASubstitution err
6、orDeletion errorInsertion errorGTACAGAGAGDNA storage is Error-proneExpensive(e.g.,$1million/GB)Slow(e.g.,hours/GB)Special preservationLow encoding density(ideal one is 2bits/nt)00-A,01-T,10-C,11-G.7|2023 SNIA.All Rights Reserved.Conclusion:One nucleotide error causes a series of errors in its subseq