(19)
(11)EP 3 559 269 B1

(12)EUROPEAN PATENT SPECIFICATION

(45)Mention of the grant of the patent:
16.09.2020 Bulletin 2020/38

(21)Application number: 17826466.9

(22)Date of filing:  15.12.2017
(51)International Patent Classification (IPC): 
C12Q 1/6806(2018.01)
C12Q 1/6855(2018.01)
(86)International application number:
PCT/EP2017/083115
(87)International publication number:
WO 2018/114706 (28.06.2018 Gazette  2018/26)

(54)

SINGLE STRANDED CIRCULAR DNA LIBRARIES FOR CIRCULAR CONSENSUS SEQUENCING

EINSTRANGIGE KREISFÖRMIGE DNA-BIBLIOTHEKEN FÜR KREISFÖRMIGE KONSENSUSSEQUENZIERUNG

BIBLIOTHÈQUES D'ADN CIRCULAIRE SIMPLE BRIN POUR LE SÉQUENÇAGE D'UNE SÉQUENCE CONSENSUS CIRCULAIRE


(84)Designated Contracting States:
AL AT BE BG CH CY CZ DE DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR

(30)Priority: 20.12.2016 US 201662436819 P

(43)Date of publication of application:
30.10.2019 Bulletin 2019/44

(73)Proprietors:
  • F. Hoffmann-La Roche AG
    4070 Basel (CH)
    Designated Contracting States:
    AL AT BE BG CH CY CZ DK EE ES FI FR GB GR HR HU IE IS IT LI LT LU LV MC MK MT NL NO PL PT RO RS SE SI SK SM TR 
  • Roche Diagnostics GmbH
    68305 Mannheim (DE)
    Designated Contracting States:
    DE 

(72)Inventors:
  • GUETTOUCHE, Toumy
    Pleasonton California 94588 (US)
  • CHEN, Rui
    Pleasonton California 94588 (US)
  • RICHARDSON, Aaron
    Pleasonton California 94588 (US)

(74)Representative: Hildebrandt, Martin K. E. et al
Roche Diagnostics GmbH Nonnenwald 2
82377 Penzberg
82377 Penzberg (DE)


(56)References cited: : 
WO-A1-2014/196863
WO-A2-2015/188192
  
  • MARK T. GREGORY ET AL: "Targeted single molecule mutation detection with massively parallel sequencing", NUCLEIC ACIDS RESEARCH, 17 September 2015 (2015-09-17), page gkv915, XP055241840, ISSN: 0305-1048, DOI: 10.1093/nar/gkv915
  
Note: Within nine months from the publication of the mention of the grant of the European patent, any person may give notice to the European Patent Office of opposition to the European patent granted. Notice of opposition shall be filed in a written reasoned statement. It shall not be deemed to have been filed until the opposition fee has been paid. (Art. 99(1) European Patent Convention).


Description

FIELD OF THE INVENTION



[0001] The invention relates to the field of nucleic acid sequencing. More specifically, the invention relates to the field of creating libraries of circular template DNA for single molecule sequencing.

BACKGROUND OF THE INVENTION



[0002] The current generation of nucleic acid sequencing methods utilizes libraries of target molecules from which each individual molecule is sequenced. Each molecule in the library comprises a target sequence to be analyzed conjugated to artificial sequences ("adaptors") necessary for the chosen sequencing method and sequencing instrument. Single molecule sequencing is often performed on double stranded DNA (dsDNA) molecules that have the same adaptor on both sides. Typically, sequencing these molecules yields data from both sense and anti-sense strand of each molecule in one read. In order to create sequencing libraries from only one strand, circularization of the target molecule with an adaptor or splint can be used. However, existing methods of generating circular single stranded libraries are inefficient and limited by the size of original target molecules. The method described herein is able to efficiently generate libraries of single stranded circular nucleic acid molecules regardless of the original molecule size.

SUMMARY OF THE INVENTION



[0003] In some embodiments, the invention is a method of making a library of circular single stranded target nucleic acid molecules from a sample comprising a plurality of double-stranded target nucleic acid molecules, the method comprising: ligating an adaptor to each end of the double-stranded target molecule, thereby forming an adaptor-ligated double-stranded molecule; denaturing the adaptor-ligated double-stranded molecule, thereby forming two strands of the adaptor-ligated molecule; annealing a capture molecule to each strand of the adaptor-ligated molecule, wherein the capture molecule is a circular single-stranded nucleic acid molecule comprising two sequences complementary to at least a portion of the adaptor, thereby forming a hybrid molecule comprising the capture molecule hybridized to the adaptor sequences at the 5'-end and the 3'-end of the strand of the adaptor-ligated molecule; extending the 3'-end of the strand of adaptor-ligated molecule to reach the 5'-end of the strand of adaptor-ligated molecule; ligating the 5'-end and the 3'-end of the strand of adaptor-ligated molecule, thereby forming a hybrid molecule comprising the capture molecule and a circularized strand of adaptor-ligated molecule; and separating the capture molecule from the circularized strand of adaptor-ligated molecule, thereby forming a library of circular single stranded target nucleic acid molecules.

[0004] In some embodiments, the adaptor comprises at least one double-stranded region and at least one single-stranded region, each comprising two strands. In some embodiments the adaptor comprises at least one barcode and at least one primer binding site. In some embodiments, the capture molecule comprises two sequences complementary to at least a portion of the single-stranded region of the adaptor. In some embodiments, the capture molecule comprises two sequences complementary to the single-stranded region and the double stranded region of the adaptor. In some embodiments the barcode is a multiplex sample identifying barcode (MID) or a unique molecular identifying barcode (UID). In some embodiments the primer is a sequencing primer. In some embodiments the sequences complementary to at least a portion of the adaptor are located diametrically opposite one another in the capture molecule.

[0005] In some embodiments the capture molecule comprises one or more or all of a barcode, a primer binding site and a binding moiety for being captured by a solid support. In some embodiments, the capture molecule is biotinylated. In some embodiments, the capture molecule is immobilized on the solid support such as a streptavidin-coated bead or surface during binding to the target molecule.

[0006] In some embodiments, the invention is a method of sequencing target nucleic acids in a sample comprising a plurality of target molecules, the method comprising: creating a library of circular target nucleic acid molecules from the sample using the method described above, wherein the adaptors further comprise a binding site for a sequencing primer; annealing the sequencing primer to the binding site; and extending the sequencing primer, thereby obtaining the sequence of the target nucleic acid. In some embodiments, the sequencing primer is extended by a DNA polymerase such as Phi 29 polymerase. In some embodiments the sequence is obtained by measuring the incorporation of labeled nucleotides during primer extension. In some embodiments, the sequence is obtained by a nanopore-based method.

[0007] In some embodiments, the invention is an alternative method of making a library of circular single stranded target nucleic acid molecules from a sample comprising a plurality of double-stranded target nucleic acid molecules, the method comprising: ligating an adaptor to each end of the double-stranded target molecule, thereby forming an adaptor-ligated double-stranded molecule; denaturing the adaptor-ligated double-stranded molecule, thereby forming two strands of the adaptor-ligated molecule; annealing a capture molecule to each strand of the adaptor-ligated molecule, wherein the capture molecule is a different circular single-stranded nucleic acid molecule comprising two adjacent sequences complementary to at least a portion of the adaptor, thereby forming a hybrid molecule comprising the capture molecule hybridized to the adaptor sequences at the 5'-end and the 3'-end of the strand of the adaptor-ligated molecule; ligating the 5'-end and the 3'-end of the strand of the adaptor-ligated molecule hybridized to adjacent sequences on the capture molecule, thereby forming a hybrid molecule comprising the capture molecule and a circularized strand of the adaptor-ligated molecule; separating the capture molecule from the circularized strand of the adaptor-ligated molecule, thereby forming a library of circular single stranded target nucleic acid molecules.

[0008] In some embodiments, the invention is an alternative method of making a library of circular single stranded target nucleic acid molecules from a sample comprising a plurality of double-stranded target nucleic acid molecules, the method comprising: denaturing the double-stranded molecule, thereby forming two strands of the target molecule; annealing a capture molecule to each strand of the target molecule, wherein the capture molecule is a circular single-stranded nucleic acid molecule comprising two sequences complementary to at least a portion of the target molecule, thereby forming a hybrid molecule comprising the capture molecule hybridized to the sequences at the 5'-end and the 3'-end of the strand of the target molecule; extending the 3'-end of the strand of the target molecule to reach the 5'-end of the strand of the target molecule; ligating the 5'-end and the 3'-end of the strand of the target molecule, thereby forming a hybrid molecule comprising the capture molecule and a circularized strand of the target molecule; separating the capture molecule from the circularized strand of the target molecule, thereby forming a library of circular single stranded target nucleic acid molecules.

[0009] In some embodiments, the invention is a library of target nucleic acid molecule created using the method described above.

BRIEF DESCRIPTION OF THE DRAWINGS



[0010] Figure 1 is a diagram of the method of generating a library of circular single stranded nucleic acid molecules according to the invention.

DETAILED DESCRIPTION OF THE INVENTION


Definitions



[0011] The following definitions aid in understanding of this disclosure.

[0012] The term "sample" refers to any composition containing or presumed to contain target nucleic acid. This includes a sample of tissue or fluid isolated from an individual for example, skin, plasma, serum, spinal fluid, lymph fluid, synovial fluid, urine, tears, blood cells, organs and tumors, and also to samples of in vitro cultures established from cells taken from an individual patient or from a model organism, including the formalin-fixed paraffin embedded tissues (FFPET) and nucleic acids isolated therefrom. A sample may also include cell-free material, such as cell-free blood fraction that contains cell-free DNA (cfDNA) or circulating tumor DNA (ctDNA).

[0013] A term "nucleic acid" refers to polymers of nucleotides (e.g., ribonucleotides and deoxyribonucleotides, both natural and non-natural) including DNA, RNA, and their subcategories, such as cDNA, mRNA, etc. A nucleic acid may be single-stranded or double-stranded and will generally contain 5'-3' phosphodiester bonds, although in some cases, nucleotide analogs may have other linkages. Nucleic acids may include naturally occurring bases (adenosine, guanosine, cytosine, uracil and thymidine) as well as non-natural bases. Some examples of non-natural bases include those described in, e.g., Seela et al., (1999) Helv. Chim. Acta 82:1640. The non-natural bases may have a particular function, e.g., increasing the stability of the nucleic acid duplex, inhibiting nuclease digestion or blocking primer extension or strand polymerization.

[0014] The terms "polynucleotide" and "oligonucleotide" are used interchangeably. Polynucleotide is a single-stranded or a double-stranded nucleic acid. Oligonucleotide is a term sometimes used to describe a shorter polynucleotide. An oligonucleotide may be comprised of at least 6 nucleotides or about 15-30 nucleotides. Oligonucleotides are prepared by any suitable method known in the art, for example, by a method involving direct chemical synthesis as described in Narang et al. (1979) Meth. Enzymol. 68:90-99; Brown et al. (1979) Meth. Enzymol. 68:109-151; Beaucage et al. (1981) Tetrahedron Lett. 22:1859-1862; Matteucci et al. (1981) J. Am. Chem. Soc. 103:3185-3191.

[0015] The term "primer" refers to a single-stranded oligonucleotide which hybridizes with a sequence in a target nucleic acid ("primer binding site") and is capable of acting as a point of initiation of synthesis along a complementary strand of nucleic acid under conditions suitable for such synthesis. The primer binding site can be unique to each target or can be added to all targets ("universal priming site" or "universal primer binding site").

[0016] The term "adaptor" means a nucleotide sequence that may be added to another sequence so as to import additional properties to that sequence. An adaptor is typically an oligonucleotide that can be single- or double-stranded, or may have both a single-stranded portion and a double-stranded portion. An adaptor may contain sequences such as barcodes and universal primer or probe sites.

[0017] The term "ligation" refers to a condensation reaction joining two nucleic acid strands wherein a 5'-phosphate group of one molecule reacts with the 3'-hydroxyl group of another molecule. Ligation is typically an enzymatic reaction catalyzed by a ligase or a topoisomerase. Ligation may join two single strands to create one single-stranded molecule. Ligation may also join two strands each belonging to a double-stranded molecule thus joining two double-stranded molecules. Ligation may also join both strands of a double-stranded molecule to both strands of another double-stranded molecule thus joining two double-stranded molecules. Ligation may also join two ends of a strand within a double-stranded molecule thus repairing a nick in the double-stranded molecule.

[0018] The term "barcode" refers to a nucleic acid sequence that can be detected and identified. Barcodes can be incorporated into various nucleic acids. Barcodes are sufficiently long e.g., 2, 5, 10 nucleotides, so that in a sample, the nucleic acids incorporating the barcodes can be distinguished or grouped according to the barcodes.

[0019] The terms "multiplex identifier" and "MID" refer to a barcode that identifies a source of a target nucleic acids (e.g., a sample from which the nucleic acid is derived, which is needed when nucleic acids from multiple samples are combined). All or substantially all the target nucleic acids from the same sample will share the same MID. Target nucleic acids from different sources or samples can be mixed and sequenced simultaneously. Using the MIDs the sequence reads can be assigned to individual samples from which the target nucleic acids originated.

[0020] The terms "unique molecular identifier" and "UID" refer to a barcode that identifies a nucleic acid to which it is attached. All or substantially all the target nucleic acids from the same sample will have different UIDs. All or substantially all of the progeny (e.g., amplicons) derived from the same original target nucleic acid will share the same UID.

[0021] The term "universal primer" and "universal priming binding site" or "universal priming site" refer to a primer and primer binding site present in (typically, in vitro added to) different target nucleic acids. For example, the universal priming site may be included in an adaptor ligated to the plurality of target nucleic acids. The universal priming site may also be a part of target-specific (non-universal) primers, for example by being added to the 5'-end of a target-specific primer. The universal primer can bind to and direct primer extension from the universal priming site.

[0022] As used herein, the terms "target sequence", "target nucleic acid" or "target" refer to a portion of the nucleic acid sequence in the sample which is to be detected or analyzed. The term target includes all variants of the target sequence, e.g., one or more mutant variants and the wild type variant.

[0023] The term "sequencing" refers to any method of determining the sequence of nucleotides in the target nucleic acid.

[0024] Single molecule sequencing is often performed on double stranded DNA (dsDNA) molecules that have the same adaptor on both sides, here called symmetrically adapted sequencing template. Typically, sequencing these molecules yields data from at least a part of the sense and anti-sense strands in one sequencing read. (See U.S. Patent No. 8,822,150). In other technologies, the template is a topologically circular single stranded molecule containing two complementary strands linked together (See U.S. Patent No. 9,404,146). In order to create sequencing libraries from only one strand, circularization of the target molecule using an adaptor (See U.S. Provisional Application "Barcoded circular library construction for identification of chimeric products" Ser. No. 62/415,245 filed on October 31, 2016) or splint (See U.S. Application Pub. No. 20120003657) can be used. However, this procedure is size limited due to CIRCLIGASE™ restrictions (up to 500bp) in the former case or inefficient in the latter case. The method described herein allows the separation of the sense and anti-sense strands in two sequencing template molecules and is not limited by the size of the original double-stranded target molecule. WO2015188192 discloses circular sequencing templates with unique molecular identifiers and patient specific barcodes.

[0025] In one embodiment, the invention is a method of generating a library of single-stranded circular nucleic acids for sequencing. Figure 1 depicts an example of the method of according to the invention.

[0026] In the first step, a plurality of double stranded DNA molecules is provided. The double stranded DNA molecules may be isolated genomic DNA or genomic DNA of reduced complexity (e.g., amplified selected regions of the genome or captured selected regions of the genome such as exome).

[0027] In the next step, the double stranded DNA molecules are ligated to adaptors on each end. The adaptor may comprise at least one ligatable double-stranded portion and at least one single stranded portion. In the example in Figure 1, it is a Y-shaped adaptor. The non-complementary region may assume any configuration, e.g., a fork structure (Y-adaptors) or a stem-loop structure. The non-complementary region may contain one or two strands. The two strands may be of the same or different lengths. The non-complementary regions do not form stable hybrids at the reaction conditions and remain single stranded during the steps of the method of the invention. The adaptor may contain more than one double stranded region and more than one single stranded region. For example, a single-stranded region may be flanked by two double-stranded regions.

[0028] The double stranded target nucleic acid must comprise ends suitable for ligation of a double stranded adaptor. In some embodiments, the ends of the target nucleic acids are "polished," i.e., extended with a nucleic acid polymerase to ensure double-stranded ends. In some embodiments, the 5'-ends of the target nucleic acids are phosphorylated. In some embodiments, the ligation is a blunt-end ligation. In some embodiments, the ligation is a cohesive end ligation. The 3'-ends of the target nucleic acid are extended with a single nucleotide (e.g., A) and the adaptor is engineered to contain a complementary overhang (e.g., T) at the 3'-ends.

[0029] In the next step, the adaptor-ligated target DNA molecules are denatured and contacted with single stranded capture DNA circles (sscDNA molecules). Creation of small single-stranded DNA circles containing desired sequences is routine in the art and such circles are commercially available (Bio-Synthesis, Inc., Lewisville, Tex). In the present invention, the circles have regions of complementarity to each of the two non-complementary sequences in the adaptors (Figure 1). In some embodiments, the regions of complementarity can be separated by a desired distance. As will be seen from the following steps of the method, the sequence between the regions of complementarity is to be copied into the library molecules and thus may be used to incorporate additional sequences into the library molecules. In some embodiments, the additional sequences are selected from primer binding sites, restriction enzyme sites, barcodes, etc.

[0030] In some embodiments, the sscDNA molecules can be attached to a solid support. In some embodiments, the attachment to the solid support is via a biotin-streptavidin linkage effected by a biotin-labeled sscDNA molecule. In some embodiments, the solid support is a bead present in solution. In some embodiments, the bead is a polystyrene bead, a paramagnetic bead, an adsorbing bead, or a charged bead. In other embodiments, the solid support is a surface, e.g., a slide or an array. In the example in Figure 1, the circular single-stranded nucleic acids molecule comprises a capturable moiety, e.g., is conjugated to biotin. The hybridization complex between the circular single-stranded nucleic acids molecule and the single stranded molecule with an adaptor sequence at each end can be captured, e.g., using streptavidin conjugated to a solid support such as a polymer bead.

[0031] The ratio of sscDNAs and denatured target molecules can be optimized for annealing of a single sscDNA to each strand of the adaptor-ligated target DNA molecule. Because the sscDNA molecule has two complementary sequences for each single strand of the target molecule, the spatial proximity will facilitate the binding of the second end of the target molecule to form the structure shown on Figure 1. Binding of the sscDNA molecule and a strand of the target DNA molecule creates a structure with a free extendable 3'-end.

[0032] In the next step, the extendable 3'-end of the target DNA strand annealed to the sscDNA is extended with a DNA polymerase going around the sscDNA molecule to reach the 5'-end of the target DNA strand. In some embodiments, the DNA polymerase is a non-strand displacing polymerase. In some embodiments, the polymerase may be selected from a Taq, Klenow, Bst, Pfu, T4, T7, E. coli pol I, Sulfolobus sp. pol IVDNA polymerases.

[0033] In some embodiments, polymerase extension is not necessary. For example, the regions of the capture molecule complementary to the adaptor are adjacent to each other on the capture molecule. After annealing to the capture molecule, the ends of the adaptor can be directly ligated. In some embodiments, an asymmetric adaptor is used wherein the single-stranded regions of the adaptor are of unequal length. The regions of the capture molecule complementary to the asymmetric adaptor are adjacent to each other on the capture molecule. After annealing to the capture molecule, the longer and the shorter ends of the adaptor can be directly ligated.

[0034] In the next step, the extended 3'-end of the target nucleic acid strand is ligated to the 5'-end of the target nucleic acid strand creating a hybrid molecule containing the adaptor-ligated target nucleic acid strand a portion of which is annealed to the part of the ssdDNA molecule (Figure 1). This hybrid molecule consists of two partially complementary single stranded circular molecules can be melted to separate the sscDNA molecule. In some embodiments, the melting is by heating. In other embodiments, the melting is chemical, e.g., by exposure to alkali or a similar nucleic acid duplex denaturing agent.

[0035] In some embodiments, the separated sscDNA molecule is removed by size separation or chromatography (beads, columns or gel electrophoresis). In embodiments where the sscDNA is biotinylated, it can be captured and removed by forming a biotin-streptavidin complex, e.g., with streptavidin-conjugated polymer coated magnetic or paramagnetic bead. In other embodiments, the sscDNA may be engineered to contain a nuclease digestion site. In some embodiments, the sscDNA is engineered to contain deoxyuracils. Such DNA can be removed by treatment with Uracil DNA N-glycosylase (UNG) and heating to convert the circular DNA into a linear form that can be digested with an exonuclease. In yet other embodiments, the sscDNA may be engineered to contain a photocleavable linker.

[0036] In some embodiments, the invention is a library of single-stranded circular molecules for nucleic acid sequencing wherein each circle comprises only one strand of the original target nucleic acid produced using the method of the invention. Each target nucleic acid in the library will contain the sequences of two adaptors and a portion of the sscDNA sequence.

[0037] In some embodiments, the invention is a method of sequencing nucleic acids via creation of a library of single-stranded circular nucleic acid molecules as described herein.

[0038] The present invention comprises generating a library of target nucleic acids from a sample for nucleic acid sequencing. Multiple nucleic acids, including all the nucleic acids in a sample may be converted into library molecules using the method and compositions described herein. In some embodiments, the sample is derived from a subject or a patient. In some embodiments the sample may comprise a fragment of a solid tissue or a solid tumor derived from the subject or the patient, e.g., by biopsy. The sample may also comprise body fluids (e.g., urine, sputum, serum, plasma or lymph, saliva, sputum, sweat, tear, cerebrospinal fluid, amniotic fluid, synovial fluid, pericardial fluid, peritoneal fluid, pleural fluid, cystic fluid, bile, gastric fluid, intestinal fluid, or fecal samples). The sample may comprise whole blood or blood fractions where normal or tumor cells may be present. In some embodiments, the sample, especially a liquid sample may comprise cell-free material such as cell-free DNA or RNA including cell-free tumor DNA or tumor RNA or cell-free fetal DNA. In some embodiments, the sample is a cell-free sample, e.g., cell-free blood-derived sample where cell-free tumor DNA or tumor RNA are present. In other embodiments, the sample is a cultured sample, e.g., a culture or culture supernatant containing or suspected to contain nucleic acids derived from the cells in the culture or from an infectious agent present in the culture. In some embodiments, the infectious agent is a bacterium, a protozoan, a virus or a mycoplasma. The sample may also be an environmental sample containing or suspected to contain nucleic acids from organisms.

[0039] A target nucleic acid is the nucleic acid of interest that may be present in the sample. In some embodiments, the target nucleic acid is a gene or a gene fragment. In some embodiments, all the genes, gene fragments and intergenic regions (entire genome) constitute target nucleic acids. In some embodiments, only a portion of the genome, e.g., only coding regions of the genome (exome) constitute target nucleic acids. In some embodiments, the target nucleic acid contains a locus of a genetic variant, e.g., a polymorphism, including a single nucleotide polymorphism or variant (SNP of SNV), or a genetic rearrangement resulting e.g., in a gene fusion. In some embodiments, the target nucleic acid comprises a biomarker, i.e., a gene whose variants are associated with a disease or condition. In other embodiments, the target nucleic acid is characteristic of a particular organism and aids in identification of the organism or a characteristic of the pathogenic organism such as drug sensitivity or drug resistance. In yet other embodiments, the target nucleic acid is characteristic of a human subject, e.g., the HLA or KIR sequence defining the subject's unique HLA or KIR genotype.

[0040] In an embodiment of the invention, one or a plurality of target nucleic acids is converted into the template configuration of the invention. In some embodiments, the target nucleic acid occurs in nature in a single-stranded form (e.g., RNA, including mRNA, microRNA, viral RNA; or single-stranded viral DNA). In other embodiments, the target nucleic acid occurs in nature in a double-stranded form. One of skill in the art would recognize that the method of the invention has multiple embodiments. A single stranded target nucleic acid can be converted into double-stranded form and then subjected to the steps shown in Figure 1. Longer target nucleic acids may be fragmented although in some applications longer target nucleic acids may be desired to achieve a longer read. In some embodiments, the target nucleic acid is naturally fragmented, e.g., circulating cell-free DNA (cfDNA) or chemically degraded DNA such as the one founds in chemically preserved or archived samples.

[0041] One of the advantages of the present invention is the ability to create single-stranded circular nucleic acids of unlimited length. The method of the invention does not have the low size limitations inherent in the single-stranded circle ligation (e.g., using CIRCLIGASE™, WO2010094040). The method also avoids the kinetic inefficiency of a splint ligation (See U.S. Application Pub. No. 20120003657).

[0042] The present invention utilizes adaptor molecules. In some embodiments, the adaptor is a double-stranded nucleic acid that at one end is capable of ligating the either end of the target nucleic acid. In some embodiments, the adaptor is phosphorylated at at least one 5'-end. In some embodiments, the adaptor contains an overhang of one or more nucleotides to match the corresponding overhang created on the target nucleic acid.

[0043] In some embodiments, the adaptor comprises a double stranded region at one end and a single-stranded region at the other end. The double stranded region contains hybridized strands of nucleic acid while the single stranded region contains one strand or two strands not hybridized with each other. The end comprising the single stranded region is not capable of ligation to the target nucleic acid. In some embodiments, the adaptor is a Y-shaped adaptor (See Prashar and Weissman, (1996) Proc. Natl. Acad. Sci. USA 93:659). In some embodiments, the Y-adaptor is a symmetric Y-adaptor having single stranded regions that are the same or approximately the same length. In other embodiments, the adaptor is an asymmetric Y-adaptor having one single stranded region that is substantially longer than the other region.

[0044] In other embodiments, the adaptor has a stem-loop structure where the single stranded region is a linker connecting two strands of the double stranded region.

[0045] As described in further detail below, the double stranded end of the adaptor is ligated to each end of a double stranded target nucleic acid molecule. Ligation of double stranded nucleic acid molecules is well known in the art (See Green M., and Sambrook, J., Molecular Cloning, 2012 CSHL Press), and improvements on the general method are described herein. In some embodiments, the adaptor molecules are in vitro synthesized artificial sequences. In other embodiments, the adaptor molecules are in vitro synthesized naturally-occurring sequences. In yet other embodiments, the adaptor molecules are isolated naturally occurring molecules or isolated non naturally-occurring molecules.

[0046] In some embodiments, the adaptor comprises one or more barcodes. A barcode can be a multiplex sample ID (MID) used to identify the source of the sample where samples are mixed (multiplexed). The barcode may also serve as a unique molecular ID (UID) used to identify each original molecule and its progeny. The barcode may also be a combination of a UID and an MID. In some embodiments, a single barcode is used as both UID and MID.

[0047] In some embodiments, each barcode comprises a predefined sequence. In other embodiments, the barcode comprises a random sequence. Barcodes can be 1-20 nucleotides long.

[0048] In some embodiments, the unique barcode (UID) is present in the double stranded portion of the adaptor. In these embodiments, each strand has a copy of the barcode (or the barcode complement) allowing for consensus sequencing and error correction as further described below and in U.S. App. Pub No. 20150044687.

[0049] In embodiments of the present invention, each target molecule is ligated to two adaptors. In some embodiments, each molecule has two unique barcodes (UID). In some embodiments, each molecule also carries the same multiplex sample ID (MID) barcode to identify the sample from which the target nucleic acid was derived.

[0050] In some embodiment, the invention comprises a pool of adaptors for creating a library of single stranded circular barcoded molecules. The adaptors within the pool have a unique barcode that are at least 1 or at least 3 edit distance apart from other barcodes in the pool. One of skill in the art would be able to determine what edit distance is optimal for a particular experiment based on typical error rates of a sequencing technology. Generally, greater edit distance means that fewer barcodes can be used in one pool. However, if the sequencing technology or a manufacturing process has a high error rate, greater edit distance will be required. For example, oligonucleotide manufacturing process used to make adaptors may have a high error rate. Similarly, a nucleic acid polymerase used in DNA amplification or primer extension in the sequencing-by-synthesis workflow can have a high error rate. These error rates would require increasing edit distance among the barcodes in adaptors of the pool. Conversely, improving the accuracy of each of the methods mentioned above will allow decreasing edit distance among the barcodes in adaptors of the pool.

[0051] In some embodiments, the invention comprises an article of manufacture represented by a single vial containing the entire pool of adaptors. Alternatively, an article of manufacture can comprise a kit where one or more adaptors of the pool are present in separate vials.

[0052] In some embodiments, the adaptor further comprises a primer binding site for at least one universal primer. A primer binding site is a sequence complementary to the primer to which primer can bind and facilitate strand elongation.

[0053] In some embodiments, the adaptor has more than one e.g., two primer binding sites. In some embodiments, one primer is used for amplification e.g., by PCR (including asymmetric PCR), linear amplification or rolling circle replication (RCA).

[0054] In some embodiments, the invention includes a step of preparing the target DNA for ligation of adaptors. In some embodiments, these steps include "polishing" e.g., converting molecules with strand overhangs into fully double stranded form by extending receded 3'-ends with a DNA polymerase or digesting protruding 3'-ends with a 3'-5' exonuclease such as Mung bean exonuclease.

[0055] In some embodiments, the double stranded ligation is a blunt-end ligation. In other embodiments, the double stranded ligation is a T-A ligation or other overhang ligation. In some embodiments, the method includes a step of adding a strand overhang to the target nucleic acid matching (i.e., complementary to) the overhang on the adaptor. In some embodiments, the overhang can be an added A nucleotide at one or both ends of the target nucleic acid while the adaptor is designed to contain a T nucleotide and the end to be ligated. The single nucleotide can be artificial synthesized during the in vitro synthesis of the adaptor molecule. The single nucleotide can also be enzymatically added e.g., by Taq polymerase or terminal transferase to one or both ends of the target nucleic acid. One or both

[0056] The invention utilizes a single stranded circular capture DNA molecule (sscDNA). In some embodiments, the circular molecule is between 30 and 500 bases long. The molecule preferably consists of an artificial sequence or a modified naturally occurring sequence designed (or modified) to avoid self-complementarity within the circle and assure the single stranded conformation under the reaction conditions described herein.

[0057] The sscDNA molecule comprises at least two regions of complementarity with the adaptor sequences. The two regions of complementarity are positioned within the sscDNA molecule to ensure an energetically favorable topology of the hybrid molecule formed by the adaptor-ligated target DNA strand and the sscDNA molecule. In some embodiments, the two regions of complementarity with the adaptor sequences are spaced 1, 2, 5, 10 or more bases apart. In some embodiments, two regions of complementarity with the adaptor sequences are placed at a maximum distance from each other (diametrically opposite) in the circle.

[0058] In some embodiments, the sscDNA molecule contains additional artificial sequences not present in the adaptors. The sscDNA molecule may contain one or more primer binding sites, one or more barcodes, one or more restriction enzyme site or any other sequences needed to be incorporated into the target DNA molecule.

[0059] In some embodiments, an adaptor is not used. Instead, the capture molecule comprises target-specific regions to which a native target nucleic acid (not having exogenous sequences) can hybridize. In some embodiments, a limited library of target nucleic acids or a single species of target nucleic acid (e.g., the sequence of a pathogen, such as a viral pathogen e.g., HIV, or a bacterial pathogen, or a group of pathogens, e.g., Streptococcus sp.) can be detected. A limited library of capture molecules having a limited number of target-specific regions or a single species of capture molecules having two target-specific regions can be used.

[0060] In some embodiments, the capture molecules can be used to detect gene fusions. In such embodiments, the capture molecule has two target-specific regions, each capable of hybridizing to one of the fusion partners.

[0061] In some embodiments, the invention utilizes enzymes. The enzymes may include a DNA polymerase (including sequencing polymerase), a DNA ligase and a terminal transferase.

[0062] In some embodiments, the DNA polymerase is a non-strand displacing polymerase. In some embodiments, the polymerase may be selected from a Taq, Klenow, Bst, Pfu, T4, T7, E. coli pol I, Sulfolobus sp. pol IV DNA polymerases.

[0063] In some embodiments, the invention also utilizes a DNA ligase. In some embodiments, T4 DNA ligase or E. coli DNA ligase is used.

[0064] In some embodiments, the invention also utilizes a template-independent DNA polymerase, e.g., a terminal transferase or a DNA polymerase with the activity of adding one or more nucleotides in a template-independent manner. In some embodiments, the invention uses a mammalian terminal transferase or Taq polymerase.

[0065] The library of single-stranded circular barcoded molecules generated from the library can be subjected to nucleic acid sequencing. The template libraries created by the method of the present invention are especially advantageous in sequencing technologies adapted for sequencing circular templates of unlimited length or repeatedly reading a circular molecule, e.g., via rolling circle replication. Examples of such technologies include the Pacific BioSciences platform utilizing the SMRT® technology (Pacific Biosciences, Menlo Park, Cal.) or a platform utilizing nanopore technology such as those manufactured by Oxford Nanopore Technologies (Oxford, UK) or Roche Genia (Santa Clara, Cal.) and any other presently existing or future single-molecule sequencing technology that is suitable for sequencing circular templates of unlimited length or for repeatedly reading circular molecules. The sequencing step may utilize platform-specific sequencing primers. Binding sites for these primers may be introduced in adaptors used in the present invention. In some embodiments, binding sites for sequencing primers are introduced in the copied portion of the sscDNA. During the strand extension step connecting the 3'-end and the 5'-end of the target DNA molecule these primer binding sites will become incorporated into the target DNA molecules.

[0066] In some embodiments, the sequencing step involves sequence analysis. Sequence analysis may comprise secondary analysis, e.g., analysis performed on the sequence assembled by the instrument converting signals collected by the instrument into base calls (primary analysis). In some embodiments, the analysis includes a step of sequence aligning. In some embodiments, aligning is used to determine a consensus sequence from a plurality of sequences, e.g., a plurality having the same barcodes (UID). Such plurality of sequences with the same UID may be a product or amplification of the target nucleic acid molecule or of repeated reads of the circular nucleic acid molecules during sequencing, e.g., via rolling circle replication by a DNA polymerase or reading by the sequencing polymerase. In some embodiments, the barcodes (UIDs) are used to establish consensus sequences from the two strands of the target nucleic acid molecules. Although these strands become segregated into two separate single-stranded circular molecules, the two original strands carry the same UID from the adaptors (Figure 1).

[0067] In other embodiments, generation of consensus sequences using barcodes (UIDs) comprises a step of eliminating artifacts, i.e., variations existing in some but not all sequences having an identical barcode (UID). Such artifacts can be eliminated from the consensus sequence because they likely result from amplification errors or sequencing errors.

[0068] In some embodiments, the copy number of each sequence in the sample can be quantified by quantifying relative numbers of sequences with each barcode (UID) in the sample. Each UID represents a single molecule in the original sample and counting different UIDs associated with each sequence variant can determine the fraction of each sequence in the original sample. A person skilled in the art will be able to determine the number of sequence reads necessary to determine a consensus sequence. In some embodiments, the relevant number is reads per UID ("sequence depth") necessary for an accurate quantitative result. In some embodiments, the desired depth is 5-50 reads per UID.

EXAMPLES


Example 1 (prophetic) Creating a library of single-stranded barcoded circular molecules



[0069] In this example, DNA is isolated from a patient's sample. In some instances, RNA is isolated from the sample and reverse-transcribed into cDNA that is treated in subsequent steps the same way as DNA isolated directly from the sample.

[0070] The DNA is end-repaired and A-tailed with T4 DNA polymerase. The addition of the A-tail allows for a subsequent efficient adaptor ligation, avoiding complications from blunt ligation. Next, a Y-shaped adaptor is ligated to both ends of the DNA using a T4 DNA ligase. The adaptor is pretreated with terminal transferase to add a T at each 3'-end. The adaptor is pretreated with T4 Poly nucleotide kinase to add a phosphate group to each 5'-end. The Y-shaped adaptor comprises a double stranded region that takes part in the ligation. The Y-shaped adaptor also comprises a single stranded region composed of two single strands that are not complementary and remain unhybridized. Following the ligation, the adaptor-ligated target molecules are heat-denatured and stored on ice.

[0071] A single stranded circular capture DNA molecule 30-500 bases in length is added to the sample. The capture molecules contain at least one biotinylated nucleotide. The capture molecules are attached to a surface of a magnetic bead decorated with streptavidin. The capture molecule contains two sequences, each complementary to each of the non-complementary strands in the adaptor. In the region between the two adaptor-complementary sequences the capture molecule contains a sample barcode and a primer binding site.

[0072] The capture molecule is added to the sample containing denatured the adaptor-ligated target molecules under conditions favoring specific DNA hybridization. The capture molecules are added in an optimal ratio and the Y-adaptor ends of the adaptor-ligated target molecules should attach with at least one end to them. Following the first hybridization, the other end of the target molecule is in spatial proximity to the adaptor-complementary sequence in the capture molecule and the likelihood of binding and second hybridization is high. The hybridization results in the formation of a hybrid molecule wherein the linear single stranded target nucleic acid is coupled to a capture molecule at its 3'- and 5'-ends.

[0073] The 3'-end of the target nucleic is extended in the presence of Pfu polymerase, dNTPs and magnesium at a suitable temperature. The polymerase is going around the circular capture molecule to reach the 5' end of the adaptor-ligated target nucleic acid that is bound to the capture molecule.

[0074] Next, T4 DNA ligase is added under conditions suitable for ligation. Ligation creates a hybrid molecule where a portion of the circular target molecule is hybridized to a portion of the circular capture molecule.

[0075] The hybrid molecule is captured and isolated from the sample using streptavidin decorated paramagnetic beads binding to the biotin-labeled capture molecule. The hybrid molecule is heat-denatured and the single stranded capture molecule is captured and removed again using streptavidin beads.

[0076] Once the creation of the library of circular target DNA molecules is completed, it can be used for circular consensus sequencing. Each circular molecule originating from one strand of the original DNA molecule is sequenced using SMRT® technology on the Pacific BioSciences instrument or using nanopore technology on a Genia instrument. The complementary strand will be sequenced in a different reaction.

[0077] The sequencing is following by bioinformatic analysis. The two strands are bioinformatically associated and consensus sequence is generated.


Claims

1. A method of making a library of circular single stranded target nucleic acid molecules from a sample comprising a plurality of double-stranded target nucleic acid molecules, the method comprising:

a. ligating an adaptor to each end of the double-stranded target molecule, thereby forming an adaptor-ligated double-stranded molecule;

b. denaturing the adaptor-ligated double-stranded molecule, thereby forming two strands of the adaptor-ligated molecule;

c. annealing a capture molecule to each strand of the adaptor-ligated molecule, wherein the capture molecule is a circular single-stranded nucleic acid molecule comprising two sequences complementary to at least a portion of the adaptor, thereby forming a hybrid molecule comprising the capture molecule hybridized to the adaptor sequences at the 5'-end and the 3'-end of the strand of the adaptor-ligated molecule;

d. extending the 3'-end of the strand of the adaptor-ligated molecule to reach the 5'-end of the strand of the adaptor-ligated molecule;

e. ligating the 5'-end and the 3'-end of the strand of the adaptor-ligated molecule, thereby forming a hybrid molecule comprising the capture molecule and a circularized strand of the adaptor-ligated molecule;

f. separating the capture molecule from the circularized strand of the adaptor-ligated molecule, thereby forming a library of circular single stranded target nucleic acid molecules.


 
2. The method of claim 1, wherein the adaptor comprises at least one double-stranded region and at least one single-stranded region, each comprising two strands.
 
3. The method of claim 1, wherein the adaptor comprises at least one barcode.
 
4. The method of claim 1, wherein the adaptor comprises at least one primer binding site, which is optionally a sequencing primer binding site.
 
5. The method of claim 2, wherein the capture molecule comprises two sequences complementary to at least a portion of the single-stranded region of the adaptor.
 
6. The method of claim 2, wherein the capture molecule comprises two sequences complementary to the single-stranded region and the double stranded region of the adaptor.
 
7. The method of claim 3, wherein the barcode is a multiplex sample identifying barcode (MID).
 
8. The method of claim 3, wherein the barcode is a unique molecular identifying barcode (UID).
 
9. The method of claim 1, wherein the sequences complementary to at least a portion of the adaptor are located diametrically opposite one another in the capture molecule.
 
10. The method of claim 1, wherein the capture molecule comprises a barcode.
 
11. The method of claim 1, wherein the capture molecule comprises a primer binding site.
 
12. The method of claim 1, wherein the capture molecule comprises a binding moiety for being captured by a solid support.
 
13. A method of sequencing target nucleic acids in a sample comprising a plurality of target molecules, the method comprising:

a. creating a library of circular target nucleic acid molecules from the sample by the method of claims 1-13, wherein the adaptors further comprise a binding site for a sequencing primer;

b. annealing the sequencing primer to the binding site; and

c. extending the sequencing primer, thereby obtaining the sequence of the target nucleic acid.


 
14. A method of making a library of circular single stranded target nucleic acid molecules from a sample comprising a plurality of double-stranded target nucleic acid molecules, the method comprising:

a. ligating an adaptor to each end of the double-stranded target molecule, thereby forming an adaptor-ligated double-stranded molecule;

b. denaturing the adaptor-ligated double-stranded molecule, thereby forming two strands of the adaptor-ligated molecule;

c. annealing a capture molecule to each strand of the adaptor-ligated molecule, wherein the capture molecule is a circular single-stranded nucleic acid molecule comprising two adjacent sequences complementary to at least a portion of the adaptor, thereby forming a hybrid molecule comprising the capture molecule hybridized to the adaptor sequences at the 5'-end and the 3'-end of the strand of the adaptor-ligated molecule;

d. ligating the 5'-end and the 3'-end of the strand of the adaptor-ligated molecule hybridized to adjacent sequences on the capture molecule, thereby forming a hybrid molecule comprising the capture molecule and a circularized strand of the adaptor-ligated molecule;

e. separating the capture molecule from the circularized strand of the adaptor-ligated molecule, thereby forming a library of circular single stranded target nucleic acid molecules.


 
15. A method of making a library of circular single stranded target nucleic acid molecules from a sample comprising a plurality of double-stranded target nucleic acid molecules, the method comprising:

a. denaturing the double-stranded molecule, thereby forming two strands of the target molecule;

b. annealing a capture molecule to each strand of the target molecule, wherein the capture molecule is a circular single-stranded nucleic acid molecule comprising two sequences complementary to at least a portion of the target molecule, thereby forming a hybrid molecule comprising the capture molecule hybridized to the sequences at the 5'-end and the 3'-end of the strand of the target molecule;

c. extending the 3'-end of the strand of the target molecule to reach the 5'-end of the strand of the target molecule;

d. ligating the 5'-end and the 3'-end of the strand of the target molecule, thereby forming a hybrid molecule comprising the capture molecule and a circularized strand of the target molecule;

e. separating the capture molecule from the circularized strand of the target molecule, thereby forming a library of circular single stranded target nucleic acid molecules.


 


Ansprüche

1. Verfahren zum Erstellen einer Bibliothek von ringförmigen einzelsträngigen Zielnukleinsäuremolekülen aus einer Probe, umfassend eine Vielzahl von doppelsträngigen Zielnukleinsäuremolekülen, wobei das Verfahren Folgendes umfasst:

a. Ligieren eines Adaptors an jedes Ende des doppelsträngigen Zielmoleküls, wodurch ein adaptorligiertes doppelsträngiges Molekül gebildet wird;

b. Denaturieren des adaptorligierten doppelsträngigen Moleküls, wodurch zwei Stränge des adaptorligierten Moleküls gebildet werden;

c. Annealing eines Fängermoleküls an jeden Strang des adaptorligierten Moleküls, wobei das Fängermolekül ein ringförmiges einzelsträngiges Nukleinsäuremolekül ist, das zwei Sequenzen umfasst, die mindestens zu einem Teil des Adaptors komplementär sind, wodurch ein Hybridmolekül gebildet wird, das das Fängermolekül umfasst, das am 5'-Ende und am 3'-Ende des Stranges des adaptorligierten Moleküls an die Adaptorsequenzen hybridisiert ist;

d. Verlängern des 3'-Endes des Stranges des adaptorligierten Moleküls, um das 5'-Ende des Stranges des adaptorligierten Moleküls zu erreichen;

e. Ligieren des 5'-Endes und des 3'-Endes des Stranges des adaptorligierten Moleküls, wodurch ein Hybridmolekül gebildet wird, das das Fängermolekül und einen ringförmig gemachten Strang des adaptorligierten Moleküls umfasst;

f. Trennen des Fängermoleküls von dem ringförmig gemachten Strang des adaptorligierten Moleküls, wodurch eine Bibliothek von ringförmigen einzelsträngigen Zielnukleinsäuremolekülen gebildet wird.


 
2. Verfahren nach Anspruch 1, wobei der Adaptor mindestens eine doppelsträngige Region und mindestens eine einzelsträngige Region umfasst, die jeweils zwei Stränge umfassen.
 
3. Verfahren nach Anspruch 1, wobei der Adaptor mindestens einen Barcode umfasst.
 
4. Verfahren nach Anspruch 1, wobei der Adaptor mindestens eine Primer-Bindungsstelle umfasst, die gegebenenfalls eine Sequenzierungsprimer-Bindungsstelle ist.
 
5. Verfahren nach Anspruch 2, wobei das Fängermolekül zwei Sequenzen umfasst, die mindestens zu einem Teil der einzelsträngigen Region des Adaptors komplementär sind.
 
6. Verfahren nach Anspruch 2, wobei das Fängermolekül zwei Sequenzen umfasst, die zu der einzelsträngigen Region und der doppelsträngigen Region des Adaptors komplementär sind.
 
7. Verfahren nach Anspruch 3, wobei der Barcode ein Multiplex-Probenidentifizierungs-Barcode (MID) ist.
 
8. Verfahren nach Anspruch 3, wobei der Barcode ein eindeutiger Molekülidentifizierungs-Barcode (UID) ist.
 
9. Verfahren nach Anspruch 1, wobei die Sequenzen, die mindestens zu einem Teil des Adaptors komplementär sind, diametral entgegengesetzt zueinander in dem Fängermolekül liegen.
 
10. Verfahren nach Anspruch 1, wobei das Fängermolekül einen Barcode umfasst.
 
11. Verfahren nach Anspruch 1, wobei das Fängermolekül eine Primer-Bindungsstelle umfasst.
 
12. Verfahren nach Anspruch 1, wobei das Fängermolekül eine Bindungseinheit umfasst, die von einem festen Träger gefangen wird.
 
13. Verfahren zum Sequenzieren von Zielnukleinsäuren in einer Probe, umfassend eine Vielzahl von Zielmolekülen, wobei das Verfahren Folgendes umfasst:

a. Erzeugen einer Bibliothek von ringförmigen Zielnukleinsäuremolekülen aus einer Probe durch das Verfahren nach den Ansprüchen 1-13, wobei die Adaptoren ferner eine Bindungsstelle für einen Sequenzierungsprimer umfassen;

b. Annealing des Sequenzierungsprimers an die Bindungsstelle und

c. Verlängern des Sequenzierungsprimers, wodurch die Sequenz der Zielnukleinsäure erhalten wird.


 
14. Verfahren zum Erstellen einer Bibliothek von ringförmigen einzelsträngigen Zielnukleinsäuremolekülen aus einer Probe, umfassend eine Vielzahl von doppelsträngigen Zielnukleinsäuremolekülen, wobei das Verfahren Folgendes umfasst:

a. Ligieren eines Adaptors an jedes Ende des doppelsträngigen Zielmoleküls, wodurch ein adaptorligiertes doppelsträngiges Molekül gebildet wird;

b. Denaturieren des adaptorligierten doppelsträngigen Moleküls, wodurch zwei Stränge des adaptorligierten Moleküls gebildet werden;

c. Annealing eines Fängermoleküls an jeden Strang des adaptorligierten Moleküls, wobei das Fängermolekül ein ringförmiges einzelsträngiges Nukleinsäuremolekül ist, das zwei benachbarte Sequenzen umfasst, die mindestens zu einem Teil des Adaptors komplementär sind, wodurch ein Hybridmolekül gebildet wird, das das Fängermolekül umfasst, das am 5'-Ende und am 3'-Ende des Stranges des adaptorligierten Moleküls an die Adaptorsequenzen hybridisiert ist;

d. Ligieren des 5'-Endes und des 3'-Endes des Stranges des adaptorligierten Moleküls, das an benachbarte Sequenzen an dem Fängermolekül hybridisiert ist, wodurch ein Hybridmolekül gebildet wird, dass das Fängermolekül und einen ringförmig gemachten Strang des adaptorligierten Moleküls umfasst;

e. Trennen des Fängermoleküls von dem ringförmig gemachten Strang des adaptorligierten Moleküls, wodurch eine Bibliothek von ringförmigen einzelsträngigen Zielnukleinsäuremolekülen gebildet wird.


 
15. Verfahren zum Erstellen einer Bibliothek von ringförmigen einzelsträngigen Zielnukleinsäuremolekülen aus einer Probe, umfassend eine Vielzahl von doppelsträngigen Zielnukleinsäuremolekülen, wobei das Verfahren Folgendes umfasst:

a. Denaturieren des doppelsträngigen Moleküls, wodurch zwei Stränge des Zielmoleküls gebildet werden;

b. Annealing eines Fängermoleküls an jeden Strang des Zielmoleküls, wobei das Fängermolekül ein ringförmiges einzelsträngiges Nukleinsäuremolekül ist, das zwei Sequenzen umfasst, die mindestens zu einem Teil des Zielmoleküls komplementär sind, wodurch ein Hybridmolekül gebildet wird, das das Fängermolekül umfasst, das am 5'-Ende und am 3'-Ende des Stranges des Zielmoleküls an die Sequenzen hybridisiert ist;

c. Verlängern des 3'-Endes des Stranges des Zielmoleküls, um das 5'-Ende des Stranges des Zielmoleküls zu erreichen;

d. Ligieren des 5'-Endes und des 3'-Endes des Stranges des Zielmoleküls, wodurch ein Hybridmolekül gebildet wird, das das Fängermolekül und einen ringförmig gemachten Strang des Zielmoleküls umfasst;

e. Trennen des Fängermoleküls von dem ringförmig gemachten Strang des Zielmoleküls, wodurch eine Bibliothek von ringförmigen einzelsträngigen Zielnukleinsäuremolekülen gebildet wird.


 


Revendications

1. Procédé de fabrication d'une bibliothèque de molécules cibles d'acide nucléique circulaire simple brin à partir d'un échantillon comprenant une pluralité de molécules cibles d'acide nucléique double brin, le procédé comprenant :

a. la ligature d'un adaptateur à chaque extrémité de la molécule cible double brin, formant ainsi une molécule double brin liée à l'adaptateur ;

b. la dénaturation de la molécule double brin liée à l'adaptateur, formant ainsi deux brins de la molécule liée à l'adaptateur ;

c. la renaturation d'une molécule de capture à chaque brin de la molécule liée à l'adaptateur, dans laquelle la molécule de capture est une molécule d'acide nucléique circulaire simple brin comprenant deux séquences complémentaires d'au moins une partie de l'adaptateur, formant ainsi une molécule hybride comprenant la molécule de capture hybridée aux séquences de l'adaptateur à l'extrémité 5' et à l'extrémité 3' du brin de la molécule liée à l'adaptateur ;

d. l'allongement de l'extrémité 3' du brin de la molécule liée à l'adaptateur pour atteindre l'extrémité 5' du brin de la molécule liée à l'adaptateur ;

e. la ligature de l'extrémité 5' et de l'extrémité 3' du brin de la molécule liée à l'adaptateur, formant ainsi une molécule hybride comprenant la molécule de capture et un brin circularisé de la molécule liée à l'adaptateur ;

f. la séparation de la molécule de capture du brin circularisé de la molécule liée à l'adaptateur, formant ainsi une bibliothèque de molécules cibles d'acide nucléique circulaire simple brin.


 
2. Procédé selon la revendication 1, dans lequel l'adaptateur comprend au moins une région double brin et au moins une région simple brin, chacune comprenant deux brins.
 
3. Procédé selon la revendication 1, dans lequel l'adaptateur comprend au moins un code-barres.
 
4. Procédé selon la revendication 1, dans lequel l'adaptateur comprend au moins un site de liaison à une amorce, qui est éventuellement un site de liaison à une amorce de séquençage.
 
5. Procédé selon la revendication 2, dans lequel la molécule de capture comprend deux séquences complémentaires d'au moins une partie de la région simple brin de l'adaptateur.
 
6. Procédé selon la revendication 2, dans lequel la molécule de capture comprend deux séquences complémentaires de la région simple brin et de la région double brin de l'adaptateur.
 
7. Procédé selon la revendication 3, dans lequel le code-barres est un code-barres d'identification d'échantillon multiplexé (MID).
 
8. Procédé selon la revendication 3, dans lequel le code-barres est un code-barres d'identification moléculaire unique (UID).
 
9. Procédé selon la revendication 1, dans lequel les séquences complémentaires d'au moins une partie de l'adaptateur sont situées diamétralement opposées l'une par rapport à l'autre dans la molécule de capture.
 
10. Procédé selon la revendication 1, dans lequel la molécule de capture comprend un code-barres.
 
11. Procédé selon la revendication 1, dans lequel la molécule de capture comprend un site de liaison à une amorce.
 
12. Procédé selon la revendication 1, dans lequel la molécule de capture comprend une fraction de liaison destinée à être capturée par un support solide.
 
13. Procédé de séquençage d'acides nucléiques cibles dans un échantillon comprenant une pluralité de molécules cibles, le procédé comprenant :

a. la création d'une bibliothèque de molécules cibles d'acide nucléique circulaire à partir de l'échantillon par le procédé selon les revendications 1 à 13, dans laquelle les adaptateurs comprennent en outre un site de liaison pour une amorce de séquençage;

b. la renaturation de l'amorce de séquençage au site de liaison ; et

c. l'allongement de l'amorce de séquençage, obtenant ainsi la séquence de l'acide nucléique cible.


 
14. Procédé de fabrication d'une bibliothèque de molécules cibles d'acide nucléique circulaire simple brin à partir d'un échantillon comprenant une pluralité de molécules cibles d'acide nucléique double brin, le procédé comprenant :

a. la ligature d'un adaptateur à chaque extrémité de la molécule cible double brin, formant ainsi une molécule double brin liée à l'adaptateur ;

b. la dénaturation de la molécule double brin liée à l'adaptateur, formant ainsi deux brins de la molécule liée à l'adaptateur ;

c. la renaturation d'une molécule de capture à chaque brin de la molécule liée à l'adaptateur, dans laquelle la molécule de capture est une molécule d'acide nucléique circulaire simple brin comprenant deux séquences adjacentes complémentaires d'au moins une partie de l'adaptateur, formant ainsi une molécule hybride comprenant la molécule de capture hybridée aux séquences d'adaptateur à l'extrémité 5' et à l'extrémité 3' du brin de la molécule liée à l'adaptateur ;

d. la ligature de l'extrémité 5' et de l'extrémité 3' du brin de la molécule liée à l'adaptateur hybridée aux séquences adjacentes sur la molécule de capture, formant ainsi une molécule hybride comprenant la molécule de capture et un brin circularisé de la molécule liée à l'adaptateur ;

e. la séparation de la molécule de capture du brin circularisé de la molécule liée à l'adaptateur, formant ainsi une bibliothèque de molécules cibles d'acide nucléique circulaire simple brin.


 
15. Procédé de fabrication d'une bibliothèque de molécules cibles d'acide nucléique circulaire simple brin à partir d'un échantillon comprenant une pluralité de molécules cibles d'acide nucléique double brin, le procédé comprenant :

a. la dénaturation de la molécule double brin, formant ainsi deux brins de la molécule cible ;

b. la renaturation d'une molécule de capture à chaque brin de la molécule cible, dans laquelle la molécule de capture est une molécule d'acide nucléique circulaire simple brin comprenant deux séquences complémentaires d'au moins une partie la molécule cible, formant ainsi une molécule hybride comprenant la molécule de capture hybridée aux séquences à l'extrémité 5' et à l'extrémité 3' du brin de la molécule cible ;

c. l'allongement de l'extrémité 3' du brin de la molécule cible pour atteindre l'extrémité 5' du brin de la molécule cible ;

d. la ligature de l'extrémité 5' et de l'extrémité 3' du brin de la molécule cible, formant ainsi une molécule hybride comprenant la molécule de capture et un brin circularisé de la molécule cible ;

e. la séparation de la molécule de capture du brin circularisé de la molécule cible, formant ainsi une bibliothèque de molécules cibles d'acide nucléique circulaire simple brin.


 




Drawing








Cited references

REFERENCES CITED IN THE DESCRIPTION



This list of references cited by the applicant is for the reader's convenience only. It does not form part of the European patent document. Even though great care has been taken in compiling the references, errors or omissions cannot be excluded and the EPO disclaims all liability in this regard.

Patent documents cited in the description




Non-patent literature cited in the description