FIELD OF THE INVENTION
[0001] The present invention relates to mammalian cells modified to provide for improved
expression of a recombinant protein of interest. In particular, the invention relates
to CHO cells and other host cells in which the expression of one or more endogenous
secreted proteins has been disrupted, as well as to the preparation, identification
and use of such cells in the production of recombinant proteins.
BACKGROUND OF THE INVENTION
[0002] Chinese hamster ovary (CHO) cells are the primary expression system for manufacturing
biopharmaceuticals, accounting for 35.5% of the total cumulative (1982-2014) biopharmaceutical
product approvals (Walsh, 2014). Because of their importance in commercial production
of therapeutic recombinant proteins, significant efforts have been made in order to
characterize and optimize CHO cells for this purpose (Kim et al., 2012; Datta et al.,
2013; Lee et al., 2015; Chaudhuri et al., 2015). These include studies of the CHO
cell proteome and secretome, the latter representing the Host Cell Proteins (HCPs)
secreted from CHO cells (Baycin-Hizal et al., 2012; Slade et al., 2012; Zhang, 2010).
[0003] WO 2007/006808 A1 relates to CHO cells with knocked-down or knocked-out genes encoding host cell proteins
contaminating recombinant production of vitamin K-dependent proteins. For example,
endogenous Protein S was reported to contaminate preparations of recombinant Factor
VIIa, when this protein was produced in CHO cells.
[0004] WO 2012/082509 A2 relates to knockdown or knockout of the biomarkers Acsl3, Akrlb8, Anapc10, Arl6ip1,
Cnih2, Ctsd, Ctsl, Derl1, Dse, Ebpl, Ecm1, Elk3, Fth1, Gjb3, Hmox1, Itgb1bp1, Ldha,
Lgals1, Lgals3, Lrpap1, Myl6b, Pit1, Pomp, Rps26, Sen2, Sh2d3c, S100a4, Tmed1, Tnfrsf12a,
Tinagl, Tnfrsf25, Trappc6b, 0610007C21Rik and 2610209M04Rik for the purpose of improving
cell growth or viability, and enhancing protein production in CHO cells.
[0005] Despite these and other advances in the art, there is still a need for host cells
such as CHO cells which are capable of high-yield production of recombinant proteins
of interest.
SUMMARY OF THE INVENTION
[0006] It has been found by the present inventor(s) that mammalian host cells may be modified,
such as by disruption by knockout of specific endogenous genes, for improved or higher
expression of a recombinant protein of interest being produced by the host cell, such
as a CHO cell.
[0007] First of all a production by the mammalian cells of these host cell proteins may
be limiting the cells ability to produce a recombinant protein of interest, simply
by the use of the same protein production machinery of the cell, or by the supply
of nutrients for the proteins. By knocking out these endogenous genes, the host cell
do not spend ressources on the production of these endogenous proteins product of
these genes.
[0008] Secondly, some of these endogenous proteins product may be very difficult to completely
remove from a media of a host cell producing one or more of these host cell proteins
in addition to a recombinant protein of interest.
[0009] Another object of embodiments of the present invention is to provide mammalian host
cells , which cells are able to grow to a higher cell density. This has a significant
advantage in that a higher production of a recombinant protein may be obtained by
having more cells in the same batch producing such a recombinant protein.
[0010] So, in a first aspect the present invention relates to a CHO cell modified to disrupt
expression of one or more endogenous proteins selected from the group consisting of
TIMP1, NID1-1, NID1-2, and LGALS3BP.
[0011] In a second aspect the present invention relates to a method of producing a recombinant
protein of interest, comprising
- (a) culturing the CHO cell of some embodiments of the invention in a culture medium
such that the protein of interest is expressed from the nucleic acid sequence;
- (b) harvesting the recombinant protein of interest from the culture medium; and
- (c) optionally, further purifying the harvested recombinant protein of interest.
[0012] In a third aspect the present invention relates to a method for preparing the CHO
cell of the invention, comprising the steps of
- (a) modifying the CHO cell to knock-down or knock-out the endogenous gene or genes
encoding one or more of TIMP1, NID1-1, NID1-2, and LGALS3BP; and
- (b) optionally, transfecting the modified CHO cell with a nucleic acid sequence encoding
a protein of interest.
BRIEF DESCRIPTION OF THE DRAWINGS
[0013]
Figure 1: Rituximab expression from a multiplex secretory KO and individual secretory
KO clones(normalized to CHO-S WT). Rituximab titers from transiently transfected CHO
cell lines grown in shakerflasks. All titers are normalised to titers from wt CHO
cells.
Figure 2: Maximum Cell Density of a multiplex secretory KO and individual secretory
KO clones(normalized to CHO-S WT. Maximum reached cell density in shakeflasks, for
the 6SecKO and for Cell lines with the genes knocked out individually, compared to
wt.
Figure 3: Presence of Host Cell Proteins (HCP) (normalised to wt CHO-S) in used media
measured by quantity (BCA and Nanodrop) and immunogenicity (HCP). CDCHO is HCP free
CHO media.
Figure 4: Analysis of the HCP present in used media from cells with the genes TIMP1,
BGN, NID1-1 and NID1-2 knocked out individually. Numbers are normalized to wt being
100.
DETAILED DISCLOSURE OF THE INVENTION
[0014] The present invention relates to mammalian cells and in particular CHO cells modified
to provide for improved expression of a recombinant protein of interest. In particular,
the expression of one or more endogenous secreted proteins has been significantly
reduced or disrupted.
[0015] It will be understood that in principle any method or technique for reducing or disrupting
the expression of one or more endogenous secreted proteins may be used. The examples
of such methods including siRNA targeting, targeted gene knock-out, transfection with
a transcriptional factor, and site-specific cleavage of the DNA strands encoding protein
contaminants are not to be construed limiting in any way. In principle, any molecular
biology, cell biology, or selection method may be used to reduce or disrupt the expression
level of particular endogenous secreted proteins.
Definitions
[0016] The term "disrupt" as used herein refers to the significant reduction to complete
removal of the expression of one or more endogenous proteins in a CHO cell, such as
be knockdown or knockout. This may be measured as presence of this one or more endogenous
proteins in a culture medium of the CHO cell, such as by mass spectrometry wherein
the total content of a endogenous protein may be less than at the most 500 ppm, such
as at the most 100 ppm, such as at the most 10 ppm, e.g. at the most 1 ppm, or at
the most 0.1 ppm, or non-detectable.
[0017] The term "knock-down", "reduction" or "depletion" in the context of gene expression
as used herein refers to experimental approaches leading to reduced expression of
a given gene compared to expression in a control cell. Knock-down of a gene can be
achieved by various experimental means such as introducing nucleic acid molecules
into the cell which hybridize with parts of the gene's mRNA leading to its degradation
(e.g. shRNAs, RNAi, miRNAs) or altering the sequence of the gene in a way that leads
to reduced transcription, reduced mRNA stability or diminished mRNA translation.
[0018] A complete inhibition of expression of a given gene is referred to as "knock-out".
Knockout of a gene means that no functional transcripts are synthesized from said
gene leading to a loss of function normally provided by this gene. Gene knock-out
is achieved by altering the DNA sequence leading to disruption or deletion of the
gene or its regulatory sequences. Knock-out technologies include the use of homologous
recombination techniques to replace, interrupt or delete crucial parts or the entire
gene sequence or the use of DNA- modifying enzymes such as zink- finger nucleases
to introduce double strand breaks into DNA of the target gene.
[0019] One specific method for knocking out a specific gene according to the invention is
the CRISPR-Cas9 methods as described in e.g.
Ronda, Carlotta; Pedersen, Lasse Ebdrup; Hansen, Henning Gram; Kallehauge, Thomas
Beuchert; Betenbaugh, Michael J; Nielsen, Alex Toftgaard; Kildegaard, Helene Faustrup;,
Accelerating genome editing in CHO cells using CRISPR Cas9 and CRISPy, a web-based
target finding tool, Biotechnology and bioengineering,111,8,1604-1616,2014, or in
Grav, Lise Marie; Lee, Jae Seong; Gerling, Signe; Kallehauge, Thomas Beuchert; Hansen,
Anders Holmgaard; Kol, Stefan; Lee, Gyun Min; Pedersen, Lasse Ebdrup; Kildegaard,
Helene Faustrup; One-step generation of triple knockout CHO cell lines using CRISPR/Cas9
and fluorescent enrichment, Biotechnology journal, 2015.
[0020] The term "eukaryotic host cell line" as used herein refers to any suitable cell line
from eukaryotic origin that can be used for the expression of recombinant proteins.
Non-limiting examples of suitable host cell lines includes CHO cells, BKH cells, HKBI
I cells, SP2/0 cells, and NSO cells, L-M cells (ATCC CCL 1), HEK293 cells (ATCC CCL
1555), HEK293 (ATCC CRL 1573;
Graham et al., J. Gen. Virol. 36: 59-72, 1977), CV-1 cells (ATCC CCL 70), COS-1 cells (ATCC CRL 1650), COS-7 cells (ATCC CRL 1651),
3T3 cells (ATCC CCL 92), NIH/3T3 cells (ATCC CRL 1658), HeLa cells(ATCC CCL 2), C127I
cells (ATCC CRL 1616), BS-C-1 cells (ATCC CCL 26) and MRC-5 cells (ATCC CCL 171).
One particular suitable cell is a CHO cell (e. g. , ATCC CCL 61), such as one selected
from a CHO-K1, CHO-S and CHO DG44 cells.
[0021] The term "Protein of interest" as used herein refers to any protein that may be produced
by recombinant means, typically a protein that require expression in a eukaryotic
host cell due to post translational modifications including glycosylation. This includes
proteins of human and animal origin, but also proteins of other sources such as plants,
insects, etc., and mutated, artificial, synthetic, fusion or chimeric proteins. In
particular "protein of interest" includes plasma proteins, peptide hormones, growth
factors, cytokines and antibodies. In more detail, plasma proteins include human and
animal blood clotting factors such as fibrinogen, prothrombin, thrombin, FX, FXa,
FIX, FIXa, FVII, FVIIa, FVIII, FVIIIa, FXI, FXIa, FXII, FXIIa, FXIII, FXIIIa, von
Willebrand factor etc., transport proteins such as albumin, transferrin, ceruloplasmin,
haptoglobin, hemoglobin, hemopexin, etc., protease inhibitors such as β-antithrombin,
o-antithrombin, o2-macroglobulin, Cl-inhibitor, tissue factor pathway inhibitor (TFPI),
heparin cofactor II, protein C inhibitor (PAI-3), Protein C, Protein S, etc., antiangionetic
proteins such as latent-antithrombin, etc., highly glycosylated proteins including
alfa-1-acid glycoprotein, antichymotrypsin, inter-α-trypsin inhibitor, α-2-HS glycoprotein,
C-reactive protein, etc. and other proteins such as histidine-rich glycoprotein, mannan
binding lectin, C4-binding protein, fibronectin, GC-globulin, plasminogen, blood factors
such as erythropoeitin, interferon, tumor factors, tPA, gCSF and derivatives and muteins
thereof. The antibody may be any therapeutic antibody selected from the group consisting
of abagovomab, adalimumab, alemtuzumab, aurograb, bapineuzumab, basiliximab, belimumab,
bevacizumab, briakinumab, canakinumab, catumaxomab, certolizumab pegol, cetuximab,
daclizumab, denosumab, efalizumab, galiximab, gemtuzumab ozogamicin, golimumab, ibritumomab
tiuxetan, infliximab, ipilimumab, lumiliximab, mepolizumab, motavizumab, muromonab,
mycograb, natalizumab, nimotuzumab, ocrelizumab, ofatumumab, omalizumab, palivizumab,
panitumumab, pertuzumab, ranibizumab, reslizumab, rituximab, teplizumab, tocilizumab/atlizumab,
tositumomab, trastuzumab, Proxinium
™, Rencarex
™, ustekinumab, and zalutumumab.
[0022] "TIMP1" or TIMP-1 refers to Tissue inhibitors of metalloproteases type 1 endogenously
derived from the host cell, such as Chinese Hamster TIMP-1 of a CHO cell.
[0023] "LGALS3BP " refers to lectin, galactoside-binding soluble 3 binding protein endogenously
derived from the host cell, such as Chinese Hamster LGALS3BP of a CHO cell.
[0024] NID1 or NID1-1 as used herein refers to Nidogen 1. On the CHO K1 reference genome
this gene exists in 2 isoforms, hence the NID1-1 and NID1-2.
[0025] "BGN" refers to gene of biglycan.
[0026] "CTSD" refers to a Cathepsin D protein endogenously derived from the host cell, such
as Chinese Hamster CTSD of a CHO cell.
Timp1 identifiers from NCBI:
[0027]
Gene ID: 100770193; mRNA: XM_007630118.1, XM_007630119.1, JP061669.1, JI873443.1
Protein: XP_003512290.1, XP_007628308.1, XP_007628309.1, EGW14768.1, ERE65804.1
LGALS3BP identifiers from NCBI:
[0028]
Gene ID: 100767541; mRNA: XM_003498817.2, XM_007622088.1
Protein: XP_007620278.1, XP_003498865.1, EGV97878.1
NID1 identifiers from NCBI:
[0029]
Gene ID: 100766031; mRNA: XM_003509885.2, XM_003507587.2, XM_007632645.1, KE673075.1
Proteins: ERE78436.1, XP_007630835.1, XP_003507635.2, XP_003509933.2
BGN identifiers from NCBI:
[0030]
Gene ID: 100771022; mRNA: XM_003506540.2, XM_007628144.1, KE685970.1:158610-162415
(+), JH000676.1:731964-735640 (-)
Protein: XP_003506588.1, XP_007626334.1, ERE63792.1, EGV92105.1
CTSD identifiers from NCBI:
[0031]
Gene ID: 100766628; mRNA, XM_003510234.2, XM_007611606.1, JI884212.1, JP051918.1,
JH001272.1:176080-187386 (-), KE672442.1:201083-212389 (+)
Protein, EGW04839.1, ERE79325.1, XP_007609796.1, XP_003510282.1
Specific embodiments of the invention
[0032] The present invention relates to a CHO cell modified to disrupt expression of one
or more endogenous proteins selected from the group consisting of TIMP1, NID1-1, NID1-2,
and LGALS3BP.
[0033] In some embodiments the CHO cell is modified to disrupt expression of TIMP1.
[0034] In some embodiments the CHO cell is modified to disrupt expression of LGALS3BP.
[0035] In some embodiments the CHO cell is modified to disrupt expression of NID1-1, NID1-2,
or both.
[0036] In some embodiments the CHO cell is modified to disrupt expression of the endogenous
proteins TIMP1, NID1-1, NID1-2, and LGALS3BP.
[0037] In some embodiments the CHO cell is modified to disrupt expression of
- (a) at least 5 endogenous proteins;
- (b) at most 50 endogenous proteins; or
- (c) between 5 and 50 endogenous proteins.
[0038] In some embodiments the CHO cell is modified to disrupt expression of TIMP1, LGALS3BP,
NID1-1, and NID1-2.
[0039] In some embodiments the expression of at least one endogenous protein is disrupted
by knockdown of the gene encoding the endogenous protein.
[0040] In some embodiments the expression of at least one endogenous protein is disrupted
by knockout of the gene encoding the endogenous protein.
[0041] In some embodiments the CHO cell of the invention comprises a nucleic acid sequence
encoding a recombinant protein of interest, the nucleic acid sequence being extrachromosomal
or chromosomally integrated and under the control of an inducible or constitutive
promoter.
[0042] In some embodiments the cell of the invention produce higher levels of said recombinant
protein of interest and/or has an increased cell-density as compared to a relevant
control cell without disrupted expression of one or more endogenous proteins, such
as a wild type CHO cell.
[0043] In some embodiments the CHO cell according to the invention is selected from a CHO-K1,
CHO-S and CHO DG44 cells.
[0044] According to the cells and methods of the invention, the gene expression of certain
genes are "disrupted" by at least one step selected from the group consisting of gene
silencing, gene knock-down, gene knock-out, delivery of a dominant negative construct,
conditional gene knock-out, and/or by gene alteration with respect to a specific gene.
[0045] The term "gene expression", as used herein, is meant to encompass at least one step
selected from the group consisting of DNA transcription into mRNA, mRNA processing,
noncoding mRNA maturation, mRNA export, translation, protein folding and/or protein
transport.
[0046] The inhibition or reduction of gene expression of a gene refers to methods which
directly interfere with gene expression, encompassing, but not restricted to, inhibition
or reduction of DNA transcription, e.g., by use of specific promoter-related repressors,
by site specific mutagenesis of a given promoter, by promoter exchange, or inhibition
or reduction of translation, e.g., by RNAi induced post-transcriptional gene silencing.
The expression of a dysfunctional, or inactive gene product with reduced activity,
can, for example, be achieved by site specific or random mutagenesis, insertions or
deletions within the coding gene.
[0047] The inhibition or reduction of the activity of gene product can, for example, be
achieved by administration of, or incubation with, an inhibitor to the respective
enzyme, prior to or simultaneously with protein expression. Examples for such inhibitors
include, but are not limited to, an inhibitory peptide, an antibody, an aptamer, a
fusion protein or an antibody mimetic against said enzyme, or a ligand or receptor
thereof, or an inhibitory peptide or nucleic acid, or a small molecule with similar
binding activity.
[0048] Other ways to inhibit the enzyme are the reduction of specific cofactors of the enzyme
in the medium, like copper, which is a PAM specific ion cofactor (e.g., in the form
of CuS04), ascorbate, which acts as an electron donor for PAM, molecular oxygen, catalase
and others known today to the skilled artisan, or yet to be discovered in the future.
[0049] Gene silencing, gene knock-down and gene knock-out refers to techniques by which
the expression of a gene is reduced, either through genetic modification or by treatment
with an oligonucleotide with a sequence complementary to either an mRNA transcript
or a gene. If genetic modification of DNA is done, the result is a knock-down or knock-out
organism. If the change in gene expression is caused by an oligonucleotide binding
to an mRNA or temporarily binding to a gene, this results in a temporary change in
gene expression without modification of the chromosomal DNA and is referred to as
a transient knock-down.
[0050] In a transient knock-down, which is also encompassed by the above term, the binding
of this oligonucleotide to the active gene or its transcripts causes decreased expression
through blocking of transcription (in the case of gene-binding), degradation of the
mRNA transcript (e.g. by small interfering RNA (siRNA) or RNase-H dependent antisense)
or blocking either mRNA translation, pre-mRNA splicing sites or nuclease cleavage
sites used for maturation of other functional RNAs such as miRNA (e.g., by Morpholino
oligos or other RNase-H independent antisense). Other approaches involve the use of
shRNA (small hairpin RNA, which is a sequence of RNA that makes a tight hairpin turn
that can be used to silence gene expression via RNA interference), esiRNA (Endoribonuclease-prepared
siRNAs, which are a mixture of siRNA oligos resulting from cleavage of long double-stranded
RNA (dsRNA) with an endoribonuclease), or the activation of the RNA-induced silencing
complex (RISC).
[0051] Other approaches to carry out gene silencing, knock-down or knock-out are known to
the skilled person from the respective literature, and their application in the context
of the present invention is considered as routine.
[0052] Gene knock-out refers to techniques by which the expression of a gene is fully blocked,
i.e. the respective gene is inoperative, or even removed. Methodological approaches
to achieve this goal are manifold and known to the skilled person. Examples are the
production of a mutant which is dominantly negative for the given gene. Such mutant
can be produced by site directed mutagenesis (e.g., deletion, partial deletion, insertion
or nucleic acid substitution), by use of suitable transposons, or by other approaches
which are known to the skilled person from the respective literature, the application
of which in the context of the present invention is thus considered as routine. One
example for a newly developed technique which the skilled person would consider as
useful in the context of the present invention is knock-out by use of targeted Zinc
Finger Nucleases. A respective Kit is provided by Sigma Aldrich as "CompoZR knockout
ZFN". Another approach encompasses the use of Transcription activator-like effector
nucleases (TALENs).
[0053] The delivery of a dominant negative construct involves the introduction of a sequence
coding for a dysfunctional enzyme, e.g., by transfection. Said coding sequence is
functionally coupled to a strong promoter, in such way that the gene expression of
the dysfunctional enzyme overrules the natural expression of the wild type enzyme,
which, in turn, leads to an effective physiological defect of the respective enzyme
activity.
[0054] A conditional gene knock-out allows to block gene expression in a tissue- or time-specific
manner. This is done, for example, by introducing short sequences called IoxP sites
around the gene of interest. Again, other approaches are known to the skilled person
from the respective literature, and their application in the context of the present
invention is considered as routine.
[0055] One other approach is gene alteration which may lead to a dysfunctional gene product
or to a gene product with reduced activity. This approach involves the introduction
of frame shift mutations, nonsense mutations (i.e., introduction of a premature stop
codon) or mutations which lead to an amino acid substitution which renders the whole
gene product dysfunctional, or causing a reduced activity. Such gene alteration can
for example be produced by mutagenesis (e.g., deletion, partial deletion, insertion
or nucleic acid substitution), either unspecific (random) mutagenesis or site directed
mutagenesis.
[0056] Protocols describing the practical application of gene silencing, gene knock-down,
gene knock-out, delivery of a dominant negative construct, conditional gene knock-out,
and/or gene alteration are commonly available to the skilled artisan, and are within
his routine. The technical teaching provided herein is thus entirely enabled with
respect to all conceivable methods leading to an inhibition or reduction of gene expression
of a gene product, or to the expression of a dysfunctional, or inactive gene product,
or with reduced activity.
Preparation of protein of interest
[0057] The invention also relates to a method of preparing a protein of interest as mentioned
above. The protein of interest described herein may be produced by means of recombinant
nucleic acid techniques. In general, a cloned nucleic acid sequence is modified to
encode the desired protein. This modified sequence is then inserted into an expression
vector, which is in turn transformed or transfected into host cells. Higher eukaryotic
cells, in particular cultured mammalian cells, are preferred as host cells. The complete
nucleotide and amino acid sequences for human
[0058] Optional amino acid sequence alterations may be accomplished by a variety of techniques.
Modification of the nucleic acid sequence may be by site-specific mutagenesis. Techniques
for site-specific mutagenesis are well known in the art and are described in, for
example,
Zoller and Smith (DNA 3: 479-488,1984) or "
Splicing by extension overlap", Horton et al., Gene 77,1989, pp. 61-68. Thus, using the nucleotide and amino acid sequences of the protein of interest,
one may introduce the alteration (s) of choice. Likewise, procedures for preparing
a DNA construct using polymerase chain reaction using specific primers are well known
to persons skilled in the art (cf.
PCR Protocols, 1990, Academic Press, San Diego, California, USA).
[0060] The nucleic acid construct encoding the protein of interest may also be prepared
synthetically by established standard methods, e. g. the phosphoamidite method described
by Beaucage and
Caruthers, Tetrahedron Letters 22 (1981), 1859-1869, or the method described by
Matthes et al., EMBO Journal 3 (1984), 801-805. According to the phosphoamidite method, oligonucleotides are synthesised, e. g.
in an automatic DNA synthesiser, purified, annealed, ligated and cloned in suitable
vectors. The protein of interest may also be prepared by polymerase chain reaction
using specific primers, for instance as described in
US 4,683, 202,
Saiki et al., Science 239 (1988), 487-491, or Sambrook et al., supra.
[0061] Furthermore, the nucleic acid construct may be of mixed synthetic and genomic, mixed
synthetic and cDNA or mixed genomic and cDNA origin prepared by ligating fragments
of synthetic, genomic or cDNA origin (as appropriate), the fragments corresponding
to various parts of the entire nucleic acid construct, in accordance with standard
techniques.
[0062] The nucleic acid construct is preferably a DNA construct. DNA sequences for use in
producing the protein of interest will typically encode a pre-pro polypeptide at the
amino-terminus of the protein of interest to obtain proper posttranslational processing
(e. g. gamma-carboxylation of glutamic acid residues) and secretion from the host
cell.
[0063] The DNA sequences encoding the protein of interest are usually inserted into a recombinant
vector which may be any vector, which may conveniently be subjected to recombinant
DNA procedures, and the choice of vector will often depend on the host cell into which
it is to be introduced. Thus, the vector may be an autonomously replicating vector,
i. e. a vector, which exists as an extrachromosomal entity, the replication of which
is independent of chromosomal replication, e. g. a plasmid. Alternatively, the vector
may be one which, when introduced into a host cell, is integrated into the host cell
genome and replicated together with the chromosome(s) into which it has been integrated.
[0064] The vector is preferably an expression vector in which the DNA sequence encoding
the protein of interest is operably linked to additional segments required for transcription
of the DNA. In general, the expression vector is derived from plasmid or viral DNA,
or may contain elements of both. The term "operably linked" indicates that the segments
are arranged so that they function in concert for their intended purposes, e. g. transcription
initiates in a promoter and proceeds through the DNA sequence coding for the polypeptide.
[0065] Expression vectors for use in expressing a protein of interest will comprise a promoter
capable of directing the transcription of a cloned gene or cDNA. The promoter may
be any DNA sequence, which shows transcriptional activity in the host cell of choice
and may be derived from genes encoding proteins either homologous or heterologous
to the host cell.
[0066] Examples of suitable promoters for directing the transcription of the DNA encoding
a protein of interest in mammalian cells are the SV40 promoter (
Subramani et al., Mol. Cell Biol. 1 (1981), 854-864), the MT-1 (metallothionein gene) promoter (
Palmiter et al., Science 222 (1983), 809-814), the CMV promoter (
Boshart et al., Cell 41 : 521-530,1985) or the adenovirus 2 major late promoter (
Kaufman and Sharp, Mol. Cell. Biol, 2: 1304-1319, 1982).
[0067] The DNA sequences encoding the protein of interest may also, if necessary, be operably
connected to a suitable terminator, such as the human growth hormone terminator (
Palmiter et al., Science 222,1983, pp. 809-814) or the TPI 1 (
Alber and Kawasaki, J. Mol. Appl. Gen. 1, 1982, pp. 419-434) or ADH3 (
McKnight et al., The EMBO J. 4,1985, pp. 2093-2099) terminators. Expression vectors may also contain a set of RNA splice sites located
downstream from the promoter and upstream from the insertion site for the sequence
itself encoding the protein of interest. Preferred RNA splice sites may be obtained
from adenovirus and/or immunoglobulin genes. Also contained in the expression vectors
is a polyadenylation signal located downstream of the insertion site. Particularly
preferred polyadenylation signals include the early or late polyadenylation signal
from SV40 (Kaufman and Sharp, ibid. ), the polyadenylation signal from the adenovirus
5 Elb region, the human growth hormone gene terminator (
DeNoto et al. Nucl. Acids Res. 9: 3719-3730,1981) or the polyadenylation signal from the gene encoding the protein of interest. The
expression vectors may also include a noncoding viral leader sequence, such as the
adenovirus 2 tripartite leader, located between the promoter and the RNA splice sites;
and enhancer sequences, such as the SV40 enhancer.
[0068] To direct the protein of interest into the secretory pathway of the host cells, a
secretory signal sequence (also known as a leader sequence, prepro sequence or pre
sequence) may be provided in the recombinant vector. The secretory signal sequence
is joined to the DNA sequences encoding the protein of interest in the correct reading
frame. Secretory signal sequences are commonly positioned 5'to the DNA sequence encoding
the peptide. The secretory signal sequence may be that, normally associated with the
protein or may be from a gene encoding another secreted protein.
[0069] The procedures used to ligate the DNA sequences coding for the protein of interest,
the promoter and optionally the terminator and/or secretory signal sequence, respectively,
and to insert them into suitable vectors containing the information necessary for
replication, are well known to persons skilled in the art (cf. , for instance,
Sambrook et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor, New York,
1989).
[0070] Methods of transfecting mammalian cells and expressing DNA sequences introduced in
the cells are described in e. g.
Kaufman and Sharp, J. Mol. Biol. 159 (1982), 601-621;
Southern and Berg, J. Mol. Appl. Genet. 1 (1982), 327-341;
Loyter et al., Proc. Natl. Acad. Sci. USA 79 (1982), 422-426;
Wigler et al., Cell 14 (1978), 725;
Corsaro and Pearson, Somatic CellGenetics 7 (1981), 603,
Graham and van der Eb, Virology 52 (1973), 456; and
Neumann et al., EMBO J. 1 (1982), 841-845. Cloned DNA sequences are introduced into cultured mammalian cells by, for example,
calcium phosphate-mediated transfection (
Wigler et al., Cell 14: 725-732, 1978;
Corsaro and Pearson, Somatic Cell Genetics 7: 603-616,1981 ;
Graham and Van der Eb, Virology 52d: 456-467, 1973) or electroporation (
Neumann et al., EMBO J. 1: 841-845,1982).
[0071] To identify and select cells that express the exogenous DNA, a gene that confers
a selectable phenotype (a selectable marker) is generally introduced into cells along
with the gene or cDNA of interest. Preferred selectable markers include genes that
confer resistance to drugs such as neomycin, hygromycin, and methotrexate. The selectable
marker may be an amplifiable selectable marker. A preferred amplifiable selectable
marker is a dihydrofolate reductase (DHFR) sequence. Selectable markers are reviewed
by
Thilly (Mammalian Cell Technology, Butterworth Publishers, Stoneham, MA). The person skilled in the art will easily be able to choose suitable selectable
markers.
[0072] Selectable markers may be introduced into the cell on a separate plasmid at the same
time as the gene of interest, or they may be introduced on the same plasmid. If, on
the same plasmid, the selectable marker and the gene of interest may be under the
control of different promoters or the same promoter, the latter arrangement producing
a dicistronic message. Constructs of this type are known in the art (for example,
Levinson and Simonsen, U. S. 4,713, 339). It may also be advantageous to add additional DNA, known as"carrier DNA, "to the
mixture that is introduced into the cells.
[0073] After the cells have taken up the DNA, they are grown in an appropriate growth medium,
typically 1-2 days, to begin expressing the gene of interest. As used herein the term
"appropriate growth medium" means a medium containing nutrients and other components
required for the growth of cells and the expression of the human Protein of interest
of interest. Media generally include a carbon source, a nitrogen source, essential
amino acids, essential sugars, vitamins, salts, phospholipids, protein and growth
factors. For production of gamma-carboxylated proteins, the medium will contain vitamin
K, preferably at a concentration of about 0.1 llg/ml to about 5 Lg/ml. Drug selection
is then applied to select for the growth of cells that are expressing the selectable
marker in a stable fashion. For cells that have been transfected with an amplifiable
selectable marker the drug concentration may be increased to select for an increased
copy number of the cloned sequences, thereby increasing expression levels. Clones
of stably transfected cells are then screened for expression of the protein of interest.
[0074] The host cell into which the DNA sequence encoding the protein of interest is introduced
may be any cell, such as one which is capable of producing the posttranslational modified
proteins, such as higher eukaryotic cells.
Example 1
Identification of the top-6 secreted proteins in our CHO cell line
[0075] First protein mass spectroscopy data of the extracellular media from several CHO
cell cultures was analyzed by a series of processes implemented and executed in python
(
https://www.python.org/):
Relevant data was extracted from mass spec software data output files, e.g. short
and long identifiers, top 3 matched peptide intensity sums and identifiers for potential
homologs.
[0076] Homologs were collapsed by sum and intensity sums were normalized by converting into
percentage of total peptide intensity sums.
[0077] External information about run conditions was added to the data.
[0078] Top 20 proteins were then selected based on the criteria that they were present in
at least half of our samples and have the 20 highest median normalized intensity sum
values.
[0079] These 20 proteins were then further analyzed by gene ontology to exclude proteins
for various reasons:
Gene ontology, as compiled by e.g. the Gene Ontology Consortium (GO,
http)://geneontology.org/), can be used to infer the origin of gene products. The means by which proteins reach
the media of a cell culture may be by e.g. controlled secretion or cell lysis. We
were primarily interested in proteins that had reached the media by secretion, and
specifically not interested in proteins that had reached in by cell lysis. If, by
gene ontology, we saw that a protein mainly resides intracellularly, we assumed that
its presence in the media was not due to organized secretion, but rather cell lysis
and consequently knock down or knock out is unlikely to free up secretory capacity.
[0080] We assigned gene ontology terms to our top 20 proteins by using the GO Cross-species
Homology database (GO CHO,
https://ebdrup.biosustain.dtu.dk/gocho/) which extracts terms for a gene product by looking for similarly named gene product
in a selection of related organisms under the assumption that gene function, and thus
GO terms, are largely conserved between closely related species.
[0081] Using the GO CHO interface, we then exclude proteins from the top 20 list that are
not known to be present in the extracellular media
[0082] Known from the GeneOntology database. This is a database of manual (literature) and
electronic (literature and various methods) annotation for gene products..
[0083] Additionally we excluded proteins form the top 20 list, that are known to be essential
or important for the functioning of the cell.
[0084] These processes ultimately resulted in the identification of 6 secreted proteins.
(TIMP1, BGN, CTSD; NID1-1, NID1-2 and LGALS3BP).
[0085] The selected gRNA was synthesised as synthetic oligoes, annealed to make dsDNA, and
cloned into an expression vector using USER cloning (New England Biolabs) according
to the manufacturer's protocol.
[0086] First all 6 genes were knocked out individually in CHO-S cells. Thereafter 5 genes
were knocked out in the knock out cell line that already contained TIMP1 to create
a cell lines that contained disrupted coding sequences for all 6 genes.
[0087] The annealed oligo's were cloned into expression vectors using USER cloning, according
to the manufacturers (New England Biolabs) protocol.
Example 2. Preparation of CHO KO cells:
[0089] The sgRNA expression constructs were designed by fusing tracrRNA and crRNA into a
chimeric sgRNA (Jinek et al. , 2012) and located immediately downstream of a U6 promoter
(Chang et al. , 2013). Initially, the sgRNA expression cassette was synthesized as
a gBlock (Integrated DNA Technologies) and subcloned into the pRSFDuet-1 vector (Novagen,
Merck) using Kpnl and Hindlll restriction sites. This pRSFDuet-1/sgRNA expression
vector was used as backbone in a PCR-based uracil specific excision reagent (USER)
cloning method. This method was designed to easily and rapidly change the 19 bp-long
variable region (N19) of the sgRNA in order to generate our sgRNA constructs. From
the pRSFDuet-1/sgRNA expression vector, a 4221 bp-long amplicon (expression vector
backbone) was generated by PCR (1x: 98'C for 2 min; 30x: 98'C for 10 s, 57'C for 30
s, 72'C for 4 min 12 s; 1x: 72'C for 5 min) using two uracil-containing primers (sgRNA
Backbone fw and sgRNA Backbone rv, Integrated DNA Technologies, Table 2) and the X7
DNA polymerase. Subsequent to Fastdigest Dpnl (Thermo Fisher Scientific) treatment,
the amplicon was purified from a 2% agarose TBE gel using the QIAEX II Gel Extraction
Kit (Qiagen). In parallel, 54 bp-long and 53 bp-long single stranded oligos, (sense
and antisense strand, respectively) comprising the variable region of the sgRNA, were
synthesized (TAG Copenhagen, Table 2). Annealing of the sense and antisense single
stranded oligos (100 pM) were done with slow cooling from 95 °C to 25°C over 15 min
in a thermocycler in buffer 4 from New England Biolabs. The annealed oligos were then
mixed with the gel purified expression vector backbone and treated with USER enzyme
(New England Biolabs) according to manufacturer's recommendations. After USER enzyme
treatment, the reaction mixture was transformed into E. coli Mach1 competent cells
(Life Technologies) according to standard procedures. Transformant clones were selected
on 50 pg/mL Kanamycin (Sigma-Aldrich) LB plates. All constructs were verified by sequencing
and purified by NucleoBond Xtra Midi EF (Macherey-Nagel) according to manufacturer's
guidelines.
[0090] The CHO codon optimized Cas9 expression vector applied in the study is described
in
[0092] CHO cells e.g. CHO-S cells from Life Technologies were grown in appropriate medium
e.g. CD CHO medium (Life Technologies) supplemented with 8 mM L-Glutamine and cultivated
in shake flasks. The cells were incubated at 37°C, 5% CO2, with 120 rpm shaking and
passaged every 2-3 days. Transfection was performed with the expression vectors encoding
CHO optimized Cas9 and sgRNA targeting the respective KO sites, described above. For
each sample, 3 × 10
6 cells were transfected with a total of 3.75 pg of DNA. 16 hours post transfection,
the samples were incubated at 30°C for 32 hours before transferred back to 37°C. The
transfected cells were single cell sorted into 96 well plates, using Fluorescence-activated
cell sorting (FACS) to harvest GFP positive cells and cultivated at 37°C, 5% CO2 for
3 weeks. Drying out of the wells was avoided by topping up with fresh media as needed.
Growing clones were collapsed into single 96 well plates and a sample pr growing clone
was taken out for genomic analysis to identify gene KO's. Genomic DNA is prepared
using Quick Extract kit (Epicentre) according to the manufacturers recommendations.
Amplicons covering the targeted sites were made using target specific miSeg primers
(tableX) using Phusion polymerase in a thermocycler using the following programme:
1x: 95°C for 7 min; 20x: 95°C for 45 s, 69.5°C for 30 s (Δ -0.5°C/cycle) , 72°C for
30s; 1x: 72°C for 7 min
[0093] The resulting amplicons are purified using AMPure XP Beads (Beckman Coulter) according
to the manufacturers recommendations.
[0094] The purified amplicons are indexed using the Nextera XT indexing kit from Illumina
using the following protocol:
| Mix |
x1 |
| DNA |
5,0µL |
| Nextera XT Index 1 |
2,5µL |
| Nextera XT Index 2 |
2,5µL |
| 2x KAPA HiFI HotStart Ready Mix |
12,5µL |
| H2O |
2,5uL |
| |
25µl |
and run on a thermal cycler using the following program: 1x: 95°C for 3 min; 8x: 95°C
for 30 sec, 55°C for 30 sec 72°C for 30 sec; 1x: 72°C for 5 min
[0095] The resulting indexed libraries are purified using AMPure XP Beads (Beckman Coulter)
according to the manufacturers recommendations.
[0096] The libraries are normalized to 10nM, pooled and run on an miSeq DNA sequencer (Illumina)
using the following protocol:
Bring 10nM Library to 4nM by combining:
- 4µl of 10nM library
- 6µl Hybridisation buffer1 (HT1) (Illumina)
Combine together:
- 5µl of 4nM library
- 5µl of 0.2N NaOH
Vortex briefly and spin 1 minute at 280g.
Incubate for 5 minutes at room temperature for denaturing DNA into single strands.
Add 990µl pre-chilled HT1 (library is now 20pM & denatured in 1mM NaOH).
Kept on ice from now on.
[0097] The denatured libraries are diluted to 10pM with pre-chilled HT1 and spiked with
30µl denatured Illumina PhiX Control (Illumina), immediately loaded on a miSeg cartridge
and run on a Illumina miSeq Desktop sequencer.
[0098] After MiSeq the sequence data are analyzed by running them through a series of steps
executed using python. First paired sequences are merged into one sequence. Then sequences
are quality checked and trimmed or discarded if below threshold. Sequences are then
aligned to wild type version of the target sites. Each clone is then identified on
the basis of its unique index pair and analyzed in regard to the genomic situation
at each of the target sites. If it is clear that all target sites have been disrupted
in a manner that results in a frameshift and the data is generally plausible and clean,
the clone is saved for further propagation.
[0099] The KO's were all generated as single KO's first, and thereafter the TIMP1 KO was
used as basis for the KO of the other 5 gene targets in one single procedure, where
all 5 gRNA vectors were co-transfected with the Cas9 vector, to generate a CHO Cell
line with all 6 genes knocked out (6XSecKO), using the protocols described above.
Example 3: Characterization of CHO KO cells
[0100] The CHO cells were analyzed for growth and rituximab expression using the following
protocol:
Thaw cell from Cell Bank
Maintain and propagate cells for 3 passages
On Day 1: Wash cells and culture o/n without anti-clumping agent
On Day 2: Count cells and adjust to 1,000,000 cells/mL (3 × 125 mL shake flasks with
30 mL for each cell line), transfect cells with Rituximab expression vector according
to protocol described above
On day 3, Take out a 1 mL sample from each flask for cell count and determination
of transfection efficiency and replace with 1 mL fresh medium (containing 60 uL Anti-clumping
agent per mL)
On day 4 Take out a 1 mL sample from each flask for cell count and determination of
transfection efficiency and replace with 1 mL fresh medium
On day 5: Take out a 1 mL sample from each flask for cell count and determination
of transfection efficiency and replace with 1 mL fresh medium, and harvest supernatants
for analysis of Rituximab titer.
[0101] All incubations are done at 37°C, 5%CO2 120 rpm in a Multitron Incubator (Inforss).
[0102] Rituximab titers were determined using the Octet RED96 system from Pall according
to the manufacturer's recommendations for detection of IgG.
[0103] Cells were counted using a Nucleocounter NC-200 from Chemometec according to the
manufacturers recommendations for counting CHO cells
[0104] From figure 1 is seen a positive effect of knocking out all genes except maybe for
BGN.
[0105] From figure 2 is seen that the 6XSecKO reach significantly higher maximal cell densities
than the wt CHO-S cells. The gene KO that seems to be the major contributor to this
phenotype is the knocking out of the NID1-1 genes.
Example 4 Quantification of the host cell protein (HCP) content in the supernatant
[0106] Used media from CHO-S, 6XsecKO and 11XsecKO, was harvested and the HCP were TCA precipitated
using the following protocol:
- 1. Add 1 volume of TCA stock to 4 volumes of protein sample. (250 ul TCA + 1000 ul
SN) i.e. in 1.5ml tube with maximum vol., add 250µl TCA to 1.0ml sample.
- 2. Incubate 10 min at 4°C.
- 3. Spin tube in microcentrifuge at 14K rpm, 5 min.
- 4. Remove supernatant, leaving protein pellet intact. Pellet should be formed from
whitish, fluffy ppt.
- 5. Wash pellet with 200µl cold acetone.
- 6. Spin tune in microfuge at 14K rpm, 5min.
- 7. Repeat steps 4-6 for a total of 2 acetone washes.
- 8. Dry pellet by placing tube in 95°C heat block for 5-10 min to drive off acetone.
- 9. Dissolve the pellets in 200 ul 1% SDS, 50 mM TRIS, pH 7.5 at 95C
[0107] The protein content of the redissolved HCP was determined using Bradford Protein
assay (Pierce) according to the suppliers instructions, and using absorbation at 280
nm on a nanodrop (Thermo Fischer). The presence of HCP was also determined using an
antibody based CHO cell HCP kit (FortéBIO).
[0108] Analysis of the HCP present in the used media shows that the 2 KO's contain significantly
lower amounts of HCP in the used media that the CHO-S cell line (between approx 55%
and 70% compared to CHO-S,) and even less immunogenic HCP (approx 35%), shown by the
antibody based HCP assay. (See fig 3).
[0109] Analysis of the HCP present in used media from cells with the genes knocked out individually,
shows that especially TIMP1, BGN, NID1-1 and NID1-2 removal reduces the immunogenic
signal seen in the antibodybased assay (see Fig 4)
List of references
[0110]
WO 2007/006808 A1
WO 2012/082509 A2
Baycin-Hizal et al., J Proteome Res 2012;11(11):5265-5276
Chaudhuri et al., J Bioprocess Biotech 2015;5:7
Datta et al., Biotechnol Bioeng 2013;110(5):1255-1271
Kim et al., Appl Microbiol Biotechnol 2012;93:917-930
Slade et al., J Proteome Res 2012;11:6175-6186
Zhang, "Mammalian Cell Culture for Biopharmaceutical production", In: Manual of Industrial
Microbiology and Biotechnology, Edition: 3rd ed., Chapter: 12, Publisher: ASM Press,
Washington, DC., pp.157 - 178 (2010)
Lee et al., Biotechnology Journal 2015; 10(7):979-994
Walsh, Nat Biotechnol 2014;32:992-1000.