INDUCED PLURIPOTENT STEM CELLS – Regents of the University of …

This application is a continuation application and claims the benefit of priority of U.S. patent application Ser. No. 13/811,572, filed Apr. 5, 2013, which is a national stage application under 35 U.S.C. 371 of PCT/US2011/044995, filed Jul. 22, 2011, and published as WO 2012/012708 on Jan. 26, 2012, which claims priority from U.S. Provisional Application Ser. Nos. 61/366,821 filed Jul. 22, 2010 and 61/390,454 filed Oct. 6, 2010, which applications are herein incorporated by reference in their entirety.

This invention was made with Government support under United States Grant No. R01 DK082430-01 awarded by the National Institute of Diabetes and Digestive and Kidney Diseases, National Institutes of Health. The government has rights in the invention.

Nuclear reprogramming, the process of converting one cell type into another by resetting the pattern of gene expression, can be achieved through forced expression of defined transcription factors. One example is the induced pluripotent stem cells (iPSCs) prepared by transducing four genes (e.g., Oct4, Sox2, Kif4 and c-Myc, called OSKM hereafter) into a cell type to be dedifferentiated. iPSCs are a type of pluripotent stem cell artificially derived by reprogramming a somatic cell. iPSCs are morphologically similar to embryonic stem cells and are capable of differentiating into a variety of different somatic cell types. This technology allows researchers to obtain pluripotent stem cells for use in a research setting. iPSCs also have therapeutic uses for the treatment of disease without the need for stem cells derived from an embryonic source.

However, generally less than 1% of transduced cells are reprogrammed to form iPSCs, and the entire process of establishing iPSC clones is long (over a month).

Described herein is a novel approach to nuclear reprogramming using a fusion protein (a protein created through the joining of two or more genes or portions thereof in any orientation or copy number (e.g., from about 1 to about 2, about 3, about 4, about 5 or more copies of genes for example) which originally coded for separate proteins or portions thereof) of a transcription activation domain (TAD) of a gene, for example, MyoD and a transcription factor, for example, Oct4 (such a fusion protein is designated herein as M3O) that greatly improves the efficiency of reprogramming and accelerates iPSC production. iPSC colonies emerged five days after transduction of Sox2, Klf4 and c-Myc (SKM) and M3O into fibroblasts, with colonies rapidly enlarging in the absence of feeder cells. The pluripotency of iPSCs was confirmed by genome-wide gene expression analysis, teratoma formation, and chimera formation, including germline transmission. Transduction of M3O and SKM increased chromatin accessibility at the Oct4 promoter, facilitated recruitment of the Oct4-binding Paf1 complex, and remodeled many histone modifications at pluripotency genes to an embryonic stem cell (ESC)-like state more efficiently than transduction of OSKM. Thus, discussed herein is a novel approach to nuclear reprogramming in which a wide variety of TADs can be combined with related or unrelated transcription factors to reprogram the pattern of gene expression, with applications ranging from induction of pluripotency to direct transdifferentiation.

One embodiment provides iPSCs derived by nuclear reprogramming of a somatic cell with a fusion protein. The somatic cell can be a mammalian cell, for example a mouse cell or a human cell. One embodiment provides a fusion protein for induction of pluripotent stem cells. Another embodiment provides such a pluripotent stem cell, wherein the reprogramming comprises contacting the somatic cell with a fusion protein or DNA encoding the fusion protein. The disclosed methods and fusion proteins can be used to conveniently and reproducibly establish iPSCs having pluripotency and growth ability similar to that of ES cells (ESCs).

One embodiment provides a method for preparing an induced pluripotent stem cell by nuclear reprogramming of a somatic cell. which comprises introducing a nucleic acid sequence, by methods available to one of skill in the art, coding for a fusion protein of an unrelated/heterologous transactivation domain and a transcription factor into the somatic cell. One embodiment provides an induced pluripotent stem cell obtained by such a method. The fusion protein can be the fusion of an unrelated/heterologous transactivation domain and a transcription factor (e.g., the TAD is not normally associated with the transcription factor), such as the transactivation domain of MyoD (sequence information for MyoD is provided, for example, at NM_002478.4; NM_010866.2; NP_002469.2; NP_034996.2) or VP16 fused with Oct4 (full length or a bioactive fragment thereof; octamer-binding transcription factor 4 also known as POU5F1 (POU domain, class 5, transcription factor 1); sequence includes, for example, NM_002701; NM_013633.2; NP_002692; NP_038661.2; NM_001009178; NP_001009178; NM_131112; NP_571187). Additional trans-activating domains can include, for example, but are not limited to, those found in p53, VP16, MLL, E2A, HSF1, NF-IL6, NFAT1 and NF-B.

Additional factors to be introduced into the cell, and/or used to generate a fusion protein with a transactivation domain, can include, but is not limited to, a gene from the Sox family (e.g., SOX genes encode a family of transcription factors that bind to the minor groove in DNA, and belong to a super-family of genes characterized by a homologous sequence called the HMG (high mobility group) box and include, but are not limited to, SoxA, SRY (e.g., NM_003140.1; NM_011564; NP_003131.1; NP_035694), SoxB1, Sox1 (e.g., NM_005986), Sox2 (e.g., NM_003106; NM_011443; NP_003097; NP_035573). Sox3 (e.g., NM_005634; XM_988206; NP_005625; XP_993300), SoxB2, Sox14 (e.g., NM_004189; XM_284529; NP_004180; XP_284529), Sox21 (e.g., NM_007084; XM_979432; NP_009015; XP_984526), SoxC, Sox4 (e.g., NM_003107; NM_009238; NP_003098; NP_033264), Sox11 (e.g., XM_001128542; NM_009234; XP_001128542; NP_033260), Sox12 (e.g., NM_006943; XM_973626: NP_008874; XP_978720). SoxD, Sox5 (e.g., NM_006940; NM_011444; NP_008871; NP_035574), Sox6 (e.g., NM_017508; NM_001025560; NP_059978; NP_001020731), Sox13 (e.g., NM_005686; NM_011439; NP_005677; NP_035569), SoxE, Sox8 (e.g., NM_014587; NM_011447; NP_055402; NP_035577), Sox9 (e.g., NM_000346; NM_011448; NP_000337; NP_035578), Sox10 (e.g., NM_006941; XM_001001494; NP_008872; XP_001001494), SoxF, Sox7, Sox17, Sox18 (e.g., NM_018419; NM_009236; NP_060889; NP_033262), SoxG, Sox15 (e.g., NM_006942; NM_009235; NP_008873; NP_033261), SoxH, Sox30), the Klf (Krueppel-like factor) family (e.g., KLF1 (e.g., NM_006563), KLF2 (e.g., NM_016270; XM_982078; NP_057354; XP_987172), KLF3 (e.g., NM_016531; XM_994052; NP_057615; XP_999146), KLF4 (e.g., NM_004235; NM_010637; NP_004226; NP_034767), KLF5 (e.g., NM_001730; NM_009769; NP_001721; NP_033899), KLF6 (e.g., NM_001008490; NM_011803; NP_001008490; NP_035933), KLF7 (e.g., NM_003709; XM_992457; NP_003700; XP_997551), KLF8 (e.g., NM_007250; NM_173780; NP_009181; NP_776141), KLF9 (e.g., NM_001206; XM_988516; NP_001197; XP_993610), KLF10 (e.g., NM_001032282; NM_013692; NP_001027453; NP_038720), KLF11 (e.g., XM_001129527; NM_178357; XP_001129527: NP_848134), KLF12 (e.g., NM_016285; NM_010636; NP_057369; NP_034766), KLF13 (e.g., NM_015995; NM_021366; NP_057079; NP_067341). KLF14 (e.g., NM_138693; NM_001135093; NP_619638; NP_001128565), KLF15 (e.g., NM_014079; NM_023184; NP_054798; NP_075673), KLF16, KLF17 (e.g., NM_173484.3; NM_029416.2; NP_775755.3; NP_083692.2)), the Myc family (e.g., c-Myc (e.g., NM_002467.4; NM_010849; NP_002458.2; NP_034979)), nanog (e.g., NM_024865.2; NM_028016.2; NP_079141.2; NP_082292.1), Lin28 (e.g., NM_024674; NM_145833; NP_078950: NP_665832) or a combination thereof. Additionally, the cell can also be contacted with a cytokine, such as basic fibroblast growth factor (bFGF) and/or stem cell factor (SCF). In one embodiment, the somatic cell is further contacted with a DNA demethylation reagent.

One embodiment provides a somatic cell derived by inducing differentiation of an induced pluripotent stem cell as disclosed herein. One embodiment also provides a method for stem cell therapy comprising: (1) isolating and collecting a somatic cell from a subject; (2) inducing said somatic cell from the subject into an iPSC (3) inducing differentiation of said iPSCs, and (4) transplanting the differentiated cell from (3) into the subject (e.g., a mammal, such as a human).

FIGS. 1A, 1B, 1C, 1D, 1E, 1F, 1G and 1H. Establishment of mouse iPSCs with M3O-SKM. (A) Schematic drawing of MyoD-Oct4 chimeric constructs. Numbers indicate amino acid positions delimiting MyoD fragments. The basic helix-loop-helix (bHLH) domain of MyoD corresponds to amino acids 108-167, which was not used in these chimeric constructs. EO indicates a polypeptide consisting of one methionine and a chain of 20 glutamic acids fused to Oct4 (E for glutamic acid). Right column shows percentage of GFP-positive colonies derived from mouse embryonic fibroblasts (MEFs) transduced with each MyoD-Oct4 chimeric construct along with SKM and cultured on feeder cells (FIG. 1B, Protocol A). Data represent the meanSEM from three independent experiments. (B) Schematic drawings of two protocols for iPSC creation. Whereas transduced MEFs were transferred onto feeder cells on day 4 in Protocol A, MEFs were maintained feeder-free until the end of experiments in Protocol B. (C) Emergence of GFP-positive colonies obtained with M3O-SKM with Protocol B. Bar, 200 m. (D) Summary of the efficiency of making GFP-positive colonies with various combinations of the M3O, Sox2, Klf4, and c-Myc genes with Protocol B. Number of GFP-positive colonies peaked by day 14. (E) Drawings of various combinations of the M3 domain and Oct4. The efficiency of making GFP-positive colonies with Protocol B in the presence of SKM is shown on the right. (F) Drawings of TAD replacement constructs in which TADs of Oct4 were replaced with the M3 domain. Constructs were transduced with SKM. (G) Drawings of fusion constructs between the M3 domain and Sox2 or Klf4. Sox2 mutants were transduced with OKM or M3O-KM. The Klf4 mutant was transduced with OSM or M3O-SM. (H) Drawings of fusion constructs between Oct4 and TADs taken from other transactivators. Constructs were transduced with SKM.

FIGS. 2A, 2B and 2C. Characterization of mouse iPSCs prepared with M3O-SKM (M3O-iPSCs). (A) Comparison of GFP-positivity between colonies obtained with M3O-SKM and OSKM using Protocol B. Representative images of the GFP expression patterns used to categorize colonies are shown (top). Percentages of colonies with different GFP expression patterns were calculated from 300 colonies for M3O-SKM and OSKM (bottom). Bar, 200 m. (B) qRT-PCR analysis of expression levels of three pluripotency genes in MEFs and GFP-positive colonies obtained with M3O-SKM and OSKM. PCR primers specific to endogenous Oct4 and Sox2 were used for these two genes. Although GFP-positive colonies were harvested on different days based on the time when the GFP signal first emerged for M3O-SKM (day 5) and OSKM (day 10), the intervals between time points is equivalent (bottom of graphs). Expression level of each gene in ESCs (CGR8.8 cells) was defined as 1.0. Five colonies were examined for each condition. Results represent the mean+SEM of three independent experiments. (C) qRT-PCR analysis of expression levels of three fibroblast-enriched genes in MEFs and GFP-positive colonies obtained with M3O-SKM and OSKM.

FIGS. 3A, 3B, 3C, 3D, 3E and 3F. Verification of pluripotency of mouse M3O-iPSCs. (A) Expression level of transcripts in M3O-iPSCs and ESCs relative to MEFs. Log 2 ratios are plotted for transcripts in ESCs/MEFs and iPSCs/MEFs. Red lines indicate a 4-fold difference in transcript levels. Transcripts in M3O-iPSCs were assayed 60 days after transduction. (B) Hematoxylin and eosin staining of teratoma sections derived from M3O-iPSCs. Neural tube and epidermis (ectoderm), striated muscle and bone (mesoderm), and mucous gland and respiratory epithelium (endoderm) are shown. Bar, 50 m. (C) X gal staining for cells expressing the lacZ gene in a chimeric embryo prepared with M3O-iPSCs and a control embryo at 13.5 dpc. (D) Chimeric mice prepared with M3O-iPSCs. The agouti coat color indicates a high (right) and low (left) contribution of iPSCs to the skin. The host embryos used to generate mice were derived from the albino mouse strain ICR. (E) Germline contribution of M3O-iPSCs as shown by GFP expression in the gonad of a 13.5 dpc chimeric embryo. (F) Pups obtained from crossing a wild-type ICR female (bottom) with an M3O-iPSC chimeric male (left mouse in panel D).

FIGS. 4A, 4B, 4C, 4D, 4E and 4F. Characterization of human iPSCs established with M3O-SKM. (A) Immunofluorescence staining of NANOG and SSEA4 in human iPSC colonies on day 8 and 15 obtained with M3O-SKM without subculture after day 3 when transferred onto Matrigel. Bar, 100 m for (A) and (B). Note that day 15 colonies are substantially larger than day 8 colonies as indicated by the different magnifications. (B) Comparison of the efficiency of making NANOG-positive colonies between M3O-SKM and OSKM. The number of NANOG-positive colonies was divided by the number of seeded dermal fibroblasts at each time point. (C) Immunofluorescence staining of pluripotency markers in cloned human iPSCs obtained with M3O-SKM on day 28 after four passages. (D) Quantitative RT-PCR analysis of pluripotency genes expressed in cloned human iPSCs prepared with M3O-SKM. Ten colonies were harvested on day 30 and the mean+SEM was obtained. The expression level of each gene in human ESCs H9 was defined as 1.0. Endogenous genes were amplified for OCT4, SOX2. KLF4 and c-MYC. (E) Karyotype analysis of a human iPSC established with M3O-SKM. (F) Hematoxylin and eosin staining of teratoma sections derived from human iPSCs prepared with M3O-SKM. Bar, 100 m.

FIGS. 5A, 5B, 5C, 5D, 5E and 5F. Chromatin analyses of the Oct4 gene in MEFs transduced with M3O-SKM (M3O-MEFs) and those with OSKM (O-MEFs). (A) DNA methylation patterns at the proximal promoter of the Oct4 gene analyzed with bisulfite sequencing. Black circles indicate methylated CpG and open circles, unmethylated CpG. The proportion of unmethylated CpG sites was calculated by dividing the number of unmethylated CpG sites by the total number of CpG sites in each cell type. (B) Flow cytometry of O-MEFs and M3O-MEFs prepared with Protocol B and harvested on day 9. (C) ChIP analyses of the binding levels of Oct4, Sox2, and the Paf1 complex subunits at the distal enhancer (Region 1) and initiation site (Region 2) of the Oct4 gene in M3O-MEFs and 0-MEFs. Data represent the mean+SEM of three independent experiments. All y axes indicate relative enrichment (fold). Relative enrichment in ESCs was defined as 1.0. ESCs and MEFs were mixed at a 13:87 ratio in the sample labeled as ESCs+MEFs (blue). The difference of the values between the two samples indicated by an asterisk was statistically significant (p<0.01). (D) Analyses of the accessibility of the restriction enzyme NsiI to chromatin at the distal enhancer of the Oct4 gene by Southern blotting. Locations of the enzyme recognition site and probe are shown in relation to the distal enhancer of the Oct4 gene (top). The transcription initiation site was defined as position 1. Appearance of new DNA fragments following digestion with NsiI are shown (bottom). Percentage of digested chromatin was obtained by dividing the combined signal intensity of the bands at 752 and 652 bp by the combined signal intensity of the two bands and the band at 1404 bp. Cloned O-iPSCs and M3O-iPSCs were used for day 30 lanes. GFP-negative population was collected by a FACS and analyzed for the day 9 GFP () lane of M3O-MEFs (far right). (E) ChIP analyses of the levels of three histone modifications associated with active genes at the initiation site (Region 2) and a coding region (Region 3) of the Oct4 gene. (F) ChIP analyses of the levels of two histone modifications associated with inactive genes at a coding region of the Oct4 gene (Region 3). Relative enrichment in MEFs was defined as 1.0.

FIGS. 6A, 6B, 6C, 6D and 6E. Effects of M3O-SKM and OSKM on expression of pluripotency markers and cell proliferation. (A) Temporal profiles of expression patterns of alkaline phosphatase. Bar, 100 m. (B) Temporal profiles of expression patterns of SSEA1. Bar, 100 m. (C) Flow cytometry comparing the expression level of SSEA1 between MEFs transduced with OSKM and those transduced with M3O-SKM. (D) Cell proliferation patterns of MEFs transduced with M3O or Oct4. Means+SEM of three independent experiments are shown. (E) Cell proliferation patterns of MEFs transduced with M3O-SKM or OSKM.

FIGS. 7A, 7B, 7C, 7D, 7E and 7F. Chromatin analyses of day 9 at the Oct4 gene comparing transduction of MEFs with different gene combinations. (A) Flow cytometry of MEFs transduced with M3O-SK and OSK. (B) DNA methylation analysis by bisulfite sequencing. MEFs were transduced with one (1F), two (2F), or three (3F) transcription factor genes. (C) ChIP studies on transcription factor binding at the distal enhancer. (D) Chip analyses on histone modifications associated with active genes. (E) ChIP studies on histone modifications associated with suppressed genes. (F) Hypothetical summary of epigenetic remodeling induced by M3O-SKM (right) in comparison to the lack of remodeling with OSKM (left). Binding sites for Oct4 and Sox2 are located adjacent to each other at the distal enhancer of Oct41. Transduced Oct4 and Sox2 cannot bind to their respective binding sites (blue box and gray box, respectively) in the majority of O-MEFs due to condensed chromatin. In contrast, M3O and Sox2 can effectively bind to each binding site in M3O-MEFs through the effects of the unidentified binding proteins to the MyoD TAD domain. Recruitment of these proteins eventually contributes to DNA demethylation at the proximal promoter and a histone modification pattern typical of active genes at the coding region.

FIG. 8. Immunoblotting of MyoD-Oct4 fusion proteins. Expression of transduced MyoD-Oct4 fusion genes was evaluated with an antibody against Oct4 (top). Expression of histone H2A was examined as a loading control (bottom). Bands correspond to the predicted molecular mass of each protein. Identities of extra bands marked with asterisks are unknown.

FIGS. 9A, 9B and 9C. Chip analyses of the Sox2 gene. (A) Binding of Oct4 and Sox2 at the enhancer. (B) Binding of parafibromin and the levels of histone modifications associated with active genes on day 9. (C) Levels of histone modifications associated with suppressive genes on day 9.

FIGS. 10A and 10B. ChIP analyses on day 9 of the Oct4 gene comparing transduction of one (1F), two (2F), three (3F) and four (4F) transcription factor genes. (A) Transcription factor binding. (B) Histone modifications associated with gene activation.

FIGS. 11A and 11B. ChIP analyses on day 9 of the Sox2 gene comparing transduction of one (1F), two (2F), three (3F) and four (4F) transcription factor genes. (A) Transcription factor binding at the enhancer. (B) Histone modifications associated with gene activation and suppression.

iPSC technology is the process of converting an adult specialized cell, such as a skin cell, into a stem cell, a process known as dedifferentiation. iPSCs can be very useful in clinical as well as preclinical settings. For example, iPSCs can be created from human patients and differentiated into many tissues to provide new materials for autologous transplantation, which can avoid immune rejection of the transplanted tissues. For example, pancreatic beta cells differentiated from a patient's iPSCs can be transplanted into the original patient to treat diabetes. Also, iPSCs derived from a patient can be differentiated into the ailing tissue to be used in an in vitro disease model. For example, study of dopaninergic neurons differentiated from a Parkinson's disease patient can provide unprecedented clues for the pathogenesis of the disease. In vitro-differentiated cells derived from iPSCs can be used for drug screening. For instance, many drugs are metabolized in the liver, but there have been no ideal liver cells that can be cultured for a long term for in vitro screening of drug toxicity. Also, iPSCs provide a new opportunity to understand the mechanisms underlying the plasticity of cell differentiation. Thus, the potential of iPSCs for many fields of life science is tremendous.

However, the process of generating iPSCs is slow and inefficient. With the standard protocol, MEFs are transduced with OSKM on day 1 and the cells are transferred onto feeder cells composed of irradiated fibroblasts, which provide a poorly characterized, but optimal environment for the generation of iPSCs, on day 5. iPSC colonies emerge around day 10, which are then picked up and expanded over the next two to three weeks on feeder cells to establish purified iPSC lines. Eventually, only 0.1% of the transduced fibroblasts turn into iPSCs. This slow process and extremely low efficiency make production of iPSCs costly.

It is disclosed herein that a fusion protein combining, for example, the stem cell factor Oct4 (a homeodomain transcription factor associated with undifferentiated cells) with a portion of another protein factor, for example, a transactivation domain, such as that of MyoD, can accelerate the process of making iPSCs. It is also shown herein that heterologous transactivation domains, including the MyoD TAD, promote global chromatin remodeling of stem cell genes. Thus, the process disclosed herein improves the efficiency and quality of iPSCs.

As used herein, the terms below are defined by the following meanings:

Induced pluripotent stem cells, commonly abbreviated as iPSCs, are a type of pluripotent stem cell obtained from a non-pluripotent cell, typically an adult somatic cell (a cell of the body, rather than gametes or an embryo), by inducing a forced expression of certain genes. iPSCs are believed to be similar to natural pluripotent stem cells, such as ESCs in many respects, such as the expression of certain stem cell genes and proteins, chromatin methylation patterns, doubling time, embryoid body formation, teratoma formation, viable chimera formation, and potency and differentiability.

iPSCs are not adult stem cells, but rather reprogrammed cells (e.g., epithelial cells) given pluripotent capabilities. Using genetic reprogramming with protein transcription factors, pluripotent stem cells equivalent to embryonic stem cells have been derived from human adult skin tissue. Shinya Yamanaka and his colleagues at Kyoto University used the transcription factors Oct3/4, Sox2, c-Myc, and Klf4 in their experiments on cells from humans. Junying Yu, James Thomson, and their colleagues at the University of Wisconsin-Madison used a different set of factors, Oct4, Sox2, Nanog and Lin28, and carried out their experiments using cells from human foreskin to generate iPS cells.

The term isolated refers to a factor(s), cell or cells which are not associated with one or more factors, cells or one or more cellular components that are associated with the factor(s), cell or cells in vivo.

Cells include cells from, or the subject is, a vertebrate, such as a mammal, including a human. Mammals include, but are not limited to, humans, farm animals, sport animals and companion animals. Included in the term animal is dog, cat, fish, gerbil, guinea pig, hamster, horse, rabbit, swine, mouse, monkey (e.g., ape, gorilla, chimpanzee, orangutan), rat, sheep, goat, cow and bird.

An effective amount generally means an amount which provides the desired local or systemic effect and/or performance.

Pluripotency refers to a stem cell that has the potential to differentiate into one, two or three of the three germ layers: endoderm (e.g., interior stomach lining, gastrointestinal tract, the lungs), mesoderm (e.g., muscle, bone, blood, urogenital), or ectoderm (e.g., epidermal tissues and nervous system). Pluripotent stem cells can give rise to any fetal or adult cell type.

Transdifferentiation is when a non-stem cell transforms into a different type of cell, or when an already differentiated stem cell creates cells outside its already established differentiation path.

A transcription factor (sometimes called a sequence-specific DNA-binding factor) is a protein that binds to specific DNA sequences, thereby controlling the transfer (or transcription) of genetic information from DNA to mRNA. Transcription factors perform this function alone or with other proteins or factors in a complex, by promoting (as an activator), or blocking (as a repressor) the recruitment of RNA polymerase (the enzyme that performs the transcription of genetic information from DNA to RNA) to specific genes. Generally, a defining feature of transcription factors is that they contain one or more DNA-binding domains (DBDs), which attach to specific sequences of DNA adjacent to the genes that they regulate.

A transcription activation domain, transactivation domain or trans-activating domain is generally that portion of a transcription factor that is responsible for recruitment of the transcription machinery needed to transcribe RNA. Transactivation is an increased rate of gene expression triggered either by biological processes or by artificial means. Transactivation can be triggered either by endogenous cellular or viral proteinstransactivators. These protein factors act in trans (i.e., intermolecularly). An unrelated or heterologous transactivation domain refers to a transactivation domain that is not normally associated with the gene/protein (e.g., transcription factor) of interest (not wild-type).

By pure it is meant that the population of cells has the desired purity. For example, iPSC populations can comprise mixed populations of cells. Those skilled in the art can readily determine the percentage of iPSCs in a population using various well-known methods, such as fluorescence activated cell sorting (FACS). Preferable ranges of purity in populations comprising iPSCs are about 1 to about 5%, about 5 to about 10%, about 10 to about 15%, about 15 to about 20%, about 20 to about 25%, about 25 to about 30%, about 30 to about 35%, about 35 to about 40%, about 40 to about 45%, about 45 to about 50%, about 50 to about 55%, about 55 to about 60%, about 60 to about 65%, about 65 to about 70%, about 70 to about 75%, about 75 to about 80%, about 80 to about 85%, about 85 to about 90%, about 90% to about 95% or about 95 to about 100%. Purity of the cells can be determined for example according to the cell surface marker profile within a population.

The terms comprises. comprising, and the like can have the meaning ascribed to them in U.S. Patent Law and can mean includes, including and the like. As used herein, including or includes or the like means including, without limitation.

Rapid and Efficient Production of iPSCs

Through the processes disclosed herein, iPSC colonies emerge as early as about five days (day 5) after transduction of a transactivator domain (or a portion thereof) fused to a transcription factor (or a portion thereof), e.g., M3O (short transactivation domain of MyoD (about 50 to 60 amino acids) fused to the amino terminus of the full-length Oct4), Sox2, Klf4, and c-Myc without feeder cells. The preparation of the nucleic acid molecule coding for the fusion protein(s) as well as the construct(s) of Sox, Klf, c-Myc etc. (either singly or on a polycistronic RNA) can be carried out by methods available to an art worker as well as the transduction thereof into cells (see, for example, Sambrook, Molecular Cloning: A Laboratory Manual).

iPSCs established with the standard OSKM protocol frequently contain partially reprogrammed cells and even established iPSCs occasionally lose pluripotency during prolonged cultures. In contrast, the iPSCs disclosed herein retain pluripotency more tightly and heterogeneity among different colonies is much less apparent than that with the OSKM iPSCs. In addition, iPSC colonies can be obtained without c-Myc (use only M3O, Sox2 and Klf4) at the efficiency of 0.44% around day 7. iPSCs have been prepared without c-Myc (use OSK) before, but the efficiency was low (<0.01%) and it generally took 30 to 40 days for iPSCs to emerge2,3. Additionally, this transactivation domain-based strategy can be applied to amplify the effects of other transcription factors to facilitate their reprogramming capability of cell differentiation. In summary, the use of a TAD, such as the M3 domain, has made iPSC production faster, easier, feeder-free and more efficient than the standard OSKM or other protocols.

Thus, as discussed above, the fusion technology, such as the M3O, technology disclosed herein has significant advantages over wild-type Oct4 (or other transcription factors) in generating iPSCs. First, the fusion technology is faster. While iPSC colonies appear at about day 10 with the standard OSKM protocol (see, Cell Stem Cell 2008, 3, 595 for a general protocol for making iPSCs), iPSC colonies emerge on day 5 with the fusion technology (e.g., M3O-SKM). Second, efficiency of making iPSCs is more than 50-fold higher with the fusions technology (e.g., M3O-SKM) than that with OSKM. Third, purer iPSCs populations can be obtained with the fusions technology described herein (e.g., M3O-SKM) compared with OSKM. Fourth, the fusion technology described herein (e.g., M3O-SKM) does not require feeder cells unlike OSKM. This is noted especially for making iPSCs for transplantation purposes because one would generally need to use patient-derived fibroblasts as feeder cells to avoid immune rejection. Also, the use of feeder cells adds an extra step to make iPSCs. Feeder-free iPSCs have been reported, but they are derived from already undifferentiated cells, such as adipose stem cells. Fibroblasts generally require feeder cells to become iPSCs. Finally, iPSCs can be prepared using only M3O, Sox2 and Klf4 (without c-Myc).

Generally, genes which can be used to create induced pluripotent stem cells, either singly, in combination or as fusions with transactivation domains, include, but are not limited to, one or more of the following: Oct4 (Oct3/4, Pou5f1), Sox (e.g., Sox1, Sox2, Sox3, Sox18, or Sox15), Klf (e.g., Klf4, Klf1, Klf3, Klf2 or Klf5), Myc (e.g., c-myc, N-myc or L-myc), nanog, or LIN28. As examples of sequences for these genes and proteins, the following accession numbers are provided: Mouse MyoD: M84918, NM_010866; Mouse Oct4 (POU5F1): NM_013633; Mouse Sox2: NM_011443; Mouse Klf4: NM_010637; Mouse c-Myc: NM_001177352, NM_001177353, NM_001177354 Mouse Nanog: NM_028016; Mouse Lin28: NM_145833: Human MyoD: NM_002478; Human Oct4 (POU5F1): NM_002701, NM_203289, NM_001173531; Human Sox2: NM_003106; Human Klf4: NM_004235; Human c-Myc: NM_002467; Human Nanog: NM_024865; and/or Human Lin28: NM_024674, for portions or fragments thereof and/or any related sequence available to an art worker (these sequences are incorporated by referenced herein). For example, sequences for use in the invention have at least about 50% or about 60% or about 70%, about 71%, about 72%, about 73%, about 74%, about 75%, about 76%, about 77%, about 78%, or about 79%, or at least about 80%, about 81%, about 82%, about 83%, about 84%, about 85%, about 86%, about 87%, about 88%, or about 89%, or at least about 90%, about 91%, about 92%, about 93%, or about 94%, or at least about 95%, about 96%, about 97%, about 98%, or about 99% sequence identity compared to the sequences and/or accession numbers provided herein and/or any other such sequence available to an art worker, using one of alignment programs available in the art using standard parameters or hybridization techniques. In one embodiment, the differences in sequence are due to conservative amino acid changes. In another embodiment, the protein sequence or DNA sequence has at least 80% sequence identity with the sequences disclosed herein and is bioactive (e.g., retains activity).

Methods of alignment of sequences for comparison are available in the art. Thus, the determination of percent identity between any two sequences can be accomplished using a mathematical algorithm. Computer implementations of these mathematical algorithms can be utilized for comparison of sequences to determine sequence identity. Such implementations include, but are not limited to: CLUSTAL in the PC/Gene program (available from Intelligenetics, Mountain View, Calif.); the ALIGN program (Version 2.0) and GAP, BESTFIT, BLAST, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Version 8 (available from Genetics Computer Group (GCG), 575 Science Drive. Madison, Wis., USA). Alignments using these programs can be performed using the default parameters.

During and after preparation of iPSCs, the cells can be cultured in culture medium that is established in the art and commercially available from the American Type Culture Collection (ATCC), Invitrogen and other companies. Such media include, but are not limited to, Dulbecco's Modified Eagle's Medium (DMEM), DMEM F12 medium, Eagle's Minimum Essential Medium, F-12K medium, Iscove's Modified Dulbecco's Medium, Knockout DMEM, or RPMI-1640 medium. It is within the skill of one in the art to modify or modulate concentrations of media and/or media supplements as needed for the cells used. It will also be apparent that many media are available as low-glucose formulations, with or without sodium pyruvate.

Also contemplated is supplementation of cell culture medium with mammalian sera. Sera often contain cellular factors and components that are needed for viability and expansion. Examples of sera include fetal bovine serum (FBS), bovine serum (BS), calf serum (CS), fetal calf serum (FCS), newborn calf serum (NCS), goat serum (GS), horse serum (HS), human serum, chicken serum, porcine serum, sheep serum, rabbit serum, rat serum (RS), serum replacements (including, but not limited to, KnockOut Serum Replacement (KSR, Invitrogen)), and bovine embryonic fluid. It is understood that sera can be heat-inactivated at 55-65 C. if deemed needed to inactivate components of the complement cascade. Modulation of serum concentrations, or withdrawal of serum from the culture medium can also be used to promote survival of one or more desired cell types. In one embodiment, the cells are cultured in the presence of FBS/or serum specific for the species cell type. For example, cells can be isolated and/or expanded with total serum (e.g., FBS) or serum replacement concentrations of about 0.5% to about 5% or greater including about 5% to about 15% or greater, such as about 20%, about 25% or about 30%. Concentrations of serum can be determined empirically.

Additional supplements can also be used to supply the cells with trace elements for optimal growth and expansion. Such supplements include insulin, transferrin, sodium selenium, and combinations thereof. These components can be included in a salt solution such as, but not limited to, Hanks' Balanced Salt Solution (HBSS), Earle's Salt Solution, antioxidant supplements, MCDB-201 supplements, phosphate buffered saline (PBS), N-2-hydroxyethylpiperazine-N-ethanesulfonic acid (HEPES), nicotinamide, ascorbic acid and/or ascorbic acid-2-phosphate, as well as additional amino acids. Many cell culture media already contain amino acids; however some require supplementation prior to culturing cells. Such amino acids include, but are not limited to, L-alanine, L-arginine. L-aspartic acid. L-asparagine, L-cysteine, L-cysteine, L-glutamic acid, L-glutamine, L-glycine, L-histidine, L-inositol, L-isoleucine. L-leucine, L-lysine. L-methionine, L-phenylalanine, L-proline, L-serine, L-threonine, L-tryptophan, L-tyrosine, and L-valine.

Antibiotics are also typically used in cell culture to mitigate bacterial, mycoplasmal, and fungal contamination. Typically, antibiotics or anti-mycotic compounds used are mixtures of penicillin/streptomycin, but can also include, but are not limited to, amphotericin (Fungizone), ampicillin, gentamicin, bleomycin, hygromycin, kanamycin, mitomycin, mycophenolic acid, nalidixic acid, neomycin, nystatin, paromomycin, polymyxin, puromycin, rifampicin, spectinomycin, tetracycline, tylosin, and zeocin.

Hormones can also be advantageously used in cell culture and include, but are not limited to, D-aldosterone, diethylstilbestrol (DES), dexamethasone, -estradiol, hydrocortisone, insulin, prolactin, progesterone, somatostatin/human growth hormone (HGH), thyrotropin, thyroxine, and L-thyronine. -mercaptoethanol can also be supplemented in cell culture media.

Lipids and lipid carriers can also be used to supplement cell culture media, depending on the type of cell and the fate of the differentiated cell. Such lipids and carriers can include, but are not limited to cyclodextrin (, , ), cholesterol, linoleic acid conjugated to albumin, linoleic acid and oleic acid conjugated to albumin, unconjugated linoleic acid, linoleic-oleic-arachidonic acid conjugated to albumin, oleic acid unconjugated and conjugated to albumin, among others. Albumin can similarly be used in fatty-acid free formulation.

Cells in culture can be maintained either in suspension or attached to a solid support, such as extracellular matrix components and synthetic or biopolymers. Cells often require additional factors that encourage their attachment to a solid support (e.g., attachment factors) such as type I, type II, and type IV collagen, concanavalin A, chondroitin sulfate, fibronectin, superfibronectin and/or fibronectin-like polymers, gelatin, laminin, poly-D and poly-L-lysine, Matrigel, thrombospondin, and/or vitronectin.

Cells can be cultured at different densities, e.g., cells can be seeded or maintained in the culture dish at different densities. For example, for cells to be dedifferentiated or iPSCs, the cells can be seeded or maintained at low or high cell densities. For example, at densities, including, but not limited to, densities of less than about 2000 cells/well of a 12-well plate (for example, 12-well flat-bottom growth area: 3.8 cm2 well volume: 6.0 ml or well IDdepth (mm) 22.117.5, well capacity (ml) 6.5, growth area (cm2) 3.8), including less than about 1500 cells/well of a 12-well plate, less than about 1,000 cells/well of a 12-well plate, less than about 500 cells/well of a 12-well plate, or less than about 200 cells/well of a 12-well plate. The cells can also be seeded or maintained at higher densities, for example, great than about 2,000 cells/well of a 12-well plate, greater than about 2,500 cells/well of a 12-well plate, greater than about 3,000 cells/well of a 12-well plate, greater than about 3,500 cells/well of a 12-well plate, greater than about 4,000 cells/well of a 12-well plate, greater than about 4,500 cells/well of a 12-well plate, greater than about 5,000 cells/well of a 12-well plate, greater than about 5,500 cells/well of a 12-well plate, greater than about 6,000 cells/well of a 12-well plate, greater than about 6,500 cells/well of a 12-well plate, greater than about 7,000 cells/well of a 12-well plate, greater than about 7,500 cells/well of a 12-well plate or greater than about 8,000 cells/well of a 12-well plate.

The maintenance conditions of cells cultures can also contain cellular factors that allow cells, such as the iPSCs of the invention, to remain in an undifferentiated form. It may be advantageous under conditions where the cell must remain in an undifferentiated state of self-renewal for the medium to contain epidermal growth factor (EGF), platelet derived growth factor (PDGF), leukemia inhibitory factor (LIF), basic fibroblast growth factor (bFGF) and combinations thereof. It is apparent to those skilled in the art that supplements that allow the cell to self-renew (e.g., to produce replicate daughter cells having differentiation potential that is identical to those from which they arose; a similar term used in this context is proliferation), but not differentiate should be removed from the culture medium prior to differentiation. It is also apparent that not all cells will require these factors.

The following examples are provided in order to demonstrate and further illustrate certain embodiments and aspects of the present invention and are not to be construed as limiting the scope thereof.

Full-length and deletion mutants of mouse Oct4 cDNA were fused with various TADs and inserted into the pMXs-IP vector4. Polycistronic cDNAs encoding Sox2, Klf4 and c-Myc were transferred from the 4F2A lentiviral vector5 to the pMXs-IP vector, pMXs-IP vectors encoding OSKM separately (Addgene) were also used in some experiments. These pMXs-IP vectors were transfected into Plat-E cells6 with Fugene 6 (Roche). Virus supernatant was harvested 48 and 72 hr later and filtered through a 0.45 Cpm syringe filter. MEFs were prepared from Oct4-GFP mice which harbour an IRES-green fluorescence protein (GFP) fusion cassette downstream of the stop codon of the Oct4 gene (Jackson Laboratory #008214)7. All animal experiments were conducted in accordance with the animal experiment guidelines of University of Minnesota. For chimera experiments, MEFs were prepared from mice that harbour the Oct4-GFP allele and ROSA26-lacZ allele. MEFs were seeded at 3105 cells/6 cm dish on day 2 in DMEM with 10% fetal bovine serum (FBS). Fresh virus supernatant was added to MEFs on day 1 and day 0 with 10 g/ml polybrene. Culture medium was then changed to iPSC medium (DMEM, 15% fetal bovine serum, 100 M MEM non-essential amino acids, 55 M 2-mercaptoethanol, 2 mM L-glutamine and 1000 u/ml leukemia inhibitory factor) on day 1. Transduced MEFs were subcultured onto irradiated SNL feeder cells at 2105 cells/6 cm dish on day 4 and maintained on the feeder cells in Protocol A. The maximum number of GFP-positive colonies obtained around day 18 was divided by 2105 to obtain the efficiency of making iPSCs. In Protocol B, transduced MEFs were maintained without feeder cells. GFP-positive colonies were picked up around day 10 to clone without feeder cells for pluripotency analyses. Retrovirus titer was measured using NIH3T3 cells as described 8. All recombinant DNA research was conducted following the NIH guidelines.

Preparation of Human iPSCs

Full-length human OCT4 cDNA fused with the M3 domain of human MYOD at the amino terminus was inserted into the pMXs-IP vector. pMXs-IP vectors encoding human M3O, OCT4, SOX2, KLF4 and c-MYC (Addgene) were transfected into Plat-A cells (Cell Biolabs) with Lipofectamin 2000 (Invitrogen). Virus supernatant was harvested 48 and 72 hrs later (day 1 and 0, respectively below), filtered through a 0.45 m syringe filter and transduced into dermal fibroblasts obtained from a 34-year-old Caucasian female (Cell Applications). On day 2, 2.7104 fibroblasts were plated in each well of a 12-well plate in DMEM with 10% fetal bovine serum. Fresh virus supernatant was added to the fibroblasts on day 1 and day 0 with 10 g/ml polybrene. On day 3 cells were harvested with trypsin and subcultured at 1.7104 cells per well in 12-well plates coated with BD Matrigel hESC-qualified Matrix (BD Biosciences) in human iPSC medium (KnockOut DMEMF-12 (Invitrogen), 20% Knockout Serum Replacement (Invitrogen), 100 M MEM non-essential amino acids, 1% insulin-transferrin-selenium (Invitrogen), 0.1 mM 2-mercaptoethanol, 2 mM L-glutamine and 4 ng/ml basic FGF). The medium was changed every other day.

One million cells were resuspended in ice-cold lysis buffer containing 0.1% NP40 and incubated on ice for 5 min as previously described9. Nuclei were isolated with centrifugation at 4,000g for 5 min and digested with 200 u/ml NsiI for 2 hr at 37 C. DNA was purified and double-digested with MspI and BamHI, followed by Southern blotting using the radioactive probe shown in FIG. 5D.

MEFs were transduced with MyoD-Oct4 fusion genes and analyzed with immunoblotting five days after transduction. All antibodies are listed in supplemental Table 1. SuperSignal West Dura (Thermo Scientific) was used to detect chemiluminescence signal.

iPSCs were fixed with 4% formaldehyde for 10 min and permeabilized with 0.5% Triton X-100 for 3 min. Cells were then incubated with primary antibody and secondary antibody for 1 hr each at 25 C. DNA was counterstained with Hoechst 33342. Used antibodies are listed in Table 1. Fluorescence signal was captured with a 10 A-Plan Phi Var1 objective (numerical aperture 0.25) and an AxioCam charge coupled device camera attached to an Axiovert 200M fluorescence microscope (all from Zeiss). Photoshop 7.0 (Adobe Systems) was used for image processing.

Alkaline phosphatase was detected with an Alkaline Phosphatase Detection Kit (Millipore SCR004).

The percentage of GFP-positive or SSEA1-positive cells at each time point was determined with a FACSCalibur flow cytometer and analyzed using CellQuest Pro software (both BD Biosciences).

Quantitative RT-PCR (qRT-PCR)

cDNA for mRNA was prepared from iPSC colonies using a Cells-to-cDNA II kit (Ambion). qRT-PCR was performed with GoTaq qPCR Master mix (Promega) on a Realplex 2S system (Eppendorf). PCR primer sequences are listed in Table 2. Expression level of glyceraldehyde 3-phosphate dehydrogenase (GAPDH) was used to normalize the expression levels of mRNAs. The feeder-free ESC line CGR8.8 was used as a positive control.

RNA was prepared from CGR8.8 cells, MEFs, and a mouse iPSC clone prepared with the fusion gene between the M3 domain of MyoD and Oct4 (M3O-iPSC) on day 60 with the PureLink RNA total RNA purification system (Invitrogen). RNA was amplified and labeled using the Agilent Quick AmpLabeling Kit (Agilent Technologies) following the manufacturer's protocol. cRNA was hybridized overnight to Agilent Whole Murine Genome Oligo Microarray using the Agilent Gene Expression Hybridization Kit. The fluorescence signals of the hybridized microarrays were detected using Agilent's DNA Microarray Scanner. The Agilent Feature Extraction Software was used to read out and process the image files. Data were processed and visualized with Spotfire DecisionSite for Functional Genomics software. DNA microarray data have been deposited in the NCBI GEO database under the accession number GSE22327.

Adherent cells were arrested with colcemid, harvested, treated with 75 mM KCl hypotonic solution, and fixed with methanol and acetic acid at 3:1. The cells were spread onto glass slides and stained with Wright-Giemsa stain. G-banded metaphases were evaluated using an Olympus BX61 microscope outfitted with 10 and 100 objectives. Metaphase cells were imaged and karyotyped using Applied Spectral Imaging (ASI) software.

Ten M3O-iPSCs of a cloned line were transferred into a microdrop of KSOMaa solution (Millipore) with a zona-free 8-cell stage mouse embryo of the ICR strain (albino) after brief exposure to acidic Tyrode's solution (Millipore). Aggregated morula stage embryos at 2.5 days post coitum (dpc) that contained GFP-positive iPSCs were transferred into the uteri of 2.5 dpc pseudopregnant recipient mice. Embryos at 13.5 dpc were analyzed for chimera formation with X gal stain or for germline transmission with a fluorescence microscope. To prepare teratomas, one million cloned mouse or human M3O-iPSCs were injected into the limb muscle of NOD/SCID mice. Teratomas were fixed with 10% formalin and embedded with paraffin after three weeks for mouse iPSCs and eight weeks for human iPSCs. Five-m thick sections were stained with haematoxylin and eosin for histological analysis.

ChIP was performed as described in the instruction of the EZ Magna ChIP G kit (Millipore). All antibodies are listed in Table 1. PCR primer sequences are listed in Table 2. PCR amplification levels were first normalized against the value obtained with control IgG. The normalized values with ESCs or MEFs were then defined as 1.0 depending on antibodies to obtain relative expression levels in other cells.

Genomic DNA from mouse iPSCs was treated with bisulfite with an EZ DNA Methylation-Gold kit (Zymo Research). The DNA sequence at the Oct4 proximal promoter region was amplified with PCR using the primers listed in Table 2 and cloned into the pCR2.1-TOPO vector (Invitrogen) for sequencing.

Generation of Mouse IPSCs with Heterologous Transactivation Domains

Full-length mouse Oct4 was fused with various fragments of mouse MyoD (FIG. 1A). The basic helix-loop-helix (bHLH) domain of MyoD, used for dimerization and DNA binding, was not included in these constructs to avoid activation of MyoD-target genes. Each chimeric gene was co-transduced with a polycistronic retroviral vector encoding mouse Sox2, Klf4, and c-Myc (SKM)5 into MEFs derived from Oct4-GFP mice, which contain the GFP gene knocked into the Oct4 locus7. In this model, formation of GFP-positive colonies indicates that individual MEFs develop into Oct4-expressing cells capable of clonal growth. Expression of chimeric proteins was confirmed through immunoblotting with antibodies against Oct4 (FIG. 8). As a control, MEFs were transduced with OSKM (O-MEFs) on day 1 and 0 and transferred these cells onto SNL feeder cells on day 4 following a standard protocol (FIG. 1B, Protocol A). GFP-positive colonies emerged around day 10, gradually increasing in number until reaching a peak by day 18. To calculate the percentage of MEFs that were reprogrammed into iPSCs, the number of GFP-positive colonies were divided by the total number of MEFs seeded in a culture dish. It was estimated that 0.080.09% of O-MEFs were converted into GFP-positive cells, which is similar to previous reports8,10 (FIG. 1A, right column). MEFs were then transduced with each chimeric gene along with SKM and followed the protocol described above (Protocol A). M3O with SKM (M3O-SKM) increased the percentage of GFP-positive colonies most drastically, with 5.100.85% of MEFs (M3O-MEFs) being transformed into GFP-positive cells by day 15. The M3 region encompasses the acidic transactivation domain (TAD) of MyoD (amino acids 3-56)11. However, the simple presence of acidity was insufficient to facilitate iPSC formation, as evidenced by a lack of increase in GFP-positive colonies in MEFs transduced with M6O, which also contains the main acidic amino acid cluster, or a chain of 20 glutamic acids attached to Oct4 (EO) (FIG. 1A). The high efficiency with which M3O created iPSCs as compared to Oct4 was not simply due to a difference in the retrovirus titer for the two virus suspensions. The titer for the M3O virus and Oct4 virus was 1.80.2107 and 2.10.4107 colony forming units/ml, respectively.

While conducting the above experiments, it was noticed that GFP-positive colonies emerged from M3O-MEFs on about day 5 without transfer onto feeder cells (FIG. 1B, Protocol B), and these colonies steadily increased in size and number (FIG. 1C). By around day 12, 3.60.5% of M3O-MEFs formed GFP-positive colonies in the absence of feeder cells, perhaps supported by the surrounding MEFs serving as autologous feeder cells (FIG. 1D). In contrast, GFP-positive colonies emerged from O-MEFs between day 16 and 18 at an extremely low efficiency (0.00350.0006%) with the same protocol. It was next tested if GFP-positive colonies could be obtained without Sox2, Klf4, or c-Myc in the presence of M3O with Protocol B (FIG. 1D). Although M3O still required Sox2 and Klf4, c-Myc was dispensable. Previous studies have reported that iPSCs can be established without c-Myc2,3; however, the uniqueness of M3O-SK lies in the speed and efficiency with which GFP-positive colonies form. While it requires three to four weeks and the presence of feeder cells for OSK to induce GFP-positive colonies at an efficiency of around 0.01%2,3, M3O-SK could generate GFP-positive colonies without feeder cells by day 7 after transduction at an efficiency of 0.44%, over 40-fold more efficient than OSK.

These striking differences between M3O and Oct4 prompted the evaluation of the specificity of the M3O configuration in relation to other host factors and TADs taken from other transcription factors using Protocol B. First, the location and number of the M3 domains in the fusion protein with Oct4 were changed (FIG. 1E). Second, the two TADs in Oct412 were replaced with the M3 domain in various combinations (FIG. 1F). Third, the M3 domain was fused to Sox2 or Klf4 and tested in combination with other members of OSKM and M3O (FIG. 1G). OM3 was as effective as M3O in iPSC creation. In a fourth experiment, TADs taken from other powerful transactivators were fused to Oct4 (FIG. 1H), including the TADs from Tax of human T-lymphotropic virus type 1 (HTLV-1)13, Tat of human immunodeficiency virus type 1 (HIV-1)14,15, Gata416,17 and Mef2c17.

The GFP-positive colonies that emerged on day 5 following transduction with M3O-SKM using Protocol B contained 31-143 cells in 12 colonies, with a median of 43 cells/colony. This number of cells would be produced after less than seven cell divisions assuming even division for each cell, which is strikingly small compared to the median of 70 cell divisions needed before GFP-positive cells appear with OSKM Is. The colonies that emerged with M3O-SKM were usually homogenously GFP-positive from the beginning. On day 7 over 97% of these colonies were homogeneously GFP-positive with Protocol B compared to around 5% of colonies derived with OSKM obtained on day 12 with Protocol A (FIG. 2A). Protocol A was used for OSKM. As a result, GFP-positive colonies were harvested at different time points corresponding to two days after the onset of GFP activation.

The quality of GFP-positive colonies obtained with M3O-SKM and OSKM were compared by quantitative RT-PCR (qRT-PCR) analysis of three pluripotency genes (endogenous Oct4, endogenous Sox2, and Nanog) and three fibroblast-enriched genes (Thy1, Col6a2, and Fgf7)19,21. Homogeneously GFP-positive colonies obtained with M3O-SKM using Protocol B and those with OSKM using Protocol A were selected to represent the colonies for each group. Although cells were harvested at different time points corresponding to the onset of GFP activation, the interval between time points is the same. For OSKM, expression of the three pluripotency genes gradually increased during the initial week after emergence of GFP-positive colonies, indicating a slow maturation process toward pluripotency (FIG. 2B). For M3O-SKM, in contrast, levels of these transcripts reached or exceeded those seen in ESCs at the time of the emergence of GFP-positive colonies and remained at similar levels until day 30. This differential efficiency of transcriptional reprogramming was also evident with suppression of the three fibroblast-enriched genes. For M3O-SKM, expression levels of these genes on day 5 when the GFP signal was apparent were comparable to those seen in ESCs, but it took around one week after the activation of GFP for OSKM to accomplish the same level of gene suppression (FIG. 2C). Together, these results indicate that M3O-SKM can reprogram MEFs to an iPSC state more efficiently than OSKM.

The pluripotency of iPSC clones prepared with M3O-SKM following Protocol B (M3O-iPSCs) was verified using three standard approaches. First, genome-wide transcript analysis demonstrated highly similar gene expression in M3O-iPSCs and ESCs. Out of 41,160 probes, 3,293 were greater than 4-fold differentially expressed (up- or down-regulated) in both ESCs and cloned iPSCs compared to MEFs (FIG. 3A). The commonly up-regulated genes included eight ECS-enriched genes, such as Oct4, Sox2 and Nanog. In addition, Thy1, Col6a2 and Fgf7 were down-regulated more than 16-fold in both ESCs and iPSCs. Second, intramuscular injection of M3O-iPSCs into an NOD/SCID mouse resulted in teratoma formation as shown by the presence of various tissues derived from the three germ layers (FIG. 3B). Third, aggregation of 8-cell stage embryos of the ICR strain with M3O-iPSCs containing the Oct4-GFP allele and ROSA26-lacZ allele formed chimeric mice (FIG. 3C. 3D). M3O-iPSCs contributed to germ cells in some chimeric mice (FIG. 3E). When one of the chimeric males (FIG. 3D left) was crossed with a wild-type female ICR mouse (FIG. 3F, white adult at bottom), all 11 pups showed agouti or black coat color (FIG. 3F).

Establishment of Human iPSCs with M3O-SKM

Next it was evaluated if M3O could also facilitate generation of human iPSCs in comparison to OSKM. Human M3O-SKM and OSKM were transduced into human dermal fibroblasts prepared from a 34-year-old female. Because these cells did not harbor a transgene that could be used as a convenient marker for reprogramming, expression of the pluripotency protein NANOG was monitored by immunofluorescence staining as an iPSC indicator. NANOG-positive human ESC-like colonies emerged around day 8 with M3O-SKM, with the number increasing by around day 15 when 0.300.033% of the original fibroblasts were converted to iPSC colonies (FIG. 4A, 4B). In contrast, when OSKM was transduced, NANOG-positive colonies did not emerge until around day 12 and eventually only 0.00520.0018% of the fibroblasts were turned into iPSC colonies. This indicates 58-fold increased efficiency with M3O-SKM in comparison to OSKM. Furthermore, while less than 10% of the colonies that appeared with OSKM were NANOG positive, more than 90% of the colonies produced with M3O-SKM were NANOG-positive, consistent with the results for mouse iPSCs. Cloned iPSCs prepared with M3O-SKM also expressed endogenous OCT4 and surface markers SSEA4, TRA-1-60 and TRA-1-81 on day 28 (FIG. 4C). Transduced M3O was suppressed by this day (not shown). In addition, iPSCs prepared with M3O-SKM expressed twelve pluripotency genes as demonstrated by quantitative RT-PCR (FIG. 4D). All twenty mitotic spreads prepared from a cloned M3O-SKM iPSCs demonstrated normal karyotypes (FIG. 4E). Finally, they formed teratomas when injected into an NOD/SCID mouse (FIG. 4F), proving pluripotency of the cells.

To understand how M3O-SKM facilitated nuclear reprogramming at the molecular level, several chromatin changes at the Oct4 gene were examined during the early phase of iPSC generation. All analyses were performed with Protocol B on all MEFs in a culture dish including GFP-positive and -negative cells without subculture for 9 days. First, changes in DNA methylation at the promoter of the Oct4 gene were studied. CpG dinucleotides at the proximal promoter of the Oct4 gene are heavily methylated in MEFs, unlike in ESCs and iPSCs22 (FIG. 5A), and this serves as a major inhibitory mechanism for Oct4 transcription. While the number of unmethylated CpG sites remained essentially the same on day 9 in O-MEFs, the number increased approximately twofold in M3O-MEFs on the same day (FIG. 5A, 25.5% vs 55.5%). The more advanced demethylation in M3O-MEFs than in O-MEFs is consistent with the higher percentage of GFP-positive cells in M3O-MEFs than in O-MEFs on day 9 (12.77% vs 0.52%) as shown by flow cytometry (FIG. 5B).

Next, the binding of Oct4 and Sox2 to the distal enhancer of the Oct4 gene1 using chromatin immunoprecipitation (ChIP) was studied. The binding of Oct4 and Sox2 to the distal enhancer remained low with O-MEFs (FIG. 5C). However, Oct4, which was identical to M3O in this case, was already highly bound to the Oct4 distal enhancer in M3O-MEFs as early as day 3 when no GFP-positive colonies had yet appeared (FIG. 5C, the red column in the Oct4 panel). The Oct4-binding level gradually increased subsequently, eventually reaching the level comparable to that seen in ESCs on day 9. The chromatin binding of Sox2 displayed a similar tendency. The binding levels of these two proteins in the mixture of ESCs and MEFs at a 13:87 ratio was studied. This study showed substantially lower binding of Oct4 and Sox2 in comparison to the day 9 levels in M3O-MEFs (FIG. 5C, ESCs+MEFs in blue). This observation indicates that Oct4 and Sox2 were bound to the Oct4 enhancer in the majority of M3O-MEFs including GFP-negative cells on day 9. The increased binding of these two proteins to chromatin in M3O-MEFs prompted us to investigate if chromatin accessibility at the distal enhancer was also increased in M3O-MEFs. Increased chromatin accessibility is generally indicated by higher sensitivity to DNAses23. Chromatin from M3O-MEFs and O-MEFs was digested with the restriction enzyme NsiI and analyzed DNA fragments using Southern blotting. Indeed, chromatin accessibility was consistently higher in M3O-MEFs compared to O-MEFs between day 5 and day 9 (FIG. 5D). Additionally, GFP-negative M3O-MEFs were selected with a FACS on day 9 followed by NsiI digestion analysis. This GFP-negative population also demonstrated increased sensitivity to NsiI (FIG. 5D, far right), indicating that the minor GFP-positive population did not significantly influence the overall result of chromatin accessibility.

Previous reports have shown that the Paf1 complex is recruited to the distal enhancer of the Oct4 gene through binding to the Oct4 protein24,25 and then generally moves to the coding region of the gene26. Three Paf1 complex subunitsparafibromin, Leo1 and Paf1displayed a gradual increase of binding to the distal enhancer and coding region of the Oct4 gene in M3O-MEFs, but not in O-MEFs, between days 3 and 9 following transduction (FIG. 5C). The Paf1 complex recruits the histone methyltransferase complex COMPASS, which catalyzes trimethylation of lysine 4 on histone H3 (H3K4me3)26. This histone modification, a marker for active genes, was also increased specifically in M3O-MEFs in the coding region of the Oc4 gene (FIG. 5E). Two other markers for active genes, acetylation of lysines 9 and 14 on histone H3 (H3K9ac and H3K14ac)27, were also increased in M3O-MEFs (FIG. 5E). In addition, two markers for suppressed genes, trimethylation of H3K9 (H3K9me3) and H3K27 (H3K27me3)27, were more decreased in M3O-MEFs than those in O-MEFs (FIG. 5F). Similar results were observed at the Sox2 locus (FIG. 9). Among these chromatin changes, the levels of H3K9me3 and H3K27me3 in M3O-MEFs most quickly reached the levels observed in ESCs (FIG. 5F), suggesting that the loss of these suppressive histone markers precedes other chromatin modifications. Taken together, these results demonstrate that chromatin at Oct4 and Sox2 loci was generally remodeled in majority of M3O-MEFs, including the GFP-negative population, toward an ESC pattern during the first ten days after transduction, while chromatin in the majority of O-MEFs was not significantly altered.

In addition to global chromatin remodeling, M3O-SKM also elicited wider expression of two pluripotency markers than OSKM: alkaline phosphatase and SSEA1. Alkaline phosphatase was almost ubiquitously expressed by day 9 in M3O-MEFs, unlike the weak and heterogeneous expression observed in O-MEFs (FIG. 6A). SSEA1 was also more widely expressed in M3O-MEFs than in O-MEFs by day 9 as shown by immunofluorescence microscopy and flow cytometory (FIG. 6B, 6C). While alkaline phosphatase and SSEA1 are not exclusively expressed in pluripotent cells, these findings support the interpretation that M3O-SKM remodeled the chromatin in much more wider population of the cells to a certain degree unlike OSKM. Rapid cell proliferation is known to facilitate iPSC generation as shown using p53-null MEFs18; however, neither M3O-SKM nor M3O alone facilitated MEF proliferation during the initial 9 days after transduction (FIG. 6D, 6E).

Chromatin Analyses of Pluripotency Genes without c-Myc M3O-SK induced GFP-positive colonies over 100-fold more efficiently than OSKM with Protocol B (0.44% with M3O-SK in FIG. 1D vs 0.0035% with OSKM in FIG. 1F). This observation suggests that the M3 domain could compensate for the lack of c-Myc when Oct4 activation was used as an indicator. Although several roles of c-Myc have been proposed, its precise function in iPSC formation remains elusive28. To further understand the roles of c-Myc in the activation of pluripotency genes, chromatin analyses at the Oct4 and Sox2 loci were repeated comparing MEFs transduced with three genes (M3O-SK or OSK) and four genes (M3O-SKM of OSKM) on day 9 when the effects of M3O-SKM were readily detectable. One gene (M3O or Oct4) and two genes (M3O+Sox2 or Oct4+Sox2) were transduced for comparison. At this time point, 3.16% of MEFs were GFP-positive with M3O-SK (FIG. 7A), and no GFP-positive cells were observed with other combinations of one, two, or three genes. However, M3O-SK did not significantly decrease the overall level of DNA methylation compared with other gene combinations (FIG. 7B).

As for transcription factor binding to the enhancer, M3O facilitated binding of Oct4, Sox2, and parafibromin in combination with Sox2 or Sox2 and Klf4 (FIG. 7C, red), with some of these binding levels comparable to levels achieved with M3O-SKM. However, Leo1 and Paf1 were not recruited to the enhancer without c-Myc (FIG. 7C). The binding of parafibromin, Leo1, and Paf1 to the initiation site of Oct4 was also weak without c-Myc (FIG. 10A). Consistent with this partial assembly of the Paf1 complex at the Oct4 gene, the level of H3K4me3 remained low without c-Myc (FIG. 7D, 10B). Another active gene marker, H3K9ac, also remained low without c-Myc (FIG. 7D, 10B). Whereas H3K9me3 was effectively decreased by M3O-S and M3O-SK, H3K27me3 was more resistant to demethylation by any of the gene combinations without c-Myc (FIG. 7E). At the Sox2 gene, compared to the Oct4 gene. M3O did not substantially increase the binding of Oct4 or Sox2 to the enhancer alone or in combination with Sox2 or Sox2 and Klf4 (FIG. 11A). The changes in the levels of H3K4me3, H3K9ac, H3K9me3 and H3K27me3 were all weak in the absence of c-Myc (FIG. 11B). Together, these chromatin studies indicate that while M3O could facilitate formation of GFP-positive colonies without c-Myc, the overall level of chromatin remodeling in GFP-negative MEFs was low in the absence of c-Myc.

The present study advances the field of iPSC biology by showing that one of the rate-limiting steps in iPSC formation with OSKM is poor chromatin accessibility at pluripotency genes and that a strong transactivating domain can overcome this problem. Because iPSC formation was dramatically improved with M3O-SKM, the factors required to increase chromatin accessibility most likely already exist within MEFs but are not effectively recruited to pluripotency genes when using OSKM. Our current working model is that the MyoD TAD overcomes the barrier of closed chromatin by effectively attracting chromatin modifying proteins and thereby facilitating the binding of Oct4 and other regulatory proteins as well as epigenetic modifications at pluripotency genes (FIG. 7F). Myc family proteins have been proposed to globally relax chromatin in part through activation of the histone acetyltransferase GCN5 and in part through direct binding to thousands of genomic loci28,29. The results also support c-Myc's potential roles in chromatin remodeling.

One of the central questions related to the molecular mechanisms of iPSC formation is how closed chromatin at the loci of Oct4, Sox2, and Nanog are opened by OSKM. Little is known about this mechanism. One potential mechanism is that chromatin disruption occurs during repeated DNA replication as suggested by a report that 92% of B lymphocytes derived from inducible OSKM transgenic mice become iPSCs after 18 weeks of culture18. Additionally, knockdown of p53 in B cells shortened both cell doubling time and the time required to form iPSCs by twofold. However, this does not seem to be the case for M3O-SKM since it did not facilitate cell proliferation. Additionally, emerging GFP-positive colonies contained far less cells than their counterparts obtained from B cells. It has been difficult to perform biochemical analysis of the early process of iPSC formation, such as epigenetic remodeling at pluripotency genes, because of the presence of feeder cells and non-responsive MEFs that comprise more than 90% of transduced cells. However, the MyoD TAD eliminated the requirement for feeder cells and achieved significant levels of epigenetic remodeling even in those MEFs that eventually fell short of activating GFP with Protocol B. Thus, the MyoD TAD is expected to facilitate the dissection of epigenetic processes during the early phase of iPSC formation.

By combining transcription factors with TADs, this approach to nuclear reprogramming is expected to have a range of applications from inducing pluripotency, as shown in this study, to inducing direct conversion from one differentiated cell type to another without transitioning through iPSCs17,33,34. The strategy of TAD-fusion to potentiate transactivators will further advance the study of nuclear reprogramming. The effect of each TAD may be on dependent on cell types, host transcription factors, and target genes. Other TADs have been used to amplify the activity of transcription factors. For instance, the TAD of VP16 was fused to the transcription factor Pdxl to facilitate conversion of hepatocytes to pancreatic cells36,37. However, the MyoD TAD has not been used in nuclear reprogramming. The TAD-fusion approach is applicable to combinations of many other transcription factors and TADs to amplify the activity of the host transcription factor and control cell fate decisions.

Following is a list of plasmid constructs used in the above work as well as two constructs based on the VP16 gene and data therefor.

The M3 domain of the mouse MyoD cDNA was fused to the amino terminus of the full-length mouse Oct4 cDNA using PCR and inserted into the EcoRI site of the pMXs-IP vector.

The cDNA encoding the M3 domain of mouse MyoD (amino acids 1-62) was amplified with two primer sets, MyoDOct4F4 (GAGAATTCGCCATGGAGCTTCTATCGCCGCCAC; SEQ ID NO:1) and MO63-109R1 (CAGGTGTCCAGCCATGTGCTCCTCCGGTTTCAG; SEQ ID NO:2). Full length Oct4 cDNA was amplified with two primer sets, MO63-109F1 (CTGAAACCGGAGGAGCACATGGCTGGACACCTG; SEQ ID NO:3) and MyoDOct4R5 (CGGAATTCTCTCAGTTGAATGCATGGGAGAG; SEQ ID NO:4). The two PCR products of each first PCR were used as a template for the secondary PCR with the primer set MyoDOct4F4 and MyoDOct4R5. M3O was directly subcloned into EcoRI site of pMXs-IP.

The M3 domain of the mouse MyoD cDNA was fused to the carboxy terminus of the mouse full length Oct4 cDNA.

The M3 domain was prepared with PCR using the primer pair M3F1 and M3R1 and inserted into the EcoRI and the XhoI sites of the pMXs-IP vector to create the pMXs-IP M3 vector. Oct4 was then PCR amplified with the primer pair Oct4F1 and Oct4R1, and inserted into the EcoRI site of pMXs-IP M3 vector.

Activity Test of Making iPSCs OM3 converts 3.2% of MEFs to iPSCs. 3) Mouse M3OM3

Mouse M3 was fused to both the amino and carboxy termini of mouse Oct4.

PCR for Mouse M3OM3

Mouse M3 domain was prepared from the mouse MyoD cDNA with PCR using the primer pair M3OF1 and M3OR1. Mouse full length Oct4 was prepared with PCR using the primer set M3OF2 and Oct4R1. To make M3O, the above two PCR products were used as templates for PCR with the primer pair M3OF1 and Oct4R1. Finally, to make M3OM3, M3O was inserted into the EcoRI site of the pMXs-IP M3 vector prepared in the OM3 construct above.

The M3 domain of the human MyoD cDNA was fused to the amino terminus of the full-length human Oct4 cDNA using PCR and inserted into the EcoRI site of the pMXs-IP vector.

The M3 domain of human MyoD was PCR amplified with the primer pair of hM3OF1 (see below for sequence) and hM3OR1. Human full length Oct4 was PCR amplified with the primer pair of hM3OF2 and hM3OR2. These two PCR products were used as templates for the third PCR with the primers hM3OF1 and hM3OR2.

The full length of the TAD (amino acids 411-490) of VP16 was fused to the amino terminus of the mouse full-length Oct4 cDNA. VP16 is a protein expressed by the herpes simplex virus type I and its transactivation domain is highly powerful.

The cDNA encoding the transactivation domain of VP16 (amino acids 411-490) was amplified by PCR and inserted into the BamHI and XhoI sites of the pMXs-IP vector to create the pMXs VP16-IP vector. Then the full-length mouse Oct4 cDNA was inserted into the EcoRI and XhoI sites of the pMXs VP16-IP vector.

Human herpesvirus 1 complete genome: X14112.1 Tegument protein VP16 from human herpes simplex virus type 1: NP_044650 Activity Test of Making iPSCs VP16LO-SKM converts around 0.5% of mouse embryonic fibroblasts to iPSCs, which is lower than M3O-SKM (5.3%) but still higher than OSKM (0.08%). In addition, VP16LO-SKM does not require feeder cells, unlike OSKM, to make iPSCs.

A part of the TAD (amino acids 446-490) of VP16 was fused to the amino terminus of the mouse full-length Oct4 cDNA.

The cDNA encoding a part of the transactivation domain of VP16 (amino acids 446-490) was amplified with two primer sets, V16F4 (CGAGAATTCGCCATGTTGGGGGACGGGGATC; SEQ ID NO: 28) and V16OR (CAGGTGTCCAGCCATCCCACCGTACTCGTC; SEQ ID NO:29). Full length Oct4 cDNA was amplified with two primer sets. VP16OF (GACGAGTACGGTGGGATGGCTGGACACCTG; SEQ ID NO:30) and Oct4R1 (GCGCTCGAGTCTCAGTTTGAATGCATGGGAGAG; SEQ ID NO:31). The two PCR products of each first PCR were used as a template for the secondary PCR with the primer set V16F4 and Oct4R1. VP16OS was directly subcloned into EcoRI and XhoI site of pMXs-IP.

INDUCED PLURIPOTENT STEM CELLS - Regents of the University of ...

Related Post