Download - (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

Transcript
Page 1: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

Functional conservation and divergence of GmCHLI genes inpolyploid soybean

Qing Li1,2, Chao Fang1,2, Zongbiao Duan1,2, Yucheng Liu1,2, Hao Qin3, Jixiang Zhang1,2, Peng Sun4, Wenbin Li5,

Guodong Wang3 and Zhixi Tian1,*1State Key Laboratory of Plant Cell and Chromosome Engineering, Institute of Genetics and Developmental Biology, Chinese

Academy of Sciences, Beijing 100101, China,2University of Chinese Academy of Sciences, Beijing 100039, China,3State Key Laboratory of Plant Genomics, Institute of Genetics and Developmental Biology, Chinese Academy of Sciences,

Beijing 100101, China,4Affiliated Hospital of Hebei University, Baoding 071000, China, and5Key Laboratory of Soybean Biology in Chinese Ministry of Education, Northeast Agricultural University, Harbin 150030,

China

Received 20 June 2016; revised 17 July 2016; accepted 21 July 2016.

*For correspondence (e-mail [email protected]).

Accession numbers: RNA-seq raw sequence data generated from this study have been deposited in the Genome Sequence Archive (GSA) database in the BIG

Data Center (http://gsa.big.ac.cn/index.jsp) under accession number PRJCA000242.

SUMMARY

Polyploidy is prevalent in nature. As the fate of duplicated genes becomes more complicated when the

encoded proteins function as oligomers, functional investigations into duplicated oligomer-encoding genes

in polyploid genomes will facilitate our understanding of how traits are expressed. In this study, we identi-

fied GmCHLI1, a gene encoding the I subunit of magnesium (Mg)-chelatase, which functions in hexamers as

responsible for the semi-dominant etiolation phenotype in soybean. Four GmCHLI copies derived from two

polyploidy events were identified in the soybean genome. Further investigation with regard to expression

patterns indicated that these four copies have diverged into two pairs; mutation in the other copy of the

pair that includes GmCHLI1 also resulted in a chlorophyll-deficient phenotype. Protein interaction assays

showed that these four GmCHLIs can interact with each other, but stronger interactions were found with

mutated subunits. The results indicate that, in polyploidy, deficiency in each copy of duplicated oligomer-

encoding genes could result in a mutant phenotype due to hetero-oligomer formation, which is different

from the model of allelic dosage or functional redundancy. In addition, we interestingly found an increase in

isoflavonoids in the heterozygous etiolated plants, which might be useful for improving soybean seed

quality.

Keywords: duplicated genes, GmCHLIs, oligomers, evolution, soybean.

INTRODUCTION

Polyploidy, or whole-genome duplication (WGD), is com-

mon in nature, particularly in plants (Wendel, 2000; Kon-

drashov et al., 2002), and the duplicated genes generated

through polyploidy provide important raw genetic material

for evolution (Zhang, 2003; Conant and Wolfe, 2008). Inves-

tigations of the fate of duplicated genes will help our

understanding of allelic divergence, novel phenotypic gen-

eration and adaptive evolution (Conant and Wolfe, 2008).

Since the first report of gene duplication in Drosophila in

1936 (Bridges, 1936), duplicated genes have been exten-

sively studied at phylogenetic, functional and genomic

levels (Adams and Wendel, 2005; Innan and Kondrashov,

2010). During evolution, one copy of a duplicated pair may

be lost, leaving the other copy as a singleton. However,

when both copies of a duplicated pair persist, the two

copies will generally evolve with different fates and can

maintain all or only part of their original functions, exhibit

divergent functions and gain new functions; one or both

copies can also become pseudogenes (Flagel and Wendel,

2009).

Divergence between duplicate genes can arise via diver-

gence of their expression patterns or via accumulated

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd

1

The Plant Journal (2016) doi: 10.1111/tpj.13282

Page 2: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

mutations (Innan and Kondrashov, 2010). Studies have

indicated that sub-functionalization at the expression level

might be a relatively rapid process compared with sub-

functionalization through mutations (Doyle et al., 2008;

Roulin et al., 2012). One example is soybean, an important

crop providing major sources of animal feed protein and

cooking oil. It has been reported that the soybean genome

has undergone two polyploidy events during the last 60

million years (Shoemaker et al., 2006), which have resulted

in approximately 75% of its genes being present in more

than one copy (Schmutz et al., 2010). Additionally, gen-

ome-wide expression profiling has demonstrated that

approximately 50% of the paralogs in soybean are differen-

tially expressed (Roulin et al., 2012). For instance, only one

copy of the duplicated genes homologous to Arabidopsis

TERMINAL FLOWER controls growth habit in soybean; the

other copy does not due to a divergent transcriptional

pattern (Tian et al., 2010).

In addition to evolving into divergent functional pairs,

some duplicated pairs maintain a similar function (Zhang,

2003). Investigations of the coordinated functions of these

conserved duplicated copies are interesting and will help

to clarify how a trait is determined, which is important for

crops because almost all crop plants are polyploid (Udall

and Wendel, 2006). One popular model of conserved dupli-

cated gene cooperation is the control of a trait by allelic

dosage. For instance, studies indicate that the different

copies of FLOWERING LOCUS C (FLC) in Brassica napus, a

polyploid crop, affect flowering time incrementally, which

explains a large portion of the phenotypic variance

observed in field trials (Schranz et al., 2002; Quijada et al.,

2006; Udall et al., 2006). Another popular model in trait

control is functional redundancy. For example, D1 and D2,

homologs of the STAY-GREEN (SGR) gene, are duplicated

as a result of the most recent WGD in soybean: only the

d1d2 double mutant was found to exhibit the stay-green

phenotype, whereas the single mutant did not, suggesting

that D1 and D2 are redundantly involved in chlorophyll

degradation (Fang et al., 2014).

In nature, as opposed to acting directly as monomers,

many proteins must assemble as oligomers to perform

their functions. Following polyploidy, hetero-oligomers

instead of homo-oligomers may form. Thus, the evolution

of conserved duplicated genes becomes more complicated

when their encoded proteins function as oligomers. Func-

tional and evolutionary investigations of these duplicated

oligomer-encoding genes in polyploid genomes will pro-

vide important information regarding how traits are

expressed.

Magnesium (Mg)-chelatase, which catalyzes the first

step of chlorophyll biosynthesis in a heterotrimeric com-

plex, is composed of three distinct subunits in plants: CHLI,

CHLD and CHLH (Rissler et al., 2002). Structural analyses

have suggested that CHLI forms hexamers and belongs to

the family of ATPases associated with various cellular

activities (AAA+) (Fodje et al., 2001; Hansson et al., 2002).

In barley and maize, CHLI is encoded by a single gene, and

mutation of this gene results in the semi-dominant chlo-

rina phenotype (Hansson et al., 1999; Sawers et al., 2006).

Most genes in soybean are present in multiple copies due

to the WGDs that have occurred (Schmutz et al., 2010). In

this study, we performed map-based cloning and found

that a CHLI homolog in soybean, GmCHLI1, is responsible

for the etiolation phenotype. Four paralogous copies are

presented in the soybean genome, and we comprehen-

sively investigated functional conservation and divergence

among these four copies. We found that the duplication

status of soybean CHLIs has greatly influenced their role

in chlorophyll biosynthesis due to the formation of

hetero-oligomers.

RESULTS

GmCHLI is the gene responsible for the etiolation

phenotype in soybean

We found that one soybean accession in our germplasm

pool always exhibited segregating phenotypes in the pro-

geny population, including green, pale-green, and yellow

leaves (Figure 1a). Further investigation revealed lethality

of the yellow plants shortly after germination and that the

green plants did not generate segregating progeny. How-

ever, the progeny derived from the pale-green plants

exhibited a semi-dominant segregation ratio of 1:2:1 for

green:pale-green:yellow phenotypes. Therefore, we deter-

mined the etiolation phenotype (termed the y mutant in

this study) to be controlled by a single semi-dominate

gene. The green, pale-green and yellow plants were of the

Y/Y, Y/y and y/y genotypes, respectively.

Consistent with their phenotypes, the y/y unifoliolate

leaf contained much smaller chlorophyll (both Chl a and

Chl b) and carotenoid contents than those of the Y/Y unifo-

liolate leaf (Figure 1b). Ultrastructural investigation by

transmission electron microscopy suggested that the

chloroplasts of the mutant were also affected (Figure 1c).

Compared with the chloroplasts of Y/Y unifoliolate leaf, the

chloroplasts of y/y unifoliolate leaf showed abnormal

structures, with no clear prolamellar body and only a few

prothylakoid membranes extending across the length of

the chloroplasts (Figure 1c). To determine whether photo-

synthesis efficiency was affected in the mutant, the chloro-

phyll fluorescence parameters Fv/Fm (maximal PS II

quantum yield), NPQ (nonphotochemical quenching) and

qP (coefficient of photochemical quenching) were mea-

sured. The Fv/Fm, NPQ and qP values of the y/y unifolio-

late leaf were significantly decreased compared with those

of the Y/Y unifoliolate leaf (Figure 1d–f). Interestingly,

although the Y/y unifoliolate leaf exhibited intermediate

deficiency in chlorophyll and carotenoid levels (Figure 1b)

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

2 Qing Li et al.

Page 3: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

and chloroplast structures (Figure 1c), its chlorophyll fluo-

rescence parameters were similar to those of the Y/Y unifo-

liolate leaf (Figure 1d–f). This situation indicates that

although chlorophyll deficiency occurs in Y/y plants, the

chlorophyll level is still above the threshold for maintain-

ing normal photosynthesis.

To identify the y locus, we crossed the Y/y mutant with

Williams 82. Initial mapping using an F2 population con-

sisting of approximately 350 individuals anchored the y

locus in a 0.22 Mb region between ‘CAPS marker 25’ and

‘CAPS marker 23’ on chromosome 13 (Figure 1g). In total,

37 annotated genes are present in this 0.22 Mb region.

Except for Glyma13g30560, the annotated functions of

these genes are either unknown or do not involve chloro-

phyll metabolism (Table S1). Glyma13g30560 encodes a

CHLI homolog, the I subunit of Mg-chelatase, and CHLI

mutation results in a semi-dominant chlorophyll-deficient

phenotype in various plant species (Figure S1) (Hansson

et al., 1999; Sawers et al., 2006; Huang and Li, 2009).

Comparison with the Y/Y genomic DNA sequence demon-

strated four SNPs in y/y plants (Figure 1g). One SNP is a

non-synonymous A->G mutation in the third exon, which

converts a Gln to an Arg at residue 275; the remaining

SNPs are synonymous mutations in non-coding regions.

The Y/y plants were found to be heterozygous at these

positions. Thus, we speculated that Glyma13g30560 is a

candidate gene responsible for the semi-dominant etiola-

tion phenotype of the y mutant and that A->G is the

responsible mutation. This speculation was confirmed by

a recent study, in which the same missense mutation in

GmCHLI was identified in an investigation of a semi-

dominant chlorophyll-deficient mutant of y11 through a

candidate gene sequencing method (Campbell et al.,

2015).

Y/Y Y/y y/y

Y/Y Y/y y/y

Y/Y Y/y y/y

1 µm

Fv/F

m

NP

Q

qP

Pig

men

ts (m

g·g-

1 fre

shw

eigh

t)

0

0.2

0.4

0.6

0.8

1

0

1

2

3

4

5 Chl aChl bCar

0

0.2

0.4

0.6

0.8

0

0.1

0.2

0.3

0.4

0.5

b ac

a a

b

aa

b

(a) (b)

(c)

(d) (e) (f)

(g)

Recombinants

Chr. 13

4 Mb

32.5

4 M

b34

.39

Mb

100 kb

CA

PS

28

CA

PS

22

3 2 1 110 0 1 1 2 3 7 15

Candidate region223 kb

10 kb

500 bp

5735 bpGlyma13g30560

UTR Exon

ATG TGAG->A

A->G A->TC->T

y

1 cm

1 µm1 µm

Y/Y Y/y y/y Y/Y Y/y y/y Y/Y Y/y y/y

Figure 1. Identification of the y locus.

(a) Fourteen-day-old Y/Y, Y/y and y/y seedlings.

(b) Chlorophyll a (Chl a), chlorophyll b (Chl b) and carotenoid (Car) contents in unifoliolate leaf, as shown in (a). The error bar indicates the standard deviation

of three replicates.

(c) Ultrastructure of the chloroplasts in unifoliolate leaf, as shown in (a).

(d–f) Chlorophyll fluorescence values of Fv/Fm (d), NPQ (e) and qP (f) in unifoliolate leaf, as shown in (a). The error bar indicates the standard deviation of at

least three replicates. Statistical significance at the 5% level was determined using Duncan’s multiple range test.

(g) Map-based cloning of the y locus.

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

Evolution of GmCHLIs in soybean 3

Page 4: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

Four GmCHLI copies are present in the soybean genome

Soybean is a paleopolyploid plant that has experienced at

least two rounds of WGD events (Shoemaker et al., 2006),

resulting in approximately 75% of its genes being present

in multiple copies (Schmutz et al., 2010). A homolog

search showed four CHLI copies in the soybean genome

(Figures 2a and S1). To distinguish them from each other,

we termed Glyma13g30560 as GmCHLI1 and the other

three copies as GmCHLI2 (Glyma15g08680), GmCHLI3 (Gly-

ma07g32550) and GmCHLI4 (Glyma13g24050). These four

copies were further grouped into two pairs: GmCHLI1 and

GmCHLI2; GmCHLI3 and GmCHLI4 (Figure 2a). The

sequence similarity within each pair was higher than that

between the pairs (Figure 2a and Table S2). Based on a

previous WGD analysis (Schmutz et al., 2010), we deter-

mined that the copies within each pair came from the

younger WGD at 13 million years ago (MYA) and that the

two pairs were derived from the older WGD at 59 MYA

(Figure S2).

The existence of duplicated genes provides opportuni-

ties for functional divergence (Force et al., 1999; Lynch

et al., 2001), and it would be valuable to determine how

the four GmCHLI copies coordinately evolved to control

chlorophyll biosynthesis. An expression assay using previ-

ously reported transcriptome data (Shen et al., 2014) sug-

gested that GmCHLI1 and GmCHLI2 exhibit significantly

different expression patterns from those of GmCHLI3 and

GmCHLI4 (Figure 2b). Consistent with their function in

photosynthesis, GmCHLI1 and GmCHLI2 exhibited con-

stantly high expression in most tissues, except for the root,

senescent leaf and later-developed seed. In contrast,

GmCHLI3 and GmCHLI4 exhibited significantly lower

expression in most tissues and presented only relatively

higher expression in young tissues (Figure 2b). These

(a)

(b)

(c)

(d)

(e)

AtCHLI2

HvCHLI78

100

100

100

99

99

90

0.02

GmCHLI1GmCHLI2

GmCHLI3GmCHLI4

AtCHLI1

NtCHLIZmCHLI

OsCHLI

LUC

/RE

N

0

0.0002

0.0004

0.0006

0.0008

0.0010

0.0012

0.0014

PAtCHLI1 PAtCHLI2 PGmCHLI1 PGmCHLI2

PGmCHLI3 PGmCHLI4 pGreen CK

GmCHLI1

GmCHLI2

GmCHLI3

GmCHLI4

LB RB

KpnI/XhoI SpeIPromoter

REN LUC35S

Roo

tS

tem

-1S

tem

-2S

hoot

mer

iste

mC

otyl

edon

-1C

otyl

edon

-2Le

af b

ud-1

Leaf

bud

-2Le

af b

ud-3

Leaf

-1Le

af-2

Leaf

-3Fl

ower

-1Fl

ower

-2Fl

ower

-3Fl

ower

-4Fl

ower

-5P

od+s

eed-

1P

od+s

eed-

2P

od+s

eed-

3P

od-1

Pod

-2P

od-3

See

d-1

See

d-2

See

d-3

See

d-4

See

d-5

Figure 2. Phylogenetic analysis and transcriptional assay of different GmCHLI copies.

(a) Phylogenetic analysis of CHLI homologs/paralogs in soybean and other plants. A neighbor-joining tree was constructed with MEGA 6.0 software based on

amino acid sequences. The protein sequences are from soybean (GmCHLI1, Glyma13 g30560; GmCHLI2, Glyma15 g08680; GmCHLI3, Glyma07 g32550 and

GmCHLI4, Glyma13 g24050), Arabidopsis (AtCHLI1, At4 g18480; AtCHLI2, At5 g45930), Oryza sativa (OsCHLI, Os03 g36540.1), Zea mays (ZmCHLI,

GRMZM2G419806_T01), Hordeum vulgare (HvCHLI, ABF72535.2) and Nicotiana tabacum (NtCHLI, AAG35472.1).

(b) Transcriptional assay of the four GmCHLI copies in 28 tissues/organs. The size of the number indicates earlier to later developmental stages within the same

tissue/organ. The FPKM value was recognized as the gene expression level; the data are shown in base 2 logarithmic form (log2 FPKM).

(c) Diagram showing the structure of the Dual-Luciferase reporter vector.

(d) LUC activities driven by the indicated CHLI promoters in N. benthamiana.

(e) Dual-Luciferase assay of the relative CHLI promoter activity. The error bar indicates the standard deviation of three replicates.

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

4 Qing Li et al.

Page 5: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

results suggest that the two pairs (GmCHLI1 and GmCHLI2;

GmCHLI3 and GmCHLI4) have evolved differential func-

tions; GmCHLI3 and GmCHLI4 may no longer be the major

genes controlling chlorophyll biosynthesis in soybean or at

least not in all tissues.

To investigate associations between differential tran-

scription levels and promoter divergence, we assessed the

promoter activities of the four GmCHLIs using a reported

transient transcription Dual-Luciferase assay system (Hel-

lens et al., 2005) in which tobacco leaves were infected

with Agrobacterium tumefaciens harboring one plasmid

expressing a Dual-Luciferase reporter (Figure 2c, d). In Ara-

bidopsis, CHLI is encoded by two copies, AtCHLI1 and

AtCHLI2 (Rissler et al., 2002), with AtCHLI1 being

expressed at a much higher level than AtCHLI2 and thus

acting as the major functional gene (Huang and Li, 2009).

To validate the reliability of our experimental system, the

promoter activities of AtCHLI1 and AtCHLI2 were also

quantified as positive controls. The results showed that the

transcriptional activity (ratio of LUC to REN) of PAtCHLI1

was approximately six times higher than that of PAtCHLI2

(Figure 2e), which is consistent with the mRNA ratio of

AtCHLI1 to AtCHLI2 (5.86 � 1.31) in Arabidopsis seedlings

(Huang and Li, 2009) and demonstrated the reliability of

the experimental system. Consistent with their expression

levels, the promoters of GmCHLI1 and GmCHLI2 exhibited

significantly stronger transcription activities than the pro-

moters of GmCHLI3 and GmCHLI4 (Figure 2e). Our results

indicate that evolution of the promoter activities of the

different copies of GmCHLI may greatly affect their

transcription patterns.

Mutation of GmCHLI2 also results in a chlorophyll-

deficient phenotype

Analyses of expression and promoter activities suggested

that GmCHLI2 has characteristics similar to those of

GmCHLI1 (Figure 2b, e), indicating that GmCHLI2 may

have similar functions as GmCHLI1 with regard to chloro-

phyll biosynthesis. If so, mutation in GmCHLI2 may also

result in a chlorophyll-deficient phenotype. A previous

study reported that the y9 mutant resulted in a chlorina

foliage phenotype (Chen et al., 1999). y9 is located in the

region between the two markers ‘Satt720’ and ‘A069_2’ on

chromosome 15 (Cregan et al., 1999) (http://www.soy-

base.org/), and GmCHLI2 is also located in this interval.

Therefore, GmCHLI2 is a possible candidate for y9. Geno-

mic DNA sequencing of GmCHLI2 from the y9 mutant (PI

548172) and its parent Illini (PI 548348) showed a single

non-synonymous G->A mutation in the third exon, result-

ing in the change of amino acid Asp to Asn at residue 278

(Figure 3a). The same amino acid substitution in HvCHLI

was reported for the barley Clo-125 mutant (Figure S1)

(Hansson et al., 1999). These results confirmed that

GmCHLI2 is a candidate for y9, a recessive mutant with

greenish yellow leaves (Figure 3b) and notably decreased

chlorophyll and carotenoid contents compared with Illini

(Figure 3c). Consistently, the grana stacks of y9 chloro-

plasts were looser than those of Illini (Figure 3d).

Functional analysis of the four GmCHLIs

The above analyses suggested that GmCHLI1 and

GmCHLI2 may have similar functions, and when duplicated

genes have similar functions, they usually exhibit func-

tional redundancy in trait control. Nevertheless, mutation

in either GmCHLI1 or GmCHLI2 could result in chlorophyll

biosynthesis deficiency. Moreover, although GmCHLI3 and

GmCHLI4 exhibited different expressional patterns com-

pared with GmCHLI1 and GmCHLI2, high similarities were

observed in the protein sequences of these four GmCHLIs

(Table S2). To clarify the functional divergence of these

GmCHLIs at the protein level, we first investigated their

subcellular localization. All four GmCHLIs were localized to

the chloroplasts, as is AtCHLI1, suggesting that their sub-

cellular localization had not diverged during evolution fol-

lowing the two WGD events. Moreover, we found that

neither mGmCHLI1 from the homozygous y mutant nor

mGmCHLI2 from the y9 mutant exhibited altered subcellu-

lar localization (Figure S3).

Subsequently, we performed a complementary test to

study the functions of these four GmCHLI copies. To

(a) GmCHLI27510 bp

UTR Exon 500 bp

ATG TGA

G->A

Illini y9

Illini y9

Illini y9

Pig

men

ts (m

g·g-

1 fre

shw

eigh

t)

0

1

2

3Car

Chl aChl b

1 µm

5 cm

y Q->R y9 D->N

(b) (c)

(d)

1 µm

Figure 3. Phenotypes of the y9 mutant.

(a) Mutation site in the GmCHLI2 gene and protein in the y9 mutant.

(b) Fourteen-day-old Illini and y9 seedlings.

(c) Chl a, Chl b and carotenoid (Car) contents in unifoliolate leaf, as shown

in (b). The error bar indicates the standard deviation of three replicates.

(d) Ultrastructure of chloroplasts in unifoliolate leaf, as shown in (b).

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

Evolution of GmCHLIs in soybean 5

Page 6: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

eliminate the potential influence of different GmCHLI

copies in the soybean genome, we used the Arabidopsis

cs215 mutant as the background because it has been

reported that CHLI is functionally conserved in various spe-

cies and only AtCHLI1 functions as the main I subunit of

Mg-chelatase in Arabidopsis (Huang and Li, 2009). cs215 is

a chli1 mutant with a non-synonymous mutation and pre-

sents a semi-dominant chlorophyll-deficient phenotype

(Huang and Li, 2009). The results showed that each of the

wild-type GmCHLIs driven by the 35S promoter could res-

cue the chlorophyll-deficient phenotype of the cs215

mutant, similar to the results for AtCHLI1 (Figure 4 and

Table S3). The transgenic plants presented foliage pheno-

types that ranged from yellow to fully green, and the

degree of rescue was generally positively related to the

level of transgenic expression (Figure S4a–d). In contrast,

the mutant types of mGmCHLI1 and mGmCHLI2 could not

rescue the cs215 mutant (Table S3). Moreover, some

mGmCHLI1/mGmCHLI2 transgenic plants in the Enk

(wild-type) background were slightly green and even etio-

lated (Figure 4 and Table S3). Overall, the chlorina level

was also positively related to high transgenic expression

(Figure S4e, f). It is noteworthy that the chlorophyll-

deficient phenotype of the 35S::mGmCHLI2 transgenic

plants was not as serious as that of the 35S::mGmCHLI1

transgenic plants (Figure 4 and Table S3), which is consis-

tent with the corresponding phenotypes of the y and y9

mutants. These results indicate that all four GmCHLIs can

function as the CHLI subunit but that their functions could

be dominant negatively affected by non-synonymous

mutation of either copy of GmCHLI1 and GmCHLI2, further

suggesting that the different copies might interacted in

nature.

The four GmCHLIs interact with each other but exhibit

stronger interactions with mGmCHLI1 or mGmCHLI2

To reveal how these GmCHLIs cooperate to control of

chlorophyll synthesis, their interactions were evaluated

using a point-to-point yeast two-hybrid (Y2H) assay. All of

the combinations of wild-type GmCHLIs showed moderate

interactions at 6 days of incubation (Figure 5a). These

interactions were further confirmed using a bimolecular

fluorescence complementation (BIFC) assay (Figure S5).

Unexpectedly, the mutated subunits (mGmCHLI1 and

mGmCHLI2) not only showed interactions with the other

GmCHLIs, but the interactions were even slightly stronger

(Figure 5a). To validate this result, we examined the inter-

action capacities of the GmCHLI1 and GmCHLI2 with their

wild-type and mutated subunits in yeast cells using b-galassays (Figure 5b). The results showed that the mutated

subunits indeed exhibited stronger interactions compared

with the wild-type subunits, particularly between different

paralogs (GmCHLI1 with mGmCHLI2 and mGmCHLI1 with

GmCHLI2).

In addition to CHLI, CHLD and CHLH are also required

for the formation of a functional Mg-chelatase heterotri-

meric complex (Hansson et al., 2002; Rissler et al., 2002).

We found two CHLD copies (GmCHLD1 for Glyma01g43630

and GmCHLD2 for Glyma11g01840) and three CHLH copies

(GmCHLH1 for Glyma03g29330, GmCHLH2 for Gly-

ma10g20570 and GmCHLH3 for Glyma19g32070) in the

soybean genome. We determined that GmCHLD1 and

GmCHLD2 as well as GmCHLH1 and GmCHLH3 are

Enk cs215/+ cs215/cs215

35S::AtCHLI1 (cs215/cs215)

35S::GmCHLI1 (cs215/cs215)

35S::GmCHLI2 (cs215/cs215)

35S::GmCHLI3 (cs215/cs215)

35S::GmCHLI4 (cs215/cs215)

35S::mGmCHLI1 (Enk)

35S::mGmCHLI2 (Enk)

2 cm

Figure 4. Complementation tests of GmCHLIs in Arabidopsis.

Photographs of approximately 3-week-old Enk, cs215/+, cs215/cs215 and

representative transgenic lines are shown.

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

6 Qing Li et al.

Page 7: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

duplication pairs from the recent WGD (Figure S2). Subcel-

lular localization showed that all of the GmCHLD and

GmCHLH copies localize to the chloroplast (Figure S3). We

then performed the BIFC assay to investigate interactions

between all four of the GmCHLIs and the different copies

of GmCHLD or GmCHLH, and the results indicated that

they all interacted (Figure S6). We also assessed interac-

tions between GmCHLDs and GmCHLHs, as well as the

interactions of GmCHLDs or GmCHLHs themselves, with

positive results for all combinations (Figure S7).

Expression of GmCHLIs, GmCHLDs and GmCHLHs are

affected in y and y9 mutants

Because all of the GmCHLIs, GmCHLDs and GmCHLHs

were found to interact with each other, mutation of

GmCHLI1 and GmCHLI2 might affect expression of these

subunits. We then explored the gene expression of these

subunits in the root, stem, cotyledon and unifoliolate leaf

of Y/Y, Y/y and y/y plants by RT-qPCR. First, the results

indicated much higher expression for GmCHLI1 and

GmCHLI2 relative to GmCHLI3 and GmCHLI4 in unifoliolate

leaf, which is consistent with the transcriptome results

(Figure 6a–d). GmCHLD1 showed a significantly higher

expression level than GmCHLD2 (Figure 6e, f), and the

mRNA levels of GmCHLH1 and GmCHLH3 were signifi-

cantly higher relative to that of GmCHLH2 (Figure 6g–i).Second, expect for GmCHLI4, all of the genes were highly

expressed in the unifoliolate leaf compared with other

tested tissues/organs (Figure 6). Third, expression of

almost all of the genes was decreased in the y/y unifolio-

late leaf in comparison with that of the Y/Y unifoliolate leaf,

with exception of GmCHLD2 (Figure 6). Consistent with the

phenotype (Figure 1), the total mRNA levels of these sub-

units in the Y/y unifoliolate leaf were comparable with

those in the Y/Y unifoliolate leaf, or at an intermediate level

between the Y/Y and y/y unifoliolate leaves (Figure 6). Sim-

ilarly, transcription of these subunits was evaluated in the

y9 and Illini plants (Figure S8), with expression in the y9

mutant either unchanged or slightly decreased in

comparison with Illini. These results indicate that the

GmCHLI1 and GmCHLI2 missense mutations not only

affect interaction capacity but also influence the transcript

abundance of Mg-chelatase subunits and subsequently

block chlorophyll biosynthesis.

Isoflavonoids are increased in Y/y plants

Isoflavonoids are special metabolites in legumes and also

one of the key parameters for evaluating the nutritional

quality of soybean seeds (Dhaubhadel et al., 2003). Isofla-

vonoids are synthesized via the general phenylpropanoid

pathway and legume-specific isoflavonoid branch path-

ways catalyzed by a series of enzymes, including pheny-

lalanine ammonia lyase (PAL), chalcone synthase (CHS),

chalcone reductase (CHR), isoflavone synthase (IFS), cinna-

mate-4-hydroxylase (C4H), 4-coumarate CoA ligase (4CL)

and chalcone isomerase (CHI) (Du et al., 2010; Chu et al.,

2014). Among these enzymes, IFS is directly involved in

the conversion of flavanone substrates to isoflavonoids

(Cheng et al., 2008). Interestingly, our transcriptome assay

of four tissues/organs in Y/Y and Y/y plants showed up-

regulation of the transcript abundance of nearly all of

the enzymes involved in isoflavonoid biosynthesis in the

12–14 mm seed and 14th trifoliolate leaf of Y/y plants com-

pared with Y/Y plants (Figure 7a). Conversely, considerable

changes in transcripts of these enzymes were not observed

in the 8–10 mm seed of Y/y plants vs. Y/Y plants (Figure

7a).

The increased transcripts of isoflavonoid biosynthesis

enzymes in the 12–14 mm seed of Y/y plants may improve

seed accumulation of isoflavonoid products. Thus, we sub-

sequently assayed primary and secondary metabolites in

mature seeds harvested from Y/Y and Y/y plants (Tables

S4, S5). At the same time, primary and secondary metabo-

lites in the sprouts generated from these seeds were also

examined (Tables S6, S7) because sprouts are a common

and favorite food in Asia. Among the considerable changes

in a wide range of metabolites in the seeds and sprouts

between Y/Y and Y/y plants (Tables S4, S5, S6 and S7), the

mG

mC

HLI1

mG

mC

HLI2

AtCHLI1

GmCHLI1

GmCHLI2

GmCHLI3

mGmCHLI1

GmCHLI4

mGmCHLI2

BD

AtC

HLI1

Gm

CH

LI1

Gm

CH

LI2

Gm

CH

LI3

Gm

CH

LI4

AD

Prey

Bait

-Leu/-Trp/-His/-Ade

0

0.4

0.8

1.2

1.6

2

AD-GmCHLI1

β-ga

l act

ivity

(uni

t)

AD-mGmCHLI2AD-GmCHLI2

AD

AD-mGmCHLI1

bcab

d

a

e

bc

a

c

b

d

(a) (b)Figure 5. Interaction analysis among GmCHLIs

using the yeast two-hybrid system.

(a) Interactions among GmCHLIs. AtCHLI1 was used

as a positive control.

(b) Comparison of the interaction capacities of the

indicated proteins in yeast cells based on b-galassays. The error bar indicates the standard devia-

tion of three replicates. Statistical significance at

the 5% level was determined using Duncan’s multi-

ple range test.

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

Evolution of GmCHLIs in soybean 7

Page 8: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

total content of isoflavonoids was found to be significantly

elevated in the seeds from Y/y plants (Figure 7b). However,

the genotype of the individual seed from Y/y plants had lit-

tle effect on the increase in isoflavonoids (Figure 7b). Simi-

lar results were observed for the sprouts (Figure 7b).

These findings suggest that the greater accumulation of

isoflavonoids in seeds from Y/y plants is mainly deter-

mined by maternal nutrition.

DISCUSSION

Evolution of oligomer-encoding GmCHLIs following

polyploidy in soybean

Polyploidy is one of the major forces promoting genome

evolution (Adams and Wendel, 2005; Flagel and Wendel,

2009), and clarification of how duplicated genes diverge

after polyploid events is important for understanding trait

determination. Mg-chelatase catalyzes the first critical step

in chlorophyll biosynthesis at the chlorophyll and heme

branch point of tetrapyrrole biosynthesis. The enzyme con-

sists of three distinct subunits: CHLI, CHLD and CHLH (Riss-

ler et al., 2002). The sequences and functions of these

subunits are highly conserved in different photosynthetic

organisms, and all three subunits are essential for proper

Mg-chelatase function (Hansson et al., 1999, 2002; Sawers

et al., 2006). It has been reported that CHLI needs to form

as a hexamer and then to interact with CHLD and CHLH to

function as a Mg-chelatase (Hansson et al., 2002; Masuda,

2008). Due to the two WGD events (Schmutz et al., 2010),

four CHLI paralogs, two CHLD paralogs and three CHLH

paralogs are present in the soybean genome; with the

exception of GmCHLH2, these genes are present as dupli-

cated pairs (Figure S2). Therefore, it would be useful to

determine how the different GmCHLI copies coordinate to

form a hexamer prior to becoming functional.

Our transgenic Arabidopsis assay demonstrated that all

four of the GmCHLIs can function as a CHLI subunit (Fig-

ure 4). However, our transcriptional investigation indicated

that the four copies have diverged into two pairs according

to expression patterns (Figure 2). Among these GmCHLIs,

GmCHLI1 and GmCHLI2 exhibited constantly high expres-

sion in most tissues, suggesting that they may be the

major CHLI isoforms. In contrast, GmCHLI3 and GmCHLI4

showed significantly low expression in most tissues, indi-

cating that they may no longer be the main genes encod-

ing the CHLI subunit in soybean, at least not in all tissues.

GmCHLI1 GmCHLI2 GmCHLI3

GmCHLI4 GmCHLD1 GmCHLD2

GmCHLH1 GmCHLH2 GmCHLH3

Y/Y Y/y y/y

00.010.020.030.040.050.06

0

0.02

0.04

0.06

0.08

0.10

0

0.0005

0.0010

0.0015

0.0020

0.0025

00.0020.0040.0060.0080.0100.012

0

0.05

0.10

0.15

0.200.25

00.0030.0060.0090.0120.0150.018

0

Root

Stem

Cotyled

on

Unifoli

olate

leaf

Root

Stem

Cotyled

on

Unifoli

olate

leaf

Root

Stem

Cotyled

on

Unifoli

olate

leaf

369

121518

00.0010.0020.0030.0040.0050.0060.007

0

2

4

6

8

(a) (b) (c)

(d) (e) (f)

(g) (h) (i)

Rel

ativ

e ex

pres

sion

Rel

ativ

e ex

pres

sion

Rel

ativ

e ex

pres

sion

Figure 6. Relative expression of GmCHLIs, GmCHLDs and GmCHLHs in the y mutant.

Expression patterns of GmCHLI1 (a), GmCHLI2 (b), GmCHLI3 (c), GmCHLI4 (d), GmCHLD1 (e), GmCHLD2 (f), GmCHLH1 (g), GmCHLH2 (h) and GmCHLH3 (i) in

different tissues/organs of the y mutant. RNA was isolated from seedlings when their unifoliolate leaves were fully expanded. ACT11 was used as an internal

control. The error bar indicates the standard deviation of at least three replicates.

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

8 Qing Li et al.

Page 9: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

The similarity of GmCHLI1 and GmCHLI2 at both the mRNA

and protein levels suggest that these genes are functionally

redundant. Nevertheless, unlike other duplicated gene pairs

for which only a double mutant can generate a mutant phe-

notype (Fang et al., 2014), mutation in each copy of the

duplicated GmCHLI1 and GmCHLI2 pair can result in a

chlorophyll-deficient phenotype (Figures 1 and 3). This situ-

ation may be because all the GmCHLIs have the ability to

form both homo-hexamers and hetero-hexamers (Figures 5

and S5), and then further interact with GmCHLD and

GmCHLH to generate the Mg-chelatase complex.

Photosynthesis provides the energy required for the life

activities of almost all organisms and thus is one of the

most important biochemical reactions on Earth. Genetically

modifying genes involved in photosynthesis has long been

considered as a potentially efficient approach to increasing

crop yields, though it has met with significant challenges

(Horton, 2000; Long et al., 2006). Many proteins involved in

photosynthesis exist as oligomers to form functional com-

plexes (Liu et al., 2004; Guskov et al., 2009; Qin et al.,

2015; Suga et al., 2015), and even the chaperonins that

assist in protein assembly also need to function in a multi-

meric manner (Bai et al., 2015). It has been reported that

almost all crops have experienced polyploidy events (Wen-

del, 2000; Kondrashov et al., 2002), resulting in oligomer

formation by different duplicated copies. Interactions

among the subunits in a protein complex are critical for

activity, which could be affected by mutation in any of the

subunits (Zhong et al., 2003; Imai et al., 2008; Daras et al.,

2009; Kim et al., 2013; Kakehi et al., 2015; Zhang et al.,

2016), and our results support this notion. We found that

non-functional mGmCHLI1 or mGmCHLI2 strongly com-

peted with functional GmCHLIs in assembly of the CHLI

hexamer, thus forming a dysfunctional Mg-chelatase com-

plex (Figure 5). Therefore, we infer that the evolution of

duplicated oligomer-encoding genes will be more restricted

because they may be affected by other paralogs. This is

partly supported by the fact that in 302 resequenced soy-

bean accessions, no homozygous amino acid mutation is

found in conservative positions of the four GmCHLIs (Zhou

et al., 2015). Our results indicate that increasing crop yield

via modification of photosynthesis-related genes might be

a complex process, particularly for genes involved in multi-

mer formation because their modifications will affect coop-

erative interactions among different duplicated copies.

However, simultaneous modification of paralogs at the

same amino acid position is a potential strategy.

Application of GmCHLI1 for breeding

In addition to photosynthesis, the chloroplast is the site of

the biosynthesis of many metabolites such as organic

acids, amino acids and sugars. In general, abnormal

chloroplast ultrastructure would affect some metabolism

processes, thereby possibly altering the abundance of

metabolites in the leaves and other tissues/organs (Luo

et al., 2013; Satou et al., 2014). In this study, we found

impaired chloroplast ultrastructure in the chlorophyll-

deficient y mutant (Figure 1c), and our metabolomic data

revealed large changes in a number of metabolites in the

seeds and sprouts from heterozygosis mutant plants

(Tables S4, S5, S6 and S7). Interestingly, the total content

of isoflavonoids was significantly elevated in the seeds

and sprouts from Y/y plants (Figure 7b). As a class of

plant-specialized secondary metabolites that are almost

exclusive to the legume family, isoflavonoids play impor-

tant roles in plant–environment interactions and in human

health (Du et al., 2010). The increase of isoflavonoid levels

has long been considered in soybean breeding because of

their health-related benefits (Dhaubhadel et al., 2003). Our

investigation further revealed that the accumulation of iso-

flavonoids in the seeds and sprouts from Y/y plants is

mainly determined by the maternal genotype (Figure 7b).

(a)

(b)Seeds Sprouts

Y/y plants

0

10

20

30

40

50

0102030405060

Isof

lavo

noid

s (μ

mol

g–1

)

Isof

lavo

noid

s (μ

mol

g–1

)

eb d c a

d

ab c b a

Y/Y plants Y/y plantsPALC4H4CL

CHSCHRCHIIFS

Y/y plants

Figure 7. Increased isoflavonoids in Y/y plants.

(a) Gene expression of the enzymes involved in isoflavonoid biosynthesis in

trifoliolate leaf and seed from Y/y and Y/Y plants. The total FPKM values of

the paralogs that encode the same enzyme were recognized as the mRNA

level for this enzyme, the data are shown in base 2 logarithmic form (log2

Total FPKM).

(b) Total isoflavonoids in the mature seeds and sprouts from Y/y and Y/Y

plants. Total isoflavonoids are the sum of the mole content of each isoflavo-

noid type. Single Y/Y indicates seeds or sprouts harvested from Y/Y plants;

‘Y/y mix’ denotes seeds or sprouts of unknown genotype harvested from Y/

y plants; Y/Y, Y/y and y/y above the Y/y plants represent seeds or sprouts of

corresponding genotypes harvested from Y/y plants. The error bar indicates

the standard deviation of three replicates. Statistical significance at the 5%

level was determined using Duncan’s multiple range test.

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

Evolution of GmCHLIs in soybean 9

Page 10: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

Similarly, a maternal effect on seed isoflavonoid content

was noted in the reciprocal crosses between soybean culti-

vars that differ in seed isoflavonoids (Dhaubhadel et al.,

2003). The isoflavonoid content of F1 seeds more closely

resembles that of the maternal parent. Moreover, the Y/y

heterozygosis mutant did not show a significant difference

in yield compared with wild-type plants. Therefore, Y/y

plants may be useful for future breeding of high-isoflavo-

noid soybean.

EXPERIMENTAL PROCEDURES

Plant materials and growth conditions

The soybean plants used for quantitative analysis of pigments,observation of chloroplast ultrastructure, determination of chloro-phyll fluorescence parameters and RT-qPCR were grown in agreenhouse (22–25°C, 16 h light/8 h dark photoperiod,130–180 lmol m�2 sec�1 light intensity and 50–60% relativehumidity). Soybean plants for map-based cloning, transcriptomeand metabolome analyses were grown at the farm of the Instituteof Genetics and Developmental Biology with the same cultivationmanagement. All of the Arabidopsis and N. benthamiana plantswere grown in a greenhouse (22–25°C, 16 h light/8 h dark pho-toperiod, 130–180 lmol m�2 sec�1 light intensity and 50–60% rela-tive humidity).

Pigments analysis

Chlorophylls and carotenoids were extracted using 100% acetoneand quantified spectrophotometrically using the following previ-ously published formulas (Lichtenthaler, 1987):

Concentration of chlorophyllaðCaÞ ¼ ð11:24A662� 2:04A645Þ � v/w

Concentration of chlorophyllbðCbÞ ¼ ð20:13A645� 4:19A662Þ � v/w

Concentration ofcarotenoidsðCcarÞ ¼ðð1000A470� 1:90Ca� 63:14CbÞ=214Þ � v/w

(V indicates the volume of acetone; W represents the freshweight of the sample).

Transmission electron microscopy analysis

Leaf sections (approximately 1 mm 9 1 mm) from the same posi-tion were first placed in 1.5 mL primary fixation solution (2.5% glu-taraldehyde allocated with 0.1 M phosphate buffer) and maintainedunder vacuum for 3 h for full pre-fixation. After washing five timeswith 1 mL of 0.1 M phosphate buffer, the samples were post-fixedfor 2 h with 0.5 mL of 1% osmium tetroxide in the same buffer. Thesamples were then dehydrated in a graded acetone series and sub-sequently infiltrated with a graded epoxy resin series. The polymer-ization reaction was performed in 100% epoxy resin for 24 h at60°C. Ultra-thin sections were obtained using a Leica EM UC7 ultra-microtome (Leica, Wetzlar, Germany) and stained with uranyl acet-ate for 20 min and lead citrate for 5 min. The observations andrecording of images were performed using a JEM-1400 transmis-sion electron microscope (JEOL, Tokyo, Japan).

Chlorophyll fluorescence analysis

Fv/Fm, NPQ and qP were measured using an IMAGING-PAM M-series Chlorophyll Fluorometer with the MAXI version (HeinzWalz, Effeltrich, Germany). Plants were dark adapted for 30 min

before measurements. The operation and analysis were per-formed according to the manufacturer’s instructions.

Map-based cloning of the y locus

Heterozygous Y/y plants were crossed with the sequenced cultivarWilliams 82, and the pale-green F1 plants were selfed to generatethe F2 mapping population. DNA was isolated from F2 individualsusing the CTAB method. Positional cloning was performed usingthe combinations of sequence length polymorphism and cleavedamplified polymorphic sequence markers provided in Table S8.The markers were designed based on the DNA sequences of Wil-liams 82 and y mutant resequencing.

RNA extraction and RT-qPCR

Total RNA was extracted using an RNAprep Pure Plant Kit (TIAN-GEN, Beijing, China) according to the manufacturer’s instructions.cDNA synthesis was performed with M-MLV reverse transcriptase(Invitrogen, Carlsbad, CA, USA) in accordance with the manufac-turer’s protocol. RT-qPCR was performed using the LightCycler480system (Roche, Mannheim, Germany) and LightCycler� 480 SYBRGreen I Master Kit (Roche) according to the manufacturer’sinstructions. The relative expression level was calculated usingthe 2�DCt method. All of the primers used for RT-qPCR are listed inTable S9.

Plasmid construction

The plasmids used for different purposes, the construct methodsand their correspondingly amplified primers are listed inTable S10. For Arabidopsis transformation, the full-length CDSs ofAtCHLI1 and GmCHLIs were amplified using specific primers, dou-ble digested with XhoI and SpeI, and ligated into the correspond-ingly digested PFGC5941 vector. Similarly, the full-length CDSs ofAtCHLI1, GmCHLIs, GmCHLDs and GmCHLHs were cloned into thePucGFP vector without stop codons for fusion with GFP for subcel-lular location and into the pGADT7 and pGBKT7 vectors for theY2H assay by digestion and ligation using the combinations ofrestriction enzymes shown in Table S10. For the BIFC assay, thefull-length CDSs of AtCHLI1, GmCHLIs, GmCHLDs and GmCHLHswithout stop codons were first introduced into the entry vectorpCRTM8/GW/TOPO� by TA cloning and then transferred to the desti-nation vectors of nEYFP/pUGW2 and cEYFP/pUGW2 by recombina-tion using LR Clonase Enzyme (Invitrogen). For the promoteractivity analysis of AtCHLIs and GmCHLIs, the inferred promoters(from ~ 3 kb upstream of the 50 UTR to ATG start codon) werecloned into the pGreen-0800-LUC vector by digestion and ligationusing the combinations of restriction enzymes shown in Table S10.

Yeast two-hybrid assay

The Y2H assay was performed using the Matchmaker GAL4 Two-Hybrid System 3 (Clontech, Palo Alto, CA, USA) according to theClontech yeast protocols handbook. The full-length GmCHLIs werefused with both the GAL4 activation domain (AD) in pGADT7 andthe GAL4 binding domain (BD) in pGBKT7. Subsequently, differentcombinations of plasmids were co-transformed into yeast strainAH109. The resulting yeast co-transformants were first screenedat 30°C on plates containing double-dropout medium (-Leu/-Trp)and then scraped onto high-stringency plates containing quadru-ple-dropout medium (-Leu/-Trp/-Ade/-His) and grown at 30°C. Theb-gal assay for quantifying protein–protein interactions was per-formed according to the manufacturer’s instructions, with slightmodifications; chlorophenol red-b-D-galactopyranoside (CPRG,Roche) was used as the substrate.

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

10 Qing Li et al.

Page 11: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

Subcellular localization and BIFC assay

To examine subcellular localization, PucGFP or its recombinantplasmid ligated to the target gene was transformed into Arabidop-sis protoplast cells as previously described (Yoo et al., 2007), withslight modifications. After incubation in the dark at 22°C for atleast 12 h, GFP fluorescence was observed using a confocal laserscanning microscope Leica SP8 (Leica). For the BIFC assay, Xgene-nEYFP/pUGW2 and Y gene-cEYFP/pUGW2 were co-trans-formed into Arabidopsis protoplast cells following the methoddescribed above, and YFP fluorescence was observed.

Transient transcription dual-luciferase assay

A transient Dual-Luciferase assay using N. benthamiana was per-formed following a previously described method (Hellens et al.,2005), with certain modifications. Agrobacterium EHA105 contain-ing a pGreen or pGreen-PCHLIs-LUC reporter plasmid was trans-formed into N. benthamiana leaves. After 3 days of incubation, asmall amount of the infected leaves were sprayed with 1 mMluciferin (Promega, Madison, WI, USA) supplemented with 0.01%Triton X-100 and photographed using a low-light cooled CCDimaging apparatus NightOWL II LB983 with Indigo software(BERTHOLD TECHNOLOGIES, Bad Wildbad, Germany) to assessLUC expression. The remaining leaves infected with the sametype of reporter plasmid were harvested and temporarily storedin liquid nitrogen for assay using the Dual-Luciferase� Reporter(DLRTM) Assay System (Promega). In the Dual-Luciferase assay,each sample was first ground into a powder using liquid nitro-gen, and ~ 50 mg of the powder was suspended and homoge-nized in 500 lL passive lysis buffer. A 20 lL aliquot of the crudeextract was mixed with 100 lL of LUC assay buffer, and LUCactivity was measured using a GloMax�-96 microplate lumin-ometer (Promega). Then, 100 lL of Stop and Glow buffer wasadded to test REN activity. Promoter activity was calculated bythe ratio of LUC to REN activities.

RNA sequence and data analysis

Sequencing libraries were constructed as previously described(Severin et al., 2010) and sequenced using an HiSeq 2500 system(Illumina, San Diego, CA, USA). The reads were mapped to thesoybean reference genome (Wm82. a2.v1) using HISAT (2.0.0)(Kim et al., 2015). Reference genome-referred transcript assemblyand calculation of the estimated expression was achieved with theStringTie (v1.0.4) package (Pertea et al., 2015). The FPKM valuewas used to calculate transcript abundance. As the enzymesinvolved in isoflavonoid biosynthesis in soybean are multi-copyencoded, to simplify the complexity of the transcript analysis ofthese enzymes, the total FPKM values of the multiple copies wererecognized as the mRNA level for each enzyme. The correspond-ing paralogs encoding the enzymes involved in isoflavonoidbiosynthesis and their FPKM values are shown in Table S11.

Metabolome analysis

Primary metabolites were measured using gas chromatography–mass spectrometry following a previously described method(Fiehn et al., 2000), with certain modifications. The relative massof each primary metabolite was calculated using ribitol as aninternal standard, and the unit of the relative mass is presented asmicrogram per gram of dry weight (lg g�1 dry weight). Secondarymetabolites were assessed using liquid chromatography–tandemmass spectrometry as previously described (Chang et al., 2012),with certain modifications. The relative mole content of each

secondary metabolite was calculated using daidzin as an externalstandard, and the unit of the relative mole content is presented asmicromole per gram of dry weight (lmol/g dry weight).

ACKNOWLEDGEMENTS

We thank Dr Li Hsou-min (Institute of Molecular Biology, Acade-mia Sinica) for kindly providing Arabidopsis cs215 seeds. Thiswork was supported by the National Key Research and Develop-ment Program for ‘Seven Crop Breeding’ Pilot Project (Grant No.2016YFD101001), the National Natural Science Foundation ofChina (Grant Nos. 91531304 and 31525018) and the ‘Strategic Pri-ority Research Program’ of the Chinese Academy of Sciences(Grant No. XDA08020202).

CONFLICT OF INTEREST

The authors have declared that no competing interests

exist.

SUPPORTING INFORMATION

Additional Supporting Information may be found in the onlineversion of this article.Figure S1. Alignment of GmCHLI1 and related proteins.Figure S2. Duplications of GmCHLIs, GmCHLDs and GmCHLHs insoybean.Figure S3. Subcellular localization of GmCHLIs, GmCHLDs andGmCHLHs.Figure S4. Association analysis between the level of GmCHLIstransgenic expression and the chlorophyll content of transgenicplants.Figure S5. Interaction analysis among GmCHLIs by BIFC.Figure S6. Interaction analysis between GmCHLIs and GmCHLDsor GmCHLHs by BIFC.Figure S7. Interaction analysis among GmCHLDs, GmCHLHs, andGmCHLDs with GmCHLHs by BIFC.Figure S8. Relative expression of GmCHLIs, GmCHLDs andGmCHLHs in the y9 mutant.

Table S1. The gene information in map-based cloning locus.Table S2. The protein identity analysis of GmCHLIs, GmCHLDsand GmCHLHs.Table S3. The phenotype summary of the T1 transgenic plants.Table S4. The primary metabolome of y mutant seeds.Table S5. The secondary metabolome of y mutant seeds.Table S6. The primary metabolome of y mutant sprouts.Table S7. The secondary metabolome of y mutant sprouts.Table S8. Information of the molecular markers used for map-based cloning of the y locus.Table S9. Information of the RT-qPCR primers used in this study.Table S10. Information of the vectors used in this study.Table S11. Gene expression information for the enzymes involvedin the isoflavonoid biosynthesis pathway in the y mutant.

REFERENCES

Adams, K.L. and Wendel, J.F. (2005) Polyploidy and genome evolution in

plants. Curr. Opin. Plant Biol. 8, 135–141.Bai, C., Guo, P., Zhao, Q. et al. (2015) Protomer roles in chloroplast chaper-

onin assembly and function. Mol Plant, 8, 1478–1492.Bridges, C.B. (1936) The Bar ‘gene’ - A duplication. Science, 83, 210–211.Campbell, B.W., Mani, D., Curtin, S.J., Slattery, R.A., Michno, J.M., Ort, D.R.,

Schaus, P.J., Palmer, R.G., Orf, J.H. and Stupar, R.M. (2015) Identical sub-

stitutions in magnesium chelatase paralogs result in chlorophyll-deficient

soybean mutants. G3-Genes Genomes. Genetics, 5, 123–131.Chang, Y.W., Zhao, C.X., Wu, Z.M., Zhou, J., Zhao, S.M., Lu, X. and Xu,

G.W. (2012) Chip-based nanoflow high performance liquid

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

Evolution of GmCHLIs in soybean 11

Page 12: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

chromatography coupled to mass spectrometry for profiling of soybean

flavonoids. Electrophoresis, 33, 2399–2406.Chen, X.F., Imsande, J. and Palmer, R.G. (1999) Eight new mutants at the k2

Mdh1-n y20 chromosomal region in soybean. J. Hered. 90, 399–403.Cheng, H., Yu, O. and Yu, D.Y. (2008) Polymorphisms of IFS1 and IFS2 gene

are associated with isoflavone concentrations in soybean seeds. Plant

Sci. 175, 505–512.Chu, S.S., Wang, J., Cheng, H., Yang, Q. and Yu, D.Y. (2014) Evolutionary

study of the isoflavonoid pathway based on multiple copies analysis in

soybean. BMC Genet. 15, 76.

Conant, G.C. and Wolfe, K.H. (2008) Turning a hobby into a job: how dupli-

cated genes find new functions. Nat. Rev. Genet. 9, 938–950.Cregan, P.B., Jarvik, T., Bush, A.L. et al. (1999) An integrated genetic linkage

map of the soybean genome. Crop Sci. 39, 1464–1490.Daras, G., Rigas, S., Penning, B., Milioni, D., McCann, M.C., Carpita, N.C.,

Fasseas, C. and Hatzopoulos, P. (2009) The thanatos mutation in Ara-

bidopsis thaliana cellulose synthase 3 (AtCesA3) has a dominant-nega-

tive effect on cellulose synthesis and plant growth. New Phytol. 184,

114–126.Dhaubhadel, S., McGarvey, B.D., Williams, R. and Gijzen, M. (2003) Isoflavo-

noid biosynthesis and accumulation in developing soybean seeds. Plant

Mol. Biol. 53, 733–743.Doyle, J.J., Flagel, L.E., Paterson, A.H., Rapp, R.A., Soltis, D.E., Soltis, P.S.

and Wendel, J.F. (2008) Evolutionary genetics of genome merger and

doubling in plants. Annu. Rev. Genet. 42, 443–461.Du, H., Huang, Y.B. and Tang, Y.X. (2010) Genetic and metabolic engineer-

ing of isoflavonoid biosynthesis. Appl. Microbiol. Biotechnol. 86, 1293–1312.

Fang, C., Li, C., Li, W. et al. (2014) Concerted evolution of D1 and D2 to reg-

ulate chlorophyll degradation in soybean. Plant J. 77, 700–712.Fiehn, O., Kopka, J., Trethewey, R.N. and Willmitzer, L. (2000) Identification

of uncommon plant metabolites based on calculation of elemental com-

positions using gas chromatography and quadrupole mass spectrome-

try. Anal. Chem. 72, 3573–3580.Flagel, L.E. and Wendel, J.F. (2009) Gene duplication and evolutionary nov-

elty in plants. New Phytol. 183, 557–564.Fodje, M.N., Hansson, A., Hansson, M., Olsen, J.G., Gough, S., Willows,

R.D. and Al-Karadaghi, S. (2001) Interplay between an AAA module and

an integrin I domain may regulate the function of magnesium chelatase.

J. Mol. Biol. 311, 111–122.Force, A., Lynch, M., Pickett, F.B., Amores, A., Yan, Y.L. and Postlethwait, J.

(1999) Preservation of duplicate genes by complementary, degenerative

mutations. Genetics, 151, 1531–1545.Guskov, A., Kern, J., Gabdulkhakov, A., Broser, M., Zouni, A. and Saenger,

W. (2009) Cyanobacterial photosystem II at 2.9-A resolution and the role

of quinones, lipids, channels and chloride. Nat. Struct. Mol. Biol. 16, 334–342.

Hansson, A., Kannangara, C.G., von Wettstein, D. and Hansson, M. (1999)

Molecular basis for semidominance of missense mutations in the

XANTHA-H (42-kDa) subunit of magnesium chelatase. Proc. Natl Acad.

Sci. USA, 96, 1744–1749.Hansson, A., Willows, R.D., Roberts, T.H. and Hansson, M. (2002) Three

semidominant barley mutants with single amino acid substitutions in

the smallest magnesium chelatase subunit form defective AAA(+) hex-

amers. Proc. Natl Acad. Sci. USA, 99, 13944–13949.Hellens, R.P., Allan, A.C., Friel, E.N., Bolitho, K., Grafton, K., Templeton,

M.D., Karunairetnam, S., Gleave, A.P. and Laing, W.A. (2005) Transient

expression vectors for functional genomics, quantification of promoter

activity and RNA silencing in plants. Plant Methods, 1, 13.

Horton, P. (2000) Prospects for crop improvement through the genetic

manipulation of photosynthesis: morphological and biochemical aspects

of light capture. J. Exp. Bot. 51 (Suppl 1), S475–S485.Huang, Y.S. and Li, H.M. (2009) Arabidopsis CHLI2 Can Substitute for CHLI1.

Plant Physiol. 150, 636–645.Imai, A., Komura, M., Kawano, E., Kuwashiro, Y. and Takahashi, T. (2008) A

semi-dominant mutation in the ribosomal protein L10 gene suppresses

the dwarf phenotype of the acl5 mutant in Arabidopsis thaliana. Plant J.

56, 881–890.Innan, H. and Kondrashov, F. (2010) The evolution of gene duplications:

classifying and distinguishing between models. Nat. Rev. Genet. 11, 97–108.

Kakehi, J.I., Kawano, E., Yoshimoto, K., Cai, Q.Q., Imai, A. and Takahashi, T.

(2015) Mutations in ribosomal proteins, RPL4 and RACK1, suppress the

phenotype of a thermospermine-deficient mutant of Arabidopsis thali-

ana. PLoS ONE, 10, e0117309.

Kim, S.R., Yang, J.I. and An, G. (2013) OsCpn60 alpha 1, encoding the plas-

tid chaperonin 60 alpha subunit, is essential for folding of rbcL. Mol.

Cells, 35, 402–409.Kim, D., Landmead, B. and Salzberg, S.L. (2015) HISAT: a fast spliced

aligner with low memory requirements. Nat. Methods, 12, 357–U121.

Kondrashov, F.A., Rogozin, I.B., Wolf, Y.I. and Koonin, E.V. (2002) Selection

in the evolution of gene duplications. Genome Biol. 3, RESEARCH0008.

Lichtenthaler, H.K. (1987) Chlorophylls and carotenoids – pigments of pho-

tosynthetic biomembranes. Methods Enzymol. 148, 350–382.Liu, Z., Yan, H., Wang, K., Kuang, T., Zhang, J., Gui, L., An, X. and Chang,

W. (2004) Crystal structure of spinach major light-harvesting complex at

2.72 A resolution. Nature, 428, 287–292.Long, S.P., Zhu, X.G., Naidu, S.L. and Ort, D.R. (2006) Can improvement

in photosynthesis increase crop yields? Plant, Cell Environ. 29, 315–330.

Luo, T., Luo, S., Araujo, W.L., Schlicke, H., Rothbart, M., Yu, J., Fan, T.T.,

Fernie, A.R., Grimm, B. and Luo, M.Z. (2013) Virus-induced gene silenc-

ing of pea CHLI and CHLD affects tetrapyrrole biosynthesis, chloroplast

development and the primary metabolic network. Plant Physiol. Bio-

chem. 65, 17–26.Lynch, M., O’Hely, M., Walsh, B. and Force, A. (2001) The probability of

preservation of a newly arisen gene duplicate. Genetics, 159, 1789–1804.

Masuda, T. (2008) Recent overview of the Mg branch of the tetrapyrrole

biosynthesis leading to chlorophylls. Photosynth. Res. 96, 121–143.Pertea, M., Pertea, G.M., Antonescu, C.M., Chang, T.C., Mendell, J.T. and

Salzberg, S.L. (2015) StringTie enables improved reconstruction of a

transcriptome from RNA-seq reads. Nat. Biotechnol. 33, 290–295.Qin, X., Suga, M., Kuang, T. and Shen, J.R. (2015) Photosynthesis. Struc-

tural basis for energy transfer pathways in the plant PSI-LHCI supercom-

plex. Science, 348, 989–995.Quijada, P.A., Udall, J.A., Lambert, B. and Osborn, T.C. (2006) Quantitative

trait analysis of seed yield and other complex traits in hybrid spring

rapeseed (Brassica napus L.): 1. Identification of genomic regions from

winter germplasm. Theor. Appl. Genet. 113, 549–561.Rissler, H.M., Collakova, E., DellaPenna, D., Whelan, J. and Pogson, B.J.

(2002) Chlorophyll biosynthesis. Expression of a second Chl I gene of

magnesium chelatase in Arabidopsis supports only limited chlorophyll

synthesis. Plant Physiol. 128, 770–779.Roulin, A., Auer, P.L., Libault, M., Schlueter, J., Farmer, A., May, G., Stacey,

G., Doerge, R.W. and Jackson, S.A. (2012) The fate of duplicated genes

in a polyploid plant genome. Plant J. 73, 143–153.Satou, M., Enoki, H., Oikawa, A. et al. (2014) Integrated analysis of transcrip-

tome and metabolome of Arabidopsis albino or pale green mutants with

disrupted nuclear-encoded chloroplast proteins. Plant Mol. Biol. 85, 411–428.

Sawers, R.J.H., Viney, J., Farmer, P.R., Bussey, R.R., Olsefski, G., Anufri-

kova, K., Hunter, C.N. and Brutnell, T.P. (2006) The maize Oil yellow1

(Oy1) gene encodes the I subunit of magnesium chelatase. Plant Mol.

Biol. 60, 95–106.Schmutz, J., Cannon, S.B., Schlueter, J. et al. (2010) Genome sequence of

the palaeopolyploid soybean. Nature, 463, 178–183.Schranz, M.E., Quijada, P., Sung, S.B., Lukens, L., Amasino, R. and Osborn,

T.C. (2002) Characterization and effects of the replicated flowering time

gene FLC in Brassica rapa. Genetics, 162, 1457–1468.Severin, A.J., Woody, J.L., Bolon, Y.T. et al. (2010) RNA-Seq Atlas of Gly-

cine max: a guide to the soybean transcriptome. BMC Plant Biol. 10, 160.

Shen, Y.T., Zhou, Z.K., Wang, Z. et al. (2014) Global dissection of alternative

splicing in paleopolyploid soybean. Plant Cell, 26, 996–1008.Shoemaker, R.C., Schlueter, J. and Doyle, J.J. (2006) Paleopolyploidy and gene

duplication in soybean and other legumes. Curr. Opin. Plant Biol. 9, 104–109.Suga, M., Akita, F., Hirata, K. et al. (2015) Native structure of photosystem II

at 1.95 �A resolution viewed by femtosecond X-ray pulses. Nature, 517,

99–103.Tian, Z., Wang, X., Lee, R., Li, Y., Specht, J.E., Nelson, R.L., McClean, P.E.,

Qiu, L. and Ma, J. (2010) Artificial selection for determinate growth habit

in soybean. Proc. Natl Acad. Sci. USA, 107, 8563–8568.

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

12 Qing Li et al.

Page 13: (2016) doi: 10.1111/tpj.13282 Functional ... - Geneticstianlab.genetics.ac.cn/TianLab_Publications/pdf... · Functional conservation and divergence of GmCHLI genes in polyploid soybean

Udall, J.A. and Wendel, J.F. (2006) Polyploidy and crop improvement. Crop

Sci. 46, S3–S14.Udall, J.A., Quijada, P.A., Lambert, B. and Osborn, T.C. (2006) Quantitative

trait analysis of seed yield and other complex traits in hybrid spring

rapeseed (Brassica napus L.): 2. Identification of alleles from unadapted

germplasm. Theor. Appl. Genet. 113, 597–609.Wendel, J.F. (2000) Genome evolution in polyploids. Plant Mol. Biol. 42,

225–249.Yoo, S.D., Cho, Y.H. and Sheen, J. (2007) Arabidopsis mesophyll proto-

plasts: a versatile cell system for transient gene expression analysis. Nat.

Protoc. 2, 1565–1572.

Zhang, J.Z. (2003) Evolution by gene duplication: an update. Trends Ecol.

Evol. 18, 292–298.Zhang, S.J., Zhou, H., Yu, F., Bai, C.C., Zhao, Q., He, J.H. and Liu, C.M.

(2016) Structural insight into the cooperation of chloroplast chaperonin

subunits. BMC Biol. 14, 29.

Zhong, R.Q., Morrison, W.H., Freshour, G.D., Hahn, M.G. and Ye, Z.H. (2003)

Expression of a mutant form of cellulose synthase AtCesA7 causes domi-

nant negative effect on cellulose biosynthesis. Plant Physiol. 132, 786–795.Zhou, Z., Jiang, Y., Wang, Z. et al. (2015) Resequencing 302 wild and culti-

vated accessions identifies genes related to domestication and improve-

ment in soybean. Nat. Biotechnol. 33, 408–414.

© 2016 The AuthorsThe Plant Journal © 2016 John Wiley & Sons Ltd, The Plant Journal, (2016), doi: 10.1111/tpj.13282

Evolution of GmCHLIs in soybean 13