JXB Advance Access published online on October 3, 2008
Journal of Experimental Botany, doi:10.1093/jxb/ern239
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
RESEARCH PAPER |
Variation of metabolic profiles in developing maize kernels up- and down-regulated for the hda101 gene
1Dipartimento di Chimica, Università degli Studi di Roma La Sapienza, Piazzale Aldo Moro 5, 00185-Roma, Italia
2CRA-Unità di Ricerca per la Maiscoltura, Bergamo, Italia
* To whom correspondence should be addressed: E-mail: c.manetti{at}caspur.it
Received 12 June 2008; Revised 1 August 2008 Accepted 14 August 2008
| Abstract |
|---|
|
|
|---|
To shed light on the specific contribution of HDA101 in modulating metabolic pathways in the maize seed, changes in the metabolic profiles of kernels obtained from hda101 mutant plants have been investigated by a metabonomic approach. Dynamic properties of chromatin folding can be mediated by enzymes that modify DNA and histones. The enzymes responsible for the steady-state of histone acetylation are histone acetyltransferase and histone deacetylase (HDA). Therefore, it is interesting to evaluate the effects of up- and down-regulation of a Rpd-3 type HDA on the development of maize seeds in terms of metabolic changes. This has been reached by analysing nuclear magnetic resonance spectra by different chemometrician approaches, such as Orthogonal Projection to Latent Structure-Discriminant Analysis, Parallel Factors Analysis, and Multi-way Partial Least Squares-Discriminant Analysis (N-PLS-DA). In particular, the latter approaches were chosen because they explicitly take time into account, organizing data into a set of slices that refer to different steps of the developing process. The results show the good discriminating capabilities of the N-PLS-DA approach, even if the number of samples ought be increased to obtain better predictive capabilities. However, using this approach, it was possible to show differences in the accumulation of metabolites during development and to highlight the changes occuring in the modified seeds. In particular, the results confirm the role of this gene in cell cycle control.
Key words: Metabolomics, metabonomics, multi-way analysis, NMR, Rpd3, Zea mays
| Introduction |
|---|
|
|
|---|
Eukaryotic genes are regulated by a complex interplay of transcriptional factors and chromatin proteins that pack chromosome DNA into the confined space of the nucleus, while preparing genes for activation or repression (Kadonaga, 1998). Evidence suggests that various levels of chromatin folding ensure the organization of DNA into a tightly packaged environment, which must be highly flexible to switch between repressive condensed to active accessible states. This dynamic property of chromatin can be mediated by enzymes that modify DNA and histones (Fischle et al., 2003).
A variety of post-translational modifications of histones has been identified, including acetylation, methylation, phosphorylation, and ubiquitination (Peterson and Laniel, 2004). Among different histone modifications, acetylation of N-terminal lysine residues correlates with transcriptional activation (Shahbazian and Grunstein, 2007). In addition, acetylation is frequently associated with other histone marks, thus forming specific histone modification patterns. These patterns constitute the histone code, which is written in response to intra- and extracellular signals, by enzymes that modify histones in specific amino acid residues and is interpreted by regulatory factors that regulate the chromatin structure to modulate gene and genome activity (Jenuwein and Allis, 2001; Berger, 2007).
The enzymes responsible for the steady-state of histone acetylation are histone acetyltransferases (HATs) and histone deacetylases (HDACs). These enzymes are members of distinct gene families and exist as multiprotein complexes (Carrozza et al., 2003; Thiagalingam et al., 2003). They can be targeted to specific promoters through interaction with sequence-specific transcription factors to modify both histones and non-histone proteins locally and can also act globally to modulate the turnover of histone acetylation throughout the genome (Pfluger and Wagner, 2007). HATs, HDACs, as well as other factors involved in the modulation of chromatin structure, are highly conserved in eukaryotes, including plants (Loidl, 2004). However, the peculiarities of plant development and the response to environmental cues can result in marked differences, including the presence of plant-specific HDACs and distinct regulatory mechanisms involved in the establishment and maintenance of epigenetic information. In the genome of different plant species, several potentially functional HDACs have been identified and classified into three distinct families: (i) the Rpd3/Hda1 super-family, (ii) the Sir2-related, and (iii) the plant-specific HD2-like HDACs (http://www.chromdb.org; Pandey et al., 2002). The characterization of Arabidopsis (Arabidopsis thaliana) HDAC mutants revealed that members of different classes and within the same group have evolved specific functions (Probst et al., 2004; Tian and Chen, 2001; Zhou et al., 2005; Long et al., 2006). Nevertheless, many aspects of the HDACs involvement in plant development, as well as the mechanisms responsible for HDAC-mediated control of gene/genome activity, remain elusive.
To date, 15 genes encoding putative HDACs (i.e. 10 Rpd3/Hda1-, 1 Sir2-, and 4 HD2-like genes; http://www.chromdb.org) have been identified in the maize genome, and members of all three HDAC families have been biochemically characterized (Lusser et al., 2001). Studies of maize Rpd3-type HDACs function has revealed that members of this family are differentially expressed during plant development and can physically interact with the maize retinoblastoma-related protein, a key regulator of cell cycle progression (Rossi et al., 2003; Varotto et al., 2003). Furthermore, in cereals, the role of these enzymes in controlling cell division was confirmed by the finding that over-expression of a rice (Oryza sativa) Rpd3 gene leads to alterations in growth rate and plant architecture (Jang et al., 2003). More recently, Rossi et al. (2007) used maize plants with specific up- and down-regulation of hda101 expression to characterize functionally a member of the maize Rpd3-type HDAC family, i.e the hda101 gene. Their results indicated that gene expression, including the transcription of important regulators of meristem function and of vegetative to reproductive transition, was affected, suggesting a role of hda101 in modulating plant development, genome activity, and the modification of histone marks. Collectively, the results from the functional characterization of HDA101 indicate that this enzyme affects, either directly or indirectly, the expression of genes involved in various metabolic pathways.
To shed light on the specific contribution of HDA101 in modulating metabolic pathways in the maize seed, changes in the metabolic profiles of kernels obtained from hda101 mutant plants have been investigated. In the field of metabolomics, the analysis of metabolic changes in time is a fundamental aspect of understanding the biochemical response of an organism to an external perturbation (Lindon et al., 2001). As processes develop through time, the metabolic responses also exhibit dynamic variation. Therefore, monitoring these changes results in characteristic patterns for each type of perturbation. Principal component trajectories have been constructed from Nuclear Magnetic Resonance (NMR) data to investigate the changing multivariate biochemical profile during the development of a toxic lesion (Keun et al., 2004). However, this kind of analysis, although effective for trajectory analysis, is not suitable for the simultaneous comparison of several parallel systems, and thus the use of alternate multi-way tools for optimally extracting metabolic trajectory and biomarker information have been investigated (Antti et al., 2002; Dyrby et al., 2005a). Multi-way analysis (Bro, 1997) of NMR data is the extension of the traditional multivariate analysis, already applied to metabonomic studies of maize (Manetti et al., 2004, 2006) and permits the direct study of development through time.
Among the existing procedures, several approaches were used in this study to distill information from the large amount of experimental data. As a first step, PARallel FACtor analysis (PARAFAC) was chosen (Bro, 1997); this is a multi-way decomposition method, which can be compared to bi-linear Principal Component Analysis (PCA), or rather it is one generalization of bi-linear PCA.
The important difference between PCA and PARAFAC is that in PARAFAC there is no need for orthogonality to identify the model. Furthermore, the PARAFAC model has the advantage of being a unique solution (Bro, 1997), an important characteristic when the dynamics of changes are studied. PARAFAC was applied successfully in many areas, ranging from the monitoring of the photocatalytic degradation of phenol in aqueous suspensions of TiO2 by fluorescence spectroscopy (Bosco et al., 2006) to the quantification of nitrite in water and meat samples by kinetic-spectrophotometric analysis (Niazi et al., 2005), and to the prediction of sensory qualities of different potatoes (Povlsen et al., 2003).
Subsequently, multi-way Partial Least-Squares-Discriminant Analysis (N-PLS-DA) was applied; this is the regression method for the analysis of a higher order array (Bro, 1996). As for the traditional two-way PLS, it searches for a compromise between better fit (i.e. less error in describing the array of the independent variables) and better prediction (i.e. less error in evaluating the response space). N-PLS has been applied successfully in many areas, ranging from the analysis of food characteristics by fluorescence (Christensen et al., 2006) and gas chromatography (Guimet et al., 2005; Durante et al., 2006), to the simultaneous determination of ions by electrochemical sensors (Chow et al., 2006). Recently, the use of N-PLS in the quantification of lipo-protein fractions obtained by 2-D diffusion-edited NMR spectra has been evaluated (Dyrby et al., 2005b).
In summary, in this paper, the application of multi-way methods to NMR spectra to evaluate the effect of the up- and down-regulation of HDA101 activity in terms of metabolite concentrations during maize seed development is described, comparing the two approaches one with the other. In addition, the results are compared with those of Orthogonal Projection to Latent Structure-Discriminant Analysis (OPLS-DA), an abundantly used bivariate approach in the metabonomic field (Cloarec et al., 2005a; Trygg and Wold, 2002).
| Materials and methods |
|---|
|
|
|---|
Plant material
Kernel samples from the B73 inbred lines (WT) and two early isogenic versions of this inbred, containing a modified ZmRpd3-101 maize gene (Rossi et al., 1998), in sense and antisense orientation (OE1 and AS33), respectively, were used in this study. A detailed description of the origin of the transgenic lines was recently reported by Rossi et al. (2007) and a summary will be given here. Briefly, the constitutive maize ubiquitin promoter (Christensen et al., 1992) was cloned upstream or downstream of the full-length hda101 cDNA sequence, into the pGEM3-ZmRpd31 plasmid (Rossi et al., 2003), to obtain sense and antisense hda101 constructs, respectively. The polyadenylation domain of the nopaline synthase gene was inserted opposite to the ubiquitin promoter. The resulting cassettes were used to generate the pRpd3-53 and pRpd3-35 plasmids. These plasmids were employed to transform protoplasts from the maize suspension cell line HE-89 (Morocz et al., 1990) using the PEG method. Regenerated T0 plants were converted to the B73 inbred by two backcrosses, to minimize a mixed genetic background influence, and selfed twice to obtain homozygous plants, which were used for analysis. The presence of the transgenes in plants was assessed for resistance to gluphosinate using PCR screening.
For the current study two lines were chosen, i.e. AS33 and OE1, which displayed the most pronounced differences in the levels of both hda101 mRNA and protein compared with the wild type; differences in hda101 expression was observed in seedlings, developing ears, and kernels harvested at different developmental stages (Rossi et al., 2007).
Plants of the inbred line B73 and its transgenic versions were grown in experimental plots under containment, according to guidelines of the Italian laws for bio-safety. At flowering, plants were self-pollinated; and a minimum of six well-filled ears of each genotype were harvested at four stages of kernel development, i.e. 8, 13, 18, and 23 days after pollination (DAP) and frozen immediately in liquid N2. Kernels harvested at physiological maturity were also used in this study. For each genotype, the kernel samples were conserved in sealed plastic bags at –80 °C until NMR analysis.
NMR sample preparation
For each genotype, a sample of single maize seed was weighed and then frozen in a stainless steel mortar using liquid N2, before being pulverized to a fine powder with a pestle chilled in liquid N2 and maintained in a liquid N2 bath during the pulverization procedure.
Three ml of methanol/chloroform mixture (2:1 v/v) were added to the entire pulverizated sample. The powder was stirred and 1 ml of chloroform and 1.2 ml of water were added (Bligh-Dyer modified; Miccheli et al., 1988; Ricciolini et al., 1994). The sample was stored at 4 °C for 1 h and then centrifuged at 10 000 g for 20 min at 4 °C. The resulting upper hydro-alcoholic and lower chloroformic phases were separated. The extraction procedure was performed twice, once on the powder and once on the pellet, in order to obtain a quantitative extraction. After the second extraction, the two hydro-alcoholic phases obtained were pooled, dried under N2 flux, and stored at –80 °C prior to analysis.
NMR data collection
To obtain NMR spectra, the dried sample was dissolved in 1 ml of 0.5 mM TSP (sodium salt of 3-(trimethylsilyl)propionic-2,2,3,3-d4 acid) solution in D2O PBS buffer (pH 7.4) to avoid chemical-shift changes due to pH variation. The dissolved extracts were transferred to a 5 mm NMR tube.
NMR spectra were recorded on a Bruker (Bruker GmbH, Rheinstetten, Germany) DRX 500 spectrometer, operating at 1H frequency of 500.13 MHz. Single pulse spectra were acquired using a solvent pre-saturation pulse sequence to suppress residual water resonances, so that it was possible to evaluate signals near the solvent region better. Possible bias was taken into account in the post-processing procedure, eliminating variables (i.e. buckets, see below) corresponding to the water region. Spectra were obtained at T=27 °C, 256 scans were acquired, with data collected into 64 k data points, and a spectral width of 12 ppm, using a 20 s delay for a full relaxation condition. Prior to Fourier transformation, an exponential multiplication was performed, using a line broadening equal to 0.09 Hz.
The spectra were phased, baseline corrected using the usual ACD routines (Advanced Chemistry Development Inc., 90 Adelaide Street West, Toronto, Ontario, M5H 3V9, Canada), and they were referenced to TSP for chemical shift (0.00 ppm).
NMR data pre-processing treatment
The 1H spectra were reduced to 499 buckets to produce a matrix of sequentially integrated regions of 0.02 ppm in width between –0.5 ppm and 9.5 ppm, using ACD software: column 1 corresponds to the bucket –0.5 ppm to –0.48 ppm. The spectra were normalized according to the area of the TSP peak, fixed to 10. The region between 4.7 ppm and 4.9 ppm was removed to eliminate baseline effects due to presaturation of the water signal.
Statistical analysis
Orthogonal Projection to Latent Structure-Discriminant Analysis (OPLS-DA):
The reduced and normalized NMR spectral data were inported into Matlab (version 7.4, The Mathworks, Natick, MA).
OPLS-DA has gained increasing interest in the metabonomic field, due to the possibility of obtaining models that allow a thorough interpretation of the results. This is achieved by a separate modelling of predictive (variation of interest) and class-related (variation not related to the responses) variation in the X-matrix (the spectral descriptors being metabolite concentrations or buckets) through the identification of Y-orthogonal variation (Bylesjö et al., 2007) (Y being the matrix made of the features of interest such as treatment classes).
In this case, X has been considered as a matrix containing the bucketed spectra and Y as a dummy matrix containing information on the seed genotype. In particular, this input matrix has a row, for each sample, containing 1 for the y variable corresponding to the right group and 0 for all the others.
Data have been mean centred and unit variance scaled before investigation and analysed using in-house routines. Note that this approach collapses the time dimension by concatenating the NMR spectra. In this way, explicit reference to time is lost while this dimension can be crucial to study a developmental process.
Multi-way analysis:
In order to include the time dimension, data were arranged in a three-way data box with kernels as mode 1, bucketed NMR spectra as mode 2, and time as mode 3, giving a data cube of dimension 12x484x5, as it consists of 4 replicatesx3 genotypesx5 growth stages as rows and 484 NMR regions as variables.
Matlab was equipped with the N-way Toolbox version 3.0 (obtained from R Bro at http://www.models.life.ku.dk/source; Andersson and Bro, 2000).
Prior to analysis, the data were mean centred and scaled at unit variance across the sample mode.
Initially, data were analysed by an unsupervised multi-way approach, such as PARallel FACtor analysis (PARAFAC), where decomposition of the data is made into triads or trilinear components. Instead of one score vector and one loading vector as in bilinear PCA, each component consists of one score vector and two loading vectors. A PARAFAC model of a three-way array is given by three loading matrices, A, B, and C with typical elements aif, bjf, and ckf and it is defined by the structural model
|
|
The second multi-way approach used was multi-way partial least-squares (N-PLS), which allows the regression of NMR data against classes present in the data set. For this procedure also, time was explicitly included and data were arranged in a box, putting behind each other data sheets ordered in time. A Y matrix was constructed, where each column defines a group that corresponds to the genotypes of the kernels, whose values are dummy variables. In particular, the column contains 1 for the samples belonging to the group and zeros for all the others. The method could therefore be defined as N-PLS-DA. Figure 1 shows a schematic representation of the strategy that has been applied here.
|
As a result of the N-PLS-DA model, the X array is decomposed in terms of a scores matrix (T) relative to the sample mode, and two weights matrices (WJ, WK), relative to the NMR and time modes, respectively, while the Y array is decomposed in terms of a score matrix (U) relative to the first mode, and one loadings matrix QM relative to the second mode. The expression relating the two decomposition models of X and Y is:
|
|
Prior to analysis, the data were mean centred and scaled at unit variance across the sample mode. Following the approach applied by Durante et al. (2006), the model was validated using a leave-one-genotype-out-at-a-time: samples belonging to the same genotypes were always left in the same validation segment.
| Results |
|---|
|
|
|---|
Transgenic plants with up- and down-regulation of hda101 expression
Maize plants with up- and down-regulation of hda101 transcription were generated by transformation with plasmids that constitutively over-express hda101 sense (OE) or antisense (AS) transcript (Rossi et al., 2007). A phenotypic characterization of the two transgenic progenies of plants revealed that the perturbation of hda101 expression induces alterations in plant growth and development, including kernel size (Table 1). In particular, the kernel size, measured as the 100-kernel weight, was definitively smaller in OE1 mature kernels (16.5±0.4 g) compared with wild-type (27.9±1.4 g) and AS33 (22.5±1.0 g).
|
Evaluation of 1H NMR spectra of maize seed
In the current study, the 1H-NMR spectra of hydro-alcoholic extracts of maize kernels derived from AS33, OE1, and WT lines and harvested at five different developmental stages have been analysed. The stages investigated reflect: (i) 8 DAP, corresponding to the embryogenesis stage, when the embryo differentiates; (ii) 13 DAP, corresponding to the initial stage of maturation characterized by cell expansion; (iii) 18 DAP, corresponding to the central phase of kernel maturation when the maximum accumulation of starch and storage proteins occurred; (iv) 23 DAP, corresponding to the end phase of maturation, when their water content starts to decrease; and (v) mature seeds. The assignment of the peaks was achieved in data based on the literature (Fan, 1996; Le Gall et al., 2003) and on 2D spectra, previously obtained for maize seed samples in our laboratory (Manetti et al., 2006).
In Fig. 2, NMR spectra for all the developmental stages of the WT seeds are reported. The list of the assigned metabolites is reported in Table 2. A number of assignable amino acids, organic acids, and sugars were identified. In particular, the following metabolites dominated the spectra: (i) amino acids such as threonine, alanine, glutamate, glutamine, aspartate, asparagine, tyrosine, phenylalanine, valine, isoleucine, and acid
-amino butyric acid (GABA); (ii) organic acids such as pyruvate, succinate, 3-hydroxybutyrate, and choline; and (iii) sugars such as
- and β-glucose and sucrose. Significant ratios compared with the wild type were observed in the transgenic kernels for several metabolites during the different stages of sampling. The differences of the metabolite levels in transgenic compared to wild-type kernels are shown in Table 3 and Table 4 and are particularly evident in mature kernels. Specifically, down-regulation of hda101 (AS33 line) gave rise to a reduction in the level of several metabolites (Table 3). In fact, these seeds were associated with a decrease in the level of sugars, organic acids such as acetate, 3-hydroxybutyric acid, and succinate, and amino acids such as asparagine, glutamine, isoleucine, valine, and GABA. Considering the TCA-cycle intermediates, succinate was the only metabolite that displayed a major reduction in content. Kernels derived from plants with up-regulation of hda1 (OE1 line) were generally associated with a significant accumulation of the metabolite concentrations with respect to wild type, ranging from 1.3–4.3-fold, with the exception of 3-hydroxybutyric acid, acetate, and formate for which the ratios, are <1 (Table 4). In detail, there was a significant accumulation in the content of most of the amino acids and, more notably, in the non-proteogenic amino acid GABA. Considering glycolysis and the TCA-cycle, the intermediates pyruvate, succinate, and malate, were the only metabolites that displayed a considerable increase in their content. The amount of many metabolites was also altered in AS33 and OE1 kernels harvested at different developmental stages, although the number of assignable metabolites showing variation is smaller than the one observed in mature kernels.
|
|
|
|
Analysis by OPLS-DA
To extend the knowledge obtained by univariate analysis (i.e. variations in levels of single metabolites) on the dynamics of processes and on differences among genotypes, a metabonomic approach was applied. As a first step, data were investigated by OPLS-DA, to remove variations in X (NMR spectra) unrelated to Y (genotype). The resulting model was constituted by ten latent variables explaining 66.12% of X-variance and 72.72% of Y-variance. All three genotypes formed distinct clusters in the space spanned by the scores of the first two latent variables (Fig. 3). Furthermore, the orthogonal matrix obtained by the analysis was investigated by PCA: it contains characteristics correlated to the developing process, as shown by the score plot (Fig. 4). The samples are ordered according to their development stage on the second principal component, even if the data matrix does not explicitly take time information into account.
|
|
Analysis by multi-way methods
To extend the analysis, a multi-way approach was chosen to investigate the data set by considering the characteristics of the system (Castro and Manetti, 2007), containing two factors that simultaneously change (i.e. genotype and kernel development).
PARAFAC analysis was carried out on the array containing the NMR bucketed spectra. As a result, a model with two latent variables was obtained, explaining 67.25% of variance. In Fig. 5, a score plot is shown. It is noticeable that both factors manage to discriminate between OE1 derived kernels and those of the other two genotypes.
|
N-PLS-DA regression was developed between the array containing the bucketed NMR spectra and a dummy matrix containing information about samples groups. As a result, two factors were found to be significant, explaining 46.33% of X variance and 77.63% of Y variance. The total explained variation of X for this model is relatively low because many regions of the spectra contained only noise, and the autoscaling of the corresponding variables contributes to increasing the random variance, which is impossible to model. However, 76% of Y variance can be related to this variation in X.
In Fig. 6, the score plot is shown. It is evident that it is possible to separate the three genotypes. In particular, it is noticeable that the first factor manages to discriminate between OE1 derived kernels, characterized by positive values of the scores, and those of the other two genotypes, with negative values. Furthermore, the second factor allows the discrimination between the wild-type kernels, which have positive values, and the two genetically modified kernels with negative values.
|
In Fig. 7, the plot of the loadings corresponding to the NMR mode is shown. For the first latent variable, positive values are obtained for NMR descriptors, corresponding to all the resonances of sucrose,
and β-glucose, and choline. For the second latent variable, alanine, glutamine, and sucrose have positive loading values, while
and β-glucose and guanine have negative ones.
|
The efficacy of the method in describing a dynamic trajectory is shown in Fig. 8, where the loadings corresponding to the time mode are plotted; this analysis shows the discriminant trajectories among the genotypes and this is easily correlatable to the evolution of the sysyem in time as seen by NMR spectra (Fig. 2).
|
| Discussion |
|---|
|
|
|---|
In this study the metabolic profiles of three maize genotypes were compared to highlight the consequences of the up- and down- regulation of hda101 gene in developing kernels. The discrimination among the genotypes was obtained, taking into account the growth pattern of the three groups.
Data have been analysed by different multivariate analysis methods. In particular, N-PLS-DA has been applied, considering as the response block a dummy matrix, which records samples membership, and obtaining a model based on the discriminating characteristics among the groups analogous to the PLS-Discriminant Analysis (PLS-DA) approach, that is abundantly used in metabonomics. The advantage of a multi-way analysis in comparison to the two-way multivariate technique is not to obtain a better fit, but rather to obtain more adequate, robust, and interpretable models (Bro, 1997). In fact, the constructed models integrate the information contained in the entire structure of the multi-way array, without any loss due to the unfolding procedure.
Our aim is the analysis of changes in the developing process due to a genetic modification perturbing the expression of genes involved in various metabolic pathways by a change in the organization of histone.
The chosen approach is based on the study of metabolism through different stages of development by an NMR technique, whose non-targeted characteristics allow the observation of variations on the entire bucketed spectra with hundreds variables for each spectra. The comparison between variable and sample numbers could suggest that it is impossible to build a model suitable for prediction and not only for summarizing huge data sets. However, it is crucial to keep in mind that high correlation exists among many variables (spectrum buckets). In fact, NMR sensibility limits the observable metabolites to some tens, and the spectra are constituted by many signals arising from different parts of the same molecule, signals that are a duplication of information. In fact, correlation analysis of these signals, highlighted by specific pulse sequences (COrrelation SpectroscopY, COSY; Total Correlation SpectroscopY, TOCSY), allows spectral assignment: this purely spectroscopic approach found a chemometrician equivalent (Statistical TOtal Correlation SpectroscpY, STOCSY) when the metabolomic approach tried to highlight correlations among metabolites due to system biochemistry, i.e. metabolic pathways (Cloarec et al., 2005b).
At the beginning, the metabolomic approach was mainly focused on classification problems, i.e. the characterization of healthy versus ill patients (score plot analysis) (Nicholson et al., 2002), nowadays, this approach is extended to metabolic trajectories study by the analysis of original variable contributions to the definition of the new latent variables (loading analysis) driving the observer to go in-depth to the metabolic significance of data (Manetti et al., 2006; Coen et al., 2008).
This second aim is less correlated to the predictive capabilities of the model, but it highlights some of the discriminant characteristics among sample classes. The classification obtained can be driven by multiple regression techniques (e.g. PLS-DA) that give the possibility of observing the correlation structure from the desired point of view. Analysis of the loading obtained (Fig. 7) allows correlation among metabolites: this correlation is not always direct, but can be guaranteed by the metabolites involved in other pathways. For these reasons, in some cases the perturbation of the system can be detected observing that the factor loading of some metabolites have significantly different values, i.e they do not correlate with the same component in the control samples and in the perturbed ones. This variation can be concerted, underlying the invariant characteristics of the relation between metabolites that is resistant to the perturbation: groups of metabolites keep their correlation with the same component, while other do not keep it. In other cases, the signature of the perturbation can be revealed by a change of the sign of factor loading: metabolites positively correlated in the control samples have loading with the opposite sign in the perturbed ones (Camacho et al., 2005; Manetti et al., 2006).
When the metabonomic data set contains data collected at different times for different sample types, traditional multivariate analysis increases the sample dimension of the unit/variable matrix but it distorts the multivariate space spanned by the latent variables, mixing up the characteristics due to sample mode with those due to time mode. OPLS-DA tries to solve this problem by introducing a dimension where the uninteresting variation can be projected and treated as noise.
In the N-way approach, time is explicitly considered and it generates more interpretable trajectories and is able clearly to represent the analysed samples whose characteristics have been known since the beginning.
It is important to underline that, in biological contexts, the number of samples is often limited and for this reason it is important not to forget any possible overfitting problems and not to stress the predictive capabilities of the models. For this reason, in this study, a non-supervised multi-way approach (PARAFAC) has also been used, in addition to the supervised one, confirming the results but at the same time showing that N-PLS-DA allows a better interpretation. In PARAFAC, no class information is presented to the model at the beginning, i.e. no Y matrix is defined.
Note that the estimated N-way model cannot be rotated without a loss of fit, as opposed to two-way analysis where one may rotate scores and loadings without changing the fit of the model: consequence of this characteristic (uniqueness of the solution) is more interpretable latent variables (Bro, 1997).
The peculiar dynamic in the early development of maize seeds comes out as being characteristic of the perturbation due to biological processes related to the different traits of the system (up- and down-regulation of hda101). In fact, putting together the results of score plot and loading plot analysis (Figs 6, 8), it is possible to observe that the factor distinguishing OE1-derived kernels from those of the other two genotypes has a time loading with different characteristics from the time loading of the second latent variable that discriminates between WT and modified kernels.
From the comparison of the two loadings, it is possible to say that the early stages of development are influenced by the introduction of up- and down-regulation of hda101. This can constitute a framework where results of univariate analysis can be collocated.
Our results indicated that kernels obtained from hda101 over-expressing plants (OE1 line) exhibit a tendency to accumulate several metabolites compared with the levels observed in wild-type kernels (Tables 3, 4). On the contrary, kernels with down-regulation of hda101 expression (AS33) show a tendency to a general decrease of metabolite levels. These different behaviours may be related to the involvement of hda101 in controlling cell cycle progression, probably reflected in an alteration of the kernel size in hda101 mutants in comparison to the wild type. In this respect, previous studies (Rossi et al., 2003; Varotto et al., 2003) have pointed out the involvement of hda101 in cell cycle control, suggesting that hda101 physically interacts with the maize homologues of the mammalian retinoblastoma protein, a key regulator of G1/S transition (Harbour and Dean, 2000). Therefore, it may be predicted that the up-regulation of HDA101 activity in the kernels of OE1 line induces aberrant effects on the normal progression of the cell cycle and, in particular, a general reduction of the cell number in developing seeds. This appears related to the decrease of the 100-kernel weight (Table 1) observed in the OE1 line compared to wild type (Rossi et al., 2007) and to the profiles of the loadings relative to the time mode obtained by N-PLS-DA analysis (Fig. 8). The general repression of cell cycle progression may also result in a reduced capacity of the metabolites utilization and transformation in the various metabolic pathways, leading to increased levels of metabolite accumulation observed in OE1 kernels. Conversely, the down-regulation of HDA101 activity in AS33 kernels may induce an increase in the rate of metabolites utilization to sustain the increased cell cycle activity. However, additional efforts are required to clarify this point. A similar general effect, determining an overall slow-down of early kernel development, was also observed by the over-expression of the activator ZmOCL1, a member of the ZmOCL (Outer Cell Layer) family, encoding putative transcription factors of the HD-ZIP IV class (Khaled et al., 2005). In this context, the fact that HDA101 is usually considered as a repressor of gene transcription (Rossi et al., 2007; Shahbazian and Grunstein, 2007), suggests that the opposite tendency of change in the metabolite levels in OE1 and AS33 kernels is due to opposite effects on the regulation of transcription of genes encoding for key regulators of metabolite synthesis and/or utilization.
Furthermore, these results reveal that the switch between the prestorage or cell division phase (8 DAP) to the storage or differentiation phase and mature seed is also accompanied by a switch from a much reduced hexose (
and β-glucose) and sucrose ratio to a high sucrose/hexose ratio within the embryo in the AS33 kernels to an increase of these metabolites in the OE1 kernels, in comparison to wild-type kernels (Tables 3, 4). It has been shown that a high hexose/sucrose ratio favours cell division whereas a high sucrose/hexose ratio favours differentiation to storage parenchyma cells. N-PLS-DA analysis also shows that sugars are key metabolites in discriminating among the three genotypes (Fig. 7). In fact, sugars such as glucose and sucrose can act as signals to trigger changes in gene expression in plants (Lam et al., 1998) and post-transcriptional modification of proteins (Cotelle et al., 2000) associated with nitrogen metabolism. These results have implicated a model in which genes involved in C and N metabolism are cis-regulated by both C and N signals (Coruzzi and Bush, 2001; Coruzzi and Zhou, 2001). Moreover, Price et al. (2004) found that glucose regulates a broader range of genes, incuding genes associated with carbohydrate metabolism, signal transduction, and metabolite transport. In addition, a large number of stress-responsive genes were also induced by glucose, indicating a role of sugar in environmental responses. It was also revealed that significant interactions exist between glucose and nitrogen in regulating gene expression, since glucose can modulate the effects of nitrogen and vice-versa.
A further observation which originates from the metabolic experiments was a significant variation in alanine content (Tables 3, 4; Fig. 7). This appears from our analyses one of the most important metabolites for discriminating among the three genopypes investigated here. In this context, it was shown that synthesis of alanine may occur at the expense of the amino acids, glutamate and aspartate (Stewart and Larher, 1980) and occurs concomitantly with the accumulation of 4-aminobutyrate or GABA (Ratcliffe, 1995), mediated by the enzyme alanine aminotransferase. This enzyme belongs to a pyridoxal phosphate multigene family widely distributed in animals, plants, algae, yeast, and bacteria (Vedavathi et al., 2004). It catalyses transamination reactions using several amino donor:acceptor combinations, including the reversible transfer of an amino group from glutamate to pyruvate to form 2-oxoglutarate and alanine (Ricoult et al., 2006). The observations are consistent with our finding and with that of Rossi et al. (2007) who reported from a functional analyses of the genotypes investigated here an appreciable alteration in expression level of the gene encoding alanine aminotransferase among the same genotypes investigated in this study.
In conclusion, this study shows results that clearly reveal the influence of hda101 on maize metabolism, and at the same time it underlines the possibilities of multivariate analysis, used not to build a predictive model, but to describe the evolution of the system. Obviously, in this frame, multivariate analysis is a useful tool for physiologists, as they can synthesize experimental information in an interpretable form, that increases knowledge on the process studied in a way not possible by a traditional univariate approach.
| Acknowledgements |
|---|
The authors want to thank Professor ME Di Cocco and Dr L Casciani for their contribution in the discussion of the NMR data. This work was supported by Ministero delle Politiche Agricole, Alimentari e Forestali, Rome – Italy, special grant: ZEAGEN'. Furthermore, the authors thank the anonymous referees whose suggestions contributed to the improvement of the paper.
| References |
|---|
|
|
|---|
Andersson CA, Bro R. The N-way toolbox for MATLAB. Chemometrics and Intelligent Laboratory Systems (2000) 52:1–4.[CrossRef][Web of Science]
Antti H, Bollard ME, Ebbels T, Keun H, Lindon JC, Nicholson JK, Holmes E. Batch statistical processing of 1H-NMR-derived urinary spectral data. Journal of Chemometrics (2002) 16:461–468.[CrossRef][Web of Science]
Berger SL. The complex language of chromatin regulation during transcription. Nature (2007) 447:407–412.[CrossRef][Web of Science][Medline]
Bosco MV, Garrido M, Larrechi MS. Determination of phenol in the presence of its principal degradation products in water during a TiO2-photocatalytic degradation process by three-dimensional excitation–emission matrix fluorescence and parallel factor analysis. Analytica Chimica Acta (2006) 559:240–247.[CrossRef][Web of Science]
Bro R. Multi-way calibration. Multilinear PLS. Journal of Chemometrics (1996) 10:47–61.[CrossRef][Web of Science]
Bro R. PARAFAC. Tutorial and applications. Chemometrics and Intelligent Laboratory Systems (1997) 38:149–171.[CrossRef][Web of Science]
Bylesjö M, Rantalainen M, Cloarec O, Nicholson JK, Holmes E, Trygg J. OPLS discriminant analysis: combining the strength of PLS-DA and SIMCA classification. Journal of Chemometrics (2007) 20:341–351.[CrossRef][Web of Science]
Camacho D, de la Fuente A, Mendes P. The origin of correlations in metabolomics data. Metabolomics (2005) 1:53–63.[CrossRef][Web of Science]
Carrozza MJ, Utley RT, Workman JL, Cote J. The diverse functions of histone acetyltransferase complexes. Trends in Genetics (2003) 19:321–329.[CrossRef][Web of Science][Medline]
Castro C, Manetti C. A multi-way approach to analyze metabonomic data: a study of maize seeds development. Analytical Biochemistry (2007) 371:194–200.[CrossRef][Web of Science][Medline]
Chow E, Ebrahimi D, Gooding JJ, Hibbert DB. Application of N-PLS calibration to the simultaneous determination of Cu2+, Cd2+ and Pb2+ using peptide modified electrochemical sensors. Analyst (2006) 131:1051–1057.[CrossRef][Medline]
Christensen AH, Sharrock RA, Quail PH. Maize polyubiquitin genes: structure, thermal perturbation of expression and transcript splicing, and promoter activity following transfer to protoplasts by electroporation. Plant Molecular Biology (1992) 18:675–689.[CrossRef][Web of Science][Medline]
Christensen J, Nörgaard L, Bro R, Engelsen SB. Multivariate autofluorescence of intact food systems. Chemistry Reviews (2006) 106:1979–1994.[CrossRef]
Cloarec O, Dumas ME, Craig A, et al. Statistical total correlation spectroscopy: an exploratory approach for latent biomarker identification from metabolic 1H NMR data sets. Analytical Chemistry (2005b) 77:1282–1289.[Medline]
Cloarec O, Dumas ME, Trygg J, Craig A, Barton RH, Lindon JC, Nicholson JK, Holmes E. Evaluation of the orthogonal projection on latent structure model limitations caused by chemical shift variability and improved visualization of biomarker changes in 1H-NMR spectroscopic metabonomic studies. Analytical Chemistry (2005a) 77:517–526.[Medline]
Coen M, Holmes E, Lindon JC, Nicholson JK. NMR-based metabolic profiling and metabonomic approaches to problems in molecular toxicology. Chemical Research in Toxicology (2008) 21:9–27.[CrossRef][Web of Science][Medline]
Cotelle V, Meek SE, Provan F, Milne FC, Morrice N, MacKintosh C. 14–3–3s regulate global cleavage of their diverse binding partners in sugar-starved Arabidopsis cells. EMBO Journal (2000) 19:2869–2876.[CrossRef][Web of Science][Medline]
Coruzzi G, Bush DR. Nitrogen and carbon nutrient and metabolite signaling in plants. Plant Physiology (2001) 125:61–64.
Coruzzi G, Zhou L. Carbon and nitrogen sensing and signaling in plants: emerging matrix effects. Current Opinion in Plant Biology (2001) 4:247–253.[CrossRef][Web of Science][Medline]
Durante C, Cocchi M, Grandi M, Marchetti A, Bro R. Application of N-PLS to gas chromatographic and sensory data of traditional balsamic vinegars of Modena. Chemometrics and Intelligent Laboratory Systems (2006) 83:54–65.[CrossRef][Web of Science]
Dyrby M, Baunsgaard D, Bro R, Engelsen SB. Multi-way chemometric analysis of the metabolic response to toxins monitored by NMR. Chemometrics and Intelligent Laboratory Systems (2005a) 76:79–89.[CrossRef][Web of Science]
Dyrby M, Peterson M, Whittaker AK, Lambert L, Nørgaard L, Bro R, Engelsen SB. Analysis of lipoproteins using 2D diffusion-edited NMR spectroscopy and multi-way chemometrics. Analytica Chimica Acta (2005b) 531:209–216.[CrossRef][Web of Science]
Fan T. Metabolite profiling by one- and two-dimensional NMR analysis of complex mixtures. Progress in Nuclear Magnetic Resonance Spectroscopy (1996) 28:161–219.[CrossRef][Web of Science]
Fischle W, Wang Y, Allis CD. Histone and chromatin cross-talk. Current Opinion in Cell Biology (2003) 15:172–183.[CrossRef][Web of Science][Medline]
Guimet F, Ferreä J, Boqueä R, Vidal M, Garcia J. Excitation-emission fluorescence spectroscopy combined with three-way methods of analysis as a complementary technique for olive oil characterization. Journal of Agricultural and Food Chemistry (2005) 53:9319–9328.[CrossRef][Web of Science][Medline]
Harbour JW, Dean DC. The Rb/E2F pathway: expanding roles and emerging paradigms. Genes and Development (2000) 14:2393–2409.
Jang IC, Pahk YM, Song SI, Kwon HJ, Nahm BH, Kim JK. Structure and expression of the rice class-I type histone deacetylase genes OsHDAC1–3: OsHDAC1 overexpression in transgenic plants leads to increased growth rate and altered architecture. The Plant Journal (2003) 33:531–541.[CrossRef][Web of Science][Medline]
Jenuwein T, Allis CD. Translating the histone code. Science (2001) 293:1074–1080.
Kadonaga JT. Eukaryotic transcription: an interlaced network of transcription factors and chromatin-modifying machines. Cell (1998) 92:307–313.[CrossRef][Web of Science][Medline]
Khaled AS, Vernoud V, Ingram GC, Perez P, Sarda X, Rogowsky PM. Engrailed-ZmOCL1 fusions cause a transient reduction of kernel size in maize. Plant Molecular Biology (2005) 58:123–139.[CrossRef][Web of Science][Medline]
Keun HC, Ebbels TMD, Bollard ME, Beckonert O, Antti H, Holmes E, Lindon JC, Nicholson JK. Geometric Trajectory Analysis of metabolic responses to toxicity can define treatment specific profiles. Chemical Research in Toxicology (2004) 17:579–587.[CrossRef][Web of Science][Medline]
Lam HM, Chiu J, Hsieh MH, Meisel L, Oliveira IC, Shin M, Coruzzi G. Glutamate-receptor genes in plants. Nature (1998) 396:125–126.[CrossRef][Web of Science][Medline]
Le Gall G, Colquhoun IJ, Davis AL, Collins GJ, Verhoeyen ME. Metabolite profiling of tomato (Lycopersicon esculentum) using 1H NMR spectroscopy as a tool to detect potential unintended effects following a genetic modification. Journal of Agricultural and Food Chemistry (2003) 51:2447–2456.[CrossRef][Web of Science][Medline]
Lindon JC, Holmes E, Nicholson JK. Pattern recognition methods and application in biomedical magnetic resonance. Progress in Nuclear Magnetic Resonance Spectroscopy (2001) 39:1–40.[CrossRef][Web of Science]
Loidl P. A plant dialect of the histone language. Trends in Plant Science (2004) 9:84–90.[CrossRef][Web of Science][Medline]
Long JA, Ohno C, Smith ZR, Meyerowitz EM. TOPLESS regulates apical embryonic fate in Arabidopsis. Science (2006) 312:1520–1523.
Lusser A, Kölle D, Loidl P. Histone acetylation: lessons from plant kingdom. Trends in Plant Science (2001) 6:59–65.[CrossRef][Web of Science][Medline]
Manetti C, Bianchetti C, Bizzarri M, et al. NMR-based metabonomics study of transgenic maize. Phytochemistry (2004) 65:3187–3198.[CrossRef][Web of Science][Medline]
Manetti C, Bianchetti C, Casciani L, Castro C, Di Cocco ME, Miccheli A, Motto M, Conti F. A metabonomic study of transgenic maize (Zea mays) seeds revealed variation in osmolytes and branched amino acids. Journal of Experimental Biology (2006) 57:2613–2625.
Miccheli A, Aureli T, Delfini M, Di Cocco ME, Viola P, Gobetto R, Conti F. Study on influence of inactivation enzyme techniques and extraction procedures on cerebral phosphorylated metabolite levels by P-31 NMR spectroscopy. Cellular and Molecular Biology (1988) 34:591–603.[Web of Science][Medline]
Morocz S, Donn G, Nemeth J, Dudits D. An improved system to obtain fertile regenerants via maize protoplasts isolated from a highly embryogenic suspension culture. Theoretical and Applied Genetics (1990) 80:721–726.[Web of Science]
Niazi A, Ghasemi J, Yazdanipour A. PARAFAC decomposition of three-way kinetic-spectrophotometric spectral matrices based on phosphomolybdenum blue complex chemistry for nitrite determination in water and meat samples. Analytical Letters (2005) 38:2377–2392.[CrossRef][Web of Science]
Nicholson JK, Connelly J, Lindon JC, Holmes E. Metabonomics: a platform for studying drug toxicity and gene function. Nature Reviews Drug Discovery (2002) 1:153–161.[CrossRef][Web of Science][Medline]
Pandey R, Muller A, Napoli CA, Selinger DA, Pikaard CS, Richards EJ, Bender J, Mount DW, Jorgensen RA. Analysis of histone acetyltransferase and histone deacetylase families of Arabidopsis thaliana suggests functional diversification of chromatin modification among multicellular eukaryotes. Nucleic Acids Research (2002) 30:5036–5055.
Peterson CL, Laniel M. Histones and histone modifications. Current Biology (2004) 14:R546–R551.[CrossRef][Web of Science][Medline]
Pfluger J, Wagner D. Histone modifications and dynamic regulation of genome accessibility in plants. Current Opinion in Plant Biology (2007) 10:645–652.[CrossRef][Web of Science][Medline]
Povlsen VT, Rinnan Å, van den Berg F, Andersen HJ, Thybo AK. Direct decomposition of NMR relaxation profiles and prediction of sensory attributes of potato samples. LWT-Food Science and Technology (2003) 36:423–432.
Price J, Laxmi A, St Martin SK, Jang JC. Global transcription profiling reveals multiple sugar signal transduction mechanisms in Arabidopsis. The Plant Cell (2004) 16:2128–2150.
Probst AV, Fagard M, Proux F, et al. Arabidopsis histone deacetylase HDA6 is required for maintenance of transcriptional gene silencing and determines nuclear organization of rDNA repeats. The Plant Cell (2004) 16:1021–1034.
Ricciolini R, Miccheli A, Di Cocco ME, Piccolella E, Marino A, Sammartino MP, Conti F. Dexamethasone-dependent modulation of human lymphoblastoid B cell line through sphingosine production. Biochimica et Biophysica Acta (1994) 1221:103–108.[Medline]
Ratcliffe RG. Metabolic aspects of the anoxic response in plant tissue. In: Environment and plant metabolism: flexibily and acclimatation—Smirnoff N, ed. (1995) Oxford, UK: Bios Scientific. 111–189.
Ricoult C, Echeverria HO, Cliquet J-B, Limani AM. Charatecterization of alanine aminotransferase (AlaAT) multigene family and hypoxic response in young seedlinds of the model legume Medicago truncatula. Journal of Experimental Biology (2006) 37:3079–3089.
Rossi V, Hartings H, Motto M. Identification and characterisation of an RPD3 homologue from maize (Zea mays L.) that is able to complement an rpd3 null mutant of Saccharomyces cerevisiae. Molecular and General Genetics (1998) 258:288–296.[CrossRef][Web of Science]
Rossi V, Locatelli S, Lanzanova C, et al. A maize histone deacetylase and retinoblastoma-related protein physically interact and cooperate in repressing gene transcription. Plant Molecular Biology (2003) 51:401–413.[CrossRef][Web of Science][Medline]
Rossi V, Locatelli S, Varotto S, Donn G, Pirona R, Henderson DA, Hartings H, Motto M. Maize histone deacetylase hda101 is involved in plant development, gene transcription, and sequence-specific modulation of histone modification of genes and repeats. The Plant Cell (2007) 19:1145–1162.
Shahbazian MD, Grunstein M. Functions of site-specific histone acetylation and deacetylation. Annual Review of Biochemistry (2007) 76:1–26.[CrossRef][Medline]
Stewart GR, Larher F. Accumulation of amino acids and related compounds in relation to environmental stress. In: The biochemistry of plants—Miflin BJ, ed. (1980) Vol 5. New York, NY: Academic Press. 609–635.
Thiagalingam S, Cheng KH, Lee HJ, Mineva N, Thiagalingam A, Ponte JF. Histone deacetylases: unique players in shaping the epigenetic histone code. Annals of the New York Academy of Science (2003) 983:84–100.[Web of Science][Medline]
Tian L, Chen ZJ. Blocking histone deacetylation in Arabidopsis induces pleiotropic effects on plant gene regulation and development. Proceedings of the National Academy of Sciences, USA (2001) 98:200–205.
Trygg J, Wold S. Orthogonal projection to latent structure. Journal of Chemometrics (2002) 16:283–293.[CrossRef][Web of Science]
Varotto S, Locatelli S, Canova S, Pipal A, Motto M, Rossi V. Expression profile and cellular localization of maize Rpd3-type histone deacetylase during plant development. Plant Physiology (2003) 133:606–617.
Vedavathi M, Girish KS, Kumar MK. Isolation and characterization of cytosolic alanine aminotransferase isoforms from starved rat liver. Molecular and Cellular Biochemistry (2004) 267:13–23.[CrossRef][Web of Science][Medline]
Zhou C, Zhang L, Duan J, Miki B, Wu K. Histone Deacetylase19 is involved in jasmonic acid and ethylene signaling of pathogen response in Arabidopsis. The Plant Cell (2005) 17:1196–1204.
![]()
CiteULike
Connotea
Del.icio.us What's this?
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||







