Analyses of chlorogenic acids and related cinnamic acid derivatives from Nicotiana tabacum tissues with the aid of UPLC-QTOF-MS/MS based on the in-source collision-induced dissociation method

Background Chlorogenic acids (CGAs) are a class of phytochemicals that are formed as esters between different derivatives of cinnamic acid and quinic acid molecules. In plants, accumulation of these compounds has been linked to several physiological responses against various stress factors; however, biochemical synthesis differs from one plant to another. Although structurally simple, the analysis of CGA molecules with modern analytical platforms poses an analytical challenge. The objective of the study was to perform a comparison of the CGA profiles and related derivatives from differentiated tobacco leaf tissues and undifferentiated cell suspension cultures. Results Using an UHPLC-Q-TOF-MS/MS fingerprinting method based on the in-source collision induced dissociation (ISCID) approach, a total of 19 different metabolites with a cinnamic acid core moiety were identified. These metabolites were either present in both leaf tissue and cell suspension samples or in only one of the two plant systems. Profile differences point to underlying biochemical similarities or differences thereof. Conclusion Using this method, the regio- and geometric-isomer profiles of chlorogenic acids of the two tissue types of Nicotiana tabacum were achieved. The method was also shown to be applicable for the detection of other related molecules containing a cinnamic acid core.


Background
Chlorogenic acids (CGAs) are the family of ester phytochemicals formed between cinnamic acid derivatives and quinic acids. These compounds are present in almost all plants and contribute a significant fraction of the total dietary intake of phenols in the daily human diet. Moreover, they possess some notable bio-medical or pharmacological properties [1][2][3]. CGAs are phenolic compounds produced through the shikimate-and phenylpropanoid pathways [4,5], and have been identified in responses against both biotic and abiotic stressors [6]. The most common naturally-occurring cinnamic acid derivatives that have been reported to be utilized during the biosynthesis of these molecules are p-coumaric -, caffeicand ferulic acid which give rise to p-coumaroylquinic acid (pCoQA), caffeoylquinic acid (CQA) and feruloylquinic acid (FQA), respectively [1,2]. Other acids such as sinapic acids are, however, rarely found.
Naturally, plants are known to synthesize the transisomers over the cis-isomers of CGA compounds. The latter has been reported to be formed in tissue or extracts previously exposed to UV light, mechanical processing of coffee and electric field during MS data acquisition [7]. However, our latest study on tobacco cells treated with different chemical and biological priming inducers shows that the cis-isomer of 5-CQA was induced or up-regulated from a pre-existing pool when compared to non-treated cells. These results suggest the existence of possible enzymatic machinery responsible for the production of the cis-isomers in plants [8]. From an analytical perspective, CGA molecules offer a challenge owing to the structural similarities and complexity of these compounds. As such, scientists still spend a great deal of time on developing appropriate methodologies, even though reliable methods for the analysis have been developed in the past [8,9]. Most of the approaches for the analysis of these compounds are, however, achieved by the use of ion trap MS-based platforms [1,2,[10][11][12]. Due to the limited availability of such instruments, emphasis is being placed on the development of alternative, equivalent methods to overcome these challenges. The development of the ion trap hierarchical approach has significantly contributed to the detailed analyses of CGA molecules [10][11][12][13][14]. Apart from the structural hierarchy, there exists regioand geometric-isomerism which offers yet another dimension of complexity to the analyses of these molecules. However, the use of such ion trap-MS methods has also enabled the discrimination between regiobut not geometric-isomers of CGAs. As such, chromatographic methods have also been optimized for proper annotation. The order of elution of the mono-acylated CGA seems to remain constant on a reverse-phase column [12]. Using this knowledge, other methods based on Q-TOF-MS have been developed [7], albeit in most cases with several difficulties which ultimately affect the accuracy of the annotated metabolites.
To circumvent this problem, we have recently developed a Q-TOF-MS fingerprinting method based on the in-source collision induced dissociation (ISCID) approach for the analysis of chlorogenic acid derivatives [8]. This method has proved to generate very stable and reproducible results compared to previously published data. To substantiate our method, the current study profiles the CGA content of tobacco plant systems (leaf tissue vs. cultured cells) which have been reported to be a good source of a variety of bioactive constituents including CGAs [15].
For plant-related studies, mass production of secondary metabolites can be achieved by using plant cell suspensions. Cultured cells do not only provide a costeffective alternative as they are also environmentally friendly. The main advantage of using this system is that it can be easily manipulated for biotechnological purposes [16,17]. However, it should be noted that cells in suspension culture differ significantly from mature organ tissue such as leaves in that cells grow rapidly and divide, do not contain any traces of chlorophyll and multiply in an aqueous environment containing stimulatory phytohormones [18]. Thus, the response to stress factors encountered by intact plant leaves and cell suspensions may vary. The evident biochemical differences between cell suspensions and leaf tissues have also been attributed to the different environmental conditions to which they are exposed to [19]. Furthermore, it has been reported that the biosynthesis of CGAs is highly dependent on the developmental stage of the tissues [20]. Cell type-associated localization of CGAs during maturation (i.e. tissue development) has also been reported [21].
As such, the aim of the current study was to comprehensively profile the CGA content of both tobacco leaf tissue and cell suspensions. The results of the current study are also expected to contribute to the identification of any underlying biochemical differences with regard to CGA biosynthesis between the two systems.

Plant material
Nicotiana tabacum cv. Samsun cell cultures were grown in a Murashige and Skoog (MS) medium containing 0.25 mg/L 2,4-dichlorophenoxyacetic acid and 0.25 mg/L kinetin (pH 5.8) [22] at room temperature on a shaker at 120 rpm with a light/dark cycle of 12 h/12 h, and low light intensity of 30 μmol/m 2 /s. Tobacco plants were grown in composted soil under greenhouse conditions: temperature min 10°C and max 22°C, light/dark cycle of 12 h/12 h, and light intensity of 60 μmol/m 2 /s.

Extraction of metabolites
The cells were harvested by filtration on 55 mm filter paper circles using a vacuum filtration system (Millipore, Billerica, MA, USA) and washed with 20 mL MS medium without vitamins while tobacco leaves were ground with a mortar and pestle in liquid nitrogen. Two grams (2 g) of each sample was weighed and homogenized in 20 mL (1:10 m/v) 80% methanol using a probe sonicator (Bandelin Sonopuls, Germany) set at 55% power for 15 sec with 4 cycles. The crude extract was centrifuged at 4100 x g for 15 min at room temperature. The supernatant was evaporated to approximately 1 mL using a rotary evaporator set at 55°C before being dried to completeness in a heating block set at 55°C overnight. The dried residues were reconstituted in 300 μL 50% (v/ v) UHPLC-grade methanol in milli-Q water, and filtered through a 0.22 μm nylon filter into glass vials fitted with 500 μL inserts. The filtered extracts were stored at −20°C until analysis. For reproducibility of the results, the experiments consisted of 5 independent biological repeats and each extract was analyzed in duplicate (2 technical repeats).

Ultra-high performance liquid chromatography (UPLC)
The extracts were chromatographically analyzed on a UHPLC high-definition quadrupole time-of-flight MS instrument (UPLC-qTOF SYNAPT G1 HDMS system, Waters Corporation, Manchester, UK) fitted with an Acquity HSS T3 column (1.7 μm, 2.1 × 150 mm; Waters Corporation). A binary solvent system consisting of eluent A: 0.1% formic acid in water and B: 0.1% formic acid in acetonitrile (Romil Chemistry, UK) was used. A 20 min gradient method at constant flow rate of 0.4 mL/min was used for analyte separation, and the conditions were: 2% B over 0.0-1.0 min, 2-3% B over 1.0-3 min, 3-8% B over 3-4 min, 8-12% B over 4-12.00 min, 12 -95% B over 12-13 min and held constant at 95% B over 13-15 min to wash the column and 95-5% B over 15-16 min. Thereafter, the column was returned to initial conditions at 16 min and allowed to equilibrate for 4 min. Chromatographic separation was monitored using a photodiode array (PDA) detector (Waters Corporation, UK) with a scanning range set between 200-500 nm, 1.2 nm bandwidth resolution and a sampling rate of 20 points/sec.

Results and discussion
Due to the lack of authentic standards and low concentration of metabolites, the current study only presents qualitative data. In most metabolite fingerprinting studies, metabolite identification is challenging and requires the use of authentic standards for definite metabolite identification. However, due to unavailability of most plants standards, we opted for cider and coffee as surrogate extracts for some of the metabolites identified herein [12]. It is also worth mentioning that the introduction of instruments such as Q-TOF-MS have contributed significantly in overcoming this problem because of its ability of measuring mass accuracy below 3 ppm [23]. For all the 19 metabolites detected (Figure 1, Table 1), the mass error was below 5 ppm. As previously mentioned, proper annotation of structurally similar isomers of cinnamic acid derivatives is a difficult task [9]. To avoid ambiguity during identification, using the IUPAC numbering system, we have considered all possible factors ranging from chromatographic separation to mass spectrometric behaviour as shown in previously published data [10,12]. In order to simplify identification, metabolites harboring common cinnamic acid moieties in their core structure were monitored using a single mass extracted ion chromatogram (XIC). As an example, four peaks with m/z 353, representative of three regio-isomers of CQA, were detected in extracts from both cells and leaves ( Figure 2). This enabled proper identification of CGAs based on previously published data, taking into consideration all possible isomers both at positional and geometrical level [1,2,[10][11][12]23] [24]. By monitoring these ions, CGA and the related derivatives could be positively annotated as discussed below. All mass spectra shown in Figures 3, 4, 5, 6 and 7 were generated at a collision energy of 30 eV and 60 eV cone voltage as described under Experimental section.
Characterization of p-coumaroylquinic acids p-Coumaroylquinic acid has a molecular weight (Mr) of 338 and four peaks at m/z 337 were detected in leaf extracts whilst only three were detected in cells (XIC not shown). These ion peaks, according to the mass spectra ( Figure 3), were identified as trans-4-pCoQA (1), trans-5-pCoQA (2), cis-5-pCoQA (3) and cis-4-pCoQA (4). The leaf samples were found to contain 1, 2, 3 and 4 while the cells were found to only contain 1, 2 and 4. The absence of 3 (cis-5-pCoQA) in cells could not be entirely explained but is possibly due to the fact that some molecules are more prone to isomerization than others, a phenomenon which has been observed in other plants but without supporting explanation [8]. In addition, another important factor to note is that both transand cis-isomers have very similar fragmentation patterns [12]. In the current study the mass spectra of the trans isomers are shown throughout. The notable absence of 3-pCoQA in both leaf and cell extracts is an observation which represents a very interesting biochemical phenotype. This molecule has been found to accumulate in other plants such as legume forages and birch trees [25][26][27].
both leaf and cell suspension samples. Another interesting observation was the fact that cis-5-CQA (9) appeared in a relatively high intensity compared to the cis-3-CQA in both the leaf and cell samples. The same was also noted in our previous study where the same molecule was shown to be induced by activators of plant defence and priming responses [8]. The differences in the metabolite profiles (including CGAs and phenolic content) between cells and leaf tissue have been previously reported [21,29]. However, according to the best of our knowledge, this is the first report focusing on the differences between cells and leaf tissue samples taking into account both regional and geometrical isomerism (cis and trans configurations) of CGAs. From our previous work [8], we could confidently conclude that the cis-5-CQA molecule could be a natural product of tobacco plant systems, suggesting an interesting biochemical phenotype which is not fully explained in other plant species.

Characterization of feruloylquinic acids
Feruloylquinic acids have an Mr = 368. Similarly to pCoQA and CQA, molecules harboring ferulic acid     moieties were also identified ( Figure 5). However, only three peaks were successfully identified; as trans-5-FQA (10), cis-5-FQA (11) and trans-4-FQA (12) respectively (Table 1). Similarly to the pCoQA, there was an absence of the 3-acyl molecule in both cell-and leaf extracts. These three FQAs were identified in leaf tissue, however, only trans-5-FQA (10) and cis-5-FQA (11) were identified in cells, suggesting an underlying biochemical difference between the two systems, possibly due to enzymes differing in their substrate specificities.

Characterization of caffeoylglycoside and feruloylglycoside
Caffeoylglycoside and feruloylglycoside have molecular weights of 342 and 356 respectively. Two molecules with pseudomolecular peaks at m/z 341 ( Figure 6A) were identified as isomers of caffeoylglycoside (13)  The identification of these molecules was also found to be consistent with published data [9]. Interestingly, this molecule was only identified in cells and not in leaf samples. The fact that feruloylglycoside molecules are only biosynthesized in cell suspensions again suggests a very interesting biochemical characteristic of the cultured cells in suspension which is absent in the leaf tissue.

Characterization of di-caffeoylquinic acid and caffeoylquinic acid glucoside
Both di-caffeoylquinic acid and caffeoylquinic acid glucoside have an Mr =516. Here, a maximum of nine peaks at m/z 515 were identified and, based on the accurate masses and fragmentation patterns, these ions were distinguished as either di-caffeoylquinic acid and caffeoylquinic acid glycosides ( Figure 7). As previously reported, both di-CQA and CQA-glycoside produce an isobaric pseudomolecular ion at m/z 515. As such, molecules 15-19 were annotated as either di-CQA or CQA glycosides. Based on the accurate mass, the diCQA were detected with an average m/z of 515.1463 (C 25 [30,31] reported the hierarchical fragmentation scheme of similar molecules, briefly it was noted that CQA forms a glycoside through an ether bond at either C-3 or C-4 on the aromatic caffeoyl ring. During MS fragmentation, these molecules gives rise to ions at m/z 341 which predominates in both cases; however a peak at m/z 323 is a characteristic of glucosyl attachment at C-3 [30]. Thus these molecules were putatively identified as 3-O-(4'-O-caffeoyl glucosyl) quinic acid (18) since it also produced similar fragmentation to 3CQA. In turn, 5-O-(3'-O-caffeoyl glucosyl) quinic acid (19) produces fragmentation similarly to 5-CQA ( Figure 8). Interestingly, the CQA glycosides were only present in leaf tissue samples. Also, a previous report showed the accumulation of similar glycosides in Moringa leaves [32] and recently in Lonicera leaves [30].

Conclusion
Using the described UHPLC-Q-TOF-MS/MS fingerprinting method, based on the ISCID approach, a total of 19 different metabolites with a cinnamic acid core moiety were identified. Chlorogenic acids and related compounds are important because their involvement during plant defence responses is also becoming apparent [8,33,34]. In a separate study, the distribution of these molecules were found to be different across the different parts of the plants [35], which is an indication that there could be a localised function associated with a particular content and distribution. Similarly, the results of the current study indicate that there exists a significant difference in the CGA profiles of tobacco leaf tissue and cell suspensions. Possibly, the differences stem from different biochemical pathways leading to the biosynthesis of CGA molecules in the two biological systems. As already mentioned, the biochemical differences between cells and leaf tissue could be a result of the different environmental conditions which the cells and the leaves are exposed to as well as the level of tissue differentiation. This finding is in agreement with other published data where it was demonstrated that the CGA content varies with plant developmental stages. Therefore, there is a need to further investigate the underlying biochemical differences in such plant systems by investigating the same plant systems at transcriptomic and proteomic levels in conjunction with MS-based metabolite profiling.