Advertisement

Structure ID of unknowns (small molecule, metabolomics)

Discussions about GC-MS, LC-MS, LC-FTIR, and other "coupled" analytical techniques.

5 posts Page 1 of 1
So frustrated, got nowhere after a few days of thinking, database searching, and literature reading… Please help!

Background:
Biomarker discovery from wild type mice serum sample by LC-MS/MS analysis.

Here is my approach

1. Obtain full scan accurate mass on Q-TOF (Agilent 6510), mass accuracy < 2 ppm;
2. Use Agilent “Qualitative Analysisâ€

David Sparkman will be running a 6-week course on MS Interpretation starting in a couple of weeks:
http://www.sepsci.com/chromforum/viewtopic.php?t=7540
-- Tom Jupille
LC Resources / Separation Science Associates
tjupille@lcresources.com
+ 1 (925) 297-5374

A course will help but won't offer you an instant solution for your problem, interpreting a complicated MS requires solid backgroud in organic/organic synthesis.

I would suggest you to play with the masses a liitle bit to see if you can get any hit. For example, subsctract a H20 or CH3 or other possible functional group to your M (M+n), them see what happens. Moreover, since you have the accurate MS capability, simply search a few major daughter ions (e.g. D1+D2+D3=M), to find out what possible structure could be. if you are luck enough, you may hit some new compounds/metabolites.

This reminds me our previous topic about API 4000 Q-trap. If you have that capacity, a 3rd MS may give you some information about the major MS2 daughter ions (using MS2-MS3 as regular MS-MS for datebase serach).
The issue you raise, namely, the difficulty of getting an unequivocal structure ID of a "total unknown" in a complex mixture solely by chromatography-based mass spectrometry is far more difficult than most practicing scientists realize.

Even with accurate mass combined with MS^n, getting a structure with just chrom+MS alone is very challenging. For example, consider that a single elemental formula, C15 H22 O2, which has a nominal mass of 234 Da, has over 100 billion (yes, > 10^11) valid chemical structures.

I have performed mass spectrometry based structure elucidation in a number of roles, which have included working for a large pharmaceutical company supporting medicinal chemistry, as well as drug metabolism and pharmacokinetics. In addition I have also performed structure elucidation for metabolomics studies.

Of these cases, the metabolomics work presented the greatest challenge for the following reason. In my pharma role, I usually had some sort of information beyond just the mass spec data about the structure I was being asked to elucidate. For example, in the case of synthesis, the chemist knew what starting materials were present in the reaction, as well as the expected products. Often, they could produce different analogues.

In the case of drug metabolism, I always had the parent compound, and its fragmentation pattern combined with a knowledge of "typical" biotransformations facilitated ID of the metabolites.

Now, for metabolomics, the number of possible structures is far less constrained than in my previous roles. Having a comprehensive library of compounds will give you a good start.

You have searched a good collection of databases, but I recommend that you try ChemSpider (http://www.chemspider.com/). It's free, and they currently index over 20 million compounds, including all of KEGG and HMDB just to name a few.

Recognize that many endogenous biochemicals can undergo the same transformations as xenobiotics (e.g. oxidations, sulfations, glucuronidations, phosphorylations, glutathione conjugations, etc), and that many of these conjugated forms will not be listed in databases (yet).

Here's the typical process I follow when trying to elucidate a "total" unknown.

1) Acquire accurate mass, MS^n data for at least n >=3.

2) Use accurate mass + isotopic information to reduce number of possible empirical formulas (Oliver Fiehn's "Seven Golden Rules" paper is a good reference).

3) Using the formula(s), search ChemSpider. Be aware that in LC/MS, not all compounds ionize via protonation. For example, carnitines and cholines, which contain a trimethyl-ammonium functionality, show up as the M+ species, not M+H like many others. Checking for Ring/Double/Triple bond equivalents is useful.

4) Frequently, you won't get any hits, so as one previous poster stated, look in the primary MS for common losses. For example glucuronides (-176), sulfate/phosphate (-80, but distinguishiable with accurate mass), acetic acid (-60), formic acid (-46), methanol (-32), etc all give clues to the structure of the compound. Also, sometimes, a match for the parent compound won't be found, but you will get a match for one of these decomposed species.

All of the vendors of accurate mass instrumentation have greatly oversold the capability of mass spectrometry for performing total unknown eludication. However, occasionally, you can identify a potential structure for an unknown, but, in the absence of some other information besides the mass spec data, in most cases, you'll be left with many possibilities.

Good Luck!

Thank you all for the inputs!

I will try searching with different MS, which considered the possible losses, and the chemspider databases.

I know structure ID of total unknown is very challenging, but I thought if I work hard enough, I should be able to narrow it down to what group of a certain biomarker belong to, although locate the position a double bond or -OH on the ring might be impossible. However, I am often get lost when facing a situation like: observe M-176 from the MSMS spectrum, but the generated molecular formula only has 4 oxygens? Things are really complicated...Anyway, I will keep trying, and thanks again!

JWL
5 posts Page 1 of 1

Who is online

In total there are 230 users online :: 0 registered, 0 hidden and 230 guests (based on users active over the past 5 minutes)
Most users ever online was 10230 on Thu Dec 04, 2025 5:56 am

Users browsing this forum: No registered users and 230 guests

Latest Blog Posts from Separation Science

Separation Science offers free learning from the experts covering methods, applications, webinars, eSeminars, videos, tutorials for users of liquid chromatography, gas chromatography, mass spectrometry, sample preparation and related analytical techniques.

Subscribe to our eNewsletter with daily, weekly or monthly updates: Food & Beverage, Environmental, (Bio)Pharmaceutical, Bioclinical, Liquid Chromatography, Gas Chromatography and Mass Spectrometry.

Liquid Chromatography

Gas Chromatography

Mass Spectrometry