You are here Biopharmaceutical/ Genomic glossary homepage/Search > FAQs for biopharmaceutical glossaries

Tips & FAQs for the Biopharmaceutical glossaries
Evolving terminology for emerging technologies
Suggestions? Comments? Questions? Mary Chitty mchitty@healthtech.com
Last revised May 12, 2008


New Page 1

Please register for CHI's Genomics Glossaries & Taxonomies website. This sign-in box with then disappear from each page, if you accept cookies. Use of this site will continue to be free, but better demographic data on who is accessing this material helps us to justify the expense of maintaining this resource. Registration policy has details.

Registered users of the Genomics Glossaries & Taxonomies will automatically be signed up for CHI's complimentary email monthly newsletter, GenomeLink, unless you choose to opt out of receiving it.

Mr.     Ms.     Mrs.     Dr.     Prof.

First:

         

Last:

Title:

Dept.:

Company:

Address:

City:

State:

Zip:

Country:

Email:

Opt-out of Email

YES    NO

Telephone:

Would you like to receive CHI event updates via fax? 
Yes       No 

Fax:


1. What can you do with the search engine?
2. Must a term be found a certain number of times to be included in the glossary? 

3. How do you determine the relative prevalence/ popularity of variant terms? Does this change over time? 
4. Why is this called Glossaries and taxonomies?  What's  a taxonomy anyway?
5. How often do you check for dead links?
6. Why are you missing some very basic terms while you include others which seem even more elementary?
7.  Why don't you have an A-Z section of definitions?
8. Why are there terms without any definitions?
9. Why do you quote so much?
10. Aren't some of the terms in rather odd or idiosyncratic categories?  How do you categorize them, and how consistent are you?
 11. Why don't you link to specific definitions in the glossaries you cross-reference?

12. Why aren't the definitions shorter?
13. What's your budget for this project? 
14. Why are you compiling these glossaries/taxonomies?

15. What happened to the Genomic Glossaries & Taxonomies?


1. What can you do with the search engine?

It does support truncation (use an asterisk *).

It is sensitive to punctuation. In particular, try words and phrases with and without hyphens if you don't retrieve any results.

You can't search with numbers.  You won't find 2D gel electrophoresis (although it is there).  I've tried to put in cross references to two-D however.

It isn't case- sensitive.

It doesn't seem to support searching by phrases. Two words come up with fewer results than either one of the words alone. This implies a Boolean AND function, but I'm not convinced of this.  I don't think there are Boolean OR or NOT functions.  

The site is supposed to be re-indexed at least once every week. I'm still figuring out just what the search engine is doing, and would welcome your observations.  mchitty@healthtech.com  Sometimes I use Google, by adding the words healthtech and glossary to whatever I'm looking for instead of the website search engine. 

See How to look for other unfamiliar  terms if you don't find terms with these tips.  There are always more terms to add, and I'm concentrating on ones which can't be found in the standard (specialized) print dictionaries cited in How to look.

2. Do you require a certain number of times for a term to be found to include it in the glossary? 
No, but I do like to find terms on the web or in print to get a sense of the spelling and context. I find Google incredibly useful for tracking down very recent words. (See the Omes and –Omics glossary) and for any situation where variant usages (and/or spellings) come into play. 

3. How do you determine the relative preferability (or popularity) of variant terms?
Chemoinformatics is one good example.

May 15, 2002
cheminformatics 6,340
chemoinformatics 2,200

On Oct. 16, 2001
cheminformatics had about 5,010 (Google) documents, Alta Vista 3198 results (3198 documents); 
chemoinformatics about 1,640 (Google) and AltaVista 839 results (1560 documents) 
chemiinformatics about 26 (Google) and AltaVista 16 results (19 documents).

Searching AltaVista and Google on July 18, 2000 for variants had  these results
chemoinformatics AltaVista = 1043 word count Google = 684 documents
cheminformatics AltaVista = 63 word count Google = 39  documents
chemiinformatics AltaVista = 13 word count Google = 6 documents 

By May 23, 2001 the relative predominance had shifted (using Google) to  
cheminformatics about 3000
chemoinformatics about 1270
chemiinformatics about 25. 

I changed the title of the Cheminformatics/Chemoinformatics Glossary from Chemoinformatics Glossary in Nov. 2001.  

 Computers & computing, Data & information management, Microarray & protein chips categories and -Omes & -omics glossary now have Google word/ phrase counts for most entries. 

Unless indicated otherwise multi- word phrases are searched in quotation marks, and match the term entry (singular or plural).  Terms with hyphens - for example data mining- gene expression is searched as "data mining" "gene expression".

Example: A Google search on Dec. 15, 2000 showed only about one-third the number of occurrences of  "oncogenomics" as "cancer genomics", with no overlap between the two terms.

Example AltaVista works (with truncation *) better than Google for biochip* "DNA array*" "gene chip*" microarray* and other variants but Google would usually the first search engine I try for a single word (bioinformatics) or a phrase like "functional genomics". Advanced Search in Google now permits limited Boolean "or" searching, but I haven't used this feature much yet. 

As far as I can tell Google still does not permit truncation.) No search engine covers the entire web (or even a majority of it) and several weeks to months are needed before some sites are indexed (particularly noticeable with very recent events. 

4. Why did you change the name to glossaries and taxonomies?  What is a taxonomy anyway?
You can find various definitions for different kinds of taxonomies in  Information management & interpretation

l. I realized I'd envisioned the hierarchical nature of the definitions almost from the beginning, and I realized how helpful additional related terms, narrower terms and broader terms) references could be.  I am still working on adding to these and would welcome your comments on possible additions (or modifications, or deletions).  I am convinced that there is a fundamental reorganization of biology going on at both the molecular and biochemical levels, as we have new technologies, completed genomes and lots of bright and motivated people working on really difficult problems.

Going to a meeting (sponsored by the Special Libraries Association, in Northampton MA April 4, 2001) on Taxonomies gave me a lot to think about (and incorporate into this website).  I gave a talk on taxonomies and information overload for the Pharmaceutical division at the annual Special Libraries Association meeting, June 2002, Los Angeles. Virtual presentation: Information overload, taxonomies and beyond http://www.genomicglossaries.com/presentation/SLA_outline.asp

5. How often do you check for dead links?
Not very often.  This is one process that hasn't scaled for me.  I need a more automated solution which checks not only if the URL has changed, but if the content has.  Some sites which seem to be only temporarily unavailable. I sometimes leave a definition in from a no longer working website until I find another (and as good or better) one.  These are the ones with a URL  without a live link. Please let me know if you find any non- working or incorrect links now.  I'd be very interested in hearing about options for more automated checking for linkrot 

6. Why are you missing some very basic terms while you include others which seem even more elementary?
My choices are somewhat arbitrary at times.  See Basic genetics (& genomics) especially the glossary in the Primer on Molecular Genetics for a good basic glossary and How to look for other unfamiliar terms for some strategies for finding terms I don't include (and email mchitty@healthtech.com to suggest terms you don't find.  Often I just haven't thought about a particular term and would welcome your suggestions.

Some words (like gene or phenotype) are familiar to almost everyone, but are in the process of being redefined by genomics. When I realize this I collect a variety of definitions (and background material) and eventually try to synthesize it and see what I can make of it.  Sometimes I put the research material on the web, sometimes it is only on my computer and in piles in my office.  The definitions get longer before they get shorter.  I don't want to lose something crucial even if it eventually seems redundant, before I understand enough to sort out subtleties.

7.  Why don't you have an A-Z section of definitions?
I did for some months before the glossaries were up on the web, but it was unwieldy and labor- intensive. This website really needs to be in a database instead of FrontPage.  I'd welcome suggestions for implementing, or partnering on this.

I was heartened to hear John Simpson, editor of the Oxford English Dictionary in Boston in the spring of 2000, on tour talking about the web OED version, and asked him if they were using a database. (He had talked about the early days of data conversion into WordPerfect.)  He said they couldn't manage without a very sophisticated one, run by Highwire Press out of Palo Alto (US), and had profited from consultation with people at both MIT and Stanford.

I kept alphabetical and category indexes manually for about a year, but simply couldn't keep up.  Now i really need to switch to a database  

8. Why are there terms without any definitions?
These are definitions I'm working on and haven't had time to get in yet.  E-mail me if you want a work in progress definition.  I probably have something to send, and would be interested to know if a term or concept is in demand.

9. Why do you quote so much?
This is essentially a descriptive glossary, not a prescriptive one.  I'm trying to document just how words and phrases are being used, and to distinguish nuances and differences, particularly if I can identify the influences of  different disciplines.  

I also suspect that there is a fair amount of unrecognized miscommunication as a result of differing interpretations taken for granted by varying disciplines, without these being articulated very often.  

I'd be interested in hearing from people who can give me specific examples.  The classic example is bioinformatics, which has biologists and computer scientists coming at the same data from very different perspectives and assumptions.

10. Aren't some of the terms in rather odd or idiosyncratic categories?  How do you categorize them, and how consistent are you?
Absolutely there are oddities. I move terms around a lot at times, and have trouble remembering where they are. Even the broad categories (applications, informatics, technologies and biology aren't mutually exclusive.  

There is a lot of overlap between what i categorize as applications (genomics, proteomics, etc.) and technologies (which is both instrumentation and other techniques).  Informatics overlays, and is involved with almost all of the categories.  I try for consistency, but don't claim to be anywhere near it.  Please let me know if you think something is wildly out of place.  I do try to define a term only once (except for those included in the overviews) in an effort to not have fragmented differing definitions in too many places.

I've tried to make the different sections topical guidebooks to a field for those less familiar with the general area. - browsable  and scannable (and many sites link to specific sections most relevant to theirs).  I started by circulating paper glossaries before I was ready to put this up on the web and found that nobody wanted to read 15 pages worth at a go.  It was only when I got the handouts down to one (double- sided) page that I started to get feedback.  As glossaries grow I  tend to split them off, but maintain ample cross- references,.

11. Why don't the links to specific definitions in the glossaries always work or even exist?
1. I've been busy trying to catch up with terms I want to input (a considerable backlog).  I'm writing, editing, organizing and typing (and cutting and pasting) as fast as I can.  FrontPage bookmarks seem to disappear at times.  And the please fill out this form seems to interfere with links.  

12. Why aren't the definitions shorter?
I try to keep shortening them, but am also adding more -- and some of them just haven't had a chance to be edited down sufficiently.  Since I am looking for evolving definitions I hesitate to discard anything that looks potentially useful.  But please let me know if you can suggest more succinct wording,  find redundant sections, or have any other comments.

13. What's your budget for this project?
Minimal, except for my time, my company contributes the website and software (and supports some of my time) and a few hundred dollars for print books and subscriptions. (Not everything is on the web.) 

14. Why are you doing this?
I’m having a good time, learning a lot and putting pieces together in a way I couldn't before.  This serves internal uses - it's (to an extent) the basis for  in- house indexing and information retrieval. 

In my job I am always trying to figure out what are the incremental improvements that make life easier for people in the labs or clinics, without missing true paradigm shifts (which come along only occasionally - see new paradigm in the Biotech & pharmaceutical applications overview.). As a librarian I’ve spent a lot of time trying to learning specialized vocabulary in order to be able to ask (reasonably) intelligent questions.

15. What happened to the Genomic Glossaries & Taxonomies? This website has evolved into a Biopharmaceutical Glossary and Taxonomies.  Genomics and proteomics are still part of it.   

Mary Chitty mchitty@healthtech.com
 alternate e-mail mgchitty@msn.com
 

Bibliography

Alpha glossary index
  

Contact | Privacy Statement | Alphabetical Glossary List | Tips & glossary FAQs | Site Map