[ssml] Parsing taxonomy from blast output

Robert Citek rwcitek at alum.calberkeley.org
Fri Apr 1 12:38:35 EST 2005


Hello Goel,

On Friday, Apr 1, 2005, at 10:36 US/Central, Goel, Manisha wrote:
> I need to parse the blast ouput to get the taxonomy information.
> If I could get the taxonomy nodes associted with each gi number .. This
> would also work.

If I understand you correctly, you want to display the taxonomy lineage 
given a gi that you've parsed out of a BLAST report.  Is that right?

For example, let's say one of your blast queries hit gi 1392161.  Since 
this is for a human cDNA clone, its associated taxid would be 9606 and 
therefore its taxonomy lineage would be this:

+--------+------------------------+
| tax_id | name_txt               |
+--------+------------------------+
|      1 | root                   |
| 131567 | cellular organisms     |
|   2759 | Eukaryota              |
|  33154 | Fungi/Metazoa group    |
|  33208 | Metazoa                |
|   6072 | Eumetazoa              |
|  33213 | Bilateria              |
|  33316 | Coelomata              |
|  33511 | Deuterostomia          |
|   7711 | Chordata               |
|  89593 | Craniata               |
|   7742 | Vertebrata             |
|   7776 | Gnathostomata          |
| 117570 | Teleostomi             |
| 117571 | Euteleostomi           |
|   8287 | Sarcopterygii          |
|  32523 | Tetrapoda              |
|  32524 | Amniota                |
|  40674 | Mammalia               |
|  32525 | Theria                 |
|   9347 | Eutheria               |
|   9443 | Primates               |
|   9526 | Catarrhini             |
|   9604 | Hominidae              |
| 207598 | Homo/Pan/Gorilla group |
|   9605 | Homo                   |
|   9606 | Homo sapiens           |
+--------+------------------------+

Is that what you are looking for?

Regards,
- Robert



More information about the ssml-general mailing list