Comprehensive Survey of Processed Pseudogenes in the Human Genome
We have identified ~8000 processed pseudogenes plus ~4000 duplicated pseudogenes in the latest GoldenPath human draft genome.
You can either interatively search an online database, or download the relevant data and texts from this page.
Publications
Results
- online database
 - EBI human proteome set used as TBLASTN query sequences (9.7 MB) names only (0.5 MB)
 - Help on the file format
 - Processed pseudogenes
 
- annotations txt (1.2 MB), excel (3.0 MB), help
 - nucleotide sequences (6.5 MB) help
 - amino acid sequence (4.7 MB) help
 - individual sequence alignment: clustal format (24.4 MB)
 - multiple sequence alignment: clustal format (14.6 MB), phylip format (12.1 MB)
 - multiple sequence alignment, including putative pseudogenes: clustal format (15.6 MB), phylip format (12.9 MB)
 - Processed pseudogenes grouped by chromosomes
 - Occurrences of processed pseudogenes among human proteins
 
- Putative Processed pseudogenes
 
- annotations (0.11 MB) help
 - nucleotide sequences (0.45 MB) help
 - amino acid sequence (0.33 MB) help
 - individual sequence alignment: clustal format (1.7 MB)
 - Putative processed pseudogenes grouped by chromosomes
 - The processed pseudogenes in the mouse genome.
 ![]()
Comments to Zhaolei Zhang