Comprehensive Survey of Processed Pseudogenes in the Human Genome
We have identified ~8000 processed pseudogenes plus ~4000 duplicated pseudogenes in the latest GoldenPath human draft genome.
You can either interatively search an online database, or download the relevant data and texts from this page.
Publications
Results
- online database
- EBI human proteome set used as TBLASTN query sequences (9.7 MB) names only (0.5 MB)
- Help on the file format
- Processed pseudogenes
- annotations txt (1.2 MB), excel (3.0 MB), help
- nucleotide sequences (6.5 MB) help
- amino acid sequence (4.7 MB) help
- individual sequence alignment: clustal format (24.4 MB)
- multiple sequence alignment: clustal format (14.6 MB), phylip format (12.1 MB)
- multiple sequence alignment, including putative pseudogenes: clustal format (15.6 MB), phylip format (12.9 MB)
- Processed pseudogenes grouped by chromosomes
- Occurrences of processed pseudogenes among human proteins
- Putative Processed pseudogenes
- annotations (0.11 MB) help
- nucleotide sequences (0.45 MB) help
- amino acid sequence (0.33 MB) help
- individual sequence alignment: clustal format (1.7 MB)
- Putative processed pseudogenes grouped by chromosomes
- The processed pseudogenes in the mouse genome.
![]()
Comments to Zhaolei Zhang