Eukaryotic Pseudogenes

Comprehensive Survey of Processed Pseudogenes in the Human Genome
We have identified ~8000 processed pseudogenes plus ~4000 duplicated pseudogenes in the latest GoldenPath human draft genome.
You can either interatively search an online database, or download the relevant data and texts from this page.

Publications

Zhang et al. Genome Research (2003) in press.

Results

online database

EBI human proteome set used as TBLASTN query sequences (9.7 MB) names only (0.5 MB)

Help on the file format

Processed pseudogenes

annotations txt (1.2 MB), excel (3.0 MB), help

nucleotide sequences (6.5 MB) help

amino acid sequence (4.7 MB) help

individual sequence alignment: clustal format (24.4 MB)

multiple sequence alignment: clustal format (14.6 MB), phylip format (12.1 MB)

multiple sequence alignment, including putative pseudogenes: clustal format (15.6 MB), phylip format (12.9 MB)

Processed pseudogenes grouped by chromosomes

annotations

nucleotide sequences

amino acid sequence

Occurrences of processed pseudogenes among human proteins

txt (0.4 MB) excel (0.8 MB) help

Putative Processed pseudogenes

annotations (0.11 MB) help

nucleotide sequences (0.45 MB) help

amino acid sequence (0.33 MB) help

individual sequence alignment: clustal format (1.7 MB)

Putative processed pseudogenes grouped by chromosomes

annotations

nucleotide sequences

amino acid sequence

The processed pseudogenes in the mouse genome.

Comments to Zhaolei Zhang