Table Name
|
Size (kb),
Format
|
Links
|
Fields
(keys bold)
|
Description
|
tm segs
|
5 k, tab delim.
|
data,
head
|
id_, start_I, stop_n, energy_f
|
Transmembrane segments.
(version 2, revised 971113)
|
tm histo
|
1 k, tab delim.
|
data,
head
|
ntm_I, prots_n
|
Histogram of frequency of transmembrane segments.
|
signal segs
|
1 k, tab delim.
|
data,
head
|
id_, start_I, stop_n
|
Signal sequences.
|
seq MBY pdb MBY lcl MBY tms MBY lnk STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq_MBY_pdb_MBY_lcl_MBY_tms with
the mask linkers to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk.
|
seq MBY pdb MBY lcl MBY tms MBY lnk COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq_MBY_pdb_MBY_lcl_MBY_tms with
the mask linkers to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk.
|
seq MBY pdb MBY lcl MBY tms MBY lnk
|
202 k, fasta
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq_MBY_pdb_MBY_lcl_MBY_tms
with the mask linkers
|
id ntm
|
14 k, tab delim.
|
data,
head
|
id_, signalp, ntm_n
|
This table contains the number of transmembrane segments for each ORF.
Its definition of TM-segment is after filtering.
It also has signal sequence data, based on simple criteria.
|
all masks
|
5 k, tab delim.
|
data,
head
|
gid_, start_I, stop_n, tool_, score
|
This file concatenates the results of
creating all the masks for genome PS.
|
aa freq histo
|
1 k, tab delim.
|
data,
head
|
aa_, freq_n
|
Histogram of frequency of the various amino acids
|
aafreq histo
|
1 k, tab delim.
|
data,
head
|
aa_, freq_n
|
Histogram of frequency of the various amino acids
|
alla segs
|
2 k, tab delim.
|
data,
head
|
id_, start_I, stop_n
|
all-a segments
|
allb segs
|
1 k, tab delim.
|
data,
head
|
id_, start_I, stop_n
|
all-b segments
|
characterized domains
|
6 k, tab delim.
|
data,
head
|
id_, start_I, stop_n
|
Already characterized domains (the borders between
linker regions).
|
full len segs
|
16 k, tab delim.
|
data,
head
|
id_, start_I, stop_n
|
Full length segments.
|
gorss
|
201 k, fasta
|
data,
head
|
gid_, gorss
|
This fasta file is the result of running GOR sec. struc. prediction
on the genome PS
|
gorss MBY nul
|
201 k, fasta
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking gorss
with the mask full_len_segs
|
gorss MBY nul COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file gorss with
the mask full_len_segs to generate the masked fasta file gorss_MBY_nul.
|
gorss MBY nul STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file gorss with
the mask full_len_segs to generate the masked fasta file gorss_MBY_nul.
MASKED_CHARS = number of characters masked with the application of this mask.
Masked_Seqs = number of sequences masked with the application of this mask.
Masking_Segs = number of segments used in the application of the mask
|
gorss MBY ucd
|
201 k, fasta
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking gorss
with the mask unchar_domains
|
gorss MBY ucd COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file gorss with
the mask unchar_domains to generate the masked fasta file gorss_MBY_ucd.
|
gorss MBY ucd STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file gorss with
the mask unchar_domains to generate the masked fasta file gorss_MBY_ucd.
MASKED_CHARS = number of characters masked with the application of this mask.
Masked_Seqs = number of sequences masked with the application of this mask.
Masking_Segs = number of segments used in the application of the mask
|
id ntm nofilt
|
14 k, tab delim.
|
data,
head
|
id_, signalp, ntm_n
|
This table contains data on whether there is a signal sequence
and the number of transmembrane segments.
(version 2, revised 971113).
(Renamed table on 980101: id_ntm --> id_ntm_nofilt)
|
linkers
|
6 k, tab delim.
|
data,
head
|
id_, start_I, stop_n
|
Linker regions between two other defined segments,
which are less in length than 50
|
low complexity long
|
4 k, tab delim.
|
data,
head
|
id_, start_I, stop_n, cplxity_f
|
Low complexity regions generated with the
following seg command: seg/seg tmp.fa 45 3.4 3.75 -l
|
low complexity short
|
7 k, tab delim.
|
data,
head
|
id_, start_I, stop_n, cplxity_f
|
Low complexity regions generated with the
following seg command: seg/seg tmp.fa 25 3.0 3.3 -l
|
null mask
|
1 k, tab delim.
|
data,
head
|
|
|
pdb40d135 soluble matches
|
1 k, tab delim.
|
data,
head
|
|
|
seq
|
202 k, Hidden
|
data,
head
|
-
|
-
|
seq MBY cdo
|
1 k, Bad!
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq
with the mask characterized_domains
|
seq MBY cdo COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq with
the mask characterized_domains to generate the masked fasta file seq_MBY_cdo.
|
seq MBY cdo STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq with
the mask characterized_domains to generate the masked fasta file seq_MBY_cdo.
|
seq MBY lcl
|
202 k, fasta
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq
with the mask low_complexity_long
|
seq MBY lcl COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq with
the mask low_complexity_long to generate the masked fasta file seq_MBY_lcl.
|
seq MBY lcl STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq with
the mask low_complexity_long to generate the masked fasta file seq_MBY_lcl.
MASKED_CHARS = number of characters masked with the application of this mask.
Masked_Seqs = number of sequences masked with the application of this mask.
Masking_Segs = number of segments used in the application of the mask
|
seq MBY lcs
|
8 k, Bad!
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq
with the mask low_complexity_short
|
seq MBY lcs COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq with
the mask low_complexity_short to generate the masked fasta file seq_MBY_lcs.
|
seq MBY lcs STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq with
the mask low_complexity_short to generate the masked fasta file seq_MBY_lcs.
|
seq MBY lnk
|
1 k, Bad!
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq
with the mask linkers
|
seq MBY lnk COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq with
the mask linkers to generate the masked fasta file seq_MBY_lnk.
|
seq MBY lnk STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq with
the mask linkers to generate the masked fasta file seq_MBY_lnk.
|
seq MBY nul
|
202 k, fasta
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq
with the mask full_len_segs
|
seq MBY nul COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq with
the mask full_len_segs to generate the masked fasta file seq_MBY_nul.
|
seq MBY nul STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq with
the mask full_len_segs to generate the masked fasta file seq_MBY_nul.
MASKED_CHARS = number of characters masked with the application of this mask.
Masked_Seqs = number of sequences masked with the application of this mask.
Masking_Segs = number of segments used in the application of the mask
|
seq MBY pdb
|
11 k, Bad!
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq
with the mask pdb40d135_soluble_matches
|
seq MBY pdb COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq with
the mask pdb40d135_soluble_matches to generate the masked fasta file seq_MBY_pdb.
|
seq MBY pdb MBY lcl
|
10 k, Bad!
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq_MBY_pdb
with the mask low_complexity_long
|
seq MBY pdb MBY lcl COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq_MBY_pdb with
the mask low_complexity_long to generate the masked fasta file seq_MBY_pdb_MBY_lcl.
|
seq MBY pdb MBY lcl MBY lcs
|
9 k, Bad!
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq_MBY_pdb_MBY_lcl
with the mask low_complexity_long
|
seq MBY pdb MBY lcl MBY lcs COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq_MBY_pdb_MBY_lcl with
the mask low_complexity_long to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_lcs.
|
seq MBY pdb MBY lcl MBY lcs MBY tms
|
6 k, Bad!
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq_MBY_pdb_MBY_lcl_MBY_lcs
with the mask tm_segs
|
seq MBY pdb MBY lcl MBY lcs MBY tms COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq_MBY_pdb_MBY_lcl_MBY_lcs with
the mask tm_segs to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_lcs_MBY_tms.
|
seq MBY pdb MBY lcl MBY lcs MBY tms STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq_MBY_pdb_MBY_lcl_MBY_lcs with
the mask tm_segs to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_lcs_MBY_tms.
|
seq MBY pdb MBY lcl MBY lcs STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq_MBY_pdb_MBY_lcl with
the mask low_complexity_long to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_lcs.
|
seq MBY pdb MBY lcl MBY tms
|
2 k, Bad!
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq_MBY_pdb_MBY_lcl
with the mask tm_segs
|
seq MBY pdb MBY lcl MBY tms COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq_MBY_pdb_MBY_lcl with
the mask tm_segs to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_tms.
|
seq MBY pdb MBY lcl MBY tms MBY lnk MBY alp
|
202 k, fasta
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk
with the mask alla_segs
|
seq MBY pdb MBY lcl MBY tms MBY lnk MBY alp COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk with
the mask alla_segs to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk_MBY_alp.
|
seq MBY pdb MBY lcl MBY tms MBY lnk MBY alp MBY bet
|
202 k, fasta
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk_MBY_alp
with the mask allb_segs
|
seq MBY pdb MBY lcl MBY tms MBY lnk MBY alp MBY bet COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk_MBY_alp with
the mask allb_segs to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk_MBY_alp_MBY_bet.
|
seq MBY pdb MBY lcl MBY tms MBY lnk MBY alp MBY bet STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk_MBY_alp with
the mask allb_segs to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk_MBY_alp_MBY_bet.
MASKED_CHARS = number of characters masked with the application of this mask.
Masked_Seqs = number of sequences masked with the application of this mask.
Masking_Segs = number of segments used in the application of the mask
|
seq MBY pdb MBY lcl MBY tms MBY lnk MBY alp STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk with
the mask alla_segs to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk_MBY_alp.
MASKED_CHARS = number of characters masked with the application of this mask.
Masked_Seqs = number of sequences masked with the application of this mask.
Masking_Segs = number of segments used in the application of the mask
|
seq MBY pdb MBY lcl MBY tms STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq_MBY_pdb_MBY_lcl with
the mask tm_segs to generate the masked fasta file seq_MBY_pdb_MBY_lcl_MBY_tms.
|
seq MBY pdb MBY lcl STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq_MBY_pdb with
the mask low_complexity_long to generate the masked fasta file seq_MBY_pdb_MBY_lcl.
|
seq MBY pdb MBY tms
|
2 k, Bad!
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq_MBY_pdb
with the mask tm_segs
|
seq MBY pdb MBY tms COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq_MBY_pdb with
the mask tm_segs to generate the masked fasta file seq_MBY_pdb_MBY_tms.
|
seq MBY pdb MBY tms STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq_MBY_pdb with
the mask tm_segs to generate the masked fasta file seq_MBY_pdb_MBY_tms.
|
seq MBY pdb STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq with
the mask pdb40d135_soluble_matches to generate the masked fasta file seq_MBY_pdb.
|
seq MBY tms
|
202 k, fasta
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq
with the mask tm_segs
|
seq MBY tms COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq with
the mask tm_segs to generate the masked fasta file seq_MBY_tms.
|
seq MBY tms STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq with
the mask tm_segs to generate the masked fasta file seq_MBY_tms.
MASKED_CHARS = number of characters masked with the application of this mask.
Masked_Seqs = number of sequences masked with the application of this mask.
Masking_Segs = number of segments used in the application of the mask
|
seq MBY ucd
|
1 k, Bad!
|
data,
head
|
gid_, masked_seq
|
This fasta file is the result of masking seq
with the mask unchar_domains
|
seq MBY ucd COMP
|
1 k, tab delim.
|
data,
head
|
aa_, count_n
|
This is the aa composition of the
masked file from masking the fasta file seq with
the mask unchar_domains to generate the masked fasta file seq_MBY_ucd.
|
seq MBY ucd STAT
|
1 k, tab delim.
|
data,
head
|
stat_, value
|
This are the statistics from masking the fasta file seq with
the mask unchar_domains to generate the masked fasta file seq_MBY_ucd.
|
seq lengths
|
13 k, tab delim.
|
data,
head
|
gid_, length_n
|
Length of each sequence in genome.
|
tm segs filtered
|
4 k, tab delim.
|
data,
head
|
id_, start_I, stop_n, energy_f
|
Transmembrane segment definitions after removing pdb matches and (most
importantly) low-complexity regions. The tm_segs table is just
the raw data.
This is based on looking at the masked the file seq_MBY_pdb_MBY_lcl_MBY_tms_MBY_lnk for the TM
segments (annotated with a 3).
|
unchar domains
|
15 k, tab delim.
|
data,
head
|
id_, start_I, stop_n
|
Linker regions between two other defined segments,
which are greater in length than 50
That is, these are uncharacterized protein domains.
|