; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0004571 (gene) of Snake gourd v1 genome

Gene IDTan0004571
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:10529030..10529251
RNA-Seq ExpressionTan0004571
SyntenyTan0004571
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]2.5e-1972.6Show/hide
Query:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL
        M SLQ MFGQ S  +++DALKY+YN+RM E  SVREHV NMMVHFNVAE+NG VIDE SQVSFI+ESL +SFL
Subjt:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL

KAA0050670.1 gag/pol protein [Cucumis melo var. makuwa]1.5e-1972.6Show/hide
Query:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL
        M SLQ MFGQ S  +++DAL Y+YN+RM E  SVREHV NMMVHFNVAE+NG VIDE SQVSFI+ESLL+SFL
Subjt:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL

XP_022155999.1 uncharacterized protein LOC111022974 [Momordica charantia]6.7e-2072.6Show/hide
Query:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL
        M SL+ MFGQPS   R++ALK++YNSRMKE TS++EHV N+MVHFNVAE+NG VIDE SQVSFI+ESL KSFL
Subjt:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL

XP_022158197.1 uncharacterized protein LOC111024734 [Momordica charantia]3.0e-2073.97Show/hide
Query:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL
        M SLQ+MFGQPS   R++ALK+VYNSRMKE +SVREHV N+MVHFNVAE NG+VIDE SQ SFI+ESL K+FL
Subjt:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL

XP_038904417.1 uncharacterized protein LOC120090784 [Benincasa hispida]8.7e-2079.71Show/hide
Query:  QAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL
        + MFGQ SSSVR+DALKYV+NSRMKE  SVREHV +MMV+FN+ EVNGVVIDE SQVSFIMESL KSFL
Subjt:  QAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.2e-1972.6Show/hide
Query:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL
        M SLQ MFGQ S  +++DALKY+YN+RM E  SVREHV NMMVHFNVAE+NG VIDE SQVSFI+ESL +SFL
Subjt:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL

A0A5A7U676 Gag/pol protein7.2e-2072.6Show/hide
Query:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL
        M SLQ MFGQ S  +++DAL Y+YN+RM E  SVREHV NMMVHFNVAE+NG VIDE SQVSFI+ESLL+SFL
Subjt:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL

A0A5D3CPJ6 Gag/pol protein1.2e-1972.6Show/hide
Query:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL
        M SLQ MFGQ S  +++DALKY+YN+RM E  SVREHV NMMVHFNVAE+NG VIDE SQVSFI+ESL +SFL
Subjt:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL

A0A6J1DRZ2 uncharacterized protein LOC1110229743.2e-2072.6Show/hide
Query:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL
        M SL+ MFGQPS   R++ALK++YNSRMKE TS++EHV N+MVHFNVAE+NG VIDE SQVSFI+ESL KSFL
Subjt:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL

A0A6J1DWL0 uncharacterized protein LOC1110247341.4e-2073.97Show/hide
Query:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL
        M SLQ+MFGQPS   R++ALK+VYNSRMKE +SVREHV N+MVHFNVAE NG+VIDE SQ SFI+ESL K+FL
Subjt:  MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGATCATTACAGGCCATGTTTGGACAACCGTCCTCATCGGTCCGTTATGATGCTCTCAAGTACGTTTACAACTCTCGAATGAAGGAGGAAACTTCTGTTAGGGAGCA
TGTCTTTAATATGATGGTCCACTTCAACGTGGCAGAGGTAAACGGGGTTGTCATAGATGAGAACAGTCAGGTCAGCTTTATAATGGAATCTCTTCTGAAGAGTTTCCTGT
AG
mRNA sequenceShow/hide mRNA sequence
ATGGGATCATTACAGGCCATGTTTGGACAACCGTCCTCATCGGTCCGTTATGATGCTCTCAAGTACGTTTACAACTCTCGAATGAAGGAGGAAACTTCTGTTAGGGAGCA
TGTCTTTAATATGATGGTCCACTTCAACGTGGCAGAGGTAAACGGGGTTGTCATAGATGAGAACAGTCAGGTCAGCTTTATAATGGAATCTCTTCTGAAGAGTTTCCTGT
AG
Protein sequenceShow/hide protein sequence
MGSLQAMFGQPSSSVRYDALKYVYNSRMKEETSVREHVFNMMVHFNVAEVNGVVIDENSQVSFIMESLLKSFL