; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0021955 (gene) of Snake gourd v1 genome

Gene IDTan0021955
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:75795368..75795970
RNA-Seq ExpressionTan0021955
SyntenyTan0021955
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]4.1e-7267.5Show/hide
Query:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN
        M+S  + +L +++L G NY +WK  +NT+L++DDL FVL E+CP VPA N  ++V++ Y+RW KAN+K + YILAS+SEVLAK HE M++AREIM SLQ 
Subjt:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]4.1e-7267.5Show/hide
Query:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN
        M+S  + +L +++L G NY +WK  +NT+L++DDL FVL E+CP VPA N  ++V++ Y+RW KAN+K + YILAS+SEVLAK HE M++AREIM SLQ 
Subjt:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]4.1e-7267.5Show/hide
Query:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN
        M+S  + +L +++L G NY +WK  +NT+L++DDL FVL E+CP VPA N  ++V++ Y+RW KAN+K + YILAS+SEVLAK HE M++AREIM SLQ 
Subjt:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]1.9e-7268Show/hide
Query:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN
        M+S  + +L +++L G NY  WK  +NT+L++DDL FVL E+CP VPA N  Q+V++ Y+RW KAN+K + YILAS+SEVLAK HE M++AREIM SLQ 
Subjt:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]4.1e-7267.5Show/hide
Query:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN
        M+S  + +L +++L G NY +WK  +NT+L++DDL FVL E+CP VPA N  ++V++ Y+RW KAN+K + YILAS+SEVLAK HE M++AREIM SLQ 
Subjt:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein2.0e-7267.5Show/hide
Query:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN
        M+S  + +L +++L G NY +WK  +NT+L++DDL FVL E+CP VPA N  ++V++ Y+RW KAN+K + YILAS+SEVLAK HE M++AREIM SLQ 
Subjt:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG

A0A5A7TU93 Gag/pol protein2.0e-7267.5Show/hide
Query:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN
        M+S  + +L +++L G NY +WK  +NT+L++DDL FVL E+CP VPA N  ++V++ Y+RW KAN+K + YILAS+SEVLAK HE M++AREIM SLQ 
Subjt:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG

A0A5A7TWB9 Gag/pol protein2.0e-7267.5Show/hide
Query:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN
        M+S  + +L +++L G NY +WK  +NT+L++DDL FVL E+CP VPA N  ++V++ Y+RW KAN+K + YILAS+SEVLAK HE M++AREIM SLQ 
Subjt:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG

A0A5A7V6N0 Gag/pol protein9.0e-7368Show/hide
Query:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN
        M+S  + +L +++L G NY  WK  +NT+L++DDL FVL E+CP VPA N  Q+V++ Y+RW KAN+K + YILAS+SEVLAK HE M++AREIM SLQ 
Subjt:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG

A0A5D3CPJ6 Gag/pol protein2.0e-7267.5Show/hide
Query:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN
        M+S  + +L +++L G NY +WK  +NT+L++DDL FVL E+CP VPA N  ++V++ Y+RW KAN+K + YILAS+SEVLAK HE M++AREIM SLQ 
Subjt:  MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+SVREHVL++MVHFNVAEMNGAVIDE SQVSFILESLP+SFLQF +NAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCCCAATTATAGCCTTACTGAAAAGCGAACGTTTAACTGGTGAAAACTATACTACATGGAAGTACAACCTGAATACGATTCTTGTTGTTGACGACCTTTGGTT
TGTACTGACTGAGAAATGTCCTCATGTCCCTGCTCGAAACACTCCTCAATCTGTTAAGAAGACGTACGACCGCTGGATCAAGGCCAATGATAAGACCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTTTAGCCAAAAACCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGTTGCAGAACATGTTTGGACAACCATCTGGACAGCTTCGA
CACGAATCCCTCAAGTACGTTTATAACTCCCGTATGAAGGAGGGGTCGTCGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAACGTGGCTGAAATGAACGGCGC
TGTCATTGACGAGCAAAGTCAGGTCTCCTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCACAACAATGCGGTGATGAACAAGATAGAGTATGACCTGACTA
CTCTCCTTAATGAACTACAAACTTTCCAATCTCTTATGAAGAATAAGGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCCCAATTATAGCCTTACTGAAAAGCGAACGTTTAACTGGTGAAAACTATACTACATGGAAGTACAACCTGAATACGATTCTTGTTGTTGACGACCTTTGGTT
TGTACTGACTGAGAAATGTCCTCATGTCCCTGCTCGAAACACTCCTCAATCTGTTAAGAAGACGTACGACCGCTGGATCAAGGCCAATGATAAGACCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTTTAGCCAAAAACCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCGTTGCAGAACATGTTTGGACAACCATCTGGACAGCTTCGA
CACGAATCCCTCAAGTACGTTTATAACTCCCGTATGAAGGAGGGGTCGTCGGTGAGAGAACACGTTCTCGATCTGATGGTCCACTTCAACGTGGCTGAAATGAACGGCGC
TGTCATTGACGAGCAAAGTCAGGTCTCCTTCATCCTGGAATCTCTTCCGAAGAGTTTCCTGCAATTCCACAACAATGCGGTGATGAACAAGATAGAGTATGACCTGACTA
CTCTCCTTAATGAACTACAAACTTTCCAATCTCTTATGAAGAATAAGGGATAG
Protein sequenceShow/hide protein sequence
MSSPIIALLKSERLTGENYTTWKYNLNTILVVDDLWFVLTEKCPHVPARNTPQSVKKTYDRWIKANDKTKVYILASVSEVLAKNHEGMVSAREIMSSLQNMFGQPSGQLR
HESLKYVYNSRMKEGSSVREHVLDLMVHFNVAEMNGAVIDEQSQVSFILESLPKSFLQFHNNAVMNKIEYDLTTLLNELQTFQSLMKNKG