; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0003530 (gene) of Snake gourd v1 genome

Gene IDTan0003530
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG02:84671176..84671763
RNA-Seq ExpressionTan0003530
SyntenyTan0003530
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-6968.21Show/hide
Query:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE
        M+S+ + +L + +L G N+ +WK+ +NT+L++DDL+FVL EECPQ+P  NA ++++E Y+RW KAN+KA  YILAS+SEVLAKKHE M++AREIM S  E
Subjt:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE

Query:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL
        MFGQ+S QI+H++LKY+YN+RM EG+SVREHVL+MMVHFNVAEMNGAVIDE SQVSFILESLP SFLQF SNAVM KI Y LTTLLNELQTF  L
Subjt:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-6968.21Show/hide
Query:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE
        M+S+ + +L + +L G N+ +WK+ +NT+L++DDL+FVL EECPQ+P  NA ++++E Y+RW KAN+KA  YILAS+SEVLAKKHE M++AREIM S  E
Subjt:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE

Query:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL
        MFGQ+S QI+H++LKY+YN+RM EG+SVREHVL+MMVHFNVAEMNGAVIDE SQVSFILESLP SFLQF SNAVM KI Y LTTLLNELQTF  L
Subjt:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-6968.21Show/hide
Query:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE
        M+S+ + +L + +L G N+ +WK+ +NT+L++DDL+FVL EECPQ+P  NA ++++E Y+RW KAN+KA  YILAS+SEVLAKKHE M++AREIM S  E
Subjt:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE

Query:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL
        MFGQ+S QI+H++LKY+YN+RM EG+SVREHVL+MMVHFNVAEMNGAVIDE SQVSFILESLP SFLQF SNAVM KI Y LTTLLNELQTF  L
Subjt:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-6968.21Show/hide
Query:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE
        M+S+ + +L + +L G N+ +WK+ +NT+L++DDL+FVL EECPQ+P  NA ++++E Y+RW KAN+KA  YILAS+SEVLAKKHE M++AREIM S  E
Subjt:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE

Query:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL
        MFGQ+S QI+H++LKY+YN+RM EG+SVREHVL+MMVHFNVAEMNGAVIDE SQVSFILESLP SFLQF SNAVM KI Y LTTLLNELQTF  L
Subjt:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]9.3e-6968.21Show/hide
Query:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE
        M+S+ + +L + +L G N+ +WK+ +NT+L++DDL+FVL EECPQ+P  NA ++++E Y+RW KAN+KA  YILAS+SEVLAKKHE M++AREIM S  E
Subjt:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE

Query:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL
        MFGQ+S QI+H++LKY+YN+RM EG+SVREHVL+MMVHFNVAEMNGAVIDE SQVSFILESLP SFLQF SNAVM KI Y LTTLLNELQTF  L
Subjt:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein4.5e-6968.21Show/hide
Query:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE
        M+S+ + +L + +L G N+ +WK+ +NT+L++DDL+FVL EECPQ+P  NA ++++E Y+RW KAN+KA  YILAS+SEVLAKKHE M++AREIM S  E
Subjt:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE

Query:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL
        MFGQ+S QI+H++LKY+YN+RM EG+SVREHVL+MMVHFNVAEMNGAVIDE SQVSFILESLP SFLQF SNAVM KI Y LTTLLNELQTF  L
Subjt:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL

A0A5A7TU93 Gag/pol protein4.5e-6968.21Show/hide
Query:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE
        M+S+ + +L + +L G N+ +WK+ +NT+L++DDL+FVL EECPQ+P  NA ++++E Y+RW KAN+KA  YILAS+SEVLAKKHE M++AREIM S  E
Subjt:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE

Query:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL
        MFGQ+S QI+H++LKY+YN+RM EG+SVREHVL+MMVHFNVAEMNGAVIDE SQVSFILESLP SFLQF SNAVM KI Y LTTLLNELQTF  L
Subjt:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL

A0A5A7TWB9 Gag/pol protein4.5e-6968.21Show/hide
Query:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE
        M+S+ + +L + +L G N+ +WK+ +NT+L++DDL+FVL EECPQ+P  NA ++++E Y+RW KAN+KA  YILAS+SEVLAKKHE M++AREIM S  E
Subjt:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE

Query:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL
        MFGQ+S QI+H++LKY+YN+RM EG+SVREHVL+MMVHFNVAEMNGAVIDE SQVSFILESLP SFLQF SNAVM KI Y LTTLLNELQTF  L
Subjt:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL

A0A5D3CPJ6 Gag/pol protein4.5e-6968.21Show/hide
Query:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE
        M+S+ + +L + +L G N+ +WK+ +NT+L++DDL+FVL EECPQ+P  NA ++++E Y+RW KAN+KA  YILAS+SEVLAKKHE M++AREIM S  E
Subjt:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE

Query:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL
        MFGQ+S QI+H++LKY+YN+RM EG+SVREHVL+MMVHFNVAEMNGAVIDE SQVSFILESLP SFLQF SNAVM KI Y LTTLLNELQTF  L
Subjt:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL

A0A5D3CSZ6 Gag/pol protein4.5e-6968.21Show/hide
Query:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE
        M+S+ + +L + +L G N+ +WK+ +NT+L++DDL+FVL EECPQ+P  NA ++++E Y+RW KAN+KA  YILAS+SEVLAKKHE M++AREIM S  E
Subjt:  MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAE

Query:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL
        MFGQ+S QI+H++LKY+YN+RM EG+SVREHVL+MMVHFNVAEMNGAVIDE SQVSFILESLP SFLQF SNAVM KI Y LTTLLNELQTF  L
Subjt:  MFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT4G00980.1 zinc knuckle (CCHC-type) family protein1.6e-0521.91Show/hide
Query:  SSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDR-------WIKANDKANVYILASVSEVLAKKH-EGMVSARE
        SS   +++K  R  G+++  W S +   L    L +VLSE CP I +   P++      R       W++ +     +++ S+S+ L +++ +    A+E
Subjt:  SSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDR-------WIKANDKANVYILASVSEVLAKKH-EGMVSARE

Query:  IMSSFAEMFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCS
        +      ++     + +   ++     RM E   + E V       +     G  +DE   VS I+   P S+  FC+
Subjt:  IMSSFAEMFGQSSGQIRHESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCATCCTCAATAATAGCCTTACTTAAAAGCGTCCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCGTTGTTGACGACCTACAGTT
TGTACTGTCTGAGGAATGTCCTCAGATCCCTACTCGTAACGCTCCTCAATCTATTAAGGAGGCATATGACCGCTGGATCAAGGCCAATGATAAGGCCAATGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCATTTGCAGAAATGTTTGGACAATCGTCTGGGCAGATTCGG
CACGAATCCCTAAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATATGATGGTCCACTTCAATGTGGCTGAGATGAACGGAGC
GGTCATTGACGAGCAAAGTCAGGTATCATTCATTCTGGAATCTCTTCCGATGAGTTTCCTGCAATTCTGTAGCAATGCGGTGATGAAAAAGATAAGGTATAACCTGACTA
CTCTCCTTAATGAACTACAAACTTTCAGTCTCTTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCATCCTCAATAATAGCCTTACTTAAAAGCGTCCGTTTAACTGGTGAGAATTTTACTACGTGGAAGTCCAACTTGAATACGATTCTCGTTGTTGACGACCTACAGTT
TGTACTGTCTGAGGAATGTCCTCAGATCCCTACTCGTAACGCTCCTCAATCTATTAAGGAGGCATATGACCGCTGGATCAAGGCCAATGATAAGGCCAATGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAGCACGAGGGCATGGTCTCAGCTCGTGAGATCATGAGTTCATTTGCAGAAATGTTTGGACAATCGTCTGGGCAGATTCGG
CACGAATCCCTAAAATACGTTTATAACTCCCGTATGAAGGAGGGGTCATCGGTGAGAGAACACGTTCTTGATATGATGGTCCACTTCAATGTGGCTGAGATGAACGGAGC
GGTCATTGACGAGCAAAGTCAGGTATCATTCATTCTGGAATCTCTTCCGATGAGTTTCCTGCAATTCTGTAGCAATGCGGTGATGAAAAAGATAAGGTATAACCTGACTA
CTCTCCTTAATGAACTACAAACTTTCAGTCTCTTATGA
Protein sequenceShow/hide protein sequence
MSSSIIALLKSVRLTGENFTTWKSNLNTILVVDDLQFVLSEECPQIPTRNAPQSIKEAYDRWIKANDKANVYILASVSEVLAKKHEGMVSAREIMSSFAEMFGQSSGQIR
HESLKYVYNSRMKEGSSVREHVLDMMVHFNVAEMNGAVIDEQSQVSFILESLPMSFLQFCSNAVMKKIRYNLTTLLNELQTFSLL