; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0018955 (gene) of Snake gourd v1 genome

Gene IDTan0018955
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG01:30234570..30235172
RNA-Seq ExpressionTan0018955
SyntenyTan0018955
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]3.9e-7066.5Show/hide
Query:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN
        M+S+ + +L +++L G NYA+WK+ +NT+L++DDL+F+L EECPQVPA NA ++V++ Y+ W KAN+K + YILAS+SEVLAKKHE M++ REIM SLQ 
Subjt:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+ VREHVL++MVH NVAEMN AVIDE SQVSFIL+SL +SFLQF SNAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG

KAA0061339.1 gag/pol protein [Cucumis melo var. makuwa]1.0e-7067Show/hide
Query:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN
        M+S+ + +L +++L G NYA WK+ +NT+L++DDL+F+L EECPQVPA NA Q+V++ Y+ W KAN+K + YILAS+SEVLAKKHE M++ REIM SLQ 
Subjt:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+ VREHVL++MVH NVAEMN AVIDE SQVSFIL+SL +SFLQF SNAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG

KAA0063887.1 gag/pol protein [Cucumis melo var. makuwa]9.2e-7270.85Show/hide
Query:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN
        MSSSIIALLK ++LTGENYATWKS LN ILV+ DL+F+L EECP  P +NA QSVKDAYDHW KANDK  +Y+LAS+S++L+KKHE MV+ R+IM SL+ 
Subjt:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNK
        MFGQPS Q++ E++KYVYN+RMKEG  VREHVL ++V+ NVAEMN A+ DE+SQVS+ILKSLSKSFLQF SN  MNKIEYN+TTLL ELQTFQSL   K
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNK

TYK15919.1 gag/pol protein [Cucumis melo var. makuwa]9.2e-7270.85Show/hide
Query:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN
        MSSSIIALLK ++LTGENYATWKS LN ILV+ DL+F+L EECP  P +NA QSVKDAYDHW KANDK  +Y+LAS+S++L+KKHE MV+ R+IM SL+ 
Subjt:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNK
        MFGQPS Q++ E++KYVYN+RMKEG  VREHVL ++V+ NVAEMN A+ DE+SQVS+ILKSLSKSFLQF SN  MNKIEYN+TTLL ELQTFQSL   K
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNK

XP_022158568.1 uncharacterized protein LOC111025021 [Momordica charantia]3.5e-7168Show/hide
Query:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN
        MS+S I LL S++L G+NY  WKSNLNTILV+DDL+F+LTEECP  PA NA ++V+DAYD W+KAN+K +VYILAS+SEVL+KKHE + +TREIM SLQ 
Subjt:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG
        +FGQPS  L H+++KYVYN RMKEGS VREHVL++MVH NVAE+ND V++E SQV FI++SL KS+ QF  NA+MNKIEY+LTTLLNELQ ++SL+KNKG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein1.9e-7066.5Show/hide
Query:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN
        M+S+ + +L +++L G NYA+WK+ +NT+L++DDL+F+L EECPQVPA NA ++V++ Y+ W KAN+K + YILAS+SEVLAKKHE M++ REIM SLQ 
Subjt:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+ VREHVL++MVH NVAEMN AVIDE SQVSFIL+SL +SFLQF SNAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG

A0A5A7V6N0 Gag/pol protein4.9e-7167Show/hide
Query:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN
        M+S+ + +L +++L G NYA WK+ +NT+L++DDL+F+L EECPQVPA NA Q+V++ Y+ W KAN+K + YILAS+SEVLAKKHE M++ REIM SLQ 
Subjt:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG
        MFGQ S Q++H++LKY+YN+RM EG+ VREHVL++MVH NVAEMN AVIDE SQVSFIL+SL +SFLQF SNAVMNKI Y LTTLLNELQTF+SLMK KG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG

A0A5A7VA67 Gag/pol protein4.5e-7270.85Show/hide
Query:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN
        MSSSIIALLK ++LTGENYATWKS LN ILV+ DL+F+L EECP  P +NA QSVKDAYDHW KANDK  +Y+LAS+S++L+KKHE MV+ R+IM SL+ 
Subjt:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNK
        MFGQPS Q++ E++KYVYN+RMKEG  VREHVL ++V+ NVAEMN A+ DE+SQVS+ILKSLSKSFLQF SN  MNKIEYN+TTLL ELQTFQSL   K
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNK

A0A5D3D0D9 Gag/pol protein4.5e-7270.85Show/hide
Query:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN
        MSSSIIALLK ++LTGENYATWKS LN ILV+ DL+F+L EECP  P +NA QSVKDAYDHW KANDK  +Y+LAS+S++L+KKHE MV+ R+IM SL+ 
Subjt:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNK
        MFGQPS Q++ E++KYVYN+RMKEG  VREHVL ++V+ NVAEMN A+ DE+SQVS+ILKSLSKSFLQF SN  MNKIEYN+TTLL ELQTFQSL   K
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNK

A0A6J1DWG6 uncharacterized protein LOC1110250211.7e-7168Show/hide
Query:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN
        MS+S I LL S++L G+NY  WKSNLNTILV+DDL+F+LTEECP  PA NA ++V+DAYD W+KAN+K +VYILAS+SEVL+KKHE + +TREIM SLQ 
Subjt:  MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQN

Query:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG
        +FGQPS  L H+++KYVYN RMKEGS VREHVL++MVH NVAE+ND V++E SQV FI++SL KS+ QF  NA+MNKIEY+LTTLLNELQ ++SL+KNKG
Subjt:  MFGQPSGQLRHESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCGTCCTCAATAATAGCCTTACTCAAAAGTGAACGTTTAACTGGCGAGAATTATGCTACGTGGAAGTCCAACCTGAATACGATTCTTGTTGTTGATGACCTACAGTT
TATACTTACTGAGGAATGTCCTCAGGTCCCTGCTCGAAACGCTCCTCAATCTGTTAAGGATGCATACGACCATTGGATCAAGGCCAATGACAAGACCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAACACGAGGGCATGGTCTCAACTCGTGAGATCATGAGTTCGTTGCAGAATATGTTTGGACAACCGTCTGGACAGCTTAGG
CATGAATCCCTCAAGTACGTTTATAACTCCCGTATGAAGGAGGGATCTTTGGTGAGAGAACATGTTCTCGATGTGATGGTCCACTTAAACGTGGCAGAAATGAACGACGC
GGTCATCGACGAGCAAAGTCAGGTGTCCTTCATCCTGAAATCTCTTTCGAAGAGTTTCCTGCAATTCAGCAGCAATGCGGTGATGAACAAGATAGAATACAACTTGACTA
CTCTCCTCAATGAGCTACAGACTTTCCAGTCCCTTATGAAGAATAAGGGATAG
mRNA sequenceShow/hide mRNA sequence
ATGTCGTCCTCAATAATAGCCTTACTCAAAAGTGAACGTTTAACTGGCGAGAATTATGCTACGTGGAAGTCCAACCTGAATACGATTCTTGTTGTTGATGACCTACAGTT
TATACTTACTGAGGAATGTCCTCAGGTCCCTGCTCGAAACGCTCCTCAATCTGTTAAGGATGCATACGACCATTGGATCAAGGCCAATGACAAGACCAAGGTCTACATTT
TGGCTAGTGTTTCTGAAGTTCTGGCCAAAAAACACGAGGGCATGGTCTCAACTCGTGAGATCATGAGTTCGTTGCAGAATATGTTTGGACAACCGTCTGGACAGCTTAGG
CATGAATCCCTCAAGTACGTTTATAACTCCCGTATGAAGGAGGGATCTTTGGTGAGAGAACATGTTCTCGATGTGATGGTCCACTTAAACGTGGCAGAAATGAACGACGC
GGTCATCGACGAGCAAAGTCAGGTGTCCTTCATCCTGAAATCTCTTTCGAAGAGTTTCCTGCAATTCAGCAGCAATGCGGTGATGAACAAGATAGAATACAACTTGACTA
CTCTCCTCAATGAGCTACAGACTTTCCAGTCCCTTATGAAGAATAAGGGATAG
Protein sequenceShow/hide protein sequence
MSSSIIALLKSERLTGENYATWKSNLNTILVVDDLQFILTEECPQVPARNAPQSVKDAYDHWIKANDKTKVYILASVSEVLAKKHEGMVSTREIMSSLQNMFGQPSGQLR
HESLKYVYNSRMKEGSLVREHVLDVMVHLNVAEMNDAVIDEQSQVSFILKSLSKSFLQFSSNAVMNKIEYNLTTLLNELQTFQSLMKNKG