; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007647 (gene) of Snake gourd v1 genome

Gene IDTan0007647
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG11:39281646..39282256
RNA-Seq ExpressionTan0007647
SyntenyTan0007647
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0015074 - DNA integration (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008234 - cysteine-type peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0035879.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-4962.99Show/hide
Query:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD
        M+S  + +LA+DKL G+N+ +WKN INT+ + +DL+FV  EE PQ+ ++ A+R+VR+ Y+RW KANE  + YI+ASLSEVLAKKHESM+T +EIMDSLQ+
Subjt:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD

Query:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ
        MFGQ S+Q++HD+LK+++NARM EG+SVREHVLNMM HFN+AEMN A IDE+SQ
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ

KAA0044955.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-4962.99Show/hide
Query:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD
        M+S  + +LA+DKL G+N+ +WKN INT+ + +DL+FV  EE PQ+ ++ A+R+VR+ Y+RW KANE  + YI+ASLSEVLAKKHESM+T +EIMDSLQ+
Subjt:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD

Query:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ
        MFGQ S+Q++HD+LK+++NARM EG+SVREHVLNMM HFN+AEMN A IDE+SQ
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ

KAA0051952.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-4962.99Show/hide
Query:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD
        M+S  + +LA+DKL G+N+ +WKN INT+ + +DL+FV  EE PQ+ ++ A+R+VR+ Y+RW KANE  + YI+ASLSEVLAKKHESM+T +EIMDSLQ+
Subjt:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD

Query:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ
        MFGQ S+Q++HD+LK+++NARM EG+SVREHVLNMM HFN+AEMN A IDE+SQ
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ

KAA0054490.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-4962.99Show/hide
Query:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD
        M+S  + +LA+DKL G+N+ +WKN INT+ + +DL+FV  EE PQ+ ++ A+R+VR+ Y+RW KANE  + YI+ASLSEVLAKKHESM+T +EIMDSLQ+
Subjt:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD

Query:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ
        MFGQ S+Q++HD+LK+++NARM EG+SVREHVLNMM HFN+AEMN A IDE+SQ
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ

TYK14550.1 gag/pol protein [Cucumis melo var. makuwa]1.2e-4962.99Show/hide
Query:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD
        M+S  + +LA+DKL G+N+ +WKN INT+ + +DL+FV  EE PQ+ ++ A+R+VR+ Y+RW KANE  + YI+ASLSEVLAKKHESM+T +EIMDSLQ+
Subjt:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD

Query:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ
        MFGQ S+Q++HD+LK+++NARM EG+SVREHVLNMM HFN+AEMN A IDE+SQ
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ

TrEMBL top hitse value%identityAlignment
A0A5A7SMH8 Gag/pol protein5.8e-5062.99Show/hide
Query:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD
        M+S  + +LA+DKL G+N+ +WKN INT+ + +DL+FV  EE PQ+ ++ A+R+VR+ Y+RW KANE  + YI+ASLSEVLAKKHESM+T +EIMDSLQ+
Subjt:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD

Query:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ
        MFGQ S+Q++HD+LK+++NARM EG+SVREHVLNMM HFN+AEMN A IDE+SQ
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ

A0A5A7TU93 Gag/pol protein5.8e-5062.99Show/hide
Query:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD
        M+S  + +LA+DKL G+N+ +WKN INT+ + +DL+FV  EE PQ+ ++ A+R+VR+ Y+RW KANE  + YI+ASLSEVLAKKHESM+T +EIMDSLQ+
Subjt:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD

Query:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ
        MFGQ S+Q++HD+LK+++NARM EG+SVREHVLNMM HFN+AEMN A IDE+SQ
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ

A0A5A7TWB9 Gag/pol protein5.8e-5062.99Show/hide
Query:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD
        M+S  + +LA+DKL G+N+ +WKN INT+ + +DL+FV  EE PQ+ ++ A+R+VR+ Y+RW KANE  + YI+ASLSEVLAKKHESM+T +EIMDSLQ+
Subjt:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD

Query:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ
        MFGQ S+Q++HD+LK+++NARM EG+SVREHVLNMM HFN+AEMN A IDE+SQ
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ

A0A5D3CPJ6 Gag/pol protein5.8e-5062.99Show/hide
Query:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD
        M+S  + +LA+DKL G+N+ +WKN INT+ + +DL+FV  EE PQ+ ++ A+R+VR+ Y+RW KANE  + YI+ASLSEVLAKKHESM+T +EIMDSLQ+
Subjt:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD

Query:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ
        MFGQ S+Q++HD+LK+++NARM EG+SVREHVLNMM HFN+AEMN A IDE+SQ
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ

A0A5D3CSZ6 Gag/pol protein5.8e-5062.99Show/hide
Query:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD
        M+S  + +LA+DKL G+N+ +WKN INT+ + +DL+FV  EE PQ+ ++ A+R+VR+ Y+RW KANE  + YI+ASLSEVLAKKHESM+T +EIMDSLQ+
Subjt:  MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQD

Query:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ
        MFGQ S+Q++HD+LK+++NARM EG+SVREHVLNMM HFN+AEMN A IDE+SQ
Subjt:  MFGQQSFQVRHDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQ

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTAGCTTAATAATCGCTTTACTAGCTTCCGACAAATTAGTGGGAGATAACTTCCAAACATGGAAGAATAATATTAACACGATTCAAGTAACTAACGACCTGAAGTT
CGTGCCTACTGAGGAGTATCCTCAGTTGTCGAGCTCGACTGCATCACGAAGTGTTCGTGATGCTTACGATCGATGGATCAAGGCCAATGAAAATGGCAAGGTCTATATAA
TTGCCAGCTTATCTGAAGTCTTAGCAAAGAAGCATGAGTCGATGGTCACCGGAAAGGAGATCATGGATTCGTTGCAGGACATGTTTGGACAACAGTCCTTTCAAGTCAGG
CACGATTCACTCAAACACGTCTTCAACGCCCGGATGAAAGAAGGGTCGTCTGTCCGTGAACATGTTCTAAACATGATGACCCACTTTAATCTTGCGGAGATGAACGAGGC
TTCGATCGACGAGTCGAGCCAGCAACGGTGTTATGAACAAGATAAACTACACTCTTACCACCATTCTCAACGAGCTACAGAACTTTCAGTCCTTGATGAGGATCAGGGCA
TCGAAATTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTAGCTTAATAATCGCTTTACTAGCTTCCGACAAATTAGTGGGAGATAACTTCCAAACATGGAAGAATAATATTAACACGATTCAAGTAACTAACGACCTGAAGTT
CGTGCCTACTGAGGAGTATCCTCAGTTGTCGAGCTCGACTGCATCACGAAGTGTTCGTGATGCTTACGATCGATGGATCAAGGCCAATGAAAATGGCAAGGTCTATATAA
TTGCCAGCTTATCTGAAGTCTTAGCAAAGAAGCATGAGTCGATGGTCACCGGAAAGGAGATCATGGATTCGTTGCAGGACATGTTTGGACAACAGTCCTTTCAAGTCAGG
CACGATTCACTCAAACACGTCTTCAACGCCCGGATGAAAGAAGGGTCGTCTGTCCGTGAACATGTTCTAAACATGATGACCCACTTTAATCTTGCGGAGATGAACGAGGC
TTCGATCGACGAGTCGAGCCAGCAACGGTGTTATGAACAAGATAAACTACACTCTTACCACCATTCTCAACGAGCTACAGAACTTTCAGTCCTTGATGAGGATCAGGGCA
TCGAAATTTGA
Protein sequenceShow/hide protein sequence
MSSLIIALLASDKLVGDNFQTWKNNINTIQVTNDLKFVPTEEYPQLSSSTASRSVRDAYDRWIKANENGKVYIIASLSEVLAKKHESMVTGKEIMDSLQDMFGQQSFQVR
HDSLKHVFNARMKEGSSVREHVLNMMTHFNLAEMNEASIDESSQQRCYEQDKLHSYHHSQRATELSVLDEDQGIEI