; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0010338 (gene) of Snake gourd v1 genome

Gene IDTan0010338
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionGag/pol protein
Genome locationLG09:2948337..2949207
RNA-Seq ExpressionTan0010338
SyntenyTan0010338
Gene Ontology termsGO:0006807 - nitrogen compound metabolic process (biological process)
GO:0043170 - macromolecule metabolic process (biological process)
GO:0044238 - primary metabolic process (biological process)
GO:0046872 - metal ion binding (molecular function)
InterPro domainsIPR013103 - Reverse transcriptase, RNA-dependent DNA polymerase


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0025358.1 gag/pol protein [Cucumis melo var. makuwa]4.1e-6860.79Show/hide
Query:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK-----------------------------------------------
        M DVDKD+W+KAM+ EM+SMYFNSVWELVD P+GVKPIGCKWIYKRKR   GK                                               
Subjt:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK-----------------------------------------------

Query:  ------NVDEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKK
              NVDEPCVYKKI    VAFL+LYVDDILLIGNDVG LT++K WLA+QFQMKDLGEAQYVLG+QI++++KNK LA+S+A+YIDKML RY MQNSKK
Subjt:  ------NVDEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKK

Query:  GLLPFRHGVHLFKDQCPKTPQEVEDMR
        GLLPFRHGVHL K+QC KTPQEVED+R
Subjt:  GLLPFRHGVHLFKDQCPKTPQEVEDMR

KAA0051787.1 gag/pol protein [Cucumis melo var. makuwa]4.1e-6865.85Show/hide
Query:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKR--------------------------------GVDGKNVDEPCVYKKIINCT
        M DVD ++WIK M+ E++SMY NSVW LVDQP+ VKPIGCKWIYKRKR                                G + KNVDEPCVYK+IIN T
Subjt:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKR--------------------------------GVDGKNVDEPCVYKKIINCT

Query:  VAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHGVHLFKDQCPKTPQ
        VAFL+LYVDDILLIGNDVG L +IKEWLA+QFQMKDLG AQYVLG+QIV+N+KNK LAMS+ SYIDKMLSRYKMQNSKK LLP+R+G+HL K+QC KTPQ
Subjt:  VAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHGVHLFKDQCPKTPQ

Query:  EVEDM
        E+EDM
Subjt:  EVEDM

KAA0062926.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-7063.47Show/hide
Query:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK---------------------------------------------NV
        M DVDKD+W+KAM+ EM+S+YFNSVWELVD P+GVKPIGCKWIYKRKR   GK                                             NV
Subjt:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK---------------------------------------------NV

Query:  DEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHG
        DEPCVYKKI    VAFL+LYVDDILLI NDVG+LT++K WLA+QFQMKDLGEAQYVLG+QI++++KNKTLA+S+A+YIDKML RY MQNSKKGLLPFRHG
Subjt:  DEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHG

Query:  VHLFKDQCPKTPQEVEDMR
        VHL K+QCPKTPQEVEDMR
Subjt:  VHLFKDQCPKTPQEVEDMR

TYK00851.1 gag/pol protein [Cucumis melo var. makuwa]5.4e-6863.68Show/hide
Query:  DVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK-----------------------------------------NVDEPCVY
        +VD D+WIK ++ EM+SMY NSVW LVDQ + VKPIGCKWIYKRKR   GK                                         NVDEPCVY
Subjt:  DVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK-----------------------------------------NVDEPCVY

Query:  KKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHGVHLFKD
        K+IIN TVAFL++YVDDILLIGNDVG LT+IK+WLA+QFQ+KDLG AQYVLG+QIV+N+KNKTLAMS+ SYIDKMLSRYKMQNSKKGLLP+R+G+HL K+
Subjt:  KKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHGVHLFKD

Query:  QCPKTPQEVEDM
        QCPKTPQEVEDM
Subjt:  QCPKTPQEVEDM

TYK16417.1 gag/pol protein [Cucumis melo var. makuwa]3.4e-7063.47Show/hide
Query:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK---------------------------------------------NV
        M DVDKD+W+KAM+ EM+S+YFNSVWELVD P+GVKPIGCKWIYKRKR   GK                                             NV
Subjt:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK---------------------------------------------NV

Query:  DEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHG
        DEPCVYKKI    VAFL+LYVDDILLI NDVG+LT++K WLA+QFQMKDLGEAQYVLG+QI++++KNKTLA+S+A+YIDKML RY MQNSKKGLLPFRHG
Subjt:  DEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHG

Query:  VHLFKDQCPKTPQEVEDMR
        VHL K+QCPKTPQEVEDMR
Subjt:  VHLFKDQCPKTPQEVEDMR

TrEMBL top hitse value%identityAlignment
A0A5A7SGT6 Gag/pol protein2.0e-6860.79Show/hide
Query:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK-----------------------------------------------
        M DVDKD+W+KAM+ EM+SMYFNSVWELVD P+GVKPIGCKWIYKRKR   GK                                               
Subjt:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK-----------------------------------------------

Query:  ------NVDEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKK
              NVDEPCVYKKI    VAFL+LYVDDILLIGNDVG LT++K WLA+QFQMKDLGEAQYVLG+QI++++KNK LA+S+A+YIDKML RY MQNSKK
Subjt:  ------NVDEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKK

Query:  GLLPFRHGVHLFKDQCPKTPQEVEDMR
        GLLPFRHGVHL K+QC KTPQEVED+R
Subjt:  GLLPFRHGVHLFKDQCPKTPQEVEDMR

A0A5A7UBK6 Gag/pol protein2.0e-6865.85Show/hide
Query:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKR--------------------------------GVDGKNVDEPCVYKKIINCT
        M DVD ++WIK M+ E++SMY NSVW LVDQP+ VKPIGCKWIYKRKR                                G + KNVDEPCVYK+IIN T
Subjt:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKR--------------------------------GVDGKNVDEPCVYKKIINCT

Query:  VAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHGVHLFKDQCPKTPQ
        VAFL+LYVDDILLIGNDVG L +IKEWLA+QFQMKDLG AQYVLG+QIV+N+KNK LAMS+ SYIDKMLSRYKMQNSKK LLP+R+G+HL K+QC KTPQ
Subjt:  VAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHGVHLFKDQCPKTPQ

Query:  EVEDM
        E+EDM
Subjt:  EVEDM

A0A5A7V901 Gag/pol protein1.6e-7063.47Show/hide
Query:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK---------------------------------------------NV
        M DVDKD+W+KAM+ EM+S+YFNSVWELVD P+GVKPIGCKWIYKRKR   GK                                             NV
Subjt:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK---------------------------------------------NV

Query:  DEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHG
        DEPCVYKKI    VAFL+LYVDDILLI NDVG+LT++K WLA+QFQMKDLGEAQYVLG+QI++++KNKTLA+S+A+YIDKML RY MQNSKKGLLPFRHG
Subjt:  DEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHG

Query:  VHLFKDQCPKTPQEVEDMR
        VHL K+QCPKTPQEVEDMR
Subjt:  VHLFKDQCPKTPQEVEDMR

A0A5D3BM03 Gag/pol protein2.6e-6863.68Show/hide
Query:  DVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK-----------------------------------------NVDEPCVY
        +VD D+WIK ++ EM+SMY NSVW LVDQ + VKPIGCKWIYKRKR   GK                                         NVDEPCVY
Subjt:  DVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK-----------------------------------------NVDEPCVY

Query:  KKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHGVHLFKD
        K+IIN TVAFL++YVDDILLIGNDVG LT+IK+WLA+QFQ+KDLG AQYVLG+QIV+N+KNKTLAMS+ SYIDKMLSRYKMQNSKKGLLP+R+G+HL K+
Subjt:  KKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHGVHLFKD

Query:  QCPKTPQEVEDM
        QCPKTPQEVEDM
Subjt:  QCPKTPQEVEDM

A0A5D3CWZ1 Gag/pol protein1.6e-7063.47Show/hide
Query:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK---------------------------------------------NV
        M DVDKD+W+KAM+ EM+S+YFNSVWELVD P+GVKPIGCKWIYKRKR   GK                                             NV
Subjt:  MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGK---------------------------------------------NV

Query:  DEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHG
        DEPCVYKKI    VAFL+LYVDDILLI NDVG+LT++K WLA+QFQMKDLGEAQYVLG+QI++++KNKTLA+S+A+YIDKML RY MQNSKKGLLPFRHG
Subjt:  DEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHG

Query:  VHLFKDQCPKTPQEVEDMR
        VHL K+QCPKTPQEVEDMR
Subjt:  VHLFKDQCPKTPQEVEDMR

SwissProt top hitse value%identityAlignment
P04146 Copia protein7.2e-0733.71Show/hide
Query:  CVY---KKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQN
        C+Y   K  IN  + +++LYVDD+++   D+  + N K +L  +F+M DL E ++ +G++I + Q++K + +S+++Y+ K+LS++ M+N
Subjt:  CVY---KKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQN

P10978 Retrovirus-related Pol polyprotein from transposon TNT 1-944.8e-1939.34Show/hide
Query:  KNVDEPCVY-KKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLP
        K   +PCVY K+        L+LYVDD+L++G D G +  +K  L+  F MKDLG AQ +LGM+IV+ + ++ L +S+  YI+++L R+ M+N+K    P
Subjt:  KNVDEPCVY-KKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLP

Query:  FRHGVHLFKDQCPKTPQEVEDM
            + L K  CP T +E  +M
Subjt:  FRHGVHLFKDQCPKTPQEVEDM

P92519 Uncharacterized mitochondrial protein AtMg008103.9e-0539.47Show/hide
Query:  FLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSK
        +L+LYVDDILL G+    L  +   L+S F MKDLG   Y LG+QI  +     L +S+  Y +++L+   M + K
Subjt:  FLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSK

Q94HW2 Retrovirus-related Pol polyprotein from transposon RE12.0e-0433.33Show/hide
Query:  TVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLP
        ++ ++++YVDDIL+ GND   L N  + L+ +F +KD  E  Y LG++    +    L +S+  YI  +L+R  M  +K    P
Subjt:  TVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLP

Arabidopsis top hitse value%identityAlignment
AT4G23160.1 cysteine-rich RLK (RECEPTOR-like protein kinase) 82.8e-0645.45Show/hide
Query:  WIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDG
        W  AM+ E+ +M     WE+   P   KPIGCKW+YK K   DG
Subjt:  WIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDG

ATMG00810.1 DNA/RNA polymerases superfamily protein2.8e-0639.47Show/hide
Query:  FLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSK
        +L+LYVDDILL G+    L  +   L+S F MKDLG   Y LG+QI  +     L +S+  Y +++L+   M + K
Subjt:  FLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQYVLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSK

ATMG00820.1 Reverse transcriptase (RNA-dependent DNA polymerase)9.0e-0538.64Show/hide
Query:  WIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDG
        W +AM +E+ ++  N  W LV  P     +GCKW++K K   DG
Subjt:  WIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGACGTCGACAAAGACGAATGGATTAAAGCTATGAACCAGGAAATGCAGTCGATGTACTTCAATTCCGTCTGGGAGCTTGTAGACCAACCAGATGGGGTCAAACC
TATTGGTTGCAAGTGGATCTACAAGCGTAAACGTGGCGTAGATGGGAAGAATGTTGATGAGCCTTGTGTTTACAAGAAAATCATCAACTGCACTGTCGCATTTCTGATAT
TGTATGTGGATGATATCCTGCTCATTGGGAATGACGTAGGATTTCTTACTAACATTAAGGAATGGTTGGCTTCGCAATTCCAAATGAAAGATTTGGGAGAGGCCCAGTAT
GTTCTAGGTATGCAGATAGTCCAAAACCAGAAGAACAAAACACTAGCCATGTCTGAAGCATCTTACATTGACAAGATGTTGTCTCGGTATAAGATGCAAAACTCCAAGAA
GGGTTTGCTACCTTTCAGGCATGGGGTTCACTTGTTTAAGGATCAATGTCCTAAGACGCCTCAAGAGGTTGAGGATATGAGATGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGACGTCGACAAAGACGAATGGATTAAAGCTATGAACCAGGAAATGCAGTCGATGTACTTCAATTCCGTCTGGGAGCTTGTAGACCAACCAGATGGGGTCAAACC
TATTGGTTGCAAGTGGATCTACAAGCGTAAACGTGGCGTAGATGGGAAGAATGTTGATGAGCCTTGTGTTTACAAGAAAATCATCAACTGCACTGTCGCATTTCTGATAT
TGTATGTGGATGATATCCTGCTCATTGGGAATGACGTAGGATTTCTTACTAACATTAAGGAATGGTTGGCTTCGCAATTCCAAATGAAAGATTTGGGAGAGGCCCAGTAT
GTTCTAGGTATGCAGATAGTCCAAAACCAGAAGAACAAAACACTAGCCATGTCTGAAGCATCTTACATTGACAAGATGTTGTCTCGGTATAAGATGCAAAACTCCAAGAA
GGGTTTGCTACCTTTCAGGCATGGGGTTCACTTGTTTAAGGATCAATGTCCTAAGACGCCTCAAGAGGTTGAGGATATGAGATGA
Protein sequenceShow/hide protein sequence
MADVDKDEWIKAMNQEMQSMYFNSVWELVDQPDGVKPIGCKWIYKRKRGVDGKNVDEPCVYKKIINCTVAFLILYVDDILLIGNDVGFLTNIKEWLASQFQMKDLGEAQY
VLGMQIVQNQKNKTLAMSEASYIDKMLSRYKMQNSKKGLLPFRHGVHLFKDQCPKTPQEVEDMR