; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0007323 (gene) of Snake gourd v1 genome

Gene IDTan0007323
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase
Genome locationLG02:69767141..69767569
RNA-Seq ExpressionTan0007323
SyntenyTan0007323
Gene Ontology termsGO:0006508 - proteolysis (biological process)
GO:0003676 - nucleic acid binding (molecular function)
GO:0008233 - peptidase activity (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0026280.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]2.4e-2955.56Show/hide
Query:  QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKCFRVMRCPKDR
        +     +EA    G   S+ ESS P  E N+EEQ+  R+ QRL   +  +Q D EKKYGIERLKALGATTF GT +P DAE  L LIEKCF+V RC +DR
Subjt:  QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKCFRVMRCPKDR

Query:  KVDLAAFLLQKGADDWWEITESRKRDAWS--WKEF
        KV+LAAFLLQ GA+DWW + ESR+R      W EF
Subjt:  KVDLAAFLLQKGADDWWEITESRKRDAWS--WKEF

KAA0035225.1 DNA/RNA polymerases superfamily protein [Cucumis melo var. makuwa]1.4e-2956.12Show/hide
Query:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKC
        M RG  R     +T      +  G   S+ ESS P  E NMEEQ+  R+ QRL   + S+Q D EKKYGIERLKALGATTF GT +PADAE  L LIEKC
Subjt:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKC

Query:  FRVMRCPKDRKVDLAAFLLQKG----ADDWWEITESRKR
        FRV RCP+DRKV+LA+FLLQ G     +DWW + ESR+R
Subjt:  FRVMRCPKDRKVDLAAFLLQKG----ADDWWEITESRKR

XP_038877272.1 uncharacterized protein LOC120069556 [Benincasa hispida]2.0e-2852.9Show/hide
Query:  RGRQVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKCFRVMRCP
        R R+    T +ATS +  E  +GESSHPQ E   +EQ+  R  + LAE++G   VD +K + IERLKALGA+TFEGT +PADAE    ++EKCFRVM CP
Subjt:  RGRQVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKCFRVMRCP

Query:  KDRKVDLAAFLLQKGADDWWEITESRKR--DAWSWKEF
        +DRKV LA FLLQK A+DWW + + R+R  +  +W+EF
Subjt:  KDRKVDLAAFLLQKGADDWWEITESRKR--DAWSWKEF

XP_038887090.1 uncharacterized protein LOC120077268 [Benincasa hispida]1.5e-2855.83Show/hide
Query:  SEGESSHPQQEVN--MEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKCFRVMRCPKDRKVDLAAFLLQKGADD
        SEGES+ PQ   +  +++ +  +I QRLA SVGS + D+EKKYGIER KALGA TFEGTA+PA+AE+ L+++EKCF +M CP++RKV LA FLLQKGA+ 
Subjt:  SEGESSHPQQEVN--MEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKCFRVMRCPKDRKVDLAAFLLQKGADD

Query:  WWEITESRKR--DAWSWKEF
        WW++  +R+   +A  W EF
Subjt:  WWEITESRKR--DAWSWKEF

XP_038896416.1 uncharacterized protein LOC120084680 [Benincasa hispida]1.4e-2969.47Show/hide
Query:  MEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKCFRVMRCPKDRKVDLAAFLLQKGADDWWEITESRKR
        ME+++F RITQRLA SVGS Q D EKK+GIERLKALGATTF+GT DP DAE+ L LIEKCF+VMRCP+D KV+LA FLLQKG + WW++ E+  R
Subjt:  MEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKCFRVMRCPKDRKVDLAAFLLQKGADDWWEITESRKR

TrEMBL top hitse value%identityAlignment
A0A5A7SJ99 DNA/RNA polymerases superfamily protein1.1e-2955.56Show/hide
Query:  QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKCFRVMRCPKDR
        +     +EA    G   S+ ESS P  E N+EEQ+  R+ QRL   +  +Q D EKKYGIERLKALGATTF GT +P DAE  L LIEKCF+V RC +DR
Subjt:  QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKCFRVMRCPKDR

Query:  KVDLAAFLLQKGADDWWEITESRKRDAWS--WKEF
        KV+LAAFLLQ GA+DWW + ESR+R      W EF
Subjt:  KVDLAAFLLQKGADDWWEITESRKRDAWS--WKEF

A0A5A7T1M0 Reverse transcriptase1.1e-2754.07Show/hide
Query:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKC
        M RG  R     E       +  G   S+ ESS P+ E N+EEQ+  R+ QRL   + S+Q D EKKYG ERLKALGATTF GT +P D E  L LIEKC
Subjt:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKC

Query:  FRVMRCPKDRKVDLAAFLLQKGADDWWEITESRKR
        FRV R  +DRKV+LAAFLLQ  A+DWW + ESR+R
Subjt:  FRVMRCPKDRKVDLAAFLLQKGADDWWEITESRKR

A0A5A7U067 Retrotrans_gag domain-containing protein4.8e-2852.41Show/hide
Query:  MARGGKRGRQV-ETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKC
        M RG  R   V E       +  G E S+ ESS    EVN+EEQ+  R+ Q L   + S+Q +LEK + IERLKALGATTF GT + ADAE  L LIEKC
Subjt:  MARGGKRGRQV-ETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKC

Query:  FRVMRCPKDRKVDLAAFLLQKGADDWWEITESRKRDA--WSWKEF
        F+VMRC +DRKV+L  FLL+ G +DWW +TESR+R     SW EF
Subjt:  FRVMRCPKDRKVDLAAFLLQKGADDWWEITESRKRDA--WSWKEF

A0A5D3BB91 Reverse transcriptase1.3e-2853.1Show/hide
Query:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKC
        M RG  R     E       +  G   S+ ESS P+ E N+EEQ+  R+ QRL   + S+Q D EKKYG ERLKALGATTF GT +P D E  L LIEKC
Subjt:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKC

Query:  FRVMRCPKDRKVDLAAFLLQKGADDWWEITESRKRDA--WSWKEF
        FRV R  +DRKV+LAAFLLQ  A+DWW + ESR+R     SW EF
Subjt:  FRVMRCPKDRKVDLAAFLLQKGADDWWEITESRKRDA--WSWKEF

A0A5D3DES5 DNA/RNA polymerases superfamily protein6.7e-3056.12Show/hide
Query:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKC
        M RG  R     +T      +  G   S+ ESS P  E NMEEQ+  R+ QRL   + S+Q D EKKYGIERLKALGATTF GT +PADAE  L LIEKC
Subjt:  MARGGKRGR-QVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKC

Query:  FRVMRCPKDRKVDLAAFLLQKG----ADDWWEITESRKR
        FRV RCP+DRKV+LA+FLLQ G     +DWW + ESR+R
Subjt:  FRVMRCPKDRKVDLAAFLLQKG----ADDWWEITESRKR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCGAGGTGGCAAACGAGGCAGGCAAGTGGAGACTGGAACTCAAGAAGCTACTAGTGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAAATCTTTACGAGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCATCACAAGTAGATCTAGAAAAGAAGTATGGCATTGAAAGACTGAAAG
CCTTAGGTGCAACAACATTTGAAGGCACGGCAGATCCCGCTGATGCTGAGGTTTCGTTAAATCTGATTGAGAAGTGCTTTAGGGTCATGCGATGCCCTAAAGACAGGAAG
GTCGATTTAGCAGCATTCTTACTCCAGAAAGGGGCGGATGATTGGTGGGAGATAACAGAGAGCAGAAAACGGGATGCTTGGAGCTGGAAAGAGTTTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTCGAGGTGGCAAACGAGGCAGGCAAGTGGAGACTGGAACTCAAGAAGCTACTAGTGATAGAGGAAGAGAGGTATCAGAGGGAGAGTCTAGTCATCCTCAGCAAGA
GGTGAACATGGAGGAACAAATCTTTACGAGGATAACTCAAAGATTAGCTGAAAGTGTTGGATCATCACAAGTAGATCTAGAAAAGAAGTATGGCATTGAAAGACTGAAAG
CCTTAGGTGCAACAACATTTGAAGGCACGGCAGATCCCGCTGATGCTGAGGTTTCGTTAAATCTGATTGAGAAGTGCTTTAGGGTCATGCGATGCCCTAAAGACAGGAAG
GTCGATTTAGCAGCATTCTTACTCCAGAAAGGGGCGGATGATTGGTGGGAGATAACAGAGAGCAGAAAACGGGATGCTTGGAGCTGGAAAGAGTTTTGA
Protein sequenceShow/hide protein sequence
MARGGKRGRQVETGTQEATSDRGREVSEGESSHPQQEVNMEEQIFTRITQRLAESVGSSQVDLEKKYGIERLKALGATTFEGTADPADAEVSLNLIEKCFRVMRCPKDRK
VDLAAFLLQKGADDWWEITESRKRDAWSWKEF