; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016576 (gene) of Snake gourd v1 genome

Gene IDTan0016576
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationLG01:109518134..109522513
RNA-Seq ExpressionTan0016576
SyntenyTan0016576
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAF7140294.1 hypothetical protein RHSIM_Rhsim06G0103400 [Rhododendron simsii]1.7e-3031.73Show/hide
Query:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDIL
        DFNE++  +EKWGG+  S S++++F   +   ++ D+ +KG  Y W  +R     ++ER+DR L   ++  +FP+  +F    + SDH PL ++      
Subjt:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDIL

Query:  KHRKRKSQIFRFEEIWSLQDDCIDIIKDG-----------------RAKEK------------------LEDLLNEEEVYWRQRARVGWLKWGDRNSKWF
          RK     F+FE +W+   +C+  IKD                  +A +K                  +E ++  EE+Y  QR+RV WL +GDRNSK+F
Subjt:  KHRKRKSQIFRFEEIWSLQDDCIDIIKDG-----------------RAKEK------------------LEDLLNEEEVYWRQRARVGWLKWGDRNSKWF

Query:  HKCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQN
        +     RR+ N I R+K+  G+W  +ED I+ EIH YF +LF ++G ++
Subjt:  HKCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQN

XP_023918603.1 uncharacterized protein LOC112030149 [Quercus suber]4.8e-3034.89Show/hide
Query:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINL-GEDI
        DFNE+T+  EK GG +   SQM +FR+ +      D+G+ G  + W++  D   +  ERLDR + TN+++ LFP   + HL+   SDH PL I   G D 
Subjt:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINL-GEDI

Query:  LKHRKRKSQIFRFEEIWSLQDDCIDIIKD--------------------GRAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPIN
         + R      FRFE++W     C   I+                      + ++++ +LL++E   WRQRA+V WLK GDRN+K+FH  AS RR+ N I 
Subjt:  LKHRKRKSQIFRFEEIWSLQDDCIDIIKD--------------------GRAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPIN

Query:  RIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQN
         +   DG W +N  ++   +  ++Q+LFTS    N
Subjt:  RIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQN

XP_024177965.2 uncharacterized protein LOC112183883 [Rosa chinensis]2.2e-3032.4Show/hide
Query:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDIL
        D NE+    EK GGV   + QM  FR+ +   ++FD+G+ G P+ W     R   M+ RLDR + +  + D+F    + HL  ++ DH P+ + +   I 
Subjt:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDIL

Query:  KHRKRKSQIFRFEEIWSLQDDCIDIIKDGRAKE----------------KLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPINRIKSK
            R+   FRFE  W+L + C +++K G   E                KLE LL EE+VYW+QR++V WL  GDRN+K+FH+ AS+RR  N +  +   
Subjt:  KHRKRKSQIFRFEEIWSLQDDCIDIIKDGRAKE----------------KLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPINRIKSK

Query:  DGIWISNEDQISHEIHSYFQSLFTSNGKQNNLVLYNFLKKVNPCLSPEQD
        +G+W      +   +  YF S+F SN   N++ + N +  + P ++ + +
Subjt:  DGIWISNEDQISHEIHSYFQSLFTSNGKQNNLVLYNFLKKVNPCLSPEQD

XP_030497597.1 uncharacterized protein LOC115713255 [Cannabis sativa]5.7e-3131.97Show/hide
Query:  SLVGENMLLQGNLDIDLGENNKPPRSKRKKWRRINTEKELLLAGPLDSCKGKRPLEEDFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYP
        S+ GEN  +Q  L    GE N+  R  R  W +I   +EL     L  C     +  D N +T   +K GG  +    +D F + +    + DM   GYP
Subjt:  SLVGENMLLQGNLDIDLGENNKPPRSKRKKWRRINTEKELLLAGPLDSCKGKRPLEEDFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYP

Query:  YNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDILKHRKRKSQIFRFEEIWSLQDDCIDIIKDGR-------------
        Y W + R  +  ++ R+DR L +  + + FP   + +LE   SDH P+++    +++K      Q FRFE +W  +  C+ ++KD R             
Subjt:  YNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDILKHRKRKSQIFRFEEIWSLQDDCIDIIKDGR-------------

Query:  -------AKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQNNLV
               A+ +L+++L ++EV+WRQR++  WLK GD+NSK+FH  A+ R+++N I R+K+ +GIWI  E+ +++ +  YF +LFTS+    N V
Subjt:  -------AKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQNNLV

XP_042950258.1 uncharacterized protein LOC122282367 [Carya illinoinensis]2.8e-3032.51Show/hide
Query:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDIL
        DFNE+   +EK GGV     QM+ FR  +    + D+G+ G  + W   R+ +  +KERLDR L    + +LF    +  L   NSDH PL ++      
Subjt:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDIL

Query:  KHRKRKSQIFRFEEIWSLQDDCIDIIK----------------------------------------------DGRAKEKLED---LLNEEEVYWRQRAR
        +H  +K ++FR+E  WS + +  DI+                                               +G+ KE  E+   LL EEE+ WRQRA+
Subjt:  KHRKRKSQIFRFEEIWSLQDDCIDIIK----------------------------------------------DGRAKEKLED---LLNEEEVYWRQRAR

Query:  VGWLKWGDRNSKWFHKCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQNNLVLYNFLKKVNPCLSPEQD
          WLK GDRNSK+FHKCA+HRRK N I RI  + G+  + +  +S    SY+Q LFTS+   ++  +   + KV   ++PE +
Subjt:  VGWLKWGDRNSKWFHKCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQNNLVLYNFLKKVNPCLSPEQD

TrEMBL top hitse value%identityAlignment
A0A2N9EX83 Reverse transcriptase domain-containing protein2.7e-3437.55Show/hide
Query:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINL---GE
        DFNE+ + EEKWG V+  + QM  FR  +      D+G+ G PY W+  +     + ERLDR L T D+   FP   + HL+ + SDH PL + L   G 
Subjt:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINL---GE

Query:  DILKHRKRKSQIFRFEEIWSLQDDCIDIIKDG------------RAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPINRIKSKD
        ++   RKR    FRFEE+W+L   C D IK              +   +L DL  +EE  W+QR+R  WL+ GDRN+K+FH  A++R++ N I+ I+ + 
Subjt:  DILKHRKRKSQIFRFEEIWSLQDDCIDIIKDG------------RAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPINRIKSKD

Query:  GIWISNEDQISHEIHSYFQSLFTSNGKQN
        G+W S  +++ H I  Y++ LFT++   N
Subjt:  GIWISNEDQISHEIHSYFQSLFTSNGKQN

A0A2N9H936 Uncharacterized protein3.6e-3134.71Show/hide
Query:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDIL
        DFNEI + EEKWG V+   SQM +FR  +      D+G+ G PY W+  +     + ERLDR L T D+   FP   + HL  + SDH PL + L     
Subjt:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDIL

Query:  KHRKRKSQIFRFEEIWSLQDDCIDIIKDG----------------------------RAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHR
        K R R+ + FRFEE+W++   C D IK                              +   +L DL  +EE  W+QR+R  WL+ GD+N+K+FH  A++R
Subjt:  KHRKRKSQIFRFEEIWSLQDDCIDIIKDG----------------------------RAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHR

Query:  RKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQN
        ++ N I+ I+ + G+W S +  +   I  Y++ LFT++   N
Subjt:  RKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQN

B9FU75 Reverse transcriptase domain-containing protein4.0e-3032.47Show/hide
Query:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNW-YQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDI
        DFNEI    EK GG    Q+QMD FR+T+    + D+G++G  + W   S   +  ++E LDR +   ++   FP   + + +  +SDH P+ I      
Subjt:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNW-YQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDI

Query:  LKHRKRKSQIFRFEEIWSLQDDCID---------------------------------IIKDGRAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFH
            +R S  FRFE  W  ++ C +                                 ++K+G  K KLE L  + + YWRQRA V WL+ GDRN+ +FH
Subjt:  LKHRKRKSQIFRFEEIWSLQDDCID---------------------------------IIKDGRAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFH

Query:  KCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQNNLVLYNFLKK-----VNPCLSPE
           S R++SN I R+K +DG W+ +E++    I  YF+++F SNG Q+   L + +KK     +N CL  E
Subjt:  KCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQNNLVLYNFLKK-----VNPCLSPE

M5XQU7 Uncharacterized protein1.8e-3034.8Show/hide
Query:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDIL
        DFNE+    EK GG+     QM +FR+ I   ++ DMG++G  + W+ +R+  I  KERLDR L   ++  LFP  ++ HLE  +SDH P+ +     + 
Subjt:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDIL

Query:  KHRKRKSQIFRFEEIWSLQDDCIDIIKDG-----------------RAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPINRIKS
          R+R    FRFE +W+  +DC  II +                      +L+ LL+ EE +W+QR++V WLK GDRN+++FH+ AS+R++ N +  ++ 
Subjt:  KHRKRKSQIFRFEEIWSLQDDCIDIIKDG-----------------RAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPINRIKS

Query:  KDGIWISNEDQISHEIHSYFQSLFTSN
          G W  +E  + + +  YF  LFTS+
Subjt:  KDGIWISNEDQISHEIHSYFQSLFTSN

Q0D4Q0 Os07g0613900 protein4.0e-3032.47Show/hide
Query:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNW-YQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDI
        DFNEI    EK GG    Q+QMD FR+T+    + D+G++G  + W   S   +  ++E LDR +   ++   FP   + + +  +SDH P+ I      
Subjt:  DFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMFDMGYKGYPYNW-YQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDI

Query:  LKHRKRKSQIFRFEEIWSLQDDCID---------------------------------IIKDGRAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFH
            +R S  FRFE  W  ++ C +                                 ++K+G  K KLE L  + + YWRQRA V WL+ GDRN+ +FH
Subjt:  LKHRKRKSQIFRFEEIWSLQDDCID---------------------------------IIKDGRAKEKLEDLLNEEEVYWRQRARVGWLKWGDRNSKWFH

Query:  KCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQNNLVLYNFLKK-----VNPCLSPE
           S R++SN I R+K +DG W+ +E++    I  YF+++F SNG Q+   L + +KK     +N CL  E
Subjt:  KCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQNNLVLYNFLKK-----VNPCLSPE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein9.8e-0530.88Show/hide
Query:  EVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSN
        E ++RQ++R+ WL+ GD N+++FHK     +  N I  ++  D + + N  Q+   I +Y+  L  S+
Subjt:  EVYWRQRARVGWLKWGDRNSKWFHKCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAACTAACTGCTGAAGAAGAGAAACCTGCTGATTTACGTCCCTTTGCCCCGAAACAGAAACTGAGGGGAAAGGATTTTGAATTCTGTTTGATAGTGAAGAGTAGCTC
GAATAATGGTGATGGTAAGGAATTTGGAAGGAAGAAGATTGTGGAGAATGAATCTGTTCTCCCAATGAACGATGAAACTATTATTGAAAAGAGTAAATTTTCTAGTTCTT
GTAAGGATGATGGGAGGAAAGGGGTTGGTTTTTCGACATTAGAAGAAGTTGATACCTTTCCTGTTGTGGGGAACGTAGAGTCTATTGAGCCTGTGAGTATGGAAGGAATT
ATGGTAGATATGGGGGAAGAAAGGAAAGCTCTTTCAAAGGGATTTGACCTGAATTCCTTGGTAGGAGAAAATATGTTGTTGCAGGGGAATTTGGACATTGATCTAGGGGA
AAACAACAAACCTCCTCGTTCCAAAAGAAAGAAGTGGCGAAGAATCAATACTGAAAAAGAACTCTTACTGGCAGGGCCTCTAGATTCTTGTAAAGGTAAGAGACCGCTTG
AAGAAGATTTTAATGAAATTACTGAAGGAGAGGAAAAATGGGGAGGAGTCTCACATTCTCAATCTCAGATGGATAGTTTCAGGAATACAATCCATCACTACAATATGTTT
GACATGGGGTATAAGGGCTATCCTTATAATTGGTATCAATCTAGGGATAGACAAATCATTATGAAAGAAAGACTTGATCGAGACCTTTGTACTAATGACTTCTATGATCT
TTTTCCTTTTGTTTCAATCTTCCATTTAGAATTTATGAATTCTGATCACGACCCTTTGGAGATTAATCTTGGAGAGGATATTTTAAAGCATAGAAAAAGGAAGTCCCAGA
TTTTTCGCTTTGAGGAAATTTGGTCTCTTCAAGATGATTGTATTGATATTATTAAAGATGGGAGAGCCAAGGAGAAATTGGAGGATTTACTTAATGAAGAGGAGGTGTAT
TGGAGACAGAGAGCTAGGGTCGGCTGGCTGAAATGGGGAGATAGAAATTCCAAATGGTTCCATAAATGTGCTTCTCACAGAAGAAAGAGTAACCCGATTAATAGAATAAA
GAGTAAAGATGGCATTTGGATTTCTAATGAAGACCAAATTAGCCATGAGATACATTCATACTTTCAATCATTGTTTACCTCCAATGGTAAACAAAACAATCTTGTTCTTT
ACAATTTCTTGAAGAAAGTTAATCCTTGTCTATCACCAGAGCAAGATCTTTAG
mRNA sequenceShow/hide mRNA sequence
ATGAAACTAACTGCTGAAGAAGAGAAACCTGCTGATTTACGTCCCTTTGCCCCGAAACAGAAACTGAGGGGAAAGGATTTTGAATTCTGTTTGATAGTGAAGAGTAGCTC
GAATAATGGTGATGGTAAGGAATTTGGAAGGAAGAAGATTGTGGAGAATGAATCTGTTCTCCCAATGAACGATGAAACTATTATTGAAAAGAGTAAATTTTCTAGTTCTT
GTAAGGATGATGGGAGGAAAGGGGTTGGTTTTTCGACATTAGAAGAAGTTGATACCTTTCCTGTTGTGGGGAACGTAGAGTCTATTGAGCCTGTGAGTATGGAAGGAATT
ATGGTAGATATGGGGGAAGAAAGGAAAGCTCTTTCAAAGGGATTTGACCTGAATTCCTTGGTAGGAGAAAATATGTTGTTGCAGGGGAATTTGGACATTGATCTAGGGGA
AAACAACAAACCTCCTCGTTCCAAAAGAAAGAAGTGGCGAAGAATCAATACTGAAAAAGAACTCTTACTGGCAGGGCCTCTAGATTCTTGTAAAGGTAAGAGACCGCTTG
AAGAAGATTTTAATGAAATTACTGAAGGAGAGGAAAAATGGGGAGGAGTCTCACATTCTCAATCTCAGATGGATAGTTTCAGGAATACAATCCATCACTACAATATGTTT
GACATGGGGTATAAGGGCTATCCTTATAATTGGTATCAATCTAGGGATAGACAAATCATTATGAAAGAAAGACTTGATCGAGACCTTTGTACTAATGACTTCTATGATCT
TTTTCCTTTTGTTTCAATCTTCCATTTAGAATTTATGAATTCTGATCACGACCCTTTGGAGATTAATCTTGGAGAGGATATTTTAAAGCATAGAAAAAGGAAGTCCCAGA
TTTTTCGCTTTGAGGAAATTTGGTCTCTTCAAGATGATTGTATTGATATTATTAAAGATGGGAGAGCCAAGGAGAAATTGGAGGATTTACTTAATGAAGAGGAGGTGTAT
TGGAGACAGAGAGCTAGGGTCGGCTGGCTGAAATGGGGAGATAGAAATTCCAAATGGTTCCATAAATGTGCTTCTCACAGAAGAAAGAGTAACCCGATTAATAGAATAAA
GAGTAAAGATGGCATTTGGATTTCTAATGAAGACCAAATTAGCCATGAGATACATTCATACTTTCAATCATTGTTTACCTCCAATGGTAAACAAAACAATCTTGTTCTTT
ACAATTTCTTGAAGAAAGTTAATCCTTGTCTATCACCAGAGCAAGATCTTTAG
Protein sequenceShow/hide protein sequence
MKLTAEEEKPADLRPFAPKQKLRGKDFEFCLIVKSSSNNGDGKEFGRKKIVENESVLPMNDETIIEKSKFSSSCKDDGRKGVGFSTLEEVDTFPVVGNVESIEPVSMEGI
MVDMGEERKALSKGFDLNSLVGENMLLQGNLDIDLGENNKPPRSKRKKWRRINTEKELLLAGPLDSCKGKRPLEEDFNEITEGEEKWGGVSHSQSQMDSFRNTIHHYNMF
DMGYKGYPYNWYQSRDRQIIMKERLDRDLCTNDFYDLFPFVSIFHLEFMNSDHDPLEINLGEDILKHRKRKSQIFRFEEIWSLQDDCIDIIKDGRAKEKLEDLLNEEEVY
WRQRARVGWLKWGDRNSKWFHKCASHRRKSNPINRIKSKDGIWISNEDQISHEIHSYFQSLFTSNGKQNNLVLYNFLKKVNPCLSPEQDL