; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0019121 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0019121
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr5:38843936..38850069
RNA-Seq ExpressionLag0019121
SyntenyLag0019121
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR002156 - Ribonuclease H domain
IPR026960 - Reverse transcriptase zinc-binding domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004136915.2 uncharacterized protein LOC101209598 [Cucumis sativus]2.1e-9382.73Show/hide
Query:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG
        MEEV+F HESKEKIF+IF EFMA VAKLDELGTLGSQLLSG++QGLELLRRP+I+ TSKLI+NVIE +NTE LRSYIEAGCI THDG QSTKKLHTCRVG
Subjt:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG

Query:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL
        LDDHLKKARSLI+ELERL +D NI LE E+PLC+ST+SDEDLELDE EATV +KKPDA++YA+LMGI+KVM+KKNH+MQEKIISGLSLKSSSGELETY L
Subjt:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL

Query:  MWSLQPYIDDEIMGQAWKLI
        MWSLQPYIDDEIM  AWKL+
Subjt:  MWSLQPYIDDEIMGQAWKLI

XP_008455091.1 PREDICTED: uncharacterized protein LOC103495347 isoform X3 [Cucumis melo]3.9e-9282.27Show/hide
Query:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG
        MEEVKF HESKEKIF+IF EFMA VAKLDELGT GSQLLSG++QGLELLRRP+I+ TSKLI+NVIET+NTE LR+YIEAGCI THDG QSTKKLHTCRVG
Subjt:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG

Query:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL
        LDDHLKKARSLI+ELERL +D NI LE E+PLC+ST+SDEDLELDE EATV +KK DA++YALLMGI+KVM+KKNH+MQEKIISGLSLKSSSGELETY L
Subjt:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL

Query:  MWSLQPYIDDEIMGQAWKLI
        MWSLQPYIDD IM  AWKL+
Subjt:  MWSLQPYIDDEIMGQAWKLI

XP_016901720.1 PREDICTED: uncharacterized protein LOC103495347 isoform X2 [Cucumis melo]3.7e-9080.09Show/hide
Query:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGL------ELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKL
        MEEVKF HESKEKIF+IF EFMA VAKLDELGT GSQLLSG++QGL      ELLRRP+I+ TSKLI+NVIET+NTE LR+YIEAGCI THDG QSTKKL
Subjt:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGL------ELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKL

Query:  HTCRVGLDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGE
        HTCRVGLDDHLKKARSLI+ELERL +D NI LE E+PLC+ST+SDEDLELDE EATV +KK DA++YALLMGI+KVM+KKNH+MQEKIISGLSLKSSSGE
Subjt:  HTCRVGLDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGE

Query:  LETYSLMWSLQPYIDDEIMGQAWKLI
        LETY LMWSLQPYIDD IM  AWKL+
Subjt:  LETYSLMWSLQPYIDDEIMGQAWKLI

XP_038887975.1 uncharacterized protein LOC120077932 isoform X1 [Benincasa hispida]1.0e-9585.91Show/hide
Query:  MEEVKFH-ESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG
        MEEVKF  +SKEKIF+IF EFMA VAKL+ELGTLGSQLLSG+QQGLELLRRPAI+RTSKLI+NVIETNNTE LRSYIEAGCI THD VQSTKKLH+CRVG
Subjt:  MEEVKFH-ESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG

Query:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL
        LDDHLKKARS+I++LERLL+D NIALE E+P CSSTVSDEDLE +E EATV NKKPDA+EYALLMGIIKVMVKKNH MQEKIISGLSLKSSSGELETY L
Subjt:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL

Query:  MWSLQPYIDDEIMGQAWKLI
        MWSLQPYIDDEIM QAWKL+
Subjt:  MWSLQPYIDDEIMGQAWKLI

XP_038887977.1 uncharacterized protein LOC120077932 isoform X2 [Benincasa hispida]5.9e-8886.93Show/hide
Query:  MACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVGLDDHLKKARSLINELERLLDD
        MA VAKL+ELGTLGSQLLSG+QQGLELLRRPAI+RTSKLI+NVIETNNTE LRSYIEAGCI THD VQSTKKLH+CRVGLDDHLKKARS+I++LERLL+D
Subjt:  MACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVGLDDHLKKARSLINELERLLDD

Query:  ANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSLMWSLQPYIDDEIMGQAWKLI
         NIALE E+P CSSTVSDEDLE +E EATV NKKPDA+EYALLMGIIKVMVKKNH MQEKIISGLSLKSSSGELETY LMWSLQPYIDDEIM QAWKL+
Subjt:  ANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSLMWSLQPYIDDEIMGQAWKLI

TrEMBL top hitse value%identityAlignment
A0A0A0K2B6 Uncharacterized protein1.0e-9382.73Show/hide
Query:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG
        MEEV+F HESKEKIF+IF EFMA VAKLDELGTLGSQLLSG++QGLELLRRP+I+ TSKLI+NVIE +NTE LRSYIEAGCI THDG QSTKKLHTCRVG
Subjt:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG

Query:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL
        LDDHLKKARSLI+ELERL +D NI LE E+PLC+ST+SDEDLELDE EATV +KKPDA++YA+LMGI+KVM+KKNH+MQEKIISGLSLKSSSGELETY L
Subjt:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL

Query:  MWSLQPYIDDEIMGQAWKLI
        MWSLQPYIDDEIM  AWKL+
Subjt:  MWSLQPYIDDEIMGQAWKLI

A0A1S3C033 uncharacterized protein LOC103495347 isoform X16.4e-8872.98Show/hide
Query:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGL----------------------------ELLRRPAIDRTSKLIKNVIETNNTEI
        MEEVKF HESKEKIF+IF EFMA VAKLDELGT GSQLLSG++QGL                            ELLRRP+I+ TSKLI+NVIET+NTE 
Subjt:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGL----------------------------ELLRRPAIDRTSKLIKNVIETNNTEI

Query:  LRSYIEAGCIKTHDGVQSTKKLHTCRVGLDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMV
        LR+YIEAGCI THDG QSTKKLHTCRVGLDDHLKKARSLI+ELERL +D NI LE E+PLC+ST+SDEDLELDE EATV +KK DA++YALLMGI+KVM+
Subjt:  LRSYIEAGCIKTHDGVQSTKKLHTCRVGLDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMV

Query:  KKNHLMQEKIISGLSLKSSSGELETYSLMWSLQPYIDDEIMGQAWKLI
        KKNH+MQEKIISGLSLKSSSGELETY LMWSLQPYIDD IM  AWKL+
Subjt:  KKNHLMQEKIISGLSLKSSSGELETYSLMWSLQPYIDDEIMGQAWKLI

A0A1S3C0U3 uncharacterized protein LOC103495347 isoform X31.9e-9282.27Show/hide
Query:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG
        MEEVKF HESKEKIF+IF EFMA VAKLDELGT GSQLLSG++QGLELLRRP+I+ TSKLI+NVIET+NTE LR+YIEAGCI THDG QSTKKLHTCRVG
Subjt:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG

Query:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL
        LDDHLKKARSLI+ELERL +D NI LE E+PLC+ST+SDEDLELDE EATV +KK DA++YALLMGI+KVM+KKNH+MQEKIISGLSLKSSSGELETY L
Subjt:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL

Query:  MWSLQPYIDDEIMGQAWKLI
        MWSLQPYIDD IM  AWKL+
Subjt:  MWSLQPYIDDEIMGQAWKLI

A0A1S4E157 uncharacterized protein LOC103495347 isoform X21.8e-9080.09Show/hide
Query:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGL------ELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKL
        MEEVKF HESKEKIF+IF EFMA VAKLDELGT GSQLLSG++QGL      ELLRRP+I+ TSKLI+NVIET+NTE LR+YIEAGCI THDG QSTKKL
Subjt:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGL------ELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKL

Query:  HTCRVGLDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGE
        HTCRVGLDDHLKKARSLI+ELERL +D NI LE E+PLC+ST+SDEDLELDE EATV +KK DA++YALLMGI+KVM+KKNH+MQEKIISGLSLKSSSGE
Subjt:  HTCRVGLDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGE

Query:  LETYSLMWSLQPYIDDEIMGQAWKLI
        LETY LMWSLQPYIDD IM  AWKL+
Subjt:  LETYSLMWSLQPYIDDEIMGQAWKLI

A0A6J1D9B8 uncharacterized protein LOC1110187947.0e-8778.64Show/hide
Query:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG
        MEEV F H SKEKIFKIF EFMA VAKL+ELGTLGS+ LSG QQGLELLRRPAI+R+SKLI++VIETNNTE L+SY EAGCI THDGVQSTKKLHTCR+G
Subjt:  MEEVKF-HESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVG

Query:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL
        LDDHLKKARSLI+ELE LL++ANIALE E+   +STVSDED+ELDE EATV ++K DA+EYAL MGIIK MVKK+++MQEKIISGLSLKSSSGELETY +
Subjt:  LDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSL

Query:  MWSLQPYIDDEIMGQAWKLI
        MWSL+PYIDDEI+ +AWKL+
Subjt:  MWSLQPYIDDEIMGQAWKLI

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657508.4e-2926.67Show/hide
Query:  MPVQVGRNKGIMFKRIKERVEKTLQGWKEKLLSLAGKEAL------------------------------RQLLVGRNKGKKMSHWMSWRNMCKRKKSGY
        MPV   R     F  I ERV   + GW+EK LS AG+  L                              R  L G    KK  H + W  +C  KK G 
Subjt:  MPVQVGRNKGIMFKRIKERVEKTLQGWKEKLLSLAGKEAL------------------------------RQLLVGRNKGKKMSHWMSWRNMCKRKKSGY

Query:  LGFRQISALNQAMLAKQSWRIARNPNNILYKILRGRY----LKDGNFLEAPIGNAPSLTWRSICWG-RDLFQKGFRWRMGSGKMISIDRDPWISRKGNAR
        LG R   ++N+A+++K  WR+ +  N++   +L+ +Y    ++D  +L  P G+  S TWRSI  G RD+   G  W  G G+ I    D W+S K    
Subjt:  LGFRQISALNQAMLAKQSWRIARNPNNILYKILRGRY----LKDGNFLEAPIGNAPSLTWRSICWG-RDLFQKGFRWRMGSGKMISIDRDPWISRKGNAR

Query:  PTLIQDNLKGFCVAHMLDEKNQW----NEDMVRLDFF-----RMDAED-ILNTPTGSKESKDEIIWHPDKKGLFSVKSVYHLAMDLHEMNSASQSDSKIT
        P L  DN +       +  K+ W      D  ++D +     R++    +L+  TG   ++D + W   + G FSV+S Y       EM +  +      
Subjt:  PTLIQDNLKGFCVAHMLDEKNQW----NEDMVRLDFF-----RMDAED-ILNTPTGSKESKDEIIWHPDKKGLFSVKSVYHLAMDLHEMNSASQSDSKIT

Query:  TSVWKKLWKLNILHRSKTCTWKVIQDIIPTKANVLKKGVDLNPMCLFCKKKLETTTHLIWECKIPIEGWETFIPTTHKLFDLCRLSWDPKDYWWWLEENL
         S +  LWK+ +  R KT  W V    + T+    ++ +  + +C  CK  +E+  H++ +C   +  W   +P         +  +  K  + WL +NL
Subjt:  TSVWKKLWKLNILHRSKTCTWKVIQDIIPTKANVLKKGVDLNPMCLFCKKKLETTTHLIWECKIPIEGWETFIPTTHKLFDLCRLSWDPKDYWWWLEENL

Query:  ----NTEELARS---AIVIW
              E++  S   A++IW
Subjt:  ----NTEELARS---AIVIW

P93295 Uncharacterized mitochondrial protein AtMg003105.1e-1836.92Show/hide
Query:  KLLSLAGKEALRQLLVGRNKGKKMSHWMSWRNMCKRKK-SGYLGFRQISALNQAMLAKQSWRIARNPNNILYKILRGRYLKDGNFLEAPIGNAPSLTWRS
        KLL      A+ +      + K+   W++W+ +CK K+  G LGFR +   NQA+LAKQS+RI   P+ +L ++LR RY    + +E  +G  PS  WRS
Subjt:  KLLSLAGKEALRQLLVGRNKGKKMSHWMSWRNMCKRKK-SGYLGFRQISALNQAMLAKQSWRIARNPNNILYKILRGRYLKDGNFLEAPIGNAPSLTWRS

Query:  ICWGRDLFQKGFRWRMGSGKMISIDRDPWI
        I  GR+L  +G    +G G    +  D WI
Subjt:  ICWGRDLFQKGFRWRMGSGKMISIDRDPWI

Arabidopsis top hitse value%identityAlignment
AT3G26855.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.5e-0430.36Show/hide
Query:  LWKLNILHRSKTCTWKVIQDIIPTKANVLKKGVDLNPMCLFCKKKLETTTHLIWEC
        +W L I  + K   WK + + +P  A +L + + + P C  C +  ET TH+++ C
Subjt:  LWKLNILHRSKTCTWKVIQDIIPTKANVLKKGVDLNPMCLFCKKKLETTTHLIWEC

AT3G49645.1 unknown protein1.6e-5150.93Show/hide
Query:  ESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVGLDDHLKKA
        E K+KI +IF +FM  + +L+ELG   +  L   QQGL  L+RP I  +SKLI+N+I+ N T  L+SYIEAGCI  HD  QST+ LHT   GL DHL KA
Subjt:  ESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLELLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVGLDDHLKKA

Query:  RSLINELERLLDDANIALENEDPL-CSSTVSDEDLELDEREATV-ANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSLMWSLQP
        ++L+ ELERL D+A +A+E+   L   S+   + +  DE   TV   + P+ +EYA L+ +I  M+K+N++MQ+KI+  LSLKSSSGELETYSLMWSL+P
Subjt:  RSLINELERLLDDANIALENEDPL-CSSTVSDEDLELDEREATV-ANKKPDASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSLMWSLQP

Query:  YIDDEIMGQAWKLI
        +++DEI+ +AWK I
Subjt:  YIDDEIMGQAWKLI

AT4G29090.1 Ribonuclease H-like superfamily protein3.5e-3026.37Show/hide
Query:  RNKGK-KMSHWMSWRNMCKRKKSGYLGFRQISALNQAMLAKQSWRIARNPNNILYKILRGRYLKDGNFLEAPIGNAPSLTWRSICWGRDLFQKGFRWRMG
        RNK + K  HW +W ++   K  G +GF+ I A N A+L KQ WR+   P +++ K+ + RY    + L AP+G+ PS  W+SI   +++ ++G R  +G
Subjt:  RNKGK-KMSHWMSWRNMCKRKKSGYLGFRQISALNQAMLAKQSWRIARNPNNILYKILRGRYLKDGNFLEAPIGNAPSLTWRSICWGRDLFQKGFRWRMG

Query:  SGKMISIDRDPWISRKGNARPTLIQ-----------DNLKGFCVAHMLDEK-NQWNEDMVRLDFFRMDAEDILNTPTGSKESKDEIIWHPDKKGLFSVKS
        +G+ I I R  W+  K  +    +Q             LK   V+ ++DE   +W +D++ + F  ++ + I     G +   D   W     G ++VKS
Subjt:  SGKMISIDRDPWISRKGNARPTLIQ-----------DNLKGFCVAHMLDEK-NQWNEDMVRLDFFRMDAEDILNTPTGSKESKDEIIWHPDKKGLFSVKS

Query:  VYHLAMDLHEMNSASQSDSKITTS-VWKKLWKLNILHRSKTCTWKVIQDIIPTKANVLKKGVDLNPMCLFCKKKLETTTHLIWECKIPIEGW
         Y +   +    S+ Q  S+ + + +++K+WK     + +   WK + + +P    +  + +     C+ C    ET  HL+++C      W
Subjt:  VYHLAMDLHEMNSASQSDSKITTS-VWKKLWKLNILHRSKTCTWKVIQDIIPTKANVLKKGVDLNPMCLFCKKKLETTTHLIWECKIPIEGW

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein3.6e-1936.92Show/hide
Query:  KLLSLAGKEALRQLLVGRNKGKKMSHWMSWRNMCKRKK-SGYLGFRQISALNQAMLAKQSWRIARNPNNILYKILRGRYLKDGNFLEAPIGNAPSLTWRS
        KLL      A+ +      + K+   W++W+ +CK K+  G LGFR +   NQA+LAKQS+RI   P+ +L ++LR RY    + +E  +G  PS  WRS
Subjt:  KLLSLAGKEALRQLLVGRNKGKKMSHWMSWRNMCKRKK-SGYLGFRQISALNQAMLAKQSWRIARNPNNILYKILRGRYLKDGNFLEAPIGNAPSLTWRS

Query:  ICWGRDLFQKGFRWRMGSGKMISIDRDPWI
        I  GR+L  +G    +G G    +  D WI
Subjt:  ICWGRDLFQKGFRWRMGSGKMISIDRDPWI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCAAGCCGTCTAGGGGTTTGAGGCAAAGAGACCCCCTCTCCCCTACCTCTTCCTCCTATGTGCTGAAGGCTTCTCGACGATACGACAGTCTTATCTTCATCAAAGC
TCAGAGGAGGGACCTAAACTCTCTCAGGAGAACCTTGAGGATCTATGAAGAAGCGTCGGGTCAAACCATTAACATGGACAAATCTATGTTTATGACTAGCAAGATCATGA
GTGAAGAGGCTGCACGAAGTTGTGAAAGGATTTTGGGCATTAAGAAAAGTCACTGTCTTGGCCAGTATCTAAGGATGCCGGTGCAAGTAGGAAGAAACAAAGGGATAATG
TTCAAAAGAATTAAAGAGAGAGTGGAGAAGACTCTTCAGGGATGGAAAGAGAAACTCCTCTCACTGGCAGGGAAAGAGGCTTTGCGCCAACTTTTGGTGGGGAGAAACAA
AGGAAAGAAGATGAGTCATTGGATGAGTTGGAGGAACATGTGCAAGAGGAAAAAGTCGGGATACTTGGGATTTAGGCAAATCTCCGCTTTAAACCAAGCAATGCTGGCCA
AGCAAAGTTGGAGGATAGCCCGAAACCCGAATAACATCCTTTATAAAATATTGAGAGGTAGATACTTAAAAGATGGAAACTTCTTGGAAGCCCCAATAGGAAATGCTCCC
TCTCTGACTTGGAGAAGCATCTGTTGGGGTCGTGACTTGTTCCAAAAAGGTTTTCGGTGGAGAATGGGTAGCGGTAAAATGATTAGCATCGATAGAGATCCGTGGATTAG
TAGGAAAGGAAATGCTAGACCTACCCTTATCCAAGATAATCTCAAAGGTTTTTGTGTGGCCCACATGCTGGATGAGAAAAACCAATGGAATGAGGATATGGTAAGGCTAG
ACTTCTTTAGGATGGATGCAGAAGACATTCTGAATACTCCCACGGGAAGTAAAGAGTCCAAAGATGAAATCATTTGGCACCCGGACAAAAAAGGCCTTTTCTCTGTGAAG
AGTGTCTATCATCTAGCCATGGATCTCCATGAAATGAATTCAGCCTCTCAATCTGACTCCAAGATAACAACCTCAGTGTGGAAGAAACTATGGAAGCTTAACATCCTTCA
TAGATCTAAAACCTGCACCTGGAAAGTGATCCAAGACATCATCCCCACTAAGGCAAACGTGTTGAAAAAGGGAGTTGACTTAAATCCTATGTGCCTTTTTTGTAAAAAGA
AGTTGGAAACCACTACTCATCTCATTTGGGAGTGCAAAATTCCAATAGAGGGGTGGGAAACCTTTATTCCTACAACTCATAAACTCTTTGATTTATGCAGATTAAGCTGG
GACCCGAAGGATTATTGGTGGTGGCTGGAGGAAAATCTAAACACAGAGGAGTTGGCGAGAAGTGCAATAGTCATATGGAAGAAAATTTCCAAAAATTGGTCAATTAAAGT
CCTAGAAGCCAAAGCTCTGCTCGAGGGTTTGAAAGAGATTAGCTATACCTGCAATCGCCGCTCAATCTTCCTTGAGGTGGAATCTGACGCTCTCGAAGTCGTCGACGCTT
TGATCGGCTCTGTAGAGGATTTCTCAGACCTGAAAAACTTCACCGATCAGATCAAGGACATTGCTTCGAAGTCGTTCGGTATAGAATTTCGTCACTGTAGTATATTTTTT
AACACAGACGCGCACTGTGTTGTGAGAAAAGCCATGGATTTTCATTTTGCTTCTGATTTGGTTTCTGGTCATGCAACTGATCGTCGCCGCCGCCGCCGCCGCCTCTCGCC
GTTCATTGCTGCCGCCGTAGCCTTGCTCCTACGAGAAATGATAATGTTAAATTGCTCTGCTTGTTCGTTTGTCAGAATGGATATGGAAGAAGTTAAATTTCACGAGTCAA
AAGAGAAGATTTTCAAGATTTTTGGAGAGTTCATGGCTTGTGTTGCAAAGCTCGACGAATTGGGGACTTTAGGAAGCCAACTGCTTTCTGGCGTTCAGCAAGGACTTGAG
CTTCTTAGACGACCTGCAATAGATAGAACATCCAAGTTGATCAAGAATGTCATTGAAACTAACAATACGGAGATTCTTAGATCGTACATTGAAGCTGGATGCATCAAAAC
CCATGATGGCGTGCAAAGTACAAAGAAGTTGCATACATGCCGGGTTGGACTTGATGATCATTTGAAAAAAGCAAGGAGCTTAATCAATGAACTCGAGCGCCTACTCGATG
ATGCAAATATTGCATTGGAAAATGAAGATCCCCTGTGCTCTTCAACGGTCTCAGATGAAGATCTAGAATTGGATGAACGAGAAGCAACTGTTGCTAATAAGAAACCTGAT
GCTAGTGAATATGCTTTATTAATGGGGATCATCAAAGTCATGGTCAAGAAAAACCACCTAATGCAGGAGAAGATTATTTCTGGTCTAAGTCTCAAATCATCCTCGGGGGA
ACTCGAAACGTACAGCCTGATGTGGTCGTTACAACCGTATATAGATGATGAAATTATGGGTCAAGCTTGGAAACTCATTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTCAAGCCGTCTAGGGGTTTGAGGCAAAGAGACCCCCTCTCCCCTACCTCTTCCTCCTATGTGCTGAAGGCTTCTCGACGATACGACAGTCTTATCTTCATCAAAGC
TCAGAGGAGGGACCTAAACTCTCTCAGGAGAACCTTGAGGATCTATGAAGAAGCGTCGGGTCAAACCATTAACATGGACAAATCTATGTTTATGACTAGCAAGATCATGA
GTGAAGAGGCTGCACGAAGTTGTGAAAGGATTTTGGGCATTAAGAAAAGTCACTGTCTTGGCCAGTATCTAAGGATGCCGGTGCAAGTAGGAAGAAACAAAGGGATAATG
TTCAAAAGAATTAAAGAGAGAGTGGAGAAGACTCTTCAGGGATGGAAAGAGAAACTCCTCTCACTGGCAGGGAAAGAGGCTTTGCGCCAACTTTTGGTGGGGAGAAACAA
AGGAAAGAAGATGAGTCATTGGATGAGTTGGAGGAACATGTGCAAGAGGAAAAAGTCGGGATACTTGGGATTTAGGCAAATCTCCGCTTTAAACCAAGCAATGCTGGCCA
AGCAAAGTTGGAGGATAGCCCGAAACCCGAATAACATCCTTTATAAAATATTGAGAGGTAGATACTTAAAAGATGGAAACTTCTTGGAAGCCCCAATAGGAAATGCTCCC
TCTCTGACTTGGAGAAGCATCTGTTGGGGTCGTGACTTGTTCCAAAAAGGTTTTCGGTGGAGAATGGGTAGCGGTAAAATGATTAGCATCGATAGAGATCCGTGGATTAG
TAGGAAAGGAAATGCTAGACCTACCCTTATCCAAGATAATCTCAAAGGTTTTTGTGTGGCCCACATGCTGGATGAGAAAAACCAATGGAATGAGGATATGGTAAGGCTAG
ACTTCTTTAGGATGGATGCAGAAGACATTCTGAATACTCCCACGGGAAGTAAAGAGTCCAAAGATGAAATCATTTGGCACCCGGACAAAAAAGGCCTTTTCTCTGTGAAG
AGTGTCTATCATCTAGCCATGGATCTCCATGAAATGAATTCAGCCTCTCAATCTGACTCCAAGATAACAACCTCAGTGTGGAAGAAACTATGGAAGCTTAACATCCTTCA
TAGATCTAAAACCTGCACCTGGAAAGTGATCCAAGACATCATCCCCACTAAGGCAAACGTGTTGAAAAAGGGAGTTGACTTAAATCCTATGTGCCTTTTTTGTAAAAAGA
AGTTGGAAACCACTACTCATCTCATTTGGGAGTGCAAAATTCCAATAGAGGGGTGGGAAACCTTTATTCCTACAACTCATAAACTCTTTGATTTATGCAGATTAAGCTGG
GACCCGAAGGATTATTGGTGGTGGCTGGAGGAAAATCTAAACACAGAGGAGTTGGCGAGAAGTGCAATAGTCATATGGAAGAAAATTTCCAAAAATTGGTCAATTAAAGT
CCTAGAAGCCAAAGCTCTGCTCGAGGGTTTGAAAGAGATTAGCTATACCTGCAATCGCCGCTCAATCTTCCTTGAGGTGGAATCTGACGCTCTCGAAGTCGTCGACGCTT
TGATCGGCTCTGTAGAGGATTTCTCAGACCTGAAAAACTTCACCGATCAGATCAAGGACATTGCTTCGAAGTCGTTCGGTATAGAATTTCGTCACTGTAGTATATTTTTT
AACACAGACGCGCACTGTGTTGTGAGAAAAGCCATGGATTTTCATTTTGCTTCTGATTTGGTTTCTGGTCATGCAACTGATCGTCGCCGCCGCCGCCGCCGCCTCTCGCC
GTTCATTGCTGCCGCCGTAGCCTTGCTCCTACGAGAAATGATAATGTTAAATTGCTCTGCTTGTTCGTTTGTCAGAATGGATATGGAAGAAGTTAAATTTCACGAGTCAA
AAGAGAAGATTTTCAAGATTTTTGGAGAGTTCATGGCTTGTGTTGCAAAGCTCGACGAATTGGGGACTTTAGGAAGCCAACTGCTTTCTGGCGTTCAGCAAGGACTTGAG
CTTCTTAGACGACCTGCAATAGATAGAACATCCAAGTTGATCAAGAATGTCATTGAAACTAACAATACGGAGATTCTTAGATCGTACATTGAAGCTGGATGCATCAAAAC
CCATGATGGCGTGCAAAGTACAAAGAAGTTGCATACATGCCGGGTTGGACTTGATGATCATTTGAAAAAAGCAAGGAGCTTAATCAATGAACTCGAGCGCCTACTCGATG
ATGCAAATATTGCATTGGAAAATGAAGATCCCCTGTGCTCTTCAACGGTCTCAGATGAAGATCTAGAATTGGATGAACGAGAAGCAACTGTTGCTAATAAGAAACCTGAT
GCTAGTGAATATGCTTTATTAATGGGGATCATCAAAGTCATGGTCAAGAAAAACCACCTAATGCAGGAGAAGATTATTTCTGGTCTAAGTCTCAAATCATCCTCGGGGGA
ACTCGAAACGTACAGCCTGATGTGGTCGTTACAACCGTATATAGATGATGAAATTATGGGTCAAGCTTGGAAACTCATTGCATGA
Protein sequenceShow/hide protein sequence
MVKPSRGLRQRDPLSPTSSSYVLKASRRYDSLIFIKAQRRDLNSLRRTLRIYEEASGQTINMDKSMFMTSKIMSEEAARSCERILGIKKSHCLGQYLRMPVQVGRNKGIM
FKRIKERVEKTLQGWKEKLLSLAGKEALRQLLVGRNKGKKMSHWMSWRNMCKRKKSGYLGFRQISALNQAMLAKQSWRIARNPNNILYKILRGRYLKDGNFLEAPIGNAP
SLTWRSICWGRDLFQKGFRWRMGSGKMISIDRDPWISRKGNARPTLIQDNLKGFCVAHMLDEKNQWNEDMVRLDFFRMDAEDILNTPTGSKESKDEIIWHPDKKGLFSVK
SVYHLAMDLHEMNSASQSDSKITTSVWKKLWKLNILHRSKTCTWKVIQDIIPTKANVLKKGVDLNPMCLFCKKKLETTTHLIWECKIPIEGWETFIPTTHKLFDLCRLSW
DPKDYWWWLEENLNTEELARSAIVIWKKISKNWSIKVLEAKALLEGLKEISYTCNRRSIFLEVESDALEVVDALIGSVEDFSDLKNFTDQIKDIASKSFGIEFRHCSIFF
NTDAHCVVRKAMDFHFASDLVSGHATDRRRRRRRLSPFIAAAVALLLREMIMLNCSACSFVRMDMEEVKFHESKEKIFKIFGEFMACVAKLDELGTLGSQLLSGVQQGLE
LLRRPAIDRTSKLIKNVIETNNTEILRSYIEAGCIKTHDGVQSTKKLHTCRVGLDDHLKKARSLINELERLLDDANIALENEDPLCSSTVSDEDLELDEREATVANKKPD
ASEYALLMGIIKVMVKKNHLMQEKIISGLSLKSSSGELETYSLMWSLQPYIDDEIMGQAWKLIA