; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0008985 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0008985
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr9:33443375..33446751
RNA-Seq ExpressionLag0008985
SyntenyLag0008985
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PKA49974.1 Putative ribonuclease H protein [Apostasia shenzhenica]2.1e-6637.19Show/hide
Query:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS
        NG   E+ TP RG+RQG   SPY F+LCAEG SALL        + GI  + +   + HL FADDSL F +A +   S IK+ L+ YE+ASGQ IN +KS
Subjt:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS

Query:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------
            + N++ D    I+  L +   +    YLGLP+ T +NK ++F+++K++VWK LQSWK+KLFS GG+EILIKA+AQA  ++TMS F++P+       
Subjt:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------

Query:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA
                                                          GWRI     SLMAR+L+ KY+    FL A+A  NSS +W+SI+WGR L  
Subjt:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA

Query:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPI-RVKETLKGKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRG
         GL+W++GNG  + +  DPWL +   +  I      L   R+ E+L E+ SW  + I       D   IL MP  +   +D +IW  D KG    + G
Subjt:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPI-RVKETLKGKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRG

PRQ55763.1 putative RNA-directed DNA polymerase [Rosa chinensis]5.6e-6729.65Show/hide
Query:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS
        NG P+    P+RG+RQGDS SPYLF++CAE LS L+ + E + +I G++I +    ++HLFFADDS  FFKA  +E   +K++ + YE ASGQMIN +KS
Subjt:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS

Query:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------
            +KN+   K  E++A L V++ +    YLGLPT+ S +K   F+ L ++V K  Q W+AK  S  GKEI+IKA+AQ+IP Y MS F +P+       
Subjt:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------

Query:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA
                                                          GWRI ++P SL+AR+ + KY+    F+KA+A   SS  WKSI++GR L  
Subjt:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA

Query:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRV-KETLKGKRVAEILN-ENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGT
        +GL+ +VGNG QI +  D WL     + PI   KE L+  RV E+++ E + W   ++ + F+P +   I S+P       D  +W  D KGM   + G 
Subjt:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRV-KETLKGKRVAEILN-ENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGT

Query:  CLLVVQSGKEDLGKSIVKTYQPQSSETQVSHDNWLPPEENQ--WKLNVDATWFDDADVGGVGWIIRDSHEGLRQTS-ENFKARPKLF-----DHELVVES
         +  +  G  +                ++     +PP+     W+L              +  I+   HE  ++ +  +F+    +F     +H L +  
Subjt:  CLLVVQSGKEDLGKSIVKTYQPQSSETQVSHDNWLPPEENQ--WKLNVDATWFDDADVGGVGWIIRDSHEGLRQTS-ENFKARPKLF-----DHELVVES

Query:  DASEIVKLINLDSEDLTDISLLIDEIRDLAPTTNVAKFVYSPRSTNFLAH-----SLACAVAWNRDFVFFYFGPSQILRKGVGVGWIVF-----------
        D S I     L   DL  + ++   + +     NV   +       F  +     S    + WN   VF  +  +Q   K +G    V            
Subjt:  DASEIVKLINLDSEDLTDISLLIDEIRDLAPTTNVAKFVYSPRSTNFLAH-----SLACAVAWNRDFVFFYFGPSQILRKGVGVGWIVF-----------

Query:  --------GPL-LNADASWIEGEGIGGLGCVVRDSNGCLIGTGCKKINNMWSIKCLEADNC
                G L +N D S+    G GG+G V+RD  G  +    +   N+ S   +EA+ C
Subjt:  --------GPL-LNADASWIEGEGIGGLGCVVRDSNGCLIGTGCKKINNMWSIKCLEADNC

XP_023881891.1 uncharacterized protein LOC111994244 [Quercus suber]9.9e-6437.78Show/hide
Query:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS
        NG    S TP RG+RQGD  SPY+F+LCA+G S+LL        I G+ I   C  + HLFFADDSL F KA  +E   +  ILQ YE ASGQ IN++KS
Subjt:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS

Query:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------
            + N  ++K  E+   LG +Q      YLGLP+   ++K  +F+ +K+RV + L  WK KL SVGG+EILIKA+AQAIP YTMS F+IP+       
Subjt:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------

Query:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA
                                                          GWR+  +P SL+A+I + +YY      +AK   + S  W+SI  G  +  
Subjt:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA

Query:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGK------RVAEILN-ENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKG
         G +W+VGNG++I I ED WL      TPI  K     K      RV+ +++ E   WK+D++ D FLP +A  ILS+P  +    D+IIW  + KG
Subjt:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGK------RVAEILN-ENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKG

XP_030498122.1 uncharacterized protein LOC115713779 [Cannabis sativa]9.9e-6428.05Show/hide
Query:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS
        NG       P+RGIRQGD  SPYLF++CAEGLS LL  +E+  +++G++++    +V+HLFFADDS+ F +A  + + +I+++L  Y  ASGQ +N +K 
Subjt:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS

Query:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIP--------
        +   + N  +         L +        YLGLP+ + R+K  +FS + D++WK+L SWK +LFS GGKE+L+KA+ QAIP Y MS FR+P        
Subjt:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIP--------

Query:  -----------------------------------------------RDPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA
                                                           WR+   P SL++RIL  +Y+   S L A      SL W+SIVWG+ L  
Subjt:  -----------------------------------------------RDPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA

Query:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGTCL
        +GL+W+VG G QI    DPWL     +TP     T    +VA+++N +  W    +  +F  +D + ILS+P       D +IW     G    + G   
Subjt:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGTCL

Query:  LVVQSGKEDLGKSIVKTYQPQSSETQVSHDNWLPPEENQWKLNVDA-----TWFDDADVGGVGWIIRDSHEGLRQTSENFKARPKLFDHELVVESDASEI
            + ++D G S   TY           ++W       WKL + +      W    +V  V   +   H          K + +   H L + S A E+
Subjt:  LVVQSGKEDLGKSIVKTYQPQSSETQVSHDNWLPPEENQWKLNVDA-----TWFDDADVGGVGWIIRDSHEGLRQTSENFKARPKLFDHELVVESDASEI

Query:  VKLINLDSEDLTDISLLIDE-IRDLAPTTNVAKF---------VYSPRSTNFLAHSLACAVAWNRDFVFFYFGPSQILRKGVGVGWIVFGPL--------
         K+ NL+       S   +E +  ++  T+  +F         ++  R+  F      C  A  +DF   Y    Q +     V      P         
Subjt:  VKLINLDSEDLTDISLLIDE-IRDLAPTTNVAKF---------VYSPRSTNFLAHSLACAVAWNRDFVFFYFGPSQILRKGVGVGWIVFGPL--------

Query:  -----------------LNADASWIEGEGIGGLGCVVRDSNGCLIGTGCKKINNMWSIKCLEA
                         LN DA++ +   I G+G V+RDS+G +     K I+  +  + +EA
Subjt:  -----------------LNADASWIEGEGIGGLGCVVRDSNGCLIGTGCKKINNMWSIKCLEA

XP_030508858.1 uncharacterized protein LOC115723499 [Cannabis sativa]4.0e-6535.98Show/hide
Query:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS
        NG       P RGIRQGD  SPYLF++CAEGLS LL  +E+  ++ G+K++    +V+HLFFADDS+ F +A ++ + +I+++L  Y  ASGQ IN +K 
Subjt:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS

Query:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIP--------
        +   + N            L +        YLGLP+ + R+K  +FS + D++W +L SWK +LFSVGGKE+L+KA+ QAIP Y MS FR+P        
Subjt:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIP--------

Query:  ----------RDPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFAEGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLK
                      WR+   P SL++RIL  +Y+   + L A      SL W+SIVWG+ L  +GL+W+VG+G +I    DPWL     +TP   + T  
Subjt:  ----------RDPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFAEGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLK

Query:  GKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGTCLLVVQSGKEDLGKSIV
           VA++++++  W    +  +F  ++ + ILS+P       D +IW   T G    + G       +  +D G + V
Subjt:  GKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGTCLLVVQSGKEDLGKSIV

TrEMBL top hitse value%identityAlignment
A0A2I0A354 Ribonuclease H protein1.0e-6637.19Show/hide
Query:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS
        NG   E+ TP RG+RQG   SPY F+LCAEG SALL        + GI  + +   + HL FADDSL F +A +   S IK+ L+ YE+ASGQ IN +KS
Subjt:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS

Query:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------
            + N++ D    I+  L +   +    YLGLP+ T +NK ++F+++K++VWK LQSWK+KLFS GG+EILIKA+AQA  ++TMS F++P+       
Subjt:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------

Query:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA
                                                          GWRI     SLMAR+L+ KY+    FL A+A  NSS +W+SI+WGR L  
Subjt:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA

Query:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPI-RVKETLKGKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRG
         GL+W++GNG  + +  DPWL +   +  I      L   R+ E+L E+ SW  + I       D   IL MP  +   +D +IW  D KG    + G
Subjt:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPI-RVKETLKGKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRG

A0A2N9IYC8 Reverse transcriptase domain-containing protein2.9e-6930.98Show/hide
Query:  CVGTF---IPYNGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYEL
        CV T    I  NG P     P+RG+RQG+  SPYLF+LCAEG  +L+ +E++   + G+ I+     ++HLFFADDSL F KAT  + + I+ IL  YE 
Subjt:  CVGTF---IPYNGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYEL

Query:  ASGQMINLNKSLCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWF
        ASGQ IN  K+    +K+       +I   L V+       YLGLP+   R K   F+++K+RVW  L+ WK KL S  G+EILIK++AQAIP Y MS F
Subjt:  ASGQMINLNKSLCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWF

Query:  RIP--------------------------------------------RDPG-----------WRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVW
        R+P                                            RD G           WR+  +P SL  ++ + KY+   S L+ +     S  W
Subjt:  RIP--------------------------------------------RDPG-----------WRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVW

Query:  KSIVWGRSLFAEGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGKRVAEIL--NENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGAD
        +SI+  R L  +GL W+VG GK I I  D WL+       I            E L  ++  SWK +++ + FLP +A  IL +P    +  D ++WGA 
Subjt:  KSIVWGRSLFAEGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGKRVAEIL--NENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGAD

Query:  TKGMQGGRRGTCLLVVQSGKEDLGKS------------------IVKTYQPQSSETQVSHDN-------WLPPEENQWKLNVDATWFDDADVGGVGWIIR
         +G+   R G  LL     +++ G S                  +++ ++ Q    Q S  +       W PPE+ ++K+N D   F+D +  G+G IIR
Subjt:  TKGMQGGRRGTCLLVVQSGKEDLGKS------------------IVKTYQPQSSETQVSHDN-------WLPPEENQWKLNVDATWFDDADVGGVGWIIR

Query:  DSHEG--------------LRQTSENFKARPKL-FDHEL-----VVESDASEIVKLINLDSEDLTDISLLIDEIRDLAPTTNVAKFVYSPRSTNFLAHSL
         +H+G                +  E   AR  + F  +L      +E D++ IV+ I L +   T    +I++IR +A      +F++  R  N +AH L
Subjt:  DSHEG--------------LRQTSENFKARPKL-FDHEL-----VVESDASEIVKLINLDSEDLTDISLLIDEIRDLAPTTNVAKFVYSPRSTNFLAHSL

Query:  ACAVAWNRDF
        A     N+ F
Subjt:  ACAVAWNRDF

A0A2P6SAP1 Putative RNA-directed DNA polymerase2.7e-6729.65Show/hide
Query:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS
        NG P+    P+RG+RQGDS SPYLF++CAE LS L+ + E + +I G++I +    ++HLFFADDS  FFKA  +E   +K++ + YE ASGQMIN +KS
Subjt:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS

Query:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------
            +KN+   K  E++A L V++ +    YLGLPT+ S +K   F+ L ++V K  Q W+AK  S  GKEI+IKA+AQ+IP Y MS F +P+       
Subjt:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------

Query:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA
                                                          GWRI ++P SL+AR+ + KY+    F+KA+A   SS  WKSI++GR L  
Subjt:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA

Query:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRV-KETLKGKRVAEILN-ENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGT
        +GL+ +VGNG QI +  D WL     + PI   KE L+  RV E+++ E + W   ++ + F+P +   I S+P       D  +W  D KGM   + G 
Subjt:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRV-KETLKGKRVAEILN-ENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGT

Query:  CLLVVQSGKEDLGKSIVKTYQPQSSETQVSHDNWLPPEENQ--WKLNVDATWFDDADVGGVGWIIRDSHEGLRQTS-ENFKARPKLF-----DHELVVES
         +  +  G  +                ++     +PP+     W+L              +  I+   HE  ++ +  +F+    +F     +H L +  
Subjt:  CLLVVQSGKEDLGKSIVKTYQPQSSETQVSHDNWLPPEENQ--WKLNVDATWFDDADVGGVGWIIRDSHEGLRQTS-ENFKARPKLF-----DHELVVES

Query:  DASEIVKLINLDSEDLTDISLLIDEIRDLAPTTNVAKFVYSPRSTNFLAH-----SLACAVAWNRDFVFFYFGPSQILRKGVGVGWIVF-----------
        D S I     L   DL  + ++   + +     NV   +       F  +     S    + WN   VF  +  +Q   K +G    V            
Subjt:  DASEIVKLINLDSEDLTDISLLIDEIRDLAPTTNVAKFVYSPRSTNFLAH-----SLACAVAWNRDFVFFYFGPSQILRKGVGVGWIVF-----------

Query:  --------GPL-LNADASWIEGEGIGGLGCVVRDSNGCLIGTGCKKINNMWSIKCLEADNC
                G L +N D S+    G GG+G V+RD  G  +    +   N+ S   +EA+ C
Subjt:  --------GPL-LNADASWIEGEGIGGLGCVVRDSNGCLIGTGCKKINNMWSIKCLEADNC

A0A803P8L6 Uncharacterized protein7.7e-7031.47Show/hide
Query:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS
        NG  Q +FTP RG+RQGD  SP+LF+LC+EGLS LL   E    I G++  +  H ++HL FADDSL F  A+ EES  +K++L  Y   SGQ INL KS
Subjt:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS

Query:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------
           +   ++ + A  ++A LGV    +   YLG+PT   +NK  +F +++DRV   LQ WK  LFS  GKEILIKAI QA+P Y MS FRI +       
Subjt:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPR-------

Query:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA
                                                          GW+I  +P  LMA++L+  Y+  ++FL+AK     S +W+ IVWGR L  
Subjt:  ------------------------------------------------DPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA

Query:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGTCL
        +G +W +GNG  I I+EDPWL +   +         +G  +  +LN + SWK D +   F   D   +L +   N +  D I W     G      G  L
Subjt:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGTCL

Query:  LVVQSGKEDLGKSIVKTYQPQSSETQVSHDNWLPPEENQWKLNVDATWFDDADVGGVGWIIRDSHEGLRQTSEN----------------FKARPKLFD-
                     +     P     + S  +W PP    + +N DA+        G+G +IRD H G    +E                  ++  KL D 
Subjt:  LVVQSGKEDLGKSIVKTYQPQSSETQVSHDNWLPPEENQWKLNVDATWFDDADVGGVGWIIRDSHEGLRQTSEN----------------FKARPKLFD-

Query:  ---HELVVESDASEIVKLINLDSEDLTDISLLIDEIRDLAPTTNVAKFVYSPRSTNFLAHSLACAVAWNRDF
             + + SD   +   +N D+  +TD  L++ +        N+  F++SPR  N +A+   C  +W R F
Subjt:  ---HELVVESDASEIVKLINLDSEDLTDISLLIDEIRDLAPTTNVAKFVYSPRSTNFLAHSLACAVAWNRDF

A0A803PVI9 Uncharacterized protein1.0e-6635.75Show/hide
Query:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS
        NG  Q    P RGIRQGD  SPYLFILCAEGLS LL  EE   N+ G+K+     +V+HLFFADDSL   +A    +  I+  L  Y  ASGQ +NL+KS
Subjt:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLNKS

Query:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRI---------
        +   + N      A++   L +        YLGLP+ + R+K I+F+ +K+++WK L +W+ +LFS+GGKE+L+KA+AQ+IP Y MS FR+         
Subjt:  LCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRI---------

Query:  ---------------------------PRDPG-------------------WRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA
                                   P+  G                   WR+  +P SL++RIL+ +Y+S SSFL +      SL W+ IVWG+ L  
Subjt:  ---------------------------PRDPG-------------------WRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFA

Query:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGTCL
        +GL+WKVG+G+ I    DPW+     + P++   T   + V+  +  +  W  +++ D+FL SD E +L +P      +D++IW  +T G    + G  L
Subjt:  EGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGTCL

SwissProt top hitse value%identityAlignment
P11369 LINE-1 retrotransposable element ORF2 protein1.5e-0626.7Show/hide
Query:  IPYNGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINL
        I  NG   E+     G RQG   SPYLF +  E L+  + +++ I    GI+I      V     ADD + +    K  +  +  ++ ++    G  IN 
Subjt:  IPYNGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINL

Query:  NKSLCMI-NKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIM---FSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIY
        NKS+  +  KN   +K    +    ++ +N    YLG+ T T   K +    F  LK  + + L+ WK    S  G+  ++K       IY
Subjt:  NKSLCMI-NKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIM---FSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIY

P92555 Uncharacterized mitochondrial protein AtMg012501.6e-1150Show/hide
Query:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDS
        NG PQ   TP+RG+RQGD  SPYLFILC E LS L  R +    + GI+++++   + HL FADD+
Subjt:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDS

P93295 Uncharacterized mitochondrial protein AtMg003103.1e-0729.35Show/hide
Query:  MSWFR--IPRDPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFAEGLKWKVGNGKQIYIDEDPWLMQDGKWTPI
        + WF   +     +RI   P +L++R+LR +Y+  SS ++       S  W+SI+ GR L + GL   +G+G    +  D W+M +    P+
Subjt:  MSWFR--IPRDPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFAEGLKWKVGNGKQIYIDEDPWLMQDGKWTPI

Arabidopsis top hitse value%identityAlignment
AT4G29090.1 Ribonuclease H-like superfamily protein8.8e-1025.13Show/hide
Query:  WRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFAEGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGK--------RVAE
        WR+   P SLMA++ + +Y+  S  L A      S VWKSI   + +  +G +  VGNG+ I I    WL        +R++     +        +V++
Subjt:  WRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFAEGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGK--------RVAE

Query:  ILNENS-SWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGTCLLVVQSGKEDLGKSIVKTYQPQSSETQVSHDNWLPPEENQWK
        +++E+   W++D+I   F   + + I  +      + D   W   + G            V+SG   L + I K   PQ    +VS  +  P  +  WK
Subjt:  ILNENS-SWKEDIIMDSFLPSDAEAILSMPKRNMDLNDEIIWGADTKGMQGGRRGTCLLVVQSGKEDLGKSIVKTYQPQSSETQVSHDNWLPPEENQWK

AT5G65005.1 Polynucleotidyl transferase, ribonuclease H-like superfamily protein5.5e-0424.59Show/hide
Query:  TCLLVVQSGKEDLGKSIVKTYQPQSSETQVSHD-NWLPPEENQWKLNVDATWFDDADVGGVGWIIRDSHEGL----------RQTSENFKARPKL-----
        T  + +   KE L  ++    Q  +     S +  W PP  ++ K N DA+  +   V G+GWI+R+S   +          R T+E  +    +     
Subjt:  TCLLVVQSGKEDLGKSIVKTYQPQSSETQVSHD-NWLPPEENQWKLNVDATWFDDADVGGVGWIIRDSHEGL----------RQTSENFKARPKL-----

Query:  ---FDHELVV-ESDASEIVKLINLDSEDLTDISLLIDEIRDLAPTTNVAKFVYSPRSTNFLAHSLA-CAVAWNRDFVFFYFGP
           F H+ V+ E D   I ++IN  S +   +   +D I+   P+    +F +  R  N  A  LA  A+  N  +  F+  P
Subjt:  ---FDHELVV-ESDASEIVKLINLDSEDLTDISLLIDEIRDLAPTTNVAKFVYSPRSTNFLAHSLA-CAVAWNRDFVFFYFGP

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.2e-0829.35Show/hide
Query:  MSWFR--IPRDPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFAEGLKWKVGNGKQIYIDEDPWLMQDGKWTPI
        + WF   +     +RI   P +L++R+LR +Y+  SS ++       S  W+SI+ GR L + GL   +G+G    +  D W+M +    P+
Subjt:  MSWFR--IPRDPGWRIARDPGSLMARILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFAEGLKWKVGNGKQIYIDEDPWLMQDGKWTPI

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.1e-1250Show/hide
Query:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDS
        NG PQ   TP+RG+RQGD  SPYLFILC E LS L  R +    + GI+++++   + HL FADD+
Subjt:  NGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCGCTTTTGCTGCGTGGGGACTTTTATTCCCTACAATGGCATCCCCCAGGAGAGTTTCACGCCCAATCGTGGTATCAGGCAAGGCGACTCTTCGTCGCCGTATTTGTT
CATTTTATGTGCGGAAGGCCTGTCGGCTCTTCTTACAAGGGAAGAATCCATCTCTAACATTTTCGGTATTAAAATTAATAGTCATTGCCATACTGTAGCACATCTCTTTT
TTGCAGATGATAGTCTGACCTTCTTCAAAGCCACGAAAGAGGAAAGCTCCAACATAAAGAAGATCCTGCAGACCTATGAGCTCGCGTCTGGTCAGATGATCAATCTCAAT
AAGTCTCTTTGTATGATCAACAAGAACTTAGCTGAGGATAAAGCTGCGGAGATTAGTGCGGAGCTTGGGGTTATTCAGTCCAACTCCATAGGGCACTATCTTGGTCTCCC
AACGCAGACAAGTAGAAATAAGGGGATCATGTTCAGTAGATTGAAGGATAGGGTTTGGAAGGTGTTGCAAAGTTGGAAAGCGAAGTTGTTTTCTGTAGGGGGGAAGGAGA
TTCTCATCAAAGCCATTGCCCAAGCCATACCTATATACACTATGAGCTGGTTTAGAATTCCTAGGGATCCGGGTTGGAGAATTGCTAGGGATCCGGGTAGTCTTATGGCT
CGGATTCTTAGGGGCAAATACTACAGTGGTAGCTCCTTTTTGAAAGCAAAGGCCAAAGGAAATTCATCTTTGGTGTGGAAAAGTATTGTGTGGGGTAGATCTTTATTTGC
TGAAGGGCTCAAATGGAAAGTGGGGAACGGGAAACAAATTTATATTGATGAGGATCCTTGGCTTATGCAGGATGGAAAATGGACTCCCATTCGTGTTAAGGAGACGCTTA
AGGGCAAAAGAGTAGCAGAGATTCTTAATGAGAACAGTTCTTGGAAAGAGGATATCATTATGGATTCTTTCCTTCCAAGCGATGCAGAGGCTATTCTATCTATGCCCAAG
CGGAACATGGACTTGAACGACGAAATAATTTGGGGTGCTGACACAAAAGGGATGCAAGGAGGAAGACGTGGAACATGTCTTTTGGTCGTGCAAAGTGGTAAGGAAGATTT
GGGCAAATCTATTGTTAAGACTTACCAGCCCCAATCGTCGGAGACCCAAGTGAGTCACGACAACTGGCTTCCGCCGGAAGAGAATCAATGGAAGTTGAACGTTGATGCGA
CTTGGTTTGACGACGCAGATGTCGGAGGGGTGGGGTGGATCATCCGCGACTCTCATGAGGGGCTTAGACAAACCTCTGAGAATTTTAAAGCTCGTCCAAAGCTCTTCGAT
CACGAGTTGGTGGTCGAGTCGGATGCTTCCGAAATTGTGAAGCTGATCAATCTAGATTCGGAGGACCTCACGGATATCTCGCTTCTGATAGACGAGATTCGTGATTTGGC
TCCGACTACAAATGTGGCGAAGTTCGTGTATAGTCCTAGATCCACCAATTTTTTGGCGCACTCTCTTGCGTGCGCAGTTGCTTGGAATAGGGATTTTGTTTTCTTCTATT
TTGGTCCTTCTCAGATCTTGAGAAAGGGTGTTGGCGTTGGGTGGATAGTTTTTGGCCCCCTGCTGAACGCTGATGCCTCATGGATCGAGGGAGAAGGTATAGGTGGGCTA
GGATGTGTCGTTCGTGACTCTAACGGATGTCTAATCGGAACGGGCTGCAAGAAGATTAACAACATGTGGAGCATCAAATGTTTGGAAGCTGATAACTGCCTAAAAGATTA
TGCTGCTGAGCGACTGGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAACAGAACTGCCACATCACACTCGTTAGCCATCTTCATGAACCGACTTCTGTTGAGTTAT
TTTCGGGATAA
mRNA sequenceShow/hide mRNA sequence
ATGCGCTTTTGCTGCGTGGGGACTTTTATTCCCTACAATGGCATCCCCCAGGAGAGTTTCACGCCCAATCGTGGTATCAGGCAAGGCGACTCTTCGTCGCCGTATTTGTT
CATTTTATGTGCGGAAGGCCTGTCGGCTCTTCTTACAAGGGAAGAATCCATCTCTAACATTTTCGGTATTAAAATTAATAGTCATTGCCATACTGTAGCACATCTCTTTT
TTGCAGATGATAGTCTGACCTTCTTCAAAGCCACGAAAGAGGAAAGCTCCAACATAAAGAAGATCCTGCAGACCTATGAGCTCGCGTCTGGTCAGATGATCAATCTCAAT
AAGTCTCTTTGTATGATCAACAAGAACTTAGCTGAGGATAAAGCTGCGGAGATTAGTGCGGAGCTTGGGGTTATTCAGTCCAACTCCATAGGGCACTATCTTGGTCTCCC
AACGCAGACAAGTAGAAATAAGGGGATCATGTTCAGTAGATTGAAGGATAGGGTTTGGAAGGTGTTGCAAAGTTGGAAAGCGAAGTTGTTTTCTGTAGGGGGGAAGGAGA
TTCTCATCAAAGCCATTGCCCAAGCCATACCTATATACACTATGAGCTGGTTTAGAATTCCTAGGGATCCGGGTTGGAGAATTGCTAGGGATCCGGGTAGTCTTATGGCT
CGGATTCTTAGGGGCAAATACTACAGTGGTAGCTCCTTTTTGAAAGCAAAGGCCAAAGGAAATTCATCTTTGGTGTGGAAAAGTATTGTGTGGGGTAGATCTTTATTTGC
TGAAGGGCTCAAATGGAAAGTGGGGAACGGGAAACAAATTTATATTGATGAGGATCCTTGGCTTATGCAGGATGGAAAATGGACTCCCATTCGTGTTAAGGAGACGCTTA
AGGGCAAAAGAGTAGCAGAGATTCTTAATGAGAACAGTTCTTGGAAAGAGGATATCATTATGGATTCTTTCCTTCCAAGCGATGCAGAGGCTATTCTATCTATGCCCAAG
CGGAACATGGACTTGAACGACGAAATAATTTGGGGTGCTGACACAAAAGGGATGCAAGGAGGAAGACGTGGAACATGTCTTTTGGTCGTGCAAAGTGGTAAGGAAGATTT
GGGCAAATCTATTGTTAAGACTTACCAGCCCCAATCGTCGGAGACCCAAGTGAGTCACGACAACTGGCTTCCGCCGGAAGAGAATCAATGGAAGTTGAACGTTGATGCGA
CTTGGTTTGACGACGCAGATGTCGGAGGGGTGGGGTGGATCATCCGCGACTCTCATGAGGGGCTTAGACAAACCTCTGAGAATTTTAAAGCTCGTCCAAAGCTCTTCGAT
CACGAGTTGGTGGTCGAGTCGGATGCTTCCGAAATTGTGAAGCTGATCAATCTAGATTCGGAGGACCTCACGGATATCTCGCTTCTGATAGACGAGATTCGTGATTTGGC
TCCGACTACAAATGTGGCGAAGTTCGTGTATAGTCCTAGATCCACCAATTTTTTGGCGCACTCTCTTGCGTGCGCAGTTGCTTGGAATAGGGATTTTGTTTTCTTCTATT
TTGGTCCTTCTCAGATCTTGAGAAAGGGTGTTGGCGTTGGGTGGATAGTTTTTGGCCCCCTGCTGAACGCTGATGCCTCATGGATCGAGGGAGAAGGTATAGGTGGGCTA
GGATGTGTCGTTCGTGACTCTAACGGATGTCTAATCGGAACGGGCTGCAAGAAGATTAACAACATGTGGAGCATCAAATGTTTGGAAGCTGATAACTGCCTAAAAGATTA
TGCTGCTGAGCGACTGGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAACAGAACTGCCACATCACACTCGTTAGCCATCTTCATGAACCGACTTCTGTTGAGTTAT
TTTCGGGATAA
Protein sequenceShow/hide protein sequence
MRFCCVGTFIPYNGIPQESFTPNRGIRQGDSSSPYLFILCAEGLSALLTREESISNIFGIKINSHCHTVAHLFFADDSLTFFKATKEESSNIKKILQTYELASGQMINLN
KSLCMINKNLAEDKAAEISAELGVIQSNSIGHYLGLPTQTSRNKGIMFSRLKDRVWKVLQSWKAKLFSVGGKEILIKAIAQAIPIYTMSWFRIPRDPGWRIARDPGSLMA
RILRGKYYSGSSFLKAKAKGNSSLVWKSIVWGRSLFAEGLKWKVGNGKQIYIDEDPWLMQDGKWTPIRVKETLKGKRVAEILNENSSWKEDIIMDSFLPSDAEAILSMPK
RNMDLNDEIIWGADTKGMQGGRRGTCLLVVQSGKEDLGKSIVKTYQPQSSETQVSHDNWLPPEENQWKLNVDATWFDDADVGGVGWIIRDSHEGLRQTSENFKARPKLFD
HELVVESDASEIVKLINLDSEDLTDISLLIDEIRDLAPTTNVAKFVYSPRSTNFLAHSLACAVAWNRDFVFFYFGPSQILRKGVGVGWIVFGPLLNADASWIEGEGIGGL
GCVVRDSNGCLIGTGCKKINNMWSIKCLEADNCLKDYAAERLEGANSVLQQNWEQNCHITLVSHLHEPTSVELFSG