; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0032889 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0032889
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr11:38539868..38543814
RNA-Seq ExpressionLag0032889
SyntenyLag0032889
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAB4268750.1 unnamed protein product [Prunus armeniaca]2.0e-15131.46Show/hide
Query:  MWVSSISFSLLSFSKNHIDGWISRDG--RRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDEKDGG-----------------CA
        MW   ++ SL S S NHID  +   G   RWR TGFYG P      +SW LL RL      PWL  GDFN IL  DEK GG                 C 
Subjt:  MWVSSISFSLLSFSKNHIDGWISRDG--RRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDEKDGG-----------------CA

Query:  DTE-----------RPGGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQL----------------------------DLRQLVGSSWGAGPGE
          +           R   E I  +LDR    T W   +P   V HL+ ++SDH  +                            +  + +   W      
Subjt:  DTE-----------RPGGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQL----------------------------DLRQLVGSSWGAGPGE

Query:  SSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEEEL--------------------------
         S   P A+  K  R    ++GW R    + P +I S   K+   +    T  + +   +  A+L+ ++ + E+                          
Subjt:  SSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEEEL--------------------------

Query:  -----NRIGGLEDDQGVWQQ-----EKTAV------------------------------IQALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHW
             N I GLED+ G+WQ      E T V                               + LL  FT EEI  AL Q HP+KAPGPDG S  FY+ +W
Subjt:  -----NRIGGLEDDQGVWQQ-----EKTAV------------------------------IQALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHW

Query:  DIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN------------------------NQSAFIPGRCVVDNAILGFECIHEL
        D++G+D++ + L  L+ G     IN T + LIPK+   + M   RPISL N                        +QSAF+PGR + DN+I+ FE +H +
Subjt:  DIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN------------------------NQSAFIPGRCVVDNAILGFECIHEL

Query:  RRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVER
         +R  G   +  LKLDMSKAYDRVEW FL  +ML +GF ++W+ +I+ C++SVS+SF LNG  VG V   RGLRQGDPLSPYLFLLCA+  S L+   E+
Subjt:  RRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVER

Query:  RSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLGLPSYMSRN
           + G  + R  P++SHLFFA+DS +F KA +S    ++++   YE+ SGQ IN +KS V+FS N  +E Q+  +  L V R   H +YLGLP+++ R+
Subjt:  RSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLGLPSYMSRN

Query:  RSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMEL
        R     ++KD IW +IQGWK K  S AGKEVLLK + QA+P Y M+CF +P+ L +EI + MA +WW   E  R+IHWLSW +LC PK   GLGFR++  
Subjt:  RSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMEL

Query:  FNQALLANQ-----------------------CCLL-----------W-----GRELLVHGCRWRIGNGRATPIYGSNWLSNEFSLQIQSAPVLPLASTV
        FN AL+A Q                       C +L           W      + +L  G RWRIG+G+   I+   WL    S ++ S PV    +T 
Subjt:  FNQALLANQ-----------------------CCLL-----------W-----GRELLVHGCRWRIGNGRATPIYGSNWLSNEFSLQIQSAPVLPLASTV

Query:  SELFF--ATGGWNEAMLRAIFNESDS------------------WKFFGEEWISTCSYTGYR--------------DRPGSSNSERMRAW---WSA----
                T  W   +L A F+  ++                  W F      S  S  GY                  G++N      W   W      
Subjt:  SELFF--ATGGWNEAMLRAIFNESDS------------------WKFFGEEWISTCSYTGYR--------------DRPGSSNSERMRAW---WSA----

Query:  -----LWR---GLSPFVLELR------------CGKKYVVGFQI-------------CSLHQSFSHFRFEEIIGAMKD---KLTGPDFEIVVIFWWSVWN
             LWR    + P  + L+            CG++   GF +             CS   S       ++    +D    L+ P+ E V+I  W++WN
Subjt:  -----LWR---GLSPFVLELR------------CGKKYVVGFQI-------------CSLHQSFSHFRFEEIIGAMKD---KLTGPDFEIVVIFWWSVWN

Query:  LRNNLSWGGQSDDRDLWAYSSDYLSAFHVGGRRF-------------PAGDCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDG
         RN + W       D W   S+ +S    G  R              P+     W+ PP   +K+N+  + R  TG  G G V+R   G
Subjt:  LRNNLSWGGQSDDRDLWAYSSDYLSAFHVGGRRF-------------PAGDCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDG

XP_010692477.1 PREDICTED: uncharacterized protein LOC104905588 [Beta vulgaris subsp. vulgaris]7.0e-15231.1Show/hide
Query:  MSAAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISRDGRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADT--PWLIGGDFNAIL
        + A K  LGF N F V   GR+GGL L W   + FSL+SFS++HI G +    ++WR  G YG+   +    +WSLL  L  C DT  P L+GGDFN IL
Subjt:  MSAAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISRDGRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADT--PWLIGGDFNAIL

Query:  CQDEKDGGCA------------------------------DTERPGGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDLRQ-----------
           EK+GG                                +  R     I E+LDR   +  W DLYP+ +  H    +SDH  + LR            
Subjt:  CQDEKDGGCA------------------------------DTERPGGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDLRQ-----------

Query:  ----------------LVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEV-LQE
                        +V  SW    GE        +  +     + +V W   K +N  ++I +A + +  A  +  +  +    V  E +L+E+  + 
Subjt:  ----------------LVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEV-LQE

Query:  EELNRIGGLEDDQGVWQQEKTAVIQ---------ALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTM
        E    I    +   +  +   +VI+          LL+PF+K+EIL AL+Q HP KAPGPDG+   FY+  W I+G D+      +L+   SP  +N+T 
Subjt:  EELNRIGGLEDDQGVWQQEKTAVIQ---------ALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTM

Query:  IVLIPKIKATRRMVDFRPISLCN------------------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAF
        I LIPK+K   +  +FRPI+LCN                        NQSAF+PGR + DNA++  E  H ++ R R       +KLDMSKAYDRVEW F
Subjt:  IVLIPKIKATRRMVDFRPISLCN------------------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAF

Query:  LPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIF
        L +++L +GF  +W+++I+  VSSV++SF +NG   G V  +RGLRQGDPLSPYLF++ A   S ++    +  Q+ G +  RS P ISHLFFA+DSL+F
Subjt:  LPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIF

Query:  FKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAG
         +A + E  +I ++  +YE ASGQ INYEKS V++S       +  ++ +L++ +   H++YLG+PS   R++      + DRIW ++QGWK K  S AG
Subjt:  FKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAG

Query:  KEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQCCLL---------------
        KEVLLKS++QAIP Y M  ++ P  +I++I   MARFWW  S+ +R+IHW +WDS+C  KC  G+GF+D+ +FN ALL  Q   L               
Subjt:  KEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQCCLL---------------

Query:  -----------------------W-GRELLVHGCRWRIGNGRATPIYGSNWLSNEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAIFNESD-----
                               W  + LL  G  WR+GNG    ++   W+ +E    + S P   +      + F    W  ++L +  NE D     
Subjt:  -----------------------W-GRELLVHGCRWRIGNGRATPIYGSNWLSNEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAIFNESD-----

Query:  -------------SWKFFGEEWISTCSYTGYRDRPGSSNSERMRAW---WSA---------LWRGLSPFVLELR--CGKKYVVGFQIC---------SLH
                     +W F  +   S    T Y    G +     +AW   WS          LWR L    L +R     +++    +C           H
Subjt:  -------------SWKFFGEEWISTCSYTGYRDRPGSSNSERMRAW---WSA---------LWRGLSPFVLELR--CGKKYVVGFQIC---------SLH

Query:  QSFSHFRFEEI-IGAMKDKLTGPDFEI----VVIFW---------------WSVWNLRNNLSWGGQSDDRD-LWAYSSDYLSAFHVGGRRF-------PA
          F   +  ++ + +    L   D  +    +++ W               W +W  RN   +  ++     L    S  +       RR          
Subjt:  QSFSHFRFEEI-IGAMKDKLTGPDFEI----VVIFW---------------WSVWNLRNNLSWGGQSDDRD-LWAYSSDYLSAFHVGGRRF-------PA

Query:  GDCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDGEVFMAVCLSLQRYWSVDLAEGWAM
        G    W  PP   +KLN+DAS+  D G  G   + R SDG V  A    ++ YW+ ++AE  A+
Subjt:  GDCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDGEVFMAVCLSLQRYWSVDLAEGWAM

XP_010693052.1 PREDICTED: uncharacterized protein LOC104906048 [Beta vulgaris subsp. vulgaris]1.5e-15436.41Show/hide
Query:  AAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISRDGRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDE
        A K  LGF N F V   GR+GGL + W   I FSL+S+S++HI G I    ++WR  G YG+   +    +WSL+  L G +  P L+GGDFN IL  +E
Subjt:  AAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISRDGRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDE

Query:  KDGGCADTERPG-------------------------------GETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDLRQ-------------L
        K+GG +D  R G                                  I E+LDR   +  W  LYPN +V H    +SDH  + LR               
Subjt:  KDGGCADTERPG-------------------------------GETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDLRQ-------------L

Query:  VGSSWGAGP-------GESSDITPLAIANKANRCMRSMVGWGRSKSENFPR---RICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEEELNRIGG
          +SW   P           D T  ++  +     + +  W   +  NF +   R+ +A  ++Q      +  ++R   +  E +L+E+ +E   + +  
Subjt:  VGSSWGAGP-------GESSDITPLAIANKANRCMRSMVGWGRSKSENFPR---RICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEEELNRIGG

Query:  LEDDQGVWQQEKTAVIQA---------LLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIK
        +  D  V   E    +           LL+PFTKEE+  AL Q HP KAPGPDG+   FY+  W IIG D+     ++L+    P  IN T I LIPK+K
Subjt:  LEDDQGVWQQEKTAVIQA---------LLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIK

Query:  ATRRMVDFRPISLCN------------------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRL
              +FRPI+LCN                        NQSAF+PGR + DNA++  E  H ++ R R       +KLDMSKAYDRVEW FL +++L +
Subjt:  ATRRMVDFRPISLCN------------------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRL

Query:  GFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEA
        GF  +W+++I+ CVSSVS+SF +NG   G VT +RGLRQGDPLSPYLF+L A   S ++    +  Q+ G +  RS P ISHLFFA+DSL+F +A + E 
Subjt:  GFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEA

Query:  LVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSI
         +I ++  RYE+ASGQ INYEKS V+FS       ++ ++ +L + +   H +YLG+PS   R+R+     + DRIW ++QGWK K  S AGKE+LLKS+
Subjt:  LVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSI

Query:  VQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQCCLL-----------------------
        +QAIP Y M  ++LP  +I++IH  MARFWW  S+  R+IHW SWDSLC  KC  G+GFRD+ +FN ALL  Q   L                       
Subjt:  VQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQCCLL-----------------------

Query:  ---------------W-GRELLVHGCRWRIGNGRATPIYGSNWLSNEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAIFNESD
                       W  + LL  G  WRIGNG    I+   WL +E    I S     L      + F    W  +++ A+FN+ D
Subjt:  ---------------W-GRELLVHGCRWRIGNGRATPIYGSNWLSNEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAIFNESD

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]3.5e-15130.55Show/hide
Query:  LGFDNCFCVDCH---------GRSGGLALMWVSSISFSLLSFSKNHIDGWIS--RDGRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNA
        LG+ N F VDC           R+GGL L+W   I  +L +FS NHID  I    D  RWR TG YG    +L   +W+L++++      PWLIGGDFN 
Subjt:  LGFDNCFCVDCH---------GRSGGLALMWVSSISFSLLSFSKNHIDGWIS--RDGRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNA

Query:  ILCQDEKDGG-------------CAD----------------TERPGGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDH-----------------
        IL   EK+GG             C +                  + GGE I  +LDR      W+DL+P   V HL  S+SDH                 
Subjt:  ILCQDEKDGG-------------CAD----------------TERPGGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDH-----------------

Query:  -----------RQLDLRQLVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQ
                    + +   +V   W +  G     T   I  +  +  +++  W   K  +    I     K+        ++   +  ++ E +L ++L 
Subjt:  -----------RQLDLRQLVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQ

Query:  EE-------------------------------ELNRIGGLEDDQGVWQQEKT-----------------------------------AVIQALLQPFTK
         E                               + N I GL ++ GVW  E +                                   A+   L++ F +
Subjt:  EE-------------------------------ELNRIGGLEDDQGVWQQEKT-----------------------------------AVIQALLQPFTK

Query:  EEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN-------------------
        EEIL AL Q HP KAPGPDG S  FY+ +W ++G D+I +    +N       +N T + LIPK+K    M   RPISLCN                   
Subjt:  EEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN-------------------

Query:  -----NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLS
              QSAF+PGR + DN++L FE  H L+RR  G   +  LKLDMSKAYDRVEW F+  +M  +GF Q WI  I+ CV++VS+SF LNGE  G +  +
Subjt:  -----NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLS

Query:  RGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEE
        RGLRQGD +SPYLFLLCA+GLS +L   E + ++ G  I    PSI+HLFFA+DS +F KA + E   ++E+   YE ASGQ +N++KS ++FS N    
Subjt:  RGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEE

Query:  AQQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGS
         Q+ +++V  V R   H +YLGLP+ +S +++ A +F+ ++   +++ WK K  SVAGKEV++KS+VQ++P Y M+CF LP+ L +E+HR MA FWW  S
Subjt:  AQQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGS

Query:  EEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQ---------------------------------------CCLLWGRELLVHGCRWRIGNGR
        E+ R+IHWL+WD +C+PK   GLGFR+ME FNQALLA Q                                         L+ G+ LL  G R+++G+G 
Subjt:  EEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQ---------------------------------------CCLLWGRELLVHGCRWRIGNGR

Query:  ATPIYGSNWLSNEFSLQIQSAPVLPLAS-TVSELFFA-TGGWNEAMLRAIF----------------NESDS--WKFFGEEWISTCSYTGYR--------
           ++   W+   +S +  S  +  L   TV++L    +  W    L  +F                N  D   W F      S  S  GY         
Subjt:  ATPIYGSNWLSNEFSLQIQSAPVLPLAS-TVSELFFA-TGGWNEAMLRAIF----------------NESDS--WKFFGEEWISTCSYTGYR--------

Query:  -DRPGSSNSERMRAWWSALWRG-LSPFV-------------LELRCGKKYVVGFQIC---------SLH---------------------QSFSHFRFEE
             +SNS+  +  W  +W   + P V              ++  G++  +  +IC         +LH                     ++ +    +E
Subjt:  -DRPGSSNSERMRAWWSALWRG-LSPFV-------------LELRCGKKYVVGFQIC---------SLH---------------------QSFSHFRFEE

Query:  IIGAMKDKLTGPDFEIVVIFWWSVWNLRNNLSW-GGQSDDRDLWAYSSDYLSAF-----HVGGRRFPAGDCCVWRPPPNRKLKLNIDASVRPDTGEAGGG
         +  M D L     +I  +  W++W+ RN L W GG  +      +S   LS +          + P G    W  PP  +LK+N+D + + + G  G G
Subjt:  IIGAMKDKLTGPDFEIVVIFWWSVWNLRNNLSW-GGQSDDRDLWAYSSDYLSAF-----HVGGRRFPAGDCCVWRPPPNRKLKLNIDASVRPDTGEAGGG

Query:  CVLRGSDG
         V+R   G
Subjt:  CVLRGSDG

XP_024195790.1 uncharacterized protein LOC112198938 [Rosa chinensis]5.4e-15233.01Show/hide
Query:  MSAAKRLLGFDNCFCVDC------HG----RSGGLALMWVSSISFSLLSFSKNHIDGWISR-DGRR-WRLTGFYGFPSADLHPQSWSLLSRLRGCADTPW
        M + +R LG+ N F V+C      HG    R+GGL L+W   I+ SL S+S +HID  ++   G + WR TG YG P       +W+L+  L G  + PW
Subjt:  MSAAKRLLGFDNCFCVDC------HG----RSGGLALMWVSSISFSLLSFSKNHIDGWISR-DGRR-WRLTGFYGFPSADLHPQSWSLLSRLRGCADTPW

Query:  LIGGDFNAILCQDEKDGGCADTERP-----------------------------GGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDLRQLV
        L+GGDFN IL   EK+GG    ER                               GE I  +LDR   N  W  ++P   V HL+ + SDH         
Subjt:  LIGGDFNAILCQDEKDGGCADTERP-----------------------------GGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDLRQLV

Query:  GSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEEELNRIGGLEDDQGVWQQE
                       P+ +  +  R  R      + + E+F  R       V++     + S     ++ T       L E +L+ +  +  D     +E
Subjt:  GSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEEELNRIGGLEDDQGVWQQE

Query:  KTAVIQALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN-----
         T     L     +EE+  ALKQ HP+KAPGPDG S +FY+  WD++G D++++    LN       IN T + LIPK+K    +   RPISLCN     
Subjt:  KTAVIQALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN-----

Query:  -------------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFS
                            QSAF+PGR + DN++L FE  H L+RR RG   +  LKLDMSKAYDRVEW FL  ++ +LGF   WI   + CV +VS+S
Subjt:  -------------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFS

Query:  FNLNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINY
        F +NG+  G +  SRGLRQGD +SPYLFLLCA+ LS ++   ER   + G    R  P+ISHLFFA+DS IF KA  ++ L +R++  +YE+ SGQ INY
Subjt:  FNLNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINY

Query:  EKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIK
        EKS ++FS N     Q  ++ VL V R   H +YLGLP+ +S ++  A +++ DR+  + QGW+ K  S AGKEVL+K++VQ+IP Y M+CF LP+ L  
Subjt:  EKSVVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIK

Query:  EIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQ-----C----------------------------------CLLWGRE
        E+HR MARFWW     ER+IHW++WD LC PK   GLGFRDM LFN ALLA Q     C                                   ++ GR+
Subjt:  EIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQ-----C----------------------------------CLLWGRE

Query:  LLVHGCRWRIGNGRATPIYGSNWLS-----NEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAI--------------------------FNESDSW
        LL  G R+++GNG    I+   W+        FS+ +Q A  L +   + E    TG W E +L  +                          F++   +
Subjt:  LLVHGCRWRIGNGRATPIYGSNWLS-----NEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAI--------------------------FNESDSW

Query:  KFFGEEWISTCSYTGYRDRPGSSNSERMRAWWSALW----------------RGLSPF---------VLELRC------GKKYVVGFQICSLHQSF-SHF
               ++    T  R   GSS        W  LW                +G+ P          + ++RC       +  +  F+ C   ++F  H 
Subjt:  KFFGEEWISTCSYTGYRDRPGSSNSERMRAWWSALW----------------RGLSPF---------VLELRC------GKKYVVGFQICSLHQSF-SHF

Query:  RFEEIIGA------------MKDKLTGPDFEIVVIFWWSVWNLRNNLSW-GGQSDDRDLWAYSSDYLSAF-----HVGGRRFPAGDCCVWRPPPNRKLKL
          E   G             M D L G   E  ++  W VW  RNN+ W    S+  ++ A++  +L  +       GG+         W  PP  +LK+
Subjt:  RFEEIIGA------------MKDKLTGPDFEIVVIFWWSVWNLRNNLSW-GGQSDDRDLWAYSSDYLSAF-----HVGGRRFPAGDCCVWRPPPNRKLKL

Query:  NIDASVRPDTGEAGGGCVLRGSDG
        NID S   +TG  G G V+R  +G
Subjt:  NIDASVRPDTGEAGGGCVLRGSDG

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein4.6e-15735.21Show/hide
Query:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI-SRDGRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDEKDG-
        LG   CF V+  G  GGLAL+W  S+   + S+SK+HID W+ S  G  WR TGFYG P       SW LL RL+G +D PWL+ GDFN I+  DEK G 
Subjt:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWI-SRDGRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDEKDG-

Query:  ----------------GCA-------------DTERPGGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDH--------------------------
                         C                 R   E + E+LDR      W DL+P   + H+ ++ SDH                          
Subjt:  ----------------GCA-------------DTERPGGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDH--------------------------

Query:  ----RQLDLRQLVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAI----TDLSTSDSRDLLVQTEAQLEEVLQEEE
            R+    + +  +W              ++ K  +C   ++ W ++     P+ I    +++        + ++  + R L      +L  +LQ+EE
Subjt:  ----RQLDLRQLVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAI----TDLSTSDSRDLLVQTEAQLEEVLQEEE

Query:  L-------------------------------NRIGGLEDDQGVWQQEKTAVI-----------------------------------QALLQPFTKEEI
        +                               N I G+ D Q VWQ+E+T +                                    Q LL+PFT+EE+
Subjt:  L-------------------------------NRIGGLEDDQGVWQQEKTAVI-----------------------------------QALLQPFTKEEI

Query:  LLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN----------------------
         +AL Q  P+KAPGPDG++  F++  W I+G+D+  + L  LN G    ++N T I LIPK+K+   M  FRPISLCN                      
Subjt:  LLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN----------------------

Query:  --NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGL
          +QSAF+PGR + DN ++ FE IH L+ +  G       KLDMSKAY+RVEW +L +IML+LGF ++W+ +I+ CV+SVS+S  +NG+  G V  SRGL
Subjt:  --NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGL

Query:  RQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQ
        RQGDPLSPYLFL+CA+GLS+LL   ER   I G  + R  P +SHLFFA+DSLIF +A +++   ++++   YERASGQ IN +K+ + FS N     + 
Subjt:  RQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQ

Query:  YISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEE
         I ++   S     ++YLGLP  + R++  A   +KDRIW ++QGWK KF S AGKE+L+K+++QAIP Y M+CF+LP  L  EI     RFWW     E
Subjt:  YISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEE

Query:  RRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQCCLL------------WGRELLVHGCRWRIGNGRATPIYGSNWLSNEFSLQIQSAPVLPL---A
        R+IHWLS   LC  K   G+GFRD++ FNQALLA Q   L              +++L  G RWR+GNG    I+   W+ +  + ++ S PV  L   A
Subjt:  RRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQCCLL------------WGRELLVHGCRWRIGNGRATPIYGSNWLSNEFSLQIQSAPVLPL---A

Query:  STVSELFFATGGWNEAMLRAIFNESD
        S  S +   T  WN  +L  IF   D
Subjt:  STVSELFFATGGWNEAMLRAIFNESD

A0A803PBD1 Uncharacterized protein1.1e-15531.2Show/hide
Query:  KRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISRDG-RRWRLTGFYGFPSADLHPQSWSLLSRLR-GCADTPWLIGGDFNAILCQDE
        K  LGFD  F V   G SGGL L+W S +  +++++S NHI  +IS D    +  T FYG P+     Q+W LLSR+   C   PW + GDFN  L  ++
Subjt:  KRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISRDG-RRWRLTGFYGFPSADLHPQSWSLLSRLR-GCADTPWLIGGDFNAILCQDE

Query:  KDG--------------------------GCADT---ERPGGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDL------------------
        K                            G A T      G     E+LD   GN+ W   +P+  +NHLD+  SDHR + L                  
Subjt:  KDG--------------------------GCADT---ERPGGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDL------------------

Query:  ------------RQLVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSA-ITDLSTSDSRDLLVQTEAQLEEVLQEEE
                      ++  +W + P  S+D + + +  ++N C   +  W       F  RI  A+Q ++ A  T+  +S+S       +A L+ +L +EE
Subjt:  ------------RQLVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSA-ITDLSTSDSRDLLVQTEAQLEEVLQEEE

Query:  L-------------------------------NRIGGLEDDQGVWQQEKTAVIQ----------------------ALLQP---------------FTKE
                                        N I  L  + G   Q+  +++                       AL  P               F+  
Subjt:  L-------------------------------NRIGGLEDDQGVWQQEKTAVIQ----------------------ALLQP---------------FTKE

Query:  EILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN--------------------
        E+  AL Q  P+KAPG DG +  FY+ +W ++G D+ ++ L VLN+       N T+I LIPK+K    + D+RPISLC                     
Subjt:  EILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN--------------------

Query:  ----NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSR
            NQSAF+ GR + DN ++  E +H +  + +G   W  +KLDM+KA+DRVEW FL ++M +  F  ++I +++ C+++ +F FN+NG+ +G VT SR
Subjt:  ----NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSR

Query:  GLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEA
        G+RQGDPLSPYLFL+CA+GLS++L  VERR +  G RI RS P ISHL FA+DS++F K+       I+E   +Y+  SGQ +N++KS + FSPNT    
Subjt:  GLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEA

Query:  QQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSE
        Q  I   + +       +YLGLP    R++S +   +KDRIW  +  W GKF S  GKE+LLK+++QAIP Y M+ F+LP      + + MA FWW  ++
Subjt:  QQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSE

Query:  EERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLA--------NQCCLL-------------------------------WGRELLVHGCRWRIGNGRA
         + ++HW SW SLC  K M GLGFR ++ FNQA+LA        NQ  LL                               WG+ LL  G   RIGNG  
Subjt:  EERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLA--------NQCCLL-------------------------------WGRELLVHGCRWRIGNGRA

Query:  TPIYGSNWLSNEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAIFNES-----------DSWKFFGEEWISTCS-----YTGY--------RDRPGS
        T I+   W+ N    +I  + V P  + VS L    G WN  +LR  F +            D  K     W  + S      TGY        R +P S
Subjt:  TPIYGSNWLSNEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAIFNES-----------DSWKFFGEEWISTCS-----YTGY--------RDRPGS

Query:  SNSERMRAWWSALWRGLSP-----FVLEL---------RCGKKYVVGFQICS----LHQSFSHFRFEEIIGAMKDKLTGPDFEIVVIFWWSVWNLRNNLS
        SN+E +  WW +LW    P     FV  L             ++ +   IC       +S  H  FE     MK K    +F + +   W  WN RN   
Subjt:  SNSERMRAWWSALWRGLSP-----FVLEL---------RCGKKYVVGFQICS----LHQSFSHFRFEEIIGAMKDKLTGPDFEIVVIFWWSVWNLRNNLS

Query:  WGGQ-SDDRDLWAYSSDYLSAFHVG-------GRRFPAGDCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDGEVFMAVCLSLQRYWSVDLAEGW
        +  Q S    +   + DYL+ +           +    GD  VW PPP   LKLN DA+V       GGG ++R   G++  A   +         AEGW
Subjt:  WGGQ-SDDRDLWAYSSDYLSAFHVG-------GRRFPAGDCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDGEVFMAVCLSLQRYWSVDLAEGW

Query:  AM
        A+
Subjt:  AM

A0A803PBM9 Uncharacterized protein1.8e-16134.29Show/hide
Query:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISR-DGRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDEKDGG
        LG+D CF V+ HG+SGGL L+W + +  +++SFS  HID +I + +G+ WR TGFYG P      +SW LL+R+      PW+IGGDFN IL   EK GG
Subjt:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISR-DGRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDEKDGG

Query:  -------------------CADTERPG----------GETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDLR---------------------
                             + E  G           E I+E+LDR  GN  W DL+P   VNHLD   SDH  L L                      
Subjt:  -------------------CADTERPG----------GETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDLR---------------------

Query:  ----------QLVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEEE----
                  ++V  SW  G    S  T +A+ +K   C +++  W +S+     +R+    +K+   I   ST++     ++   Q   VL ++E    
Subjt:  ----------QLVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEEE----

Query:  ----------------------------LNRIGGLEDDQGVWQQEKTAVIQA-------------------------------------LLQPFTKEEIL
                                     N I GL D    W      V Q                                      L  PFTKE++ 
Subjt:  ----------------------------LNRIGGLEDDQGVWQQEKTAVIQA-------------------------------------LLQPFTKEEIL

Query:  LALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN-----------------------
         A++  HP+KAPG DG+ G FY+ +W  IG ++ + CL +LN G     INDT+I LIPKI+   RM +FRPISLCN                       
Subjt:  LALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN-----------------------

Query:  -NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLR
          QSAF+ GR + DNAI+GFE +H +++R  G  +   LKLDMSKAYDRVEW FL  +M  LG+ + WI+ I+ CV+SVSFS  +NGE++G+ T +RGLR
Subjt:  -NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLR

Query:  QGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQY
        QGD LSPYLFL+C++GLS L+   ER   I G R G+    +SHLFFA+DS +F +  ++E   +  +  RY R SGQ IN EKS V+       +    
Subjt:  QGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQY

Query:  ISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEER
        ++  L VS    H +YLGLPS++ R +    + +KD++W +++ WK   FS AGKE+L+K+++QAIP Y+M+CFRLP+ LI  +H   A FWW  ++E +
Subjt:  ISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEER

Query:  RIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQCCLL---------------------------------------WGRELLVHGCRWRIGNGRATPI
        +IHW +W  LC PK   GLGFR +  FNQALLA Q   L                                       WGR+++V G RWR+GNGR   I
Subjt:  RIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQCCLL---------------------------------------WGRELLVHGCRWRIGNGRATPI

Query:  YGSNWLSNEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAIFNESD-SWKF-----FGEEWISTCSY---------TGYRDR-------PGSSNSER
        +   W+             L   +T+ +L    G W   M+ A F++ D  W       F EE + T            +GYR            SN ++
Subjt:  YGSNWLSNEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAIFNESD-SWKF-----FGEEWISTCSY---------TGYRDR-------PGSSNSER

Query:  MRAWWSALW
        ++AWW   W
Subjt:  MRAWWSALW

A0A803PF28 Uncharacterized protein1.5e-15530.97Show/hide
Query:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISRD-GRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDEKDGG
        LGF+ CF VD  G+SGGLAL+W    S  + S++ +HID  +  D G  WR TGFYG P      +SW LL RL+      W+ GGDFN I+ Q EK GG
Subjt:  LGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISRD-GRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDEKDGG

Query:  -------------------CADTERPGGE----------TIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDLRQLVGSS-------WGAG----
                            ++    GG+           I+E+LDR+  N  W   +    V+ L++  SDHR L L++ +G+        WG+     
Subjt:  -------------------CADTERPGGE----------TIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDLRQLVGSS-------WGAG----

Query:  ----------------PGESSDITPL-AIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEEEL------
                         G+   + P+ A++   + C   +  W +++ +    R     +K++  I D +  D      Q E  L  V ++ E+      
Subjt:  ----------------PGESSDITPL-AIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEEEL------

Query:  -------------------------NRIGGLEDDQGVWQQE-------------------------------------KTAVIQALLQPFTKEEILLALK
                                 N I GL D+   W+ +                                      T     LL+PF+ EE+ +A+ 
Subjt:  -------------------------NRIGGLEDDQGVWQQE-------------------------------------KTAVIQALLQPFTKEEILLALK

Query:  QTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN------------------------NQS
        Q +  KAPG DGL G F+   WDI+G ++I +CL +LN+      +N+T+I LIPK+     M +FRPISLCN                        NQS
Subjt:  QTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN------------------------NQS

Query:  AFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLRQGDP
        AFI GR + DNAI+GFE +H +R+   G  K   +KLDMSKAYDRVEW FL  +M+ LG+  +WI  I+ CVSSVSFS  +NG   G     RGLRQGDP
Subjt:  AFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLRQGDP

Query:  LSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQYISQV
        LSP+LFLLC++GLS L+   ER  +I G R G     +SHLFFA+DSLIF  A   E+  ++E+   YE  SGQ IN+EK+ +       E     ++  
Subjt:  LSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEAQQYISQV

Query:  LSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHW
        L V+    H +YLG+P+++ +N+      ++D++  ++QGWK   FS AGKE+L+K+++QA+PCY M+CFR+ + ++ +I   +ARFWW  ++ + ++HW
Subjt:  LSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHW

Query:  LSWDSLCLPKCMSGLGFRDMELFNQALLANQ---------CCL------------------------------LWGRELLVHGCRWRIGNGRATPIYGSN
         +W  LC  K   G+GFR++E FNQA+LA Q         C L                              +WGRELL+ G RW +G+GR+  +    
Subjt:  LSWDSLCLPKCMSGLGFRDMELFNQALLANQ---------CCL------------------------------LWGRELLVHGCRWRIGNGRATPIYGSN

Query:  WLSNEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAIFNESDSWKFFG-------EEWIS-------TCSY-TGYRDR---PGS---SNSERMRAWW
        WL    S  +++   +P   TVS L    G W    + + F++ D     G        +WI+       T S  +GY+ R   P +   SN   ++AWW
Subjt:  WLSNEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAIFNESDSWKFFG-------EEWIS-------TCSY-TGYRDR---PGS---SNSERMRAWW

Query:  SALWRGL----------------SPFVLEL-RCGKKYVVGFQICSLHQSFSHFRFEEIIGAMKDKLTGPDFEIVVIFWWSVWNLRNNLSWGGQSDDRDLW
          +W  +                 P  +EL R G        +C+  +       E+I  A+ D +   DF   + F W VW  RN   +  +     +W
Subjt:  SALWRGL----------------SPFVLEL-RCGKKYVVGFQICSLHQSFSHFRFEEIIGAMKDKLTGPDFEIVVIFWWSVWNLRNNLSWGGQSDDRDLW

Query:  AYSSDYLSAFHVGGR-----RFPAGDCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDGEVFMA
           +  L    +G       +        W PPP     +N DAS+           V+R S G + +A
Subjt:  AYSSDYLSAFHVGGR-----RFPAGDCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDGEVFMA

A0A803PWX1 Uncharacterized protein4.1e-15831.17Show/hide
Query:  LLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISRDG-RRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDEKDG
        +L FD+ + VD  G SGGL LMW   +   +L  S  HI   ++ +G   W LTGFYG P A L   SW LL  L+     PWL  GDFN I+   EK G
Subjt:  LLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISRDG-RRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDEKDG

Query:  GCADTERP--------------------------------GGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQL--------------------
        G    +RP                                    I E+LDR   N  W   +    V  LD+  SDHR L                    
Subjt:  GCADTERP--------------------------------GGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQL--------------------

Query:  ------------DLRQLVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEE
                    +  +++ + W    G      P++   K N+C +++  W + K       I +  +K+   +T        + +   E +L  +L+++
Subjt:  ------------DLRQLVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLSTSDSRDLLVQTEAQLEEVLQEE

Query:  E-------------------------------LNRIGGLEDDQGVWQQEKTAVIQ-------------------------------------ALLQPFTK
        E                                N I GL+D  GVWQ +K  V Q                                      L+  F+ 
Subjt:  E-------------------------------LNRIGGLEDDQGVWQQEKTAVIQ-------------------------------------ALLQPFTK

Query:  EEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN-------------------
        EE++ A+K  +P KAPG DGL   FY+  W  +  ++I  CL VLN+G     +NDT++ LIPK+   +++ +FRPISLCN                   
Subjt:  EEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN-------------------

Query:  -----NQSAFIPGRCVVDNAILGFECIHELRR-RMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTL
             +QSAF+ GR + DNAI+G+EC+H +R+ R R  TK A LKLDM+KAYDRVEW FL  +ML+LG+   W+  I+ C++SV FSF +NGE  G+VT 
Subjt:  -----NQSAFIPGRCVVDNAILGFECIHELRR-RMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTL

Query:  SRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGE
         RGLRQGDPLSP+LFLLCA+  S L+   E   Q+ G   GR   S+SHLFFA+DSL+F  A + E    +EL  +Y +ASGQ +N+ KS + F      
Subjt:  SRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGE

Query:  EAQQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNG
          + +++ ++ V     + +YLGLPS++ R +    +F+K+++W +++GWKG FFS AGKEVL+K+++QAIP YTM+CFRLP+  I  IH   ARFWW  
Subjt:  EAQQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNG

Query:  SEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQC---------------------------------------CLLWGRELLVHGCRWRIGNG
        SE++ +IHW  WD LC  K   GLGFRD+ LFNQALLA Q                                         L+WG++++  G RWRIGNG
Subjt:  SEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQC---------------------------------------CLLWGRELLVHGCRWRIGNG

Query:  RATPIYGSNWLSNEFSLQIQSAPVLPLASTVSEL------FFATGGWNEA----MLRAIFNESDS--------WKF------------FGEEWISTCSYT
         +  +    WL    + +I   P LP    V +L       FA  G+++       R + +++ +        WK                 WI T S  
Subjt:  RATPIYGSNWLSNEFSLQIQSAPVLPLASTVSEL------FFATGGWNEA----MLRAIFNESDS--------WKF------------FGEEWISTCSYT

Query:  GYRDRPGSSNSER---------MRAWWSA-----LWRGLSPFVLELRCGKKYVVGFQICSLHQSFSHFRFEEIIGAMKDKLTGPDFEIVVIFWWSVWNLR
         +R        +R           A W       +W G   +    R G++ V+ F                 +  +    T  +FE  +I  W++W +R
Subjt:  GYRDRPGSSNSER---------MRAWWSA-----LWRGLSPFVLELRCGKKYVVGFQICSLHQSFSHFRFEEIIGAMKDKLTGPDFEIVVIFWWSVWNLR

Query:  NNLSWGGQSDDRD-LWAYSSDYLSAF-------HVGGRRFPAGDCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDGEVFMA
        N+++ GG       +  + S +L  F        VG RR  A     W PP     K+N+DA ++     AG   V+R  +G V +A
Subjt:  NNLSWGGQSDDRD-LWAYSSDYLSAF-------HVGGRRFPAGDCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDGEVFMA

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.7e-2824.46Show/hide
Query:  IQALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKI-KATRRMVDFRPISLCN--------
        +++L +P T  EI+  +      K+PGPDG +  FY+ + + +   +++   ++   G  P +  +  I+LIPK  + T +  +FRPISL N        
Subjt:  IQALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKI-KATRRMVDFRPISLCN--------

Query:  ----------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNL
                        +Q  FIPG     N       I  + R          + +D  KA+D+++  F+ + + +LG    ++ +I       + +  L
Subjt:  ----------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNL

Query:  NGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKS
        NG+K+    L  G RQG PLSP LF +    L  L   + +  +I G ++G+    +S   FA+D +++ +     A  + +L   + + SG  IN +KS
Subjt:  NGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKS

Query:  VVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLGL------PSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNC--FRLP
          AF  N   + +  I   L  +      +YLG+            N    LK +K+        WK    S  G+  ++K  +     Y  N    +LP
Subjt:  VVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLGL------PSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNC--FRLP

Query:  RCLIKEIHRTMARFWWN
             E+ +T  +F WN
Subjt:  RCLIKEIHRTMARFWWN

P11369 LINE-1 retrotransposable element ORF2 protein1.3e-2824.56Show/hide
Query:  IQALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLN-HGCSPGAINDTMIVLIPK-IKATRRMVDFRPISLCN-------
        +  L  P + +EI   +      K+PGPDG S  FY+   + + + I+      +   G  P +  +  I LIPK  K   ++ +FRPISL N       
Subjt:  IQALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLN-HGCSPGAINDTMIVLIPK-IKATRRMVDFRPISLCN-------

Query:  -----------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFN
                         +Q  FIPG     N       IH + +          + LD  KA+D+++  F+ +++ R G    +++MI    S    +  
Subjt:  -----------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFN

Query:  LNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEK
        +NGEK+  + L  G RQG PLSPYLF +    L  L   + ++ +I G +IG+    IS L  A+D +++    K+    +  L   +    G  IN  K
Subjt:  LNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEK

Query:  SVVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLG--LPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNC--FRLPRCL
        S +AF     ++A++ I +    S    + +YLG  L   +        K +K  I   ++ WK    S  G+  ++K  +     Y  N    ++P   
Subjt:  SVVAFSPNTGEEAQQYISQVLSVSRCPCHQQYLG--LPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNC--FRLPRCL

Query:  IKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALL
          E+   + +F WN  +       ++   L   +   G+   D++L+ +A++
Subjt:  IKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALL

P14381 Transposon TX1 uncharacterized 149 kDa protein3.6e-2624.28Show/hide
Query:  QALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN----------
        + L  P T +E+  AL+    NK+PG DGL+  F++  WD +G D  +        G  P +    ++ L+PK    R + ++RP+SL +          
Subjt:  QALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN----------

Query:  --------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNG
                      +QS  +PGR + DN  L  + +H  RR    +   A L LD  KA+DRV+  +L   +    F  Q++  +    +S      +N 
Subjt:  --------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNG

Query:  EKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKS--
             +   RG+RQG PLS  L+ L  +    LL     R ++TG  +      +    +A+D ++  + +  +    +E    Y  AS   IN+ KS  
Subjt:  EKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKS--

Query:  ------VVAFSPNTGEEAQ------QYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKG--KFFSVAGKEVLLKSIVQAIPCYTM
               V F P    +        +Y+   LS    P  Q ++ L               ++ +  ++  WKG  K  S+ G+ +++  +V +   Y +
Subjt:  ------VVAFSPNTGEEAQ------QYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKG--KFFSVAGKEVLLKSIVQAIPCYTM

Query:  NCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLG
         C    +  I +I R +  F W G       HW+S     LP    G G
Subjt:  NCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLG

P93295 Uncharacterized mitochondrial protein AtMg003101.7e-1532.89Show/hide
Query:  AIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPK-CMSGLGFRDMELFNQALLANQC---------------------------
        A+P Y M+CFRL + L K++   M  FWW+  E +R+I W++W  LC  K    GLGFRD+  FNQALLA Q                            
Subjt:  AIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPK-CMSGLGFRDMELFNQALLANQC---------------------------

Query:  ------------CLLWGRELLVHGCRWRIGNGRATPIYGSNWLSNEFSL
                     ++ GRELL  G    IG+G  T ++   W+ +E  L
Subjt:  ------------CLLWGRELLVHGCRWRIGNGRATPIYGSNWLSNEFSL

Q03278 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)1.2e-1327.72Show/hide
Query:  IQALLQPFTKEEI--LLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN-------
        I+ L +P + +EI  + A K+T    A GPDG++       W+ I   I      ++ HG  P    D+  VLIPK   T     FRP+S+ +       
Subjt:  IQALLQPFTKEEI--LLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDTMIVLIPKIKATRRMVDFRPISLCN-------

Query:  ---------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLN
                        Q AFI    V +N  L    I E R +++G+       LD+ KA+D VE   + + + R     +  + I+W   +      + 
Subjt:  ---------------NQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLN

Query:  GEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLI
          K   +  +RG+RQGDPLSP LF      +      + R  + TGF +G  +  I  L FA+D ++
Subjt:  GEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLI

Arabidopsis top hitse value%identityAlignment
AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.0e-1234.65Show/hide
Query:  QSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLRQG
        Q++FIPGR   DN +   E +H +RR+ +G+  W  LKLD+ KAYDR+ W +L + ++  GF + W+  I    +  +F       +VG+   S+  R  
Subjt:  QSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNGEKVGQVTLSRGLRQG

Query:  D
        D
Subjt:  D

AT4G29090.1 Ribonuclease H-like superfamily protein1.1e-1729.35Show/hide
Query:  AIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQC----------------------------
        A+P YTM CF LP+ + K+I   +A FWW   +E + +HW +WD L   K   G+GF+D+E FN ALL  Q                             
Subjt:  AIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPKCMSGLGFRDMELFNQALLANQC----------------------------

Query:  ------CLLW-----GRELLVHGCRWRIGNGRATPIYGSNWLSNE---FSLQIQSAPVLPLAST-----VSELFFATG-GWNEAMLRAIFNESDSWKFFG
                +W      +E+L  G R  +GNG    I+   WL ++    +L++Q  P    AS      VS+L   +G  W + ++  +F E +  K  G
Subjt:  ------CLLW-----GRELLVHGCRWRIGNGRATPIYGSNWLSNE---FSLQIQSAPVLPLAST-----VSELFFATG-GWNEAMLRAIFNESDSWKFFG

Query:  E
        E
Subjt:  E

AT4G29090.1 Ribonuclease H-like superfamily protein5.6e-0629.75Show/hide
Query:  EIVVIFWWSVWNLRNNLSWGGQS-DDRDLWAYSSDYLSAFHVGGRRFPAG--------DCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDGEVF
        ++V    W +W  RN L + G+  + +++   + D L  + +       G         C  WRPPP++ +K N DA+   D    G G VLR   GEV 
Subjt:  EIVVIFWWSVWNLRNNLSWGGQS-DDRDLWAYSSDYLSAFHVGGRRFPAG--------DCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDGEVF

Query:  MAVCLSLQRYWSVDLAEGWAM
             +L +  SV  AE  AM
Subjt:  MAVCLSLQRYWSVDLAEGWAM

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein1.2e-1632.89Show/hide
Query:  AIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPK-CMSGLGFRDMELFNQALLANQC---------------------------
        A+P Y M+CFRL + L K++   M  FWW+  E +R+I W++W  LC  K    GLGFRD+  FNQALLA Q                            
Subjt:  AIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSWDSLCLPK-CMSGLGFRDMELFNQALLANQC---------------------------

Query:  ------------CLLWGRELLVHGCRWRIGNGRATPIYGSNWLSNEFSL
                     ++ GRELL  G    IG+G  T ++   W+ +E  L
Subjt:  ------------CLLWGRELLVHGCRWRIGNGRATPIYGSNWLSNEFSL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)7.3e-1450.72Show/hide
Query:  FNLNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDS
        F +NG   G VT SRGLRQGDPLSPYLF+LC + LS L    + + ++ G R+  + P I+HL FA+D+
Subjt:  FNLNGEKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCTGCGAAGCGTTTGTTGGGGTTCGATAACTGCTTTTGTGTGGATTGTCATGGACGGAGTGGTGGGTTAGCCCTCATGTGGGTTTCCTCGATATCCTTCAGCCT
TCTTTCATTCTCCAAGAATCACATTGATGGATGGATCTCAAGGGATGGCCGTAGGTGGCGGCTCACTGGTTTCTATGGTTTCCCTTCCGCGGACTTGCATCCTCAGTCTT
GGTCTCTTCTATCTAGGCTGAGGGGCTGTGCTGATACACCGTGGCTGATTGGTGGGGATTTTAACGCCATCTTATGTCAGGATGAGAAGGATGGGGGCTGTGCTGATACA
GAACGACCGGGAGGTGAAACGATTTATGAGCAATTGGATCGGACCTTTGGAAACACACCTTGGACGGACCTCTATCCGAACTATATGGTTAACCATCTTGACTACAGCCG
GTCTGACCACAGGCAACTGGATTTGCGGCAGTTGGTTGGTAGTTCGTGGGGCGCAGGGCCTGGGGAGTCTAGTGATATAACGCCTTTGGCTATTGCCAATAAGGCTAATA
GATGTATGCGTTCGATGGTTGGTTGGGGTCGATCAAAGTCCGAGAATTTCCCGAGGCGCATCTGTAGTGCGAATCAAAAGGTTCAGTCGGCCATCACTGATCTTAGTACA
TCAGACTCTCGTGACTTACTTGTTCAGACTGAGGCTCAGTTGGAAGAGGTGTTGCAGGAGGAGGAGCTCAATCGCATTGGAGGGTTAGAGGATGATCAGGGAGTATGGCA
ACAGGAAAAGACTGCAGTTATTCAGGCACTATTGCAGCCATTTACCAAAGAGGAGATCCTGTTGGCTTTGAAGCAGACGCATCCTAATAAAGCCCCAGGTCCAGATGGGC
TGTCGGGAAGCTTTTACAAGAACCACTGGGACATTATTGGGCTAGATATCATTCAGAGTTGTTTGGCAGTTCTGAATCACGGATGTTCCCCAGGTGCCATTAATGACACT
ATGATTGTGCTCATCCCGAAAATCAAAGCGACCCGTCGGATGGTCGACTTTCGGCCTATCTCCCTTTGTAATAATCAGAGTGCATTTATTCCAGGTAGGTGTGTTGTGGA
CAATGCCATTCTGGGGTTTGAATGTATCCATGAGCTGCGGAGGAGAATGAGGGGGATGACCAAATGGGCTACGCTAAAACTGGACATGAGTAAAGCATATGATAGGGTGG
AATGGGCTTTTCTCCCGGAGATCATGCTACGATTGGGTTTTGCACAGCAATGGATTGATATGATCCTCTGGTGTGTCAGCTCGGTGTCCTTTTCTTTCAACCTGAATGGG
GAGAAGGTGGGGCAGGTGACACTGTCTAGGGGTCTCCGGCAGGGGGATCCCCTCTCCCCGTATCTGTTTCTATTATGTGCGAAAGGGTTGTCCAGTCTGTTGTGTGGAGT
TGAGCGGAGATCTCAGATTACTGGCTTTCGAATTGGGCGCTCTCGCCCATCGATTTCGCACCTCTTTTTTGCAAATGACAGCCTTATCTTTTTCAAGGCCATCAAGAGTG
AAGCGTTGGTCATTCGAGAGCTGCCGGTACGTTATGAGAGGGCGTCAGGGCAAACTATCAATTATGAGAAGTCCGTTGTTGCATTCAGCCCGAATACGGGGGAAGAGGCT
CAACAGTATATCAGCCAGGTTCTTTCTGTGTCTCGCTGCCCATGTCATCAGCAGTATCTAGGCTTACCTTCGTATATGTCACGTAACCGATCGGGGGCTTTGAAGTTTGT
AAAGGACCGTATTTGGTGCCAGATTCAGGGATGGAAGGGTAAATTCTTCTCGGTAGCGGGGAAGGAAGTTCTTCTTAAGTCCATAGTTCAGGCGATTCCTTGCTATACGA
TGAACTGTTTTCGGCTGCCTAGGTGCTTGATCAAGGAGATCCACAGGACTATGGCTAGATTTTGGTGGAATGGATCTGAAGAGGAGAGACGAATACATTGGCTGAGTTGG
GATTCTCTATGTCTTCCCAAGTGCATGAGTGGGTTGGGGTTCCGCGATATGGAACTTTTTAACCAAGCTCTGCTGGCTAATCAGTGTTGCTTGTTATGGGGGCGGGAGCT
TTTAGTTCATGGATGTCGCTGGAGGATTGGTAATGGGCGTGCTACGCCCATCTATGGCTCAAACTGGCTGTCGAATGAGTTTTCGCTTCAAATACAGTCAGCTCCAGTGC
TTCCCCTTGCTAGTACGGTGAGTGAGTTGTTCTTTGCGACTGGTGGATGGAATGAGGCTATGCTCAGAGCCATTTTCAATGAGTCTGATTCATGGAAATTTTTCGGTGAA
GAGTGGATATCGACTTGCTCATATACTGGCTACCGGGACCGACCTGGCTCCTCGAACTCCGAGAGAATGCGCGCATGGTGGTCCGCCCTTTGGAGAGGACTGTCTCCATT
TGTTCTGGAACTGCGCTGTGGTAAAAAATATGTGGTTGGGTTCCAAATTTGCTCCCTCCACCAATCCTTTTCCCATTTCAGGTTTGAGGAAATCATTGGGGCGATGAAGG
ACAAACTTACAGGGCCAGATTTTGAGATTGTGGTCATTTTTTGGTGGTCCGTGTGGAATCTTCGGAATAACCTGAGTTGGGGTGGCCAGTCAGACGACCGAGATCTCTGG
GCATATTCGAGTGATTACCTCAGTGCCTTCCATGTTGGTGGGAGGCGTTTCCCAGCAGGGGACTGCTGTGTATGGAGACCGCCCCCTAATAGGAAGCTGAAACTTAATAT
CGATGCTTCTGTAAGGCCAGATACAGGGGAAGCGGGGGGTGGTTGTGTGCTGCGTGGGTCTGATGGTGAGGTCTTTATGGCAGTCTGTTTGAGCTTACAGAGGTATTGGA
GTGTGGATTTGGCTGAGGGCTGGGCTATGTATAAAGGGGGAAACAAGGTGGCACATGTTCTGGCTAGCTTGGCCTTTTCATATGTTGATCGTGTTTGGCTTGAGGAGTGG
CCTAGCGAAATCGCGGACGTCCTGAGGAGTGATGTTGTTTCAGCTGCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCTGCGAAGCGTTTGTTGGGGTTCGATAACTGCTTTTGTGTGGATTGTCATGGACGGAGTGGTGGGTTAGCCCTCATGTGGGTTTCCTCGATATCCTTCAGCCT
TCTTTCATTCTCCAAGAATCACATTGATGGATGGATCTCAAGGGATGGCCGTAGGTGGCGGCTCACTGGTTTCTATGGTTTCCCTTCCGCGGACTTGCATCCTCAGTCTT
GGTCTCTTCTATCTAGGCTGAGGGGCTGTGCTGATACACCGTGGCTGATTGGTGGGGATTTTAACGCCATCTTATGTCAGGATGAGAAGGATGGGGGCTGTGCTGATACA
GAACGACCGGGAGGTGAAACGATTTATGAGCAATTGGATCGGACCTTTGGAAACACACCTTGGACGGACCTCTATCCGAACTATATGGTTAACCATCTTGACTACAGCCG
GTCTGACCACAGGCAACTGGATTTGCGGCAGTTGGTTGGTAGTTCGTGGGGCGCAGGGCCTGGGGAGTCTAGTGATATAACGCCTTTGGCTATTGCCAATAAGGCTAATA
GATGTATGCGTTCGATGGTTGGTTGGGGTCGATCAAAGTCCGAGAATTTCCCGAGGCGCATCTGTAGTGCGAATCAAAAGGTTCAGTCGGCCATCACTGATCTTAGTACA
TCAGACTCTCGTGACTTACTTGTTCAGACTGAGGCTCAGTTGGAAGAGGTGTTGCAGGAGGAGGAGCTCAATCGCATTGGAGGGTTAGAGGATGATCAGGGAGTATGGCA
ACAGGAAAAGACTGCAGTTATTCAGGCACTATTGCAGCCATTTACCAAAGAGGAGATCCTGTTGGCTTTGAAGCAGACGCATCCTAATAAAGCCCCAGGTCCAGATGGGC
TGTCGGGAAGCTTTTACAAGAACCACTGGGACATTATTGGGCTAGATATCATTCAGAGTTGTTTGGCAGTTCTGAATCACGGATGTTCCCCAGGTGCCATTAATGACACT
ATGATTGTGCTCATCCCGAAAATCAAAGCGACCCGTCGGATGGTCGACTTTCGGCCTATCTCCCTTTGTAATAATCAGAGTGCATTTATTCCAGGTAGGTGTGTTGTGGA
CAATGCCATTCTGGGGTTTGAATGTATCCATGAGCTGCGGAGGAGAATGAGGGGGATGACCAAATGGGCTACGCTAAAACTGGACATGAGTAAAGCATATGATAGGGTGG
AATGGGCTTTTCTCCCGGAGATCATGCTACGATTGGGTTTTGCACAGCAATGGATTGATATGATCCTCTGGTGTGTCAGCTCGGTGTCCTTTTCTTTCAACCTGAATGGG
GAGAAGGTGGGGCAGGTGACACTGTCTAGGGGTCTCCGGCAGGGGGATCCCCTCTCCCCGTATCTGTTTCTATTATGTGCGAAAGGGTTGTCCAGTCTGTTGTGTGGAGT
TGAGCGGAGATCTCAGATTACTGGCTTTCGAATTGGGCGCTCTCGCCCATCGATTTCGCACCTCTTTTTTGCAAATGACAGCCTTATCTTTTTCAAGGCCATCAAGAGTG
AAGCGTTGGTCATTCGAGAGCTGCCGGTACGTTATGAGAGGGCGTCAGGGCAAACTATCAATTATGAGAAGTCCGTTGTTGCATTCAGCCCGAATACGGGGGAAGAGGCT
CAACAGTATATCAGCCAGGTTCTTTCTGTGTCTCGCTGCCCATGTCATCAGCAGTATCTAGGCTTACCTTCGTATATGTCACGTAACCGATCGGGGGCTTTGAAGTTTGT
AAAGGACCGTATTTGGTGCCAGATTCAGGGATGGAAGGGTAAATTCTTCTCGGTAGCGGGGAAGGAAGTTCTTCTTAAGTCCATAGTTCAGGCGATTCCTTGCTATACGA
TGAACTGTTTTCGGCTGCCTAGGTGCTTGATCAAGGAGATCCACAGGACTATGGCTAGATTTTGGTGGAATGGATCTGAAGAGGAGAGACGAATACATTGGCTGAGTTGG
GATTCTCTATGTCTTCCCAAGTGCATGAGTGGGTTGGGGTTCCGCGATATGGAACTTTTTAACCAAGCTCTGCTGGCTAATCAGTGTTGCTTGTTATGGGGGCGGGAGCT
TTTAGTTCATGGATGTCGCTGGAGGATTGGTAATGGGCGTGCTACGCCCATCTATGGCTCAAACTGGCTGTCGAATGAGTTTTCGCTTCAAATACAGTCAGCTCCAGTGC
TTCCCCTTGCTAGTACGGTGAGTGAGTTGTTCTTTGCGACTGGTGGATGGAATGAGGCTATGCTCAGAGCCATTTTCAATGAGTCTGATTCATGGAAATTTTTCGGTGAA
GAGTGGATATCGACTTGCTCATATACTGGCTACCGGGACCGACCTGGCTCCTCGAACTCCGAGAGAATGCGCGCATGGTGGTCCGCCCTTTGGAGAGGACTGTCTCCATT
TGTTCTGGAACTGCGCTGTGGTAAAAAATATGTGGTTGGGTTCCAAATTTGCTCCCTCCACCAATCCTTTTCCCATTTCAGGTTTGAGGAAATCATTGGGGCGATGAAGG
ACAAACTTACAGGGCCAGATTTTGAGATTGTGGTCATTTTTTGGTGGTCCGTGTGGAATCTTCGGAATAACCTGAGTTGGGGTGGCCAGTCAGACGACCGAGATCTCTGG
GCATATTCGAGTGATTACCTCAGTGCCTTCCATGTTGGTGGGAGGCGTTTCCCAGCAGGGGACTGCTGTGTATGGAGACCGCCCCCTAATAGGAAGCTGAAACTTAATAT
CGATGCTTCTGTAAGGCCAGATACAGGGGAAGCGGGGGGTGGTTGTGTGCTGCGTGGGTCTGATGGTGAGGTCTTTATGGCAGTCTGTTTGAGCTTACAGAGGTATTGGA
GTGTGGATTTGGCTGAGGGCTGGGCTATGTATAAAGGGGGAAACAAGGTGGCACATGTTCTGGCTAGCTTGGCCTTTTCATATGTTGATCGTGTTTGGCTTGAGGAGTGG
CCTAGCGAAATCGCGGACGTCCTGAGGAGTGATGTTGTTTCAGCTGCATGA
Protein sequenceShow/hide protein sequence
MSAAKRLLGFDNCFCVDCHGRSGGLALMWVSSISFSLLSFSKNHIDGWISRDGRRWRLTGFYGFPSADLHPQSWSLLSRLRGCADTPWLIGGDFNAILCQDEKDGGCADT
ERPGGETIYEQLDRTFGNTPWTDLYPNYMVNHLDYSRSDHRQLDLRQLVGSSWGAGPGESSDITPLAIANKANRCMRSMVGWGRSKSENFPRRICSANQKVQSAITDLST
SDSRDLLVQTEAQLEEVLQEEELNRIGGLEDDQGVWQQEKTAVIQALLQPFTKEEILLALKQTHPNKAPGPDGLSGSFYKNHWDIIGLDIIQSCLAVLNHGCSPGAINDT
MIVLIPKIKATRRMVDFRPISLCNNQSAFIPGRCVVDNAILGFECIHELRRRMRGMTKWATLKLDMSKAYDRVEWAFLPEIMLRLGFAQQWIDMILWCVSSVSFSFNLNG
EKVGQVTLSRGLRQGDPLSPYLFLLCAKGLSSLLCGVERRSQITGFRIGRSRPSISHLFFANDSLIFFKAIKSEALVIRELPVRYERASGQTINYEKSVVAFSPNTGEEA
QQYISQVLSVSRCPCHQQYLGLPSYMSRNRSGALKFVKDRIWCQIQGWKGKFFSVAGKEVLLKSIVQAIPCYTMNCFRLPRCLIKEIHRTMARFWWNGSEEERRIHWLSW
DSLCLPKCMSGLGFRDMELFNQALLANQCCLLWGRELLVHGCRWRIGNGRATPIYGSNWLSNEFSLQIQSAPVLPLASTVSELFFATGGWNEAMLRAIFNESDSWKFFGE
EWISTCSYTGYRDRPGSSNSERMRAWWSALWRGLSPFVLELRCGKKYVVGFQICSLHQSFSHFRFEEIIGAMKDKLTGPDFEIVVIFWWSVWNLRNNLSWGGQSDDRDLW
AYSSDYLSAFHVGGRRFPAGDCCVWRPPPNRKLKLNIDASVRPDTGEAGGGCVLRGSDGEVFMAVCLSLQRYWSVDLAEGWAMYKGGNKVAHVLASLAFSYVDRVWLEEW
PSEIADVLRSDVVSAA