; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0021190 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0021190
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:5407655..5410092
RNA-Seq ExpressionLag0021190
SyntenyLag0021190
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036397 - Ribonuclease H superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ONI01138.1 hypothetical protein PRUPE_6G123900 [Prunus persica]3.8e-13638.36Show/hide
Query:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------
        M+KAYDRVEWVFL  +MLK+GF+  WV  +  CIS+  FS    G   G ++P RGLRQG PLSP                                   
Subjt:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------

Query:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI
                      A   D  A+  +   YE V+GQ +N+ KS +S SP+        +  +L V V   H  YLGLP+   + R      +KD++W+ I
Subjt:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI

Query:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN
         GWK KL S  G+E+L+K+V+QAIP YSM+CFR+PK L  E++  MARFWW   K  RGIHWV W+ LCK K  GG+GFRD+E FNQALLAKQCWRI++ 
Subjt:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN

Query:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIR
        P SL+AR+ + RY P   FL A VGT PSFIWRSL WG+ELL  GLRW+VG+G S+ VY   WLP    F++ S   L   TRV DL T+SGQWN  L++
Subjt:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIR

Query:  QNFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLG--QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR
          F  QEV  IL I L      D ++WHYE++G++SVKSGYRL   +   ++  PS+  + ++  +WK +W + IPNKIK FLWR   D LP    L  R
Subjt:  QNFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLG--QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR

Query:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSGWASRMGDELYLVVPAGDLGLW--------------
              +C  C R  ES LH  W C+  K V   S +G +    +  S  +L   +     + S      G   YL       GLW              
Subjt:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSGWASRMGDELYLVVPAGDLGLW--------------

Query:  ---------------CWVRSVLAREDVRWSPPEA---GWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRS--PEMAEGWAAVKGTRLAV
                           ++L     R S P+A   GW       S    R   G+GVVVR+++G  M  A+ V+R   S      E  A ++G R A+
Subjt:  ---------------CWVRSVLAREDVRWSPPEA---GWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRS--PEMAEGWAAVKGTRLAV

Query:  EMGLGPLMLETDSSR-VASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA
        +MG    +LE D+   + S F  E  +     G L+ E+   + +     C +T R GN+VAH LA  A
Subjt:  EMGLGPLMLETDSSR-VASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA

VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]5.7e-14038.25Show/hide
Query:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------
        M+KAYDRVEWVFL ++MLK+GF+  WV  +  CIS+  FS    G   G ++P RGLRQG PLSP                                   
Subjt:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------

Query:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI
                      A      A+  +   YE VSGQ +N+ KS  S SP+        +  +L V V   H +YLGLP+   + R      +KD++W+ I
Subjt:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI

Query:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN
         GWK KL S  G+E+L+K+V+QAIP YSM+CFR+PK L  E++  MARFWW   K  RGIHWV W+ LCK K  GG+GFRD+E FNQALLAKQCWRI++ 
Subjt:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN

Query:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIR
        P SL+AR+ + RY P   FL A VGT PSFIWRSL WG+ELL  GLRW+VGNG S+ VY   WLP    F++ S   L   T V DL T+SGQWN  L++
Subjt:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIR

Query:  QNFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLG--QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR
          F  QEV   L I L      D ++WHYE++G++SVKSGYRL   +   ++  PS   + ++  +WK +W + IPNKIK FLWR   D LP    L  R
Subjt:  QNFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLG--QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR

Query:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREV----------------------------KDKGPRTSGWASRMGD
              +C  C R  ES LH  W C+  K V   S +G +    +  S  +L   +                            + K    +    RM  
Subjt:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREV----------------------------KDKGPRTSGWASRMGD

Query:  ELYLVVPAGDLGLWCWVRSVLAREDVR-WSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRS--PEMAEGWAAVKGTRLAVEMG
               A +L      R    +  +  W PP AG YK+NVD + +      G+GVVVR+++G  M  A+ V+R   S      E  A ++G R A++MG
Subjt:  ELYLVVPAGDLGLWCWVRSVLAREDVR-WSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRS--PEMAEGWAAVKGTRLAVEMG

Query:  LGPLMLETDSSR-VASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA
            +LE D+   + S    E  +     G L+ E+   + +     C++T R GN+VAH LA  A
Subjt:  LGPLMLETDSSR-VASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA

XP_022150918.1 uncharacterized protein LOC111018954 [Momordica charantia]3.5e-15842.24Show/hide
Query:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLC-ISSVRFSFNVNGIRCGG---------VVPSRGLR-QGDPLSPANESDATAVRGILDCYERVSGQTV
        MSKAYDRVEW FLE +MLKMGF   W    + C +  +         R G          ++ S  LR  G  +S A  +           Y + +GQ  
Subjt:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLC-ISSVRFSFNVNGIRCGG---------VVPSRGLR-QGDPLSPANESDATAVRGILDCYERVSGQTV

Query:  NFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKL
            ++I   P    R  + +  IL V +     QYLGLP+FMPR+R    ++IKDRVW+ +QGWK KLFS+GG+EVL+K+V QAIPCY+M+CFRLPK+L
Subjt:  NFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKL

Query:  ILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWG
        I E     ARFWWG  K D+ IHWV+W SL  PKC GGMGFRD+E+FN+ALLAKQCWRI+ +P+S+L+RVLKGRYF    F+ A +   PS+IWRS+LWG
Subjt:  ILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWG

Query:  RELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMT-ASGQWNEGLIRQNFSPQEVGLILSILVRAGA-EDKIVWHYEKSGLFSV
        R+LL+ GLRW++GNG+SV +YG NW+P+    ++ SS  L   +RV+ L+    G W   ++R  F+P E   ILSI +  GA ED+++W+YEK+G++SV
Subjt:  RELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMT-ASGQWNEGLIRQNFSPQEVGLILSILVRAGA-EDKIVWHYEKSGLFSV

Query:  KSGYRLG-QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFG
        +SGY++   +    Q PSSSS+E    WW G WKM IPNKIK+FLWRLCLDRLPT  NL  RG ++ N C  C R GE S+H+FW CKF +A+ + S+FG
Subjt:  KSGYRLG-QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFG

Query:  GLIHNVQAGSMFDLLREVKDKGPRTSG----------WASR----MGDELYLVVPAG-DLGLWCWVRSVLARE--------------DVRWSPPEAGWYK
         L       S F +LRE  +   +             W  R      D    V   G +L  W    ++  RE              ++ W PP+ G YK
Subjt:  GLIHNVQAGSMFDLLREVKDKGPRTSG----------WASR----MGDELYLVVPAG-DLGLWCWVRSVLARE--------------DVRWSPPEAGWYK

Query:  VNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLETDSSRVASFFQDEAGDDFSDVGALVAELRRDMP
        +N DASF      AGLG+++ +  G+VM +A+    +++S +MAE  AAV+G +LA E+G+ P +                 +D S+ G +V + +    
Subjt:  VNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLETDSSRVASFFQDEAGDDFSDVGALVAELRRDMP

Query:  DPSFFGCRFTRREGNEVAHQLA
                F +REGN+ AH LA
Subjt:  DPSFFGCRFTRREGNEVAHQLA

XP_023889222.1 LOW QUALITY PROTEIN: uncharacterized protein LOC112001275 [Quercus suber]4.5e-12935.8Show/hide
Query:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------
        MSKAYDRVEW+FLEKIM KMGF  +WV L+  CI++V +S  +NG     + PSRG+RQGDPLSP                                   
Subjt:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------

Query:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI
                      A+  +   V+ +L CYE+ SGQ +N  K+ + FS +T   +  Q+   L VQ      +YLGLP+ + +++  S  +IK+RVW ++
Subjt:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI

Query:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN
        QGWK +L S  GREVLLK+V+QAIP Y+M+CF+LP  L  EI   + +FWWG     R IHW  W SLCKPK  GGMGFRD++ FN+A+LAKQ WR++ N
Subjt:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN

Query:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSG-TLAPETRVADLMTASGQ-WNEGL
          SL  R  K ++FP G  L A  G   SF W+S+L GR +++ G+ W+VGNG S+ +Y  NWLPD    ++ S         +V+ LM   GQ W++ +
Subjt:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSG-TLAPETRVADLMTASGQ-WNEGL

Query:  IRQNFSPQEVGLILSILVRAG-AEDKIVWHYEKSGLFSVKSGYRLGQSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR
        I  NF P E  +I +I +  G   D   W     G++SVKSGY+L     + + P SS        WKG+W + +PN++K  LWR   D LP+  NL  R
Subjt:  IRQNFSPQEVGLILSILVRAG-AEDKIVWHYEKSGLFSVKSGYRLGQSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR

Query:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNV-QAGSMFDLLREVKDKG--------PRTSGWAS----RMGDE---LYLVVPAGDLG
           +   C  C    E+SLH  W C  ++ +  +  FG L++      S  D+L+   +K           +  W      R+G++   L ++       
Subjt:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNV-QAGSMFDLLREVKDKG--------PRTSGWAS----RMGDE---LYLVVPAGDLG

Query:  LWCWVRSVLAREDV-------RWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLET
        L  +++S L    V       +W PP + W K+N D +   +   AGLG ++R+  G VM + + +     S EM E  AA      A E+G   +++E 
Subjt:  LWCWVRSVLAREDV-------RWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLET

Query:  DSSRVASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA
        DS  V        G  FS +G +V +++             TRR+GN VAH LA LA
Subjt:  DSSRVASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA

XP_024956542.1 uncharacterized protein LOC112498908 [Citrus sinensis]9.1e-13035.05Show/hide
Query:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------
        +SKAYDR+EW FLE+IM ++GF+ +W+ LI  CISSV FS  +NG   G + P RGLRQG P+SP                                   
Subjt:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------

Query:  -------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQIQ
                     A+ +D   ++ ILDCY   SGQ  NF+KS +  S +      + +G I  + + + +  YLGLP+ + R R+   + IK +V  +I 
Subjt:  -------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQIQ

Query:  GWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQNP
         W+ K FS GG+EVL+K+ VQAIP ++M+ F++P  +  +I R +  FWWG  +  R IHW  W+ + + KC GGMGFRD   FNQALLAKQ WRI Q P
Subjt:  GWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQNP

Query:  SSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIRQ
         SL+ARVL+ RYF   +FL A +G+ PS+IWRS+LWGR+++  G RW++GNG+ V ++  NW+P    F+     T+  E  V++L+     W+E LI +
Subjt:  SSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIRQ

Query:  NFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLGQSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCD
        +F   +  +I  I L R   ED+++WH+ KSG ++VKSGY+          PSSS  ES+   W  +W + +P KI+IF+WR   + LP+ +NL  R   
Subjt:  NFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLGQSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCD

Query:  VLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSG-------WASRMGDELYL----------VVPAGDLGLW
            C LC+ G E+  H    CK  K V   S F   I       +  LL  VK               WA       +L          VV   +  + 
Subjt:  VLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSG-------WASRMGDELYL----------VVPAGDLGLW

Query:  CWVRSVLAREDVR-----------WSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLML
         + R V    DV            W+PP+ G+ K+N DA+   E+  AGLG V+RD +G+V  +A  V +   S   AE  A   G ++A +  +  +++
Subjt:  CWVRSVLAREDVR-----------WSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLML

Query:  ETDSSRVASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA
        E+DS  V S   +  G   S++  +V E+++         C +T R  N +AH L  +A
Subjt:  ETDSSRVASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA

TrEMBL top hitse value%identityAlignment
A0A251NPF0 Reverse transcriptase domain-containing protein1.8e-13638.36Show/hide
Query:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------
        M+KAYDRVEWVFL  +MLK+GF+  WV  +  CIS+  FS    G   G ++P RGLRQG PLSP                                   
Subjt:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------

Query:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI
                      A   D  A+  +   YE V+GQ +N+ KS +S SP+        +  +L V V   H  YLGLP+   + R      +KD++W+ I
Subjt:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI

Query:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN
         GWK KL S  G+E+L+K+V+QAIP YSM+CFR+PK L  E++  MARFWW   K  RGIHWV W+ LCK K  GG+GFRD+E FNQALLAKQCWRI++ 
Subjt:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN

Query:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIR
        P SL+AR+ + RY P   FL A VGT PSFIWRSL WG+ELL  GLRW+VG+G S+ VY   WLP    F++ S   L   TRV DL T+SGQWN  L++
Subjt:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIR

Query:  QNFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLG--QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR
          F  QEV  IL I L      D ++WHYE++G++SVKSGYRL   +   ++  PS+  + ++  +WK +W + IPNKIK FLWR   D LP    L  R
Subjt:  QNFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLG--QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR

Query:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSGWASRMGDELYLVVPAGDLGLW--------------
              +C  C R  ES LH  W C+  K V   S +G +    +  S  +L   +     + S      G   YL       GLW              
Subjt:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSGWASRMGDELYLVVPAGDLGLW--------------

Query:  ---------------CWVRSVLAREDVRWSPPEA---GWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRS--PEMAEGWAAVKGTRLAV
                           ++L     R S P+A   GW       S    R   G+GVVVR+++G  M  A+ V+R   S      E  A ++G R A+
Subjt:  ---------------CWVRSVLAREDVRWSPPEA---GWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRS--PEMAEGWAAVKGTRLAV

Query:  EMGLGPLMLETDSSR-VASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA
        +MG    +LE D+   + S F  E  +     G L+ E+   + +     C +T R GN+VAH LA  A
Subjt:  EMGLGPLMLETDSSR-VASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA

A0A5E4FZN9 PREDICTED: retrotransposon2.7e-14038.25Show/hide
Query:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------
        M+KAYDRVEWVFL ++MLK+GF+  WV  +  CIS+  FS    G   G ++P RGLRQG PLSP                                   
Subjt:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------

Query:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI
                      A      A+  +   YE VSGQ +N+ KS  S SP+        +  +L V V   H +YLGLP+   + R      +KD++W+ I
Subjt:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI

Query:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN
         GWK KL S  G+E+L+K+V+QAIP YSM+CFR+PK L  E++  MARFWW   K  RGIHWV W+ LCK K  GG+GFRD+E FNQALLAKQCWRI++ 
Subjt:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN

Query:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIR
        P SL+AR+ + RY P   FL A VGT PSFIWRSL WG+ELL  GLRW+VGNG S+ VY   WLP    F++ S   L   T V DL T+SGQWN  L++
Subjt:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIR

Query:  QNFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLG--QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR
          F  QEV   L I L      D ++WHYE++G++SVKSGYRL   +   ++  PS   + ++  +WK +W + IPNKIK FLWR   D LP    L  R
Subjt:  QNFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLG--QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR

Query:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREV----------------------------KDKGPRTSGWASRMGD
              +C  C R  ES LH  W C+  K V   S +G +    +  S  +L   +                            + K    +    RM  
Subjt:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREV----------------------------KDKGPRTSGWASRMGD

Query:  ELYLVVPAGDLGLWCWVRSVLAREDVR-WSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRS--PEMAEGWAAVKGTRLAVEMG
               A +L      R    +  +  W PP AG YK+NVD + +      G+GVVVR+++G  M  A+ V+R   S      E  A ++G R A++MG
Subjt:  ELYLVVPAGDLGLWCWVRSVLAREDVR-WSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRS--PEMAEGWAAVKGTRLAVEMG

Query:  LGPLMLETDSSR-VASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA
            +LE D+   + S    E  +     G L+ E+   + +     C++T R GN+VAH LA  A
Subjt:  LGPLMLETDSSR-VASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA

A0A6J1DAR4 uncharacterized protein LOC1110189541.7e-15842.24Show/hide
Query:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLC-ISSVRFSFNVNGIRCGG---------VVPSRGLR-QGDPLSPANESDATAVRGILDCYERVSGQTV
        MSKAYDRVEW FLE +MLKMGF   W    + C +  +         R G          ++ S  LR  G  +S A  +           Y + +GQ  
Subjt:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLC-ISSVRFSFNVNGIRCGG---------VVPSRGLR-QGDPLSPANESDATAVRGILDCYERVSGQTV

Query:  NFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKL
            ++I   P    R  + +  IL V +     QYLGLP+FMPR+R    ++IKDRVW+ +QGWK KLFS+GG+EVL+K+V QAIPCY+M+CFRLPK+L
Subjt:  NFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKL

Query:  ILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWG
        I E     ARFWWG  K D+ IHWV+W SL  PKC GGMGFRD+E+FN+ALLAKQCWRI+ +P+S+L+RVLKGRYF    F+ A +   PS+IWRS+LWG
Subjt:  ILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWG

Query:  RELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMT-ASGQWNEGLIRQNFSPQEVGLILSILVRAGA-EDKIVWHYEKSGLFSV
        R+LL+ GLRW++GNG+SV +YG NW+P+    ++ SS  L   +RV+ L+    G W   ++R  F+P E   ILSI +  GA ED+++W+YEK+G++SV
Subjt:  RELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMT-ASGQWNEGLIRQNFSPQEVGLILSILVRAGA-EDKIVWHYEKSGLFSV

Query:  KSGYRLG-QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFG
        +SGY++   +    Q PSSSS+E    WW G WKM IPNKIK+FLWRLCLDRLPT  NL  RG ++ N C  C R GE S+H+FW CKF +A+ + S+FG
Subjt:  KSGYRLG-QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFG

Query:  GLIHNVQAGSMFDLLREVKDKGPRTSG----------WASR----MGDELYLVVPAG-DLGLWCWVRSVLARE--------------DVRWSPPEAGWYK
         L       S F +LRE  +   +             W  R      D    V   G +L  W    ++  RE              ++ W PP+ G YK
Subjt:  GLIHNVQAGSMFDLLREVKDKGPRTSG----------WASR----MGDELYLVVPAG-DLGLWCWVRSVLARE--------------DVRWSPPEAGWYK

Query:  VNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLETDSSRVASFFQDEAGDDFSDVGALVAELRRDMP
        +N DASF      AGLG+++ +  G+VM +A+    +++S +MAE  AAV+G +LA E+G+ P +                 +D S+ G +V + +    
Subjt:  VNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLETDSSRVASFFQDEAGDDFSDVGALVAELRRDMP

Query:  DPSFFGCRFTRREGNEVAHQLA
                F +REGN+ AH LA
Subjt:  DPSFFGCRFTRREGNEVAHQLA

M5W5F3 Reverse transcriptase domain-containing protein (Fragment)1.8e-13638.36Show/hide
Query:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------
        M+KAYDRVEWVFL  +MLK+GF+  WV  +  CIS+  FS    G   G ++P RGLRQG PLSP                                   
Subjt:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-----------------------------------

Query:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI
                      A   D  A+  +   YE V+GQ +N+ KS +S SP+        +  +L V V   H  YLGLP+   + R      +KD++W+ I
Subjt:  --------------ANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQI

Query:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN
         GWK KL S  G+E+L+K+V+QAIP YSM+CFR+PK L  E++  MARFWW   K  RGIHWV W+ LCK K  GG+GFRD+E FNQALLAKQCWRI++ 
Subjt:  QGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQN

Query:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIR
        P SL+AR+ + RY P   FL A VGT PSFIWRSL WG+ELL  GLRW+VG+G S+ VY   WLP    F++ S   L   TRV DL T+SGQWN  L++
Subjt:  PSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIR

Query:  QNFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLG--QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR
          F  QEV  IL I L      D ++WHYE++G++SVKSGYRL   +   ++  PS+  + ++  +WK +W + IPNKIK FLWR   D LP    L  R
Subjt:  QNFSPQEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLG--QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVR

Query:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSGWASRMGDELYLVVPAGDLGLW--------------
              +C  C R  ES LH  W C+  K V   S +G +    +  S  +L   +     + S      G   YL       GLW              
Subjt:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSGWASRMGDELYLVVPAGDLGLW--------------

Query:  ---------------CWVRSVLAREDVRWSPPEA---GWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRS--PEMAEGWAAVKGTRLAV
                           ++L     R S P+A   GW       S    R   G+GVVVR+++G  M  A+ V+R   S      E  A ++G R A+
Subjt:  ---------------CWVRSVLAREDVRWSPPEA---GWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRS--PEMAEGWAAVKGTRLAV

Query:  EMGLGPLMLETDSSR-VASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA
        +MG    +LE D+   + S F  E  +     G L+ E+   + +     C +T R GN+VAH LA  A
Subjt:  EMGLGPLMLETDSSR-VASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLAFLA

M5XK32 Reverse transcriptase domain-containing protein (Fragment)1.5e-13338.95Show/hide
Query:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-------------------ANESDATAVRGILDCY
        MSKAYDRVEW FLEK+ML MGF   WV ++  C+++V +SF VNG     + P+RGLRQGDPLSP                   A +++   ++ I + Y
Subjt:  MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSP-------------------ANESDATAVRGILDCY

Query:  ERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQAIPCYSMN
        ER SG+ +N  KS ++FS +     ++++  +L V     H  YLGLP  + R++T    ++K+RVW+++QGW+ +  S+ G+EVLLK V Q+IP Y MN
Subjt:  ERVSGQTVNFDKSIISFSPSTDGRVKAQVGQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQAIPCYSMN

Query:  CFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLRANVGTRPSF
        CF LP+ L  EI + MARFWWG +  +R IHW+ W+ LCK K  GGMGFR ++ FN A+LAKQ WR++ NP SL +R+LK +YFP  +F  A +G+RPS 
Subjt:  CFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLRANVGTRPSF

Query:  IWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPE-TRVADLMTASG--QWNEGLIRQNFSPQEVGLILSILVRAGA-EDKIVW
        +W+S+   R++LEMG R+Q+G+G+SV ++G  W+P    F V +S     E T+V++L+   G  QW+   +   F P +V   + I +   A  D+IVW
Subjt:  IWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPE-TRVADLMTASG--QWNEGLIRQNFSPQEVGLILSILVRAGA-EDKIVW

Query:  HYEKSGLFSVKSGYRLG-QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCRRGGESSLHVFWHCKFV
        +Y+K GLF+VKS YR+  + T   +  SSSSN      W+ +W   +P K+KIF WR+  D LPT  NL  +G D+ ++C  C    ES+LHV   C F 
Subjt:  HYEKSGLFSVKSGYRLG-QSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCRRGGESSLHVFWHCKFV

Query:  KAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSGWASRMGDELYLVVPAGDLGLWCWVRSVLAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGVV
         A    S      H     S  D++           G+A +   E    + A D       R    R+ VRW+ P +G  K N D +F     +  +GVV
Subjt:  KAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSGWASRMGDELYLVVPAGDLGLWCWVRSVLAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGVV

Query:  VRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLETDSSRVASFFQDEAGDDFSDVGALVAE---LRRDMPDPSFFGCRFTRREGNE
         RD+ G  + + +     V S E AE  AA +G  LA+ +G    + E DS+ V S  +  AG D+S++G +V +   L++  P   F   +FT RE N 
Subjt:  VRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLETDSSRVASFFQDEAGDDFSDVGALVAE---LRRDMPDPSFFGCRFTRREGNE

Query:  VAHQLA
        V H+LA
Subjt:  VAHQLA

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657504.6e-4425.49Show/hide
Query:  LPSFMPRDRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGG
        +P    R    +   I +RV  ++ GW+ K  S  GR  L K+V+ ++P +SM+   LP+ ++  + +    F WG     +  H V W  +C PK  GG
Subjt:  LPSFMPRDRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGG

Query:  MGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLRANVGTRP----SFIWRSLLWG-RELLEMGLRWQVGNGESVSVYGVNW------LP
        +G R  +  N+AL++K  WR++Q  +SL   VL+ +Y   G+ +R +    P    S  WRS+  G R+++  G+ W  G+G+ +  +   W      L 
Subjt:  MGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLRANVGTRP----SFIWRSLLWG-RELLEMGLRWQVGNGESVSVYGVNW------LP

Query:  DDGNFRVRSSGTLAPETRVADLMTASGQWNEGLI---RQNFSPQEVGLILSILVRAGAEDKIVWHYEKSGLFSVKSGYRLGQSTWLAQFPSSSSNESAMG
         D   R     T+  +    DL      W+   I     N +  E+  ++  LV  GA D++ W + + G FSV+S Y +     + + P      +   
Subjt:  DDGNFRVRSSGTLAPETRVADLMTASGQWNEGLI---RQNFSPQEVGLILSILVRAGAEDKIVWHYEKSGLFSVKSGYRLGQSTWLAQFPSSSSNESAMG

Query:  WWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCRRGGESSLHVFWHCK-----FVKAV--------LMESEFGGLIHNV--------
        ++  +WK+ +P ++K FLW +    + T +    R     NVC +C+ G ES LHV   C      +V+ V          +S F  L  N+        
Subjt:  WWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCRRGGESSLHVFWHCK-----FVKAV--------LMESEFGGLIHNV--------

Query:  -----------------QAGSMFDLLREVKDKGPRTSGWASRMGDELYLVVPAGDLGLWCWVRSVLAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGV
                         + G++F    + +D+      WA         V  A    +   +        + W  P  GW KVN D + R     A  G 
Subjt:  -----------------QAGSMFDLLREVKDKGPRTSGWASRMGDELYLVVPAGDLGLWCWVRSVLAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGV

Query:  VVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLETDSSRVASFFQDEAGD
        V+RD +G      SL      +P+ AE W    G   A E  +  + LE DS  +  F +    D
Subjt:  VVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLETDSSRVASFFQDEAGD

P93295 Uncharacterized mitochondrial protein AtMg003103.2e-3749.32Show/hide
Query:  AIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPK-CYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLR
        A+P Y+M+CFRL K L  +++ AM  FWW   +  R I WV+W+ LCK K   GG+GFRD+  FNQALLAKQ +RII  P +LL+R+L+ RYFP    + 
Subjt:  AIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPK-CYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLR

Query:  ANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDD
         +VGTRPS+ WRS++ GRELL  GL   +G+G    V+   W+ D+
Subjt:  ANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDD

Arabidopsis top hitse value%identityAlignment
AT1G43730.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein7.7e-1028.14Show/hide
Query:  LLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIRQNF
        LLA  L   +    +F   N  T  S+IWR L   RE+    +   VG+G +   +  NW            G L       DL+   G           
Subjt:  LLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQWNEGLIRQNF

Query:  SPQEVGLILSI--LVRAGAEDKIVWH---YEKSGLFSVKSGYRLGQSTWLAQFPSSSSNESAMGWWKGVW-KMFIPNKIKIFLWRLCLDRLPTIDNLGVR
         PQ VGL +    L+    +D  +W    +  S +FS          T LA  P +      + W+K VW K  +P K     W +  +RL T D L   
Subjt:  SPQEVGLILSI--LVRAGAEDKIVWH---YEKSGLFSVKSGYRLGQSTWLAQFPSSSSNESAMGWWKGVW-KMFIPNKIKIFLWRLCLDRLPTIDNLGVR

Query:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAV
        G  +  VC LC    ES  H+F+ C F  AV
Subjt:  GCDVLNVCGLCRRGGESSLHVFWHCKFVKAV

AT2G02650.1 Ribonuclease H-like superfamily protein1.6e-1221.9Show/hide
Query:  KGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAG--SMFD--LLREVKDKGPRT
        + +WK+ +  KIK FLWR     L T   L  R  D   +C  C    E+  H+ ++C + ++V   +    +I   Q G  S F+  L R ++    +T
Subjt:  KGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAG--SMFD--LLREVKDKGPRT

Query:  SG--------------WASR---------MGDELYLVVPAGDLGLWCWVRSVL---------------AREDVRWSPPEAGWYKVNVDASFRRERWQAGL
        +               W SR            +        D   W                       R+  +W+PP  GW K N D+ + +       
Subjt:  SG--------------WASR---------MGDELYLVVPAGDLGLWCWVRSVL---------------AREDVRWSPPEAGWYKVNVDASFRRERWQAGL

Query:  GVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLETDSSRVASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNE
        G  +R+ +G ++L  +   +       AE    +   ++    GL  +  E+DS  + +   +  G+D S +G L+ ++R  M    +    F  RE N 
Subjt:  GVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLETDSSRVASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNE

Query:  VAHQLA
         A  LA
Subjt:  VAHQLA

AT3G09510.1 Ribonuclease H-like superfamily protein1.1e-2924.09Show/hide
Query:  LKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQ---WNEGLIRQNFSP
        +K RYF     L A V  + S+ W SLL G  LL+ G R  +G+G+++ + G++ + D    R  ++     E  + +L    G    W++  I Q    
Subjt:  LKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLAPETRVADLMTASGQ---WNEGLIRQNFSP

Query:  QEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLGQSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNV
         + G I  I L ++   DKI+W+Y  +G ++V+SGY L         P+ +    ++     +W + I  K+K FLWR     L T + L  RG  +   
Subjt:  QEVGLILSI-LVRAGAEDKIVWHYEKSGLFSVKSGYRLGQSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNV

Query:  CGLCRRGGESSLHVFWHCKF------------VKAVLMESEF----GGLIHNVQAGSMFDLLREV------KDKGPRTSGWASRMGDELYLVVPAGDLGL
        C  C R  ES  H  + C F            ++  LM ++F      +++ VQ  +M D  + +      +    R +   ++  +     V +     
Subjt:  CGLCRRGGESSLHVFWHCKF------------VKAVLMESEF----GGLIHNVQAGSMFDLLREV------KDKGPRTSGWASRMGDELYLVVPAGDLGL

Query:  WCWVRSV------------LAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPL
          W+ +             +A   + W  P A + K N DA F  ++ +A  G ++R+  G  +   S+   H  +P  AE  A +   +     G   +
Subjt:  WCWVRSV------------LAREDVRWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPL

Query:  MLETDSSRVASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFT-------RREGNEVAHQLA
         +E D   + +      G  F       + L   + D SF+  +F        RR+GN++AH LA
Subjt:  MLETDSSRVASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFT-------RREGNEVAHQLA

AT4G29090.1 Ribonuclease H-like superfamily protein1.4e-5929.32Show/hide
Query:  AIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLRA
        A+P Y+M CF LPK +  +I   +A FWW  ++  +G+HW +W  L   K  GG+GF+D+E FN ALL KQ WR++  P SL+A+V K RYF   D L A
Subjt:  AIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPKCYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLRA

Query:  NVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWL---PDDGNFRV-----RSSGTLAPETRVADLMTASG-QWNEGLIRQNFSPQEVGLILS
         +G+RPSF+W+S+   +E+L  G R  VGNGE + ++   WL   P     R+     +   +++   +V+DL+  SG +W + +I   F   E  LI  
Subjt:  NVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWL---PDDGNFRV-----RSSGTLAPETRVADLMTASG-QWNEGLIRQNFSPQEVGLILS

Query:  ILVRAGAE---DKIVWHYEKSGLFSVKSGY-RLGQSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCR
        +  R G     D   W Y  SG ++VKSGY  L Q       P   S  S    ++ +WK     KI+ FLW+   + LP    L  R     + C  C 
Subjt:  ILVRAGAE---DKIVWHYEKSGLFSVKSGY-RLGQSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLPTIDNLGVRGCDVLNVCGLCR

Query:  RGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSGW--ASRM------------------GDEL----YLVVPAGDLGLW--
           E+  H+ + C F +     S     +    A S++  L  V + G     W  AS++                  G E      L     DL  W  
Subjt:  RGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSGW--ASRM------------------GDEL----YLVVPAGDLGLW--

Query:  ------CWVRSVLAREDV-RWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAE----GWAAVKGTRLAVEMGLGPLML
              C  +  + R    RW PP   W K N DA++ R+  + G+G V+R+  G V    +     ++S   AE     WA +  +R         ++ 
Subjt:  ------CWVRSVLAREDV-RWSPPEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAE----GWAAVKGTRLAVEMGLGPLML

Query:  ETDSSRVASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLA
        E+DS  +     ++  + +  +   + +L+R +   +     F  REGN +A ++A
Subjt:  ETDSSRVASFFQDEAGDDFSDVGALVAELRRDMPDPSFFGCRFTRREGNEVAHQLA

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein2.3e-3849.32Show/hide
Query:  AIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPK-CYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLR
        A+P Y+M+CFRL K L  +++ AM  FWW   +  R I WV+W+ LCK K   GG+GFRD+  FNQALLAKQ +RII  P +LL+R+L+ RYFP    + 
Subjt:  AIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLCKPK-CYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLR

Query:  ANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDD
         +VGTRPS+ WRS++ GRELL  GL   +G+G    V+   W+ D+
Subjt:  ANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTAAGGCATATGATAGAGTCGAATGGGTTTTCTTGGAGAAGATTATGCTGAAAATGGGTTTTGCTCCGGAGTGGGTGGATCTGATTTCTCTTTGTATTTCTTCCGT
ACGTTTCTCTTTTAATGTGAATGGGATCAGGTGTGGGGGTGTGGTTCCGAGTAGAGGTTTGCGGCAGGGTGACCCATTATCCCCGGCCAACGAGAGTGATGCCACAGCGG
TTCGGGGCATTCTGGACTGTTATGAACGGGTGTCGGGTCAGACAGTGAATTTCGATAAGTCTATTATCTCTTTCAGTCCGAGTACGGATGGGAGGGTTAAGGCTCAGGTG
GGTCAGATTCTGCAGGTTCAGGTTACGGCATGGCACCGCCAATATCTAGGCCTACCTTCTTTTATGCCACGGGACAGAACGAGCTCGTTGAGTTTCATTAAGGATCGAGT
ATGGCAGCAGATTCAGGGGTGGAAGGGCAAACTATTCTCAGTTGGGGGTAGGGAGGTTCTTCTAAAGTCTGTTGTGCAGGCGATTCCGTGTTATTCGATGAATTGCTTCC
GTTTACCAAAGAAGTTGATTCTTGAGATCAGCAGAGCCATGGCCCGGTTTTGGTGGGGTGGGGAGAAGGTGGATCGAGGAATTCATTGGGTGAGTTGGAAATCCCTATGT
AAGCCTAAGTGCTATGGTGGGATGGGGTTTAGGGATATGGAGATTTTTAACCAAGCTTTGCTGGCAAAACAGTGTTGGAGAATTATCCAGAATCCATCTTCCCTCCTGGC
ACGTGTTCTGAAGGGCAGATATTTTCCTTTCGGAGATTTCTTGAGGGCAAATGTGGGGACGAGACCGTCTTTTATATGGAGGAGTTTATTGTGGGGAAGAGAGCTTTTGG
AGATGGGCTTGCGTTGGCAAGTGGGGAATGGAGAAAGTGTGTCGGTATATGGAGTGAATTGGCTTCCTGATGATGGTAATTTTAGAGTACGGTCATCAGGGACTTTGGCT
CCGGAGACTCGGGTTGCAGACCTGATGACGGCATCGGGGCAGTGGAATGAAGGGCTCATCCGACAGAATTTTAGCCCTCAGGAGGTCGGTCTAATTTTGTCAATTCTTGT
GCGGGCTGGGGCGGAGGATAAGATTGTTTGGCATTATGAGAAGTCGGGCCTGTTTTCGGTGAAAAGCGGATATCGGTTGGGGCAATCAACCTGGCTTGCGCAGTTTCCGT
CTTCCTCTTCGAACGAGTCGGCAATGGGTTGGTGGAAGGGGGTTTGGAAAATGTTTATTCCGAATAAGATCAAGATTTTTCTTTGGAGACTTTGTTTAGACCGCTTGCCC
ACGATTGATAATTTGGGTGTTCGGGGCTGTGACGTGTTGAACGTTTGTGGCCTCTGTAGGCGAGGTGGGGAGTCCAGTCTGCATGTCTTCTGGCATTGCAAGTTCGTGAA
GGCCGTGTTGATGGAGTCCGAGTTTGGAGGATTGATACATAATGTGCAGGCTGGGTCTATGTTTGATCTGCTTAGGGAAGTGAAGGACAAGGGGCCTAGAACCAGTGGTT
GGGCTAGTAGAATGGGCGACGAGTTATATCTCGTCGTTCCAGCAGGCGACCTCGGCCTGTGGTGTTGGGTGAGGAGTGTTCTTGCAAGGGAAGATGTGAGATGGAGCCCC
CCGGAGGCTGGGTGGTATAAGGTGAACGTAGATGCGTCCTTTAGGAGGGAGCGATGGCAGGCGGGTCTGGGAGTGGTGGTTCGGGATTCCTCTGGTCGGGTTATGTTGTC
GGCATCTTTGGTGCAGCGACATGTGCGAAGCCCGGAGATGGCTGAAGGATGGGCCGCAGTTAAGGGAACGAGACTGGCAGTGGAGATGGGTTTGGGCCCATTGATGTTGG
AGACTGACTCCAGTCGGGTGGCTAGCTTTTTCCAAGATGAGGCAGGGGATGACTTCTCAGATGTGGGTGCCCTGGTGGCTGAATTACGAAGGGACATGCCAGATCCTTCC
TTTTTCGGTTGCAGGTTCACTCGAAGGGAGGGAAATGAGGTGGCACACCAGTTGGCTTTTCTGGCAGGGAGGGACGAAGTGTCTAGGGTTACGACGCGCAGGCATGATGC
TGGTGAAGATCGTGAATGGATAGTGAACAAAGAGCCATCAAATTAG
mRNA sequenceShow/hide mRNA sequence
ATGAGTAAGGCATATGATAGAGTCGAATGGGTTTTCTTGGAGAAGATTATGCTGAAAATGGGTTTTGCTCCGGAGTGGGTGGATCTGATTTCTCTTTGTATTTCTTCCGT
ACGTTTCTCTTTTAATGTGAATGGGATCAGGTGTGGGGGTGTGGTTCCGAGTAGAGGTTTGCGGCAGGGTGACCCATTATCCCCGGCCAACGAGAGTGATGCCACAGCGG
TTCGGGGCATTCTGGACTGTTATGAACGGGTGTCGGGTCAGACAGTGAATTTCGATAAGTCTATTATCTCTTTCAGTCCGAGTACGGATGGGAGGGTTAAGGCTCAGGTG
GGTCAGATTCTGCAGGTTCAGGTTACGGCATGGCACCGCCAATATCTAGGCCTACCTTCTTTTATGCCACGGGACAGAACGAGCTCGTTGAGTTTCATTAAGGATCGAGT
ATGGCAGCAGATTCAGGGGTGGAAGGGCAAACTATTCTCAGTTGGGGGTAGGGAGGTTCTTCTAAAGTCTGTTGTGCAGGCGATTCCGTGTTATTCGATGAATTGCTTCC
GTTTACCAAAGAAGTTGATTCTTGAGATCAGCAGAGCCATGGCCCGGTTTTGGTGGGGTGGGGAGAAGGTGGATCGAGGAATTCATTGGGTGAGTTGGAAATCCCTATGT
AAGCCTAAGTGCTATGGTGGGATGGGGTTTAGGGATATGGAGATTTTTAACCAAGCTTTGCTGGCAAAACAGTGTTGGAGAATTATCCAGAATCCATCTTCCCTCCTGGC
ACGTGTTCTGAAGGGCAGATATTTTCCTTTCGGAGATTTCTTGAGGGCAAATGTGGGGACGAGACCGTCTTTTATATGGAGGAGTTTATTGTGGGGAAGAGAGCTTTTGG
AGATGGGCTTGCGTTGGCAAGTGGGGAATGGAGAAAGTGTGTCGGTATATGGAGTGAATTGGCTTCCTGATGATGGTAATTTTAGAGTACGGTCATCAGGGACTTTGGCT
CCGGAGACTCGGGTTGCAGACCTGATGACGGCATCGGGGCAGTGGAATGAAGGGCTCATCCGACAGAATTTTAGCCCTCAGGAGGTCGGTCTAATTTTGTCAATTCTTGT
GCGGGCTGGGGCGGAGGATAAGATTGTTTGGCATTATGAGAAGTCGGGCCTGTTTTCGGTGAAAAGCGGATATCGGTTGGGGCAATCAACCTGGCTTGCGCAGTTTCCGT
CTTCCTCTTCGAACGAGTCGGCAATGGGTTGGTGGAAGGGGGTTTGGAAAATGTTTATTCCGAATAAGATCAAGATTTTTCTTTGGAGACTTTGTTTAGACCGCTTGCCC
ACGATTGATAATTTGGGTGTTCGGGGCTGTGACGTGTTGAACGTTTGTGGCCTCTGTAGGCGAGGTGGGGAGTCCAGTCTGCATGTCTTCTGGCATTGCAAGTTCGTGAA
GGCCGTGTTGATGGAGTCCGAGTTTGGAGGATTGATACATAATGTGCAGGCTGGGTCTATGTTTGATCTGCTTAGGGAAGTGAAGGACAAGGGGCCTAGAACCAGTGGTT
GGGCTAGTAGAATGGGCGACGAGTTATATCTCGTCGTTCCAGCAGGCGACCTCGGCCTGTGGTGTTGGGTGAGGAGTGTTCTTGCAAGGGAAGATGTGAGATGGAGCCCC
CCGGAGGCTGGGTGGTATAAGGTGAACGTAGATGCGTCCTTTAGGAGGGAGCGATGGCAGGCGGGTCTGGGAGTGGTGGTTCGGGATTCCTCTGGTCGGGTTATGTTGTC
GGCATCTTTGGTGCAGCGACATGTGCGAAGCCCGGAGATGGCTGAAGGATGGGCCGCAGTTAAGGGAACGAGACTGGCAGTGGAGATGGGTTTGGGCCCATTGATGTTGG
AGACTGACTCCAGTCGGGTGGCTAGCTTTTTCCAAGATGAGGCAGGGGATGACTTCTCAGATGTGGGTGCCCTGGTGGCTGAATTACGAAGGGACATGCCAGATCCTTCC
TTTTTCGGTTGCAGGTTCACTCGAAGGGAGGGAAATGAGGTGGCACACCAGTTGGCTTTTCTGGCAGGGAGGGACGAAGTGTCTAGGGTTACGACGCGCAGGCATGATGC
TGGTGAAGATCGTGAATGGATAGTGAACAAAGAGCCATCAAATTAG
Protein sequenceShow/hide protein sequence
MSKAYDRVEWVFLEKIMLKMGFAPEWVDLISLCISSVRFSFNVNGIRCGGVVPSRGLRQGDPLSPANESDATAVRGILDCYERVSGQTVNFDKSIISFSPSTDGRVKAQV
GQILQVQVTAWHRQYLGLPSFMPRDRTSSLSFIKDRVWQQIQGWKGKLFSVGGREVLLKSVVQAIPCYSMNCFRLPKKLILEISRAMARFWWGGEKVDRGIHWVSWKSLC
KPKCYGGMGFRDMEIFNQALLAKQCWRIIQNPSSLLARVLKGRYFPFGDFLRANVGTRPSFIWRSLLWGRELLEMGLRWQVGNGESVSVYGVNWLPDDGNFRVRSSGTLA
PETRVADLMTASGQWNEGLIRQNFSPQEVGLILSILVRAGAEDKIVWHYEKSGLFSVKSGYRLGQSTWLAQFPSSSSNESAMGWWKGVWKMFIPNKIKIFLWRLCLDRLP
TIDNLGVRGCDVLNVCGLCRRGGESSLHVFWHCKFVKAVLMESEFGGLIHNVQAGSMFDLLREVKDKGPRTSGWASRMGDELYLVVPAGDLGLWCWVRSVLAREDVRWSP
PEAGWYKVNVDASFRRERWQAGLGVVVRDSSGRVMLSASLVQRHVRSPEMAEGWAAVKGTRLAVEMGLGPLMLETDSSRVASFFQDEAGDDFSDVGALVAELRRDMPDPS
FFGCRFTRREGNEVAHQLAFLAGRDEVSRVTTRRHDAGEDREWIVNKEPSN