; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0022875 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0022875
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr7:40006443..40010542
RNA-Seq ExpressionLag0022875
SyntenyLag0022875
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_023871998.1 uncharacterized protein LOC111984613 [Quercus suber]7.3e-17132.74Show/hide
Query:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQENGWS--WRFTGFYGNPPCAQHHESW
        L   V +  P LIFLMETK+     D I+  L+   G+ VPS   SGGL+LLW  +++V+++SYS+ HIDA + +N  S  WRFTGFYGNP  ++  ESW
Subjt:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQENGWS--WRFTGFYGNPPCAQHHESW

Query:  KLMKRLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLA
         L+KRL     +PWV  GDFNE++                               GY G  FTW     N G + ERLDR L++     K    ++ H A
Subjt:  KLMKRLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLA

Query:  LISSDHRPILADWKEEPPDSMSRRRLR--RFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTM-ECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLS
          SSDH  +L   K+ P  S  +R  +  RFE +W K E C ++V   W     S + S +   + EC   L  W     N S+ G + +K   +Q+ L 
Subjt:  LISSDHRPILADWKEEPPDSMSRRRLR--RFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTM-ECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLS

Query:  SGEGQDTTEMKRKE-----RELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRR----------------------------------------
          E Q  + +  +E      EL  LLE +E+ W QR+R  WLK GD N  +FH KAS+R                                         
Subjt:  SGEGQDTTEMKRKE-----RELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRR----------------------------------------

Query:  -----------KINKDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKD
                   K+    N++L   F   E+   +K+M PT A GPDG+  IFYQ YW  V + + +  L FLN      + N+T+IVLIPKV DPK + D
Subjt:  -----------KINKDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKD

Query:  FRPISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWI
        FRP SLC++ YK+ +K +A+RLKQ+L  ++  +QSAFV  RLITDN L+  E +H +  +RKGK G +++KLDMSKAYDRVEW  + QIM K+GF +RW+
Subjt:  FRPISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWI

Query:  HLVMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMNNQV---------------------------------------
         L+M CV SV + V +NG P+   +P+RGLRQGDPLSPYLFL CA+GL  + ++A    ++                                       
Subjt:  HLVMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMNNQV---------------------------------------

Query:  -VQTDSLGQ-------------------------------------YLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNY
         V  DS GQ                                     YLGLPS   RSK   F  ++++V + L GWK +L S AG+E LIK+VA+A+P Y
Subjt:  -VQTDSLGQ-------------------------------------YLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNY

Query:  VMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG-----------------------------------------HFLKAGVGNN
         M+CFK P S+C+EL SM ++FWWG    +RK+ W SW+KLC+ K  GG                                         +F +A +G N
Subjt:  VMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG-----------------------------------------HFLKAGVGNN

Query:  PSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIP---REGVCTPLPLKRNEDQLTVAQLINPLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDE
        PSY WR+I+  + + K+G +W++G+G +++  +D W+P    + + +P       D    A +     +WNE LV ++    +A  +L IPL      D 
Subjt:  PSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIP---REGVCTPLPLKRNEDQLTVAQLINPLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDE

Query:  IIWQGDPKGVFTVKSAYRLRMQNFI-----------------------------------CLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHLLW
         IW  +PKG FTV+ AY+  M+  I                                     R +LP   ++ L  +H+           + E+  HL W
Subjt:  IIWQGDPKGVFTVKSAYRLRMQNFI-----------------------------------CLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHLLW

Query:  ECKRTKGKWSKF-FSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLH
         C++    WS      P ++     + W   ++ +++W+  R    + D + +++ +CW IW  RN   H
Subjt:  ECKRTKGKWSKF-FSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLH

XP_023890148.1 uncharacterized protein LOC112002224 [Quercus suber]4.3e-17933.09Show/hide
Query:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQENGWS--WRFTGFYGNPPCAQHHESW
        L   V +  P LIFLMETKM  +  D I+  +    G+ VPS G SGGL LLW  DI V ++++S+ HIDA I     S  WR TGFYGNP  ++  ESW
Subjt:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQENGWS--WRFTGFYGNPPCAQHHESW

Query:  KLMKRLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLA
         L+ RL +   +PW+  GDFNE+++                              GY G KFTWC    + G + ERLDR L++ A        ++ H+A
Subjt:  KLMKRLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLA

Query:  LISSDHRPILADWKEEPP--DSMSRRRLRRFEEVWTKYEDCKEIVKQVWS---LKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDF
          +SDH  ++   K+ PP      R++L RFE +W   E C  +V++ W+    +   S  S  ++  EC + L  W+   + G +   I+  ++ +QD 
Subjt:  LISSDHRPILADWKEEPP--DSMSRRRLRRFEEVWTKYEDCKEIVKQVWS---LKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDF

Query:  LSSGEGQD-TTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRR--------KINKDQNNMLTSSFTREEIYEVVKRMHPTKALGP
         S  +      E++   REL  LL+ +E  W QR    WLK GD N  +FH KAS+          K++   N+ LT  F  EE+   +K+M PT A GP
Subjt:  LSSGEGQD-TTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRR--------KINKDQNNMLTSSFTREEIYEVVKRMHPTKALGP

Query:  DGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITD
        DG+  +FYQ+YW  V   I E  L FLN        N+T+IVLIPKV DPK ++DFRPISLC++ YK+ +K LANRLK  L  ++  +QSAFV  RLITD
Subjt:  DGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITD

Query:  NALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICA
        N L+  E +  +  +RKGK G +++KLDMSKAYDRVEW  + QIM K+GFC+RW++L+M C+ +V++ + +NG P+    P+RGLRQGDP+SPYLFLICA
Subjt:  NALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICA

Query:  KGLLRLLNKAEMNNQV----------------------------------------VQTDSLGQ------------------------------------
        +GL  LL++A  N ++                                        V  +S GQ                                    
Subjt:  KGLLRLLNKAEMNNQV----------------------------------------VQTDSLGQ------------------------------------

Query:  -YLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINK
         YLGLPS   RSK   F  ++++V + + GWK +L S AG+E LIK+VAQA+P Y M+CFK P ++C++L  + ++FWWG    +RK+ W SWEKLC  K
Subjt:  -YLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINK

Query:  HLG-----------------------------------------GHFLKAGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREG---V
          G                                         G F+ A +G +PS+ WR+I+  + + ++G  W++G+G N+    D WIP      +
Subjt:  HLG-----------------------------------------GHFLKAGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREG---V

Query:  CTPLPLKRNEDQLTVAQLINPLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDPKGVFTVKSAYRL--------------------------
         TP  L  ++ +  V  LI   G+W+ +L+ ++F  +DA+ IL+IPL      D+ IW G   G FTV SAY+L                          
Subjt:  CTPLPLKRNEDQLTVAQLINPLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDPKGVFTVKSAYRL--------------------------

Query:  ---------RMQNFICLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGKW--SKFFSIPNELYAYDRRSWTTMDYCELLWRDGRIY
                 R   +   + +L  + +  L  +HI             E+  H+ W C++ K  W  SK F  P  +      +W   ++ +++W   +  
Subjt:  ---------RMQNFICLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGKW--SKFFSIPNELYAYDRRSWTTMDYCELLWRDGRIY

Query:  KLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQR--HEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLN--NA
          D   + ++ +ICW+IW  RN++    +H        QI+++     R+VE  R  HEEV          P  D+ E    + W  P EG +K N   A
Subjt:  KLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQR--HEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLN--NA

Query:  SWCEEKQMGSVGWILRRWDGSIIA
        ++   +  G +G ++R   G ++A
Subjt:  SWCEEKQMGSVGWILRRWDGSIIA

XP_030498122.1 uncharacterized protein LOC115713779 [Cannabis sativa]8.4e-16730.73Show/hide
Query:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQ-ENGWSWRFTGFYGNPPCAQHHESWK
        LR  V    P ++F+METK++    D  R+QL F  GIEVP  G  GGL+LLW + + +++ +YS  HID  ++  +G S  FTGFYG+P  +Q H +W 
Subjt:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQ-ENGWSWRFTGFYGNPPCAQHHESWK

Query:  LMKRLCD-QLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLA
        L++R  D     PW + GDFNEII                                + G + TW N    +G + ERLD   IN    +      + HL 
Subjt:  LMKRLCD-QLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLA

Query:  LISSDHRPILA--DWKEEPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDF--L
           SDHR + A    K +   ++  +   RFE++W + E C +I+   W+    S    +     +C + L QW    + G++R  I    K +      
Subjt:  LISSDHRPILA--DWKEEPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDF--L

Query:  SSGEGQDTTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKINK----------------------------------------
        SS   +    +K  E+ L++LL  +E YW QRAR  W+K GDSN K+FH +A++R   N+                                        
Subjt:  SSGEGQDTTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKINK----------------------------------------

Query:  -----------DQN--NMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFR
                   D+N  +++++ FT  E+Y  +  M   K+ G DG+  +F+  YW++V + + E  L+ LN   + +  N T I LIPKV  P+ +  +R
Subjt:  -----------DQN--NMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFR

Query:  PISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHL
        PISLC+++YK+++K++  RL+  L  +IS  QSAF+  RLITDN L+ FE +H++ +R++G +G  +IKLDMSKA+DRVEW YI QIM KMGF    ++L
Subjt:  PISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHL

Query:  VMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMN--------------------------------------------
        ++ C++SV++  LLNG    D  P+RG+RQGDPLSPYLFLICA+GL RLL   E N                                            
Subjt:  VMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMN--------------------------------------------

Query:  -----NQVVQTDSL----------------------------GQYLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVM
              Q V  D                               +YLGLPS + R K  +F+ I D++W+ L  WK +LFS  G+E L+K+V QAIP Y M
Subjt:  -----NQVVQTDSL----------------------------GQYLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVM

Query:  NCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINK-----------------------------------------HLGGHFLKAGVGNNPS
        +CF+ P +LC+++ SM A FWWGS  S   IHW++W  LC  K                                          L G  L AG+GN PS
Subjt:  NCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINK-----------------------------------------HLGGHFLKAGVGNNPS

Query:  YTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKRNEDQLTVAQLINPLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQG
         TWR+I+WG+ L  +G RW++G+G  +   TDPW+P     TP      +  L VA LIN    W+ S V   F   D   IL+IPL    K D +IW G
Subjt:  YTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKRNEDQLTVAQLINPLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQG

Query:  DPKGVFTVKSAY--------------------------------RLRMQNFICLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGK
           G ++VKS Y                                +LR+  +    ++LP + +  L+ +HI    +    + + E++ H L+ C R K  
Subjt:  DPKGVFTVKSAY--------------------------------RLRMQNFICLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGK

Query:  WSKFFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNRRS
        W     I N    +   + ++ +   LL+  G    L+ ++    + +CW IW  RN   H      P  +       ++ I +         V     +
Subjt:  WSKFFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNRRS

Query:  SANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMGSVGWILRRWDGSIIA
         A    T  + A  +  W  PP+G +KLN +A++ +  ++  +G +LR   G I A
Subjt:  SANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMGSVGWILRRWDGSIIA

XP_030508852.1 uncharacterized protein LOC115723496 [Cannabis sativa]2.4e-16930.63Show/hide
Query:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQEN-GWSWRFTGFYGNPPCAQHHESWK
        LR  V    P L+ L+E+K+K    D +R  L+F  GIEVP  GL GGL+LLW + ++V++ ++S  HID  I+ N G S+ FTGFYG+P  +Q H +W 
Subjt:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQEN-GWSWRFTGFYGNPPCAQHHESWK

Query:  LMKRLCDQLEM-PWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLA
        ++KR  D   + PW++ GDFNEI+                                + G + TW +    +G + ERLD   +N           V HL 
Subjt:  LMKRLCDQLEM-PWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLA

Query:  LISSDHRPI--LADWKEEPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSS
           SDHR I  L     E   ++  +   RFE++W + + C  ++   WS ++ +    +        ++L  W    + G ++  I   +K +    +S
Subjt:  LISSDHRPI--LADWKEEPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSS

Query:  --GEGQDTTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKINK-----------------------------------DQNNM
           +      ++  E+ L+ LL  +E YW QR+R  WL+ GDSN K+FH  A++RRK N+                                   DQ ++
Subjt:  --GEGQDTTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKINK-----------------------------------DQNNM

Query:  ------------------LTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFR
                          +++ FT  ++Y+ +K M   K+ G DG+  +FY  YW +V   +    L  LN   + S  N T + LIPKV  P  +  +R
Subjt:  ------------------LTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFR

Query:  PISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHL
        PISLC+++YK+++KA+  RLK  L  +IS  QSAF+  RLITDN L+ FE +H++ +R++G +G  +IKLDMSKA+DRVEW ++ Q+M KMGF    + L
Subjt:  PISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHL

Query:  VMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEM---------------------------------------------
        ++ C++SV++  LLNGT +    P+RG+RQGDPLSPYLFLICA+GL RLL   E+                                             
Subjt:  VMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEM---------------------------------------------

Query:  ------------------------NNQVVQTDSLG--------QYLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVM
                                + Q+   D LG        QYLGLPS + ++KK++F  I D++W+ L  WK  LFS  G+E L+K+V QAIP Y M
Subjt:  ------------------------NNQVVQTDSLG--------QYLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVM

Query:  NCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLG-----------------------------------------GHFLKAGVGNNPS
        +CF+ P++LC+++ SM A+FWWGS  + + IHW++W  LC  K  G                                         G++L AG+G+NPS
Subjt:  NCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLG-----------------------------------------GHFLKAGVGNNPS

Query:  YTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKRNEDQLTVAQLINPLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQG
         TWR+++WG+ L  +G RW++GSG  +   TD W+P      P   K  +  L VA LI     W+   +   F   D   +L+IPL     DD +IW  
Subjt:  YTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKRNEDQLTVAQLINPLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQG

Query:  DPKGVFTVKSAYRLRM------------------QNFICLRHLLPASQQ--------------KKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGK
           GV+ VKS Y   +                   NF  L+  LP   +               +L  +HI    +     +  E++ H L+ C R K  
Subjt:  DPKGVFTVKSAYRLRM------------------QNFICLRHLLPASQQ--------------KKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGK

Query:  WS-KFFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNRR
        W    FSI  +    +R S  T D   LL        L    +   +++CW IW  RN + H N    P      +     +   + +  R +  K    
Subjt:  WS-KFFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNRR

Query:  SSANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMGSVGWILRRWDGSIIA
        S A  P    +E     KW  PP G  KLN +A+  +E+    +G +LR  DG I+A
Subjt:  SSANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMGSVGWILRRWDGSIIA

XP_030923330.1 uncharacterized protein LOC115950239 [Quercus lobata]1.1e-16931.6Show/hide
Query:  GGLLLLWGTDINVTLRSYSKGHIDASI-QENGWSWRFTGFYGNPPCAQHHESWKLMKRLCDQLEMPWVLGGDFNEIIE----------------------
        GGL  LW  D+ + + +++  H+ A + +E+G+ W  TGFYG P   Q   SW+L+K L   +  PWV+ GDFN  +                       
Subjt:  GGLLLLWGTDINVTLRSYSKGHIDASI-QENGWSWRFTGFYGNPPCAQHHESWKLMKRLCDQLEMPWVLGGDFNEIIE----------------------

Query:  --------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLALISSDHRPILADWKEEPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVW
                G+ G  +TW N          RLDR + N    ++    +V HL+  +SDH P+L   +          R  +FEE W   ++C  ++++ W
Subjt:  --------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLALISSDHRPILADWKEEPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVW

Query:  SL--KTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGEGQDTTEMKRKE-----RELENLLEDDEIYWKQRAREEWLKWGDSN
                   ++  K   C   L  W       SI    +   KEIQ  L      + TE  + E     +++++LL+  EIYW QR+R  WL+ GD N
Subjt:  SL--KTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGEGQDTTEMKRKE-----RELENLLEDDEIYWKQRAREEWLKWGDSN

Query:  NKWFHIKASNRR---------------------------------------------------KINKDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGI
         K+FH KAS RR                                                   K+ +D    L++ FT EE+   + +M PTKA GPDG+
Subjt:  NKWFHIKASNRR---------------------------------------------------KINKDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGI

Query:  QAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNAL
         A+FYQK+W +V + +    L FLN    + EIN T IVLIPKV +P+ M +FRPISLC++IYK+I+K LANRLKQVL  IIS +QSAFVPGRLITDN L
Subjt:  QAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNAL

Query:  LGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGL
        + +E +H + +R+KGK+G +++KLD+SKAYDRVEW +++ IM KMGF   WI  VM CV + +F +L+NG P E   P+RG+RQGDP+SPYLFL+CA+GL
Subjt:  LGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGL

Query:  LRLLNKAEMNNQVV-----------------------------------------------------------------------------QTDSLGQYL
          LLNKAE+N  +                                                                              + D   +YL
Subjt:  LRLLNKAEMNNQVV-----------------------------------------------------------------------------QTDSLGQYL

Query:  GLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLG
        GLP+   R+K   F+ ++DRVW+ LQGWKG L S AG+E LIK+VAQAIP Y M+ F+ P+ LC+EL ++CA+FWWG   ++RKIHW+SW+KL   K  G
Subjt:  GLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLG

Query:  G-----------------------------------------HFLKAGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPL-P
        G                                          FL+A    N S+ WR+++  + + + GY W++G+G ++ AV D W+P       L  
Subjt:  G-----------------------------------------HFLKAGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPL-P

Query:  LKRNEDQLTVAQLINP-LGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDPKGVFTVKSAY-----------------------------RLR
        ++R+  ++ VA+LINP    WN   +  +F  ++A+ I  IPL  R+  D I W   P+G+F+VKSAY                             +LR
Subjt:  LKRNEDQLTVAQLINP-LGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDPKGVFTVKSAY-----------------------------RLR

Query:  MQNFI------CLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGKWSKFFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDR
        + N +          +LP +    L+ + I           + ES  H LW+C   +  W+             +      D  +L+  +  + +L++D 
Subjt:  MQNFI------CLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGKWSKFFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDR

Query:  IRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMG
        +       W +WC RN +LH  +   P +LN                 R EE  T  R++ N     +T+ S    W  PP G +KLN +A+   +    
Subjt:  IRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMG

Query:  SVGWILRRWDGSIIA
          G I+R   G ++A
Subjt:  SVGWILRRWDGSIIA

TrEMBL top hitse value%identityAlignment
A0A2N9H997 Uncharacterized protein8.7e-17833.48Show/hide
Query:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDAS-IQENGWSWRFTGFYGNPPCAQHHESWK
        L   V++  P+++FLMET++     +++R +L       V      GGL LLW  D+++ ++SYS  HIDA  +QE+G  WR TGFYGNP  A  H SW 
Subjt:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDAS-IQENGWSWRFTGFYGNPPCAQHHESWK

Query:  LMKRLCDQLEMPWVLGGDFNEIIEGYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLALISSDHRPILAD-------WKEEPPDSMSRR
        L++RL      PW+L GDFNEI  G+ GP+FTW N   +   +  RLDR +  P  +       V HL +++SDH  +L D       WK++      +R
Subjt:  LMKRLCDQLEMPWVLGGDFNEIIEGYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLALISSDHRPILAD-------WKEEPPDSMSRR

Query:  RLRRFEEVWTKYEDCKEIVKQVW-SLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGEGQ-DTTEMKRKERELENLLEDDE
        RL RF+  W +   C E + + W   +  +    ++ K  +C  +L QWS+     + +  I RK+K+++          D   +    REL  L+E +E
Subjt:  RLRRFEEVWTKYEDCKEIVKQVW-SLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGEGQ-DTTEMKRKERELENLLEDDE

Query:  IYWKQRAREEWLKWGDSNNKWFHIKASNRRKIN---------------------------------------------------KDQNNMLTSSFTREEI
         +W+QR+R  WL+ GD N ++FH  AS R+K N                                                    + N  L   F+ +E+
Subjt:  IYWKQRAREEWLKWGDSNNKWFHIKASNRRKIN---------------------------------------------------KDQNNMLTSSFTREEI

Query:  YEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCSIIYKVIAKALANRLKQVLHCII
         + + +MHP+KA GPDG+ A+F+QKYWD+V  D+ E  L F+     +  IN T IVLIPKV  P +M  FRPISLC+++YK+I+K L NR+K +L  +I
Subjt:  YEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCSIIYKVIAKALANRLKQVLHCII

Query:  SPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVESVNFQVLLNGTPREDFSPNRGL
        S  QSAFVPGR+ITDN ++ FE +H + ++R+GKE  +++KLDMSKAYDRVEW Y+R I+ K+GF +RW+ L++ CV +V + V++NG  +    P RGL
Subjt:  SPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVESVNFQVLLNGTPREDFSPNRGL

Query:  RQGDPLSPYLFLI------C------AKGLLRLLNKAE-----------------------MNNQVVQ------TDSLGQYLGLPSQNARSKKEIFNHIR
        RQGDPLSPYLFLI      C         LL+LL+  E                           ++Q      T+   +YLGLP    ++KK+ FN ++
Subjt:  RQGDPLSPYLFLI------C------AKGLLRLLNKAE-----------------------MNNQVVQ------TDSLGQYLGLPSQNARSKKEIFNHIR

Query:  DRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGGHFLK------------A
        DRVW+ LQGWK +L S AGRE LIK+V QAIP Y M+ F+FP+ LC+EL+S+   FWWG     RKIHW S ++L   K  GG   +             
Subjt:  DRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGGHFLK------------A

Query:  GVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPL-PLKRNEDQLTVAQLIN-PLGDWNESLVYELFQVNDAKDILNIPLRTRN
         +  N SYTWR+I   + + K G R+++G+G  +    DPW+        L P++  +   +V  LIN    +W   L+ ++F  ++A+ I  IPL  R 
Subjt:  GVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPL-PLKRNEDQLTVAQLIN-PLGDWNESLVYELFQVNDAKDILNIPLRTRN

Query:  KDDEIIWQGDPKGVFTVKSAYRL-------------------------------RMQNFI--CLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHL
          D IIW G  KG FTVKSAYR+                               +++NFI    R++LP   Q KL  K I         E + E+  H+
Subjt:  KDDEIIWQGDPKGVFTVKSAYRL-------------------------------RMQNFI--CLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHL

Query:  LWECKRTKGKWSK-FFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENL-NQQIIKEIENITRDVET
        LW C+  +  WS     +P+ +        T +D+ E       + +L    +       W +W  RN +L + Q+    ++  +  I  IE +    ++
Subjt:  LWECKRTKGKWSK-FFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENL-NQQIIKEIENITRDVET

Query:  QRHEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLNNASWCEEKQMGS---VGWILRRWDGSII-ATGF
        + H  + +  R                 +W+ P  G++K++ A  C+     +    G+++R W+G ++ A+GF
Subjt:  QRHEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLNNASWCEEKQMGS---VGWILRRWDGSII-ATGF

A0A2N9HB41 Uncharacterized protein3.1e-17532.22Show/hide
Query:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDAS-IQENGWSWRFTGFYGNPPCAQHHESWK
        L   V+   P+L+FLMET++     +++R +L       V    L GGL L W   +++ ++SYS  HIDA  IQ++G  WRFTGFYG+P  A  H SW 
Subjt:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDAS-IQENGWSWRFTGFYGNPPCAQHHESWK

Query:  LMKRLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLAL
        L++RL     +PW+L GDFNEI+                               G+ GP+FTW NN  ++  +  RLDR + +   RN      V H+ +
Subjt:  LMKRLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLAL

Query:  ISSDHRPILADWKE-EPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVWSLKTESSANSII-IKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDF-LSS
          SDH  +L +      P    + R  RF+  W   E C+E++ + WS     +    +  K  +C  +L +WS+     + +  I  K++++Q+    S
Subjt:  ISSDHRPILADWKE-EPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVWSLKTESSANSII-IKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDF-LSS

Query:  GEGQDTTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKIN-------------------------------------------
         E     E+    REL  L+  +EI+W+QR+R  WLK GDSN+++FH  AS RRK N                                           
Subjt:  GEGQDTTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKIN-------------------------------------------

Query:  --------KDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISL
                 + N  L + F+ EE+   + ++ P+KA GPDG+ A+F+Q+YW++V   + +  L  LN    +  +N T IVLIPK+  P +M  FRPISL
Subjt:  --------KDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISL

Query:  CSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHC
        C++IYK+I+K L NR+K +L  IIS  QSAFVPGR+ITDN ++ FE +H + +++ G+ G ++ KLDMSKAYDRVEW Y+R I+ K+GF +RW+ L+M C
Subjt:  CSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHC

Query:  VESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMNNQV---------------------------------------------
        V SV + V++NG  +    P RGLRQGDPLSPYLFLICA+GL  L+ KAE +N +                                             
Subjt:  VESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMNNQV---------------------------------------------

Query:  --------------------------------VQTDSLGQYLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFK
                                          T    +YLGLP    R+KK  FN ++DRV R LQGWK +L S AGRE LIK+V QA+P Y M+CFK
Subjt:  --------------------------------VQTDSLGQYLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFK

Query:  FPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG-----------------------------------------HFLKAGVGNNPSYTWR
         P+ LC+E+++M  KFWWG    +RK+HW + ++L   K  GG                                          FL+A +  NPS+ WR
Subjt:  FPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG-----------------------------------------HFLKAGVGNNPSYTWR

Query:  NIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKR-NEDQLTVAQLIN-PLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDP
        +I   + +   G  W++GSG N+    D W+        L   R  E   TV++LIN   G WN  L+ ++F   DA+ I  +PL +R   D++IW G+ 
Subjt:  NIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKR-NEDQLTVAQLIN-PLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDP

Query:  KGVFTVKSAYRLRMQ-------------------------------NFI--CLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGKW
         GV++VKSAY+L MQ                               NFI    R++LP   Q KL  K I         E +PE+  H+LW+C+  +  W
Subjt:  KGVFTVKSAYRLRMQ-------------------------------NFI--CLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGKW

Query:  SKFFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENL-NQQIIKEIENI-TRDVETQRHEEVKTNRR
        S   S+P  + A    + +  D  E   RD     L    +   + + W +W  RNE++    H N E++ N+     +E +   D E+Q          
Subjt:  SKFFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENL-NQQIIKEIENI-TRDVETQRHEEVKTNRR

Query:  SSANHPGTDQTEASRLLKWLKPPEGAWKLNNA-SWCEEKQMGSVGWILRRWDGSIIA
             P +D+       +W  P  G +KL+ A  + ++     VG +LR  DG ++A
Subjt:  SSANHPGTDQTEASRLLKWLKPPEGAWKLNNA-SWCEEKQMGSVGWILRRWDGSIIA

A0A2N9IPS8 Reverse transcriptase domain-containing protein6.5e-18132.16Show/hide
Query:  VQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASI--QENGWSWRFTGFYGNPPCAQHHESWKLMK
        ++   P ++FL ET++   G + +R  ++F+    VP  G  GGL +LW   ++V L +YS+ HIDA I  +E G  +R TGFYGNP   +  ESW L+K
Subjt:  VQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASI--QENGWSWRFTGFYGNPPCAQHHESWKLMK

Query:  RLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLALISS
         L      PW+  GDFNEI++                              GY G  +TW      N  +  RLDR + + +     +   VSHLA+ +S
Subjt:  RLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLALISS

Query:  DHRPILADWKEEPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVW-SLKTESSANSIIIKTME-CLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGEGQ
        DH PIL D  + P     +++L RFE +W K E C+E++   W    TE S   ++++ M+ C   L  WSR  + GS+  +I RK +++Q  ++     
Subjt:  DHRPILADWKEEPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVW-SLKTESSANSIIIKTME-CLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGEGQ

Query:  DTTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKIN-----------------------------------------------
         +T +   + +L  LLE +EI+W+QR+R  W+  GD N K+FH + + RR+ N                                               
Subjt:  DTTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKIN-----------------------------------------------

Query:  ------KDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCS
                 N+ L + FT++E+   +K+M+PTKA GPDG+ AIFYQ YWD+V  ++ +  L  L+    + +IN T+I LIPKV +P+++ DFRPISLC+
Subjt:  ------KDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCS

Query:  IIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVE
        +IYK+++K LANRLK+VL  +IS +QSAFVPGRLITDN L+ FE +H++  +RKGK+G +++KLDMSKAYDRVEW ++  IM  MGF K WI L+M C+ 
Subjt:  IIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVE

Query:  SVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMNN-------------------------------------------------
        SV++ VL+NG     F+ +RG+RQGD LSPYLFLICA+GL  LL KA ++                                                  
Subjt:  SVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMNN-------------------------------------------------

Query:  ----------------------------QVVQTDSLGQYLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFP
                                    QV +  S  +YLGLPS   RSK   F  I+ RVWR + GWK +  S AGRE LIK+VAQ+IP Y M+CFK P
Subjt:  ----------------------------QVVQTDSLGQYLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFP

Query:  ISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLG-----------------------------------------GHFLKAGVGNNPSYTWRNI
         SLCN+LN+M + FWWG  D  +K HW  W KLC +K  G                                         G F+ A +GN PSY WR+I
Subjt:  ISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLG-----------------------------------------GHFLKAGVGNNPSYTWRNI

Query:  IWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKRNED-QLTVAQLINPLGD-WNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDPKG
           R + + G +W IG G +V    DPW+P       +  +   D + TV+ LIN     WN  ++  LF   +A+ I  IPL  R K D + W     G
Subjt:  IWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKRNED-QLTVAQLINPLGD-WNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDPKG

Query:  VFTVKSAYRLRMQ--------------------NFICLRHLLPASQ-------------QKKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGKWSK
        +FTVKSAY L+++                     F+    L P  +              + L  +H+  +          ES  H LW C      W++
Subjt:  VFTVKSAYRLRMQ--------------------NFICLRHLLPASQ-------------QKKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGKWSK

Query:  FFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNRRSSAN
              +L  +DR      ++C+L+       +L  + ++    + + IW  RN ++++    NP  + Q+  K ++     + +          R   +
Subjt:  FFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNRRSSAN

Query:  HPGTDQTEASRLLKWLKPPEGAWKLNNASWCEEK-QMGSVGWILRRWDGSIIA
          G  Q E SR   W+ PP   +K+N  ++ +   ++ + G ++R ++G ++A
Subjt:  HPGTDQTEASRLLKWLKPPEGAWKLNNASWCEEK-QMGSVGWILRRWDGSIIA

A0A2N9J6I3 Uncharacterized protein2.4e-17532.21Show/hide
Query:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQEN-GWSWRFTGFYGNPPCAQHHESWK
        L + V+   PQ++FLMET++     ++IR +L  +    V      GGL LLW   +++ ++SYSK HID  +  + G SWR TGFYG+P  A  H SW+
Subjt:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQEN-GWSWRFTGFYGNPPCAQHHESWK

Query:  LMKRLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLAL
        L++RL       W++ GDFNEI                                G+ G +FTW NN      + ERLDR +      +      + H   
Subjt:  LMKRLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLAL

Query:  ISSDHRPILADWKEEPPDSMSRRRLR-RFEEVWTKYEDCKEIVKQVWSLK-TESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLS-S
          SDH  +L +          R++ R  FE  W + E C+E++ Q W ++   +    ++ K  +C   L  WS+      +   I+ K+  +++    +
Subjt:  ISSDHRPILADWKEEPPDSMSRRRLR-RFEEVWTKYEDCKEIVKQVWSLK-TESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLS-S

Query:  GEGQDTTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKIN-----KDQNNM--------------------------------
        G   +  E +   R+L  LL  +EIYW+QR+R  WL+ GD N  +FH  A+ R+K N     +D NN+                                
Subjt:  GEGQDTTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKIN-----KDQNNM--------------------------------

Query:  --------------LTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISL
                      L   FTREE+   + +M P+KA GPDG+ A+F+QK+W +V  D+ +  L FLN    +  +N T+I LIPKV  P+ M  FRPISL
Subjt:  --------------LTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISL

Query:  CSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHC
        C+++YK+I+K L NR+K +L  ++S SQSAFVPGR+I+DN ++ FE +H + ++R GK   +++KLDMSKAYDRVEW Y++++M K+GF  RW+ L+M C
Subjt:  CSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHC

Query:  VESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMNNQVVQ-------------------------------------------
        V SV++ +L+NG P+    P+RGLRQGDPLSPYLFLICA+GL  LL KAE    V+Q                                           
Subjt:  VESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMNNQVVQ-------------------------------------------

Query:  TDSLGQYLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEK
        T    +YLGLP    RSKK+ F+ I+DR+WR LQGWK +  S AG+  LIK+V QAIP Y M+CFKFP  LC E++SM  +FWWG  ++ RKIHW S +K
Subjt:  TDSLGQYLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEK

Query:  LCINKHLGG-----------------------------------------HFLKAGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPRE
        LC  K  GG                                          F++A +  N SY WR+I   +N+ + G RW++GSG ++    D WIP  
Subjt:  LCINKHLGG-----------------------------------------HFLKAGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPRE

Query:  GVCTPL-PLKRNEDQLTVAQLINPLG-DWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDPKGVFTVKSAYRLRMQNFICLRHLLPASQQKKLSG
             + P++  ++  +V  LIN     WN  L+ E+F   D + I  IPL  R   D +IW G  KG+FTVKSAY +       L H   A +    S 
Subjt:  GVCTPL-PLKRNEDQLTVAQLINPLG-DWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDPKGVFTVKSAYRLRMQNFICLRHLLPASQQKKLSG

Query:  KHIGKHQFHLKSEAKPESLQHLLWE-CKRTKGKWSKFFSIPNELYAYDRRSW-----TTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHK
        + +     +L S      ++   W  C+      +K F     L A    SW      T+D+  +LW       L    +   +   W +W  RNE++ +
Subjt:  KHIGKHQFHLKSEAKPESLQHLLWE-CKRTKGKWSKFFSIPNELYAYDRRSW-----TTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHK

Query:  NQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLNNA-SWCEEKQMGSVGWILRRWDGSIIATGFQVV
        ++  N  ++    ++  EN    +E    E +  + R           E     +W+ P    +KLN A  W   ++   +G ++R  +G ++A     +
Subjt:  NQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLNNA-SWCEEKQMGSVGWILRRWDGSIIATGFQVV

Query:  NLLTRKEVDVTEISLFI
        +++   E+D     L +
Subjt:  NLLTRKEVDVTEISLFI

A0A7N2LIH6 Uncharacterized protein1.6e-18432.86Show/hide
Query:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQ--ENGWSWRFTGFYGNPPCAQHHESW
        L   V+   P L+FL+ETK         + +L F  GI VPS G SGGL LLW    ++  +S S  HID  +    +G  WR TGFYG+P   + + SW
Subjt:  LRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQ--ENGWSWRFTGFYGNPPCAQHHESW

Query:  KLMKRLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLA
        KL++ L  Q EMPW++ GDFNEI+                               G+ GP+FTWCN    +     RLDR + N A        KV H++
Subjt:  KLMKRLCDQLEMPWVLGGDFNEIIE------------------------------GYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLA

Query:  LISSDHRPILADWKEEPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGE
        + +SDH  +LA +  +  +    ++   FEE+WT+ E+CKEIV+  W    E SA  +  +   C   L QW++ ++ G++   I +K+  +Q   S   
Subjt:  LISSDHRPILADWKEEPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGE

Query:  GQDTT-EMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKINK-------------------------------------------
          +T  E++  ++E+  L   +E+ WKQR+R  WL++GD N+K+FH  AS RR+ N+                                           
Subjt:  GQDTT-EMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKINK-------------------------------------------

Query:  --------DQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLC
                + N+ L   F   E+++ +++MHPTKA GPDG+  IFYQKYWD+V + +    L+ LN      +INKTYI LIPK  +P+ + +FRPISLC
Subjt:  --------DQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLC

Query:  SIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCV
        ++IYK+I+K LANRLK+VLH +I  +QSAFVPGR+ITDN ++ FE +H++  RRKGKEG+++IKLDMSKAYDRVEW Y+  +M KMGF  RWI L+M CV
Subjt:  SIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCV

Query:  ESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMNNQV----------------------------------------VQTDSL
         SV+F VL+NG P+  F+P+RGLRQGDP+SPYLFL+C +GL  ++ K E    +                                        V  +  
Subjt:  ESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNKAEMNNQV----------------------------------------VQTDSL

Query:  GQ-------------------------------------YLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKF
        GQ                                     YLGLP    R+KK+ FN I+D+V R + GWKG+L S AGRE LIK+VAQA P Y MN FK 
Subjt:  GQ-------------------------------------YLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKF

Query:  PISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG-----------------------------------------HFLKAGVGNNPSYTWRN
        P SLC ELNSM   FWWG    ++K+ W SW+ LC  K  GG                                          F++A +G  PSY WR+
Subjt:  PISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG-----------------------------------------HFLKAGVGNNPSYTWRN

Query:  IIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKRNEDQ-LTVAQLIN-PLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDPK
        I+  +N+ K+G RW +G G ++      W+P       +  +    Q   VA LI+   G+W  +LV + F  ++A++IL+IPL + N  D ++W   P 
Subjt:  IIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKRNEDQ-LTVAQLIN-PLGDWNESLVYELFQVNDAKDILNIPLRTRNKDDEIIWQGDPK

Query:  GVFTVKSAYRLRMQNFI----------------------------C---LRHLLPASQQKKLSGKHIGKHQFHLKSE-----AKPESLQHLLWECKRTKG
        G FTVKSAYR   +  +                            C   ++H L  + +  L  K    H+  +K +      + E+  H LW C   K 
Subjt:  GVFTVKSAYRLRMQNFI----------------------------C---LRHLLPASQQKKLSGKHIGKHQFHLKSE-----AKPESLQHLLWECKRTKG

Query:  KWSKF-FSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNR
         W    F+I N  +         +++ +++W    +    +       I+ W +W  RN V H             + K+ ++IT +    R EEV+T  
Subjt:  KWSKF-FSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNR

Query:  RSSANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMGSVGWILRRWDGSII
         +    P        +  +W  P +  +K+N +A+   E+    +G ++R   G I+
Subjt:  RSSANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMGSVGWILRRWDGSII

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.0e-2325.98Show/hide
Query:  RRLRRFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAI----SRKEKEIQDFLSSGEGQDTTEMKRKERELENLLE
        + L + E+  +K    +EI K    LK   +      KT++ +N    W     N   R        ++EK   D + + +G  TT+      E++  + 
Subjt:  RRLRRFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAI----SRKEKEIQDFLSSGEGQDTTEMKRKERELENLLE

Query:  DDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKINKDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLN--ERG
        +   Y+K     +     + +     +      ++N+++   L    T  EI  ++  +   K+ GPDG  A FYQ+Y    + ++  F LK     E+ 
Subjt:  DDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKINKDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLN--ERG

Query:  NI--SEINKTYIVLIPKV-NDPKSMKDFRPISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLD
         I  +   +  I+LIPK   D    ++FRPISL +I  K++ K LANR++Q +  +I   Q  F+PG     N       I  + +R K K  +I I +D
Subjt:  NI--SEINKTYIVLIPKV-NDPKSMKDFRPISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLD

Query:  MSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNK
          KA+D+++  ++ + + K+G    ++ ++    +     ++LNG   E F    G RQG PLSP LF I  + L R + +
Subjt:  MSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNK

P0C2F6 Putative ribonuclease H protein At1g657502.8e-1625Show/hide
Query:  LPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG
        +P    R  K+ F  I +RV   + GW+ +  S AGR TL K+V  ++P + M+    P S+ N L+ +   F WGS   K+K H   W K+C  K  GG
Subjt:  LPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG

Query:  HFLKAGVGNNP-------------------------------------------SYTWRNIIWG-RNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPL
          ++A    N                                            S TWR+I  G R++   G  W  G G  +   TD W+  + +   L
Subjt:  HFLKAGVGNNP-------------------------------------------SYTWRNIIWG-RNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPL

Query:  PLKRNE-----DQLTVAQLINPLGDWNESLVYELFQVNDAKDILNIPL-RTRNKDDEIIWQGDPKGVFTVKSAYRL
         L   E     D +    L  P   W+ + +      N   ++  + L       D + W+    G F+V+SAY +
Subjt:  PLKRNE-----DQLTVAQLINPLGDWNESLVYELFQVNDAKDILNIPL-RTRNKDDEIIWQGDPKGVFTVKSAYRL

P11369 LINE-1 retrotransposable element ORF2 protein3.6e-2723.08Show/hide
Query:  SWKLMKRLCDQLEMPWVLGGDFNEIIEGYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFK--VSHLALISSDHRPILADWKEEPPDSMSRRRL
        +WKL   L +   +   +  +  + +E       T+ N       +W+ +  FL    +    +  K   +H + +++  + +      E  ++ S +R 
Subjt:  SWKLMKRLCDQLEMPWVLGGDFNEIIEGYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFK--VSHLALISSDHRPILADWKEEPPDSMSRRRL

Query:  RRFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGEGQDTTEMKRKERELENLLEDDEIYWK
        RR E +  + E     + QV + +T    N       E +N++D+   R   G        ++K + + + + +G  TT+      E++N +      + 
Subjt:  RRFEEVWTKYEDCKEIVKQVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGEGQDTTEMKRKERELENLLEDDEIYWK

Query:  QRAREEWLKWGDSNNKWFHIKASNRRKINKDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTY
        +R     L+  D  +K+  +      K+N+DQ + L S  + +EI  V+  +   K+ GPDG  A FYQ + + +   + +   K   E    +   +  
Subjt:  QRAREEWLKWGDSNNKWFHIKASNRRKINKDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTY

Query:  IVLIPK-VNDPKSMKDFRPISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWT
        I LIPK   DP  +++FRPISL +I  K++ K LANR+++ +  II P Q  F+PG     N       IH + ++ K K  +I I LD  KA+D+++  
Subjt:  IVLIPK-VNDPKSMKDFRPISLCSIIYKVIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWT

Query:  YIRQIMGKMGFCKRWIHLVMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNK------AEMNNQVVQ----TDSLGQYLGLP
        ++ +++ + G    +++++          + +NG   E      G RQG PLSPYLF I  + L R + +       ++  + V+     D +  Y+  P
Subjt:  YIRQIMGKMGFCKRWIHLVMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNK------AEMNNQVVQ----TDSLGQYLGLP

Query:  SQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIP-NYVMNCFKF-PISLCNELNSMCAK
          + R    + N   + V   +   K   F     +   K + +  P + V N  K+  ++L  E+  +  K
Subjt:  SQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIP-NYVMNCFKF-PISLCNELNSMCAK

P14381 Transposon TX1 uncharacterized 149 kDa protein6.7e-2630.58Show/hide
Query:  INKDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCSIIYK
        +++ +   L +  T +E+ + ++ M   K+ G DG+   F+Q +WD +  D      +   +        +  + L+PK  D + +K++RP+SL S  YK
Subjt:  INKDQNNMLTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCSIIYK

Query:  VIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVESVNF
        ++AKA++ RLK VL  +I P QS  VPGR I DN  L  + +H   +RR G   +  + LD  KA+DRV+  Y+   +    F  +++  +     S   
Subjt:  VIAKALANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVESVNF

Query:  QVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNK
         V +N +     +  RG+RQG PLS  L+ +  +  L LL K
Subjt:  QVLLNGTPREDFSPNRGLRQGDPLSPYLFLICAKGLLRLLNK

P93295 Uncharacterized mitochondrial protein AtMg003106.3e-1632.68Show/hide
Query:  AIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG------------------------------------------HFLK
        A+P Y M+CF+    LC +L S   +FWW S ++KRKI W +W+KLC +K   G                                            ++
Subjt:  AIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG------------------------------------------HFLK

Query:  AGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLP
          VG  PSY WR+II GR L  +G    IG GI+     D WI  E   TPLP
Subjt:  AGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLP

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.9e-0822.73Show/hide
Query:  GPKFTWCNNHVNNGTIWERLDRFLIN----PAMRNKCNYFKVSHLALISSDHRPILADWKEEPPDSMSRRRLRRFEEVWT----------KYEDCKEIVK
        G  +TW +NH ++  I  +LDR + N     +  +    F++S +    SDH P +   +  P    S++  R F  + T           +E+   +  
Subjt:  GPKFTWCNNHVNNGTIWERLDRFLIN----PAMRNKCNYFKVSHLALISSDHRPILADWKEEPPDSMSRRRLRRFEEVWT----------KYEDCKEIVK

Query:  QVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGEGQDTTEMKRKERELENLLEDD--------EIYWKQRAREEWLKW
         ++SL     A     K  + LNR              G I  K KE  D L S + Q  T        +E++            E +++Q++R +WL+ 
Subjt:  QVWSLKTESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGEGQDTTEMKRKERELENLLEDD--------EIYWKQRAREEWLKW

Query:  GDSNNKWFH-IKASNRRK-------------------------------INKDQNNMLTSSFTR-----------------------EEIYEVVKRMHPT
        GD+N ++FH +  +N+ K                               +  D + +   S  R                       +EI   V  M   
Subjt:  GDSNNKWFH-IKASNRRK-------------------------------INKDQNNMLTSSFTR-----------------------EEIYEVVKRMHPT

Query:  KALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCSIIYKVI
        KA GPD   A F+ + W +V++       +F      +   N T I LIPKV     +  FRP+S C+++YK+I
Subjt:  KALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCSIIYKVI

AT3G09510.1 Ribonuclease H-like superfamily protein1.2e-0921.79Show/hide
Query:  LKAGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVA----VTDPWIPREGVCTPLPLKRNEDQLTVAQLINPLGD---WNESLVYELFQVNDAKDIL
        L A V    SY W +++ G  L K+G R  IG G N+      + D   PR     PL  +    ++T+  L    G    W++S + +    +D   I 
Subjt:  LKAGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVA----VTDPWIPREGVCTPLPLKRNEDQLTVAQLINPLGD---WNESLVYELFQVNDAKDIL

Query:  NIPLRTRNKDDEIIWQGDPKGVFTVKSAYRL--------------------------------RMQNFI--CLRHLLPASQQKKLSGKHIGKHQFHLKSE
         I L    K D+IIW  +  G +TV+S Y L                                ++++F+   L   L  +++    G  I       +  
Subjt:  NIPLRTRNKDDEIIWQGDPKGVFTVKSAYRL--------------------------------RMQNFI--CLRHLLPASQQKKLSGKHIGKHQFHLKSE

Query:  AKPESLQHLLWECKRTKGKW--SKFFSIPNELYAYD-----------RRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNP
         + ES+ H L+ C      W  S    I N+L + D            +  T  D+ +LL                 + + W+IW  RN V+      +P
Subjt:  AKPESLQHLLWECKRTKGKW--SKFFSIPNELYAYD-----------RRSWTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNP

Query:  ENLNQQIIKEIENITRDVETQRHEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMGSVGWILRRWDGSIIATG
                          + + H+ +    +S    P   +  A   ++W  PP    K N +A +  +K   + GWI+R   G+ I+ G
Subjt:  ENLNQQIIKEIENITRDVETQRHEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMGSVGWILRRWDGSIIATG

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.4e-1339.76Show/hide
Query:  LANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWI
        +  RLK ++  +I P+Q++F+PGR+ TDN +   E +H+ M R+KG +G + +KLD+ KAYDR+ W Y+   +   GF + W+
Subjt:  LANRLKQVLHCIISPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWI

AT4G29090.1 Ribonuclease H-like superfamily protein3.3e-2822.55Show/hide
Query:  AIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG------------------------------------HF-----LKA
        A+P Y M CF  P ++C ++ S+ A FWW +    + +HW++W+ L   K  GG                                    +F     L A
Subjt:  AIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG------------------------------------HF-----LKA

Query:  GVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKRNEDQ--------LTVAQLINPLG-DWNESLVYELFQVNDAKDILN
         +G+ PS+ W++I   + + +QG R  +G+G +++     W+  +     L ++R   Q        L V+ LI+  G +W + ++  LF   + K I  
Subjt:  GVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKRNEDQ--------LTVAQLINPLG-DWNESLVYELFQVNDAKDILN

Query:  IPLRTRNKDDEIIWQGDPKGVFTVKSAYRL---------------------------------RMQNFI--CLRHLLPASQQKKLSGKHIGKHQFHLKSE
        +    R   D   W     G +TVKS Y +                                 ++Q+F+  CL + LP +    L+ +H+ K    ++  
Subjt:  IPLRTRNKDDEIIWQGDPKGVFTVKSAYRL---------------------------------RMQNFI--CLRHLLPASQQKKLSGKHIGKHQFHLKSE

Query:  AKPESLQHLLWECKRTKGKWSKFFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLD------EDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQI
        +  E++ HLL++C   +  W+   SIP  L       W    Y  L W    ++ L       E   +    + W++W  RNE++ + +  N     Q++
Subjt:  AKPESLQHLLWECKRTKGKWSKFFSIPNELYAYDRRSWTTMDYCELLWRDGRIYKLD------EDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQI

Query:  IKEIENITRDVETQRHEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMGSVGWILRRWDGSI
        ++  E+   D+E  R   ++T   S    P  +++   R   W  PP    K N +A+W  + +   +GW+LR   G +
Subjt:  IKEIENITRDVETQRHEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKLN-NASWCEEKQMGSVGWILRRWDGSI

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein4.5e-1732.68Show/hide
Query:  AIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG------------------------------------------HFLK
        A+P Y M+CF+    LC +L S   +FWW S ++KRKI W +W+KLC +K   G                                            ++
Subjt:  AIPNYVMNCFKFPISLCNELNSMCAKFWWGSGDSKRKIHWRSWEKLCINKHLGG------------------------------------------HFLK

Query:  AGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLP
          VG  PSY WR+II GR L  +G    IG GI+     D WI  E   TPLP
Subjt:  AGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTCGGCATACCGTCCAGACGCATAGGCCCCAACTGATTTTCTTAATGGAAACCAAAATGAAAAACAATGGAAGTGACTATATTAGGCGACAGTTACAGTTTGAGTT
TGGTATTGAAGTCCCTAGCACAGGACTAAGCGGCGGTTTGCTTCTTCTGTGGGGGACAGATATAAACGTGACCTTGCGCTCTTATTCGAAAGGGCATATTGATGCTAGTA
TCCAAGAGAATGGTTGGTCCTGGCGGTTCACAGGCTTCTATGGTAATCCACCTTGTGCTCAGCACCATGAATCATGGAAGTTAATGAAGCGTCTATGTGACCAACTCGAG
ATGCCTTGGGTTCTGGGAGGAGATTTTAATGAGATTATAGAGGGTTATTCTGGTCCTAAGTTTACTTGGTGTAATAACCATGTAAACAATGGAACTATCTGGGAGAGGCT
AGACCGGTTCCTCATTAACCCGGCAATGCGAAACAAGTGCAACTACTTCAAAGTCTCACATTTAGCCCTTATTTCCTCTGATCACAGACCAATATTGGCTGATTGGAAGG
AAGAGCCTCCTGACAGCATGTCTCGGAGACGCCTAAGACGGTTTGAGGAGGTATGGACAAAGTATGAGGATTGCAAGGAGATTGTTAAGCAGGTTTGGAGCCTGAAAACA
GAGAGTTCTGCCAACAGTATTATTATTAAAACAATGGAATGTCTGAATCGTTTAGACCAGTGGAGTAGACGAAACTATAATGGCTCAATCAGAGGGGCCATATCGAGAAA
GGAGAAAGAAATCCAAGACTTTTTGTCTTCAGGCGAGGGTCAAGACACTACAGAGATGAAGAGGAAAGAAAGAGAGCTTGAAAACCTGCTAGAAGATGACGAGATCTACT
GGAAGCAGCGAGCTAGGGAGGAATGGCTAAAGTGGGGTGACAGTAACAATAAATGGTTCCATATTAAGGCTAGTAACAGAAGGAAGATAAATAAGGACCAGAACAACATG
CTCACATCCTCTTTTACTAGGGAGGAAATATATGAGGTGGTCAAGAGAATGCACCCTACAAAAGCTCTAGGACCAGACGGGATTCAAGCTATCTTCTACCAGAAATACTG
GGATTTGGTGAGGAACGATATAGGTGAGTTTTGTTTAAAATTTCTGAATGAAAGAGGGAACATTTCTGAGATCAACAAGACCTACATTGTGTTAATTCCAAAAGTTAACG
ACCCCAAGTCTATGAAAGACTTCAGGCCCATAAGCTTATGCTCGATAATCTATAAAGTGATCGCCAAGGCCCTTGCAAACAGGTTGAAACAGGTTTTACATTGCATCATA
TCTCCAAGCCAGTCTGCGTTTGTCCCTGGCAGACTCATAACTGATAATGCACTATTAGGTTTTGAATGTATCCATGCGGTTATGAGCAGAAGGAAAGGTAAGGAGGGTAT
TATATCCATCAAGCTGGACATGAGTAAAGCTTATGATCGGGTTGAATGGACCTACATTCGTCAAATTATGGGCAAAATGGGCTTTTGCAAGAGATGGATACATCTGGTTA
TGCATTGTGTTGAATCAGTCAATTTCCAGGTGCTCCTAAATGGGACGCCCCGCGAAGACTTCTCCCCAAATCGAGGACTTAGACAAGGGGATCCGTTGTCCCCATACTTA
TTTCTAATTTGTGCTAAAGGTCTGTTAAGACTCCTTAACAAAGCAGAGATGAACAACCAGGTTGTCCAAACAGACAGTCTAGGCCAGTACCTTGGCCTCCCATCTCAAAA
TGCTCGTAGCAAGAAAGAGATTTTTAATCACATTAGAGATAGAGTTTGGAGAGCGTTGCAAGGATGGAAAGGGAGATTATTCTCAACAGCTGGCAGGGAGACCCTGATCA
AATCTGTGGCCCAAGCCATTCCAAACTACGTGATGAACTGTTTTAAATTTCCTATATCTCTGTGTAATGAGTTAAATTCGATGTGTGCCAAGTTCTGGTGGGGATCAGGG
GACTCTAAAAGGAAGATTCACTGGAGAAGCTGGGAGAAATTGTGCATCAACAAACATCTGGGTGGGCACTTTTTGAAAGCTGGTGTGGGAAATAACCCATCCTACACATG
GAGAAACATCATTTGGGGAAGAAATCTCTTCAAACAAGGATACAGATGGCAAATTGGCAGCGGTATAAACGTGGTGGCTGTGACTGACCCATGGATTCCAAGGGAAGGTG
TCTGTACACCTCTTCCTCTCAAAAGGAATGAGGATCAACTGACAGTAGCCCAACTTATCAATCCCCTTGGAGATTGGAACGAAAGCTTGGTTTATGAGTTGTTCCAAGTC
AATGACGCGAAGGACATTTTAAATATTCCACTCAGAACGAGAAACAAAGATGACGAGATTATATGGCAAGGAGACCCAAAAGGAGTTTTCACAGTGAAGAGTGCTTATAG
ATTGAGGATGCAGAACTTCATTTGTCTGAGGCATCTACTTCCAGCCAGTCAACAAAAGAAACTCTCTGGAAAACATATTGGAAAACACCAATTCCACCTAAAATCAGAGG
CAAAACCAGAAAGCCTACAACATCTACTTTGGGAATGCAAGAGAACCAAAGGTAAGTGGTCTAAGTTTTTTTCTATCCCTAATGAATTATATGCCTATGACAGGAGGAGC
TGGACGACGATGGACTACTGTGAGCTACTATGGAGGGATGGCAGAATTTACAAGCTGGACGAAGACAGAATTAGAAGAAGCATGATTATTTGTTGGAAAATCTGGTGCTA
TAGGAATGAAGTTTTACACAAGAATCAACATCCTAATCCAGAGAACCTTAACCAGCAGATAATCAAAGAAATTGAGAATATTACCAGGGATGTGGAGACACAGAGACACG
AAGAAGTGAAGACGAACCGGCGATCTTCTGCGAACCACCCCGGCACAGACCAGACTGAGGCGAGTCGGCTGCTCAAATGGCTGAAGCCTCCGGAAGGCGCCTGGAAACTC
AACAATGCCTCCTGGTGCGAAGAGAAGCAGATGGGCAGCGTCGGCTGGATCCTTAGACGTTGGGATGGGTCGATTATTGCAACAGGATTCCAGGTGGTAAACTTACTCAC
AAGGAAGGAGGTGGATGTCACAGAAATCTCGTTGTTCATAGATGAAGCTCAACAAATAATTCGCCTGAAGAAGATTCACTCGATCCATCACGTGGCAAGATGCCACAACC
ACATGGCCCATCAACTTGCCCGTTTAGCTTGGTCCAACAATCAATCTGAAATTTTTAAGGGGCCTATTTCCTGTATGGCTTTTAAATATGAACACTTTGGACACTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTCGGCATACCGTCCAGACGCATAGGCCCCAACTGATTTTCTTAATGGAAACCAAAATGAAAAACAATGGAAGTGACTATATTAGGCGACAGTTACAGTTTGAGTT
TGGTATTGAAGTCCCTAGCACAGGACTAAGCGGCGGTTTGCTTCTTCTGTGGGGGACAGATATAAACGTGACCTTGCGCTCTTATTCGAAAGGGCATATTGATGCTAGTA
TCCAAGAGAATGGTTGGTCCTGGCGGTTCACAGGCTTCTATGGTAATCCACCTTGTGCTCAGCACCATGAATCATGGAAGTTAATGAAGCGTCTATGTGACCAACTCGAG
ATGCCTTGGGTTCTGGGAGGAGATTTTAATGAGATTATAGAGGGTTATTCTGGTCCTAAGTTTACTTGGTGTAATAACCATGTAAACAATGGAACTATCTGGGAGAGGCT
AGACCGGTTCCTCATTAACCCGGCAATGCGAAACAAGTGCAACTACTTCAAAGTCTCACATTTAGCCCTTATTTCCTCTGATCACAGACCAATATTGGCTGATTGGAAGG
AAGAGCCTCCTGACAGCATGTCTCGGAGACGCCTAAGACGGTTTGAGGAGGTATGGACAAAGTATGAGGATTGCAAGGAGATTGTTAAGCAGGTTTGGAGCCTGAAAACA
GAGAGTTCTGCCAACAGTATTATTATTAAAACAATGGAATGTCTGAATCGTTTAGACCAGTGGAGTAGACGAAACTATAATGGCTCAATCAGAGGGGCCATATCGAGAAA
GGAGAAAGAAATCCAAGACTTTTTGTCTTCAGGCGAGGGTCAAGACACTACAGAGATGAAGAGGAAAGAAAGAGAGCTTGAAAACCTGCTAGAAGATGACGAGATCTACT
GGAAGCAGCGAGCTAGGGAGGAATGGCTAAAGTGGGGTGACAGTAACAATAAATGGTTCCATATTAAGGCTAGTAACAGAAGGAAGATAAATAAGGACCAGAACAACATG
CTCACATCCTCTTTTACTAGGGAGGAAATATATGAGGTGGTCAAGAGAATGCACCCTACAAAAGCTCTAGGACCAGACGGGATTCAAGCTATCTTCTACCAGAAATACTG
GGATTTGGTGAGGAACGATATAGGTGAGTTTTGTTTAAAATTTCTGAATGAAAGAGGGAACATTTCTGAGATCAACAAGACCTACATTGTGTTAATTCCAAAAGTTAACG
ACCCCAAGTCTATGAAAGACTTCAGGCCCATAAGCTTATGCTCGATAATCTATAAAGTGATCGCCAAGGCCCTTGCAAACAGGTTGAAACAGGTTTTACATTGCATCATA
TCTCCAAGCCAGTCTGCGTTTGTCCCTGGCAGACTCATAACTGATAATGCACTATTAGGTTTTGAATGTATCCATGCGGTTATGAGCAGAAGGAAAGGTAAGGAGGGTAT
TATATCCATCAAGCTGGACATGAGTAAAGCTTATGATCGGGTTGAATGGACCTACATTCGTCAAATTATGGGCAAAATGGGCTTTTGCAAGAGATGGATACATCTGGTTA
TGCATTGTGTTGAATCAGTCAATTTCCAGGTGCTCCTAAATGGGACGCCCCGCGAAGACTTCTCCCCAAATCGAGGACTTAGACAAGGGGATCCGTTGTCCCCATACTTA
TTTCTAATTTGTGCTAAAGGTCTGTTAAGACTCCTTAACAAAGCAGAGATGAACAACCAGGTTGTCCAAACAGACAGTCTAGGCCAGTACCTTGGCCTCCCATCTCAAAA
TGCTCGTAGCAAGAAAGAGATTTTTAATCACATTAGAGATAGAGTTTGGAGAGCGTTGCAAGGATGGAAAGGGAGATTATTCTCAACAGCTGGCAGGGAGACCCTGATCA
AATCTGTGGCCCAAGCCATTCCAAACTACGTGATGAACTGTTTTAAATTTCCTATATCTCTGTGTAATGAGTTAAATTCGATGTGTGCCAAGTTCTGGTGGGGATCAGGG
GACTCTAAAAGGAAGATTCACTGGAGAAGCTGGGAGAAATTGTGCATCAACAAACATCTGGGTGGGCACTTTTTGAAAGCTGGTGTGGGAAATAACCCATCCTACACATG
GAGAAACATCATTTGGGGAAGAAATCTCTTCAAACAAGGATACAGATGGCAAATTGGCAGCGGTATAAACGTGGTGGCTGTGACTGACCCATGGATTCCAAGGGAAGGTG
TCTGTACACCTCTTCCTCTCAAAAGGAATGAGGATCAACTGACAGTAGCCCAACTTATCAATCCCCTTGGAGATTGGAACGAAAGCTTGGTTTATGAGTTGTTCCAAGTC
AATGACGCGAAGGACATTTTAAATATTCCACTCAGAACGAGAAACAAAGATGACGAGATTATATGGCAAGGAGACCCAAAAGGAGTTTTCACAGTGAAGAGTGCTTATAG
ATTGAGGATGCAGAACTTCATTTGTCTGAGGCATCTACTTCCAGCCAGTCAACAAAAGAAACTCTCTGGAAAACATATTGGAAAACACCAATTCCACCTAAAATCAGAGG
CAAAACCAGAAAGCCTACAACATCTACTTTGGGAATGCAAGAGAACCAAAGGTAAGTGGTCTAAGTTTTTTTCTATCCCTAATGAATTATATGCCTATGACAGGAGGAGC
TGGACGACGATGGACTACTGTGAGCTACTATGGAGGGATGGCAGAATTTACAAGCTGGACGAAGACAGAATTAGAAGAAGCATGATTATTTGTTGGAAAATCTGGTGCTA
TAGGAATGAAGTTTTACACAAGAATCAACATCCTAATCCAGAGAACCTTAACCAGCAGATAATCAAAGAAATTGAGAATATTACCAGGGATGTGGAGACACAGAGACACG
AAGAAGTGAAGACGAACCGGCGATCTTCTGCGAACCACCCCGGCACAGACCAGACTGAGGCGAGTCGGCTGCTCAAATGGCTGAAGCCTCCGGAAGGCGCCTGGAAACTC
AACAATGCCTCCTGGTGCGAAGAGAAGCAGATGGGCAGCGTCGGCTGGATCCTTAGACGTTGGGATGGGTCGATTATTGCAACAGGATTCCAGGTGGTAAACTTACTCAC
AAGGAAGGAGGTGGATGTCACAGAAATCTCGTTGTTCATAGATGAAGCTCAACAAATAATTCGCCTGAAGAAGATTCACTCGATCCATCACGTGGCAAGATGCCACAACC
ACATGGCCCATCAACTTGCCCGTTTAGCTTGGTCCAACAATCAATCTGAAATTTTTAAGGGGCCTATTTCCTGTATGGCTTTTAAATATGAACACTTTGGACACTGA
Protein sequenceShow/hide protein sequence
MLRHTVQTHRPQLIFLMETKMKNNGSDYIRRQLQFEFGIEVPSTGLSGGLLLLWGTDINVTLRSYSKGHIDASIQENGWSWRFTGFYGNPPCAQHHESWKLMKRLCDQLE
MPWVLGGDFNEIIEGYSGPKFTWCNNHVNNGTIWERLDRFLINPAMRNKCNYFKVSHLALISSDHRPILADWKEEPPDSMSRRRLRRFEEVWTKYEDCKEIVKQVWSLKT
ESSANSIIIKTMECLNRLDQWSRRNYNGSIRGAISRKEKEIQDFLSSGEGQDTTEMKRKERELENLLEDDEIYWKQRAREEWLKWGDSNNKWFHIKASNRRKINKDQNNM
LTSSFTREEIYEVVKRMHPTKALGPDGIQAIFYQKYWDLVRNDIGEFCLKFLNERGNISEINKTYIVLIPKVNDPKSMKDFRPISLCSIIYKVIAKALANRLKQVLHCII
SPSQSAFVPGRLITDNALLGFECIHAVMSRRKGKEGIISIKLDMSKAYDRVEWTYIRQIMGKMGFCKRWIHLVMHCVESVNFQVLLNGTPREDFSPNRGLRQGDPLSPYL
FLICAKGLLRLLNKAEMNNQVVQTDSLGQYLGLPSQNARSKKEIFNHIRDRVWRALQGWKGRLFSTAGRETLIKSVAQAIPNYVMNCFKFPISLCNELNSMCAKFWWGSG
DSKRKIHWRSWEKLCINKHLGGHFLKAGVGNNPSYTWRNIIWGRNLFKQGYRWQIGSGINVVAVTDPWIPREGVCTPLPLKRNEDQLTVAQLINPLGDWNESLVYELFQV
NDAKDILNIPLRTRNKDDEIIWQGDPKGVFTVKSAYRLRMQNFICLRHLLPASQQKKLSGKHIGKHQFHLKSEAKPESLQHLLWECKRTKGKWSKFFSIPNELYAYDRRS
WTTMDYCELLWRDGRIYKLDEDRIRRSMIICWKIWCYRNEVLHKNQHPNPENLNQQIIKEIENITRDVETQRHEEVKTNRRSSANHPGTDQTEASRLLKWLKPPEGAWKL
NNASWCEEKQMGSVGWILRRWDGSIIATGFQVVNLLTRKEVDVTEISLFIDEAQQIIRLKKIHSIHHVARCHNHMAHQLARLAWSNNQSEIFKGPISCMAFKYEHFGH