; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G02525 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G02525
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationClcChr04:8234657..8237765
RNA-Seq ExpressionClc04G02525
SyntenyClc04G02525
Gene Ontology termsNA
InterPro domainsIPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.8e-8626.92Show/hide
Query:  MWDDLRHTVTSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGF
        MW+D   ++ S     FS+   V  +NG  WW+  IYG AK K+R  F EE+  L + C P WILGGDFNVIRW+ E +  N A  +M++FNS +++   
Subjt:  MWDDLRHTVTSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGF

Query:  IEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILENSQLCWGPS--------LSDSTIFNKNVKDWWDSTSQVGFPG
        I+ PL+N K+TWSNLRA A LSRLDRFL+T QWE+ F  H ++ L + TSDH+P++LE+S + WGPS        L D   + KN++ WW +TSQ G+ G
Subjt:  IEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILENSQLCWGPS--------LSDSTIFNKNVKDWWDSTSQVGFPG

Query:  YAFNRTLKQLSSIIKNWQARQKKVTNEEKK--------SIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTV
        Y+F R LKQL+ IIK W  R KK  NE  K         IDKLE++ + ++I   +RT+LK DL+     EAQ WAQ+ K +W+ + DEN +FF K+CT 
Subjt:  YAFNRTLKQLSSIIKNWQARQKKVTNEEKK--------SIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTV

Query:  RQRRSFIYEILDENGNSCITNSS----------------------IENI--CP--------------------------------------------YAT
        RQ++  I +I++ +G +C+ +S                       IEN+  CP                                            ++ 
Subjt:  RQRRSFIYEILDENGNSCITNSS----------------------IENI--CP--------------------------------------------YAT

Query:  FQRTIC-------------------------------------------------------------------------------------------WQT
         ++ IC                                                                                           W++
Subjt:  FQRTIC-------------------------------------------------------------------------------------------WQT

Query:  -----------------------------------------------------------------------------------------------KRSVN
                                                                                                       KR +N
Subjt:  -----------------------------------------------------------------------------------------------KRSVN

Query:  -----------------------------------IIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWES
                                           I+  FE A  L IN++KST+  +NV     + I   WG++   LP SYLG+P GG+P S  FW++
Subjt:  -----------------------------------IIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWES

Query:  VYKNIEK----------------------------------------------HWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLC
        V + I+K                                               W NFL    + G++  L+ W  I S K KGG  I+++  TNF+LLC
Subjt:  VYKNIEK----------------------------------------------HWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLC

Query:  KWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS
        KWLW+F  E + LWKRLI++KY    +   P+  ++ S  SPW ++ + + WF  N+S
Subjt:  KWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.7e-9028.41Show/hide
Query:  RQFVENEDELENKINEEDFKKELIAWLRENNLKLT-----------------------------------PANELK---TKANGRCDGIILMWDDLRHTV
        + +   ++E E   + E FKK+L++WL++N LKL+                                   P+N +      A+G   GI+++WD   H++
Subjt:  RQFVENEDELENKINEEDFKKELIAWLRENNLKLT-----------------------------------PANELK---TKANGRCDGIILMWDDLRHTV

Query:  TSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGK
         S     FS+ AN L++N  SWW+  +YG  K ++R  F  E+ +L +  S  WILGGD NVIR   E ++   + +N +  N+ +++   I+ PLTN +
Subjt:  TSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGK

Query:  FTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILE--NSQLCWGP--------SLSDSTIFNKNVKDWWDSTSQVGFPGYAFNRTL
        FTWSNLR     SR+DRFLY   WE+ F+ H TRTL + TSDH+PL+ E  N +L WGP        +LSD   F +N+  WW+++ Q G+PG++F + L
Subjt:  FTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILE--NSQLCWGP--------SLSDSTIFNKNVKDWWDSTSQVGFPGYAFNRTL

Query:  KQLSSIIKNWQARQ-------KKVTNEEKKSIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIY
        K L++ IK WQ  +       K+    E  SIDK E    L+  +S RR +LK DL+  + KE+Q+W QR K LWL++ DEN +FF ++C+ RQ+RSFI+
Subjt:  KQLSSIIKNWQARQ-------KKVTNEEKKSIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIY

Query:  EILDENGNSCITNSS-----------------------IENI--------------CPY----------------------------------ATFQRTI
        EI DE G+   TN+S                       IEN+               P+                                   T   TI
Subjt:  EILDENGNSCITNSS-----------------------IENI--------------CPY----------------------------------ATFQRTI

Query:  C-------------------------WQTKR---------------------------------------------------------------------
                                  W+ K+                                                                     
Subjt:  C-------------------------WQTKR---------------------------------------------------------------------

Query:  -------------------------------SVN------------------------------IIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKR
                                       S+N                               +  FE A  LKIN+ KS L  VNVS    +E    
Subjt:  -------------------------------SVN------------------------------IIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKR

Query:  WGLTHHFLPISYLGVPFGGKPHSKAFWESV----------------------------------------------YKNIEKHWGNFLRKDKNGGYSTHL
        WG++ H LP+SYLGVP GG P S  FW +V                                               KNIEK W  FL K  NG   +HL
Subjt:  WGLTHHFLPISYLGVPFGGKPHSKAFWESV----------------------------------------------YKNIEKHWGNFLRKDKNGGYSTHL

Query:  VNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS
        +NW  ++  K +GG  I+ L  TN +LL KWLWR+  E N LW+RLI  KYK  +  +IP+     ++K+PW SI+   DWFK+N S
Subjt:  VNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.2e-8327.75Show/hide
Query:  NRNNRQFVENEDELENKINEEDFKKELIAWLRENNLKL----------TPANELKTKANGRCDGIILMWDDLRHTVTSSLGKEFSILANVLMSNGFSWWV
        +   R +   +++ E   N E FK +L+ WL+EN LKL          T  N L ++      GI+++WD   H++ S    +FS+ AN    N  SWW+
Subjt:  NRNNRQFVENEDELENKINEEDFKKELIAWLRENNLKL----------TPANELKTKANGRCDGIILMWDDLRHTVTSSLGKEFSILANVLMSNGFSWWV

Query:  MIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQW
          +YG  K ++R    E++ +L++  S  WI+GGD NV+R   E +A   + ++    N  +++   I+ PLTN ++TWSNLR     SRLDRFLY  +W
Subjt:  MIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQW

Query:  EDRFTIHFTRTLAKITSDHYPLILEN--SQLCWGP--------SLSDSTIFNKNVKDWWDSTSQVGFPGYAFNRTLKQLSSIIKNWQARQ-------KKV
        E  F  H TRTL + TSDH+PL+ E+  S L WGP        +L+D   F +N++ WW+ + Q G PG+ F + LK L+++IK WQ  +       K+ 
Subjt:  EDRFTIHFTRTLAKITSDHYPLILEN--SQLCWGP--------SLSDSTIFNKNVKDWWDSTSQVGFPGYAFNRTLKQLSSIIKNWQARQ-------KKV

Query:  TNEEKKSIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIYEILDENGNSCITNSS---------
           E  SIDK E    LS  +S RR +LK +LN  + KE+Q+W QR K LWL++ DEN AFF ++C+ RQ+R+ I+EI DE G+   TN++         
Subjt:  TNEEKKSIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIYEILDENGNSCITNSS---------

Query:  --------------IENI---------------------------------------CPYATFQ------------------------------------
                      IEN+                                        P + F+                                    
Subjt:  --------------IENI---------------------------------------CPYATFQ------------------------------------

Query:  -----------RTICWQTKRSVNIIKTFEVAIKLKIN-----------MNKSTLSAV-----------------------------NVSRTGVEEIVK--
                   R I   T     I KT    +KL +             N+    A+                             N++   ++ ++K  
Subjt:  -----------RTICWQTKRSVNIIKTFEVAIKLKIN-----------MNKSTLSAV-----------------------------NVSRTGVEEIVK--

Query:  -------RW-------------------------------------------------------------GLTHHFLPISYLGVPFGGKPHSKAFW----
               +W                                                             G+  H LP++YLGVP GG P S  FW    
Subjt:  -------RW-------------------------------------------------------------GLTHHFLPISYLGVPFGGKPHSKAFW----

Query:  ------------------------------------------ESVYKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKW
                                                   S YKNIEK W NFL K   G   +HL+NW+ +T  K +GG  I+ L+ TN +LL KW
Subjt:  ------------------------------------------ESVYKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKW

Query:  LWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTN
        LWR++ E N LW+RLI  KYK  +  ++P+     S+K+PW SI+  +DWFK+N
Subjt:  LWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTN

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.1e-9930.57Show/hide
Query:  MWDDLRHTVTSSLGKEFSILANVLMSNGFS---WWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNS
        MWDDLR  VT  +   FS+  N+   +G S   WW+  IYG +  ++R  F  E+  L N CSP W+L GDFNV+R+ +E SA N +K++M+ FN  +  
Subjt:  MWDDLRHTVTSSLGKEFSILANVLMSNGFS---WWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNS

Query:  LGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILENSQLCWGPS-------LSDSTIFNKNVKDWWDSTSQVGF
           I+ PL+N KFTWSNLR H VLSR+DRFLYT  WE+ FT H+++TL+++TSDH+P++LE+S + WGPS             F  N+ +WW +  Q G 
Subjt:  LGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILENSQLCWGPS-------LSDSTIFNKNVKDWWDSTSQVGF

Query:  PGYAFNRTLKQLSSIIKNWQARQKKVTNEEKK-------SIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCT
        PG++F R LKQLS+II+N Q + K  ++E+K        SID+LE++ NLS+  S RRT LK D+ +   KEAQ W Q+ K LW+ + DEN +FF K+C+
Subjt:  PGYAFNRTLKQLSSIIKNWQARQKKVTNEEKK-------SIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCT

Query:  VRQRRSFIYEILDENGNSCITNSSIE-------------------------NICPYATFQRTIC------------------------------------
         RQRRS I  I   +G  C TN SI                          N  P +T Q  I                                     
Subjt:  VRQRRSFIYEILDENGNSCITNSSIE-------------------------NICPYATFQRTIC------------------------------------

Query:  ------------------------------------------------------------------------WQTKR-----------------------
                                                                                W+ K+                       
Subjt:  ------------------------------------------------------------------------WQTKR-----------------------

Query:  ------------------------------------------------------------------SV--------------------------------
                                                                          SV                                
Subjt:  ------------------------------------------------------------------SV--------------------------------

Query:  -------NIIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWESV--------------------------
               NII  F++A  L IN+NKST+S +NV  +  E+I  +WG++  FLPI+YLGVP GGK  +KAFW++V                          
Subjt:  -------NIIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWESV--------------------------

Query:  --------------------YKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYV
                             KNIEK W NFL K+    +  HLVNWA ITS K KGG  I+ L+DTNF+LL KWLWR+  ED+ LWK++I AKY+    
Subjt:  --------------------YKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYV

Query:  VEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS
         +IP    + S++SPW SI KGL+WF+ ++S
Subjt:  VEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.2e-8226.34Show/hide
Query:  FKKELIAWLRENNLKLTP--ANELKTKAN------------------GRCDGIILMWDDLRHTVTSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDR
        FK++L+ WL+EN LKL+P   N++ + +                   G   GI+++WDD    V       +SI  N+L +NG +WW+  +YG  K  DR
Subjt:  FKKELIAWLRENNLKLTP--ANELKTKAN------------------GRCDGIILMWDDLRHTVTSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDR

Query:  NKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTL
         K   E+  L + C PNW++ GDFN++RWE E +A +L K NM  FN+ ++    I+ P  N  FTWSNLR +   SRLDRFL +  WE+ F +H +RTL
Subjt:  NKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTL

Query:  AKITSDHYPLILENSQLCWGP---SLSDSTI----FNKNVKDWWDSTSQVGFPGYAFNRTLKQLSSIIKNWQARQKKVTNEEKKS-------IDKLESQQ
         +  SDH+P++LE+ Q+ WGP    L++S++    F KN  +WW+S+ Q GFPGYAF ++L  LS  IK WQ  +  + +  KK+       IDKLE Q 
Subjt:  AKITSDHYPLILENSQLCWGP---SLSDSTI----FNKNVKDWWDSTSQVGFPGYAFNRTLKQLSSIIKNWQARQKKVTNEEKKS-------IDKLESQQ

Query:  NLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIYEILDENGNSC-----------------------------
         +S     +R SLK DL S    +AQ W QR +  W    DEN ++F ++CT+ QR++ I  I D  G S                              
Subjt:  NLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIYEILDENGNSC-----------------------------

Query:  ---------------------------------------------------------------------ITNSSIENI----------CPYATFQRTICW
                                                                             I N+++ N           C   +  R I  
Subjt:  ---------------------------------------------------------------------ITNSSIENI----------CPYATFQRTICW

Query:  QTK-------------------------------RSVN--------IIKT--------------------------------------------------
         T                                R +N        +I T                                                  
Subjt:  QTK-------------------------------RSVN--------IIKT--------------------------------------------------

Query:  --------------------------------------------------------------------------------------------FEVAIKLK
                                                                                                    FE A  L 
Subjt:  --------------------------------------------------------------------------------------------FEVAIKLK

Query:  INMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWE----------------------------------------------SV
         N +KST+S +N+S    ++I   +G    FLP++YLGVP GG P S++FW+                                              SV
Subjt:  INMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWE----------------------------------------------SV

Query:  YKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIV
        YK IEKHW +FL        + HL+NW   TS K  GG  I+ L+DTN +LLCKWLWR+H E N LWK+ I AKY  N+  +IP   R  S  SPW +I 
Subjt:  YKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIV

Query:  KGLDWFKTNLS
        K  DW+++ +S
Subjt:  KGLDWFKTNLS

TrEMBL top hitse value%identityAlignment
A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein3.3e-8626.92Show/hide
Query:  MWDDLRHTVTSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGF
        MW+D   ++ S     FS+   V  +NG  WW+  IYG AK K+R  F EE+  L + C P WILGGDFNVIRW+ E +  N A  +M++FNS +++   
Subjt:  MWDDLRHTVTSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGF

Query:  IEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILENSQLCWGPS--------LSDSTIFNKNVKDWWDSTSQVGFPG
        I+ PL+N K+TWSNLRA A LSRLDRFL+T QWE+ F  H ++ L + TSDH+P++LE+S + WGPS        L D   + KN++ WW +TSQ G+ G
Subjt:  IEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILENSQLCWGPS--------LSDSTIFNKNVKDWWDSTSQVGFPG

Query:  YAFNRTLKQLSSIIKNWQARQKKVTNEEKK--------SIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTV
        Y+F R LKQL+ IIK W  R KK  NE  K         IDKLE++ + ++I   +RT+LK DL+     EAQ WAQ+ K +W+ + DEN +FF K+CT 
Subjt:  YAFNRTLKQLSSIIKNWQARQKKVTNEEKK--------SIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTV

Query:  RQRRSFIYEILDENGNSCITNSS----------------------IENI--CP--------------------------------------------YAT
        RQ++  I +I++ +G +C+ +S                       IEN+  CP                                            ++ 
Subjt:  RQRRSFIYEILDENGNSCITNSS----------------------IENI--CP--------------------------------------------YAT

Query:  FQRTIC-------------------------------------------------------------------------------------------WQT
         ++ IC                                                                                           W++
Subjt:  FQRTIC-------------------------------------------------------------------------------------------WQT

Query:  -----------------------------------------------------------------------------------------------KRSVN
                                                                                                       KR +N
Subjt:  -----------------------------------------------------------------------------------------------KRSVN

Query:  -----------------------------------IIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWES
                                           I+  FE A  L IN++KST+  +NV     + I   WG++   LP SYLG+P GG+P S  FW++
Subjt:  -----------------------------------IIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWES

Query:  VYKNIEK----------------------------------------------HWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLC
        V + I+K                                               W NFL    + G++  L+ W  I S K KGG  I+++  TNF+LLC
Subjt:  VYKNIEK----------------------------------------------HWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLC

Query:  KWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS
        KWLW+F  E + LWKRLI++KY    +   P+  ++ S  SPW ++ + + WF  N+S
Subjt:  KWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein4.4e-8327.75Show/hide
Query:  NRNNRQFVENEDELENKINEEDFKKELIAWLRENNLKL----------TPANELKTKANGRCDGIILMWDDLRHTVTSSLGKEFSILANVLMSNGFSWWV
        +   R +   +++ E   N E FK +L+ WL+EN LKL          T  N L ++      GI+++WD   H++ S    +FS+ AN    N  SWW+
Subjt:  NRNNRQFVENEDELENKINEEDFKKELIAWLRENNLKL----------TPANELKTKANGRCDGIILMWDDLRHTVTSSLGKEFSILANVLMSNGFSWWV

Query:  MIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQW
          +YG  K ++R    E++ +L++  S  WI+GGD NV+R   E +A   + ++    N  +++   I+ PLTN ++TWSNLR     SRLDRFLY  +W
Subjt:  MIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQW

Query:  EDRFTIHFTRTLAKITSDHYPLILEN--SQLCWGP--------SLSDSTIFNKNVKDWWDSTSQVGFPGYAFNRTLKQLSSIIKNWQARQ-------KKV
        E  F  H TRTL + TSDH+PL+ E+  S L WGP        +L+D   F +N++ WW+ + Q G PG+ F + LK L+++IK WQ  +       K+ 
Subjt:  EDRFTIHFTRTLAKITSDHYPLILEN--SQLCWGP--------SLSDSTIFNKNVKDWWDSTSQVGFPGYAFNRTLKQLSSIIKNWQARQ-------KKV

Query:  TNEEKKSIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIYEILDENGNSCITNSS---------
           E  SIDK E    LS  +S RR +LK +LN  + KE+Q+W QR K LWL++ DEN AFF ++C+ RQ+R+ I+EI DE G+   TN++         
Subjt:  TNEEKKSIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIYEILDENGNSCITNSS---------

Query:  --------------IENI---------------------------------------CPYATFQ------------------------------------
                      IEN+                                        P + F+                                    
Subjt:  --------------IENI---------------------------------------CPYATFQ------------------------------------

Query:  -----------RTICWQTKRSVNIIKTFEVAIKLKIN-----------MNKSTLSAV-----------------------------NVSRTGVEEIVK--
                   R I   T     I KT    +KL +             N+    A+                             N++   ++ ++K  
Subjt:  -----------RTICWQTKRSVNIIKTFEVAIKLKIN-----------MNKSTLSAV-----------------------------NVSRTGVEEIVK--

Query:  -------RW-------------------------------------------------------------GLTHHFLPISYLGVPFGGKPHSKAFW----
               +W                                                             G+  H LP++YLGVP GG P S  FW    
Subjt:  -------RW-------------------------------------------------------------GLTHHFLPISYLGVPFGGKPHSKAFW----

Query:  ------------------------------------------ESVYKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKW
                                                   S YKNIEK W NFL K   G   +HL+NW+ +T  K +GG  I+ L+ TN +LL KW
Subjt:  ------------------------------------------ESVYKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKW

Query:  LWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTN
        LWR++ E N LW+RLI  KYK  +  ++P+     S+K+PW SI+  +DWFK+N
Subjt:  LWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTN

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein1.3e-9028.41Show/hide
Query:  RQFVENEDELENKINEEDFKKELIAWLRENNLKLT-----------------------------------PANELK---TKANGRCDGIILMWDDLRHTV
        + +   ++E E   + E FKK+L++WL++N LKL+                                   P+N +      A+G   GI+++WD   H++
Subjt:  RQFVENEDELENKINEEDFKKELIAWLRENNLKLT-----------------------------------PANELK---TKANGRCDGIILMWDDLRHTV

Query:  TSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGK
         S     FS+ AN L++N  SWW+  +YG  K ++R  F  E+ +L +  S  WILGGD NVIR   E ++   + +N +  N+ +++   I+ PLTN +
Subjt:  TSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGK

Query:  FTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILE--NSQLCWGP--------SLSDSTIFNKNVKDWWDSTSQVGFPGYAFNRTL
        FTWSNLR     SR+DRFLY   WE+ F+ H TRTL + TSDH+PL+ E  N +L WGP        +LSD   F +N+  WW+++ Q G+PG++F + L
Subjt:  FTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILE--NSQLCWGP--------SLSDSTIFNKNVKDWWDSTSQVGFPGYAFNRTL

Query:  KQLSSIIKNWQARQ-------KKVTNEEKKSIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIY
        K L++ IK WQ  +       K+    E  SIDK E    L+  +S RR +LK DL+  + KE+Q+W QR K LWL++ DEN +FF ++C+ RQ+RSFI+
Subjt:  KQLSSIIKNWQARQ-------KKVTNEEKKSIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIY

Query:  EILDENGNSCITNSS-----------------------IENI--------------CPY----------------------------------ATFQRTI
        EI DE G+   TN+S                       IEN+               P+                                   T   TI
Subjt:  EILDENGNSCITNSS-----------------------IENI--------------CPY----------------------------------ATFQRTI

Query:  C-------------------------WQTKR---------------------------------------------------------------------
                                  W+ K+                                                                     
Subjt:  C-------------------------WQTKR---------------------------------------------------------------------

Query:  -------------------------------SVN------------------------------IIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKR
                                       S+N                               +  FE A  LKIN+ KS L  VNVS    +E    
Subjt:  -------------------------------SVN------------------------------IIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKR

Query:  WGLTHHFLPISYLGVPFGGKPHSKAFWESV----------------------------------------------YKNIEKHWGNFLRKDKNGGYSTHL
        WG++ H LP+SYLGVP GG P S  FW +V                                               KNIEK W  FL K  NG   +HL
Subjt:  WGLTHHFLPISYLGVPFGGKPHSKAFWESV----------------------------------------------YKNIEKHWGNFLRKDKNGGYSTHL

Query:  VNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS
        +NW  ++  K +GG  I+ L  TN +LL KWLWR+  E N LW+RLI  KYK  +  +IP+     ++K+PW SI+   DWFK+N S
Subjt:  VNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein5.2e-10030.57Show/hide
Query:  MWDDLRHTVTSSLGKEFSILANVLMSNGFS---WWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNS
        MWDDLR  VT  +   FS+  N+   +G S   WW+  IYG +  ++R  F  E+  L N CSP W+L GDFNV+R+ +E SA N +K++M+ FN  +  
Subjt:  MWDDLRHTVTSSLGKEFSILANVLMSNGFS---WWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNS

Query:  LGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILENSQLCWGPS-------LSDSTIFNKNVKDWWDSTSQVGF
           I+ PL+N KFTWSNLR H VLSR+DRFLYT  WE+ FT H+++TL+++TSDH+P++LE+S + WGPS             F  N+ +WW +  Q G 
Subjt:  LGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYPLILENSQLCWGPS-------LSDSTIFNKNVKDWWDSTSQVGF

Query:  PGYAFNRTLKQLSSIIKNWQARQKKVTNEEKK-------SIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCT
        PG++F R LKQLS+II+N Q + K  ++E+K        SID+LE++ NLS+  S RRT LK D+ +   KEAQ W Q+ K LW+ + DEN +FF K+C+
Subjt:  PGYAFNRTLKQLSSIIKNWQARQKKVTNEEKK-------SIDKLESQQNLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCT

Query:  VRQRRSFIYEILDENGNSCITNSSIE-------------------------NICPYATFQRTIC------------------------------------
         RQRRS I  I   +G  C TN SI                          N  P +T Q  I                                     
Subjt:  VRQRRSFIYEILDENGNSCITNSSIE-------------------------NICPYATFQRTIC------------------------------------

Query:  ------------------------------------------------------------------------WQTKR-----------------------
                                                                                W+ K+                       
Subjt:  ------------------------------------------------------------------------WQTKR-----------------------

Query:  ------------------------------------------------------------------SV--------------------------------
                                                                          SV                                
Subjt:  ------------------------------------------------------------------SV--------------------------------

Query:  -------NIIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWESV--------------------------
               NII  F++A  L IN+NKST+S +NV  +  E+I  +WG++  FLPI+YLGVP GGK  +KAFW++V                          
Subjt:  -------NIIKTFEVAIKLKINMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWESV--------------------------

Query:  --------------------YKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYV
                             KNIEK W NFL K+    +  HLVNWA ITS K KGG  I+ L+DTNF+LL KWLWR+  ED+ LWK++I AKY+    
Subjt:  --------------------YKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYV

Query:  VEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS
         +IP    + S++SPW SI KGL+WF+ ++S
Subjt:  VEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein5.8e-8326.34Show/hide
Query:  FKKELIAWLRENNLKLTP--ANELKTKAN------------------GRCDGIILMWDDLRHTVTSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDR
        FK++L+ WL+EN LKL+P   N++ + +                   G   GI+++WDD    V       +SI  N+L +NG +WW+  +YG  K  DR
Subjt:  FKKELIAWLRENNLKLTP--ANELKTKAN------------------GRCDGIILMWDDLRHTVTSSLGKEFSILANVLMSNGFSWWVMIIYGHAKIKDR

Query:  NKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTL
         K   E+  L + C PNW++ GDFN++RWE E +A +L K NM  FN+ ++    I+ P  N  FTWSNLR +   SRLDRFL +  WE+ F +H +RTL
Subjt:  NKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGKFTWSNLRAHAVLSRLDRFLYTPQWEDRFTIHFTRTL

Query:  AKITSDHYPLILENSQLCWGP---SLSDSTI----FNKNVKDWWDSTSQVGFPGYAFNRTLKQLSSIIKNWQARQKKVTNEEKKS-------IDKLESQQ
         +  SDH+P++LE+ Q+ WGP    L++S++    F KN  +WW+S+ Q GFPGYAF ++L  LS  IK WQ  +  + +  KK+       IDKLE Q 
Subjt:  AKITSDHYPLILENSQLCWGP---SLSDSTI----FNKNVKDWWDSTSQVGFPGYAFNRTLKQLSSIIKNWQARQKKVTNEEKKS-------IDKLESQQ

Query:  NLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIYEILDENGNSC-----------------------------
         +S     +R SLK DL S    +AQ W QR +  W    DEN ++F ++CT+ QR++ I  I D  G S                              
Subjt:  NLSDIDSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIYEILDENGNSC-----------------------------

Query:  ---------------------------------------------------------------------ITNSSIENI----------CPYATFQRTICW
                                                                             I N+++ N           C   +  R I  
Subjt:  ---------------------------------------------------------------------ITNSSIENI----------CPYATFQRTICW

Query:  QTK-------------------------------RSVN--------IIKT--------------------------------------------------
         T                                R +N        +I T                                                  
Subjt:  QTK-------------------------------RSVN--------IIKT--------------------------------------------------

Query:  --------------------------------------------------------------------------------------------FEVAIKLK
                                                                                                    FE A  L 
Subjt:  --------------------------------------------------------------------------------------------FEVAIKLK

Query:  INMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWE----------------------------------------------SV
         N +KST+S +N+S    ++I   +G    FLP++YLGVP GG P S++FW+                                              SV
Subjt:  INMNKSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWE----------------------------------------------SV

Query:  YKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIV
        YK IEKHW +FL        + HL+NW   TS K  GG  I+ L+DTN +LLCKWLWR+H E N LWK+ I AKY  N+  +IP   R  S  SPW +I 
Subjt:  YKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIV

Query:  KGLDWFKTNLS
        K  DW+++ +S
Subjt:  KGLDWFKTNLS

SwissProt top hitse value%identityAlignment
P0C2F6 Putative ribonuclease H protein At1g657506.1e-0531.25Show/hide
Query:  WGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGL
        WG+   K K      HLV W+ + S K +GG  +   +  N +L+ K  WR  +E N LW  ++  KY    + +        S  S W SI  GL
Subjt:  WGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFHEEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGL

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein6.1e-0825.2Show/hide
Query:  ILGGDFNVIRWENE---ISATNLAKYNMKKFNSVMNSLGFIEHPLTNGKFTWSNLR-AHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYP--LIL
        IL GDF+ I   ++   +  T++    +++F + +     ++ P     +TWSN +  + ++ +LDR +    W   F            SDH P  +IL
Subjt:  ILGGDFNVIRWENE---ISATNLAKYNMKKFNSVMNSLGFIEHPLTNGKFTWSNLR-AHAVLSRLDRFLYTPQWEDRFTIHFTRTLAKITSDHYP--LIL

Query:  EN----SQLC--WGPSLSDSTIFNKNVKDWWDSTSQVGFPGYAFNRTLKQLSSIIK--NWQ--ARQKKVTNEEKKSIDKLESQQNLSDIDSARRTS--LK
        EN    S+ C  +   LS    F  ++   W+    VG   ++    LK      K  N Q     +  T E   S++ ++SQ   +  DS  R     +
Subjt:  EN----SQLC--WGPSLSDSTIFNKNVKDWWDSTSQVGFPGYAFNRTLKQLSSIIK--NWQ--ARQKKVTNEEKKSIDKLESQQNLSDIDSARRTS--LK

Query:  IDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFI
           N  A     ++ Q+ +  WLQD D N  FF KV    Q ++ I
Subjt:  IDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACTTAGACCCTTTATCTCCTCTATTAGACACCTTGATCTCAAGCCCAGACAACCATATGGCCTCAGATTCATTACCATTTCCTTCACAATTAGACATACTTATGGA
CAACCGCAACAATAGACAGTTCGTGGAAAATGAAGATGAATTGGAGAATAAGATCAATGAAGAAGACTTCAAAAAAGAGCTCATTGCTTGGCTCAGAGAAAACAATCTCA
AATTGACTCCAGCCAACGAACTCAAAACCAAAGCCAATGGCAGATGCGATGGCATCATTCTCATGTGGGATGATCTTCGACACACAGTAACAAGTTCATTGGGAAAGGAA
TTTTCTATCTTAGCTAATGTCTTAATGTCTAATGGCTTCAGCTGGTGGGTCATGATTATATATGGGCATGCCAAAATAAAAGATAGAAACAAATTTCTAGAGGAGATCTC
TAGTCTTAACAACACGTGCTCGCCGAATTGGATATTGGGAGGCGACTTCAATGTCATTAGGTGGGAAAATGAGATATCAGCCACTAACCTAGCTAAATACAACATGAAGA
AGTTTAACTCAGTTATGAATAGTCTTGGCTTCATTGAACATCCCCTTACAAATGGCAAGTTTACATGGTCAAATCTCAGAGCTCACGCTGTTTTATCAAGATTGGACCGA
TTTCTATATACTCCTCAATGGGAAGATAGATTCACCATTCATTTTACTAGAACATTGGCTAAAATTACCTCAGACCACTATCCTCTCATTCTTGAGAATTCACAACTTTG
TTGGGGCCCTAGCCTTTCCGATTCAACAATCTTTAACAAAAATGTAAAAGATTGGTGGGACTCTACTAGTCAAGTTGGATTCCCAGGTTATGCTTTTAACAGGACACTCA
AACAACTCTCAAGTATAATAAAAAATTGGCAGGCAAGACAGAAAAAGGTGACAAATGAAGAGAAGAAATCGATTGATAAGCTTGAAAGCCAACAAAACCTTAGCGATATC
GATAGTGCTCGAAGAACTTCGTTGAAAATTGATCTCAACTCCGAGGCTACTAAAGAAGCTCAATATTGGGCTCAAAGATATAAAAGCCTTTGGCTACAAGATGAAGACGA
AAATTTAGCTTTTTTCGACAAAGTCTGTACGGTTAGACAGCGCAGAAGTTTTATTTATGAAATACTTGATGAAAATGGTAACAGTTGCATCACCAATAGCTCTATAGAAA
ATATTTGTCCTTATGCAACATTTCAAAGAACTATATGTTGGCAAACCAAAAGATCAGTGAACATCATCAAAACCTTCGAAGTTGCAATCAAGCTCAAGATCAACATGAAC
AAGTCCACACTATCAGCCGTAAATGTCAGTAGAACCGGAGTTGAGGAGATAGTAAAAAGATGGGGCTTAACTCACCATTTTCTCCCTATAAGCTACTTGGGAGTCCCTTT
TGGAGGCAAGCCACATTCCAAAGCTTTTTGGGAATCGGTTTACAAAAACATTGAAAAGCACTGGGGAAACTTCCTTCGGAAAGACAAAAATGGTGGTTATAGTACACACC
TCGTTAATTGGGCTACGATAACTTCCCTAAAAAACAAAGGTGGTTTCGACATAAACAATTTGGAAGACACTAACTTCTCCCTGCTCTGCAAATGGCTTTGGAGATTCCAT
GAAGAGGACAACCGTCTCTGGAAGAGACTGATCCTTGCTAAATACAAGCACAATTATGTAGTGGAAATTCCCACAACAAGTAGATACTGTAGCACTAAATCCCCTTGGAT
GTCTATTGTCAAAGGTCTCGACTGGTTCAAAACCAACTTATCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACTTAGACCCTTTATCTCCTCTATTAGACACCTTGATCTCAAGCCCAGACAACCATATGGCCTCAGATTCATTACCATTTCCTTCACAATTAGACATACTTATGGA
CAACCGCAACAATAGACAGTTCGTGGAAAATGAAGATGAATTGGAGAATAAGATCAATGAAGAAGACTTCAAAAAAGAGCTCATTGCTTGGCTCAGAGAAAACAATCTCA
AATTGACTCCAGCCAACGAACTCAAAACCAAAGCCAATGGCAGATGCGATGGCATCATTCTCATGTGGGATGATCTTCGACACACAGTAACAAGTTCATTGGGAAAGGAA
TTTTCTATCTTAGCTAATGTCTTAATGTCTAATGGCTTCAGCTGGTGGGTCATGATTATATATGGGCATGCCAAAATAAAAGATAGAAACAAATTTCTAGAGGAGATCTC
TAGTCTTAACAACACGTGCTCGCCGAATTGGATATTGGGAGGCGACTTCAATGTCATTAGGTGGGAAAATGAGATATCAGCCACTAACCTAGCTAAATACAACATGAAGA
AGTTTAACTCAGTTATGAATAGTCTTGGCTTCATTGAACATCCCCTTACAAATGGCAAGTTTACATGGTCAAATCTCAGAGCTCACGCTGTTTTATCAAGATTGGACCGA
TTTCTATATACTCCTCAATGGGAAGATAGATTCACCATTCATTTTACTAGAACATTGGCTAAAATTACCTCAGACCACTATCCTCTCATTCTTGAGAATTCACAACTTTG
TTGGGGCCCTAGCCTTTCCGATTCAACAATCTTTAACAAAAATGTAAAAGATTGGTGGGACTCTACTAGTCAAGTTGGATTCCCAGGTTATGCTTTTAACAGGACACTCA
AACAACTCTCAAGTATAATAAAAAATTGGCAGGCAAGACAGAAAAAGGTGACAAATGAAGAGAAGAAATCGATTGATAAGCTTGAAAGCCAACAAAACCTTAGCGATATC
GATAGTGCTCGAAGAACTTCGTTGAAAATTGATCTCAACTCCGAGGCTACTAAAGAAGCTCAATATTGGGCTCAAAGATATAAAAGCCTTTGGCTACAAGATGAAGACGA
AAATTTAGCTTTTTTCGACAAAGTCTGTACGGTTAGACAGCGCAGAAGTTTTATTTATGAAATACTTGATGAAAATGGTAACAGTTGCATCACCAATAGCTCTATAGAAA
ATATTTGTCCTTATGCAACATTTCAAAGAACTATATGTTGGCAAACCAAAAGATCAGTGAACATCATCAAAACCTTCGAAGTTGCAATCAAGCTCAAGATCAACATGAAC
AAGTCCACACTATCAGCCGTAAATGTCAGTAGAACCGGAGTTGAGGAGATAGTAAAAAGATGGGGCTTAACTCACCATTTTCTCCCTATAAGCTACTTGGGAGTCCCTTT
TGGAGGCAAGCCACATTCCAAAGCTTTTTGGGAATCGGTTTACAAAAACATTGAAAAGCACTGGGGAAACTTCCTTCGGAAAGACAAAAATGGTGGTTATAGTACACACC
TCGTTAATTGGGCTACGATAACTTCCCTAAAAAACAAAGGTGGTTTCGACATAAACAATTTGGAAGACACTAACTTCTCCCTGCTCTGCAAATGGCTTTGGAGATTCCAT
GAAGAGGACAACCGTCTCTGGAAGAGACTGATCCTTGCTAAATACAAGCACAATTATGTAGTGGAAATTCCCACAACAAGTAGATACTGTAGCACTAAATCCCCTTGGAT
GTCTATTGTCAAAGGTCTCGACTGGTTCAAAACCAACTTATCTTGA
Protein sequenceShow/hide protein sequence
MDLDPLSPLLDTLISSPDNHMASDSLPFPSQLDILMDNRNNRQFVENEDELENKINEEDFKKELIAWLRENNLKLTPANELKTKANGRCDGIILMWDDLRHTVTSSLGKE
FSILANVLMSNGFSWWVMIIYGHAKIKDRNKFLEEISSLNNTCSPNWILGGDFNVIRWENEISATNLAKYNMKKFNSVMNSLGFIEHPLTNGKFTWSNLRAHAVLSRLDR
FLYTPQWEDRFTIHFTRTLAKITSDHYPLILENSQLCWGPSLSDSTIFNKNVKDWWDSTSQVGFPGYAFNRTLKQLSSIIKNWQARQKKVTNEEKKSIDKLESQQNLSDI
DSARRTSLKIDLNSEATKEAQYWAQRYKSLWLQDEDENLAFFDKVCTVRQRRSFIYEILDENGNSCITNSSIENICPYATFQRTICWQTKRSVNIIKTFEVAIKLKINMN
KSTLSAVNVSRTGVEEIVKRWGLTHHFLPISYLGVPFGGKPHSKAFWESVYKNIEKHWGNFLRKDKNGGYSTHLVNWATITSLKNKGGFDINNLEDTNFSLLCKWLWRFH
EEDNRLWKRLILAKYKHNYVVEIPTTSRYCSTKSPWMSIVKGLDWFKTNLS