; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025197 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025197
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr10:9687816..9692272
RNA-Seq ExpressionLag0025197
SyntenyLag0025197
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039309.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.4e-7121.43Show/hide
Query:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL
        R C  ERK   +   K        ++E      F + +  + + WI  ++  L+ TP S +FF +       IW++K  N +G   EI +V    R+  +
Subjt:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL

Query:  IIPRGEDSKGGFSLDEVVKRSNKDGIPSSSKAMSLPKDQRK-------DFIRRE--RVVSHNRVSFKK-------ATDCRR------------ICWDQTL
        ++P G +     S   ++  + K  + + ++ + LP+   +       D+ +R   + VS  R S          ++D  +            +  + T+
Subjt:  IIPRGEDSKGGFSLDEVVKRSNKDGIPSSSKAMSLPKDQRK-------DFIRRE--RVVSHNRVSFKK-------ATDCRR------------ICWDQTL

Query:  VVTRRCFHDEWGKIYDTLCNAFQKNLIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDT
        V+ RR FHD+W KI   L    +++   N FH +K L+       A ++  NKGW+ +G ++++FE W    H    ++PSYGGW   R IPL  W + T
Subjt:  VVTRRCFHDEWGKIYDTLCNAFQKNLIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDT

Query:  FKAIGEAYGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELCSVDK
        F+ IG+A GG I+  ++  +    +EA +K+R NY GF+PA + I  ++G+  V QVVT+ + + L  R V +HG F  +AA  F  +D N +    +  
Subjt:  FKAIGEAYGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELCSVDK

Query:  NRKEDQKDRCQ-INGVTKKSNPDSASNKRDGSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNE------NAVTNKKGDGRGVEPTRV--------IGYQ
          +    D    I+G  K  +P+  S  +    V  +  K       + E  V  + L+ + N+      + ++N     +G +   +        I Y+
Subjt:  NRKEDQKDRCQ-INGVTKKSNPDSASNKRDGSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNE------NAVTNKKGDGRGVEPTRV--------IGYQ

Query:  WKEK----GRAQKEISFAPKAEVTTY--VKKGLISSTTEERKQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN
         K K      + K   F P +    +   KK  +S     +K+  T   + ++  +   G+ +++   +      +   +  + T  L ++  L   +  
Subjt:  WKEK----GRAQKEISFAPKAEVTTY--VKKGLISSTTEERKQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN

Query:  GDQYHPPKDPLALVPSTPQEVMIEATPIRSEEADKGGITKPM---------ERSEEVLGNTDMGQRLDIPILK-ETVEILWQNGLCIRPIPKKLGGGSSK
         D  H   D   ++  T  EV+ E   ++  + +K   +  +          R        D  +  +    K + V  L +NGL +  I     G ++ 
Subjt:  GDQYHPPKDPLALVPSTPQEVMIEATPIRSEEADKGGITKPM---------ERSEEVLGNTDMGQRLDIPILK-ETVEILWQNGLCIRPIPKKLGGGSSK

Query:  GKNNKNPMKREIKGLISSWEKPMEKAYSSSQGQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYG
             + +     G++  W+       S  +G+      S N +     W                        W + +Y P + +ER  +W +L +++ 
Subjt:  GKNNKNPMKREIKGLISSWEKPMEKAYSSSQGQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYG

Query:  LCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILL
        L +  W IGGD +V+R  +E ++    S S    + ++ +  L +  ++N +YTWS+ R  P FS++DRF  +S+   LF     + L R TSDHFP++ 
Subjt:  LCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILL

Query:  Q--AGNFSWGPSPFRFENMWLSHPNFAKLIEEWWKDPVDEG-------------------------KTLRTKSSSIIS----------------------
        +       WGP+PFR  ++ L+ P F + +E WW+  V  G                         ++L +   +II                       
Subjt:  Q--AGNFSWGPSPFRFENMWLSHPNFAKLIEEWWKDPVDEG-------------------------KTLRTKSSSIIS----------------------

Query:  ---------------------------SDGEQ-----------------------------------------------RCT-------LDGVEWQPISL
                                    +G++                                               RC+       ++ +EW PI  
Subjt:  ---------------------------SDGEQ-----------------------------------------------RCT-------LDGVEWQPISL

Query:  TQAQWLERPFEREEIWKAVTVLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKKKVAA-------------------
        +    L  PF  EEI   +      K                            +FFE G+IN+  N TYI LI KKK  +                   
Subjt:  TQAQWLERPFEREEIWKAVTVLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKKKVAA-------------------

Query:  --------------------------KIMDAIMLAVESVDDYRARKEK----------------------------------------------------
                                  +I DAI++A E++D ++ +K K                                                    
Subjt:  --------------------------KIMDAIMLAVESVDDYRARKEK----------------------------------------------------

Query:  -------------TRDPLSPFLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGI
                       DPLS FLF +  D LSR +    + G IKG  +   +  +SHI F DD LLF + N  F+ N +     +EKASGL IN+ K+ I
Subjt:  -------------TRDPLSPFLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGI

Query:  VGINMEEGEVVRMA
        V +N+     +  A
Subjt:  VGINMEEGEVVRMA

KAA0044449.1 hypothetical protein E6C27_scaffold46G001820 [Cucumis melo var. makuwa]1.8e-8427.8Show/hide
Query:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL
        R C  E+K   +   K   +  + I+E      F + +    + W+  +   LL TP + +FF +    +  +W+Q + N RG   EI +V   GR+  +
Subjt:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL

Query:  IIPRGEDSKGGFSLDEVV---KRSNKDGIPSSSKAMSLPKDQRKDFIRRE---------------RVVSHNRVSFKKATDCRRICWDQTLVVTRRCFHDE
        ++P G D  G    ++++   K S+K   P+        +D+ K+ I++                  VS +  S    +DC +     T  + RRCFHD+
Subjt:  IIPRGEDSKGGFSLDEVV---KRSNKDGIPSSSKAMSLPKDQRKDFIRRE---------------RVVSHNRVSFKKATDCRRICWDQTLVVTRRCFHDE

Query:  WGKIYDTLCNAFQKN---LIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEA
        W KI D L +   K        PFH DKALL   + + A ++  N GW+ +G F +KFE W+   H    V+PSYGGW R R IPL  W ++TF  IGEA
Subjt:  WGKIYDTLCNAFQKN---LIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEA

Query:  YGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELC-----------
        YGG+I+   + ++ +   EA+IKV+ NY GF+PA I I  E+G   + Q VT+   + L  R   IHG FT  AA  F   +   E+             
Subjt:  YGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELC-----------

Query:  ----SVDKNRKEDQKDR-------------------CQINGVTKKSNPDSASNKRD--GSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNENAVTNK--
            S  K  K+   DR                       G +  SN      K+D    + E++KGK I C  G    K+ P       + ++  N+  
Subjt:  ----SVDKNRKEDQKDR-------------------CQINGVTKKSNPDSASNKRD--GSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNENAVTNK--

Query:  -----KGDGRGVEPTRVIGYQWKEKGRAQKEISFAPKA-EVTTYVKKGLISSTTEERKQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQ
                 + ++    + Y   E    +   S   K   V +   +   S T+ ++ + + D  + E+VVDL + S LS++    DF   E  +   + 
Subjt:  -----KGDGRGVEPTRVIGYQWKEKGRAQKEISFAPKA-EVTTYVKKGLISSTTEERKQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQ

Query:  TEALE-DIAALFEGEDNGDQYHPPKDPLALVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTDMGQRLDIPILKETVEILWQNGLCIRPIPKK
        T   E DI                KD LA + +   E          E+  K  I +  E  E             +   ++  + L +N L       +
Subjt:  TEALE-DIAALFEGEDNGDQYHPPKDPLALVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTDMGQRLDIPILKETVEILWQNGLCIRPIPKK

Query:  LGGGSSKGKNNKNPMKREIKGLISSWEKPMEKAYSSSQGQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWR
        L    +   N+                   ++  S   G     + +V+ ++I   +S       S+  +  +G    +  W SA+Y P++ K R L W 
Subjt:  LGGGSSKGKNNKNPMKREIKGLISSWEKPMEKAYSSSQGQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWR

Query:  ELFDVYGLCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTS
        EL ++  +C   W +GGDF+VIR  +E S+    S SMKRF+ ++ +CNL +  ++NAK+TWS+ R     S++DRF  S+   ++F     K L R TS
Subjt:  ELFDVYGLCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTS

Query:  DHFPILLQAGNFSWGPSPFRFENMWLSHPNFAKLIEEWWKDPVDEG
        DHFPI+L++ + SWGPSPFRF N +L  P++ K IE WW +    G
Subjt:  DHFPILLQAGNFSWGPSPFRFENMWLSHPNFAKLIEEWWKDPVDEG

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]7.0e-7623.1Show/hide
Query:  FEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSN--NRGVFVEIAKVASSGRRDNLIIPRGEDSKGGFS-LDEVVKRSN--------
        F + +    + WI +   DLL T  ++ FF +   E+  +W++K  N     +  EI ++ + GR+ ++++P G DS G  S L  +  RS+        
Subjt:  FEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSN--NRGVFVEIAKVASSGRRDNLIIPRGEDSKGGFS-LDEVVKRSN--------

Query:  ---KDGIPSSSKAMSLPKD-QRKDF--IRRERVVSHNRVSFKKATD---CRRI-------------CWDQTLVVTRRCFHDEWGKIYDTLCNAFQKNLII
           K+ + + S + S   D  RK +  +  +     N+  +K  +D    RR               +++T+++TRRCFHD+W +I  +L    +     
Subjt:  ---KDGIPSSSKAMSLPKD-QRKDF--IRRERVVSHNRVSFKKATD---CRRI-------------CWDQTLVVTRRCFHDEWGKIYDTLCNAFQKNLII

Query:  NPFHPDKALLKCLEADFACIVAHNK---GWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCM
         PF  DKA+L  L  D A ++  NK   GWS +GN+ +KFE W+  LH   +V+PSYGGW+R R IPL  W  +TF+ IG A GG+++   + + +   +
Subjt:  NPFHPDKALLKCLEADFACIVAHNK---GWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCM

Query:  EAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNRE------------ELCSVDKNRKEDQKDRCQIN
        +A IKVR NY GF+PA I I    G   +   V   + + L  R V +HG F ++AA  F   +   E            E      +      D+  I+
Subjt:  EAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNRE------------ELCSVDKNRKEDQKDRCQIN

Query:  GVTKKSNPDSASNKRDG-----SVVEREKGKNIVC------------KKGMAEVKV---GPSGLNISPNENAVTNKKGDGRGVEPTRVIGYQWKEKGRAQ
          T+    +S+ ++ D      S   +EKGK I+              K ++  KV    P G+  S + N   N KG    +     I  Q++++   +
Subjt:  GVTKKSNPDSASNKRDG-----SVVEREKGKNIVC------------KKGMAEVKV---GPSGLNISPNENAVTNKKGDGRGVEPTRVIGYQWKEKGRAQ

Query:  KEISFAPKAEVTTYVKKGLISSTTEER-KQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN-GDQYHPPKDPLA
        ++     K ++T  +KK    ST + +   +ET    ++M + ++ G                           +  + ++ + E+N G      + P  
Subjt:  KEISFAPKAEVTTYVKKGLISSTTEER-KQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN-GDQYHPPKDPLA

Query:  LVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTD--MGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLISSWE
           ST        T    E AD+    K   RS     + D   G  L+I    +   ++W     ++  PK           N  P       ++S  +
Subjt:  LVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTD--MGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLISSWE

Query:  KPMEKAYSSSQGQIESKLA---SVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRR
        + M+ A     G     L      N  +      +  I    LN+ G          W ++VY P +Y +R  LW EL  +  LC   W I GDF+++R 
Subjt:  KPMEKAYSSSQGQIESKLA---SVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRR

Query:  ADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILLQAGNFSWGPSPFRFENM
          E ++     R+M  F+ ++    L +    N  +TWS+ R  P +S++DRF +S    + F     + L+R  SDHFPILL++    WGP PFR  N 
Subjt:  ADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILLQAGNFSWGPSPFRFENM

Query:  WLSHPNFAKLIEEWWK------------------------------------------------------------------------------------
         L    F K    WW                                                                                     
Subjt:  WLSHPNFAKLIEEWWK------------------------------------------------------------------------------------

Query:  -----------------------------------DPV--------DEGKTLRTKSSSIISSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVT
                                           DP         D  +T  +   +I + +  +   +D + W PIS      L +PF+  EI   + 
Subjt:  -----------------------------------DPV--------DEGKTLRTKSSSIISSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVT

Query:  VLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKK-------------------KVAAK-------------------
            +K                            +F + GI+N   N T+I LI KK                   K+ AK                   
Subjt:  VLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKK-------------------KVAAK-------------------

Query:  -------IMDAIMLAVESVDDYRARKEK-----------------------------------------------------------------TRDPLSP
               I DAI++A E++D ++ RK K                                                                   DPLSP
Subjt:  -------IMDAIMLAVESVDDYRARKEK-----------------------------------------------------------------TRDPLSP

Query:  FLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCK
        F+F L  D LSR +    +KG IKG +       +SH+ F DD L+F + N +++ N Q     +EKASGLT N  K+ I  IN+  G   ++A   G +
Subjt:  FLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCK

Query:  VEGLPFSYLGV
         + LP +YLGV
Subjt:  VEGLPFSYLGV

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.2e-8523.73Show/hide
Query:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL
        R C  ERK   +   K        ++E      F + +  + + WI  ++  L+ TP + +FF +T      IW++K  N++G   EI +V    R+  +
Subjt:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL

Query:  IIPRGEDSKGGFSL-------DEVVKRSNKDGIPSSSK--AMSLPKDQRKDFIR------RERVVSHNRVSFKKA----TDCRRIC-------WDQTLVV
        ++P G D  G  S         EV  ++    +P +S    +S P D  K          R    S +  S+  +    +     C        + T+V+
Subjt:  IIPRGEDSKGGFSL-------DEVVKRSNKDGIPSSSK--AMSLPKDQRKDFIR------RERVVSHNRVSFKKA----TDCRRIC-------WDQTLVV

Query:  TRRCFHDEWGKIYDTLCNAFQKNLIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFK
         RR FHD+W KI   L    +++   N FH +KAL+       A ++  NKGWS +G +S++FE W+   H    ++PSYGGW   R IPL  W + TF+
Subjt:  TRRCFHDEWGKIYDTLCNAFQKNLIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFK

Query:  AIGEAYGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELCSVDKNR
         IG+A  G I+  ++  S    +EA IKVR NY GF+PA + I   +G+    QVVT+ + + L  R V +HG F  +AA  F  +D N E      +  
Subjt:  AIGEAYGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELCSVDKNR

Query:  KEDQKDRCQINGVTKKSN-PDSASNKRDGSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNENAVTNKKG-DGRGVEPTRVIGYQWKEKGRAQKEISFAP
        +    D    +   +KS+ PD  S  +    V  +  +N      + E  V  S L+ + N++ +    G    GV           +KG+ + +I   P
Subjt:  KEDQKDRCQINGVTKKSN-PDSASNKRDGSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNENAVTNKKG-DGRGVEPTRVIGYQWKEKGRAQKEISFAP

Query:  KAEVTTYVKKGLISSTTEERK-------------QQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDNGDQYHPPK
         + +     K  +S  +   K                 +  +++  V  E       SS + + +    QN+    T+ ++ +A   +    G       
Subjt:  KAEVTTYVKKGLISSTTEERK-------------QQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDNGDQYHPPK

Query:  DPL-ALVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTDMGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLIS
          L AL P+   E         S+ A+   IT     + EV+  T     + +P+ + +      N    + + K+      K +  K+P     K  + 
Subjt:  DPL-ALVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTDMGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLIS

Query:  SWEKPMEKAYSSSQ-------------GQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGIL----------------------------DSICWA
        SW K      S+                Q+ S L   N+ IIK LW S  I W + N+ G+SGGIL                            +S  W 
Subjt:  SWEKPMEKAYSSSQ-------------GQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGIL----------------------------DSICWA

Query:  SAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKC
        + +Y P + +ER   W EL ++  L +  W +GGD +VIR  +E +S    S + +  + ++ +  L +  ++N ++TWS+ R  P FS+IDRF  +S  
Subjt:  SAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKC

Query:  LDLFKNVNVKWLQRVTSDHFPILLQAGN--FSWGPSPFRFENMWLSHPNFAKLIEEWWKDPV--------------------------------------
         +LF     + L R TSDHFP++ +  N   SWGP PFR  ++ LS P F + +  WW++ +                                      
Subjt:  LDLFKNVNVKWLQRVTSDHFPILLQAGN--FSWGPSPFRFENMWLSHPNFAKLIEEWWKDPV--------------------------------------

Query:  ------------------------------------------------------------------------------DEGKTLRTKSSSII--------
                                                                                      DE  +++  ++SI         
Subjt:  ------------------------------------------------------------------------------DEGKTLRTKSSSII--------

Query:  ----SSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVTVLGAQKT-------LEFFENGIINRRTNETYICLIPKKKVA----AKIMDAIMLAV
            SS       ++ ++W PI+ ++   L  PF   EI   +     +KT       + FF++  +      T    I   ++A     +I DAI++A 
Subjt:  ----SSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVTVLGAQKT-------LEFFENGIINRRTNETYICLIPKKKVA----AKIMDAIMLAV

Query:  ESVDDYRARKEK-----------------------------------------------------------------TRDPLSPFLFTLVADSLSRFIRV
        E+VD ++ +K K                                                                   DPLSPFLF +  D LSR +  
Subjt:  ESVDDYRARKEK-----------------------------------------------------------------TRDPLSPFLFTLVADSLSRFIRV

Query:  WSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCKVEGLPFSYLGV
          + G IKG ++      +SHI F DD LLF + N  F+KN +     +E+ASGL IN+ K+ +V +N+        A   G     LP SYLGV
Subjt:  WSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCKVEGLPFSYLGV

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]9.2e-7623.1Show/hide
Query:  FEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSN--NRGVFVEIAKVASSGRRDNLIIPRGEDSKGGFS-LDEVVKRSN--------
        F + +    + WI +   DLL T  ++ FF +   E+  +W++K  N     +  EI ++ + GR+ ++++P G DS G  S L  +  RS+        
Subjt:  FEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSN--NRGVFVEIAKVASSGRRDNLIIPRGEDSKGGFS-LDEVVKRSN--------

Query:  ---KDGIPSSSKAMSLPKD-QRKDF--IRRERVVSHNRVSFKKATD---CRRI-------------CWDQTLVVTRRCFHDEWGKIYDTLCNAFQKNLII
           K+ + + S + S   D  RK +  +  +     N+  +K  +D    RR               +++T+++TRRCFHD+W +I  +L    +     
Subjt:  ---KDGIPSSSKAMSLPKD-QRKDF--IRRERVVSHNRVSFKKATD---CRRI-------------CWDQTLVVTRRCFHDEWGKIYDTLCNAFQKNLII

Query:  NPFHPDKALLKCLEADFACIVAHNK---GWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCM
         PF  DKA+L  L  D A ++  NK   GWS +GN+ +KFE W+  LH   +V+PSYGGW+R R IPL  W  +TF+ IG A GG+++   + + +   +
Subjt:  NPFHPDKALLKCLEADFACIVAHNK---GWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCM

Query:  EAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNRE------------ELCSVDKNRKEDQKDRCQIN
        +A IKVR NY GF+PA I I    G   +   V   + + L  R V +HG F ++AA  F   +   E            E      +      D+  I+
Subjt:  EAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNRE------------ELCSVDKNRKEDQKDRCQIN

Query:  GVTKKSNPDSASNKRDG-----SVVEREKGKNIVC------------KKGMAEVKV---GPSGLNISPNENAVTNKKGDGRGVEPTRVIGYQWKEKGRAQ
          T+    +S+ ++ D      S   +EKGK I+              K ++  KV    P G+  S + N   N KG    +     I  Q++++   +
Subjt:  GVTKKSNPDSASNKRDG-----SVVEREKGKNIVC------------KKGMAEVKV---GPSGLNISPNENAVTNKKGDGRGVEPTRVIGYQWKEKGRAQ

Query:  KEISFAPKAEVTTYVKKGLISSTTEER-KQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN-GDQYHPPKDPLA
        ++     K ++T  +KK    ST + +   +ET    ++M + ++ G                           +  + ++ + E+N G      + P  
Subjt:  KEISFAPKAEVTTYVKKGLISSTTEER-KQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN-GDQYHPPKDPLA

Query:  LVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTD--MGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLISSWE
           ST        T    E AD+    K   RS     + D   G  L+I    +   ++W     ++  PK           N  P       ++S  +
Subjt:  LVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTD--MGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLISSWE

Query:  KPMEKAYSSSQGQIESKLA---SVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRR
        + M+ A     G     L      N  +      +  I    LN+ G          W ++VY P +Y +R  LW EL  +  LC   W I GDF+++R 
Subjt:  KPMEKAYSSSQGQIESKLA---SVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRR

Query:  ADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILLQAGNFSWGPSPFRFENM
          E ++     R+M  F+ ++    L +    N  +TWS+ R  P +S++DRF +S    + F     + L+R  SDHFPILL++    WGP PFR  N 
Subjt:  ADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILLQAGNFSWGPSPFRFENM

Query:  WLSHPNFAKLIEEWWK------------------------------------------------------------------------------------
         L    F K    WW                                                                                     
Subjt:  WLSHPNFAKLIEEWWK------------------------------------------------------------------------------------

Query:  -----------------------------------DPV--------DEGKTLRTKSSSIISSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVT
                                           DP         D  +T  +   +I + +  +   +D + W PIS      L +PF+  EI   + 
Subjt:  -----------------------------------DPV--------DEGKTLRTKSSSIISSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVT

Query:  VLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKK-------------------KVAAK-------------------
            +K                            +F + GI+N   N T+I LI KK                   K+ AK                   
Subjt:  VLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKK-------------------KVAAK-------------------

Query:  -------IMDAIMLAVESVDDYRARKEK-----------------------------------------------------------------TRDPLSP
               I DAI++A E +D ++ RK K                                                                   DPLSP
Subjt:  -------IMDAIMLAVESVDDYRARKEK-----------------------------------------------------------------TRDPLSP

Query:  FLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCK
        F+F L  D LSR +    +KG IKG +       +SH+ F DD L+F + N +++ N Q     +EKASGLT N  K+ I  IN+  G   ++A   G +
Subjt:  FLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCK

Query:  VEGLPFSYLGV
         + LP +YLGV
Subjt:  VEGLPFSYLGV

TrEMBL top hitse value%identityAlignment
A0A5A7TDG1 LINE-1 retrotransposable element ORF2 protein6.6e-7221.43Show/hide
Query:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL
        R C  ERK   +   K        ++E      F + +  + + WI  ++  L+ TP S +FF +       IW++K  N +G   EI +V    R+  +
Subjt:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL

Query:  IIPRGEDSKGGFSLDEVVKRSNKDGIPSSSKAMSLPKDQRK-------DFIRRE--RVVSHNRVSFKK-------ATDCRR------------ICWDQTL
        ++P G +     S   ++  + K  + + ++ + LP+   +       D+ +R   + VS  R S          ++D  +            +  + T+
Subjt:  IIPRGEDSKGGFSLDEVVKRSNKDGIPSSSKAMSLPKDQRK-------DFIRRE--RVVSHNRVSFKK-------ATDCRR------------ICWDQTL

Query:  VVTRRCFHDEWGKIYDTLCNAFQKNLIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDT
        V+ RR FHD+W KI   L    +++   N FH +K L+       A ++  NKGW+ +G ++++FE W    H    ++PSYGGW   R IPL  W + T
Subjt:  VVTRRCFHDEWGKIYDTLCNAFQKNLIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDT

Query:  FKAIGEAYGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELCSVDK
        F+ IG+A GG I+  ++  +    +EA +K+R NY GF+PA + I  ++G+  V QVVT+ + + L  R V +HG F  +AA  F  +D N +    +  
Subjt:  FKAIGEAYGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELCSVDK

Query:  NRKEDQKDRCQ-INGVTKKSNPDSASNKRDGSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNE------NAVTNKKGDGRGVEPTRV--------IGYQ
          +    D    I+G  K  +P+  S  +    V  +  K       + E  V  + L+ + N+      + ++N     +G +   +        I Y+
Subjt:  NRKEDQKDRCQ-INGVTKKSNPDSASNKRDGSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNE------NAVTNKKGDGRGVEPTRV--------IGYQ

Query:  WKEK----GRAQKEISFAPKAEVTTY--VKKGLISSTTEERKQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN
         K K      + K   F P +    +   KK  +S     +K+  T   + ++  +   G+ +++   +      +   +  + T  L ++  L   +  
Subjt:  WKEK----GRAQKEISFAPKAEVTTY--VKKGLISSTTEERKQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN

Query:  GDQYHPPKDPLALVPSTPQEVMIEATPIRSEEADKGGITKPM---------ERSEEVLGNTDMGQRLDIPILK-ETVEILWQNGLCIRPIPKKLGGGSSK
         D  H   D   ++  T  EV+ E   ++  + +K   +  +          R        D  +  +    K + V  L +NGL +  I     G ++ 
Subjt:  GDQYHPPKDPLALVPSTPQEVMIEATPIRSEEADKGGITKPM---------ERSEEVLGNTDMGQRLDIPILK-ETVEILWQNGLCIRPIPKKLGGGSSK

Query:  GKNNKNPMKREIKGLISSWEKPMEKAYSSSQGQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYG
             + +     G++  W+       S  +G+      S N +     W                        W + +Y P + +ER  +W +L +++ 
Subjt:  GKNNKNPMKREIKGLISSWEKPMEKAYSSSQGQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYG

Query:  LCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILL
        L +  W IGGD +V+R  +E ++    S S    + ++ +  L +  ++N +YTWS+ R  P FS++DRF  +S+   LF     + L R TSDHFP++ 
Subjt:  LCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILL

Query:  Q--AGNFSWGPSPFRFENMWLSHPNFAKLIEEWWKDPVDEG-------------------------KTLRTKSSSIIS----------------------
        +       WGP+PFR  ++ L+ P F + +E WW+  V  G                         ++L +   +II                       
Subjt:  Q--AGNFSWGPSPFRFENMWLSHPNFAKLIEEWWKDPVDEG-------------------------KTLRTKSSSIIS----------------------

Query:  ---------------------------SDGEQ-----------------------------------------------RCT-------LDGVEWQPISL
                                    +G++                                               RC+       ++ +EW PI  
Subjt:  ---------------------------SDGEQ-----------------------------------------------RCT-------LDGVEWQPISL

Query:  TQAQWLERPFEREEIWKAVTVLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKKKVAA-------------------
        +    L  PF  EEI   +      K                            +FFE G+IN+  N TYI LI KKK  +                   
Subjt:  TQAQWLERPFEREEIWKAVTVLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKKKVAA-------------------

Query:  --------------------------KIMDAIMLAVESVDDYRARKEK----------------------------------------------------
                                  +I DAI++A E++D ++ +K K                                                    
Subjt:  --------------------------KIMDAIMLAVESVDDYRARKEK----------------------------------------------------

Query:  -------------TRDPLSPFLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGI
                       DPLS FLF +  D LSR +    + G IKG  +   +  +SHI F DD LLF + N  F+ N +     +EKASGL IN+ K+ I
Subjt:  -------------TRDPLSPFLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGI

Query:  VGINMEEGEVVRMA
        V +N+     +  A
Subjt:  VGINMEEGEVVRMA

A0A5A7TTA1 DUF4283 domain-containing protein8.9e-8527.8Show/hide
Query:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL
        R C  E+K   +   K   +  + I+E      F + +    + W+  +   LL TP + +FF +    +  +W+Q + N RG   EI +V   GR+  +
Subjt:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL

Query:  IIPRGEDSKGGFSLDEVV---KRSNKDGIPSSSKAMSLPKDQRKDFIRRE---------------RVVSHNRVSFKKATDCRRICWDQTLVVTRRCFHDE
        ++P G D  G    ++++   K S+K   P+        +D+ K+ I++                  VS +  S    +DC +     T  + RRCFHD+
Subjt:  IIPRGEDSKGGFSLDEVV---KRSNKDGIPSSSKAMSLPKDQRKDFIRRE---------------RVVSHNRVSFKKATDCRRICWDQTLVVTRRCFHDE

Query:  WGKIYDTLCNAFQKN---LIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEA
        W KI D L +   K        PFH DKALL   + + A ++  N GW+ +G F +KFE W+   H    V+PSYGGW R R IPL  W ++TF  IGEA
Subjt:  WGKIYDTLCNAFQKN---LIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEA

Query:  YGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELC-----------
        YGG+I+   + ++ +   EA+IKV+ NY GF+PA I I  E+G   + Q VT+   + L  R   IHG FT  AA  F   +   E+             
Subjt:  YGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELC-----------

Query:  ----SVDKNRKEDQKDR-------------------CQINGVTKKSNPDSASNKRD--GSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNENAVTNK--
            S  K  K+   DR                       G +  SN      K+D    + E++KGK I C  G    K+ P       + ++  N+  
Subjt:  ----SVDKNRKEDQKDR-------------------CQINGVTKKSNPDSASNKRD--GSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNENAVTNK--

Query:  -----KGDGRGVEPTRVIGYQWKEKGRAQKEISFAPKA-EVTTYVKKGLISSTTEERKQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQ
                 + ++    + Y   E    +   S   K   V +   +   S T+ ++ + + D  + E+VVDL + S LS++    DF   E  +   + 
Subjt:  -----KGDGRGVEPTRVIGYQWKEKGRAQKEISFAPKA-EVTTYVKKGLISSTTEERKQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQ

Query:  TEALE-DIAALFEGEDNGDQYHPPKDPLALVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTDMGQRLDIPILKETVEILWQNGLCIRPIPKK
        T   E DI                KD LA + +   E          E+  K  I +  E  E             +   ++  + L +N L       +
Subjt:  TEALE-DIAALFEGEDNGDQYHPPKDPLALVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTDMGQRLDIPILKETVEILWQNGLCIRPIPKK

Query:  LGGGSSKGKNNKNPMKREIKGLISSWEKPMEKAYSSSQGQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWR
        L    +   N+                   ++  S   G     + +V+ ++I   +S       S+  +  +G    +  W SA+Y P++ K R L W 
Subjt:  LGGGSSKGKNNKNPMKREIKGLISSWEKPMEKAYSSSQGQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWR

Query:  ELFDVYGLCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTS
        EL ++  +C   W +GGDF+VIR  +E S+    S SMKRF+ ++ +CNL +  ++NAK+TWS+ R     S++DRF  S+   ++F     K L R TS
Subjt:  ELFDVYGLCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTS

Query:  DHFPILLQAGNFSWGPSPFRFENMWLSHPNFAKLIEEWWKDPVDEG
        DHFPI+L++ + SWGPSPFRF N +L  P++ K IE WW +    G
Subjt:  DHFPILLQAGNFSWGPSPFRFENMWLSHPNFAKLIEEWWKDPVDEG

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein3.4e-7623.1Show/hide
Query:  FEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSN--NRGVFVEIAKVASSGRRDNLIIPRGEDSKGGFS-LDEVVKRSN--------
        F + +    + WI +   DLL T  ++ FF +   E+  +W++K  N     +  EI ++ + GR+ ++++P G DS G  S L  +  RS+        
Subjt:  FEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSN--NRGVFVEIAKVASSGRRDNLIIPRGEDSKGGFS-LDEVVKRSN--------

Query:  ---KDGIPSSSKAMSLPKD-QRKDF--IRRERVVSHNRVSFKKATD---CRRI-------------CWDQTLVVTRRCFHDEWGKIYDTLCNAFQKNLII
           K+ + + S + S   D  RK +  +  +     N+  +K  +D    RR               +++T+++TRRCFHD+W +I  +L    +     
Subjt:  ---KDGIPSSSKAMSLPKD-QRKDF--IRRERVVSHNRVSFKKATD---CRRI-------------CWDQTLVVTRRCFHDEWGKIYDTLCNAFQKNLII

Query:  NPFHPDKALLKCLEADFACIVAHNK---GWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCM
         PF  DKA+L  L  D A ++  NK   GWS +GN+ +KFE W+  LH   +V+PSYGGW+R R IPL  W  +TF+ IG A GG+++   + + +   +
Subjt:  NPFHPDKALLKCLEADFACIVAHNK---GWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCM

Query:  EAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNRE------------ELCSVDKNRKEDQKDRCQIN
        +A IKVR NY GF+PA I I    G   +   V   + + L  R V +HG F ++AA  F   +   E            E      +      D+  I+
Subjt:  EAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNRE------------ELCSVDKNRKEDQKDRCQIN

Query:  GVTKKSNPDSASNKRDG-----SVVEREKGKNIVC------------KKGMAEVKV---GPSGLNISPNENAVTNKKGDGRGVEPTRVIGYQWKEKGRAQ
          T+    +S+ ++ D      S   +EKGK I+              K ++  KV    P G+  S + N   N KG    +     I  Q++++   +
Subjt:  GVTKKSNPDSASNKRDG-----SVVEREKGKNIVC------------KKGMAEVKV---GPSGLNISPNENAVTNKKGDGRGVEPTRVIGYQWKEKGRAQ

Query:  KEISFAPKAEVTTYVKKGLISSTTEER-KQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN-GDQYHPPKDPLA
        ++     K ++T  +KK    ST + +   +ET    ++M + ++ G                           +  + ++ + E+N G      + P  
Subjt:  KEISFAPKAEVTTYVKKGLISSTTEER-KQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN-GDQYHPPKDPLA

Query:  LVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTD--MGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLISSWE
           ST        T    E AD+    K   RS     + D   G  L+I    +   ++W     ++  PK           N  P       ++S  +
Subjt:  LVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTD--MGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLISSWE

Query:  KPMEKAYSSSQGQIESKLA---SVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRR
        + M+ A     G     L      N  +      +  I    LN+ G          W ++VY P +Y +R  LW EL  +  LC   W I GDF+++R 
Subjt:  KPMEKAYSSSQGQIESKLA---SVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRR

Query:  ADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILLQAGNFSWGPSPFRFENM
          E ++     R+M  F+ ++    L +    N  +TWS+ R  P +S++DRF +S    + F     + L+R  SDHFPILL++    WGP PFR  N 
Subjt:  ADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILLQAGNFSWGPSPFRFENM

Query:  WLSHPNFAKLIEEWWK------------------------------------------------------------------------------------
         L    F K    WW                                                                                     
Subjt:  WLSHPNFAKLIEEWWK------------------------------------------------------------------------------------

Query:  -----------------------------------DPV--------DEGKTLRTKSSSIISSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVT
                                           DP         D  +T  +   +I + +  +   +D + W PIS      L +PF+  EI   + 
Subjt:  -----------------------------------DPV--------DEGKTLRTKSSSIISSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVT

Query:  VLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKK-------------------KVAAK-------------------
            +K                            +F + GI+N   N T+I LI KK                   K+ AK                   
Subjt:  VLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKK-------------------KVAAK-------------------

Query:  -------IMDAIMLAVESVDDYRARKEK-----------------------------------------------------------------TRDPLSP
               I DAI++A E++D ++ RK K                                                                   DPLSP
Subjt:  -------IMDAIMLAVESVDDYRARKEK-----------------------------------------------------------------TRDPLSP

Query:  FLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCK
        F+F L  D LSR +    +KG IKG +       +SH+ F DD L+F + N +++ N Q     +EKASGLT N  K+ I  IN+  G   ++A   G +
Subjt:  FLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCK

Query:  VEGLPFSYLGV
         + LP +YLGV
Subjt:  VEGLPFSYLGV

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein1.1e-8523.73Show/hide
Query:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL
        R C  ERK   +   K        ++E      F + +  + + WI  ++  L+ TP + +FF +T      IW++K  N++G   EI +V    R+  +
Subjt:  RKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNL

Query:  IIPRGEDSKGGFSL-------DEVVKRSNKDGIPSSSK--AMSLPKDQRKDFIR------RERVVSHNRVSFKKA----TDCRRIC-------WDQTLVV
        ++P G D  G  S         EV  ++    +P +S    +S P D  K          R    S +  S+  +    +     C        + T+V+
Subjt:  IIPRGEDSKGGFSL-------DEVVKRSNKDGIPSSSK--AMSLPKDQRKDFIR------RERVVSHNRVSFKKA----TDCRRIC-------WDQTLVV

Query:  TRRCFHDEWGKIYDTLCNAFQKNLIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFK
         RR FHD+W KI   L    +++   N FH +KAL+       A ++  NKGWS +G +S++FE W+   H    ++PSYGGW   R IPL  W + TF+
Subjt:  TRRCFHDEWGKIYDTLCNAFQKNLIINPFHPDKALLKCLEADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFK

Query:  AIGEAYGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELCSVDKNR
         IG+A  G I+  ++  S    +EA IKVR NY GF+PA + I   +G+    QVVT+ + + L  R V +HG F  +AA  F  +D N E      +  
Subjt:  AIGEAYGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELCSVDKNR

Query:  KEDQKDRCQINGVTKKSN-PDSASNKRDGSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNENAVTNKKG-DGRGVEPTRVIGYQWKEKGRAQKEISFAP
        +    D    +   +KS+ PD  S  +    V  +  +N      + E  V  S L+ + N++ +    G    GV           +KG+ + +I   P
Subjt:  KEDQKDRCQINGVTKKSN-PDSASNKRDGSVVEREKGKNIVCKKGMAEVKVGPSGLNISPNENAVTNKKG-DGRGVEPTRVIGYQWKEKGRAQKEISFAP

Query:  KAEVTTYVKKGLISSTTEERK-------------QQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDNGDQYHPPK
         + +     K  +S  +   K                 +  +++  V  E       SS + + +    QN+    T+ ++ +A   +    G       
Subjt:  KAEVTTYVKKGLISSTTEERK-------------QQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDNGDQYHPPK

Query:  DPL-ALVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTDMGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLIS
          L AL P+   E         S+ A+   IT     + EV+  T     + +P+ + +      N    + + K+      K +  K+P     K  + 
Subjt:  DPL-ALVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTDMGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLIS

Query:  SWEKPMEKAYSSSQ-------------GQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGIL----------------------------DSICWA
        SW K      S+                Q+ S L   N+ IIK LW S  I W + N+ G+SGGIL                            +S  W 
Subjt:  SWEKPMEKAYSSSQ-------------GQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGIL----------------------------DSICWA

Query:  SAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKC
        + +Y P + +ER   W EL ++  L +  W +GGD +VIR  +E +S    S + +  + ++ +  L +  ++N ++TWS+ R  P FS+IDRF  +S  
Subjt:  SAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRRADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKC

Query:  LDLFKNVNVKWLQRVTSDHFPILLQAGN--FSWGPSPFRFENMWLSHPNFAKLIEEWWKDPV--------------------------------------
         +LF     + L R TSDHFP++ +  N   SWGP PFR  ++ LS P F + +  WW++ +                                      
Subjt:  LDLFKNVNVKWLQRVTSDHFPILLQAGN--FSWGPSPFRFENMWLSHPNFAKLIEEWWKDPV--------------------------------------

Query:  ------------------------------------------------------------------------------DEGKTLRTKSSSII--------
                                                                                      DE  +++  ++SI         
Subjt:  ------------------------------------------------------------------------------DEGKTLRTKSSSII--------

Query:  ----SSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVTVLGAQKT-------LEFFENGIINRRTNETYICLIPKKKVA----AKIMDAIMLAV
            SS       ++ ++W PI+ ++   L  PF   EI   +     +KT       + FF++  +      T    I   ++A     +I DAI++A 
Subjt:  ----SSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVTVLGAQKT-------LEFFENGIINRRTNETYICLIPKKKVA----AKIMDAIMLAV

Query:  ESVDDYRARKEK-----------------------------------------------------------------TRDPLSPFLFTLVADSLSRFIRV
        E+VD ++ +K K                                                                   DPLSPFLF +  D LSR +  
Subjt:  ESVDDYRARKEK-----------------------------------------------------------------TRDPLSPFLFTLVADSLSRFIRV

Query:  WSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCKVEGLPFSYLGV
          + G IKG ++      +SHI F DD LLF + N  F+KN +     +E+ASGL IN+ K+ +V +N+        A   G     LP SYLGV
Subjt:  WSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCKVEGLPFSYLGV

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein4.4e-7623.1Show/hide
Query:  FEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSN--NRGVFVEIAKVASSGRRDNLIIPRGEDSKGGFS-LDEVVKRSN--------
        F + +    + WI +   DLL T  ++ FF +   E+  +W++K  N     +  EI ++ + GR+ ++++P G DS G  S L  +  RS+        
Subjt:  FEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSN--NRGVFVEIAKVASSGRRDNLIIPRGEDSKGGFS-LDEVVKRSN--------

Query:  ---KDGIPSSSKAMSLPKD-QRKDF--IRRERVVSHNRVSFKKATD---CRRI-------------CWDQTLVVTRRCFHDEWGKIYDTLCNAFQKNLII
           K+ + + S + S   D  RK +  +  +     N+  +K  +D    RR               +++T+++TRRCFHD+W +I  +L    +     
Subjt:  ---KDGIPSSSKAMSLPKD-QRKDF--IRRERVVSHNRVSFKKATD---CRRI-------------CWDQTLVVTRRCFHDEWGKIYDTLCNAFQKNLII

Query:  NPFHPDKALLKCLEADFACIVAHNK---GWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCM
         PF  DKA+L  L  D A ++  NK   GWS +GN+ +KFE W+  LH   +V+PSYGGW+R R IPL  W  +TF+ IG A GG+++   + + +   +
Subjt:  NPFHPDKALLKCLEADFACIVAHNK---GWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCM

Query:  EAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNRE------------ELCSVDKNRKEDQKDRCQIN
        +A IKVR NY GF+PA I I    G   +   V   + + L  R V +HG F ++AA  F   +   E            E      +      D+  I+
Subjt:  EAVIKVRNNYCGFIPAEIDIIQEDGSVAVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNRE------------ELCSVDKNRKEDQKDRCQIN

Query:  GVTKKSNPDSASNKRDG-----SVVEREKGKNIVC------------KKGMAEVKV---GPSGLNISPNENAVTNKKGDGRGVEPTRVIGYQWKEKGRAQ
          T+    +S+ ++ D      S   +EKGK I+              K ++  KV    P G+  S + N   N KG    +     I  Q++++   +
Subjt:  GVTKKSNPDSASNKRDG-----SVVEREKGKNIVC------------KKGMAEVKV---GPSGLNISPNENAVTNKKGDGRGVEPTRVIGYQWKEKGRAQ

Query:  KEISFAPKAEVTTYVKKGLISSTTEER-KQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN-GDQYHPPKDPLA
        ++     K ++T  +KK    ST + +   +ET    ++M + ++ G                           +  + ++ + E+N G      + P  
Subjt:  KEISFAPKAEVTTYVKKGLISSTTEER-KQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAALFEGEDN-GDQYHPPKDPLA

Query:  LVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTD--MGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLISSWE
           ST        T    E AD+    K   RS     + D   G  L+I    +   ++W     ++  PK           N  P       ++S  +
Subjt:  LVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTD--MGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIKGLISSWE

Query:  KPMEKAYSSSQGQIESKLA---SVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRR
        + M+ A     G     L      N  +      +  I    LN+ G          W ++VY P +Y +R  LW EL  +  LC   W I GDF+++R 
Subjt:  KPMEKAYSSSQGQIESKLA---SVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRR

Query:  ADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILLQAGNFSWGPSPFRFENM
          E ++     R+M  F+ ++    L +    N  +TWS+ R  P +S++DRF +S    + F     + L+R  SDHFPILL++    WGP PFR  N 
Subjt:  ADERSSCGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILLQAGNFSWGPSPFRFENM

Query:  WLSHPNFAKLIEEWWK------------------------------------------------------------------------------------
         L    F K    WW                                                                                     
Subjt:  WLSHPNFAKLIEEWWK------------------------------------------------------------------------------------

Query:  -----------------------------------DPV--------DEGKTLRTKSSSIISSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVT
                                           DP         D  +T  +   +I + +  +   +D + W PIS      L +PF+  EI   + 
Subjt:  -----------------------------------DPV--------DEGKTLRTKSSSIISSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVT

Query:  VLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKK-------------------KVAAK-------------------
            +K                            +F + GI+N   N T+I LI KK                   K+ AK                   
Subjt:  VLGAQKT--------------------------LEFFENGIINRRTNETYICLIPKK-------------------KVAAK-------------------

Query:  -------IMDAIMLAVESVDDYRARKEK-----------------------------------------------------------------TRDPLSP
               I DAI++A E +D ++ RK K                                                                   DPLSP
Subjt:  -------IMDAIMLAVESVDDYRARKEK-----------------------------------------------------------------TRDPLSP

Query:  FLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCK
        F+F L  D LSR +    +KG IKG +       +SH+ F DD L+F + N +++ N Q     +EKASGLT N  K+ I  IN+  G   ++A   G +
Subjt:  FLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGCK

Query:  VEGLPFSYLGV
         + LP +YLGV
Subjt:  VEGLPFSYLGV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGACGTCAAGAACAACTCGCAAGTGTGTAGCGGAAAGGAAGCTGCTCTCCATTGATTTTAGTAAGGAAAGAAACAAACGTCTGGTGAAGATCTCCGAGAAAAACCATGA
CCTGCGGTTTGAAGTAGTGTTGGAAAGTAAATACGTTCTTTGGATTTTTGACTCCATTGACGACCTGCTCATCACCCCTATTAGCCAAAAATTCTTCCGGAAGACAACAT
GTGAGAACGGTTTCATTTGGCTTCAGAAAGTGTCGAACAACAGGGGTGTGTTTGTGGAAATAGCGAAAGTTGCCTCATCTGGAAGAAGAGATAACCTGATCATTCCCCGT
GGGGAAGATTCCAAGGGAGGCTTTTCTCTTGATGAGGTTGTCAAAAGGAGTAACAAGGATGGCATCCCAAGTTCTTCAAAAGCCATGTCCCTCCCAAAAGACCAACGAAA
GGATTTTATCAGAAGGGAGCGGGTAGTAAGTCATAACCGGGTTTCCTTTAAAAAAGCGACAGACTGCAGACGCATTTGTTGGGACCAAACTTTGGTAGTCACCCGAAGAT
GTTTTCACGATGAATGGGGTAAAATCTATGATACGTTATGCAATGCTTTTCAGAAAAATTTGATTATTAACCCTTTTCATCCAGATAAGGCTCTCTTAAAGTGTCTAGAG
GCTGATTTTGCTTGTATAGTGGCTCATAATAAAGGCTGGTCGGTGATAGGGAATTTCTCTTTGAAGTTTGAATTGTGGAACTACGAGCTACACGGAAGGATTAATGTGGT
CCCGTCCTATGGTGGTTGGATTAGACTTCGGAACATTCCCCTGCAGAACTGGTGTGTTGACACATTTAAAGCGATTGGGGAAGCTTATGGCGGGTATATTGAATGCGATG
ATAAATGCCTTTCGTTAGTAGGATGTATGGAAGCTGTTATTAAAGTTAGAAATAACTACTGTGGTTTTATTCCAGCCGAAATCGATATCATTCAGGAAGATGGGTCTGTC
GCGGTCGCTCAAGTAGTCACGTATGAAGATCCCCAACTTCTGGAAAGCAGAAGAGTTTACATCCATGGAGGTTTTACCAGTGAAGCAGCTAGGGTTTTCTTTGTGGAAGA
CGATAACAGAGAAGAGCTTTGTTCAGTGGATAAAAACCGAAAAGAGGACCAAAAGGATAGGTGTCAGATAAATGGGGTTACAAAAAAGTCAAATCCTGACAGTGCTAGCA
ACAAAAGGGATGGCTCTGTTGTCGAGAGAGAAAAAGGAAAAAATATTGTCTGTAAAAAAGGTATGGCAGAGGTCAAAGTGGGGCCCAGTGGTCTGAATATCAGTCCCAAT
GAAAATGCGGTTACCAATAAAAAAGGAGATGGGAGGGGAGTGGAGCCCACGCGCGTGATAGGCTATCAGTGGAAAGAAAAAGGCAGAGCTCAAAAAGAGATATCTTTTGC
CCCTAAAGCTGAAGTTACAACTTACGTAAAGAAAGGCTTAATTAGTTCCACGACAGAGGAAAGAAAACAACAGGAAACAGATGGCAGGAAAGAGGAGATGGTAGTCGACC
TTGAAAATGGTTCGTTTTTAAGTGAGTCAAGCTGGGAGGATGACTTCCAGGGTTATGAAGGTCAAAATGAGGCGTGGAATCAGACTGAGGCCCTAGAAGATATAGCTGCC
TTGTTTGAAGGAGAAGATAACGGTGATCAGTATCACCCCCCGAAGGACCCCTTGGCCTTAGTTCCGAGTACTCCCCAAGAAGTTATGATTGAGGCAACTCCAATCAGATC
TGAAGAAGCTGATAAAGGAGGGATTACAAAGCCAATGGAGAGATCCGAGGAAGTTTTGGGGAATACTGACATGGGTCAAAGATTGGATATTCCAATTTTAAAGGAAACCG
TTGAAATCTTGTGGCAAAATGGGCTTTGTATAAGGCCAATTCCTAAGAAGTTAGGAGGTGGTAGCTCCAAGGGGAAAAACAATAAGAATCCGATGAAAAGGGAGATAAAA
GGGCTTATAAGTTCTTGGGAAAAACCGATGGAGAAAGCTTATTCTTCATCACAAGGCCAGATCGAATCTAAATTAGCTTCGGTTAACAGAAATATTATCAAGCAGTTGTG
GAGTTCTCGATTCATAGGTTGGACCTCCCTCAATTCAGAGGGAGCTTCGGGAGGGATCTTGGACTCTATTTGCTGGGCCTCTGCGGTGTACGACCCTAGTAGATACAAGG
AAAGAGACTTGTTGTGGCGTGAATTGTTCGATGTGTATGGTTTGTGTAATGGTGTGTGGTGTATAGGAGGGGACTTCAGTGTGATTAGGAGAGCTGATGAACGTTCTTCA
TGCGGTAGAAGGTCCCGTAGTATGAAAAGGTTTGACAGATGGGTTGAAGATTGCAACTTGACGGAAGTTGTGATGTCCAATGCTAAGTATACTTGGTCAGATAAAAGGGA
AGCCCCTGTTTTCTCCAAAATTGACAGGTTCTTTGTGTCCTCTAAGTGTCTGGATCTCTTTAAGAATGTTAATGTTAAATGGTTGCAAAGGGTTACTTCCGATCATTTCC
CGATTTTGTTGCAGGCTGGGAATTTCTCTTGGGGTCCAAGTCCTTTCCGGTTTGAGAATATGTGGCTTTCCCACCCAAATTTTGCCAAGCTTATCGAGGAGTGGTGGAAG
GATCCGGTTGATGAAGGGAAGACATTGAGAACGAAATCATCAAGTATTATCAGCAGTGACGGTGAGCAGAGGTGTACCCTAGATGGAGTTGAATGGCAGCCTATTTCGTT
GACCCAAGCTCAATGGTTAGAAAGGCCGTTTGAGAGGGAGGAAATTTGGAAAGCTGTGACAGTTCTTGGAGCTCAAAAAACCCTGGAGTTTTTTGAGAATGGGATTATCA
ATAGAAGAACCAATGAAACGTACATATGTTTGATCCCCAAGAAGAAAGTGGCTGCTAAGATTATGGATGCCATTATGCTAGCTGTCGAGTCTGTGGATGATTACCGAGCT
AGAAAAGAAAAGACAAGGGATCCCCTGTCTCCCTTCCTCTTCACTCTTGTGGCCGACTCCTTGAGCCGTTTCATCAGAGTGTGGAGTGCTAAAGGCTTCATTAAAGGCTT
CACAATAGGAAAGGAGGAGGTGGAGGTGTCTCATATCCAGTTTGTAGATGACACGCTGTTGTTTTGTGATGGTAATACTCAGTTTGTGAAGAATTGGCAGACTTTTTCAA
AGTTCTATGAAAAAGCCTCTGGGCTTACTATTAATATGCAAAAGACAGGCATTGTTGGCATAAACATGGAGGAAGGGGAAGTGGTCCGAATGGCTGATCTTCTAGGCTGC
AAAGTAGAAGGCCTTCCTTTTAGCTACTTGGGTGTTACGTGGGTGGTAAACCAAGGGTGGCTTCTTTTGGGACCCGGTGCTGGAAACTATTCGAGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGACGTCAAGAACAACTCGCAAGTGTGTAGCGGAAAGGAAGCTGCTCTCCATTGATTTTAGTAAGGAAAGAAACAAACGTCTGGTGAAGATCTCCGAGAAAAACCATGA
CCTGCGGTTTGAAGTAGTGTTGGAAAGTAAATACGTTCTTTGGATTTTTGACTCCATTGACGACCTGCTCATCACCCCTATTAGCCAAAAATTCTTCCGGAAGACAACAT
GTGAGAACGGTTTCATTTGGCTTCAGAAAGTGTCGAACAACAGGGGTGTGTTTGTGGAAATAGCGAAAGTTGCCTCATCTGGAAGAAGAGATAACCTGATCATTCCCCGT
GGGGAAGATTCCAAGGGAGGCTTTTCTCTTGATGAGGTTGTCAAAAGGAGTAACAAGGATGGCATCCCAAGTTCTTCAAAAGCCATGTCCCTCCCAAAAGACCAACGAAA
GGATTTTATCAGAAGGGAGCGGGTAGTAAGTCATAACCGGGTTTCCTTTAAAAAAGCGACAGACTGCAGACGCATTTGTTGGGACCAAACTTTGGTAGTCACCCGAAGAT
GTTTTCACGATGAATGGGGTAAAATCTATGATACGTTATGCAATGCTTTTCAGAAAAATTTGATTATTAACCCTTTTCATCCAGATAAGGCTCTCTTAAAGTGTCTAGAG
GCTGATTTTGCTTGTATAGTGGCTCATAATAAAGGCTGGTCGGTGATAGGGAATTTCTCTTTGAAGTTTGAATTGTGGAACTACGAGCTACACGGAAGGATTAATGTGGT
CCCGTCCTATGGTGGTTGGATTAGACTTCGGAACATTCCCCTGCAGAACTGGTGTGTTGACACATTTAAAGCGATTGGGGAAGCTTATGGCGGGTATATTGAATGCGATG
ATAAATGCCTTTCGTTAGTAGGATGTATGGAAGCTGTTATTAAAGTTAGAAATAACTACTGTGGTTTTATTCCAGCCGAAATCGATATCATTCAGGAAGATGGGTCTGTC
GCGGTCGCTCAAGTAGTCACGTATGAAGATCCCCAACTTCTGGAAAGCAGAAGAGTTTACATCCATGGAGGTTTTACCAGTGAAGCAGCTAGGGTTTTCTTTGTGGAAGA
CGATAACAGAGAAGAGCTTTGTTCAGTGGATAAAAACCGAAAAGAGGACCAAAAGGATAGGTGTCAGATAAATGGGGTTACAAAAAAGTCAAATCCTGACAGTGCTAGCA
ACAAAAGGGATGGCTCTGTTGTCGAGAGAGAAAAAGGAAAAAATATTGTCTGTAAAAAAGGTATGGCAGAGGTCAAAGTGGGGCCCAGTGGTCTGAATATCAGTCCCAAT
GAAAATGCGGTTACCAATAAAAAAGGAGATGGGAGGGGAGTGGAGCCCACGCGCGTGATAGGCTATCAGTGGAAAGAAAAAGGCAGAGCTCAAAAAGAGATATCTTTTGC
CCCTAAAGCTGAAGTTACAACTTACGTAAAGAAAGGCTTAATTAGTTCCACGACAGAGGAAAGAAAACAACAGGAAACAGATGGCAGGAAAGAGGAGATGGTAGTCGACC
TTGAAAATGGTTCGTTTTTAAGTGAGTCAAGCTGGGAGGATGACTTCCAGGGTTATGAAGGTCAAAATGAGGCGTGGAATCAGACTGAGGCCCTAGAAGATATAGCTGCC
TTGTTTGAAGGAGAAGATAACGGTGATCAGTATCACCCCCCGAAGGACCCCTTGGCCTTAGTTCCGAGTACTCCCCAAGAAGTTATGATTGAGGCAACTCCAATCAGATC
TGAAGAAGCTGATAAAGGAGGGATTACAAAGCCAATGGAGAGATCCGAGGAAGTTTTGGGGAATACTGACATGGGTCAAAGATTGGATATTCCAATTTTAAAGGAAACCG
TTGAAATCTTGTGGCAAAATGGGCTTTGTATAAGGCCAATTCCTAAGAAGTTAGGAGGTGGTAGCTCCAAGGGGAAAAACAATAAGAATCCGATGAAAAGGGAGATAAAA
GGGCTTATAAGTTCTTGGGAAAAACCGATGGAGAAAGCTTATTCTTCATCACAAGGCCAGATCGAATCTAAATTAGCTTCGGTTAACAGAAATATTATCAAGCAGTTGTG
GAGTTCTCGATTCATAGGTTGGACCTCCCTCAATTCAGAGGGAGCTTCGGGAGGGATCTTGGACTCTATTTGCTGGGCCTCTGCGGTGTACGACCCTAGTAGATACAAGG
AAAGAGACTTGTTGTGGCGTGAATTGTTCGATGTGTATGGTTTGTGTAATGGTGTGTGGTGTATAGGAGGGGACTTCAGTGTGATTAGGAGAGCTGATGAACGTTCTTCA
TGCGGTAGAAGGTCCCGTAGTATGAAAAGGTTTGACAGATGGGTTGAAGATTGCAACTTGACGGAAGTTGTGATGTCCAATGCTAAGTATACTTGGTCAGATAAAAGGGA
AGCCCCTGTTTTCTCCAAAATTGACAGGTTCTTTGTGTCCTCTAAGTGTCTGGATCTCTTTAAGAATGTTAATGTTAAATGGTTGCAAAGGGTTACTTCCGATCATTTCC
CGATTTTGTTGCAGGCTGGGAATTTCTCTTGGGGTCCAAGTCCTTTCCGGTTTGAGAATATGTGGCTTTCCCACCCAAATTTTGCCAAGCTTATCGAGGAGTGGTGGAAG
GATCCGGTTGATGAAGGGAAGACATTGAGAACGAAATCATCAAGTATTATCAGCAGTGACGGTGAGCAGAGGTGTACCCTAGATGGAGTTGAATGGCAGCCTATTTCGTT
GACCCAAGCTCAATGGTTAGAAAGGCCGTTTGAGAGGGAGGAAATTTGGAAAGCTGTGACAGTTCTTGGAGCTCAAAAAACCCTGGAGTTTTTTGAGAATGGGATTATCA
ATAGAAGAACCAATGAAACGTACATATGTTTGATCCCCAAGAAGAAAGTGGCTGCTAAGATTATGGATGCCATTATGCTAGCTGTCGAGTCTGTGGATGATTACCGAGCT
AGAAAAGAAAAGACAAGGGATCCCCTGTCTCCCTTCCTCTTCACTCTTGTGGCCGACTCCTTGAGCCGTTTCATCAGAGTGTGGAGTGCTAAAGGCTTCATTAAAGGCTT
CACAATAGGAAAGGAGGAGGTGGAGGTGTCTCATATCCAGTTTGTAGATGACACGCTGTTGTTTTGTGATGGTAATACTCAGTTTGTGAAGAATTGGCAGACTTTTTCAA
AGTTCTATGAAAAAGCCTCTGGGCTTACTATTAATATGCAAAAGACAGGCATTGTTGGCATAAACATGGAGGAAGGGGAAGTGGTCCGAATGGCTGATCTTCTAGGCTGC
AAAGTAGAAGGCCTTCCTTTTAGCTACTTGGGTGTTACGTGGGTGGTAAACCAAGGGTGGCTTCTTTTGGGACCCGGTGCTGGAAACTATTCGAGCTAA
Protein sequenceShow/hide protein sequence
MTSRTTRKCVAERKLLSIDFSKERNKRLVKISEKNHDLRFEVVLESKYVLWIFDSIDDLLITPISQKFFRKTTCENGFIWLQKVSNNRGVFVEIAKVASSGRRDNLIIPR
GEDSKGGFSLDEVVKRSNKDGIPSSSKAMSLPKDQRKDFIRRERVVSHNRVSFKKATDCRRICWDQTLVVTRRCFHDEWGKIYDTLCNAFQKNLIINPFHPDKALLKCLE
ADFACIVAHNKGWSVIGNFSLKFELWNYELHGRINVVPSYGGWIRLRNIPLQNWCVDTFKAIGEAYGGYIECDDKCLSLVGCMEAVIKVRNNYCGFIPAEIDIIQEDGSV
AVAQVVTYEDPQLLESRRVYIHGGFTSEAARVFFVEDDNREELCSVDKNRKEDQKDRCQINGVTKKSNPDSASNKRDGSVVEREKGKNIVCKKGMAEVKVGPSGLNISPN
ENAVTNKKGDGRGVEPTRVIGYQWKEKGRAQKEISFAPKAEVTTYVKKGLISSTTEERKQQETDGRKEEMVVDLENGSFLSESSWEDDFQGYEGQNEAWNQTEALEDIAA
LFEGEDNGDQYHPPKDPLALVPSTPQEVMIEATPIRSEEADKGGITKPMERSEEVLGNTDMGQRLDIPILKETVEILWQNGLCIRPIPKKLGGGSSKGKNNKNPMKREIK
GLISSWEKPMEKAYSSSQGQIESKLASVNRNIIKQLWSSRFIGWTSLNSEGASGGILDSICWASAVYDPSRYKERDLLWRELFDVYGLCNGVWCIGGDFSVIRRADERSS
CGRRSRSMKRFDRWVEDCNLTEVVMSNAKYTWSDKREAPVFSKIDRFFVSSKCLDLFKNVNVKWLQRVTSDHFPILLQAGNFSWGPSPFRFENMWLSHPNFAKLIEEWWK
DPVDEGKTLRTKSSSIISSDGEQRCTLDGVEWQPISLTQAQWLERPFEREEIWKAVTVLGAQKTLEFFENGIINRRTNETYICLIPKKKVAAKIMDAIMLAVESVDDYRA
RKEKTRDPLSPFLFTLVADSLSRFIRVWSAKGFIKGFTIGKEEVEVSHIQFVDDTLLFCDGNTQFVKNWQTFSKFYEKASGLTINMQKTGIVGINMEEGEVVRMADLLGC
KVEGLPFSYLGVTWVVNQGWLLLGPGAGNYSS