; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0034842 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0034842
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationchr3:11480988..11486292
RNA-Seq ExpressionLag0034842
SyntenyLag0034842
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046762.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.5e-17039.55Show/hide
Query:  HHSRRLDRTTSNHFPILLE--NLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAH
        H +R L R+TS+HFP++ E  N  L WGP PFR ++  + +  F   +  WW+++ QDG PGYSFI+RLK L+  IK W+   +++  + K S+  E+  
Subjt:  HHSRRLDRTTSNHFPILLE--NLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAH

Query:  IDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVSSEGISLGKDYQLEK-------------
        ID  E   PL +    +RLAL+ADL+++  +E +F  Q  K LW+ +GDEN++FFH+IC+AR++RNFI E+   EG+    +  +               
Subjt:  IDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVSSEGISLGKDYQLEK-------------

Query:  -----------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLR
                                     E EI   + SL   K+ G DG  + FFKS W  LK  IMDIF DF+D+G+IN+N+N TY+ALIPK+     
Subjt:  -----------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLR

Query:  LSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWR
          D+RPISLTT +Y+I+AKTL+ R+K++LP+TI+EN  AFV + QITDAIL+ANE VDFW   K KG+I+KLDIEKAFD +NWDFID +L  K FPI WR
Subjt:  LSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWR

Query:  KWIKACISSVSYFVLLNGRPR---------------------------------------------------------DDILLFIRDDDSMLDNLFYILK
        KWI+ CIS+V+Y +++NGRP+                                                         DDILLFI D+D  L+NL   L 
Subjt:  KWIKACISSVSYFVLLNGRPR---------------------------------------------------------DDILLFIRDDDSMLDNLFYILK

Query:  SFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSFIPNYML
         F+++SGL IN  KS+L  +NV   +A + A+ WG     LP+SYLG  LG NP           K++  + +W      + GRLTLIKS LS +P Y L
Subjt:  SFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSFIPNYML

Query:  SVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPSQAKFSS
        SVF+AP   CK I+K  R+FLW  N + E  +L+NW  V       GLGI +  V+N AL  KWLWR+  E  ALW+RL+  K+       IPS    S+
Subjt:  SVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPSQAKFSS

Query:  SRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSLPRPSNA
        S++PW SI    + F  N SW+L +G++I FW+  WS  G L  + PR +ALS    + V +AW++ +  W    RR L   E + W+   + LP+P   
Subjt:  SRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSLPRPSNA

Query:  RGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLWSA
         GS    W  +SK  FS+ASA+ L   +    S     K L  +W +
Subjt:  RGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLWSA

KAA0057507.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]3.0e-21931.12Show/hide
Query:  LKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGS--IAEIFRIDGRGNKSCVMVPDGYDKSGWISFLSMLTFKE
        L E   +KSFSM +  +++ W++  FK LL T  T H+F ++R  D C+WV KT N+  +   AEIFRID +G K  ++VP+G D  GW SFL+++TF+ 
Subjt:  LKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGS--IAEIFRIDGRGNKSCVMVPDGYDKSGWISFLSMLTFKE

Query:  Q---KATQSGHAREYSNRHSTPDSPSSNSSKKSYVEIVKSPSKDD-------IVSSSGQKDLSSSKSKP----DNPGEVD-------FDFEWDYI-----
            K  +S   +E  +  S   S  S+SS+KSY +++   S+DD           S  +  SS   KP     N  E         F  +W+ I     
Subjt:  Q---KATQSGHAREYSNRHSTPDSPSSNSSKKSYVEIVKSPSKDD-------IVSSSGQKDLSSSKSKP----DNPGEVD-------FDFEWDYI-----

Query:  -------------------------AHLLCKTK---GWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIPLHLWNMKTFTQVGDVCGGFVDVSKN
                                 A LLC  K   GW TVGN+ VKFE WD  +H+   ++PSYGGW++FRGIPLHLWN  TF  +G  CGGF+DV+K 
Subjt:  -------------------------AHLLCKTK---GWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIPLHLWNMKTFTQVGDVCGGFVDVSKN

Query:  STRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKSESFVFRGNEACTVQDVNVGSDSIIV
        + +   L +A IKV+ N+ GF+P ++ I D++G  F +  V P + +WLV RN +VHG+F  +AA E+D+ +  +E++ + G +A   +      D  I 
Subjt:  STRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKSESFVFRGNEACTVQDVNVGSDSIIV

Query:  EKTPNAPAADRSFKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHVSDKRTPKVDRAK-PYLKPNRMKGIQINE---------------------
            ++ +     K+ + + +      +++    +E  +       +D     KR+ ++   K  +L P    GIQ N                      
Subjt:  EKTPNAPAADRSFKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHVSDKRTPKVDRAK-PYLKPNRMKGIQINE---------------------

Query:  PKVYRPKVTLMT-----VQKGDQYGAPI-----------NDEFMLTVDLGYLSPISDVPISS--------PEQTPSPTIELHEETPSKIAQDSLKMLLQP
         K + P+    T     ++K  Q                + +  L+VD+G +SP+  +  S           QTP    +  +   +K    +L + ++ 
Subjt:  PKVYRPKVTLMT-----VQKGDQYGAPI-----------NDEFMLTVDLGYLSPISDVPISS--------PEQTPSPTIELHEETPSKIAQDSLKMLLQP

Query:  NAQDSGSASSGDSQNNGKQENETQARTKERSE-DQTFKRQLNKWLIENKFCLVPT---------------------------------------------
         A  + SAS   ++ N K      A+T    E D+ FK +L  WL EN+  L P                                              
Subjt:  NAQDSGSASSGDSQNNGKQENETQARTKERSE-DQTFKRQLNKWLIENKFCLVPT---------------------------------------------

Query:  ------------------------------KY--------------------------------------------------------------------
                                      KY                                                                    
Subjt:  ------------------------------KY--------------------------------------------------------------------

Query:  -----------------------------HHSRRLDRTTSNHFPILLENLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLS
                                     H SR L+R  S+HFPILLE+  + WGP PFR +N  ++++ F     +WWNS+ Q GFPGY+FI+ L  LS
Subjt:  -----------------------------HHSRRLDRTTSNHFPILLENLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLS

Query:  AKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVS
          IK W+   V+     K +L  EI  ID LE QG +  +  QKR++L++DL  + + + +   Q  +  W   GDEN ++FH+IC+  +R+N I  +  
Subjt:  AKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVS

Query:  SEGISLGKDYQLEK-----------------------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDF
          G SL     + +                                         ESEI   + S    K+ G DG T+ F+K  W  LK  ++++F DF
Subjt:  SEGISLGKDYQLEK-----------------------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDF

Query:  FDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDI
           GI+N NVN T++ALI K+    + SDYRPISLTT LY+I+AK LA R+KS LP TIAEN  AF+   QI DAIL+ANE +D W   K KG+++KLDI
Subjt:  FDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDI

Query:  EKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR-----------------------------------------------------
        EKAFDKI+W FID +L+ K FP  WRKWIKACIS+V Y +LLNG P+                                                     
Subjt:  EKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR-----------------------------------------------------

Query:  ----DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPLK----------LRSGIRSW
            DD+L+F+ D++  L+NL   L  F+++SGL  N +KS++S +N+   +  Q+A+ +G     LP++YLG  LG NP            +   +  W
Subjt:  ----DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPLK----------LRSGIRSW

Query:  ------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNA
              + GRLTL+K++LS +P Y LS FKAP S+ K+I+K  RDFLW  +  K++ +L+NWN   +P +  GLGI K K +N AL  KWLWR+  E N+
Subjt:  ------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNA

Query:  LWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFF
        LWK+ + AK++  +   IP   + SS+ SPW +I K ++ +    SW   DG+ + FWH KW N+ PL   IPR YALSN  S  V E WD  +  WN  
Subjt:  LWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFF

Query:  PRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLW
        PRR L   E  TW +   SLPR  N RG     WN +    ++VASA+ + + E   P  +   K L +LW
Subjt:  PRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLW

TYJ99315.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]6.2e-20430.81Show/hide
Query:  FRHLPRSCVIKKKKFVLSFESRT-GSSFILKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGSIAEIFRIDGRG
        F+ LPRSC +++K+FVL  +  +  + + L E G++K+FS+ +    ++W++ T K+L+ATP TN +F + R  +  IW+ KT N +G  AEIFR+D + 
Subjt:  FRHLPRSCVIKKKKFVLSFESRT-GSSFILKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGSIAEIFRIDGRG

Query:  NKSCVMVPDGYDKSGWISFLSMLTFKEQKATQSGHAREYSNRHSTPD---SPSSNSSKKSYVEIV-------KSPSKDDIVSSSGQKDLSSSKSKPDNPG
         KSC++VP+G DKSGW+SFLSM+T    K       R      ++PD   SP  +  K+SY + V        S S D   SS      SSS S  D+P 
Subjt:  NKSCVMVPDGYDKSGWISFLSMLTFKEQKATQSGHAREYSNRHSTPD---SPSSNSSKKSYVEIV-------KSPSKDDIVSSSGQKDLSSSKSKPDNPG

Query:  -----------EVDFDFEWDYI-------------------------------AHLLCKTKGWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIP
                      F  +W  I                               A+LLC+ KGW TVG + V+FE+W P  HA PKL+PSYGGW  FRGIP
Subjt:  -----------EVDFDFEWDYI-------------------------------AHLLCKTKGWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIP

Query:  LHLWNMKTFTQVGDVCGGFVDVSKNSTRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKS
        LHLWNM TF Q+G  C G + V++ +    +L EA IKV+ N+ GF+P  VRI D++G +F +++VT  +GKWL+ RN ++HGTF R+AA  +D+F+ +S
Subjt:  LHLWNMKTFTQVGDVCGGFVDVSKNSTRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKS

Query:  ESFVFRGNEACTVQDVNVGSDSIIVEKTPNAPAA--------DRSFKRPS-------------PTATSMR-------------DKGK-KICTSSEEDSQL
        E F F G+EA +   ++  SD      TP+ P+A        DR+   PS              TA   +             DKGK K+    + +S L
Subjt:  ESFVFRGNEACTVQDVNVGSDSIIVEKTPNAPAA--------DRSFKRPS-------------PTATSMR-------------DKGK-KICTSSEEDSQL

Query:  DATSRKR------------------------DSHVSDKRTPKVDR----------AKPYLKPNRMKGIQINEPKVYRPKVTLMTVQKGDQYGAPINDEFM
        +    KR                         S  S ++  KV R           +P  K N+ KG+ I +P         + +   D+  A       
Subjt:  DATSRKR------------------------DSHVSDKRTPKVDR----------AKPYLKPNRMKGIQINEPKVYRPKVTLMTVQKGDQYGAPINDEFM

Query:  LTVDLGYLSPISDVPISSPEQTPSPTIELHEETPSKIAQDSLKMLLQPNAQDSGSASSGDSQNNGKQENETQARTKERSE----DQTFKRQLNKWL----
        LTVDLG L P  D   S  +   S   E+ + T +++  ++ +M +  N   + S+ +   +     + +   R KE  E     + FK+QL  WL    
Subjt:  LTVDLGYLSPISDVPISSPEQTPSPTIELHEETPSKIAQDSLKMLLQPNAQDSGSASSGDSQNNGKQENETQARTKERSE----DQTFKRQLNKWL----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IENKFCLVPT---------------------------------KYH
                                                              I N   + P                                    H
Subjt:  ------------------------------------------------------IENKFCLVPT---------------------------------KYH

Query:  HSRRLDRTTSNHFPILLE--NLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAHI
         +R L R+TS+HFP++ E  N  LSWGP PFR ++  + +  F   +  WW ++ Q G+PG+SFI+RLK L+  IK W+   + ++   K ++  E+  I
Subjt:  HSRRLDRTTSNHFPILLE--NLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAHI

Query:  DALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVSSEGISLGKDYQLEKESEIYQNLKSLGCN
        D  E   PL +    +RLAL+ADL+++  +E +F  Q  K LW+ +GDEN++FFH+ICS+R++R+FI E+   EG     +  +   S  +    S    
Subjt:  DALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVSSEGISLGKDYQLEKESEIYQNLKSLGCN

Query:  KSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSS
         S+  D L +E     W  +  S        F  G I   +N       P         D  PIS             +  +K+TLP+TI+ N  AFV +
Subjt:  KSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSS

Query:  HQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR----------------------
         QITDAIL+ANE VD+W   K KG+I+KLDIEKAFD +N DFID++L  K FP  WRKWI+ CIS+V+Y V++NGRP+                      
Subjt:  HQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR----------------------

Query:  -----------------------------------DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPI
                                           DDILLFI D+D  L NL   L  F+++SGL IN  KS+L  VNV   +A + A+ WG     LP+
Subjt:  -----------------------------------DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPI

Query:  SYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPL
        SYLG  LG NP           K++  + +W      + GRLTLIKS LS +P Y LSVF+AP   CK I+K+ R FLW  N   E  +L+NW  V+   
Subjt:  SYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPL

Query:  DSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLH
        +  GLGI +  V+N AL  KWLWR+  E NALW+RL+  K+  +    IPS    S+S++PW SI    + F  N SW+L +G++I FW+  WS  G L 
Subjt:  DSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLH

Query:  HSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALAN
         + PR +AL+    + V +AW++ +  WN   RR L   E   W    + LP P + RGS    W  +S   FS+ASA++L   +          K L  
Subjt:  HSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALAN

Query:  LWSA
        +W +
Subjt:  LWSA

TYK00493.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.4e-21832.28Show/hide
Query:  MASFRHLPRSCVIKKKKFVLSFES-RTGSSFILKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGSIAEIFRID
        MA F+ LPRSC I++K+FVL  +     + + L E G++K+FS+ +    ++W++ T K+L+ TP +N +F + R  ++CIW+ KT N +G  AEIFR+D
Subjt:  MASFRHLPRSCVIKKKKFVLSFES-RTGSSFILKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGSIAEIFRID

Query:  GRGNKSCVMVPDGYDKSGWISFLSMLTFKEQKATQSGHAREYSNRHSTPD---SPSSNSSKKSYVEIV-------KSPSKDDIVSSSGQKDLSSSKSKPD
         +  KSC++VP+G +KS W+SFLSM+T    K       R      S+P+   SP  +  K+SY + V        S S D   SS   +  SS  S  D
Subjt:  GRGNKSCVMVPDGYDKSGWISFLSMLTFKEQKATQSGHAREYSNRHSTPD---SPSSNSSKKSYVEIV-------KSPSKDDIVSSSGQKDLSSSKSKPD

Query:  NPGEV-----------DFDFEWDYI-------------------------------AHLLCKTKGWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFR
        +P  V            F  +W  I                               A+LLC+ KGW TVG + V+FE+W P  HA PKL+PSYGGW  FR
Subjt:  NPGEV-----------DFDFEWDYI-------------------------------AHLLCKTKGWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFR

Query:  GIPLHLWNMKTFTQVGDVCGGFVDVSKNSTRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFD
        GIPLHLWNM TF Q+G  CGG + V++ +    +L EA +K++ N+ GF+P  V+I D +G +F +++VT  +GKWL+ RN ++HGTF R+AA  +D+F+
Subjt:  GIPLHLWNMKTFTQVGDVCGGFVDVSKNSTRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFD

Query:  AKSESFVFRGNEACTVQDVNVGSDSIIVEKTPNAPAADRS-FKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHV---------SDKRTPKVD--
          SE F+F G EA +   +N  S S     +P  P+A +S   +P+  ATS     +++      D+ L AT+ K    +          DK   KVD  
Subjt:  AKSESFVFRGNEACTVQDVNVGSDSIIVEKTPNAPAADRS-FKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHV---------SDKRTPKVD--

Query:  ----RAKPYLKPNRMKGIQ--INEPKVYRP---------------------KVTLMTVQ--------KGDQYGAPI----------NDEFMLTVDLGYLS
             A  + KP R        N+   + P                     K    T+Q        KG+    P+               LTVDLG L 
Subjt:  ----RAKPYLKPNRMKGIQ--INEPKVYRP---------------------KVTLMTVQ--------KGDQYGAPI----------NDEFMLTVDLGYLS

Query:  PISDVPISSPEQTPSPTIELHEETPSKIAQDSLKM-LLQPNAQDSGSASSGDSQNNGKQENETQARTKERSED---QTFKRQLNKWLIEN----------
        P+ D   S  +   S   E+ + T +++  ++ ++ +  P   +S    +   Q +  +      + +++ +D   + FK QL  WL EN          
Subjt:  PISDVPISSPEQTPSPTIELHEETPSKIAQDSLKM-LLQPNAQDSGSASSGDSQNNGKQENETQARTKERSED---QTFKRQLNKWLIEN----------

Query:  --------------------------------------KFCLV-----------------PTK-------------YHH---------------------
                                              KF L                  P K              HH                     
Subjt:  --------------------------------------KFCLV-----------------PTK-------------YHH---------------------

Query:  ---------------------------------------------------------------SRRLDRTTSNHFPILLEN--LALSWGPSPFRFDNYLI
                                                                       +R L R TS+HFP++ E+    L WGP+PFR ++  +
Subjt:  ---------------------------------------------------------------SRRLDRTTSNHFPILLEN--LALSWGPSPFRFDNYLI

Query:  KERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQC
         +  F   ++ WW  + Q+G PG+ FI+RLK L+  IK W+     ++ + K ++  E+  ID  E   PL      +RLAL+A+LN +  +E +F  Q 
Subjt:  KERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQC

Query:  YKNLWINQGDENTNFFHKICSARKRRNFISELVSSEG--------ISLG------------------------------KDYQL----EKESEIYQNLKS
         K LW+ +GDEN+ FFH+ICS+R++RN I E+   EG        ISL                                D+ L      E EI   +KS
Subjt:  YKNLWINQGDENTNFFHKICSARKRRNFISELVSSEG--------ISLG------------------------------KDYQL----EKESEIYQNLKS

Query:  LGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFA
           NK+ G DG  + FFKS W  LK  I+DIF DFF++G+IN+N+N TY+ALI K+       D+RPISLTT +Y+ +AKTL+ R+K TLP TI+ N  A
Subjt:  LGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFA

Query:  FVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPRDDILL--FIRDDDSMLDN
        F+ + QITDAIL+ANE +D+W   K KG+I+KLDIEKAFD +NW+FID +L    +P +WRKWI+ CIS+V+Y +++NG+P+  I     +R  D +   
Subjt:  FVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPRDDILL--FIRDDDSMLDN

Query:  LFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSF
        LF I           +++    LS +   G      A K G     LP++YLG  LG NP           +++  + +W      + GRLTLIKS LS 
Subjt:  LFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSF

Query:  IPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPS
        +P Y LSVF+AP S  K I+K+ R+FLW  +   +  +L+NW+ V  P +  GLGI + +V+N AL  KWLWR++ E N+LW+RL+  K+  ++   +PS
Subjt:  IPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPS

Query:  QAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSL
            SSS++PW SI    + F  N  W+L +G++I FW+  WS  G L  + PR +ALS      + + W+S+N  W    RR L   E STW    ++L
Subjt:  QAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSL

Query:  PRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKA-LANL
        P     RG     W  +SK  FS+ASA+       QP  +  NP+  L NL
Subjt:  PRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKA-LANL

TYK08190.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.0e-21931.18Show/hide
Query:  LKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGS--IAEIFRIDGRGNKSCVMVPDGYDKSGWISFLSMLTFKE
        L E   +KSFSM +  +++ W++  FK LL T  T H+F ++R  D C+WV KT N+  +   AEIFRID +G K  ++VP+G D  GW SFL+++TF+ 
Subjt:  LKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGS--IAEIFRIDGRGNKSCVMVPDGYDKSGWISFLSMLTFKE

Query:  Q---KATQSGHAREYSNRHSTPDSPSSNSSKKSYVEIVKSPSKDD-------IVSSSGQKDLSSSKSKP----DNPGEVD-------FDFEWDYI-----
            K  +S   +E  +  S   S  S+SS+KSY +++   S+DD           S  +  SS   KP     N  E         F  +W+ I     
Subjt:  Q---KATQSGHAREYSNRHSTPDSPSSNSSKKSYVEIVKSPSKDD-------IVSSSGQKDLSSSKSKP----DNPGEVD-------FDFEWDYI-----

Query:  -------------------------AHLLCKTK---GWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIPLHLWNMKTFTQVGDVCGGFVDVSKN
                                 A LLC  K   GW TVGN+ VKFE WD  +H+   ++PSYGGW++FRGIPLHLWN  TF  +G  CGGF+DV+K 
Subjt:  -------------------------AHLLCKTK---GWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIPLHLWNMKTFTQVGDVCGGFVDVSKN

Query:  STRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKSESFVFRGNEACTVQDVNVGSDSIIV
        + +   L +A IKV+ N+ GF+P ++ I D++G  F +  V P + +WLV RN +VHG+F  +AA E+D+ +  +E++ + G +A   +      D  I 
Subjt:  STRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKSESFVFRGNEACTVQDVNVGSDSIIV

Query:  EKTPNAPAADRSFKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHVSDKRTPKVDRAK-PYLKPNRMKGIQINE---------------------
            ++ +     K+ + + +      +++    +E  +       +D     KR+ ++   K  +L P    GIQ N                      
Subjt:  EKTPNAPAADRSFKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHVSDKRTPKVDRAK-PYLKPNRMKGIQINE---------------------

Query:  PKVYRPKVTLMT-----VQKGDQYGAPI-----------NDEFMLTVDLGYLSPISDVPISS--------PEQTPSPTIELHEETPSKIAQDSLKMLLQP
         K + P+    T     ++K  Q                + +  L+VD+G +SP+  +  S           QTP    +  +   +K    +L + ++ 
Subjt:  PKVYRPKVTLMT-----VQKGDQYGAPI-----------NDEFMLTVDLGYLSPISDVPISS--------PEQTPSPTIELHEETPSKIAQDSLKMLLQP

Query:  NAQDSGSASSGDSQNNGKQENETQARTKERSE-DQTFKRQLNKWLIENKFCLVPT---------------------------------------------
         A  + SAS   ++ N K      A+T    E D+ FK +L  WL EN+  L P                                              
Subjt:  NAQDSGSASSGDSQNNGKQENETQARTKERSE-DQTFKRQLNKWLIENKFCLVPT---------------------------------------------

Query:  ------------------------------KY--------------------------------------------------------------------
                                      KY                                                                    
Subjt:  ------------------------------KY--------------------------------------------------------------------

Query:  -----------------------------HHSRRLDRTTSNHFPILLENLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLS
                                     H SR L+R  S+HFPILLE+  + WGP PFR +N  ++++ F     +WWNS+ Q GFPGY+FI+ L  LS
Subjt:  -----------------------------HHSRRLDRTTSNHFPILLENLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLS

Query:  AKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVS
          IK W+   V+     K +L  EI  ID LE QG +  +  QKR++L++DL  + + + +   Q  +  W   GDEN ++FH+IC+  +R+N I  +  
Subjt:  AKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVS

Query:  SEGISLGKDYQLEK-----------------------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDF
          G SL     + +                                         ESEI   + S    K+ G DG T+ F+K  W  LK  ++++F DF
Subjt:  SEGISLGKDYQLEK-----------------------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDF

Query:  FDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDI
           GI+N NVN T++ALI K+    + SDYRPISLTT LY+I+AK LA R+KS LP TIAEN  AF+   QI DAIL+ANEV+D W   K KG+++KLDI
Subjt:  FDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDI

Query:  EKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR-----------------------------------------------------
        EKAFDKI+W FID +L+ K FP  WRKWIKACIS+V Y +LLNG P+                                                     
Subjt:  EKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR-----------------------------------------------------

Query:  ----DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPLK----------LRSGIRSW
            DD+L+F+ D++  L+NL   L  F+++SGL  N +KS++S +N+   +  Q+A+ +G     LP++YLG  LG NP            +   +  W
Subjt:  ----DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPLK----------LRSGIRSW

Query:  ------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNA
              + GRLTL+K++LS +P Y LS FKAP S+ K+I+K  RDFLW  +  K++ +L+NWN   +P +  GLGI K K +N AL  KWLWR+  E N+
Subjt:  ------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNA

Query:  LWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFF
        LWK+ + AK++  +   IP   + SS+ SPW +I K ++ +    SW   DG+ + FWH KW N+ PL   IPR YALSN  S  V E WD  +  WN  
Subjt:  LWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFF

Query:  PRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLW
        PRR L   E  TW +   SLPR  N RG     WN +    ++VASA+ + + E   P  +   K L +LW
Subjt:  PRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLW

TrEMBL top hitse value%identityAlignment
A0A5A7TTK1 LINE-1 retrotransposable element ORF2 protein2.2e-17039.55Show/hide
Query:  HHSRRLDRTTSNHFPILLE--NLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAH
        H +R L R+TS+HFP++ E  N  L WGP PFR ++  + +  F   +  WW+++ QDG PGYSFI+RLK L+  IK W+   +++  + K S+  E+  
Subjt:  HHSRRLDRTTSNHFPILLE--NLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAH

Query:  IDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVSSEGISLGKDYQLEK-------------
        ID  E   PL +    +RLAL+ADL+++  +E +F  Q  K LW+ +GDEN++FFH+IC+AR++RNFI E+   EG+    +  +               
Subjt:  IDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVSSEGISLGKDYQLEK-------------

Query:  -----------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLR
                                     E EI   + SL   K+ G DG  + FFKS W  LK  IMDIF DF+D+G+IN+N+N TY+ALIPK+     
Subjt:  -----------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLR

Query:  LSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWR
          D+RPISLTT +Y+I+AKTL+ R+K++LP+TI+EN  AFV + QITDAIL+ANE VDFW   K KG+I+KLDIEKAFD +NWDFID +L  K FPI WR
Subjt:  LSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWR

Query:  KWIKACISSVSYFVLLNGRPR---------------------------------------------------------DDILLFIRDDDSMLDNLFYILK
        KWI+ CIS+V+Y +++NGRP+                                                         DDILLFI D+D  L+NL   L 
Subjt:  KWIKACISSVSYFVLLNGRPR---------------------------------------------------------DDILLFIRDDDSMLDNLFYILK

Query:  SFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSFIPNYML
         F+++SGL IN  KS+L  +NV   +A + A+ WG     LP+SYLG  LG NP           K++  + +W      + GRLTLIKS LS +P Y L
Subjt:  SFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSFIPNYML

Query:  SVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPSQAKFSS
        SVF+AP   CK I+K  R+FLW  N + E  +L+NW  V       GLGI +  V+N AL  KWLWR+  E  ALW+RL+  K+       IPS    S+
Subjt:  SVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPSQAKFSS

Query:  SRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSLPRPSNA
        S++PW SI    + F  N SW+L +G++I FW+  WS  G L  + PR +ALS    + V +AW++ +  W    RR L   E + W+   + LP+P   
Subjt:  SRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSLPRPSNA

Query:  RGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLWSA
         GS    W  +SK  FS+ASA+ L   +    S     K L  +W +
Subjt:  RGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLWSA

A0A5A7US62 LINE-1 retrotransposable element ORF2 protein1.5e-21931.12Show/hide
Query:  LKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGS--IAEIFRIDGRGNKSCVMVPDGYDKSGWISFLSMLTFKE
        L E   +KSFSM +  +++ W++  FK LL T  T H+F ++R  D C+WV KT N+  +   AEIFRID +G K  ++VP+G D  GW SFL+++TF+ 
Subjt:  LKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGS--IAEIFRIDGRGNKSCVMVPDGYDKSGWISFLSMLTFKE

Query:  Q---KATQSGHAREYSNRHSTPDSPSSNSSKKSYVEIVKSPSKDD-------IVSSSGQKDLSSSKSKP----DNPGEVD-------FDFEWDYI-----
            K  +S   +E  +  S   S  S+SS+KSY +++   S+DD           S  +  SS   KP     N  E         F  +W+ I     
Subjt:  Q---KATQSGHAREYSNRHSTPDSPSSNSSKKSYVEIVKSPSKDD-------IVSSSGQKDLSSSKSKP----DNPGEVD-------FDFEWDYI-----

Query:  -------------------------AHLLCKTK---GWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIPLHLWNMKTFTQVGDVCGGFVDVSKN
                                 A LLC  K   GW TVGN+ VKFE WD  +H+   ++PSYGGW++FRGIPLHLWN  TF  +G  CGGF+DV+K 
Subjt:  -------------------------AHLLCKTK---GWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIPLHLWNMKTFTQVGDVCGGFVDVSKN

Query:  STRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKSESFVFRGNEACTVQDVNVGSDSIIV
        + +   L +A IKV+ N+ GF+P ++ I D++G  F +  V P + +WLV RN +VHG+F  +AA E+D+ +  +E++ + G +A   +      D  I 
Subjt:  STRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKSESFVFRGNEACTVQDVNVGSDSIIV

Query:  EKTPNAPAADRSFKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHVSDKRTPKVDRAK-PYLKPNRMKGIQINE---------------------
            ++ +     K+ + + +      +++    +E  +       +D     KR+ ++   K  +L P    GIQ N                      
Subjt:  EKTPNAPAADRSFKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHVSDKRTPKVDRAK-PYLKPNRMKGIQINE---------------------

Query:  PKVYRPKVTLMT-----VQKGDQYGAPI-----------NDEFMLTVDLGYLSPISDVPISS--------PEQTPSPTIELHEETPSKIAQDSLKMLLQP
         K + P+    T     ++K  Q                + +  L+VD+G +SP+  +  S           QTP    +  +   +K    +L + ++ 
Subjt:  PKVYRPKVTLMT-----VQKGDQYGAPI-----------NDEFMLTVDLGYLSPISDVPISS--------PEQTPSPTIELHEETPSKIAQDSLKMLLQP

Query:  NAQDSGSASSGDSQNNGKQENETQARTKERSE-DQTFKRQLNKWLIENKFCLVPT---------------------------------------------
         A  + SAS   ++ N K      A+T    E D+ FK +L  WL EN+  L P                                              
Subjt:  NAQDSGSASSGDSQNNGKQENETQARTKERSE-DQTFKRQLNKWLIENKFCLVPT---------------------------------------------

Query:  ------------------------------KY--------------------------------------------------------------------
                                      KY                                                                    
Subjt:  ------------------------------KY--------------------------------------------------------------------

Query:  -----------------------------HHSRRLDRTTSNHFPILLENLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLS
                                     H SR L+R  S+HFPILLE+  + WGP PFR +N  ++++ F     +WWNS+ Q GFPGY+FI+ L  LS
Subjt:  -----------------------------HHSRRLDRTTSNHFPILLENLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLS

Query:  AKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVS
          IK W+   V+     K +L  EI  ID LE QG +  +  QKR++L++DL  + + + +   Q  +  W   GDEN ++FH+IC+  +R+N I  +  
Subjt:  AKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVS

Query:  SEGISLGKDYQLEK-----------------------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDF
          G SL     + +                                         ESEI   + S    K+ G DG T+ F+K  W  LK  ++++F DF
Subjt:  SEGISLGKDYQLEK-----------------------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDF

Query:  FDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDI
           GI+N NVN T++ALI K+    + SDYRPISLTT LY+I+AK LA R+KS LP TIAEN  AF+   QI DAIL+ANE +D W   K KG+++KLDI
Subjt:  FDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDI

Query:  EKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR-----------------------------------------------------
        EKAFDKI+W FID +L+ K FP  WRKWIKACIS+V Y +LLNG P+                                                     
Subjt:  EKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR-----------------------------------------------------

Query:  ----DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPLK----------LRSGIRSW
            DD+L+F+ D++  L+NL   L  F+++SGL  N +KS++S +N+   +  Q+A+ +G     LP++YLG  LG NP            +   +  W
Subjt:  ----DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPLK----------LRSGIRSW

Query:  ------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNA
              + GRLTL+K++LS +P Y LS FKAP S+ K+I+K  RDFLW  +  K++ +L+NWN   +P +  GLGI K K +N AL  KWLWR+  E N+
Subjt:  ------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNA

Query:  LWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFF
        LWK+ + AK++  +   IP   + SS+ SPW +I K ++ +    SW   DG+ + FWH KW N+ PL   IPR YALSN  S  V E WD  +  WN  
Subjt:  LWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFF

Query:  PRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLW
        PRR L   E  TW +   SLPR  N RG     WN +    ++VASA+ + + E   P  +   K L +LW
Subjt:  PRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLW

A0A5D3BL61 LINE-1 retrotransposable element ORF2 protein2.1e-21832.28Show/hide
Query:  MASFRHLPRSCVIKKKKFVLSFES-RTGSSFILKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGSIAEIFRID
        MA F+ LPRSC I++K+FVL  +     + + L E G++K+FS+ +    ++W++ T K+L+ TP +N +F + R  ++CIW+ KT N +G  AEIFR+D
Subjt:  MASFRHLPRSCVIKKKKFVLSFES-RTGSSFILKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGSIAEIFRID

Query:  GRGNKSCVMVPDGYDKSGWISFLSMLTFKEQKATQSGHAREYSNRHSTPD---SPSSNSSKKSYVEIV-------KSPSKDDIVSSSGQKDLSSSKSKPD
         +  KSC++VP+G +KS W+SFLSM+T    K       R      S+P+   SP  +  K+SY + V        S S D   SS   +  SS  S  D
Subjt:  GRGNKSCVMVPDGYDKSGWISFLSMLTFKEQKATQSGHAREYSNRHSTPD---SPSSNSSKKSYVEIV-------KSPSKDDIVSSSGQKDLSSSKSKPD

Query:  NPGEV-----------DFDFEWDYI-------------------------------AHLLCKTKGWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFR
        +P  V            F  +W  I                               A+LLC+ KGW TVG + V+FE+W P  HA PKL+PSYGGW  FR
Subjt:  NPGEV-----------DFDFEWDYI-------------------------------AHLLCKTKGWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFR

Query:  GIPLHLWNMKTFTQVGDVCGGFVDVSKNSTRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFD
        GIPLHLWNM TF Q+G  CGG + V++ +    +L EA +K++ N+ GF+P  V+I D +G +F +++VT  +GKWL+ RN ++HGTF R+AA  +D+F+
Subjt:  GIPLHLWNMKTFTQVGDVCGGFVDVSKNSTRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFD

Query:  AKSESFVFRGNEACTVQDVNVGSDSIIVEKTPNAPAADRS-FKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHV---------SDKRTPKVD--
          SE F+F G EA +   +N  S S     +P  P+A +S   +P+  ATS     +++      D+ L AT+ K    +          DK   KVD  
Subjt:  AKSESFVFRGNEACTVQDVNVGSDSIIVEKTPNAPAADRS-FKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHV---------SDKRTPKVD--

Query:  ----RAKPYLKPNRMKGIQ--INEPKVYRP---------------------KVTLMTVQ--------KGDQYGAPI----------NDEFMLTVDLGYLS
             A  + KP R        N+   + P                     K    T+Q        KG+    P+               LTVDLG L 
Subjt:  ----RAKPYLKPNRMKGIQ--INEPKVYRP---------------------KVTLMTVQ--------KGDQYGAPI----------NDEFMLTVDLGYLS

Query:  PISDVPISSPEQTPSPTIELHEETPSKIAQDSLKM-LLQPNAQDSGSASSGDSQNNGKQENETQARTKERSED---QTFKRQLNKWLIEN----------
        P+ D   S  +   S   E+ + T +++  ++ ++ +  P   +S    +   Q +  +      + +++ +D   + FK QL  WL EN          
Subjt:  PISDVPISSPEQTPSPTIELHEETPSKIAQDSLKM-LLQPNAQDSGSASSGDSQNNGKQENETQARTKERSED---QTFKRQLNKWLIEN----------

Query:  --------------------------------------KFCLV-----------------PTK-------------YHH---------------------
                                              KF L                  P K              HH                     
Subjt:  --------------------------------------KFCLV-----------------PTK-------------YHH---------------------

Query:  ---------------------------------------------------------------SRRLDRTTSNHFPILLEN--LALSWGPSPFRFDNYLI
                                                                       +R L R TS+HFP++ E+    L WGP+PFR ++  +
Subjt:  ---------------------------------------------------------------SRRLDRTTSNHFPILLEN--LALSWGPSPFRFDNYLI

Query:  KERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQC
         +  F   ++ WW  + Q+G PG+ FI+RLK L+  IK W+     ++ + K ++  E+  ID  E   PL      +RLAL+A+LN +  +E +F  Q 
Subjt:  KERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQC

Query:  YKNLWINQGDENTNFFHKICSARKRRNFISELVSSEG--------ISLG------------------------------KDYQL----EKESEIYQNLKS
         K LW+ +GDEN+ FFH+ICS+R++RN I E+   EG        ISL                                D+ L      E EI   +KS
Subjt:  YKNLWINQGDENTNFFHKICSARKRRNFISELVSSEG--------ISLG------------------------------KDYQL----EKESEIYQNLKS

Query:  LGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFA
           NK+ G DG  + FFKS W  LK  I+DIF DFF++G+IN+N+N TY+ALI K+       D+RPISLTT +Y+ +AKTL+ R+K TLP TI+ N  A
Subjt:  LGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFA

Query:  FVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPRDDILL--FIRDDDSMLDN
        F+ + QITDAIL+ANE +D+W   K KG+I+KLDIEKAFD +NW+FID +L    +P +WRKWI+ CIS+V+Y +++NG+P+  I     +R  D +   
Subjt:  FVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPRDDILL--FIRDDDSMLDN

Query:  LFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSF
        LF I           +++    LS +   G      A K G     LP++YLG  LG NP           +++  + +W      + GRLTLIKS LS 
Subjt:  LFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSF

Query:  IPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPS
        +P Y LSVF+AP S  K I+K+ R+FLW  +   +  +L+NW+ V  P +  GLGI + +V+N AL  KWLWR++ E N+LW+RL+  K+  ++   +PS
Subjt:  IPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPS

Query:  QAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSL
            SSS++PW SI    + F  N  W+L +G++I FW+  WS  G L  + PR +ALS      + + W+S+N  W    RR L   E STW    ++L
Subjt:  QAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSL

Query:  PRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKA-LANL
        P     RG     W  +SK  FS+ASA+       QP  +  NP+  L NL
Subjt:  PRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKA-LANL

A0A5D3BLV7 LINE-1 retrotransposable element ORF2 protein3.0e-20430.81Show/hide
Query:  FRHLPRSCVIKKKKFVLSFESRT-GSSFILKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGSIAEIFRIDGRG
        F+ LPRSC +++K+FVL  +  +  + + L E G++K+FS+ +    ++W++ T K+L+ATP TN +F + R  +  IW+ KT N +G  AEIFR+D + 
Subjt:  FRHLPRSCVIKKKKFVLSFESRT-GSSFILKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGSIAEIFRIDGRG

Query:  NKSCVMVPDGYDKSGWISFLSMLTFKEQKATQSGHAREYSNRHSTPD---SPSSNSSKKSYVEIV-------KSPSKDDIVSSSGQKDLSSSKSKPDNPG
         KSC++VP+G DKSGW+SFLSM+T    K       R      ++PD   SP  +  K+SY + V        S S D   SS      SSS S  D+P 
Subjt:  NKSCVMVPDGYDKSGWISFLSMLTFKEQKATQSGHAREYSNRHSTPD---SPSSNSSKKSYVEIV-------KSPSKDDIVSSSGQKDLSSSKSKPDNPG

Query:  -----------EVDFDFEWDYI-------------------------------AHLLCKTKGWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIP
                      F  +W  I                               A+LLC+ KGW TVG + V+FE+W P  HA PKL+PSYGGW  FRGIP
Subjt:  -----------EVDFDFEWDYI-------------------------------AHLLCKTKGWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIP

Query:  LHLWNMKTFTQVGDVCGGFVDVSKNSTRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKS
        LHLWNM TF Q+G  C G + V++ +    +L EA IKV+ N+ GF+P  VRI D++G +F +++VT  +GKWL+ RN ++HGTF R+AA  +D+F+ +S
Subjt:  LHLWNMKTFTQVGDVCGGFVDVSKNSTRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKS

Query:  ESFVFRGNEACTVQDVNVGSDSIIVEKTPNAPAA--------DRSFKRPS-------------PTATSMR-------------DKGK-KICTSSEEDSQL
        E F F G+EA +   ++  SD      TP+ P+A        DR+   PS              TA   +             DKGK K+    + +S L
Subjt:  ESFVFRGNEACTVQDVNVGSDSIIVEKTPNAPAA--------DRSFKRPS-------------PTATSMR-------------DKGK-KICTSSEEDSQL

Query:  DATSRKR------------------------DSHVSDKRTPKVDR----------AKPYLKPNRMKGIQINEPKVYRPKVTLMTVQKGDQYGAPINDEFM
        +    KR                         S  S ++  KV R           +P  K N+ KG+ I +P         + +   D+  A       
Subjt:  DATSRKR------------------------DSHVSDKRTPKVDR----------AKPYLKPNRMKGIQINEPKVYRPKVTLMTVQKGDQYGAPINDEFM

Query:  LTVDLGYLSPISDVPISSPEQTPSPTIELHEETPSKIAQDSLKMLLQPNAQDSGSASSGDSQNNGKQENETQARTKERSE----DQTFKRQLNKWL----
        LTVDLG L P  D   S  +   S   E+ + T +++  ++ +M +  N   + S+ +   +     + +   R KE  E     + FK+QL  WL    
Subjt:  LTVDLGYLSPISDVPISSPEQTPSPTIELHEETPSKIAQDSLKMLLQPNAQDSGSASSGDSQNNGKQENETQARTKERSE----DQTFKRQLNKWL----

Query:  ----------------------------------------------------------------------------------------------------
                                                                                                            
Subjt:  ----------------------------------------------------------------------------------------------------

Query:  ------------------------------------------------------IENKFCLVPT---------------------------------KYH
                                                              I N   + P                                    H
Subjt:  ------------------------------------------------------IENKFCLVPT---------------------------------KYH

Query:  HSRRLDRTTSNHFPILLE--NLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAHI
         +R L R+TS+HFP++ E  N  LSWGP PFR ++  + +  F   +  WW ++ Q G+PG+SFI+RLK L+  IK W+   + ++   K ++  E+  I
Subjt:  HSRRLDRTTSNHFPILLE--NLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAHI

Query:  DALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVSSEGISLGKDYQLEKESEIYQNLKSLGCN
        D  E   PL +    +RLAL+ADL+++  +E +F  Q  K LW+ +GDEN++FFH+ICS+R++R+FI E+   EG     +  +   S  +    S    
Subjt:  DALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVSSEGISLGKDYQLEKESEIYQNLKSLGCN

Query:  KSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSS
         S+  D L +E     W  +  S        F  G I   +N       P         D  PIS             +  +K+TLP+TI+ N  AFV +
Subjt:  KSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSS

Query:  HQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR----------------------
         QITDAIL+ANE VD+W   K KG+I+KLDIEKAFD +N DFID++L  K FP  WRKWI+ CIS+V+Y V++NGRP+                      
Subjt:  HQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR----------------------

Query:  -----------------------------------DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPI
                                           DDILLFI D+D  L NL   L  F+++SGL IN  KS+L  VNV   +A + A+ WG     LP+
Subjt:  -----------------------------------DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPI

Query:  SYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPL
        SYLG  LG NP           K++  + +W      + GRLTLIKS LS +P Y LSVF+AP   CK I+K+ R FLW  N   E  +L+NW  V+   
Subjt:  SYLGALLGINPL----------KLRSGIRSW------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPL

Query:  DSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLH
        +  GLGI +  V+N AL  KWLWR+  E NALW+RL+  K+  +    IPS    S+S++PW SI    + F  N SW+L +G++I FW+  WS  G L 
Subjt:  DSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLH

Query:  HSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALAN
         + PR +AL+    + V +AW++ +  WN   RR L   E   W    + LP P + RGS    W  +S   FS+ASA++L   +          K L  
Subjt:  HSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALAN

Query:  LWSA
        +W +
Subjt:  LWSA

A0A5D3CA17 LINE-1 retrotransposable element ORF2 protein5.1e-22031.18Show/hide
Query:  LKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGS--IAEIFRIDGRGNKSCVMVPDGYDKSGWISFLSMLTFKE
        L E   +KSFSM +  +++ W++  FK LL T  T H+F ++R  D C+WV KT N+  +   AEIFRID +G K  ++VP+G D  GW SFL+++TF+ 
Subjt:  LKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGS--IAEIFRIDGRGNKSCVMVPDGYDKSGWISFLSMLTFKE

Query:  Q---KATQSGHAREYSNRHSTPDSPSSNSSKKSYVEIVKSPSKDD-------IVSSSGQKDLSSSKSKP----DNPGEVD-------FDFEWDYI-----
            K  +S   +E  +  S   S  S+SS+KSY +++   S+DD           S  +  SS   KP     N  E         F  +W+ I     
Subjt:  Q---KATQSGHAREYSNRHSTPDSPSSNSSKKSYVEIVKSPSKDD-------IVSSSGQKDLSSSKSKP----DNPGEVD-------FDFEWDYI-----

Query:  -------------------------AHLLCKTK---GWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIPLHLWNMKTFTQVGDVCGGFVDVSKN
                                 A LLC  K   GW TVGN+ VKFE WD  +H+   ++PSYGGW++FRGIPLHLWN  TF  +G  CGGF+DV+K 
Subjt:  -------------------------AHLLCKTK---GWVTVGNFYVKFERWDPEIHAVPKLVPSYGGWVKFRGIPLHLWNMKTFTQVGDVCGGFVDVSKN

Query:  STRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKSESFVFRGNEACTVQDVNVGSDSIIV
        + +   L +A IKV+ N+ GF+P ++ I D++G  F +  V P + +WLV RN +VHG+F  +AA E+D+ +  +E++ + G +A   +      D  I 
Subjt:  STRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPKVHGTFTREAALEYDEFDAKSESFVFRGNEACTVQDVNVGSDSIIV

Query:  EKTPNAPAADRSFKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHVSDKRTPKVDRAK-PYLKPNRMKGIQINE---------------------
            ++ +     K+ + + +      +++    +E  +       +D     KR+ ++   K  +L P    GIQ N                      
Subjt:  EKTPNAPAADRSFKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHVSDKRTPKVDRAK-PYLKPNRMKGIQINE---------------------

Query:  PKVYRPKVTLMT-----VQKGDQYGAPI-----------NDEFMLTVDLGYLSPISDVPISS--------PEQTPSPTIELHEETPSKIAQDSLKMLLQP
         K + P+    T     ++K  Q                + +  L+VD+G +SP+  +  S           QTP    +  +   +K    +L + ++ 
Subjt:  PKVYRPKVTLMT-----VQKGDQYGAPI-----------NDEFMLTVDLGYLSPISDVPISS--------PEQTPSPTIELHEETPSKIAQDSLKMLLQP

Query:  NAQDSGSASSGDSQNNGKQENETQARTKERSE-DQTFKRQLNKWLIENKFCLVPT---------------------------------------------
         A  + SAS   ++ N K      A+T    E D+ FK +L  WL EN+  L P                                              
Subjt:  NAQDSGSASSGDSQNNGKQENETQARTKERSE-DQTFKRQLNKWLIENKFCLVPT---------------------------------------------

Query:  ------------------------------KY--------------------------------------------------------------------
                                      KY                                                                    
Subjt:  ------------------------------KY--------------------------------------------------------------------

Query:  -----------------------------HHSRRLDRTTSNHFPILLENLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLS
                                     H SR L+R  S+HFPILLE+  + WGP PFR +N  ++++ F     +WWNS+ Q GFPGY+FI+ L  LS
Subjt:  -----------------------------HHSRRLDRTTSNHFPILLENLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLS

Query:  AKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVS
          IK W+   V+     K +L  EI  ID LE QG +  +  QKR++L++DL  + + + +   Q  +  W   GDEN ++FH+IC+  +R+N I  +  
Subjt:  AKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVS

Query:  SEGISLGKDYQLEK-----------------------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDF
          G SL     + +                                         ESEI   + S    K+ G DG T+ F+K  W  LK  ++++F DF
Subjt:  SEGISLGKDYQLEK-----------------------------------------ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDF

Query:  FDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDI
           GI+N NVN T++ALI K+    + SDYRPISLTT LY+I+AK LA R+KS LP TIAEN  AF+   QI DAIL+ANEV+D W   K KG+++KLDI
Subjt:  FDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDI

Query:  EKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR-----------------------------------------------------
        EKAFDKI+W FID +L+ K FP  WRKWIKACIS+V Y +LLNG P+                                                     
Subjt:  EKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPR-----------------------------------------------------

Query:  ----DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPLK----------LRSGIRSW
            DD+L+F+ D++  L+NL   L  F+++SGL  N +KS++S +N+   +  Q+A+ +G     LP++YLG  LG NP            +   +  W
Subjt:  ----DDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPLK----------LRSGIRSW

Query:  ------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNA
              + GRLTL+K++LS +P Y LS FKAP S+ K+I+K  RDFLW  +  K++ +L+NWN   +P +  GLGI K K +N AL  KWLWR+  E N+
Subjt:  ------RSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNA

Query:  LWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFF
        LWK+ + AK++  +   IP   + SS+ SPW +I K ++ +    SW   DG+ + FWH KW N+ PL   IPR YALSN  S  V E WD  +  WN  
Subjt:  LWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLKVAEAWDSSNLSWNFF

Query:  PRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLW
        PRR L   E  TW +   SLPR  N RG     WN +    ++VASA+ + + E   P  +   K L +LW
Subjt:  PRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLW

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.5e-1425.16Show/hide
Query:  SEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAH-SLRLSDYRPISLTTVLYRILAKTLAERIKSTL
        SEI   + SL   KS G DG T EF++     L P ++ +F      GI+  +  E  + LIPK    + +  ++RPISL  +  +IL K LA RI+  +
Subjt:  SEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAH-SLRLSDYRPISLTTVLYRILAKTLAERIKSTL

Query:  PSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGY-IIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRP--------
           I  +   F+   Q    I  +  V+     +K K + II +D EKAFDKI   F+   L+  G    + K I+A     +  ++LNG+         
Subjt:  PSTIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGY-IIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRP--------

Query:  ---------------------------------------------RDDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAA
                                                      DD+++++ +      NL  ++ +F + SG  IN  KS     N      SQ+  
Subjt:  ---------------------------------------------RDDILLFIRDDDSMLDNLFYILKSFKQSSGLNINFNKSSLSSVNVEGSKASQVAA

Query:  KWGCPYLPLPISYLGALL
        +         I YLG  L
Subjt:  KWGCPYLPLPISYLGALL

P0C2F6 Putative ribonuclease H protein At1g657504.2e-2230.15Show/hide
Query:  KLRSGIRSWR------SGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWL
        ++ S +  WR      +GRLTL K+ LS +P + +S    PQSI  ++D++ R FLW     K+  +LV W+ V +P    GLG+   K  N AL  K  
Subjt:  KLRSGIRSWR------SGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLNLVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWL

Query:  WRFFQEDNALWKRLLMAKF-----SPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNS-SWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLK
        WR  QE N+LW  +L  K+         W+ IP      S  S W SIA      + +   W   DG +IRFW D+W +  PL   +      ++  ++ 
Subjt:  WRFFQEDNALWKRLLMAKF-----SPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNS-SWELRDGNKIRFWHDKWSNSGPLHHSIPRFYALSNAISLK

Query:  VAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQP
          + W      W+ F +     T  +     +  L   + AR  D L W  +  G FSV SA  +   ++ P
Subjt:  VAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQP

P11369 LINE-1 retrotransposable element ORF2 protein1.6e-1330.96Show/hide
Query:  EIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPK-RAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLP
        EI   + SL   KS G DG + EF+++    L P +  +F+     G +  +  E  + LIPK +    ++ ++RPISL  +  +IL K LA RI+  + 
Subjt:  EIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPK-RAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLP

Query:  STIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGY-IIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPRDDILL
        + I  +   F+   Q    I  +  V+ +    K K + II LD EKAFDKI   F+  +L   G    +   IKA  S     + +NG   + I L
Subjt:  STIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGY-IIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPRDDILL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.8e-1728.49Show/hide
Query:  EIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPS
        E+ Q L+ +  NKS GLDGLT+EFF+  W +L P    +  + F +G +  +     ++L+PK+     + ++RP+SL +  Y+I+AK ++ R+KS L  
Subjt:  EIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPS

Query:  TIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLN
         I  +    V    I D + +  +++ F   +      + LD EKAFD+++  ++   L    F   +  ++K   +S    V +N
Subjt:  TIAENLFAFVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLN

Q03278 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)4.7e-0526.88Show/hide
Query:  SSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSH
        ++G DG+T     ++W S+   I  +FN     G   R   ++   LIPK   ++  + +RP+S+ +V  R   + LA RI       +     AF+ + 
Subjt:  SSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFAFVSSH

Query:  QITDAILVANEVVDFWTCSKTKG-YIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWI
         + +   + + ++      K KG YI  LD++KAFD +    I   L  K  P+  R +I
Subjt:  QITDAILVANEVVDFWTCSKTKG-YIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWI

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.0e-1525.48Show/hide
Query:  SNHFP--ILLENLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQ---
        S+H P  I+LENL        FR+ ++L     FL  +   W      G   +S    LK   A  K  K+L        +      +  +++++ Q   
Subjt:  SNHFP--ILLENLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIKSWKILYVDAIKTRKSSLATEIAHIDALEHQ---

Query:  GPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFI--------------------------------SELVSSE
         P D S+F+     R   N   +    F RQ  +  W+  GD NT FFHK+  A + +N I                                S++++ +
Subjt:  GPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFI--------------------------------SELVSSE

Query:  GISLGKDYQ--------------LEKESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSD
         +   KD                L  + EI   + ++  NK+ G D  T EFF  SW  +K S +    +FF  G + +  N T + LIPK     +LS 
Subjt:  GISLGKDYQ--------------LEKESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSD

Query:  YRPISLTTVLYRIL
        +RP+S  TV+Y+I+
Subjt:  YRPISLTTVLYRIL

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.6e-0634.57Show/hide
Query:  LAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSK-TKGY-IIKLDIEKAFDKINWDFIDSILSFKGFPITW
        + ER+K  + + I     +F+     TD I+   E V      K  KG+ ++KLD+EKA+D+I WD+++  L   GFP  W
Subjt:  LAERIKSTLPSTIAENLFAFVSSHQITDAILVANEVVDFWTCSK-TKGY-IIKLDIEKAFDKINWDFIDSILSFKGFPITW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTCCTTCCGTCACCTACCGAGATCATGTGTCATCAAAAAAAAGAAATTTGTTTTGTCCTTCGAAAGTCGGACAGGCTCTAGCTTTATTCTAAAGGAAAAGGGGTC
CTACAAGTCCTTTTCGATGGTCCTACATCAAGAGTCAGTAGAATGGCTAAAGGTAACTTTCAAAACACTTTTAGCAACTCCTCGTACTAATCACTATTTTCAGCAAAAGA
GGTTCAGAGACTACTGCATTTGGGTTGAAAAAACAACAAATCAAAGGGGAAGTATAGCAGAAATTTTCCGTATAGATGGTCGAGGCAACAAAAGTTGTGTCATGGTCCCT
GACGGGTACGACAAGAGTGGTTGGATCTCTTTCTTGTCGATGCTTACTTTTAAGGAACAAAAGGCTACTCAATCAGGACATGCAAGGGAATATTCCAACAGGCATAGTAC
ACCTGACTCTCCAAGCTCAAACTCTTCAAAAAAATCTTATGTAGAGATTGTAAAGAGTCCATCGAAAGATGACATTGTGTCCTCAAGCGGTCAAAAGGACCTGTCTTCAT
CCAAAAGCAAACCTGATAATCCAGGTGAGGTTGATTTTGACTTCGAGTGGGATTATATTGCTCACCTCCTCTGCAAAACCAAGGGATGGGTAACAGTGGGCAACTTTTAT
GTGAAGTTTGAAAGATGGGATCCGGAGATTCACGCAGTTCCAAAGCTTGTTCCGAGCTACGGTGGATGGGTTAAGTTTAGAGGAATCCCTTTGCATTTATGGAATATGAA
AACTTTTACTCAGGTGGGAGATGTCTGCGGTGGATTTGTTGATGTATCAAAGAACTCCACTCGAAAGCTTGATCTGTACGAGGCAGTCATCAAGGTAAAAGATAATTTCT
GCGGTTTCATCCCAACAACAGTTCGGATCGCAGATGACAAAGGGGGCCAATTTTCGATCAGAATAGTTACGCCAGAAAAAGGAAAATGGCTAGTTTGTCGAAACCCTAAA
GTTCATGGAACGTTTACTCGTGAGGCGGCTTTGGAATATGACGAATTCGACGCCAAATCAGAATCATTTGTGTTTAGGGGAAATGAGGCATGCACAGTGCAGGACGTAAA
TGTTGGATCAGATTCCATAATTGTCGAAAAGACGCCCAACGCTCCAGCCGCTGATCGTTCCTTCAAGCGACCTTCTCCAACTGCCACGTCTATGAGAGATAAAGGGAAGA
AAATCTGCACGAGTTCTGAGGAAGATAGTCAGCTAGATGCCACATCAAGGAAAAGGGACTCACATGTGTCGGATAAGCGTACGCCAAAGGTGGATCGAGCCAAGCCCTAC
TTAAAGCCCAATCGGATGAAAGGAATTCAGATCAACGAACCCAAAGTCTATAGGCCCAAGGTTACTCTAATGACAGTCCAGAAAGGAGATCAATATGGGGCCCCAATTAA
CGATGAATTCATGCTAACAGTTGACCTGGGCTACCTTTCCCCAATATCAGACGTCCCTATTTCAAGCCCAGAACAGACACCATCCCCAACGATAGAGTTGCATGAAGAAA
CTCCATCAAAGATAGCTCAAGACAGTCTGAAGATGCTTCTCCAACCCAATGCTCAAGACTCAGGAAGTGCCTCTAGCGGGGACTCACAGAACAATGGAAAACAAGAAAAT
GAGACCCAAGCCCGAACCAAAGAAAGATCTGAAGATCAGACCTTCAAAAGGCAGCTGAACAAATGGCTAATAGAGAATAAATTTTGCCTTGTCCCCACAAAATATCACCA
CTCTAGAAGGTTGGATCGCACCACGTCAAATCACTTCCCTATCCTTCTTGAGAATCTGGCCTTGTCATGGGGCCCTTCTCCTTTTAGATTTGACAATTACCTTATTAAGG
AAAGACCCTTTCTTAGTCAAATTGACTCTTGGTGGAACTCCACCTATCAAGATGGTTTCCCTGGCTACTCCTTTATTAGAAGACTCAAGCAACTTTCAGCAAAAATAAAA
TCATGGAAAATTCTGTATGTGGATGCTATAAAGACTAGAAAGAGCTCCCTTGCAACTGAGATAGCGCATATTGATGCTCTGGAGCATCAAGGCCCCCTAGACGAGAGTAT
GTTTCAGAAACGTCTGGCTCTTCGTGCAGATTTAAATCAAGTAGTTAGTCAAGAGCTGAGATTCTTAAGGCAATGCTACAAAAACCTCTGGATTAATCAAGGAGATGAAA
ACACCAATTTCTTCCATAAAATTTGTTCAGCTCGTAAGAGAAGAAATTTTATCTCGGAGTTGGTTTCCTCGGAAGGCATAAGTCTCGGAAAGGACTATCAACTAGAAAAA
GAGAGCGAAATTTATCAAAATCTTAAATCCTTGGGATGCAATAAATCATCGGGTCTAGACGGCCTCACAGTCGAATTCTTCAAAAGTTCGTGGACCTCTCTCAAGCCTTC
TATTATGGACATATTCAATGACTTCTTCGATAGGGGCATTATTAACAGAAACGTCAATGAGACCTACGTCGCCTTGATCCCTAAACGAGCCCACTCCCTCAGGCTATCAG
ATTATAGGCCAATTAGCTTAACCACGGTCCTATACCGCATCCTTGCTAAGACTCTAGCTGAAAGGATTAAAAGCACTCTCCCATCAACCATCGCGGAGAATCTGTTTGCC
TTTGTCAGCAGCCATCAAATCACAGATGCTATCCTCGTGGCAAATGAGGTTGTTGATTTTTGGACTTGCTCAAAGACTAAGGGTTATATTATAAAGCTTGACATCGAAAA
AGCGTTTGATAAGATCAACTGGGATTTTATTGATAGTATTCTCTCTTTCAAAGGCTTCCCGATCACGTGGCGAAAGTGGATCAAGGCATGTATATCTTCAGTCTCCTACT
TCGTTTTGCTAAATGGAAGGCCGAGAGATGACATTCTTCTGTTCATCAGGGATGATGATTCCATGCTCGATAATCTCTTTTATATTCTTAAATCTTTCAAGCAGTCGTCT
GGCCTCAACATCAACTTTAACAAATCTTCTCTCTCTAGTGTCAATGTGGAAGGCTCTAAGGCTTCCCAGGTAGCTGCCAAATGGGGATGCCCTTATCTGCCGCTCCCTAT
CTCTTATTTGGGTGCCCTCTTGGGGATAAACCCTCTAAAGCTTCGTTCTGGAATCCGGTCGTGGAGAAGTGGCAGGTTAACCCTCATCAAATCTGCCCTTAGCTTCATCC
CGAACTACATGCTTTCTGTCTTCAAAGCTCCCCAATCCATTTGCAAAAAGATCGATAAGATAATTAGAGATTTCCTTTGGTCGGACAATCGAGCGAAAGAGAGCCTCAAT
TTGGTCAATTGGAACACGGTAGCTGCCCCTCTTGACTCTAGTGGGCTTGGTATCTTCAAGACTAAGGTATCCAACAACGCTCTTCAATTCAAATGGCTTTGGAGATTTTT
CCAGGAAGACAATGCCCTGTGGAAGCGTCTTCTCATGGCGAAATTCTCGCCCCAGAACTGGGTGGCTATCCCTTCACAAGCAAAGTTCTCTAGCTCGAGGTCCCCGTGGC
TCTCCATTGCAAAACAAAGGAACAAATTTATAGACAATTCTTCTTGGGAGCTTAGGGATGGCAACAAAATTCGCTTCTGGCACGATAAATGGTCAAATTCGGGACCTTTG
CACCACTCAATCCCCCGCTTTTATGCCCTTTCTAATGCCATCTCCTTAAAGGTGGCAGAAGCATGGGATAGTTCCAACCTTTCTTGGAATTTCTTTCCGAGAAGAGCCTT
GTTGGCCACAGAAACCTCAACATGGTCTGCTTTTTCGGATAGTCTTCCCAGGCCTTCAAATGCTCGTGGCTCAGATCTGTTAAGGTGGAATCATAACTCTAAGGGAGTTT
TTTCTGTTGCATCTGCTCGATTACTCTTTTGGTCCGAAGATCAGCCTCCATCAGCTTCCTTGAATCCCAAAGCCCTGGCCAATCTTTGGAGTGCGGATCAACACAACAGA
CAGATTGCAATCAATTTTCTCCAACATGGCTTTAAGTCCAAGCTGTTGTGTGCTATGTTCAACAACTATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTCCTTCCGTCACCTACCGAGATCATGTGTCATCAAAAAAAAGAAATTTGTTTTGTCCTTCGAAAGTCGGACAGGCTCTAGCTTTATTCTAAAGGAAAAGGGGTC
CTACAAGTCCTTTTCGATGGTCCTACATCAAGAGTCAGTAGAATGGCTAAAGGTAACTTTCAAAACACTTTTAGCAACTCCTCGTACTAATCACTATTTTCAGCAAAAGA
GGTTCAGAGACTACTGCATTTGGGTTGAAAAAACAACAAATCAAAGGGGAAGTATAGCAGAAATTTTCCGTATAGATGGTCGAGGCAACAAAAGTTGTGTCATGGTCCCT
GACGGGTACGACAAGAGTGGTTGGATCTCTTTCTTGTCGATGCTTACTTTTAAGGAACAAAAGGCTACTCAATCAGGACATGCAAGGGAATATTCCAACAGGCATAGTAC
ACCTGACTCTCCAAGCTCAAACTCTTCAAAAAAATCTTATGTAGAGATTGTAAAGAGTCCATCGAAAGATGACATTGTGTCCTCAAGCGGTCAAAAGGACCTGTCTTCAT
CCAAAAGCAAACCTGATAATCCAGGTGAGGTTGATTTTGACTTCGAGTGGGATTATATTGCTCACCTCCTCTGCAAAACCAAGGGATGGGTAACAGTGGGCAACTTTTAT
GTGAAGTTTGAAAGATGGGATCCGGAGATTCACGCAGTTCCAAAGCTTGTTCCGAGCTACGGTGGATGGGTTAAGTTTAGAGGAATCCCTTTGCATTTATGGAATATGAA
AACTTTTACTCAGGTGGGAGATGTCTGCGGTGGATTTGTTGATGTATCAAAGAACTCCACTCGAAAGCTTGATCTGTACGAGGCAGTCATCAAGGTAAAAGATAATTTCT
GCGGTTTCATCCCAACAACAGTTCGGATCGCAGATGACAAAGGGGGCCAATTTTCGATCAGAATAGTTACGCCAGAAAAAGGAAAATGGCTAGTTTGTCGAAACCCTAAA
GTTCATGGAACGTTTACTCGTGAGGCGGCTTTGGAATATGACGAATTCGACGCCAAATCAGAATCATTTGTGTTTAGGGGAAATGAGGCATGCACAGTGCAGGACGTAAA
TGTTGGATCAGATTCCATAATTGTCGAAAAGACGCCCAACGCTCCAGCCGCTGATCGTTCCTTCAAGCGACCTTCTCCAACTGCCACGTCTATGAGAGATAAAGGGAAGA
AAATCTGCACGAGTTCTGAGGAAGATAGTCAGCTAGATGCCACATCAAGGAAAAGGGACTCACATGTGTCGGATAAGCGTACGCCAAAGGTGGATCGAGCCAAGCCCTAC
TTAAAGCCCAATCGGATGAAAGGAATTCAGATCAACGAACCCAAAGTCTATAGGCCCAAGGTTACTCTAATGACAGTCCAGAAAGGAGATCAATATGGGGCCCCAATTAA
CGATGAATTCATGCTAACAGTTGACCTGGGCTACCTTTCCCCAATATCAGACGTCCCTATTTCAAGCCCAGAACAGACACCATCCCCAACGATAGAGTTGCATGAAGAAA
CTCCATCAAAGATAGCTCAAGACAGTCTGAAGATGCTTCTCCAACCCAATGCTCAAGACTCAGGAAGTGCCTCTAGCGGGGACTCACAGAACAATGGAAAACAAGAAAAT
GAGACCCAAGCCCGAACCAAAGAAAGATCTGAAGATCAGACCTTCAAAAGGCAGCTGAACAAATGGCTAATAGAGAATAAATTTTGCCTTGTCCCCACAAAATATCACCA
CTCTAGAAGGTTGGATCGCACCACGTCAAATCACTTCCCTATCCTTCTTGAGAATCTGGCCTTGTCATGGGGCCCTTCTCCTTTTAGATTTGACAATTACCTTATTAAGG
AAAGACCCTTTCTTAGTCAAATTGACTCTTGGTGGAACTCCACCTATCAAGATGGTTTCCCTGGCTACTCCTTTATTAGAAGACTCAAGCAACTTTCAGCAAAAATAAAA
TCATGGAAAATTCTGTATGTGGATGCTATAAAGACTAGAAAGAGCTCCCTTGCAACTGAGATAGCGCATATTGATGCTCTGGAGCATCAAGGCCCCCTAGACGAGAGTAT
GTTTCAGAAACGTCTGGCTCTTCGTGCAGATTTAAATCAAGTAGTTAGTCAAGAGCTGAGATTCTTAAGGCAATGCTACAAAAACCTCTGGATTAATCAAGGAGATGAAA
ACACCAATTTCTTCCATAAAATTTGTTCAGCTCGTAAGAGAAGAAATTTTATCTCGGAGTTGGTTTCCTCGGAAGGCATAAGTCTCGGAAAGGACTATCAACTAGAAAAA
GAGAGCGAAATTTATCAAAATCTTAAATCCTTGGGATGCAATAAATCATCGGGTCTAGACGGCCTCACAGTCGAATTCTTCAAAAGTTCGTGGACCTCTCTCAAGCCTTC
TATTATGGACATATTCAATGACTTCTTCGATAGGGGCATTATTAACAGAAACGTCAATGAGACCTACGTCGCCTTGATCCCTAAACGAGCCCACTCCCTCAGGCTATCAG
ATTATAGGCCAATTAGCTTAACCACGGTCCTATACCGCATCCTTGCTAAGACTCTAGCTGAAAGGATTAAAAGCACTCTCCCATCAACCATCGCGGAGAATCTGTTTGCC
TTTGTCAGCAGCCATCAAATCACAGATGCTATCCTCGTGGCAAATGAGGTTGTTGATTTTTGGACTTGCTCAAAGACTAAGGGTTATATTATAAAGCTTGACATCGAAAA
AGCGTTTGATAAGATCAACTGGGATTTTATTGATAGTATTCTCTCTTTCAAAGGCTTCCCGATCACGTGGCGAAAGTGGATCAAGGCATGTATATCTTCAGTCTCCTACT
TCGTTTTGCTAAATGGAAGGCCGAGAGATGACATTCTTCTGTTCATCAGGGATGATGATTCCATGCTCGATAATCTCTTTTATATTCTTAAATCTTTCAAGCAGTCGTCT
GGCCTCAACATCAACTTTAACAAATCTTCTCTCTCTAGTGTCAATGTGGAAGGCTCTAAGGCTTCCCAGGTAGCTGCCAAATGGGGATGCCCTTATCTGCCGCTCCCTAT
CTCTTATTTGGGTGCCCTCTTGGGGATAAACCCTCTAAAGCTTCGTTCTGGAATCCGGTCGTGGAGAAGTGGCAGGTTAACCCTCATCAAATCTGCCCTTAGCTTCATCC
CGAACTACATGCTTTCTGTCTTCAAAGCTCCCCAATCCATTTGCAAAAAGATCGATAAGATAATTAGAGATTTCCTTTGGTCGGACAATCGAGCGAAAGAGAGCCTCAAT
TTGGTCAATTGGAACACGGTAGCTGCCCCTCTTGACTCTAGTGGGCTTGGTATCTTCAAGACTAAGGTATCCAACAACGCTCTTCAATTCAAATGGCTTTGGAGATTTTT
CCAGGAAGACAATGCCCTGTGGAAGCGTCTTCTCATGGCGAAATTCTCGCCCCAGAACTGGGTGGCTATCCCTTCACAAGCAAAGTTCTCTAGCTCGAGGTCCCCGTGGC
TCTCCATTGCAAAACAAAGGAACAAATTTATAGACAATTCTTCTTGGGAGCTTAGGGATGGCAACAAAATTCGCTTCTGGCACGATAAATGGTCAAATTCGGGACCTTTG
CACCACTCAATCCCCCGCTTTTATGCCCTTTCTAATGCCATCTCCTTAAAGGTGGCAGAAGCATGGGATAGTTCCAACCTTTCTTGGAATTTCTTTCCGAGAAGAGCCTT
GTTGGCCACAGAAACCTCAACATGGTCTGCTTTTTCGGATAGTCTTCCCAGGCCTTCAAATGCTCGTGGCTCAGATCTGTTAAGGTGGAATCATAACTCTAAGGGAGTTT
TTTCTGTTGCATCTGCTCGATTACTCTTTTGGTCCGAAGATCAGCCTCCATCAGCTTCCTTGAATCCCAAAGCCCTGGCCAATCTTTGGAGTGCGGATCAACACAACAGA
CAGATTGCAATCAATTTTCTCCAACATGGCTTTAAGTCCAAGCTGTTGTGTGCTATGTTCAACAACTATTAA
Protein sequenceShow/hide protein sequence
MASFRHLPRSCVIKKKKFVLSFESRTGSSFILKEKGSYKSFSMVLHQESVEWLKVTFKTLLATPRTNHYFQQKRFRDYCIWVEKTTNQRGSIAEIFRIDGRGNKSCVMVP
DGYDKSGWISFLSMLTFKEQKATQSGHAREYSNRHSTPDSPSSNSSKKSYVEIVKSPSKDDIVSSSGQKDLSSSKSKPDNPGEVDFDFEWDYIAHLLCKTKGWVTVGNFY
VKFERWDPEIHAVPKLVPSYGGWVKFRGIPLHLWNMKTFTQVGDVCGGFVDVSKNSTRKLDLYEAVIKVKDNFCGFIPTTVRIADDKGGQFSIRIVTPEKGKWLVCRNPK
VHGTFTREAALEYDEFDAKSESFVFRGNEACTVQDVNVGSDSIIVEKTPNAPAADRSFKRPSPTATSMRDKGKKICTSSEEDSQLDATSRKRDSHVSDKRTPKVDRAKPY
LKPNRMKGIQINEPKVYRPKVTLMTVQKGDQYGAPINDEFMLTVDLGYLSPISDVPISSPEQTPSPTIELHEETPSKIAQDSLKMLLQPNAQDSGSASSGDSQNNGKQEN
ETQARTKERSEDQTFKRQLNKWLIENKFCLVPTKYHHSRRLDRTTSNHFPILLENLALSWGPSPFRFDNYLIKERPFLSQIDSWWNSTYQDGFPGYSFIRRLKQLSAKIK
SWKILYVDAIKTRKSSLATEIAHIDALEHQGPLDESMFQKRLALRADLNQVVSQELRFLRQCYKNLWINQGDENTNFFHKICSARKRRNFISELVSSEGISLGKDYQLEK
ESEIYQNLKSLGCNKSSGLDGLTVEFFKSSWTSLKPSIMDIFNDFFDRGIINRNVNETYVALIPKRAHSLRLSDYRPISLTTVLYRILAKTLAERIKSTLPSTIAENLFA
FVSSHQITDAILVANEVVDFWTCSKTKGYIIKLDIEKAFDKINWDFIDSILSFKGFPITWRKWIKACISSVSYFVLLNGRPRDDILLFIRDDDSMLDNLFYILKSFKQSS
GLNINFNKSSLSSVNVEGSKASQVAAKWGCPYLPLPISYLGALLGINPLKLRSGIRSWRSGRLTLIKSALSFIPNYMLSVFKAPQSICKKIDKIIRDFLWSDNRAKESLN
LVNWNTVAAPLDSSGLGIFKTKVSNNALQFKWLWRFFQEDNALWKRLLMAKFSPQNWVAIPSQAKFSSSRSPWLSIAKQRNKFIDNSSWELRDGNKIRFWHDKWSNSGPL
HHSIPRFYALSNAISLKVAEAWDSSNLSWNFFPRRALLATETSTWSAFSDSLPRPSNARGSDLLRWNHNSKGVFSVASARLLFWSEDQPPSASLNPKALANLWSADQHNR
QIAINFLQHGFKSKLLCAMFNNY