; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G11480 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G11480
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionLINE-1 retrotransposable element ORF2 protein
Genome locationChr7:9729866..9733009
RNA-Seq ExpressionCSPI07G11480
SyntenyCSPI07G11480
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0039770.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]4.7e-19340.48Show/hide
Query:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV
        M  FN FI+  + IDPPL+NAKFTWSNLR  P+LSR+D FLYT NWE LF  H+SK L RVTSDHF I LE + + WG SPF+  N  LKE  FK N+  
Subjt:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV

Query:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE
        WWKN  Q GHPG+SFM +LKQLS  I+N  +  K  ++E+    + E++ ID LE + N++     RR  +K D+    FKEAQIW QK KRLW  EGDE
Subjt:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE

Query:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ----------------------------
        N++F+HKICSARQRRS IS+I++  GV CS++  I K  +DHF   +         ++  L  SP    Q Q                            
Subjt:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ----------------------------

Query:  -----------------MALLRSF-------------------SKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAI
                         + + R F                    KEK     DYRPISLTT +YKLIAKVIAER K  LP  ++ENQ+AF +GRQI DAI
Subjt:  -----------------MALLRSF-------------------SKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAI

Query:  LIANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLST
        L+ANEA+D+W+ KK++GFV+KLDIEK FDK+NW+FIDFML+KK +P KWR WI ACISSVQYSI+IN RPRGKI+P+RGIRQGD ISPFIFVLAMDY+S 
Subjt:  LIANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLST

Query:  LLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL
        LLN + ++  IKGV   G  NLTH+LFADDILLF+EDD+ +I N++    LF+LASGL+INLNKSTISPIN DA RT  +A++WGIS  F+PI YLGVPL
Subjt:  LLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL

Query:  GGKPSSRNFWAN-----------------SAAPKL--------------------------------------------------------------LGC
        GGK  ++ FW N                 S   K+                                                              LG 
Subjt:  GGKPSSRNFWAN-----------------SAAPKL--------------------------------------------------------------LGC

Query:  LSLKGSTGSF---------------------HRSNGTLKGATPC--------------------------------------------------------
          LK +  +                       +     KG  PC                                                        
Subjt:  LSLKGSTGSF---------------------HRSNGTLKGATPC--------------------------------------------------------

Query:  ------PSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARPLNN----QAEVNYND-------
               SSI D      M  DL PRR LR  E  LW ++K SL  S    G D P W LN+NG +TVAS+K  L +P  N    Q++  + +       
Subjt:  ------PSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARPLNN----QAEVNYND-------

Query:  ---------------------GRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVA
                              +RLP    +PSWC +CK  +EDR HLF LCP +  +W+ +   L   +  ++   LC  +   K KTKK  I  +  A
Subjt:  ---------------------GRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVA

Query:  ATLWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL
        + LWNIW ERN RIF GKEK V  +W+DI+A   L
Subjt:  ATLWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]5.8e-19139.01Show/hide
Query:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV
        M  FN+FIS C+ IDPPL+NAK+TWSNLRAQ  LSRLD FL+T  WE +F  H SK+L R TSDHF I LE +++ WG SPFRFTN++LK+  +K+NIE 
Subjt:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV

Query:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE
        WW NT+Q G+ GYSFM RLKQL+  IK W +  K  NE   +  + E++ ID LE + + T +H  +R ++K DL Q+   EAQIWAQKCKR+W  EGDE
Subjt:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE

Query:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT-------MLSTLS--PKINLQVQM-----------ALLRSFSKEKS-----Y
        NS+F+HKIC+ARQ++  IS I    G  C +D DI    I HF   +T        +  L   P  N+  ++             L+SF+K K+     Y
Subjt:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT-------MLSTLS--PKINLQVQM-----------ALLRSFSKEKS-----Y

Query:  VL------------------------------------------------LDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAIL
         +                                                 D+RPISLTT +YKLIAK +A+R K  LP+ ISE+Q+AF +GRQIT+AIL
Subjt:  VL------------------------------------------------LDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAIL

Query:  IANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTL
        IANEA+DFW+ KK +GFV+KLDIEK FDK+NW+FIDF+L+KK++ QKWRK I +CISSVQYSILIN RPRG+IKP+RGIRQGD +SPFIFVLAMDYLS L
Subjt:  IANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTL

Query:  LNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLG
        LN++  +  I GV F+   NLTHILFADDIL+F+ED D  + N++    LFE ASGLNINL+KSTI PIN    R   +A  WGIS   +P  YLG+PLG
Subjt:  LNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLG

Query:  GKPSSRNFWAN----------------------------------------------------------------------------SAAPKLLGCLSLK
        G+PSS NFW N                                                                              +PK  G L + 
Subjt:  GKPSSRNFWAN----------------------------------------------------------------------------SAAPKLLGCLSLK

Query:  G-----------------------------------------STGSFHRSNGTLKGATPCPS--------SIVDGMILDL--------------CPR---
                                                  S G F  +N   K  T C S         + DG  +                 PR   
Subjt:  G-----------------------------------------STGSFHRSNGTLKGATPCPS--------SIVDGMILDL--------------CPR---

Query:  -------------------------RPLRSVEEALWGDMKASLPS-LPAFGFDIPRWNLNNNGSFTVASIKLARPLNNQAEVNYNDG-------------
                                 RPLR  EE LW ++KASLP+ LP  G   P WNLN+N  F  AS+K A      +  N++               
Subjt:  -------------------------RPLRSVEEALWGDMKASLPS-LPAFGFDIPRWNLNNNGSFTVASIKLARPLNNQAEVNYNDG-------------

Query:  --------------------RRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAAT
                            +RLP W + P+WC +C  ++ED NHLF  CP+S +LW K + +L+    P +   L + I     + +K  I+ +  A  
Subjt:  --------------------RRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAAT

Query:  LWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL
        LW IW ERN RIFK +EK    +W+D  A   L
Subjt:  LWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL

KAA0039966.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.0e-18847.2Show/hide
Query:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV
        M  FN FIS  + IDPPL+NAKFTWSNLR QP+LSR+D FLYT NWE LF  H+SK L RVTSDHF IALE + + WG SPF+F N  LKE  FK+N+ +
Subjt:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV

Query:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE
        WWKN  Q GHPG+SFM +LKQLS  I++  K  K  N+EE +  + E+++ID LE + N +     RR  +K D+    FKEAQIW QK KRLW  EGDE
Subjt:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE

Query:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ-----------MALLRSFSKEKSYVLLD
        N++F+HKICSARQRRS IS+I++  GV CS++  I K  +DHF   +         ++  L  SP    Q Q            A L +FS  KS     
Subjt:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ-----------MALLRSFSKEKSYVLLD

Query:  YRPISLTTGLYKLIAKVIAE----------------------RRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKGFVVKLDIEKTFDKI
          P   T   YK    V+ E                         L+  + ++ENQ+ F +GRQI DAIL+ANEA+D+W+ KK++GFV+KLDIEK FDK+
Subjt:  YRPISLTTGLYKLIAKVIAE----------------------RRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKGFVVKLDIEKTFDKI

Query:  NWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTHILFADDI
        NW+FIDFML+KK +P +WRKWI ACISSVQYSI+IN RPRGKI+P+RGIRQGD ISPFIFVLAMDY+S LLN + ++  IKGV   G  NLTH+LFADDI
Subjt:  NWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTHILFADDI

Query:  LLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLGGKPSSRNFWAN---SAAPKLLG---CLSLK
        LLF+EDD+ +I N++    LF+LASGL+INLNKSTISPIN  A RT  +A++WGIS  F+PI YLGVPLGGK ++++FW N       KL      +  K
Subjt:  LLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLGGKPSSRNFWAN---SAAPKLLG---CLSLK

Query:  GST---GSFHRSNGT-----LKGATPCPSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIKLARPLNN
        G+    GS +  + T       G +   SSI D      M  DL PRR +R  E  LW ++K SL  S    G D P W LN++G +TVAS+K A    +
Subjt:  GST---GSFHRSNGT-----LKGATPCPSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIKLARPLNN

Query:  QAEVNYNDGRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAATLWNIWNERNRR
        Q+ ++            K  W T        +  +F +           E +  R         LC      K KTKK  I  +  A+ LWNIW ERN R
Subjt:  QAEVNYNDGRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAATLWNIWNERNRR

Query:  IFKGKEKLVGSVWKDIQATTDL
        IF GKEK V  +W+DI+A   L
Subjt:  IFKGKEKLVGSVWKDIQATTDL

TYK06777.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]2.3e-19541.44Show/hide
Query:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV
        M  FN FI+  + IDPPL+NAKFTWSNLR  P+LSR+D FLYT NWE LF  H+SK L RVTSDHF I LE + + WG SPF+  N  LKE  FK NI  
Subjt:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV

Query:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE
        WWKN  Q GHPG+SFM +LKQLS  I+N  +  K  ++E+    + E++ ID LE + N++     RR  +K D+    FKEAQIW QK KRLW  EGDE
Subjt:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE

Query:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRF--------------------------------------------------TML
        N++F+HKICSARQRRS IS+I++A GV CS++  I K  +DHF   +                                                  T  
Subjt:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRF--------------------------------------------------TML

Query:  STLSPKINLQVQMALLRSFSKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKGFVVKLDI
         T+S  IN+   +AL+    KEK     DYRPISLTT +YKLIAKVIAER K  LP  ++ENQ+AF + RQI DAIL+ANEA+D+W+ KK++GFV+KLDI
Subjt:  STLSPKINLQVQMALLRSFSKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKGFVVKLDI

Query:  EKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTH
        EK FDK+NW+FIDFML+KK +P KWR WI ACISSVQYSI+IN RPRGKI+P+RGIRQGD ISPFIFVLAMDY+S LLN + ++  IKGV   G  NLTH
Subjt:  EKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTH

Query:  ILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLGGKPSSRNFWAN------------
        +LFADDILLF+EDD+ +I N++    LF+LASGL+INLNKSTISPIN DA RT  +A++WGIS  F+PI YLGVPLGGK +++ FW N            
Subjt:  ILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLGGKPSSRNFWAN------------

Query:  -----SAAPKL--------------------------------------------------------------LGCLSLKGSTGSF--------------
             S   K+                                                              LG   LK +  +               
Subjt:  -----SAAPKL--------------------------------------------------------------LGCLSLKGSTGSF--------------

Query:  -------HRSNGTLKGATPC--------------------------------------------------------------PSSIVDG-----MILDLC
                +     KG  PC                                                               SSI D      M  DL 
Subjt:  -------HRSNGTLKGATPC--------------------------------------------------------------PSSIVDG-----MILDLC

Query:  PRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARP----LNNQAEVNYND----------------------------GRR
        PRR LR  E  LW ++K S+  S    G D P W LN+NG +TVAS+K  L +P    L+ Q++  + +                             +R
Subjt:  PRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARP----LNNQAEVNYND----------------------------GRR

Query:  LPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAATLWNIWNERNRRIFKGKEKLVGS
        LP    +PSWC +CK  +EDR HLF LCP +  +W+ +   L+  +  ++   LC  +   K KTKK  I  +  A+ LWNIW ERN RIF GKEK V  
Subjt:  LPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAATLWNIWNERNRRIFKGKEKLVGS

Query:  VWKDIQATTDL
        +W+DI+A   L
Subjt:  VWKDIQATTDL

XP_016902461.1 PREDICTED: LINE-1 retrotransposable element ORF2 protein [Cucumis melo]4.7e-19340.48Show/hide
Query:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV
        M  FN FI+  + IDPPL+NAKFTWSNLR  P+LSR+D FLYT NWE LF  H+SK L RVTSDHF I LE + + WG SPF+  N  LKE  FK N+  
Subjt:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV

Query:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE
        WWKN  Q GHPG+SFM +LKQLS  I+N  +  K  ++E+    + E++ ID LE + N++     RR  +K D+    FKEAQIW QK KRLW  EGDE
Subjt:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE

Query:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ----------------------------
        N++F+HKICSARQRRS IS+I++  GV CS++  I K  +DHF   +         ++  L  SP    Q Q                            
Subjt:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ----------------------------

Query:  -----------------MALLRSF-------------------SKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAI
                         + + R F                    KEK     DYRPISLTT +YKLIAKVIAER K  LP  ++ENQ+AF +GRQI DAI
Subjt:  -----------------MALLRSF-------------------SKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAI

Query:  LIANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLST
        L+ANEA+D+W+ KK++GFV+KLDIEK FDK+NW+FIDFML+KK +P KWR WI ACISSVQYSI+IN RPRGKI+P+RGIRQGD ISPFIFVLAMDY+S 
Subjt:  LIANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLST

Query:  LLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL
        LLN + ++  IKGV   G  NLTH+LFADDILLF+EDD+ +I N++    LF+LASGL+INLNKSTISPIN DA RT  +A++WGIS  F+PI YLGVPL
Subjt:  LLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL

Query:  GGKPSSRNFWAN-----------------SAAPKL--------------------------------------------------------------LGC
        GGK  ++ FW N                 S   K+                                                              LG 
Subjt:  GGKPSSRNFWAN-----------------SAAPKL--------------------------------------------------------------LGC

Query:  LSLKGSTGSF---------------------HRSNGTLKGATPC--------------------------------------------------------
          LK +  +                       +     KG  PC                                                        
Subjt:  LSLKGSTGSF---------------------HRSNGTLKGATPC--------------------------------------------------------

Query:  ------PSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARPLNN----QAEVNYND-------
               SSI D      M  DL PRR LR  E  LW ++K SL  S    G D P W LN+NG +TVAS+K  L +P  N    Q++  + +       
Subjt:  ------PSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARPLNN----QAEVNYND-------

Query:  ---------------------GRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVA
                              +RLP    +PSWC +CK  +EDR HLF LCP +  +W+ +   L   +  ++   LC  +   K KTKK  I  +  A
Subjt:  ---------------------GRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVA

Query:  ATLWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL
        + LWNIW ERN RIF GKEK V  +W+DI+A   L
Subjt:  ATLWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL

TrEMBL top hitse value%identityAlignment
A0A1S4E2K5 LINE-1 retrotransposable element ORF2 protein2.3e-19340.48Show/hide
Query:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV
        M  FN FI+  + IDPPL+NAKFTWSNLR  P+LSR+D FLYT NWE LF  H+SK L RVTSDHF I LE + + WG SPF+  N  LKE  FK N+  
Subjt:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV

Query:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE
        WWKN  Q GHPG+SFM +LKQLS  I+N  +  K  ++E+    + E++ ID LE + N++     RR  +K D+    FKEAQIW QK KRLW  EGDE
Subjt:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE

Query:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ----------------------------
        N++F+HKICSARQRRS IS+I++  GV CS++  I K  +DHF   +         ++  L  SP    Q Q                            
Subjt:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ----------------------------

Query:  -----------------MALLRSF-------------------SKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAI
                         + + R F                    KEK     DYRPISLTT +YKLIAKVIAER K  LP  ++ENQ+AF +GRQI DAI
Subjt:  -----------------MALLRSF-------------------SKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAI

Query:  LIANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLST
        L+ANEA+D+W+ KK++GFV+KLDIEK FDK+NW+FIDFML+KK +P KWR WI ACISSVQYSI+IN RPRGKI+P+RGIRQGD ISPFIFVLAMDY+S 
Subjt:  LIANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLST

Query:  LLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL
        LLN + ++  IKGV   G  NLTH+LFADDILLF+EDD+ +I N++    LF+LASGL+INLNKSTISPIN DA RT  +A++WGIS  F+PI YLGVPL
Subjt:  LLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL

Query:  GGKPSSRNFWAN-----------------SAAPKL--------------------------------------------------------------LGC
        GGK  ++ FW N                 S   K+                                                              LG 
Subjt:  GGKPSSRNFWAN-----------------SAAPKL--------------------------------------------------------------LGC

Query:  LSLKGSTGSF---------------------HRSNGTLKGATPC--------------------------------------------------------
          LK +  +                       +     KG  PC                                                        
Subjt:  LSLKGSTGSF---------------------HRSNGTLKGATPC--------------------------------------------------------

Query:  ------PSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARPLNN----QAEVNYND-------
               SSI D      M  DL PRR LR  E  LW ++K SL  S    G D P W LN+NG +TVAS+K  L +P  N    Q++  + +       
Subjt:  ------PSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARPLNN----QAEVNYND-------

Query:  ---------------------GRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVA
                              +RLP    +PSWC +CK  +EDR HLF LCP +  +W+ +   L   +  ++   LC  +   K KTKK  I  +  A
Subjt:  ---------------------GRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVA

Query:  ATLWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL
        + LWNIW ERN RIF GKEK V  +W+DI+A   L
Subjt:  ATLWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL

A0A5A7T9I7 LINE-1 retrotransposable element ORF2 protein2.8e-19139.01Show/hide
Query:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV
        M  FN+FIS C+ IDPPL+NAK+TWSNLRAQ  LSRLD FL+T  WE +F  H SK+L R TSDHF I LE +++ WG SPFRFTN++LK+  +K+NIE 
Subjt:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV

Query:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE
        WW NT+Q G+ GYSFM RLKQL+  IK W +  K  NE   +  + E++ ID LE + + T +H  +R ++K DL Q+   EAQIWAQKCKR+W  EGDE
Subjt:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE

Query:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT-------MLSTLS--PKINLQVQM-----------ALLRSFSKEKS-----Y
        NS+F+HKIC+ARQ++  IS I    G  C +D DI    I HF   +T        +  L   P  N+  ++             L+SF+K K+     Y
Subjt:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT-------MLSTLS--PKINLQVQM-----------ALLRSFSKEKS-----Y

Query:  VL------------------------------------------------LDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAIL
         +                                                 D+RPISLTT +YKLIAK +A+R K  LP+ ISE+Q+AF +GRQIT+AIL
Subjt:  VL------------------------------------------------LDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAIL

Query:  IANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTL
        IANEA+DFW+ KK +GFV+KLDIEK FDK+NW+FIDF+L+KK++ QKWRK I +CISSVQYSILIN RPRG+IKP+RGIRQGD +SPFIFVLAMDYLS L
Subjt:  IANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTL

Query:  LNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLG
        LN++  +  I GV F+   NLTHILFADDIL+F+ED D  + N++    LFE ASGLNINL+KSTI PIN    R   +A  WGIS   +P  YLG+PLG
Subjt:  LNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLG

Query:  GKPSSRNFWAN----------------------------------------------------------------------------SAAPKLLGCLSLK
        G+PSS NFW N                                                                              +PK  G L + 
Subjt:  GKPSSRNFWAN----------------------------------------------------------------------------SAAPKLLGCLSLK

Query:  G-----------------------------------------STGSFHRSNGTLKGATPCPS--------SIVDGMILDL--------------CPR---
                                                  S G F  +N   K  T C S         + DG  +                 PR   
Subjt:  G-----------------------------------------STGSFHRSNGTLKGATPCPS--------SIVDGMILDL--------------CPR---

Query:  -------------------------RPLRSVEEALWGDMKASLPS-LPAFGFDIPRWNLNNNGSFTVASIKLARPLNNQAEVNYNDG-------------
                                 RPLR  EE LW ++KASLP+ LP  G   P WNLN+N  F  AS+K A      +  N++               
Subjt:  -------------------------RPLRSVEEALWGDMKASLPS-LPAFGFDIPRWNLNNNGSFTVASIKLARPLNNQAEVNYNDG-------------

Query:  --------------------RRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAAT
                            +RLP W + P+WC +C  ++ED NHLF  CP+S +LW K + +L+    P +   L + I     + +K  I+ +  A  
Subjt:  --------------------RRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAAT

Query:  LWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL
        LW IW ERN RIFK +EK    +W+D  A   L
Subjt:  LWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL

A0A5D3C4J1 LINE-1 retrotransposable element ORF2 protein1.1e-19541.44Show/hide
Query:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV
        M  FN FI+  + IDPPL+NAKFTWSNLR  P+LSR+D FLYT NWE LF  H+SK L RVTSDHF I LE + + WG SPF+  N  LKE  FK NI  
Subjt:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV

Query:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE
        WWKN  Q GHPG+SFM +LKQLS  I+N  +  K  ++E+    + E++ ID LE + N++     RR  +K D+    FKEAQIW QK KRLW  EGDE
Subjt:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE

Query:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRF--------------------------------------------------TML
        N++F+HKICSARQRRS IS+I++A GV CS++  I K  +DHF   +                                                  T  
Subjt:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRF--------------------------------------------------TML

Query:  STLSPKINLQVQMALLRSFSKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKGFVVKLDI
         T+S  IN+   +AL+    KEK     DYRPISLTT +YKLIAKVIAER K  LP  ++ENQ+AF + RQI DAIL+ANEA+D+W+ KK++GFV+KLDI
Subjt:  STLSPKINLQVQMALLRSFSKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKGFVVKLDI

Query:  EKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTH
        EK FDK+NW+FIDFML+KK +P KWR WI ACISSVQYSI+IN RPRGKI+P+RGIRQGD ISPFIFVLAMDY+S LLN + ++  IKGV   G  NLTH
Subjt:  EKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTH

Query:  ILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLGGKPSSRNFWAN------------
        +LFADDILLF+EDD+ +I N++    LF+LASGL+INLNKSTISPIN DA RT  +A++WGIS  F+PI YLGVPLGGK +++ FW N            
Subjt:  ILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLGGKPSSRNFWAN------------

Query:  -----SAAPKL--------------------------------------------------------------LGCLSLKGSTGSF--------------
             S   K+                                                              LG   LK +  +               
Subjt:  -----SAAPKL--------------------------------------------------------------LGCLSLKGSTGSF--------------

Query:  -------HRSNGTLKGATPC--------------------------------------------------------------PSSIVDG-----MILDLC
                +     KG  PC                                                               SSI D      M  DL 
Subjt:  -------HRSNGTLKGATPC--------------------------------------------------------------PSSIVDG-----MILDLC

Query:  PRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARP----LNNQAEVNYND----------------------------GRR
        PRR LR  E  LW ++K S+  S    G D P W LN+NG +TVAS+K  L +P    L+ Q++  + +                             +R
Subjt:  PRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARP----LNNQAEVNYND----------------------------GRR

Query:  LPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAATLWNIWNERNRRIFKGKEKLVGS
        LP    +PSWC +CK  +EDR HLF LCP +  +W+ +   L+  +  ++   LC  +   K KTKK  I  +  A+ LWNIW ERN RIF GKEK V  
Subjt:  LPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAATLWNIWNERNRRIFKGKEKLVGS

Query:  VWKDIQATTDL
        +W+DI+A   L
Subjt:  VWKDIQATTDL

A0A5D3DLM2 LINE-1 retrotransposable element ORF2 protein9.9e-18947.18Show/hide
Query:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV
        M  FN FIS  + IDPPL+NAKFTWSNLR QP+LSR+D FLYT NWE LF  H+SK L RVTSDHF IALE + + WG SPF+F N  LKE  FK+N+ +
Subjt:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV

Query:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE
        WWKN  Q GHPG+SFM +LKQLS  I++  K  K  N+EE +  + E+++ID LE + N +     RR  +K D+    FKEAQIW QK KRLW  EGDE
Subjt:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE

Query:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ-----------MALLRSFSKEKSYVLLD
        N++F+HKICSARQRRS IS+I++  GV CS++  I K  +DHF   +         ++  L  SP    Q Q            A L +FS  KS     
Subjt:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ-----------MALLRSFSKEKSYVLLD

Query:  YRPISLTTGLYK---------------------LIAK--------VIAERRKLVLP----EIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKGFVV
          P   T   YK                     +I K        +IA++ K   P     I++ENQ+AF +GRQI DAIL+ANEA+D+W+ KK++GFV+
Subjt:  YRPISLTTGLYK---------------------LIAK--------VIAERRKLVLP----EIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKGFVV

Query:  KLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKH
        KLDIEK FDK+NW+FIDFML+KK +P +WRKWI ACISSVQYSI+IN RPRGKI+P+RGIRQGD ISPFIFVLAMDY+S LLN + ++  IKGV   G  
Subjt:  KLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKH

Query:  NLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLGGKPSSRNFWAN---SAAPK
        NLTH+LFADDILLF+EDD+ +I N++    LF+LASGL+INLNKSTISPIN  A RT  +A++WGIS  F+PI YLGVPLGGK ++++FW N       K
Subjt:  NLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLGGKPSSRNFWAN---SAAPK

Query:  LLG---CLSLKGST---GSFHRSNGT-----LKGATPCPSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTV
        L      +  KG+    GS +  + T       G +   SSI D      M  DL PRR +R  E  LW ++K SL  S    G D P W LN++G +TV
Subjt:  LLG---CLSLKGST---GSFHRSNGT-----LKGATPCPSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTV

Query:  ASIKLARPLNNQAEVNYNDGRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAAT
        AS+K A    +Q+ ++            K  W T        +  +F +           E +  R         LC      K KTKK  I  +  A+ 
Subjt:  ASIKLARPLNNQAEVNYNDGRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAAT

Query:  LWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL
        LWNIW ERN RIF GKEK V  +W+DI+A   L
Subjt:  LWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL

A0A5D3DM72 LINE-1 retrotransposable element ORF2 protein2.3e-19340.48Show/hide
Query:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV
        M  FN FI+  + IDPPL+NAKFTWSNLR  P+LSR+D FLYT NWE LF  H+SK L RVTSDHF I LE + + WG SPF+  N  LKE  FK N+  
Subjt:  MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEV

Query:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE
        WWKN  Q GHPG+SFM +LKQLS  I+N  +  K  ++E+    + E++ ID LE + N++     RR  +K D+    FKEAQIW QK KRLW  EGDE
Subjt:  WWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDE

Query:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ----------------------------
        N++F+HKICSARQRRS IS+I++  GV CS++  I K  +DHF   +         ++  L  SP    Q Q                            
Subjt:  NSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHFRRRFT--------MLSTL--SPKINLQVQ----------------------------

Query:  -----------------MALLRSF-------------------SKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAI
                         + + R F                    KEK     DYRPISLTT +YKLIAKVIAER K  LP  ++ENQ+AF +GRQI DAI
Subjt:  -----------------MALLRSF-------------------SKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAI

Query:  LIANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLST
        L+ANEA+D+W+ KK++GFV+KLDIEK FDK+NW+FIDFML+KK +P KWR WI ACISSVQYSI+IN RPRGKI+P+RGIRQGD ISPFIFVLAMDY+S 
Subjt:  LIANEAVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLST

Query:  LLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL
        LLN + ++  IKGV   G  NLTH+LFADDILLF+EDD+ +I N++    LF+LASGL+INLNKSTISPIN DA RT  +A++WGIS  F+PI YLGVPL
Subjt:  LLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL

Query:  GGKPSSRNFWAN-----------------SAAPKL--------------------------------------------------------------LGC
        GGK  ++ FW N                 S   K+                                                              LG 
Subjt:  GGKPSSRNFWAN-----------------SAAPKL--------------------------------------------------------------LGC

Query:  LSLKGSTGSF---------------------HRSNGTLKGATPC--------------------------------------------------------
          LK +  +                       +     KG  PC                                                        
Subjt:  LSLKGSTGSF---------------------HRSNGTLKGATPC--------------------------------------------------------

Query:  ------PSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARPLNN----QAEVNYND-------
               SSI D      M  DL PRR LR  E  LW ++K SL  S    G D P W LN+NG +TVAS+K  L +P  N    Q++  + +       
Subjt:  ------PSSIVDG-----MILDLCPRRPLRSVEEALWGDMKASL-PSLPAFGFDIPRWNLNNNGSFTVASIK--LARPLNN----QAEVNYND-------

Query:  ---------------------GRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVA
                              +RLP    +PSWC +CK  +EDR HLF LCP +  +W+ +   L   +  ++   LC  +   K KTKK  I  +  A
Subjt:  ---------------------GRRLPTWNIKPSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVA

Query:  ATLWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL
        + LWNIW ERN RIF GKEK V  +W+DI+A   L
Subjt:  ATLWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein7.4e-1627.71Show/hide
Query:  DYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKGFV-VKLDIEKTFDKINWKFIDFMLLKKDFPQKWRK
        ++RPISL     K++ K++A R +  + ++I  +Q+ F  G Q    I  +   +    + K K  V + +D EK FDKI   F+   L K      + K
Subjt:  DYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKGFV-VKLDIEKTFDKINWKFIDFMLLKKDFPQKWRK

Query:  WIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRL
         I A       +I++N +         G RQG  +SP +F + ++ L+  +    ++  IKG+   GK  +   LFADD++++LE+   +  N+      
Subjt:  WIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRL

Query:  FELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL
        F   SG  IN+ KS     N + Q  + +  +   +I    I+YLG+ L
Subjt:  FELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL

P11369 LINE-1 retrotransposable element ORF2 protein8.2e-1528.11Show/hide
Query:  DYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKG-FVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRK
        ++RPISL     K++ K++A R +  +  II  +Q+ F  G Q    I  +   + +  K K K   ++ LD EK FDKI   F+  +L +      +  
Subjt:  DYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKKVKG-FVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRK

Query:  WIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRL
         I+A  S    +I +N      I    G RQG  +SP++F + ++ L+  +    +Q  IKG+   GK  +   L ADD+++++ D   +   +      
Subjt:  WIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRL

Query:  FELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL
        F    G  IN NKS       + Q    +      SI    I+YLGV L
Subjt:  FELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-1124.6Show/hide
Query:  DIEKTLIDHFRRRFTMLSTLSPKINLQVQMALLRSFSKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQ-LAFRGRQITDAILIANEAV
        D  + L + F++    LS     ++L  +   LR        ++ ++RP+SL +  YK++AK I+ R K VL E+I  +Q     GR I D + +  + +
Subjt:  DIEKTLIDHFRRRFTMLSTLSPKINLQVQMALLRSFSKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQ-LAFRGRQITDAILIANEAV

Query:  DFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEK
         F ++  +    + LD EK FD+++ +++   L    F  ++  +++   +S +  + IN      +   RG+RQG  +S  ++ LA++    LL     
Subjt:  DFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEK

Query:  QNHIKGVSFNGKHNLTHIL--FADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKST---ISPINTDAQRTNCVATKWGISINFIPIQYLGVPLGG
        +  + G+    + ++  +L  +ADD++L +  D   ++  +    ++  AS   IN +KS+      +  D          W   I    I+YLGV L  
Subjt:  QNHIKGVSFNGKHNLTHIL--FADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKST---ISPINTDAQRTNCVATKWGISINFIPIQYLGVPLGG

Query:  K--PSSRNF
        +  P S+NF
Subjt:  K--PSSRNF

P92555 Uncharacterized mitochondrial protein AtMg012505.2e-0940.3Show/hide
Query:  LINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSF-NGKHNLTHILFADD
        +IN  P+G + P+RG+RQGD +SP++F+L  + LS L    ++Q  + G+   N    + H+LFADD
Subjt:  LINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSF-NGKHNLTHILFADD

Q03274 Retrovirus-related Pol polyprotein from type-1 retrotransposable element R2 (Fragment)6.1e-1027.19Show/hide
Query:  DYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAFRGRQITDAILIANEAVDFW---KKKKVKGF-VVKLDIEKTFDKINWKFIDFMLLKKDFPQKW
        ++RPI++ + L +L+ +++A+R      E   E   A +G    D  L+ +  +D +   ++++ K + VV LD+ K FD ++   I   L +    +  
Subjt:  DYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAFRGRQITDAILIANEAVDFW---KKKKVKGF-VVKLDIEKTFDKINWKFIDFMLLKKDFPQKW

Query:  RKWIEACISSVQYSILINDRPR-GKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFS
          +I   +S    +I +    +  KI   RG++QGD +SPF+F   +D    LL  ++    I G    G+  +  + FADD+LL LED+D  +     +
Subjt:  RKWIEACISSVQYSILINDRPR-GKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSFNGKHNLTHILFADDILLFLEDDDKTIDNMRFS

Query:  FRLFELASGLNINLNKS
           F    G+++N  KS
Subjt:  FRLFELASGLNINLNKS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.6e-0821.29Show/hide
Query:  MNNFNTFISGCDFIDPPLTNAKFTWSNLR-AQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSL-KWGRSPFRFTNSFLKEVSFKQNI
        +  F   +   D +D P     +TWSN +   PI+ +LD  +   +W   F    +       SDH    +   +L K  +  FR+ +      +F  ++
Subjt:  MNNFNTFISGCDFIDPPLTNAKFTWSNLR-AQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSL-KWGRSPFRFTNSFLKEVSFKQNI

Query:  EVWWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEV----DSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLW
         V W+    VG   +S    LK      K  ++    + + + +  L+ LE I +  +    DS     H+ R+   K +    A +    + QK +  W
Subjt:  EVWWKNTAQVGHPGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEV----DSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLW

Query:  NLEGDENSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHF
          +GD N+ F+HK+  A Q ++ I  +     V   +   +++ ++ ++
Subjt:  NLEGDENSAFYHKICSARQRRSFISSISTAQGVLCSSDVDIEKTLIDHF

AT1G45063.1 copper ion binding;electron carriers1.9e-0627.56Show/hide
Query:  TVASIKLARPLNNQAEVNYNDGRRLPTWNIK-PSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKG-------KTKKQ
        T  ++    PL        +   R  +W I+ PS C LC A +E R H+F  CPFS ++W           +  N+      ++K          + KK 
Subjt:  TVASIKLARPLNNQAEVNYNDGRRLPTWNIK-PSWCTLCKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKG-------KTKKQ

Query:  TISLHLV-AATLWNIWNERNRRIFKGK
           L L   A++++IW ERN R+   K
Subjt:  TISLHLV-AATLWNIWNERNRRIFKGK

AT4G20520.1 RNA binding;RNA-directed DNA polymerases5.0e-0738.27Show/hide
Query:  IAERRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKK-VKGF-VVKLDIEKTFDKINWKFIDFMLLKKDFPQKW
        + ER K ++  +I   Q +F  GR  TD I+   EAV   ++KK VKG+ ++KLD+EK +D+I W +++  L+   FP+ W
Subjt:  IAERRKLVLPEIISENQLAF-RGRQITDAILIANEAVDFWKKKK-VKGF-VVKLDIEKTFDKINWKFIDFMLLKKDFPQKW

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)3.7e-1040.3Show/hide
Query:  LINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSF-NGKHNLTHILFADD
        +IN  P+G + P+RG+RQGD +SP++F+L  + LS L    ++Q  + G+   N    + H+LFADD
Subjt:  LINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVSF-NGKHNLTHILFADD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAATAATTTCAATACTTTCATCTCGGGATGTGATTTCATTGATCCTCCCTTAACAAATGCCAAGTTTACTTGGTCAAATCTCAGAGCTCAGCCCATTCTTTCAAGACT
CGACATATTTCTATACACTCCAAATTGGGAAATTCTATTTGAACCGCACTTCTCCAAAATGCTCCCTAGAGTAACATCAGACCATTTTTCCATTGCTCTCGAGTTCAACA
GTCTAAAGTGGGGCCGCTCTCCTTTCAGATTCACAAACTCCTTTCTCAAAGAAGTCTCTTTCAAACAAAACATTGAGGTATGGTGGAAAAACACTGCTCAAGTTGGTCAC
CCGGGTTACTCCTTTATGGGGAGGCTCAAACAACTTTCTTATACAATAAAAAATTGGAGTAAAAGCCTAAAAGAATCTAACGAAGAAGAAATACAGACTTTATTGAATGA
GCTTGAGCACATAGACAATTTAGAAGTTGATAGCAATATTACCAATTTGCACATTACTCGCAGAGCATCCATCAAAACTGATCTGCGTCAAATGGCCTTTAAAGAAGCTC
AAATATGGGCTCAAAAATGCAAACGGTTATGGAACCTGGAAGGGGATGAAAACTCTGCTTTTTATCATAAAATATGCTCTGCTAGACAAAGAAGAAGCTTCATCTCAAGC
ATTTCTACCGCACAAGGAGTTCTTTGTAGTTCTGATGTGGATATAGAGAAAACTCTCATTGATCATTTCAGGAGGAGATTTACAATGCTCTCAACACTTTCTCCAAAAAT
AAATCTTCAGGTCCAGATGGCTTTACTACGAAGTTTCTCAAAGGAGAAAAGTTATGTCCTTTTGGATTACAGACCCATAAGCCTCACAACTGGTCTATACAAGCTCATTG
CTAAAGTAATTGCCGAAAGACGAAAATTAGTTCTGCCTGAAATTATCTCAGAGAATCAGTTAGCTTTCAGAGGGAGGCAGATAACTGATGCCATTTTGATTGCAAATGAA
GCTGTGGACTTCTGGAAAAAGAAAAAAGTCAAAGGCTTTGTTGTCAAGCTTGACATAGAGAAGACTTTTGATAAAATTAATTGGAAATTCATTGACTTTATGCTACTTAA
GAAAGACTTTCCCCAAAAATGGCGTAAATGGATTGAAGCTTGTATCTCAAGCGTTCAATACTCCATCCTTATCAATGACAGACCTAGAGGAAAAATTAAACCCAATAGAG
GTATTCGACAAGGAGATCATATCTCTCCTTTTATTTTTGTTCTCGCTATGGATTATCTCAGTACACTTCTCAATCACATGGAGAAACAAAACCATATTAAAGGAGTGAGT
TTCAATGGGAAACACAATCTCACACATATCCTTTTTGCAGATGATATCCTACTTTTTCTGGAGGATGATGACAAAACTATTGATAACATGAGATTTTCTTTTCGTCTTTT
TGAATTGGCCTCAGGTCTCAACATCAACCTCAACAAATCTACCATTTCACCTATTAATACGGATGCACAGAGAACAAATTGTGTGGCGACAAAATGGGGTATCTCTATAA
ATTTTATTCCCATTCAATATTTGGGGGTGCCATTGGGAGGTAAACCAAGTTCTAGAAATTTCTGGGCAAATTCAGCAGCTCCAAAGCTCCTAGGATGTTTATCATTAAAA
GGGTCGACTGGGTCCTTCCACAGGTCAAATGGAACATTAAAAGGGGCGACTCCCTGTCCTTCTAGCATAGTTGATGGCATGATCTTAGATCTTTGTCCTCGAAGACCGTT
GAGAAGTGTTGAAGAAGCTCTTTGGGGCGATATGAAAGCTTCCCTCCCCTCCCTACCTGCATTTGGTTTTGACATTCCTCGTTGGAATTTGAACAACAATGGCAGCTTTA
CGGTGGCCTCCATTAAACTTGCGAGACCTCTCAATAATCAAGCAGAGGTTAATTATAATGATGGGAGGAGACTCCCAACTTGGAATATAAAACCTTCTTGGTGTACTCTA
TGCAAAGCTGCTGAGGAAGACAGAAATCACTTGTTCTCCCTTTGTCCTTTCTCTTCGAAGCTTTGGCAAAAAGTTGAAGTTATCTTGGATAGACCCCTGTACCCAATTAA
TTCCACTTATTTGTGCAAAGAAATCTACAAAACAAAGGGTAAAACAAAAAAACAAACTATATCACTGCACTTGGTGGCTGCAACCCTTTGGAATATATGGAACGAAAGAA
ACAGAAGAATTTTCAAGGGGAAAGAAAAACTAGTTGGTTCTGTGTGGAAGGACATTCAAGCTACGACTGATCTCTGA
mRNA sequenceShow/hide mRNA sequence
ATGAATAATTTCAATACTTTCATCTCGGGATGTGATTTCATTGATCCTCCCTTAACAAATGCCAAGTTTACTTGGTCAAATCTCAGAGCTCAGCCCATTCTTTCAAGACT
CGACATATTTCTATACACTCCAAATTGGGAAATTCTATTTGAACCGCACTTCTCCAAAATGCTCCCTAGAGTAACATCAGACCATTTTTCCATTGCTCTCGAGTTCAACA
GTCTAAAGTGGGGCCGCTCTCCTTTCAGATTCACAAACTCCTTTCTCAAAGAAGTCTCTTTCAAACAAAACATTGAGGTATGGTGGAAAAACACTGCTCAAGTTGGTCAC
CCGGGTTACTCCTTTATGGGGAGGCTCAAACAACTTTCTTATACAATAAAAAATTGGAGTAAAAGCCTAAAAGAATCTAACGAAGAAGAAATACAGACTTTATTGAATGA
GCTTGAGCACATAGACAATTTAGAAGTTGATAGCAATATTACCAATTTGCACATTACTCGCAGAGCATCCATCAAAACTGATCTGCGTCAAATGGCCTTTAAAGAAGCTC
AAATATGGGCTCAAAAATGCAAACGGTTATGGAACCTGGAAGGGGATGAAAACTCTGCTTTTTATCATAAAATATGCTCTGCTAGACAAAGAAGAAGCTTCATCTCAAGC
ATTTCTACCGCACAAGGAGTTCTTTGTAGTTCTGATGTGGATATAGAGAAAACTCTCATTGATCATTTCAGGAGGAGATTTACAATGCTCTCAACACTTTCTCCAAAAAT
AAATCTTCAGGTCCAGATGGCTTTACTACGAAGTTTCTCAAAGGAGAAAAGTTATGTCCTTTTGGATTACAGACCCATAAGCCTCACAACTGGTCTATACAAGCTCATTG
CTAAAGTAATTGCCGAAAGACGAAAATTAGTTCTGCCTGAAATTATCTCAGAGAATCAGTTAGCTTTCAGAGGGAGGCAGATAACTGATGCCATTTTGATTGCAAATGAA
GCTGTGGACTTCTGGAAAAAGAAAAAAGTCAAAGGCTTTGTTGTCAAGCTTGACATAGAGAAGACTTTTGATAAAATTAATTGGAAATTCATTGACTTTATGCTACTTAA
GAAAGACTTTCCCCAAAAATGGCGTAAATGGATTGAAGCTTGTATCTCAAGCGTTCAATACTCCATCCTTATCAATGACAGACCTAGAGGAAAAATTAAACCCAATAGAG
GTATTCGACAAGGAGATCATATCTCTCCTTTTATTTTTGTTCTCGCTATGGATTATCTCAGTACACTTCTCAATCACATGGAGAAACAAAACCATATTAAAGGAGTGAGT
TTCAATGGGAAACACAATCTCACACATATCCTTTTTGCAGATGATATCCTACTTTTTCTGGAGGATGATGACAAAACTATTGATAACATGAGATTTTCTTTTCGTCTTTT
TGAATTGGCCTCAGGTCTCAACATCAACCTCAACAAATCTACCATTTCACCTATTAATACGGATGCACAGAGAACAAATTGTGTGGCGACAAAATGGGGTATCTCTATAA
ATTTTATTCCCATTCAATATTTGGGGGTGCCATTGGGAGGTAAACCAAGTTCTAGAAATTTCTGGGCAAATTCAGCAGCTCCAAAGCTCCTAGGATGTTTATCATTAAAA
GGGTCGACTGGGTCCTTCCACAGGTCAAATGGAACATTAAAAGGGGCGACTCCCTGTCCTTCTAGCATAGTTGATGGCATGATCTTAGATCTTTGTCCTCGAAGACCGTT
GAGAAGTGTTGAAGAAGCTCTTTGGGGCGATATGAAAGCTTCCCTCCCCTCCCTACCTGCATTTGGTTTTGACATTCCTCGTTGGAATTTGAACAACAATGGCAGCTTTA
CGGTGGCCTCCATTAAACTTGCGAGACCTCTCAATAATCAAGCAGAGGTTAATTATAATGATGGGAGGAGACTCCCAACTTGGAATATAAAACCTTCTTGGTGTACTCTA
TGCAAAGCTGCTGAGGAAGACAGAAATCACTTGTTCTCCCTTTGTCCTTTCTCTTCGAAGCTTTGGCAAAAAGTTGAAGTTATCTTGGATAGACCCCTGTACCCAATTAA
TTCCACTTATTTGTGCAAAGAAATCTACAAAACAAAGGGTAAAACAAAAAAACAAACTATATCACTGCACTTGGTGGCTGCAACCCTTTGGAATATATGGAACGAAAGAA
ACAGAAGAATTTTCAAGGGGAAAGAAAAACTAGTTGGTTCTGTGTGGAAGGACATTCAAGCTACGACTGATCTCTGA
Protein sequenceShow/hide protein sequence
MNNFNTFISGCDFIDPPLTNAKFTWSNLRAQPILSRLDIFLYTPNWEILFEPHFSKMLPRVTSDHFSIALEFNSLKWGRSPFRFTNSFLKEVSFKQNIEVWWKNTAQVGH
PGYSFMGRLKQLSYTIKNWSKSLKESNEEEIQTLLNELEHIDNLEVDSNITNLHITRRASIKTDLRQMAFKEAQIWAQKCKRLWNLEGDENSAFYHKICSARQRRSFISS
ISTAQGVLCSSDVDIEKTLIDHFRRRFTMLSTLSPKINLQVQMALLRSFSKEKSYVLLDYRPISLTTGLYKLIAKVIAERRKLVLPEIISENQLAFRGRQITDAILIANE
AVDFWKKKKVKGFVVKLDIEKTFDKINWKFIDFMLLKKDFPQKWRKWIEACISSVQYSILINDRPRGKIKPNRGIRQGDHISPFIFVLAMDYLSTLLNHMEKQNHIKGVS
FNGKHNLTHILFADDILLFLEDDDKTIDNMRFSFRLFELASGLNINLNKSTISPINTDAQRTNCVATKWGISINFIPIQYLGVPLGGKPSSRNFWANSAAPKLLGCLSLK
GSTGSFHRSNGTLKGATPCPSSIVDGMILDLCPRRPLRSVEEALWGDMKASLPSLPAFGFDIPRWNLNNNGSFTVASIKLARPLNNQAEVNYNDGRRLPTWNIKPSWCTL
CKAAEEDRNHLFSLCPFSSKLWQKVEVILDRPLYPINSTYLCKEIYKTKGKTKKQTISLHLVAATLWNIWNERNRRIFKGKEKLVGSVWKDIQATTDL