; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy01g006590 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy01g006590
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionTransposon TX1 uncharacterized 149 kDa protein
Genome locationChr01:10284204..10287761
RNA-Seq ExpressionLcy01g006590
SyntenyLcy01g006590
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR005135 - Endonuclease/exonuclease/phosphatase
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN68838.1 hypothetical protein VITISV_030956 [Vitis vinifera]2.7e-14335.04Show/hide
Query:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------
        IISWN RG+GS KKR ++K FL ++ P +V+ QETK    DR+   SVW++RN  W AL A G SGGI I+W+       EV+ G               
Subjt:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------

Query:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS
             VYGPN+S  RK LW EL D+  L  P W +GGDFN+ R SSEK   +  T  M+ F+ FI    L D+PL +  +TWS+ Q NP    +DRFL S
Subjt:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS

Query:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA
        +     F  +    L R TSDH+PI L     KWGP+PFRF N WL H    +    WW      GW GH FM+KL+ +K +LK+WN+ +F    + K  
Subjt:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA

Query:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY
        +   L   D+ E+ G L+ + + +R+I K ++  +  +EEI WRQK ++KW  EGD NS FFH +    R +  I E+ + +G+ + N + I++E L ++
Subjt:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY

Query:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK
        + L+T     +   +  D SPIS E    LESPFTE EI +A+  +  +K PGPDGFT    +  W ++KED+ +VF +F R+GIIN S N ++I L+PK
Subjt:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK

Query:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------
        K  ++ I                  VL+ R+++VL  TI   Q  FV+ RQI+DA+LI NE++DE  R   E                            
Subjt:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------

Query:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN
                                                                                             DDTI FSS  +  + 
Subjt:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN

Query:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA
         L + +L F   SGL +N  K+   G+ + Q H + LA    CK   WPI YLG PL GNP++  FW PVIE+I +RL  W   ++S GGR TLIQ+ L 
Subjt:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA

Query:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN
        ++P Y+LS+      V   IE++ R++LW G    +  HL+N
Subjt:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN

RVW12714.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]1.5e-14637.5Show/hide
Query:  RGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEGVYGPNSSNERKFLWQELHDLQ
        +G+GS KKR ++K FL+++ P +V++QETK    DR++  SVWS RN  W AL A G SGGI I+W+    R  EVV  VYGPN+S  RK  W EL D+ 
Subjt:  RGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEGVYGPNSSNERKFLWQELHDLQ

Query:  ALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLISDNISIKFISAQARKLERSTSDHFPIC
         L  P W +GGDFN+ R SSEK   +  T  M+ F++FI    L D PL +  YTWS+ Q NP    +DRFL S+     F  +    L R TSDH+PI 
Subjt:  ALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLISDNISIKFISAQARKLERSTSDHFPIC

Query:  LSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSALSRELSAIDNNEENGQLTEQDINRRS
        L     KWGP+PFRF N WL H +  +    WW+     GW GH FM+KL+ +K +LK WN+ +F    + K  +   L+  D+ E+ G L+ + + +R+
Subjt:  LSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSALSRELSAIDNNEENGQLTEQDINRRS

Query:  IIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFYKGLFTEKDNRAPLPQIDDLSPISDEQ
          K ++  +  +EEI WRQK ++KW  EGD NS FFH +    R +  I E+ + SG  + N + I++E L +++ L+      +   +  D SPI  E 
Subjt:  IIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFYKGLFTEKDNRAPLPQIDDLSPISDEQ

Query:  NAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPKKIGAKTI------------------V
         + LESPFTE EIY+A+  +  +K PGPDGFT    +  W+++KED+ RVF +F R+GIIN S N ++I LIPKK  ++ I                  V
Subjt:  NAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPKKIGAKTI------------------V

Query:  LSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE------------------------------------------------------
        L+ RL+ VL  TI   Q  FV+ RQI+DA+LI NE++DE  R   E                                                      
Subjt:  LSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE------------------------------------------------------

Query:  ---------------------------DDTILFSSPDKNHINNLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLN
                                   DDTI FSS  +  +  L S +L F   SGL +N  K+   G+ + Q H + LA    CK   WPI YLG PL 
Subjt:  ---------------------------DDTILFSSPDKNHINNLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLN

Query:  GNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLANLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN
        GNP++  FW PVIE+I +RL  W   ++S GGR TLIQ+ L ++P Y+LS+      V   IE++ R +LW G    +  HL+N
Subjt:  GNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLANLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]9.1e-15236.15Show/hide
Query:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------
        I+SWN RG+GS KKR +++ FL+T+NP IV+LQETK  ++DR+   SVW  + + W AL A G SGGI ILW+ S     E V G               
Subjt:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------

Query:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS
             VYGP +   RK  W EL DL  L  P W +GGDFN+ R  SEK   T  T  MR F++FI  +GL D PL N  +TWS+ Q +P    +DRFL S
Subjt:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS

Query:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA
              F  +    L R TSDH PICL     KWGP+PFRF N WL H +  +    WW   + +GW GH FM+KLK +K +LK WN  TF   +E K  
Subjt:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA

Query:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY
        +  +LS ID  E+ G L    +  R++ + ++ ++  KEE+ WRQK ++KW  EGD NS FFH + T  R +  I  ++S  G ++ N +DI +E ++F+
Subjt:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY

Query:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK
          L+++    +   +  D  PIS E   +L+ PFTE E+ RAV  +   K PGPDGFT    ++ W+++KED+ RVF +F  NG+IN S N T+I L+PK
Subjt:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK

Query:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------
        K  +  I                  VLS RL+KVL  TI+  Q  FV+ R I+DA+LI NE++DE  R   E                            
Subjt:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------

Query:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN
                                                                                             DDTI FS     H+ 
Subjt:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN

Query:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA
        NL   +L F + SGL IN +K+   G+   Q+  +SLAS F C++  WP++YLG PL GNP+++ FW PV+E+I +RL  W   ++S GGR TLIQ+ L+
Subjt:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA

Query:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLL
        ++P Y+LS+      +   IEK+ RN+LW G    +  HL+
Subjt:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLL

RVW65579.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]3.5e-14335.63Show/hide
Query:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------
        IISWN RG+GS KKR ++K FL+++ P +V++QETK    DR++  SVWS RN  W AL A G SGGI I+W+    R  EVV G               
Subjt:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------

Query:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS
             VYGPN+S  RK  W EL D+  L  P W +GGDFN+ R SSEK   +  T  M+ F++FI    L D PL +  YTWS+ Q NP    +DRFL S
Subjt:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS

Query:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA
        +     F  +    L R TSDH+PI L     KWGP+PFRF N WL H    +    WW+     GW GH FM+KL+ +K +LK WN+ +F    + K  
Subjt:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA

Query:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY
        +   L+  D+ E+ G L+++ + +R+  K ++  +  +EEI WRQK ++KW  +GD NS FFH +    R +  I E+ + SG  + N + I++E L ++
Subjt:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY

Query:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK
        + L+      +   +  D SPI  E  + LESPFTE EIY+A+  +  +K PGPDGFT    +  W+++KED+ RVF +F R+GIIN S N ++I L+PK
Subjt:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK

Query:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------
        K  ++ I                  VL+ RL+ VL  TI   Q  FV+ RQI+DA+LI NE++DE  R   E                            
Subjt:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------

Query:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN
                                                                                             DDTI FSS  +  + 
Subjt:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN

Query:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA
         L S +L F   SGL +N  K+   G+ I Q H + LA    CK   WPI YLG PL GNP++  FW PVIE+I +RL  W   ++S GGR TLIQ+ L 
Subjt:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA

Query:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN
        ++P Y+LS+      V   IE++ R +LW G    +  HL+N
Subjt:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]2.7e-14335.04Show/hide
Query:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------
        IISWN RG+GS KKR ++K FL ++ P +V+ QETK    DR+   SVW++RN  W AL A G SGGI I+W+       EV+ G               
Subjt:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------

Query:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS
             VYGPN+S  RK LW EL D+  L  P W +GGDFN+ R SSEK   +  T  M+ F+ FI    L D+PL +  +TWS+ Q NP    +DRFL S
Subjt:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS

Query:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA
        +     F  +    L R TSDH+PI L     KWGP+PFRF N WL H    +    WW      GW GH FM+KL+ +K +LK+WN+ +F    + K  
Subjt:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA

Query:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY
        +   L   D+ E+ G L+ + + +R+I K ++  +  +EEI WRQK ++KW  EGD NS FFH +    R +  I E+ + +G+ + N + I++E L ++
Subjt:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY

Query:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK
        + L+T     +   +  D SPIS E    LESPFTE EI +A+  +  +K PGPDGFT    +  W ++KED+ +VF +F R+GIIN S N ++I L+PK
Subjt:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK

Query:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------
        K  ++ I                  VL+ R+++VL  TI   Q  FV+ RQI+DA+LI NE++DE  R   E                            
Subjt:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------

Query:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN
                                                                                             DDTI FSS  +  + 
Subjt:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN

Query:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA
         L + +L F   SGL +N  K+   G+ + Q H + LA    CK   WPI YLG PL GNP++  FW PVIE+I +RL  W   ++S GGR TLIQ+ L 
Subjt:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA

Query:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN
        ++P Y+LS+      V   IE++ R++LW G    +  HL+N
Subjt:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN

TrEMBL top hitse value%identityAlignment
A0A438BP29 Transposon TX1 uncharacterized 149 kDa protein7.3e-14737.5Show/hide
Query:  RGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEGVYGPNSSNERKFLWQELHDLQ
        +G+GS KKR ++K FL+++ P +V++QETK    DR++  SVWS RN  W AL A G SGGI I+W+    R  EVV  VYGPN+S  RK  W EL D+ 
Subjt:  RGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEGVYGPNSSNERKFLWQELHDLQ

Query:  ALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLISDNISIKFISAQARKLERSTSDHFPIC
         L  P W +GGDFN+ R SSEK   +  T  M+ F++FI    L D PL +  YTWS+ Q NP    +DRFL S+     F  +    L R TSDH+PI 
Subjt:  ALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLISDNISIKFISAQARKLERSTSDHFPIC

Query:  LSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSALSRELSAIDNNEENGQLTEQDINRRS
        L     KWGP+PFRF N WL H +  +    WW+     GW GH FM+KL+ +K +LK WN+ +F    + K  +   L+  D+ E+ G L+ + + +R+
Subjt:  LSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSALSRELSAIDNNEENGQLTEQDINRRS

Query:  IIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFYKGLFTEKDNRAPLPQIDDLSPISDEQ
          K ++  +  +EEI WRQK ++KW  EGD NS FFH +    R +  I E+ + SG  + N + I++E L +++ L+      +   +  D SPI  E 
Subjt:  IIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFYKGLFTEKDNRAPLPQIDDLSPISDEQ

Query:  NAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPKKIGAKTI------------------V
         + LESPFTE EIY+A+  +  +K PGPDGFT    +  W+++KED+ RVF +F R+GIIN S N ++I LIPKK  ++ I                  V
Subjt:  NAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPKKIGAKTI------------------V

Query:  LSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE------------------------------------------------------
        L+ RL+ VL  TI   Q  FV+ RQI+DA+LI NE++DE  R   E                                                      
Subjt:  LSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE------------------------------------------------------

Query:  ---------------------------DDTILFSSPDKNHINNLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLN
                                   DDTI FSS  +  +  L S +L F   SGL +N  K+   G+ + Q H + LA    CK   WPI YLG PL 
Subjt:  ---------------------------DDTILFSSPDKNHINNLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLN

Query:  GNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLANLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN
        GNP++  FW PVIE+I +RL  W   ++S GGR TLIQ+ L ++P Y+LS+      V   IE++ R +LW G    +  HL+N
Subjt:  GNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLANLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN

A0A438FWU5 LINE-1 retrotransposable element ORF2 protein4.4e-15236.15Show/hide
Query:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------
        I+SWN RG+GS KKR +++ FL+T+NP IV+LQETK  ++DR+   SVW  + + W AL A G SGGI ILW+ S     E V G               
Subjt:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------

Query:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS
             VYGP +   RK  W EL DL  L  P W +GGDFN+ R  SEK   T  T  MR F++FI  +GL D PL N  +TWS+ Q +P    +DRFL S
Subjt:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS

Query:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA
              F  +    L R TSDH PICL     KWGP+PFRF N WL H +  +    WW   + +GW GH FM+KLK +K +LK WN  TF   +E K  
Subjt:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA

Query:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY
        +  +LS ID  E+ G L    +  R++ + ++ ++  KEE+ WRQK ++KW  EGD NS FFH + T  R +  I  ++S  G ++ N +DI +E ++F+
Subjt:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY

Query:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK
          L+++    +   +  D  PIS E   +L+ PFTE E+ RAV  +   K PGPDGFT    ++ W+++KED+ RVF +F  NG+IN S N T+I L+PK
Subjt:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK

Query:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------
        K  +  I                  VLS RL+KVL  TI+  Q  FV+ R I+DA+LI NE++DE  R   E                            
Subjt:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------

Query:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN
                                                                                             DDTI FS     H+ 
Subjt:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN

Query:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA
        NL   +L F + SGL IN +K+   G+   Q+  +SLAS F C++  WP++YLG PL GNP+++ FW PV+E+I +RL  W   ++S GGR TLIQ+ L+
Subjt:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA

Query:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLL
        ++P Y+LS+      +   IEK+ RN+LW G    +  HL+
Subjt:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLL

A0A438G038 Transposon TX1 uncharacterized 149 kDa protein1.7e-14335.63Show/hide
Query:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------
        IISWN RG+GS KKR ++K FL+++ P +V++QETK    DR++  SVWS RN  W AL A G SGGI I+W+    R  EVV G               
Subjt:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------

Query:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS
             VYGPN+S  RK  W EL D+  L  P W +GGDFN+ R SSEK   +  T  M+ F++FI    L D PL +  YTWS+ Q NP    +DRFL S
Subjt:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS

Query:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA
        +     F  +    L R TSDH+PI L     KWGP+PFRF N WL H    +    WW+     GW GH FM+KL+ +K +LK WN+ +F    + K  
Subjt:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA

Query:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY
        +   L+  D+ E+ G L+++ + +R+  K ++  +  +EEI WRQK ++KW  +GD NS FFH +    R +  I E+ + SG  + N + I++E L ++
Subjt:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY

Query:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK
        + L+      +   +  D SPI  E  + LESPFTE EIY+A+  +  +K PGPDGFT    +  W+++KED+ RVF +F R+GIIN S N ++I L+PK
Subjt:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK

Query:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------
        K  ++ I                  VL+ RL+ VL  TI   Q  FV+ RQI+DA+LI NE++DE  R   E                            
Subjt:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------

Query:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN
                                                                                             DDTI FSS  +  + 
Subjt:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN

Query:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA
         L S +L F   SGL +N  K+   G+ I Q H + LA    CK   WPI YLG PL GNP++  FW PVIE+I +RL  W   ++S GGR TLIQ+ L 
Subjt:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA

Query:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN
        ++P Y+LS+      V   IE++ R +LW G    +  HL+N
Subjt:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN

A0A438GDE7 LINE-1 retrotransposable element ORF2 protein1.3e-14335.04Show/hide
Query:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------
        IISWN RG+GS KKR ++K FL ++ P +V+ QETK    DR+   SVW++RN  W AL A G SGGI I+W+       EV+ G               
Subjt:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------

Query:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS
             VYGPN+S  RK LW EL D+  L  P W +GGDFN+ R SSEK   +  T  M+ F+ FI    L D+PL +  +TWS+ Q NP    +DRFL S
Subjt:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS

Query:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA
        +     F  +    L R TSDH+PI L     KWGP+PFRF N WL H    +    WW      GW GH FM+KL+ +K +LK+WN+ +F    + K  
Subjt:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA

Query:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY
        +   L   D+ E+ G L+ + + +R+I K ++  +  +EEI WRQK ++KW  EGD NS FFH +    R +  I E+ + +G+ + N + I++E L ++
Subjt:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY

Query:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK
        + L+T     +   +  D SPIS E    LESPFTE EI +A+  +  +K PGPDGFT    +  W ++KED+ +VF +F R+GIIN S N ++I L+PK
Subjt:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK

Query:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------
        K  ++ I                  VL+ R+++VL  TI   Q  FV+ RQI+DA+LI NE++DE  R   E                            
Subjt:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------

Query:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN
                                                                                             DDTI FSS  +  + 
Subjt:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN

Query:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA
         L + +L F   SGL +N  K+   G+ + Q H + LA    CK   WPI YLG PL GNP++  FW PVIE+I +RL  W   ++S GGR TLIQ+ L 
Subjt:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA

Query:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN
        ++P Y+LS+      V   IE++ R++LW G    +  HL+N
Subjt:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN

A5CAA2 Reverse transcriptase domain-containing protein1.3e-14335.04Show/hide
Query:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------
        IISWN RG+GS KKR ++K FL ++ P +V+ QETK    DR+   SVW++RN  W AL A G SGGI I+W+       EV+ G               
Subjt:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEG---------------

Query:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS
             VYGPN+S  RK LW EL D+  L  P W +GGDFN+ R SSEK   +  T  M+ F+ FI    L D+PL +  +TWS+ Q NP    +DRFL S
Subjt:  -----VYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLIS

Query:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA
        +     F  +    L R TSDH+PI L     KWGP+PFRF N WL H    +    WW      GW GH FM+KL+ +K +LK+WN+ +F    + K  
Subjt:  DNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSA

Query:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY
        +   L   D+ E+ G L+ + + +R+I K ++  +  +EEI WRQK ++KW  EGD NS FFH +    R +  I E+ + +G+ + N + I++E L ++
Subjt:  LSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFY

Query:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK
        + L+T     +   +  D SPIS E    LESPFTE EI +A+  +  +K PGPDGFT    +  W ++KED+ +VF +F R+GIIN S N ++I L+PK
Subjt:  KGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPK

Query:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------
        K  ++ I                  VL+ R+++VL  TI   Q  FV+ RQI+DA+LI NE++DE  R   E                            
Subjt:  KIGAKTI------------------VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE----------------------------

Query:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN
                                                                                             DDTI FSS  +  + 
Subjt:  -------------------------------------------------------------------------------------DDTILFSSPDKNHIN

Query:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA
         L + +L F   SGL +N  K+   G+ + Q H + LA    CK   WPI YLG PL GNP++  FW PVIE+I +RL  W   ++S GGR TLIQ+ L 
Subjt:  NLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQATLA

Query:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN
        ++P Y+LS+      V   IE++ R++LW G    +  HL+N
Subjt:  NLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLN

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.0e-2023.77Show/hide
Query:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALH-AYG--PSGGIAILWNE-SSFRVLEV--------------
        I++ NV G+ S  KR  +  ++ +++P++  +QET L   D    K         W  ++ A G     G+AIL ++ + F+  ++              
Subjt:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALH-AYG--PSGGIAILWNE-SSFRVLEV--------------

Query:  -------VEGVYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDI-----PLSNGKYTWSSFQPNPT
               +  +Y PN+   R F+ Q L DLQ     + ++ GDFN      ++ST        ++ N  +  T L DI     P S  +YT+ S  P+ T
Subjt:  -------VEGVYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDI-----PLSNGKYTWSSFQPNPT

Query:  MTLIDRFLISDNISIKFISAQARKLERST---SDHFPICLSLGKEKWGPSPFR-------FINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLK
         + ID       +  K + ++ ++ E  T   SDH  I L L  +    S           +N +  H ++   +  ++ +N  +           K + 
Subjt:  MTLIDRFLISDNISIKFISAQARKLERST---SDHFPICLSLGKEKWGPSPFR-------FINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLK

Query:  R-ELKLWNQQTFNTQRELKSALSRELSAIDNNEENGQLTEQDINRR---SIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAF--FHHIVTANRRKNA
        R +    N      +R     L+ +L  ++  E+    T    +RR   + I+A++  I T++ +    + +  WF E  +N        ++   R KN 
Subjt:  R-ELKLWNQQTFNTQRELKSALSRELSAIDNNEENGQLTEQDINRR---SIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAF--FHHIVTANRRKNA

Query:  IFEVLSASGRSILNDDDIEQEFLDFYKGLFTEK-DNRAPLPQIDD---LSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILK
        I  + +  G    +  +I+    ++YK L+  K +N   +    D   L  ++ E+   L  P T SEI   ++ + T K+PGPDGFTAEF ++    L 
Subjt:  IFEVLSASGRSILNDDDIEQEFLDFYKGLFTEK-DNRAPLPQIDD---LSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILK

Query:  EDIKRVFNDFFRNGIINASLNETYICLIPK
          + ++F    + GI+  S  E  I LIPK
Subjt:  EDIKRVFNDFFRNGIINASLNETYICLIPK

P08548 LINE-1 reverse transcriptase homolog4.9e-1522.59Show/hide
Query:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDR-KMAKSVWSSRNIAWTALHAYGPSGGIAILWNES-SFRVLEV----------------
        I S NV G+    KR  +  ++    P I  +QE+ L   D+ ++    WSS   A    +      GIAIL+ ++  F+  ++                
Subjt:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDR-KMAKSVWSSRNIAWTALHAYGPSGGIAILWNES-SFRVLEV----------------

Query:  -----VEGVYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDI----PLSNGKYTWSSFQPNPTMTL
             +  +Y PN  N  +F+ + L D+  L     I+ GDFN      ++S+    +  +   N  I+   L DI      +  +YT+ S   + T + 
Subjt:  -----VEGVYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDI----PLSNGKYTWSSFQPNPTMTL

Query:  IDRFLISDNISIKFISAQARKLERSTSDHFPICLSLGKEK--------WGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKL
        ID  L   +   KF   +   +    SDH  I + L   +        W  +     + W+   ++ + +  +   N+ Q           K + R  K 
Subjt:  IDRFLISDNISIKFISAQARKLERSTSDHFPICLSLGKEK--------WGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKL

Query:  WNQQTF--NTQRELKSALSRELSAIDNNEENGQLTEQDINRRSI--IKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAF--FHHIVTANRRKNAIFEVL
           Q F   T+RE  + L   L  ++  E +     +   R+ I  I+A++  I  K  I    K K  WF E  +N       ++    R K+ I  + 
Subjt:  WNQQTF--NTQRELKSALSRELSAIDNNEENGQLTEQDINRRSI--IKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAF--FHHIVTANRRKNAIFEVL

Query:  SASGRSILNDDDIEQEFLDFYKGLFTEKDNRAPLPQIDD------LSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDI
        + +     +  +I++   ++YK L++ K     L +ID       L  +S ++   L  P + SEI   + ++   K+PGPDGFT+EF +     L   +
Subjt:  SASGRSILNDDDIEQEFLDFYKGLFTEKDNRAPLPQIDD------LSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDI

Query:  KRVFNDFFRNGIINASLNETYICLIPK-----------------KIGAKTI--VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE
          +F +  + GI+  +  E  I LIPK                  I AK +  +L+ R+++ +   I   Q  F+   Q    +  +  +I   N+ K +
Subjt:  KRVFNDFFRNGIINASLNETYICLIPK-----------------KIGAKTI--VLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKRE

Query:  DDTILFSSPDK
        D  IL    +K
Subjt:  DDTILFSSPDK

P11369 LINE-1 retrotransposable element ORF2 protein2.9e-1523.92Show/hide
Query:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAW-TALHAYG--PSGGIAILWN--------------ESSFRVLE--
        +IS N+ G+ S  KR  +  +L  ++P    LQET L   DR         R   W T   A G     G+AIL +              E  F +++  
Subjt:  IISWNVRGMGSWKKRALIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAW-TALHAYG--PSGGIAILWN--------------ESSFRVLE--

Query:  ------VVEGVYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDI-----PLSNGKYTWSSFQPNPT
               +  +Y PN +    F+   L  L+A   P+ I+ GDFN    S ++S          K  + ++   L DI     P + G YT+ S  P+ T
Subjt:  ------VVEGVYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNITRWSSEKSTNTTPTCGMRKFNKFIETTGLQDI-----PLSNGKYTWSSFQPNPT

Query:  MTLIDRFLISDNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPF-------RFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKREL
         + ID  +       ++ + +   +    SDH  + L          P          +N  L  + + + +  +   N  +          +K   R  
Subjt:  MTLIDRFLISDNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPF-------RFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKREL

Query:  KLWNQQTFNTQREL--KSALSRELSAIDNNEENGQLTEQDINRRSIIK--ADMMNISTKEEIIWRQKCKLKWFIEGDVNSAF--FHHIVTANRRKNAIFE
        KL        +RE    S+L+  L A++  E N   + +   R+ IIK   ++  + T+   I R      WF E  +N        +   +R K  I +
Subjt:  KLWNQQTFNTQREL--KSALSRELSAIDNNEENGQLTEQDINRRSIIK--ADMMNISTKEEIIWRQKCKLKWFIEGDVNSAF--FHHIVTANRRKNAIFE

Query:  VLSASGRSILNDDDIEQEFLDFYKGLFTEK-DNRAPLPQIDD---LSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDI
        + +  G    + ++I+     FYK L++ K +N   + +  D   +  ++ +Q   L SP +  EI   ++ + T K+PGPDGF+AEF    +   KED+
Subjt:  VLSASGRSILNDDDIEQEFLDFYKGLFTEK-DNRAPLPQIDD---LSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDI

Query:  KRVFNDFFR----NGIINASLNETYICLIPK
          + +  F      G +  S  E  I LIPK
Subjt:  KRVFNDFFR----NGIINASLNETYICLIPK

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein2.2e-3129.09Show/hide
Query:  ILGGDFNITRWSSEKST---NTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQ-PNPTMTLIDRFLISDNISIKFISAQARKLERSTSDHFPICLSL
        IL GDF+    +S+  +    + P  G+ +F   +  + L DIP     YTWS+ Q  NP +  +DR + + +    F SA A       SDH P  + L
Subjt:  ILGGDFNITRWSSEKST---NTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQ-PNPTMTLIDRFLISDNISIKFISAQARKLERSTSDHFPICLSL

Query:  -GKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTF-NTQRELKSALSRELSAIDN----NEENGQLTEQDIN
            K     FR+ +   +H   L  +   W      G       + LK  K+  KL N+Q F N Q + K AL   L +I +    N  +     + + 
Subjt:  -GKEKWGPSPFRFINGWLSHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTF-NTQRELKSALSRELSAIDN----NEENGQLTEQDIN

Query:  RRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFYKGLF-TEKDNRAP--LPQIDDLS
        R+          +   E  +RQK ++KW  +GD N+ FFH ++ AN+ KN I  +       + N   +++  + +Y  L  ++ D   P  + +I D+ 
Subjt:  RRSIIKADMMNISTKEEIIWRQKCKLKWFIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFYKGLF-TEKDNRAP--LPQIDDLS

Query:  PI--SDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPKKIG
        P   +D   + L +  ++ EI  AV  +  NK PGPD FTAEF  +SW ++K+       +FFR G +    N T I LIPK  G
Subjt:  PI--SDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDGFTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPKKIG


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CAAAGGAAATTGAAAAAGGGGATAATCTCCAAATATCGTACAGTTGCACCCTCCAAGGAGCCCCTATTGAGGCTTGTCACATAGAGGATTATCAGACGCAATCAG
ATTGGGCCCCACACCTTCTAGAAGATCACGTGATTCCCCAATCCATAGTTGCCACCACCATCGGTTCCAAAAAACCCTTTCCCCCCAAAATCTTTAGCCATACTC
CCCAATCAGAGGCCCACTCAAATCATCAGCCCCCACGTCAAGCCCCAAAAAGCCCACTTTATCCAGATGGCCCACCCCAAACCTGCACCAATCAACCCTTTTGCC
CTAATCCCGAAGCCCCTACAGACAAAAACCCCTTGACAAACTCCCCACCAACCAGCCCAATCCCATTCATTAATAACCAACATGGCCGGAAAAAGCCCATCATCA
TTGATAATAAGGAAACCTTCCTTCTCACGGGCACTGTACATTTGACTGGATCGACTGACCACCAATCAGATTCAGAAGGAGCTCTTTCCTTCCCATGCTCTCCGA
ATTTGGATGAATCTCCCCCTACTCACCCACAAAAGGCTACCAACCAAGCGGTTTCTCCTCCAACTATATCCTACTTATTCGAATCTTCCGAAGATCAGGTCCTTG
ATATTGAAAGCCCCATTCCGTTGATGATTGAGGAGCCCTTTGTGGCAGTTAATCAGCAAAACCTTGAGATGGAGACCACAGCCTTGGTCGACATCAATGTAGAAG
AATTGACAGGAGATGAATCCGATTCCAAGACCCACTTCCAGCAAAGAGATGCAGCTGCATATCTTCCCTTTCTTTTTCCTTGGCTCGCGGAACATGGCATGTGCA
TCATGCCAATGCCAAACAGAAATAAGCTATCTACTGCAGCAAAAAAGAAGACTCAATGGATAATATCCTGGAATGTTAGGGGTATGGGCTCATGGAAGAAAAGAG
CCCTCATCAAAGTTTTCCTCACAACAAAAAATCCAGCCATTGTTATCCTTCAAGAAACAAAGCTTCACTCTTTTGATAGAAAGATGGCCAAATCAGTTTGGAGTT
CGAGAAACATTGCCTGGACAGCCCTCCACGCCTATGGTCCATCGGGGGGTATTGCCATTCTGTGGAATGAGTCTTCCTTCAGGGTTTTGGAGGTTGTTGAAGGAG
TGTATGGTCCAAATTCTTCAAATGAGAGAAAATTCCTTTGGCAAGAACTTCATGACTTGCAGGCCCTTTGTCTTCCAAATTGGATACTAGGGGGCGATTTCAACA
TTACTAGATGGTCTTCGGAGAAATCGACCAACACAACCCCCACTTGCGGTATGAGGAAATTTAATAAATTCATAGAAACAACAGGCTTACAGGATATTCCGCTCT
CCAACGGAAAATACACATGGTCTAGTTTCCAGCCAAATCCCACTATGACTCTTATTGATAGATTCCTCATATCCGACAATATATCTATCAAATTTATATCAGCAC
AGGCGAGAAAGCTAGAGAGGAGCACATCAGACCACTTTCCTATTTGTCTATCTTTGGGTAAAGAAAAATGGGGCCCATCCCCCTTTCGTTTTATCAACGGATGGT
TATCTCATAAAGATCTCTTGCAGATGGTGGATCACTGGTGGAATTCGAATTCGTTACAAGGCTGGCCGGGTCATGGATTTATGCAAAAGTTGAAAGGGTTGAAAA
GGGAGCTTAAGCTCTGGAATCAGCAAACCTTTAACACACAGAGGGAACTGAAATCTGCTTTAAGCCGTGAGCTATCTGCTATTGACAACAACGAGGAAAATGGCC
AACTAACCGAACAAGATATCAATAGAAGATCGATTATCAAAGCTGATATGATGAATATATCGACCAAGGAAGAAATTATCTGGAGGCAAAAATGCAAACTCAAAT
GGTTTATTGAAGGAGATGTTAATTCAGCCTTTTTCCACCACATTGTTACAGCGAACAGAAGAAAGAACGCCATTTTTGAAGTCCTTTCAGCATCAGGACGAAGTA
TCCTGAATGATGATGATATTGAACAAGAATTTCTTGATTTTTACAAAGGACTCTTCACCGAGAAAGATAATCGCGCTCCTCTCCCACAAATTGACGATTTGAGCC
CCATCTCTGATGAACAAAATGCCTTTCTTGAGTCCCCCTTTACCGAATCAGAAATTTATAGAGCTGTATCTGACATTGGTACCAACAAGACTCCTGGTCCCGATG
GATTCACTGCTGAATTCCTTAAAAAATCCTGGAACATCCTCAAAGAAGACATTAAAAGGGTGTTCAACGATTTTTTTAGGAATGGAATCATAAATGCTAGTCTTA
ATGAGACATATATTTGTCTTATCCCCAAGAAGATCGGAGCAAAAACAATAGTTCTATCAGAAAGACTGAAGAAAGTCCTCCCCCACACGATCACTGGTTTCCAAT
CGACTTTTGTTAAAGATAGGCAAATCATGGATGCTCTTCTTATAACAAATGAGCTTATTGACGAATGGAATAGAAAGAAAAGGGAAGATGACACCATATTATTCT
CCTCGCCAGATAAGAACCACATCAACAACCTTTTCAGCACTATTCTATCGTTTGAAGAGGCATCAGGTCTGAATATAAATTGCCAGAAGACTGAATTTTTGGGCC
TAGGTATCAACCAACAGCACGCTGCCTCCTTAGCCTCTTCCTTCGGGTGTAAGTTGGGATCGTGGCCGATTACATATCTTGGTTTTCCTCTTAATGGCAACCCCC
GTTCGTTGAATTTCTGGTCTCCAGTGATTGAAAAGATTGAAAAAAGATTGCACAATTGGGGTTCCAACCACATATCCAAAGGAGGCCGTCACACTCTTATACAAG
CTACTCTAGCAAACCTCCCAGTCTACTACCTCTCCATTTCCCTTGCCCACATGAAAGTCACAAAAGCTATTGAAAAACTATTCCGAAACTACCTATGGAGAGGGA
CTCACGGAAATAGAGGCAGGCACCTTCTAAATGTTGGCTTTTTGGCCCTTAATCTCAAATTAATTAATTTTAATTAA
mRNA sequenceShow/hide mRNA sequence
CAAAGGAAATTGAAAAAGGGGATAATCTCCAAATATCGTACAGTTGCACCCTCCAAGGAGCCCCTATTGAGGCTTGTCACATAGAGGATTATCAGACGCAATCAG
ATTGGGCCCCACACCTTCTAGAAGATCACGTGATTCCCCAATCCATAGTTGCCACCACCATCGGTTCCAAAAAACCCTTTCCCCCCAAAATCTTTAGCCATACTC
CCCAATCAGAGGCCCACTCAAATCATCAGCCCCCACGTCAAGCCCCAAAAAGCCCACTTTATCCAGATGGCCCACCCCAAACCTGCACCAATCAACCCTTTTGCC
CTAATCCCGAAGCCCCTACAGACAAAAACCCCTTGACAAACTCCCCACCAACCAGCCCAATCCCATTCATTAATAACCAACATGGCCGGAAAAAGCCCATCATCA
TTGATAATAAGGAAACCTTCCTTCTCACGGGCACTGTACATTTGACTGGATCGACTGACCACCAATCAGATTCAGAAGGAGCTCTTTCCTTCCCATGCTCTCCGA
ATTTGGATGAATCTCCCCCTACTCACCCACAAAAGGCTACCAACCAAGCGGTTTCTCCTCCAACTATATCCTACTTATTCGAATCTTCCGAAGATCAGGTCCTTG
ATATTGAAAGCCCCATTCCGTTGATGATTGAGGAGCCCTTTGTGGCAGTTAATCAGCAAAACCTTGAGATGGAGACCACAGCCTTGGTCGACATCAATGTAGAAG
AATTGACAGGAGATGAATCCGATTCCAAGACCCACTTCCAGCAAAGAGATGCAGCTGCATATCTTCCCTTTCTTTTTCCTTGGCTCGCGGAACATGGCATGTGCA
TCATGCCAATGCCAAACAGAAATAAGCTATCTACTGCAGCAAAAAAGAAGACTCAATGGATAATATCCTGGAATGTTAGGGGTATGGGCTCATGGAAGAAAAGAG
CCCTCATCAAAGTTTTCCTCACAACAAAAAATCCAGCCATTGTTATCCTTCAAGAAACAAAGCTTCACTCTTTTGATAGAAAGATGGCCAAATCAGTTTGGAGTT
CGAGAAACATTGCCTGGACAGCCCTCCACGCCTATGGTCCATCGGGGGGTATTGCCATTCTGTGGAATGAGTCTTCCTTCAGGGTTTTGGAGGTTGTTGAAGGAG
TGTATGGTCCAAATTCTTCAAATGAGAGAAAATTCCTTTGGCAAGAACTTCATGACTTGCAGGCCCTTTGTCTTCCAAATTGGATACTAGGGGGCGATTTCAACA
TTACTAGATGGTCTTCGGAGAAATCGACCAACACAACCCCCACTTGCGGTATGAGGAAATTTAATAAATTCATAGAAACAACAGGCTTACAGGATATTCCGCTCT
CCAACGGAAAATACACATGGTCTAGTTTCCAGCCAAATCCCACTATGACTCTTATTGATAGATTCCTCATATCCGACAATATATCTATCAAATTTATATCAGCAC
AGGCGAGAAAGCTAGAGAGGAGCACATCAGACCACTTTCCTATTTGTCTATCTTTGGGTAAAGAAAAATGGGGCCCATCCCCCTTTCGTTTTATCAACGGATGGT
TATCTCATAAAGATCTCTTGCAGATGGTGGATCACTGGTGGAATTCGAATTCGTTACAAGGCTGGCCGGGTCATGGATTTATGCAAAAGTTGAAAGGGTTGAAAA
GGGAGCTTAAGCTCTGGAATCAGCAAACCTTTAACACACAGAGGGAACTGAAATCTGCTTTAAGCCGTGAGCTATCTGCTATTGACAACAACGAGGAAAATGGCC
AACTAACCGAACAAGATATCAATAGAAGATCGATTATCAAAGCTGATATGATGAATATATCGACCAAGGAAGAAATTATCTGGAGGCAAAAATGCAAACTCAAAT
GGTTTATTGAAGGAGATGTTAATTCAGCCTTTTTCCACCACATTGTTACAGCGAACAGAAGAAAGAACGCCATTTTTGAAGTCCTTTCAGCATCAGGACGAAGTA
TCCTGAATGATGATGATATTGAACAAGAATTTCTTGATTTTTACAAAGGACTCTTCACCGAGAAAGATAATCGCGCTCCTCTCCCACAAATTGACGATTTGAGCC
CCATCTCTGATGAACAAAATGCCTTTCTTGAGTCCCCCTTTACCGAATCAGAAATTTATAGAGCTGTATCTGACATTGGTACCAACAAGACTCCTGGTCCCGATG
GATTCACTGCTGAATTCCTTAAAAAATCCTGGAACATCCTCAAAGAAGACATTAAAAGGGTGTTCAACGATTTTTTTAGGAATGGAATCATAAATGCTAGTCTTA
ATGAGACATATATTTGTCTTATCCCCAAGAAGATCGGAGCAAAAACAATAGTTCTATCAGAAAGACTGAAGAAAGTCCTCCCCCACACGATCACTGGTTTCCAAT
CGACTTTTGTTAAAGATAGGCAAATCATGGATGCTCTTCTTATAACAAATGAGCTTATTGACGAATGGAATAGAAAGAAAAGGGAAGATGACACCATATTATTCT
CCTCGCCAGATAAGAACCACATCAACAACCTTTTCAGCACTATTCTATCGTTTGAAGAGGCATCAGGTCTGAATATAAATTGCCAGAAGACTGAATTTTTGGGCC
TAGGTATCAACCAACAGCACGCTGCCTCCTTAGCCTCTTCCTTCGGGTGTAAGTTGGGATCGTGGCCGATTACATATCTTGGTTTTCCTCTTAATGGCAACCCCC
GTTCGTTGAATTTCTGGTCTCCAGTGATTGAAAAGATTGAAAAAAGATTGCACAATTGGGGTTCCAACCACATATCCAAAGGAGGCCGTCACACTCTTATACAAG
CTACTCTAGCAAACCTCCCAGTCTACTACCTCTCCATTTCCCTTGCCCACATGAAAGTCACAAAAGCTATTGAAAAACTATTCCGAAACTACCTATGGAGAGGGA
CTCACGGAAATAGAGGCAGGCACCTTCTAAATGTTGGCTTTTTGGCCCTTAATCTCAAATTAATTAATTTTAATTAA
Protein sequenceShow/hide protein sequence
KEIEKGDNLQISYSCTLQGAPIEACHIEDYQTQSDWAPHLLEDHVIPQSIVATTIGSKKPFPPKIFSHTPQSEAHSNHQPPRQAPKSPLYPDGPPQTCTNQPFCP
NPEAPTDKNPLTNSPPTSPIPFINNQHGRKKPIIIDNKETFLLTGTVHLTGSTDHQSDSEGALSFPCSPNLDESPPTHPQKATNQAVSPPTISYLFESSEDQVLD
IESPIPLMIEEPFVAVNQQNLEMETTALVDINVEELTGDESDSKTHFQQRDAAAYLPFLFPWLAEHGMCIMPMPNRNKLSTAAKKKTQWIISWNVRGMGSWKKRA
LIKVFLTTKNPAIVILQETKLHSFDRKMAKSVWSSRNIAWTALHAYGPSGGIAILWNESSFRVLEVVEGVYGPNSSNERKFLWQELHDLQALCLPNWILGGDFNI
TRWSSEKSTNTTPTCGMRKFNKFIETTGLQDIPLSNGKYTWSSFQPNPTMTLIDRFLISDNISIKFISAQARKLERSTSDHFPICLSLGKEKWGPSPFRFINGWL
SHKDLLQMVDHWWNSNSLQGWPGHGFMQKLKGLKRELKLWNQQTFNTQRELKSALSRELSAIDNNEENGQLTEQDINRRSIIKADMMNISTKEEIIWRQKCKLKW
FIEGDVNSAFFHHIVTANRRKNAIFEVLSASGRSILNDDDIEQEFLDFYKGLFTEKDNRAPLPQIDDLSPISDEQNAFLESPFTESEIYRAVSDIGTNKTPGPDG
FTAEFLKKSWNILKEDIKRVFNDFFRNGIINASLNETYICLIPKKIGAKTIVLSERLKKVLPHTITGFQSTFVKDRQIMDALLITNELIDEWNRKKREDDTILFS
SPDKNHINNLFSTILSFEEASGLNINCQKTEFLGLGINQQHAASLASSFGCKLGSWPITYLGFPLNGNPRSLNFWSPVIEKIEKRLHNWGSNHISKGGRHTLIQA
TLANLPVYYLSISLAHMKVTKAIEKLFRNYLWRGTHGNRGRHLLNVGFLALNLKLINFN