; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0001987 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0001987
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr4:37943513..37948541
RNA-Seq ExpressionLag0001987
SyntenyLag0001987
Gene Ontology termsNA
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN62318.1 hypothetical protein VITISV_026527 [Vitis vinifera]3.2e-20036.93Show/hide
Query:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG
        MK ++WNVRGLGS  KR ++K  ++ +NP +V++QETK  + +R  + S+W+  +  W +L  +GASGGILI+W   +   +E + G FSIS+   +   
Subjt:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG

Query:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL
           W+SAVYGPN   L  DFW EL D+ GL    W +GGDFNV R S EK    S++ +M+ F+ +I   +L+D PL+N  F+WS+  E      LDRFL
Subjt:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL

Query:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQRSMTSQLP
         + +    F   +   + R TSDH+P+ +      WGP PFRFEN WLQ ++FK     WWS     GW G  FM +L+ +K  L+ WNK       +  
Subjt:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQRSMTSQLP

Query:  TLIL-ELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIA
          IL +L   D+IE         +  R   + ++E L  +E I W Q+ K++W+++GD N+ F+H++   ++ +  + E+ +   + L +A  I  E + 
Subjt:  TLIL-ELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIA

Query:  FYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLI
        +++ LYT  +   +    LDWS I+ + A+ L+  FTEEE+  A+  L   K+ GP+GFT   F+  W +IK D++ +  EF+ SG+IN + N ++I LI
Subjt:  FYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLI

Query:  PKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKA
        PKK  +K +SD+RPISLI   YKIIA+VL  RLR VL  TI   Q AFV  RQIL A LIANE++D+   S ++GVV K+D EKA+D V WDFLD VL+ 
Subjt:  PKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKA

Query:  KGFGLKWRNWIRGCISSVNYSIIINGRPR--------VQSGKVIS--------------------------HPVGISSVRLTHLQFADDTLLFSIFYLKA
        KGF  +WR W+ GC+SSV+Y+I++NG  +        ++ G  +S                            VG +  R++HLQFADDT+ FS    + 
Subjt:  KGFGLKWRNWIRGCISSVNYSIIINGRPR--------VQSGKVIS--------------------------HPVGISSVRLTHLQFADDTLLFSIFYLKA

Query:  LENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQAT
        L+ L +L+ +F   SGL +N  KS   GINL+   +  L     CK   WP  YL +PLGGNPK   FW PV+E+I  +L GW+  ++S GGR TLIQ+ 
Subjt:  LENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQAT

Query:  LSSIPIYIFRCLS----SRSRLPKFWRSLL-------EIFSGKDLEEMVRVLGN---GHNIYFWHDCWLPCGIIAEAYPRLFRLTTRSTAMVDQVWNSSN
        L+ +P Y            +++ +  R  L       +  S  ++   V+   N   G   YF  +      ++ E  PRLFR+       +  V   S 
Subjt:  LSSIPIYIFRCLS----SRSRLPKFWRSLL-------EIFSGKDLEEMVRVLGN---GHNIYFWHDCWLPCGIIAEAYPRLFRLTTRSTAMVDQVWNSSN

Query:  A-AWDLGLRRNLNDLEVIEWTSLSHILSSVTIR-SVNDSWCWPLDHSKSFSVKSLMSDMLTDCDSTPSNFYLE-LWKDVYPKKIKIFLWELSHGGINTAD
           W+L  RRNL+D E+ +   L   L  + +  SV D+  WPL  S  FSVKS     L+    +P NF  + +W    P K+K F+W ++H  +NT D
Subjt:  A-AWDLGLRRNLNDLEVIEWTSLSHILSSVTIR-SVNDSWCWPLDHSKSFSVKSLMSDMLTDCDSTPSNFYLE-LWKDVYPKKIKIFLWELSHGGINTAD

Query:  RLQRRMPHLTLSPSCCVMCYSAPENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVFMG
         LQ R P+  LSP  C++C    E+  HLF+HC      W  +        + P  IY ++   F G
Subjt:  RLQRRMPHLTLSPSCCVMCYSAPENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVFMG

RVW16209.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]2.9e-20135.46Show/hide
Query:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG
        MK ++WNVRGLGS  KR +IK  ++ +NP +V++QETK  + +R  + S+W+  +  W +L ++GASGGILI+W       +E + G FS+S+   +   
Subjt:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG

Query:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL
           W+SAVYGPN   L  DFW EL D+ GL    W +GGDFNV R S EK    S++ +M+ F+ +I+  +L+D PL+N  F+WS+  E      LDRFL
Subjt:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL

Query:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQRSMTSQLP
         + +    F   +   + R TSDH+P+ +      WGP PFRFEN WLQ ++FK     WWS     GW G  FM +L+ +K  L+ WNK          
Subjt:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQRSMTSQLP

Query:  TLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAF
                               E +   + ++E L  +E I W Q+ K++W+++GD N+ F+H++   R+ +  I E+ +   + L +A  I +E + +
Subjt:  TLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAF

Query:  YKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLIP
        ++ LYT  +   +    LDWSPI+ + A+ LE  FTEEE+  A+  L   K+PGP+GFT   F+  W +IK D++ +  EF+ SG+IN + N ++I LIP
Subjt:  YKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLIP

Query:  KKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKAK
        KK  +K +SD+RPISLI   YKIIA+VLS RLR VL  TI   Q AFV  RQILDA LIANE++D+   S ++GVV K+D EKA+D V WDFLD +L+ K
Subjt:  KKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKAK

Query:  GFGLKWRNWIRGCISSVNYSIIINGRPR--------VQSGKVIS--------------------------HPVGISSVRLTHLQFADDTLLFSIFYLKAL
        GF  +WR W+ GC+SSV+++I++NG  +        ++ G  +S                            VG +  R++HLQFADD + FS    + L
Subjt:  GFGLKWRNWIRGCISSVNYSIIINGRPR--------VQSGKVIS--------------------------HPVGISSVRLTHLQFADDTLLFSIFYLKAL

Query:  ENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQATL
        + L +L+ +F    GL +N  KS   GINL+   +  L     CK   WP  YLG+PLGGNPK+  FW PV+E+I  +L GW+ A++S GGR TLIQ+ L
Subjt:  ENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQATL

Query:  SSIPIYIFRCL------------------------SSRSRL--------PK---------------------FWR-----------SLLEIF----SGKD
        + +P Y                               R  L        PK                      WR            +L I+    +G D
Subjt:  SSIPIYIFRCL------------------------SSRSRL--------PK---------------------FWR-----------SLLEIF----SGKD

Query:  LEEMVR-----------------------VLGNGHNIYFWHDCWLPCGIIAEAYPRLFRLTTRSTAMVDQVWNSSNA-AWDLGLRRNLNDLEVIEWTSLS
           +VR                       V GNG  I FW D W     +   YPRLFR+       +  V   S    W+L  RRNL+D E+ +   L 
Subjt:  LEEMVR-----------------------VLGNGHNIYFWHDCWLPCGIIAEAYPRLFRLTTRSTAMVDQVWNSSNA-AWDLGLRRNLNDLEVIEWTSLS

Query:  HILSSVTIR-SVNDSWCWPLDHSKSFSVKSL---MSDMLTDCDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSA
          L  + +  SV D+  WPL  S  FSVKS    +S       + PS F   +W    P K+K F+W ++H  +NT D LQ R P+  LSP  C++C   
Subjt:  HILSSVTIR-SVNDSWCWPLDHSKSFSVKSL---MSDMLTDCDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSA

Query:  PENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVFMG
         E+  HLF+HC      W  +        + P +IY ++   F G
Subjt:  PENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVFMG

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]8.7e-20635.88Show/hide
Query:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG
        MK L+WN RGLGS +KR ++++ +  QNP +V+LQETK  + +R  + S+W    + W +L + GASGGI+I+W        E + G FS+++     + 
Subjt:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG

Query:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL
         SFWL++VYGP       DFW EL DL GL   +W +GGDFNV R   EK  +  ++ NM+ F+ +I    L+D PL+N  F+WS+         LDRFL
Subjt:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL

Query:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQRSMTSQLP
         +++    F       + R TSDH P+ L    + WGP PFRFEN WL    FK     WW +   +GW G  FM KLK +K  L+ WN        +  
Subjt:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQRSMTSQLP

Query:  TLIL-ELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIA
         LIL +L ++D IE         +  R L R ++E++  KE +QW Q+ +++W+++GD N+ FFHR+   R+ +  I  ++S    +L +  DI +E + 
Subjt:  TLIL-ELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIA

Query:  FYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLI
        F+  LY+K     +    +DW PI+G+    L++ FTEEEV  AV  L   K+PGP+GFT   ++  W +IK D+M +  EF+ +GVIN + N T+I L+
Subjt:  FYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLI

Query:  PKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKA
        PKK  +  +SDYRPISL+   YKIIA+VLS RLR VL  TI+++Q AFV  R ILDA LIANEV+D+   S ++G+V K+D EKA+D VDW FLD VL+ 
Subjt:  PKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKA

Query:  KGFGLKWRNWIRGCISSVNYSIIINGRPR----------------------------------VQSGKVISHPVGISSVRLTHLQFADDTLLFSIFYLKA
        KGF  KWR WIRGC+SS +++I++NG  +                                   ++G      VG    R++ LQFADDT+ FS   ++ 
Subjt:  KGFGLKWRNWIRGCISSVNYSIIINGRPR----------------------------------VQSGKVISHPVGISSVRLTHLQFADDTLLFSIFYLKA

Query:  LENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQAT
        L+NL  ++ +F   SGL IN  KS   GIN     +  L + F C+   WP +YLG+PLGGNPK + FW PV+E+I ++L GWK A++S GGR TLIQ+ 
Subjt:  LENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQAT

Query:  LSSIPIYIFRCL----SSRSRLPKFWRSLL---------------EIFS-------------------------------------------------GK
        LS IP Y         S  S++ K  R+ L               E+ S                                                 G 
Subjt:  LSSIPIYIFRCL----SSRSRLPKFWRSLL---------------EIFS-------------------------------------------------GK

Query:  DLEEMVR-----------------------VLGNGHNIYFWHDCWLPCGIIAEAYPRLFR-LTTRSTAMVDQVWNSSNAAWDLGLRRNLNDLEVIEWTSL
        D   +VR                       V+GNG  I FW D W     +   +  L+R ++ ++  + + + NS   AW+L  RRNL D E+     L
Subjt:  DLEEMVR-----------------------VLGNGHNIYFWHDCWLPCGIIAEAYPRLFR-LTTRSTAMVDQVWNSSNAAWDLGLRRNLNDLEVIEWTSL

Query:  SHILSSVT-IRSVNDSWCWPLDHSKSFSVKSL---MSDMLTDCDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYS
           LSSV    S+ DS  W L  S  F+VKS    +S +       P+ F   LW    P K+K   W ++HG +NT D+LQ R P+ +L P  C++C  
Subjt:  SHILSSVT-IRSVNDSWCWPLDHSKSFSVKSL---MSDMLTDCDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYS

Query:  APENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVF--MGHPFHGK
          E+  HLF+HC      W  +    G   + P +   +L   F  +G+   GK
Subjt:  APENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVF--MGHPFHGK

RVW67743.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]5.3e-20335.56Show/hide
Query:  KFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGAS-----GGILIMWSELDFVVKETIQGLFSISIHI-
        K L+WN RGLG  +KR  ++  +  QNP +  LQETK    +R  + S+W    + W +L + GA+     GGI+I+W  + F   E + G FS+++ + 
Subjt:  KFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGAS-----GGILIMWSELDFVVKETIQGLFSISIHI-

Query:  FMADGFSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSL
        F  +GF FWL++VYGPN+     DFW EL DL GL   +W +GGDFNV R   EK  D  ++ NM+ F+ +I    L+D PL+N  F+WS+         
Subjt:  FMADGFSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSL

Query:  LDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWN-KEQRS
        LDRFL +++    F   +   + R TSDH P+ L      WGP PFRFEN WL    FK     WW +  ++GW G  FM KLK +K  L+ WN +    
Subjt:  LDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWN-KEQRS

Query:  MTSQLPTLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIE
        +  +   ++ +L ++D IE      L  +  R L R+++E+L  KE +QW Q+ +++W+++GD N+ FFHR+   R+ +  I  ++ R+  +L +   I 
Subjt:  MTSQLPTLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIE

Query:  KEFIAFYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNET
        +E + F+  LY+K     ++   +DW+PI+ + A+ L++ F+EEE+   V  L   K+P P+GFT   ++  W +IK D+M +  EF+  GVIN + N T
Subjt:  KEFIAFYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNET

Query:  YICLIPKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLD
        +I ++PKK     +SDYRPISL+   YKIIA+VLS RLR VL  TI  +Q AFV  RQILDA LIANEV+D+   S ++GVV K+D EKA+D VDW FLD
Subjt:  YICLIPKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLD

Query:  AVLKAKGFGLKWRNWIRGCISSVNYSIIINGRPR----------------------------------VQSGKVISHPVGISSVRLTHLQFADDTLLFSI
         VL+ KGF  KWR+W+RGC+SS +++I++NG  +                                   ++G      +G    R++ LQFADDT+ FS 
Subjt:  AVLKAKGFGLKWRNWIRGCISSVNYSIIINGRPR----------------------------------VQSGKVISHPVGISSVRLTHLQFADDTLLFSI

Query:  FYLKALENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHT
          L  L+NL  +  +F   SGL IN  K+   GIN     +  L     C+   W  +YLG+PLGGNPK + FW PV+E+I ++L GWK A++S GGR T
Subjt:  FYLKALENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHT

Query:  LIQATLSSIPIYIF------------------------------------------RCLSSRS----------------------------RLPKFWRSL
        LIQ+ LS IP Y                                            R +  RS                            R P  W+++
Subjt:  LIQATLSSIPIYIF------------------------------------------RCLSSRS----------------------------RLPKFWRSL

Query:  LEIFSGKDLEEMVR-VLGNGHNIYFWHDCWLPCGIIAEAYPRLFR-LTTRSTAMVDQVWNSSNAAWDLGLRRNLNDLEVIEWTSLSHILSSVTIR-SVND
         ++F  ++    VR V+GNG  I FW D W     +   +  L+R ++ ++  +++ + N    +W+   RRNL D E+     L   L SV +  S +D
Subjt:  LEIFSGKDLEEMVR-VLGNGHNIYFWHDCWLPCGIIAEAYPRLFR-LTTRSTAMVDQVWNSSNAAWDLGLRRNLNDLEVIEWTSLSHILSSVTIR-SVND

Query:  SWCWPLDHSKSFSVKSLMSDMLTDCDS---TPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSAPENPCHLFVHCLYA
        S  W L  S SFSVKS    +L D +     P+ F   LW    P K+K   W ++HG +NT D+LQ R P+  L P  C++C    E+  HLF+HC   
Subjt:  SWCWPLDHSKSFSVKSLMSDMLTDCDS---TPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSAPENPCHLFVHCLYA

Query:  SKYWLIMKNAFGWSLILPNNIYGILDYVF--MGHPFHGK
           W  + N  G   + P +I  +L   F  +G+   GK
Subjt:  SKYWLIMKNAFGWSLILPNNIYGILDYVF--MGHPFHGK

RVW93866.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]7.1e-20836.65Show/hide
Query:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG
        MK ++WNVRGLGS  KR +IK  ++ +NP +V++QETK  + +R  + S+W+  +  W +L ++GASGGILI+W       +E + G FS+S+   +   
Subjt:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG

Query:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL
           W+S VYGPN   L  DFW EL D+ GL    W +GGDFNV R S EK    S++ +M+ F+ +I+  +L+D PL+N  F+WS+  E      LDRFL
Subjt:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL

Query:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQ-RSMTSQL
         + +    F   +   + R TSDH+P+ +      WGP PFRFEN WLQ ++FK     WWS     GW G+ FM +L+ +K  L+ W K     +  + 
Subjt:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQ-RSMTSQL

Query:  PTLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIA
         +++ +L   D+IE         I  R   + ++E L  +E I W Q+ K++W+++GD N+ F+H++   R+ +  I E+ +   + L +A  I +E + 
Subjt:  PTLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIA

Query:  FYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLI
        +++ LYT  +   +    LDWSPI+ + A+ LE  FTEEE+  A+  L   K+ GP+GFT   F+  W +IK D++ +  EF+ SG+IN + N ++I LI
Subjt:  FYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLI

Query:  PKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKA
        PKK  +K +SD+RPISLI   YKII +VLS RLR VL  TI   Q AFV  RQILDA LIANE++D+   S ++GVV K+D EKA+D V WDFLD VL+ 
Subjt:  PKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKA

Query:  KGFGLKWRNWIRGCISSVNYSIIINGRPR--------VQSGKVIS--------------------------HPVGISSVRLTHLQFADDTLLFSIFYLKA
        KGF  +WR W+ GC+SSV+++I++NG  +        ++ G  +S                            VG +  R++HLQFADDT+ FS    + 
Subjt:  KGFGLKWRNWIRGCISSVNYSIIINGRPR--------VQSGKVIS--------------------------HPVGISSVRLTHLQFADDTLLFSIFYLKA

Query:  LENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQAT
        L+ L +L+ +F   SGL +N  KS   GINL+   +  L     CK   WP  YLG+PLGGNPK   FW PV+E+I  +L GW+ A++S  GR TLIQ+ 
Subjt:  LENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQAT

Query:  LSSIPIYIFRCL------------------------SSRSRLPKF------------------WRSLLEIFSGKDLEEMVRVLGNGHNIYFWHDCWLPCG
        L+ +P Y                               R  L ++                  W+++ ++F    L     V+GNG  I FW D W    
Subjt:  LSSIPIYIFRCL------------------------SSRSRLPKF------------------WRSLLEIFSGKDLEEMVRVLGNGHNIYFWHDCWLPCG

Query:  IIAEAYPRLFRLTTRSTAMVDQVWNSSNA-AWDLGLRRNLNDLEVIEWTSLSHILSSVTIR-SVNDSWCWPLDHSKSFSVKSL---MSDMLTDCDSTPSN
         +   YPRLFR+       +  V   S    W+L  RRNL+D E+ +   L   L  + +  SV D+  WPL  S  FSVKS    +S       + PS 
Subjt:  IIAEAYPRLFRLTTRSTAMVDQVWNSSNA-AWDLGLRRNLNDLEVIEWTSLSHILSSVTIR-SVNDSWCWPLDHSKSFSVKSL---MSDMLTDCDSTPSN

Query:  FYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSAPENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVFMG
        F   +W    P K+K F+W ++H  +NT D LQ R P+  LSP  C++C    E+  HLF+HC      W  +        + P +IY ++   F G
Subjt:  FYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSAPENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVFMG

TrEMBL top hitse value%identityAlignment
A0A438FWU5 LINE-1 retrotransposable element ORF2 protein4.2e-20635.88Show/hide
Query:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG
        MK L+WN RGLGS +KR ++++ +  QNP +V+LQETK  + +R  + S+W    + W +L + GASGGI+I+W        E + G FS+++     + 
Subjt:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG

Query:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL
         SFWL++VYGP       DFW EL DL GL   +W +GGDFNV R   EK  +  ++ NM+ F+ +I    L+D PL+N  F+WS+         LDRFL
Subjt:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL

Query:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQRSMTSQLP
         +++    F       + R TSDH P+ L    + WGP PFRFEN WL    FK     WW +   +GW G  FM KLK +K  L+ WN        +  
Subjt:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQRSMTSQLP

Query:  TLIL-ELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIA
         LIL +L ++D IE         +  R L R ++E++  KE +QW Q+ +++W+++GD N+ FFHR+   R+ +  I  ++S    +L +  DI +E + 
Subjt:  TLIL-ELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIA

Query:  FYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLI
        F+  LY+K     +    +DW PI+G+    L++ FTEEEV  AV  L   K+PGP+GFT   ++  W +IK D+M +  EF+ +GVIN + N T+I L+
Subjt:  FYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLI

Query:  PKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKA
        PKK  +  +SDYRPISL+   YKIIA+VLS RLR VL  TI+++Q AFV  R ILDA LIANEV+D+   S ++G+V K+D EKA+D VDW FLD VL+ 
Subjt:  PKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKA

Query:  KGFGLKWRNWIRGCISSVNYSIIINGRPR----------------------------------VQSGKVISHPVGISSVRLTHLQFADDTLLFSIFYLKA
        KGF  KWR WIRGC+SS +++I++NG  +                                   ++G      VG    R++ LQFADDT+ FS   ++ 
Subjt:  KGFGLKWRNWIRGCISSVNYSIIINGRPR----------------------------------VQSGKVISHPVGISSVRLTHLQFADDTLLFSIFYLKA

Query:  LENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQAT
        L+NL  ++ +F   SGL IN  KS   GIN     +  L + F C+   WP +YLG+PLGGNPK + FW PV+E+I ++L GWK A++S GGR TLIQ+ 
Subjt:  LENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQAT

Query:  LSSIPIYIFRCL----SSRSRLPKFWRSLL---------------EIFS-------------------------------------------------GK
        LS IP Y         S  S++ K  R+ L               E+ S                                                 G 
Subjt:  LSSIPIYIFRCL----SSRSRLPKFWRSLL---------------EIFS-------------------------------------------------GK

Query:  DLEEMVR-----------------------VLGNGHNIYFWHDCWLPCGIIAEAYPRLFR-LTTRSTAMVDQVWNSSNAAWDLGLRRNLNDLEVIEWTSL
        D   +VR                       V+GNG  I FW D W     +   +  L+R ++ ++  + + + NS   AW+L  RRNL D E+     L
Subjt:  DLEEMVR-----------------------VLGNGHNIYFWHDCWLPCGIIAEAYPRLFR-LTTRSTAMVDQVWNSSNAAWDLGLRRNLNDLEVIEWTSL

Query:  SHILSSVT-IRSVNDSWCWPLDHSKSFSVKSL---MSDMLTDCDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYS
           LSSV    S+ DS  W L  S  F+VKS    +S +       P+ F   LW    P K+K   W ++HG +NT D+LQ R P+ +L P  C++C  
Subjt:  SHILSSVT-IRSVNDSWCWPLDHSKSFSVKSL---MSDMLTDCDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYS

Query:  APENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVF--MGHPFHGK
          E+  HLF+HC      W  +    G   + P +   +L   F  +G+   GK
Subjt:  APENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVF--MGHPFHGK

A0A438IAX3 Transposon TX1 uncharacterized 149 kDa protein3.4e-20836.65Show/hide
Query:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG
        MK ++WNVRGLGS  KR +IK  ++ +NP +V++QETK  + +R  + S+W+  +  W +L ++GASGGILI+W       +E + G FS+S+   +   
Subjt:  MKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADG

Query:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL
           W+S VYGPN   L  DFW EL D+ GL    W +GGDFNV R S EK    S++ +M+ F+ +I+  +L+D PL+N  F+WS+  E      LDRFL
Subjt:  FSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFL

Query:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQ-RSMTSQL
         + +    F   +   + R TSDH+P+ +      WGP PFRFEN WLQ ++FK     WWS     GW G+ FM +L+ +K  L+ W K     +  + 
Subjt:  ITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQ-RSMTSQL

Query:  PTLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIA
         +++ +L   D+IE         I  R   + ++E L  +E I W Q+ K++W+++GD N+ F+H++   R+ +  I E+ +   + L +A  I +E + 
Subjt:  PTLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIA

Query:  FYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLI
        +++ LYT  +   +    LDWSPI+ + A+ LE  FTEEE+  A+  L   K+ GP+GFT   F+  W +IK D++ +  EF+ SG+IN + N ++I LI
Subjt:  FYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLI

Query:  PKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKA
        PKK  +K +SD+RPISLI   YKII +VLS RLR VL  TI   Q AFV  RQILDA LIANE++D+   S ++GVV K+D EKA+D V WDFLD VL+ 
Subjt:  PKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKA

Query:  KGFGLKWRNWIRGCISSVNYSIIINGRPR--------VQSGKVIS--------------------------HPVGISSVRLTHLQFADDTLLFSIFYLKA
        KGF  +WR W+ GC+SSV+++I++NG  +        ++ G  +S                            VG +  R++HLQFADDT+ FS    + 
Subjt:  KGFGLKWRNWIRGCISSVNYSIIINGRPR--------VQSGKVIS--------------------------HPVGISSVRLTHLQFADDTLLFSIFYLKA

Query:  LENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQAT
        L+ L +L+ +F   SGL +N  KS   GINL+   +  L     CK   WP  YLG+PLGGNPK   FW PV+E+I  +L GW+ A++S  GR TLIQ+ 
Subjt:  LENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQAT

Query:  LSSIPIYIFRCL------------------------SSRSRLPKF------------------WRSLLEIFSGKDLEEMVRVLGNGHNIYFWHDCWLPCG
        L+ +P Y                               R  L ++                  W+++ ++F    L     V+GNG  I FW D W    
Subjt:  LSSIPIYIFRCL------------------------SSRSRLPKF------------------WRSLLEIFSGKDLEEMVRVLGNGHNIYFWHDCWLPCG

Query:  IIAEAYPRLFRLTTRSTAMVDQVWNSSNA-AWDLGLRRNLNDLEVIEWTSLSHILSSVTIR-SVNDSWCWPLDHSKSFSVKSL---MSDMLTDCDSTPSN
         +   YPRLFR+       +  V   S    W+L  RRNL+D E+ +   L   L  + +  SV D+  WPL  S  FSVKS    +S       + PS 
Subjt:  IIAEAYPRLFRLTTRSTAMVDQVWNSSNA-AWDLGLRRNLNDLEVIEWTSLSHILSSVTIR-SVNDSWCWPLDHSKSFSVKSL---MSDMLTDCDSTPSN

Query:  FYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSAPENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVFMG
        F   +W    P K+K F+W ++H  +NT D LQ R P+  LSP  C++C    E+  HLF+HC      W  +        + P +IY ++   F G
Subjt:  FYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSAPENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVFMG

M5VS59 Reverse transcriptase domain-containing protein (Fragment)4.1e-20937.87Show/hide
Query:  RKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADGFSFWLSAVYGPNRC
        RKR L+K+ +++  P +V+L ETK  +++R ++  +W S    W    S G SGGI ++W+     V +++ G FS+SI I    G  +WLS +YGP R 
Subjt:  RKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADGFSFWLSAVYGPNRC

Query:  ELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFLITADCLKKFGSPML
             FW+EL DL G  GDKW LGGDFNV R+S EKS++  ++++M+ FN +I    L D  L N  F+WS+  E      LDRFL++      F     
Subjt:  ELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFLITADCLKKFGSPML

Query:  RRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQ-RSMTSQLPTLILELRKLDSIE
        + + R+TSDH P+ L    + WGP PFRFEN WL    F   ++ WW ++ + GW G+ FM +LK LK  L+ W+KE+   +   L      L  LD  E
Subjt:  RRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQ-RSMTSQLPTLILELRKLDSIE

Query:  DSD-IFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAFYKALYTKDSNLR
         ++ +  L   E   LL  +I +L  KE ++W QR K++W ++GD NT FFHR+    +++N I ++   D   +    +IE+E I F+K LY+ + N+ 
Subjt:  DSD-IFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAFYKALYTKDSNLR

Query:  FLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLIPKKLDAKSVSDYR
        +    L+W PI+   A  LE+ F  EEV  AV   G  KSPGP+GF+  FF+  W ++K D+M ++++F+ SG++N   NET+ICLIPKK ++  V+D R
Subjt:  FLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLIPKKLDAKSVSDYR

Query:  PISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRG
        PISL+   YK+I++VL++RLR VL +TI+++Q AFV  RQILDA L+ANEV+++    K+KG+V K+D EKA+D V+W+F+D VL  KGFG+KWR WI G
Subjt:  PISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRG

Query:  CISSVNYSIIINGRPR--------VQSGKVIS------------------------HPV--GISSVRLTHLQFADDTLLFSIFYLKALE----NLFNLIK
        C+ SVN+SI+ING+PR        ++ G  +S                        H +  G   V ++HLQFADDT    IF L   E    NL  L+K
Subjt:  CISSVNYSIIINGRPR--------VQSGKVIS------------------------HPV--GISSVRLTHLQFADDTLLFSIFYLKALE----NLFNLIK

Query:  IFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQATLSSIPIYIF
        +F   SG+ IN  KS  LGIN     ++ +  ++GC+ G WP  YLG+PLGGNP+ ++FW PV++K+EK+L  WK A +SKGGR TLIQA LSSIP Y  
Subjt:  IFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQATLSSIPIYIF

Query:  -------------------------------------RCLSSRS----------------RLPKFWRSLLEIFS---------------GKDLEEMVRV-
                                             R   S+                 R    WR  LE  S               G D +++ +V 
Subjt:  -------------------------------------RCLSSRS----------------RLPKFWRSLLEIFS---------------GKDLEEMVRV-

Query:  ----------------------LGNGHNIYFWHDCWLPCGIIAEAYPRLFRLTTRSTAMVDQVWNS--SNAAWDLGLRRNLNDLEVIEWTSLSHILSSVT
                              +GNG  I FW D WL  GI+ + +PRL  L+ R    +    N+      WD   RRNL++ E+ E   L  IL +V 
Subjt:  ----------------------LGNGHNIYFWHDCWLPCGIIAEAYPRLFRLTTRSTAMVDQVWNS--SNAAWDLGLRRNLNDLEVIEWTSLSHILSSVT

Query:  I-RSVNDSWCWPLDHSKSFSVKSLMSDML-TDCDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSAPENPCHLFV
        +  S  D   W ++   SFS KS  S +L T  D  P   +  +WK   P KI+ F+W  ++G INT D +QRR P + LSPS CV+C    EN  HLF+
Subjt:  I-RSVNDSWCWPLDHSKSFSVKSLMSDML-TDCDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSAPENPCHLFV

Query:  HCLYASKYWLIMKNAFG
        HC Y+ + W  M  A G
Subjt:  HCLYASKYWLIMKNAFG

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)1.0e-20737.01Show/hide
Query:  PKKKSCSTTRKKS----KQMKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVV
        P KK+ +  R +       MK ++WN+RGLGS RKR L+K+ +++  P +V+L ETK   ++R ++  +W S    W    S G SGGI ++W+     V
Subjt:  PKKKSCSTTRKKS----KQMKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVV

Query:  KETIQGLFSISIHIFMADGFSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGC
         +++ G FS+SI I    G  +WLS +YGP R      FW+EL DL G  GD W LGGDFNV R+S EKS++  ++++M+ FN +I    L D  L N  
Subjt:  KETIQGLFSISIHIFMADGFSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGC

Query:  FSWSSFGEMNYLSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGL
        F+WS+  E      LDRFL++    + F     + + R+TSDH P+ L    + WGP PFRFEN WL    FK  ++ WW ++ + GW G+ FM +LK L
Subjt:  FSWSSFGEMNYLSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGL

Query:  KIALRFWNKEQ-RSMTSQLPTLILELRKLDSIEDSD-IFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEV
        K  L+ W+KE+   +   L      L  LD  E ++ +  L   E   LL  +I +L  KE ++W QR K++W + GD NT FFHR+    +++N I ++
Subjt:  KIALRFWNKEQ-RSMTSQLPTLILELRKLDSIEDSD-IFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEV

Query:  LSRDNVSLISAGDIEKEFIAFYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIR
           D   +    +IE+E I F+K LY+ + N                                AV   G  KSPGP+GF+  FF+  W ++K D+M +++
Subjt:  LSRDNVSLISAGDIEKEFIAFYKALYTKDSNLRFLPTNLDWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIR

Query:  EFYNSGVINAALNETYICLIPKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKL
        +F+ SG++N   NET+ICLIPKK ++  V+DYRPISL+   YK+I++VL++ LR VL +TI+++Q AFV  RQILDA L+ANEV+++    K+KG+V K+
Subjt:  EFYNSGVINAALNETYICLIPKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKL

Query:  DLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRGCISSVNYSIIINGRPR--------VQSGKVIS------------------------HPV--GISSVR
        D EKA+D V+W+F+D V+  KGFG+KWR WI GC+ SVN+SI+ING+PR        ++ G  +S                        H +  G   V 
Subjt:  DLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRGCISSVNYSIIINGRPR--------VQSGKVIS------------------------HPV--GISSVR

Query:  LTHLQFADDTLLFSIFYLKALE----NLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKI
        ++HLQFADDT    IF L   E    NL  L+K+F   SG+ IN  KS  LGIN     ++ +  ++GC+ G WP  YLG+PLGGNP+ ++FW PV+EK+
Subjt:  LTHLQFADDTLLFSIFYLKALE----NLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKI

Query:  EKKLHGWKYAFISKGGRHTLIQATLSSIPIYIFRCL------------------------------------------------SSRSRLPK-----FWR
        EK+L  WK A +SKGGR TLIQA LSSIP Y                                                     S R R+        WR
Subjt:  EKKLHGWKYAFISKGGRHTLIQATLSSIPIYIFRCL------------------------------------------------SSRSRLPK-----FWR

Query:  SLLEIFS---------------GKDLEEMVR-----------VLGNGHNIYFWHDCWLPCGIIAEAYPRLFRLTTRSTAMVDQVWNSS--NAAWDLGLRR
          LE  S               G    E+ +            +GNG  I FW D WL  GI+ + +PRL  L+ R    +    N+      WD   RR
Subjt:  SLLEIFS---------------GKDLEEMVR-----------VLGNGHNIYFWHDCWLPCGIIAEAYPRLFRLTTRSTAMVDQVWNSS--NAAWDLGLRR

Query:  NLNDLEVIEWTSLSHILSSVTI-RSVNDSWCWPLDHSKSFSVKSLMSDML-TDCDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLT
        NL++ E+ E   L  IL +V +  S  D   W ++   SFS KS  S +L T  D  P   +  +WK   P KI+ F+W  ++G INT D +QRR P + 
Subjt:  NLNDLEVIEWTSLSHILSSVTI-RSVNDSWCWPLDHSKSFSVKSLMSDML-TDCDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLT

Query:  LSPSCCVMCYSAPENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGIL
        LSPS CV+C    EN  HLF+HC Y+ + W  M  A G   ++P   + +L
Subjt:  LSPSCCVMCYSAPENPCHLFVHCLYASKYWLIMKNAFGWSLILPNNIYGIL

M5WPQ5 Reverse transcriptase domain-containing protein4.8e-21038.53Show/hide
Query:  KQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADGFSFWLSAVYGPNRCELYGDF
        K+ +++  P +V+L ETK   ++R ++  +W S    W    S G SGGI ++W+     V +++ G FS+SI I    G  +WLS +YGP R      F
Subjt:  KQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFMADGFSFWLSAVYGPNRCELYGDF

Query:  WQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFLITADCLKKFGSPMLRRIDRL
        W+EL DL G  GDKW LGGDFNV R+S EKS++  ++++M+ FN +I    L D  L N  F+WS+  E      LDRFL++    + F     + + R+
Subjt:  WQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFLITADCLKKFGSPMLRRIDRL

Query:  TSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQ-RSMTSQLPTLILELRKLDSIEDSD-IF
        TSDH P+ L    + WGP PFRFEN WL    FK  ++ WW ++ + GW G+ FM +LK LK  L+ W+KE+   +   L      L  LD  E ++ + 
Subjt:  TSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQ-RSMTSQLPTLILELRKLDSIEDSD-IF

Query:  FLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAFYKALYTKDSNLRFLPTNL
         L   E   LL  +I +L  +E ++W QR K++W ++GD NT FFHR+    +++N I ++   D   +    +IE+E I F+K LY+++ N+ +    L
Subjt:  FLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAFYKALYTKDSNLRFLPTNL

Query:  DWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLIPKKLDAKSVSDYRPISLIP
        +W PI+   A  LE+ F  EEV  AV   G  KSPGP+GF+  FF+  W ++K D+M ++++F+ SG++N   NET+ICLIPKK ++  V+DYRPISL+ 
Subjt:  DWSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLIPKKLDAKSVSDYRPISLIP

Query:  CAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRGCISSVN
          YK+I++VL++RLR VL +TI+++Q AFV  RQILDA L+ANEV+++    K+KG+V K+D EKA+D V+W+F+D V+  KGFG+KWR WI GC+ SVN
Subjt:  CAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRGCISSVN

Query:  YSIIINGRPR--------VQSGKVISHPVGISSVRLTHLQFADDTLLFSIFYLKALE----NLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKT
        +SI+ING+PR        ++ G  +S  +    + ++HLQFADDT    IF L   E    NL  L+K+F   SG+ IN  KS  LGIN     ++ +  
Subjt:  YSIIINGRPR--------VQSGKVISHPVGISSVRLTHLQFADDTLLFSIFYLKALE----NLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKT

Query:  TFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQATLSSIPIYIF-----------------------------RC-
        ++GC+ G WP  YLG+PLGGNP+ ++FW PV+EK+EK+L  WK A +SKGGR TLIQA LSSIP Y                               +C 
Subjt:  TFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQATLSSIPIYIF-----------------------------RC-

Query:  --------------------LSSRS---RLPKFWRSLLEIFS---------------GKDLEEMVRV-----------------------LGNGHNIYFW
                            L  R+   R    WR  LE  S               G D + + +V                       +GNG  I FW
Subjt:  --------------------LSSRS---RLPKFWRSLLEIFS---------------GKDLEEMVRV-----------------------LGNGHNIYFW

Query:  HDCWLPCGIIAEAYPRLFRLTTRSTAMVDQVWNSS--NAAWDLGLRRNLNDLEVIEWTSLSHILSSVTI-RSVNDSWCWPLDHSKSFSVKSLMSDML-TD
         D WL  GI+ + +PRL  L+ R    +    N+      WD   RRNL++ E+ E   L  IL +V +  S  D   W ++   SFS KS  S +L T 
Subjt:  HDCWLPCGIIAEAYPRLFRLTTRSTAMVDQVWNSS--NAAWDLGLRRNLNDLEVIEWTSLSHILSSVTI-RSVNDSWCWPLDHSKSFSVKSLMSDML-TD

Query:  CDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSAPENPCHLFVHCLYASKYWLIMKNAFG
         D  P   +  +WK   P KI+ F+W  ++G INT D +QRR P + LSPS CV+C    EN  HLF+HC Y+ + W  M  A G
Subjt:  CDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSAPENPCHLFVHCLYASKYWLIMKNAFG

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.6e-3723.31Show/hide
Query:  SKQMKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSL-DSNG---ASGGILIMWSELDF---VVKETIQGLFS
        +  +  LT NV GL S  KR  +   I+ Q+PS+  +QET  +  + + +K        GW  +  +NG    +G  +++  + DF    +K   +G   
Subjt:  SKQMKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSL-DSNG---ASGGILIMWSELDF---VVKETIQGLFS

Query:  ISIHIFMADGF----SFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDI--PLQNGCFSW
           H  M  G        +  +Y PN         Q L DL        ++ GDFN      ++S  Q ++++ +  N  +    L+DI   L      +
Subjt:  ISIHIFMADGF----SFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDI--PLQNGCFSW

Query:  SSFGEMNY-LSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIF--------GDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQG------W
        + F   ++  S +D  + +   L K     +  I    SDH  + L             +W        + W+  +  KA ++ ++  N  +       W
Subjt:  SSFGEMNY-LSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIF--------GDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQG------W

Query:  PGFGFMNKLKGLKIALRFWNKEQRSMTSQLPTLILELRKLDSIEDSDIFFLAHIEARR-----LLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFH
          F  +   +G  IAL  + ++Q    S++ TL  +L++L+  E +      H +A R      +R +++ +  ++ +Q     +  + ++ ++      
Subjt:  PGFGFMNKLKGLKIALRFWNKEQRSMTSQLPTLILELRKLDSIEDSDIFFLAHIEARR-----LLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFH

Query:  RILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAFYKALY-TKDSNLRFLPTNLD---WSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTA
        R++  ++ KN I  + +          +I+     +YK LY  K  NL  + T LD      +N +    L +  T  E+   + SL   KSPGP+GFTA
Subjt:  RILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAFYKALY-TKDSNLRFLPTNLD---WSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTA

Query:  EFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLIPKK-LDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLI
        EF++     +   ++ + +     G++  +  E  I LIPK   D     ++RPISL+    KI+ ++L+NR++  +   I  +Q+ F+   Q       
Subjt:  EFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLIPKK-LDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLI

Query:  ANEVIDDWNISKKKG-VVLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRGCISSVNYSIIING-------------------------------RP
        +  VI   N +K K  V++ +D EKAFDK+   F+   L   G    +   IR        +II+NG                               R 
Subjt:  ANEVIDDWNISKKKG-VVLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRGCISSVNYSIIING-------------------------------RP

Query:  RVQSGKVISHPVGISSVRLTHLQFADDTLLFSIFYLKALENLFNLIKIFELASGLNINFGKSE-FLGIN---LEDHQMDWLKTTFGCKQGNWPTTYLGVP
          Q  ++    +G   V+L+   FADD +++    + + +NL  LI  F   SG  IN  KS+ FL  N    E   M  L  T   K+      YLG+ 
Subjt:  RVQSGKVISHPVGISSVRLTHLQFADDTLLFSIFYLKALENLFNLIKIFELASGLNINFGKSE-FLGIN---LEDHQMDWLKTTFGCKQGNWPTTYLGVP

Query:  LGGNPKNV--SFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQATLSSIPIYIFRCLSSRSRLPKFWRSLLE
        L  + K++    ++P++++I++  + WK    S  GR  +++  ++ +P  I+R  +   +LP  + + LE
Subjt:  LGGNPKNV--SFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQATLSSIPIYIFRCLSSRSRLPKFWRSLLE

P08548 LINE-1 reverse transcriptase homolog6.6e-3122.55Show/hide
Query:  SKQMKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSL-DSNG--ASGGILIMWSEL----DFVVKETIQGLFS
        SK +   + NV GL    KR  +   IQ+  P +  +QE+  +      +K  +     GW+S+  +NG     GI I++++        +++   G F 
Subjt:  SKQMKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSL-DSNG--ASGGILIMWSEL----DFVVKETIQGLFS

Query:  ISIHIFMADGFSFWLSAVYGPNRCELYGDFWQE-LHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDI-----PLQNGCFSW
                D  S  +  +Y PN       F +E L D++ L     I+ GDFN      ++S  + +S+ +   N  I    L DI     P +   +++
Subjt:  ISIHIFMADGFSFWLSAVYGPNRCELYGDFWQE-LHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDI-----PLQNGCFSW

Query:  SSFGEMNYLSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIFGD--------ISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMN
         S     Y S +D  L     L KF    +  I  + SDH+ + +   +         +W       +++W+ D   K  +  +  QN  Q        +
Subjt:  SSFGEMNYLSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIFGD--------ISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMN

Query:  K----LKGLKIALR-FWNKEQRSMTSQLPTLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARK
             L+G  IAL+ F  K +R    ++  L+  L++L+  E S+       E  + +R ++  +  K  IQ   + K  + ++ ++       +   ++
Subjt:  K----LKGLKIALR-FWNKEQRSMTSQLPTLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARK

Query:  RKNMITEVLSRDNVSLISAGDIEKEFIAFYKALYT-KDSNLRFLPTNLD--WSPINGDMAVD-LEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSW
         K++I+ + + ++       +I+K    +YK LY+ K  NL+ +   L+    P      V+ L +  +  E+   +++L   KSPGP+GFT+EF++   
Subjt:  RKNMITEVLSRDNVSLISAGDIEKEFIAFYKALYT-KDSNLRFLPTNLD--WSPINGDMAVD-LEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSW

Query:  HIIKSDIMTMIREFYNSGVINAALNETYICLIPKK-LDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDD
          +   ++ + +     G++     E  I LIPK   D     +YRPISL+    KI+ ++L+NR++  +   I  +Q+ F+   Q       +  VI  
Subjt:  HIIKSDIMTMIREFYNSGVINAALNETYICLIPKK-LDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDD

Query:  WN-ISKKKGVVLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRGCISSVNYSIIING--------RPRVQSGKVIS---------------------
         N +  K  ++L +D EKAFD +   F+   LK  G    +   I    S    +II+NG        R   + G  +S                     
Subjt:  WN-ISKKKGVVLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRGCISSVNYSIIING--------RPRVQSGKVIS---------------------

Query:  HPVGISSVRLTHLQFADDTLLFSIFYLKALENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNV--SFW
          + I S  +    FADD +++      +   L  +IK +   SG  IN  KS        +     +K +           YLGV L  + K++    +
Subjt:  HPVGISSVRLTHLQFADDTLLFSIFYLKALENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNV--SFW

Query:  QPVIEKIEKKLHGWKYAFISKGGRHTLIQATLSSIPIYIFRCLSSRSRLPKFWRSLLEIFSGKDLEEMV
        + + ++I + ++ WK    S  GR  +++ ++    IY F  +  ++ L  F          KDLE+++
Subjt:  QPVIEKIEKKLHGWKYAFISKGGRHTLIQATLSSIPIYIFRCLSSRSRLPKFWRSLLEIFSGKDLEEMV

P11369 LINE-1 retrotransposable element ORF2 protein8.9e-3624.04Show/hide
Query:  TTRKKSKQMKF--LTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGW-TSLDSNG---ASGGILIMWSELDF---VVKE
        TT+ K     F  ++ N+ GL S  KR  +   + +Q+P+   LQET     +R+ ++        GW T   +NG    +G  +++  ++DF   V+K+
Subjt:  TTRKKSKQMKF--LTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGW-TSLDSNG---ASGGILIMWSELDF---VVKE

Query:  TIQGLFSISIHIFMADGFSFWLSAVYGPN-RCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDI-----PL
          +G F +     + +  S  +  +Y PN R   +      L  L        I+ GDFN    S ++S  Q ++R+       +    L DI     P 
Subjt:  TIQGLFSISIHIFMADGFSFWLSAVYGPN-RCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDI-----PL

Query:  QNGCFSWSSFGEMNYLSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIF-GDISWGPCPF--RFENSWLQDSSFKALVES------WWSQNPLQG
          G   +S+       S +D  +     L ++ +  +  +  + SDH+ L LIF  +I+ G   F  +  N+ L D+  K  ++        +++N    
Subjt:  QNGCFSWSSFGEMNYLSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIF-GDISWGPCPF--RFENSWLQDSSFKALVES------WWSQNPLQG

Query:  WPGFGFMNK--LKGLKIALRFWNKEQRSM-TSQLPTLILELRKLDSIEDSDIFFLAHIEARR----LLREQIENLTGKEHIQWHQRCKIQWLQQGDENTN
        +P      K  L+G  IAL    K++ +  TS L T +  L K ++             +RR     LR +I  +  +  IQ   + +  + ++ ++   
Subjt:  WPGFGFMNK--LKGLKIALRFWNKEQRSM-TSQLPTLILELRKLDSIEDSDIFFLAHIEARR----LLREQIENLTGKEHIQWHQRCKIQWLQQGDENTN

Query:  FFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAFYKALY-TKDSNLRFLPTNLD---WSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNG
           R+    + K +I ++ +          +I+    +FYK LY TK  NL  +   LD      +N D    L    + +E+   + SL   KSPGP+G
Subjt:  FFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAFYKALY-TKDSNLRFLPTNLD---WSPINGDMAVDLEKDFTEEEVYCAVKSLGVSKSPGPNG

Query:  FTAEFFKHSWHIIKSDIMTMIREFYN----SGVINAALNETYICLIPK-KLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQ
        F+AEF++      K D++ ++ + ++     G +  +  E  I LIPK + D   + ++RPISL+    KI+ ++L+NR++  +   I  +Q+ F+   Q
Subjt:  FTAEFFKHSWHIIKSDIMTMIREFYN----SGVINAALNETYICLIPK-KLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQ

Query:  ILDASLIANEVIDDWN-ISKKKGVVLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRGCISSVNYSIIING--------------------------
               +  VI   N +  K  +++ LD EKAFDK+   F+  VL+  G    + N I+   S    +I +NG                          
Subjt:  ILDASLIANEVIDDWN-ISKKKGVVLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRGCISSVNYSIIING--------------------------

Query:  -----RPRVQSGKVISHPVGISSVRLTHLQFADDTLLFSIFYLKALENLFNLIKIFELASGLNINFGKS-EFLGINLEDHQMDWLKTT-FGCKQGNWPTT
             R   Q  ++    +G   V+++ L  ADD +++      +   L NLI  F    G  IN  KS  FL    +  + +  +TT F     N    
Subjt:  -----RPRVQSGKVISHPVGISSVRLTHLQFADDTLLFSIFYLKALENLFNLIKIFELASGLNINFGKS-EFLGINLEDHQMDWLKTT-FGCKQGNWPTT

Query:  YLGVPLGGNPKNV--SFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQATLSSIPIYIFRCL
        YLGV L    K++    ++ + ++I++ L  WK    S  GR  +++  +    IY F  +
Subjt:  YLGVPLGGNPKNV--SFWQPVIEKIEKKLHGWKYAFISKGGRHTLIQATLSSIPIYIFRCL

P14381 Transposon TX1 uncharacterized 149 kDa protein4.0e-3624.39Show/hide
Query:  TWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSS----SHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSIS---IHIFM
        T N  G  +  +   +   ++Q   S+  LQET ++          W      +H+ WTS        G++ ++S  D    E +     I    +H+ +
Subjt:  TWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSS----SHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSIS---IHIFM

Query:  AD-GFSFWLSAVYGPNRCELYGDFWQELHD-LAGLGGDK-WILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNG----CFSWSSFGEM
         + G ++ L  VY P        F++ L   +  +  D+  I+GGDFN T  + +++  +    +       IA + L+D+  +       F++    + 
Subjt:  AD-GFSFWLSAVYGPNRCELYGDFWQELHD-LAGLGGDK-WILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNG----CFSWSSFGEM

Query:  NY-LSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGP--CPFRFENSWLQDSSFKALVESWWSQNPLQGWPG----FGFMNKLKGL-K
        +   S +DR  I++  + +  S  +R      SDH  ++L        P    + F NS L+D  F   V   W     +GW      F  +N+   + K
Subjt:  NY-LSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDISWGP--CPFRFENSWLQDSSFKALVESWWSQNPLQGWPG----FGFMNKLKGL-K

Query:  IALRFWNKE-QRSMTSQ-------LPTLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKN
        + L+   +E  +S++ Q       L   +L+L +  S  +       ++E +  LR    N+  ++      R ++Q L   D  + FF+ +   +  + 
Subjt:  IALRFWNKE-QRSMTSQ-------LPTLILELRKLDSIEDSDIFFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKN

Query:  MITEVLSRDNVSLISAGDIEKEFIAFYKALYTKDSNLRFLPTNL-DWSPINGDMAVD-LEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKS
         IT + + D   L     I     +FY+ L++ D         L D  P+  +   + LE   T +E+  A++ +  +KSPG +G T EFF+  W  +  
Subjt:  MITEVLSRDNVSLISAGDIEKEFIAFYKALYTKDSNLRFLPTNL-DWSPINGDMAVD-LEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKS

Query:  DIMTMIREFYNSGVINAALNETYICLIPKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKK
        D   ++ E +  G +  +     + L+PKK D + + ++RP+SL+   YKI+A+ +S RL+ VL   I  +Q   V  R I D   +  +++   + +++
Subjt:  DIMTMIREFYNSGVINAALNETYICLIPKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKK

Query:  KGV---VLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIR------GCISSVNYSI---IINGRPRVQ----SGKVISHPVG----ISSVRLTHL---
         G+    L LD EKAFD+VD  +L   L+A  FG ++  +++       C+  +N+S+   +  GR   Q    SG++ S  +     +   RLT L   
Subjt:  KGV---VLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIR------GCISSVNYSI---IINGRPRVQ----SGKVISHPVG----ISSVRLTHL---

Query:  ---------QFADDTLLFSIFYLKALENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPT---TYLGVPLGGNPKNVSFWQPV
                  +ADD +L +   L  LE      +++  AS   IN+ KS   G+     ++D+L   F  +  +W +    YLGV L      VS  Q  
Subjt:  ---------QFADDTLLFSIFYLKALENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPT---TYLGVPLGGNPKNVSFWQPV

Query:  IEKIE---KKLHGWK--YAFISKGGRHTLIQATLSSIPIYIFRCLS-SRSRLPKFWRSLLE
        IE  E    +L  WK     +S  GR  +I   ++S   Y   CLS ++  + K  R LL+
Subjt:  IEKIE---KKLHGWK--YAFISKGGRHTLIQATLSSIPIYIFRCLS-SRSRLPKFWRSLLE

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.6e-3226.18Show/hide
Query:  RNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMN-YLSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDI-SWGPCPFRFENSWLQDSSFKA
        R ++ F   +    L+DIP +   ++WS+  + N  +  LDR +   D    F S +        SDH P  +I  ++       FR+ +      +F  
Subjt:  RNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMN-YLSLLDRFLITADCLKKFGSPMLRRIDRLTSDHYPLALIFGDI-SWGPCPFRFENSWLQDSSFKA

Query:  LVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKE-----QRSMTSQLPTLILELRKLDSIEDSDIFFLAHIEARR--LLREQIENLTGKEHIQWHQRC
         +   W +    G   F     LK  K   +  N++     Q      L +L     +L +     +F + H+  ++       +E+        + Q+ 
Subjt:  LVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKE-----QRSMTSQLPTLILELRKLDSIEDSDIFFLAHIEARR--LLREQIENLTGKEHIQWHQRC

Query:  KIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAFYKALYTKDSNLRFLPTNL----DWSPI--NGDMAVDLEKDFTEEEVYC
        +I+WLQ GD NT FFH+++ A + KN+I  +   D+V + +   +++  +A+Y  L   DS++   P ++    D  P   N  +A  L    +++E+  
Subjt:  KIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAFYKALYTKDSNLRFLPTNL----DWSPI--NGDMAVDLEKDFTEEEVYC

Query:  AVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLIPKKLDAKSVSDYRPISLIPCAYKII
        AV ++  +K+PGP+ FTAEFF  SW ++K   +  ++EF+ +G +    N T I LIPK      +S +RP+S     YKII
Subjt:  AVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLIPKKLDAKSVSDYRPISLIPCAYKII

AT4G20520.1 RNA binding;RNA-directed DNA polymerases1.3e-0535Show/hide
Query:  RLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGV----VLKLDLEKAFDKVDWDFLDAVLKAKGFGLKW
        RL+ ++ + I   Q +F+  R   D  +   E +   ++ +KKGV    +LKLDLEKA+D++ WD+L+  L + GF   W
Subjt:  RLRFVLPHTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGV----VLKLDLEKAFDKVDWDFLDAVLKAKGFGLKW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTCCCCAAACCTCTTGACGGAGTAGAGTACTCCGTTCAAATTCAAGGCATTGACAACCACCGACAGAACGCCATGGCGGCCGGACCTCCAAGCGGCGCCACCAACCT
AGAATCTACGACATCGAATTTGGAAGATTCTAAGAAAATTGAGGAAATTACGGATGCGTTGAAATCACCCCTAGATCGAAGACCGACAAACTTGAAGATATGCTATAAAG
GCTCCACCTCAGATAAACCAATGACCTTATTGGGATCTGATATTGAGGATTATCTCTCCAGTCCCACCCAAGCCCACTTCCCTTTGGCCCAAAACTCCACCCAAGATCCT
TCAGCCCAATTTCCTCTTCATATTACCTATACTGAACCAATTTATCCCTCCCTCGACCCAAACCAACCCCCACTGGCTATTTTACCACCTTTATCTCCTTCAAATATTAT
CCCACCCAATCCAACTTCTCCAAACCCAAATTTGACAGTGGCCTCTCCACCAAAGACTTTTAATGTTGATCCAAGCCCAAGTCCCCAAATTCAAGAAAACCCACCACCCA
ATTCTCACCCACCTCTTTCTCTCCCTTCCTGCCTCAGTCAATTAGCCCCTCTCCTCAATGAGCATGGTTTATGCATCATGGCAATACCACCTTTTCCAAGACCACCTAAA
AAGAAATCGTGTTCTACCACTCGGAAAAAATCGAAGCAAATGAAGTTTTTGACATGGAATGTGCGTGGATTGGGTTCATGGCGGAAACGGGCTTTGATTAAGCAAACTAT
CCAGCAGCAAAATCCAAGTTTGGTTCTTTTGCAAGAAACAAAATCATCCAGTATCAATCGAAATATTATTAAATCAATTTGGAGTTCCTCTCATATTGGATGGACATCAT
TGGACTCTAATGGAGCTTCGGGAGGAATTCTTATTATGTGGAGTGAACTTGATTTTGTCGTCAAGGAGACTATCCAAGGTCTTTTTTCTATCTCTATTCACATTTTTATG
GCGGATGGTTTCTCCTTTTGGCTTTCAGCAGTTTATGGTCCTAATAGGTGTGAATTATATGGAGATTTCTGGCAAGAGCTTCATGATTTGGCGGGCTTAGGAGGAGATAA
ATGGATTCTTGGTGGAGATTTTAATGTTACCCGTTGGTCCTGGGAAAAGTCTCATGATCAATCGATTTCTAGAAATATGAAAACTTTCAATAGATGGATTGCTACATATC
AGCTTATGGACATCCCATTGCAAAATGGATGCTTTTCTTGGTCTAGTTTTGGTGAAATGAACTATCTTTCTCTGTTGGATCGTTTCTTAATTACTGCGGATTGTCTTAAA
AAATTTGGCTCTCCTATGCTGCGTAGAATTGATCGTCTAACTTCAGACCATTATCCATTAGCTTTAATCTTTGGAGACATTTCTTGGGGTCCTTGCCCTTTTCGGTTTGA
AAACTCTTGGTTGCAAGACAGTTCATTCAAGGCTTTGGTTGAGAGTTGGTGGTCACAAAATCCTTTGCAAGGCTGGCCGGGTTTTGGCTTTATGAATAAGCTTAAGGGGC
TAAAAATTGCTCTTCGCTTCTGGAATAAAGAGCAACGGTCTATGACATCTCAACTTCCTACCCTTATTTTAGAATTGAGAAAACTCGACAGTATTGAGGATAGTGATATT
TTCTTTTTGGCCCATATAGAAGCAAGACGTTTACTTCGAGAGCAGATAGAAAATTTGACAGGAAAAGAGCATATTCAGTGGCATCAACGTTGTAAAATTCAGTGGCTTCA
ACAGGGTGATGAAAATACAAATTTCTTCCATCGTATTTTAGCAGCTCGTAAAAGGAAAAACATGATTACTGAGGTTCTATCTCGTGACAATGTTAGCTTAATCTCAGCTG
GGGATATTGAGAAGGAGTTTATTGCTTTTTATAAAGCGCTTTATACCAAAGACAGTAATTTGCGTTTCCTCCCTACAAACTTAGATTGGAGTCCTATTAATGGGGATATG
GCTGTTGATTTAGAAAAAGATTTTACAGAGGAGGAAGTCTACTGTGCGGTCAAATCCTTGGGTGTAAGTAAATCTCCAGGTCCTAACGGTTTTACAGCTGAATTCTTTAA
GCATTCTTGGCATATTATTAAATCTGATATTATGACCATGATTAGAGAATTTTATAACTCAGGTGTTATTAACGCAGCCTTAAATGAAACCTATATTTGCTTAATCCCAA
AGAAACTAGATGCTAAATCAGTGTCAGACTATCGTCCTATTAGTCTTATCCCCTGTGCCTACAAGATTATTGCTAGAGTCTTGTCTAACAGATTAAGGTTTGTTCTGCCT
CATACTATTGCTGAAAATCAAATGGCCTTTGTTGCCAATAGACAAATCCTGGATGCTTCTCTAATTGCCAATGAAGTGATTGATGACTGGAATATCTCTAAAAAGAAAGG
TGTGGTTCTTAAATTGGACCTGGAAAAGGCTTTTGATAAAGTGGATTGGGATTTTCTAGATGCGGTTTTGAAGGCTAAAGGTTTCGGCTTAAAATGGAGGAATTGGATTC
GAGGTTGTATCTCTTCAGTTAACTACTCGATAATTATCAATGGAAGACCTCGAGTGCAGTCCGGCAAAGTTATATCTCATCCGGTTGGTATCTCTTCTGTTAGATTGACT
CATTTGCAATTTGCTGATGATACATTATTGTTCTCCATTTTTTATTTGAAAGCTCTTGAGAATTTGTTTAATCTTATCAAAATCTTTGAATTGGCTTCTGGACTCAATAT
CAACTTTGGTAAAAGTGAATTTTTGGGGATCAATCTTGAAGACCATCAAATGGATTGGTTGAAGACAACTTTTGGATGTAAACAGGGTAATTGGCCTACAACATATCTTG
GGGTTCCCTTAGGTGGTAATCCAAAGAATGTGTCCTTTTGGCAACCAGTGATTGAAAAAATTGAGAAAAAGCTTCATGGTTGGAAATATGCATTTATCTCGAAAGGTGGT
AGACATACGCTCATTCAAGCTACTCTCTCAAGTATACCTATTTATATCTTTCGTTGTTTAAGTTCCCGATCAAGGTTGCCAAAATTTTGGAGAAGCTTATTAGAGATTTT
TTCTGGGAAGGATCTCGAGGAAATGGTGAGAGTTCTTGGAAATGGTCATAATATTTATTTCTGGCATGACTGTTGGCTACCATGTGGAATTATTGCAGAAGCATATCCAA
GACTATTTCGATTAACTACCAGATCTACTGCTATGGTGGACCAAGTCTGGAATAGTTCCAATGCGGCTTGGGATTTGGGACTTAGACGTAATCTAAATGATTTGGAGGTA
ATTGAATGGACTAGTCTCTCTCATATTCTTTCATCTGTTACGATTAGATCAGTGAATGACTCATGGTGTTGGCCTCTTGATCATTCTAAGAGTTTTTCTGTCAAATCTCT
TATGTCTGATATGCTTACTGATTGTGATTCCACACCATCCAATTTTTACTTGGAGCTTTGGAAAGATGTCTATCCTAAGAAAATCAAAATTTTTCTATGGGAGCTTAGCC
ATGGAGGTATTAATACGGCAGATCGATTGCAAAGGAGAATGCCTCATCTGACTCTCTCTCCATCTTGTTGTGTGATGTGTTATTCTGCCCCAGAAAATCCTTGTCATCTT
TTTGTTCACTGCCTTTATGCCTCTAAATACTGGTTAATCATGAAGAATGCCTTTGGTTGGTCCTTGATCTTGCCCAACAATATTTATGGAATTCTCGACTATGTGTTTAT
GGGACACCCATTTCATGGCAAGCTTTTTGAAGACAACCATGAACACCAACTCCAAATAAAGAACCCTCTCCCTCTTCGTATTGAAGAACCAGTTTACAGCGGTTCTATTC
TTCCCATTGACAAGGAAGACAAAGCTCTTATTGATATAAACGTGGAGGAAGAAGAAACCGATGACTTAACAATAGACAACGAAAGCATCGATCCAGCGGTTTATCTCCCT
ATTTTATTTCCTTGGCTTGTCGAACATGGCATGTGCATCATGCCAATGCCAAGTAGGCATAAGCTATCCAATGCCACCAAGAAGTTAAATGGGTCAAGGAATTACAGAAT
CTCCACACCAACGTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTCCCCAAACCTCTTGACGGAGTAGAGTACTCCGTTCAAATTCAAGGCATTGACAACCACCGACAGAACGCCATGGCGGCCGGACCTCCAAGCGGCGCCACCAACCT
AGAATCTACGACATCGAATTTGGAAGATTCTAAGAAAATTGAGGAAATTACGGATGCGTTGAAATCACCCCTAGATCGAAGACCGACAAACTTGAAGATATGCTATAAAG
GCTCCACCTCAGATAAACCAATGACCTTATTGGGATCTGATATTGAGGATTATCTCTCCAGTCCCACCCAAGCCCACTTCCCTTTGGCCCAAAACTCCACCCAAGATCCT
TCAGCCCAATTTCCTCTTCATATTACCTATACTGAACCAATTTATCCCTCCCTCGACCCAAACCAACCCCCACTGGCTATTTTACCACCTTTATCTCCTTCAAATATTAT
CCCACCCAATCCAACTTCTCCAAACCCAAATTTGACAGTGGCCTCTCCACCAAAGACTTTTAATGTTGATCCAAGCCCAAGTCCCCAAATTCAAGAAAACCCACCACCCA
ATTCTCACCCACCTCTTTCTCTCCCTTCCTGCCTCAGTCAATTAGCCCCTCTCCTCAATGAGCATGGTTTATGCATCATGGCAATACCACCTTTTCCAAGACCACCTAAA
AAGAAATCGTGTTCTACCACTCGGAAAAAATCGAAGCAAATGAAGTTTTTGACATGGAATGTGCGTGGATTGGGTTCATGGCGGAAACGGGCTTTGATTAAGCAAACTAT
CCAGCAGCAAAATCCAAGTTTGGTTCTTTTGCAAGAAACAAAATCATCCAGTATCAATCGAAATATTATTAAATCAATTTGGAGTTCCTCTCATATTGGATGGACATCAT
TGGACTCTAATGGAGCTTCGGGAGGAATTCTTATTATGTGGAGTGAACTTGATTTTGTCGTCAAGGAGACTATCCAAGGTCTTTTTTCTATCTCTATTCACATTTTTATG
GCGGATGGTTTCTCCTTTTGGCTTTCAGCAGTTTATGGTCCTAATAGGTGTGAATTATATGGAGATTTCTGGCAAGAGCTTCATGATTTGGCGGGCTTAGGAGGAGATAA
ATGGATTCTTGGTGGAGATTTTAATGTTACCCGTTGGTCCTGGGAAAAGTCTCATGATCAATCGATTTCTAGAAATATGAAAACTTTCAATAGATGGATTGCTACATATC
AGCTTATGGACATCCCATTGCAAAATGGATGCTTTTCTTGGTCTAGTTTTGGTGAAATGAACTATCTTTCTCTGTTGGATCGTTTCTTAATTACTGCGGATTGTCTTAAA
AAATTTGGCTCTCCTATGCTGCGTAGAATTGATCGTCTAACTTCAGACCATTATCCATTAGCTTTAATCTTTGGAGACATTTCTTGGGGTCCTTGCCCTTTTCGGTTTGA
AAACTCTTGGTTGCAAGACAGTTCATTCAAGGCTTTGGTTGAGAGTTGGTGGTCACAAAATCCTTTGCAAGGCTGGCCGGGTTTTGGCTTTATGAATAAGCTTAAGGGGC
TAAAAATTGCTCTTCGCTTCTGGAATAAAGAGCAACGGTCTATGACATCTCAACTTCCTACCCTTATTTTAGAATTGAGAAAACTCGACAGTATTGAGGATAGTGATATT
TTCTTTTTGGCCCATATAGAAGCAAGACGTTTACTTCGAGAGCAGATAGAAAATTTGACAGGAAAAGAGCATATTCAGTGGCATCAACGTTGTAAAATTCAGTGGCTTCA
ACAGGGTGATGAAAATACAAATTTCTTCCATCGTATTTTAGCAGCTCGTAAAAGGAAAAACATGATTACTGAGGTTCTATCTCGTGACAATGTTAGCTTAATCTCAGCTG
GGGATATTGAGAAGGAGTTTATTGCTTTTTATAAAGCGCTTTATACCAAAGACAGTAATTTGCGTTTCCTCCCTACAAACTTAGATTGGAGTCCTATTAATGGGGATATG
GCTGTTGATTTAGAAAAAGATTTTACAGAGGAGGAAGTCTACTGTGCGGTCAAATCCTTGGGTGTAAGTAAATCTCCAGGTCCTAACGGTTTTACAGCTGAATTCTTTAA
GCATTCTTGGCATATTATTAAATCTGATATTATGACCATGATTAGAGAATTTTATAACTCAGGTGTTATTAACGCAGCCTTAAATGAAACCTATATTTGCTTAATCCCAA
AGAAACTAGATGCTAAATCAGTGTCAGACTATCGTCCTATTAGTCTTATCCCCTGTGCCTACAAGATTATTGCTAGAGTCTTGTCTAACAGATTAAGGTTTGTTCTGCCT
CATACTATTGCTGAAAATCAAATGGCCTTTGTTGCCAATAGACAAATCCTGGATGCTTCTCTAATTGCCAATGAAGTGATTGATGACTGGAATATCTCTAAAAAGAAAGG
TGTGGTTCTTAAATTGGACCTGGAAAAGGCTTTTGATAAAGTGGATTGGGATTTTCTAGATGCGGTTTTGAAGGCTAAAGGTTTCGGCTTAAAATGGAGGAATTGGATTC
GAGGTTGTATCTCTTCAGTTAACTACTCGATAATTATCAATGGAAGACCTCGAGTGCAGTCCGGCAAAGTTATATCTCATCCGGTTGGTATCTCTTCTGTTAGATTGACT
CATTTGCAATTTGCTGATGATACATTATTGTTCTCCATTTTTTATTTGAAAGCTCTTGAGAATTTGTTTAATCTTATCAAAATCTTTGAATTGGCTTCTGGACTCAATAT
CAACTTTGGTAAAAGTGAATTTTTGGGGATCAATCTTGAAGACCATCAAATGGATTGGTTGAAGACAACTTTTGGATGTAAACAGGGTAATTGGCCTACAACATATCTTG
GGGTTCCCTTAGGTGGTAATCCAAAGAATGTGTCCTTTTGGCAACCAGTGATTGAAAAAATTGAGAAAAAGCTTCATGGTTGGAAATATGCATTTATCTCGAAAGGTGGT
AGACATACGCTCATTCAAGCTACTCTCTCAAGTATACCTATTTATATCTTTCGTTGTTTAAGTTCCCGATCAAGGTTGCCAAAATTTTGGAGAAGCTTATTAGAGATTTT
TTCTGGGAAGGATCTCGAGGAAATGGTGAGAGTTCTTGGAAATGGTCATAATATTTATTTCTGGCATGACTGTTGGCTACCATGTGGAATTATTGCAGAAGCATATCCAA
GACTATTTCGATTAACTACCAGATCTACTGCTATGGTGGACCAAGTCTGGAATAGTTCCAATGCGGCTTGGGATTTGGGACTTAGACGTAATCTAAATGATTTGGAGGTA
ATTGAATGGACTAGTCTCTCTCATATTCTTTCATCTGTTACGATTAGATCAGTGAATGACTCATGGTGTTGGCCTCTTGATCATTCTAAGAGTTTTTCTGTCAAATCTCT
TATGTCTGATATGCTTACTGATTGTGATTCCACACCATCCAATTTTTACTTGGAGCTTTGGAAAGATGTCTATCCTAAGAAAATCAAAATTTTTCTATGGGAGCTTAGCC
ATGGAGGTATTAATACGGCAGATCGATTGCAAAGGAGAATGCCTCATCTGACTCTCTCTCCATCTTGTTGTGTGATGTGTTATTCTGCCCCAGAAAATCCTTGTCATCTT
TTTGTTCACTGCCTTTATGCCTCTAAATACTGGTTAATCATGAAGAATGCCTTTGGTTGGTCCTTGATCTTGCCCAACAATATTTATGGAATTCTCGACTATGTGTTTAT
GGGACACCCATTTCATGGCAAGCTTTTTGAAGACAACCATGAACACCAACTCCAAATAAAGAACCCTCTCCCTCTTCGTATTGAAGAACCAGTTTACAGCGGTTCTATTC
TTCCCATTGACAAGGAAGACAAAGCTCTTATTGATATAAACGTGGAGGAAGAAGAAACCGATGACTTAACAATAGACAACGAAAGCATCGATCCAGCGGTTTATCTCCCT
ATTTTATTTCCTTGGCTTGTCGAACATGGCATGTGCATCATGCCAATGCCAAGTAGGCATAAGCTATCCAATGCCACCAAGAAGTTAAATGGGTCAAGGAATTACAGAAT
CTCCACACCAACGTGA
Protein sequenceShow/hide protein sequence
MLPKPLDGVEYSVQIQGIDNHRQNAMAAGPPSGATNLESTTSNLEDSKKIEEITDALKSPLDRRPTNLKICYKGSTSDKPMTLLGSDIEDYLSSPTQAHFPLAQNSTQDP
SAQFPLHITYTEPIYPSLDPNQPPLAILPPLSPSNIIPPNPTSPNPNLTVASPPKTFNVDPSPSPQIQENPPPNSHPPLSLPSCLSQLAPLLNEHGLCIMAIPPFPRPPK
KKSCSTTRKKSKQMKFLTWNVRGLGSWRKRALIKQTIQQQNPSLVLLQETKSSSINRNIIKSIWSSSHIGWTSLDSNGASGGILIMWSELDFVVKETIQGLFSISIHIFM
ADGFSFWLSAVYGPNRCELYGDFWQELHDLAGLGGDKWILGGDFNVTRWSWEKSHDQSISRNMKTFNRWIATYQLMDIPLQNGCFSWSSFGEMNYLSLLDRFLITADCLK
KFGSPMLRRIDRLTSDHYPLALIFGDISWGPCPFRFENSWLQDSSFKALVESWWSQNPLQGWPGFGFMNKLKGLKIALRFWNKEQRSMTSQLPTLILELRKLDSIEDSDI
FFLAHIEARRLLREQIENLTGKEHIQWHQRCKIQWLQQGDENTNFFHRILAARKRKNMITEVLSRDNVSLISAGDIEKEFIAFYKALYTKDSNLRFLPTNLDWSPINGDM
AVDLEKDFTEEEVYCAVKSLGVSKSPGPNGFTAEFFKHSWHIIKSDIMTMIREFYNSGVINAALNETYICLIPKKLDAKSVSDYRPISLIPCAYKIIARVLSNRLRFVLP
HTIAENQMAFVANRQILDASLIANEVIDDWNISKKKGVVLKLDLEKAFDKVDWDFLDAVLKAKGFGLKWRNWIRGCISSVNYSIIINGRPRVQSGKVISHPVGISSVRLT
HLQFADDTLLFSIFYLKALENLFNLIKIFELASGLNINFGKSEFLGINLEDHQMDWLKTTFGCKQGNWPTTYLGVPLGGNPKNVSFWQPVIEKIEKKLHGWKYAFISKGG
RHTLIQATLSSIPIYIFRCLSSRSRLPKFWRSLLEIFSGKDLEEMVRVLGNGHNIYFWHDCWLPCGIIAEAYPRLFRLTTRSTAMVDQVWNSSNAAWDLGLRRNLNDLEV
IEWTSLSHILSSVTIRSVNDSWCWPLDHSKSFSVKSLMSDMLTDCDSTPSNFYLELWKDVYPKKIKIFLWELSHGGINTADRLQRRMPHLTLSPSCCVMCYSAPENPCHL
FVHCLYASKYWLIMKNAFGWSLILPNNIYGILDYVFMGHPFHGKLFEDNHEHQLQIKNPLPLRIEEPVYSGSILPIDKEDKALIDINVEEEETDDLTIDNESIDPAVYLP
ILFPWLVEHGMCIMPMPSRHKLSNATKKLNGSRNYRISTPT