; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0040426 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0040426
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr13:4655981..4674573
RNA-Seq ExpressionLag0040426
SyntenyLag0040426
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
GO:0005515 - protein binding (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR005135 - Endonuclease/exonuclease/phosphatase
IPR006652 - Kelch repeat type 1
IPR015915 - Kelch-type beta propeller
IPR026960 - Reverse transcriptase zinc-binding domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
CAN65484.1 hypothetical protein VITISV_029474 [Vitis vinifera]7.9e-22238.58Show/hide
Query:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW
        ETKK    RR + S+W++    WAAL + GASGGI+I+W     + +E + G FS+SI   +    S WLS +YGP   A R DFW EL D+AGL    W
Subjt:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW

Query:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS
         VGGDF V R + EK      T +M+ F+ +I D +L+D+PL++  FTWS++  N     LDRF  +N+       +    L R TSDH+PI L      
Subjt:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS

Query:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELA-QITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQ
        WGP PFRFEN WL    FK     WW      GW GH  M+KL+ +K +L+ WN +   EL+ +   ++ ++   D LE+   L+ E   +R   +  ++
Subjt:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELA-QITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQ

Query:  DISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERV
        ++   E I+W Q+ ++KW+KEGD N++FFH++   R+ +  I E+ + +G+ +  ++ I++E + +++ LYT  +   +    LDWSPI+   A  LE  
Subjt:  DISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERV

Query:  FSEE-------------------------------VLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKR
        F+EE                               +  D++ +F +F+  GIIN + N ++I L+PKK  S+ + D+RPISLI+  YKIIA+VL+ R++ 
Subjt:  FSEE-------------------------------VLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKR

Query:  VLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPT
        VL  TI   Q AFV+ RQILDA L+ANE++D+ R SG++GV  K+D EKA+D V WDFL  V+++KGFG +WRKW+ GC+SS ++++++NG  +G +  +
Subjt:  VLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPT

Query:  RGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIED
        RGLRQGDPLSPFLF +V+D LSR++        +    +G +   ++HLQFADDT+ FS+     +  L N + VF   SGL +N  KS + GIN     
Subjt:  RGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIED

Query:  LESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQ
        L   A +  CK   WP  YLGLPLGGNPK   FW  VIE+I  +L  W+ A +S GGR TLIQ+ L++MP Y+LSLF IP+ V   ++++ RDF W G  
Subjt:  LESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQ

Query:  MNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWS-SPSVRGGSKSPWRYISSTIVLLTSRIQKRVG
             H +NW     P+  GGLG G +  RN ALL KW+WRY  E ++LW Q+I++ Y     S+ W  +  VR   + PW+ I+      +   +  VG
Subjt:  MNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWS-SPSVRGGSKSPWRYISSTIVLLTSRIQKRVG

Query:  NGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIATFWNVENS-AWDLGLRRNLNEEEILEWATLSHQLSSVILRNN-RDSWLWPLDPSKSFTVRSLM
        NG    FW D W   + L   YP+L R+ + ++  I++        +W+   RRNL++ EI +   L   L  + + ++  D   W L PS  FTV+S  
Subjt:  NGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIATFWNVENS-AWDLGLRRNLNEEEILEWATLSHQLSSVILRNN-RDSWLWPLDPSKSFTVRSLM

Query:  TDLLLSSRPSSNNLY--YVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDGG
          L LS    S  ++    +W    P K+K F+W ++H  +NT D LQ R P+++LSP  C +C   G
Subjt:  TDLLLSSRPSSNNLY--YVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDGG

KAA0039950.1 LINE-1 retrotransposable element ORF2 protein [Cucumis melo var. makuwa]1.9e-22338.36Show/hide
Query:  MWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCWIVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQL
        MW+D +++I    +G FS+SI V    G  +WLS IYGP  R  R  FW+EL  L  +    WI+GGDF V RW  E +   P   +MR FN +I +  L
Subjt:  MWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCWIVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQL

Query:  LDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDISWGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGH
        +D PL N  +TWS++    +L+ LDRF  T+             L R TSDH+PI L    ISWGP PFRF N++L   D+K  +E WW  T   G+ G+
Subjt:  LDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDISWGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGH

Query:  GLMQKLKGLKYELRSWNLSQK-KELAQITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARK
          M++LK L   +++W   +K K  A   + + EI  +D+LE     T   R KR  L+  +  I+  EA  W Q+CK  W+ EGDEN+ FFH+I  AR+
Subjt:  GLMQKLKGLKYELRSWNLSQK-KELAQITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARK

Query:  RKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERVFSEEVL-------------------------------
        +K  I+++++  G + L   +I   FI  ++++YT + N +    NLDW PI+   +  L++ F+E  +                               
Subjt:  RKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERVFSEEVL-------------------------------

Query:  HDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKRVLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSG
         +I +IF DF+   IIN  +NET I LI KK   +   D+RPISL +  YK+IA+ L++RLK+ LP TI+ +Q+AFV+ RQI +A L+ANE +D WR   
Subjt:  HDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKRVLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSG

Query:  KKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPTRGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTH
        ++G  IKLD+EKAFDK++W F+  V+  K + +KWRK I  CISS  YSI+INGRPRG+I P+RG+RQGDPLSPF+F++  D LSRL+++ A   KI   
Subjt:  KKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPTRGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTH

Query:  PIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIEDLESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIV
             +  L H+ FADD L+F       ++NL   + +FE ASGLNIN  KS +  IN   +  +S A  +G   G  P++YLG+PLGG P + +FW  V
Subjt:  PIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIEDLESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIV

Query:  IEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAK
        ++KIQ KL +WKY+ +SKGGR TLI +TL ++PIY +S+F +P  +   ++  +R+F W G+     +  I W     P+  GGLGI ++   N ALL K
Subjt:  IEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAK

Query:  WIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPS--VRGGSKSPWRYISSTIVLLTSRIQKRVGNGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAI
        W+W++L E++ LW++LI++KY       + S PS      + SPW+ ++  I      I  +V +G++  FW D W  +  L+   P+L+ LS+ +  ++
Subjt:  WIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPS--VRGGSKSPWRYISSTIVLLTSRIQKRVGNGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAI

Query:  ATFWNVENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNNRDSWLWPLDPSKSF---TVRSLMTDLLLSSRPSSNNLYYVIWKDAYPKKIKIFLWELS
          FWN  ++ W L + R L + E   W  +   L + +        LW L+ +  F   +V+  + +  +S      NLY  +WK  +PKK K F+W L 
Subjt:  ATFWNVENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNNRDSWLWPLDPSKSF---TVRSLMTDLLLSSRPSSNNLYYVIWKDAYPKKIKIFLWELS

Query:  HGCINTADRLQRRMPHRSLSPSWCVMCSDGGGG--------PILGDLWALKGLIEEENETP
        HGCINTADRLQ+R+P+ +LSP+WC MC+             P    LW+    +   N TP
Subjt:  HGCINTADRLQRRMPHRSLSPSWCVMCSDGGGG--------PILGDLWALKGLIEEENETP

RVW16209.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]7.9e-22238.5Show/hide
Query:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW
        ETKK    RR + S+W+     W AL + GASGGI+I+W     + +E + G FS+S+   +      W+S +YGP   + R DFW EL D+ GL    W
Subjt:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW

Query:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS
         VGGDF V R + EK     +T +MR F+ +I + +LLD PL+N  FTWS+I E+     LDRF  +N+  L         L R TSDH+PI ++     
Subjt:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS

Query:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQD
        WGP PFRFEN WL   +FK     WW+    IGW GH  M++L+ +K +L+ WN S   EL                             + + +  +++
Subjt:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQD

Query:  ISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERVF
        +   E I+W Q+ K+KW+KEGD N+KF+H++   R+ +  I E+ +  G+ L  A+ I +E + +++ LYT      +    LDWSPI+E  A  LE  F
Subjt:  ISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERVF

Query:  SEE-------------------------------VLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKRV
        +EE                               +  D++ +FA+F+  GIIN + N ++I LIPKK  SK + D+RPISLI+  YKIIA+VLS RL+ V
Subjt:  SEE-------------------------------VLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKRV

Query:  LPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPTR
        L  TI   Q AFV+ RQILDA L+ANE++D+ R SG++GV  K+D EKA+D V WDFL  +++ KGF  +WRKW+ GC+SS +++I++NG  +G +  +R
Subjt:  LPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPTR

Query:  GLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIEDL
        GLRQGDPLSPFLF +V+D LSR++        +    +G +   ++HLQFADD + FS      L  L + + VF    GL +N  KS + GIN     L
Subjt:  GLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIEDL

Query:  ESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQM
           A +  CK   WP  YLGLPLGGNPK+  FW  V+E+I  +L  W+ A +S GGR TLIQ+ L+++P Y+LSLF +P+ V   ++++ RDF W G   
Subjt:  ESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQM

Query:  NGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPS-VRGGSKSPWRYISSTIVLLTSRIQKRVGN
            H + W     P+ +GGLG+GN+ +RN ALL KW+WRY  E ++LW Q+I++ Y     S+ W + + VR   + PW+ I+      +   +   GN
Subjt:  NGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPS-VRGGSKSPWRYISSTIVLLTSRIQKRVGN

Query:  GQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIATFWNVENS-AWDLGLRRNLNEEEILEWATLSHQLSSVILRNN-RDSWLWPLDPSKSFTVRSLMT
        G    FW D W   + L T YP+L+R+   ++ +I++         W+L  RRNL++ EI +   L   L  + L  +  D+ LWPL  S  F+V+S   
Subjt:  GQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIATFWNVENS-AWDLGLRRNLNEEEILEWATLSHQLSSVILRNN-RDSWLWPLDPSKSFTVRSLMT

Query:  DLLLSSRPSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDGG
         L  SS  S N     +W    P K+K F+W ++H  +NT D LQ R P+++LSP  C++C   G
Subjt:  DLLLSSRPSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDGG

RVW64408.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]1.2e-23039.06Show/hide
Query:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW
        ETK+ T  RR + S+W    + WAAL + GASGGI+I+W        E + G FS+++     E  SFWL+ +YGP +   R DFW EL DL GL    W
Subjt:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW

Query:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS
         VGGDF V R   EK  +  +T NMR F+ +I +  L+D PL+N  FTWS++  +     LDRF  +++       +    L R TSDH PI L    + 
Subjt:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS

Query:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMV-DEIKTLDRLEEADCLTLEQRTKRHQLRESIQ
        WGP PFRFEN WL   +FK     WW      GW GH  M+KLK +K +L+ WN+    +L +   ++  ++  +D +E+   L  +   +R   R  ++
Subjt:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMV-DEIKTLDRLEEADCLTLEQRTKRHQLRESIQ

Query:  DISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERV
        D+   E + W Q+ ++KW+KEGD N+KFFHR+   R+ +  I  ++S  G +L   ++I +E ++F+ NLY+K     +    +DW PI+      L+R 
Subjt:  DISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERV

Query:  FSEE-------------------------------VLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKR
        F+EE                               +  D++ +F +F+ +G+IN + N T+I L+PKK  S  + DYRPISL++  YKIIA+VLS RL++
Subjt:  FSEE-------------------------------VLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKR

Query:  VLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPT
        VL  TI+ +Q AFVE R ILDA L+ANE++D+ R SG++G+  K+D EKA+D VDW FL  V++ KGF +KWR WI GC+SS++++I++NG  +G +  +
Subjt:  VLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPT

Query:  RGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIED
        RGLRQGDPLSPFLF +V+D LSR++      G      +G     ++ LQFADDT+ FS      L NL   + VF   SGL IN  KS + GIN+  E 
Subjt:  RGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIED

Query:  LESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQ
        L S A +F C++  WP +YLGLPLGGNPK + FW  V+E+I  +L  WK A +S GGR TLIQ+ LS++P Y+LSLF IP+ +   ++K+ R+F W G+ 
Subjt:  LESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQ

Query:  MNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSVRGGSKSPWRYISSTIVLLTSRIQKRVGN
             H + W+    P+ +GGLG G +  RN ALL KW+WR+  E + LW + ++   Y T  +   ++  VR   + PW+ I+      +  ++  VGN
Subjt:  MNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSVRGGSKSPWRYISSTIVLLTSRIQKRVGN

Query:  GQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIA-TFWNVENSAWDLGLRRNLNEEEILEWATLSHQLSSV-ILRNNRDSWLWPLDPSKSFTVRSLMT
        G+   FW D W  ++ L + +  LYR+ S ++  ++    N    AW+L  RRNL + EI     L   LSSV    +  DS  W L  S  FTV+S   
Subjt:  GQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIA-TFWNVENSAWDLGLRRNLNEEEILEWATLSHQLSSV-ILRNNRDSWLWPLDPSKSFTVRSLMT

Query:  DLLLSSRPSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDGG
         L   S P        +W    P K+K   W ++HG +NT D+LQ R P++SL P WC++C   G
Subjt:  DLLLSSRPSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDGG

RVW70235.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]5.7e-22038.48Show/hide
Query:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW
        ETKK    RR + S+W++    WAAL + GASGGI+I+W     + +E + G FS+SI   +    S WLS +YGP + A R D W EL D+AGL    W
Subjt:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW

Query:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS
         VGGDF V R + EK     +T +M+ F+ +I D +L+D+PL++  FTWS++  N     LDRF  +N+       +    L R TSDH+PI L      
Subjt:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS

Query:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELA-QITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQ
        WGP PFRFEN WL    FK     WW      GW GH  M+KL+ +K +L+ WN +   EL+ +   ++  +   D LE+   L+ E   +R   +  ++
Subjt:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELA-QITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQ

Query:  DISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERV
        ++   E I+W Q+ ++KW+KEGD N+KFFH++   R+ +  I E+ + +G  +  ++ I++E + +++ LYT  +   +    LDWSPI+   A  LE  
Subjt:  DISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERV

Query:  FSEE-------------------------------VLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKR
        F+EE                               +  D++ +F +F+  GIIN + N ++I L+PKK  S+ + D+RPISLI+  YKIIA+VL+ R++ 
Subjt:  FSEE-------------------------------VLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKR

Query:  VLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPT
        VL  TI   Q AFV+ RQILDA L+ANE++D+ R SG++GV  K+D EKA+D V WDFL  VM++KGFG +WRKW+ GC+SS ++++++NG  +G +  +
Subjt:  VLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPT

Query:  RGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIED
        RGLRQGDPLSPFLF +V+D LSR++        +    +G +   ++HLQFADDT+ FS+     +  L N + VF   SGL +N  KS + GIN     
Subjt:  RGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIED

Query:  LESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQ
        L   A +  CK   WP  YLGLPLGGNPK   FW  VIE+I  +L  W+ A +S GGR TLIQ+ L++MP Y+LSLF IP+ V   ++++ RDF W G  
Subjt:  LESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQ

Query:  MNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWS-SPSVRGGSKSPWRYISSTIVLLTSRIQKRVG
             H +NW     P+  GGLG G +  RN ALL KW+WRY  E ++LW Q+I++ Y     S+ W  +  VR   + PW+ I+      +   +  VG
Subjt:  MNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWS-SPSVRGGSKSPWRYISSTIVLLTSRIQKRVG

Query:  NGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIAT-FWNVENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNN-RDSWLWPLDPSKSFTVRSLM
        NG    FW D W   + L   YP+L R+ + ++  I++   +    +W+   RRNL++ EI +   L      + + ++  D   W L  S  FTV+S  
Subjt:  NGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIAT-FWNVENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNN-RDSWLWPLDPSKSFTVRSLM

Query:  TDLLLSSRPSSNNLY--YVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDGG
          L LS    S  ++    +W    P K+K F+W ++H  +NT D LQ R P+++LSP  C +C   G
Subjt:  TDLLLSSRPSSNNLY--YVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDGG

TrEMBL top hitse value%identityAlignment
A0A438FWU5 LINE-1 retrotransposable element ORF2 protein5.9e-23139.06Show/hide
Query:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW
        ETK+ T  RR + S+W    + WAAL + GASGGI+I+W        E + G FS+++     E  SFWL+ +YGP +   R DFW EL DL GL    W
Subjt:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW

Query:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS
         VGGDF V R   EK  +  +T NMR F+ +I +  L+D PL+N  FTWS++  +     LDRF  +++       +    L R TSDH PI L    + 
Subjt:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS

Query:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMV-DEIKTLDRLEEADCLTLEQRTKRHQLRESIQ
        WGP PFRFEN WL   +FK     WW      GW GH  M+KLK +K +L+ WN+    +L +   ++  ++  +D +E+   L  +   +R   R  ++
Subjt:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMV-DEIKTLDRLEEADCLTLEQRTKRHQLRESIQ

Query:  DISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERV
        D+   E + W Q+ ++KW+KEGD N+KFFHR+   R+ +  I  ++S  G +L   ++I +E ++F+ NLY+K     +    +DW PI+      L+R 
Subjt:  DISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERV

Query:  FSEE-------------------------------VLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKR
        F+EE                               +  D++ +F +F+ +G+IN + N T+I L+PKK  S  + DYRPISL++  YKIIA+VLS RL++
Subjt:  FSEE-------------------------------VLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKR

Query:  VLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPT
        VL  TI+ +Q AFVE R ILDA L+ANE++D+ R SG++G+  K+D EKA+D VDW FL  V++ KGF +KWR WI GC+SS++++I++NG  +G +  +
Subjt:  VLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPT

Query:  RGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIED
        RGLRQGDPLSPFLF +V+D LSR++      G      +G     ++ LQFADDT+ FS      L NL   + VF   SGL IN  KS + GIN+  E 
Subjt:  RGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIED

Query:  LESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQ
        L S A +F C++  WP +YLGLPLGGNPK + FW  V+E+I  +L  WK A +S GGR TLIQ+ LS++P Y+LSLF IP+ +   ++K+ R+F W G+ 
Subjt:  LESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQ

Query:  MNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSVRGGSKSPWRYISSTIVLLTSRIQKRVGN
             H + W+    P+ +GGLG G +  RN ALL KW+WR+  E + LW + ++   Y T  +   ++  VR   + PW+ I+      +  ++  VGN
Subjt:  MNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSVRGGSKSPWRYISSTIVLLTSRIQKRVGN

Query:  GQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIA-TFWNVENSAWDLGLRRNLNEEEILEWATLSHQLSSV-ILRNNRDSWLWPLDPSKSFTVRSLMT
        G+   FW D W  ++ L + +  LYR+ S ++  ++    N    AW+L  RRNL + EI     L   LSSV    +  DS  W L  S  FTV+S   
Subjt:  GQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIA-TFWNVENSAWDLGLRRNLNEEEILEWATLSHQLSSV-ILRNNRDSWLWPLDPSKSFTVRSLMT

Query:  DLLLSSRPSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDGG
         L   S P        +W    P K+K   W ++HG +NT D+LQ R P++SL P WC++C   G
Subjt:  DLLLSSRPSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDGG

M5VS59 Reverse transcriptase domain-containing protein (Fragment)3.8e-23841.23Show/hide
Query:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW
        ETKK T  R+L+  +W S F  W    S+G SGGI ++W+    ++ +++ G FS+SI +    G  +WLS IYGP  + +R  FW+EL DL G  G+ W
Subjt:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW

Query:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS
         +GGDF V R++ EKSN+  +T +MR FN +I++  L D  L N  FTWS++ EN     LDRF  +          +   L R+TSDH PI L+   + 
Subjt:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS

Query:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDE----IKTLDRLEEADCLTLEQRTKRHQLRE
        WGP PFRFEN WLN  DF   ++ WW    + GW G+  M +LK LK +L+ W+   K+E   +   + E    +  LD+ E  + L    R++R  L  
Subjt:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDE----IKTLDRLEEADCLTLEQRTKRHQLRE

Query:  SIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADL
         I D++  E + W QR K+KW +EGD NTKFFHR+    +++N I ++   D   +     IE+E I F++ LY+ + N+ +    L+W PI++ +A  L
Subjt:  SIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADL

Query:  ERVFS-------------------------------EEVLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNR
        ER F                                E V  D++ +  DF+  GI+N   NET+ICLIPKK +S  V D RPISL++  YK+I++VL++R
Subjt:  ERVFS-------------------------------EEVLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNR

Query:  LKRVLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKI
        L+ VL +TI+ +Q AFV+ RQILDA L+ANE++++ R   +KG+  K+D EKA+D V+W+F+  V+  KGFG KWR WI GC+ S N+SI+ING+PRGK 
Subjt:  LKRVLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKI

Query:  IPTRGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSS
          +RGLRQGDPLSPFLF +VSD LSR+I     +  +     G     ++HLQFADDT+           NL   +K+F   SG+ IN  KS +LGIN S
Subjt:  IPTRGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSS

Query:  IEDLESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWE
         E L + AG +GC++G WP  YLGLPLGGNP+ ++FW  V++K++ +L+ WK A +SKGGR TLIQA LS++P YY+SLF +P  V   ++++ R+F WE
Subjt:  IEDLESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWE

Query:  GSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSV-RGGSKSPWRYISSTIVLLTSRIQK
        G +     H + W+     +  GGLGIG+L++RNEAL AKW+WR+  E NSLW ++I +KY    DS+ W +  + +   ++PWR IS          + 
Subjt:  GSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSV-RGGSKSPWRYISSTIVLLTSRIQK

Query:  RVGNGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIATFWN--VENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNNR-DSWLWPLDPSKSFTV
         VGNG+   FW D WL    L  ++P+L  LS +++ +IA F N  V    WD   RRNL+E EI E   L   L +V L  +R D   W ++   SF+ 
Subjt:  RVGNGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIATFWN--VENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNNR-DSWLWPLDPSKSFTV

Query:  RSLMTDLLLSSR----PSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSD
        +S  + LL ++R    P S+     IWK   P KI+ F+W  ++G INT D +QRR P   LSPSWCV+C +
Subjt:  RSLMTDLLLSSR----PSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSD

M5WJ76 Reverse transcriptase domain-containing protein (Fragment)7.4e-23441.63Show/hide
Query:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW
        ETKK    R+L+  +W S F  W    S+G SGGI ++W+    ++ +++ G FS+SI +    G  +WLS IYGP  + +R  FW+EL DL G  G+ W
Subjt:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW

Query:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS
         +GGDF V R++ EKSN+  +T +MR FN +I++  L D  L N  FTWS++ EN     LDRF  +          +   L R+TSDH PI L+   + 
Subjt:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS

Query:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDE----IKTLDRLEEADCLTLEQRTKRHQLRE
        WGP PFRFEN WLN  DFK  ++ WW    + GW G+  M +LK LK +L+ W+   K+E   +   + E    +  LD+ E  + L    R++R  L  
Subjt:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDE----IKTLDRLEEADCLTLEQRTKRHQLRE

Query:  SIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADL
         I D++  E + W QR K+KW ++GD NTKFFHR+    +++N I ++   D   +     IE+E I F++ LY+ + N        D SP  +  +   
Subjt:  SIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADL

Query:  ERVFSEEVLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKRVLPSTIAPNQLAFVEDRQILDASLMANE
         +   E V  D++ +  DF+  GI+N   NET+ICLIPKK +S  V DYRPISL++  YK+I++VL++ L+ VL +TI+ +Q AFV+ RQILDA L+ANE
Subjt:  ERVFSEEVLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKRVLPSTIAPNQLAFVEDRQILDASLMANE

Query:  LIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPTRGLRQGDPLSPFLFIMVSDCLSRLISHG
        ++++ R   +KG+  K+D EKA+D V+W+F+  VM  KGFG KWR WI GC+ S N+SI+ING+PRGK   +RGLRQGDPLSPFLF +VSD LSRLI   
Subjt:  LIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPTRGLRQGDPLSPFLFIMVSDCLSRLISHG

Query:  AYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIEDLESAAGIFGCKIGSWPSNYLGLPLGGNP
          +  +     G     ++HLQFADDT+           NL   +K+F   SG+ IN  KS +LGIN S + L + AG +GC++G WP  YLGLPLGGNP
Subjt:  AYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIEDLESAAGIFGCKIGSWPSNYLGLPLGGNP

Query:  KNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLK
        + ++FW  V+EK++ +L+ WK A +SKGGR TLIQA LS++P YY+SLF +P  V   ++++ R+F WEG       H + W+     +  GGLGIG+L+
Subjt:  KNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLK

Query:  QRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSVRGGSKSPWRYISSTIVLLTSRIQKRVGNGQNTDFWHDQWLNSEKLATIYPKLYRLS
        +R EAL AKW+WR+  E NSLW ++I +KY    +              +PWR IS          +  VGNG+   FW D WL    L  ++P+L  LS
Subjt:  QRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSVRGGSKSPWRYISSTIVLLTSRIQKRVGNGQNTDFWHDQWLNSEKLATIYPKLYRLS

Query:  SKQHDAIATFWN--VENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNNR-DSWLWPLDPSKSFTVRSLMTDLLLSSR----PSSNNLYYVIWKDAYP
         +++ +IA F N  V    WD   RRNL+E EI E   L   L +V L  +R D   W ++   SF+ +S  + LL ++R    P S+     IWK   P
Subjt:  SKQHDAIATFWN--VENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNNR-DSWLWPLDPSKSFTVRSLMTDLLLSSR----PSSNNLYYVIWKDAYP

Query:  KKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSD
         KI+ F+W  ++G INT D +QRR P   LSPSWCV+C +
Subjt:  KKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSD

M5WPQ5 Reverse transcriptase domain-containing protein2.2e-23340.76Show/hide
Query:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW
        ETKK    R+L+  +W S F  W    S+G SGGI ++W+    ++ +++ G FS+SI +    G  +WLS IYGP  + +R  FW+EL DL G  G+ W
Subjt:  ETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCW

Query:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS
         +GGDF V R++ EKSN+  +T +MR FN +I++  L D  L N  FTWS++ EN     LDRF  +          +   L R+TSDH PI L+   + 
Subjt:  IVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDIS

Query:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDE----IKTLDRLEEADCLTLEQRTKRHQLRE
        WGP PFRFEN WLN  DFK  ++ WW    + GW G+  M +LK LK +L+ W+   K+E   +   + E    +  LD+ E  + L    R++R  L  
Subjt:  WGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDE----IKTLDRLEEADCLTLEQRTKRHQLRE

Query:  SIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADL
         I D++  E + W QR K+KW +EGD NTKFFHR+    +++N I ++   D   +     IE+E I F++ LY+++ N+ +    L+W PI++ +A  L
Subjt:  SIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADL

Query:  ERVFS-------------------------------EEVLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNR
        ER F                                E V  D++ +  DF+  GI+N   NET+ICLIPKK +S  V DYRPISL++  YK+I++VL++R
Subjt:  ERVFS-------------------------------EEVLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNR

Query:  LKRVLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKI
        L+ VL +TI+ +Q AFV+ RQILDA L+ANE++++ R   +KG+  K+D EKA+D V+W+F+  VM  KGFG KWR WI GC+ S N+SI+ING+PRGK 
Subjt:  LKRVLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKI

Query:  IPTRGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSS
          +RGLRQGDPLSPFLF +V +                          ++HLQFADDT+           NL   +K+F   SG+ IN  KS +LGIN S
Subjt:  IPTRGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSS

Query:  IEDLESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWE
         E L + AG +GC++G WP  YLGLPLGGNP+ ++FW  V+EK++ +L+ WK A +SKGGR TLIQA LS++P YY+SLF +P  V   ++++ R+F WE
Subjt:  IEDLESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWE

Query:  GSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSV-RGGSKSPWRYISSTIVLLTSRIQK
        G +     H + W+     +  GGLGIG+L++RNEAL AKW+WR+  E NSLW ++I +KY    DS+ W +  + +   ++PWR IS          + 
Subjt:  GSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSV-RGGSKSPWRYISSTIVLLTSRIQK

Query:  RVGNGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIATFWN--VENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNNR-DSWLWPLDPSKSFTV
         VGNG+   FW D WL    L  ++P+L  LS +++ +IA F N  V    WD   RRNL+E E+ E   L   L +V L  +R D   W ++   SF+ 
Subjt:  RVGNGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIATFWN--VENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNNR-DSWLWPLDPSKSFTV

Query:  RSLMTDLLLSSR----PSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSD
        +S  + LL ++R    P S+     IWK   P KI+ F+W  ++G INT D +QRR P   LSPSWCV+C +
Subjt:  RSLMTDLLLSSR----PSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSD

M5XHS0 Reverse transcriptase domain-containing protein (Fragment)2.0e-22639.96Show/hide
Query:  TTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCWIVGGD
        T  R+L+  +W S F  W    S+G SGGI ++W+    ++ +++ G FS+SI +    G  +WLS IYGP  + +R  FW+EL DL G  G+ W +GGD
Subjt:  TTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCWIVGGD

Query:  FKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDISWGPCP
        F V R++ EKSN+  +T +MR FN +I++  L D  L N  FTWS++ EN     LDRF  +          +   L R+TSDH PI L+   + WGP P
Subjt:  FKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDISWGPCP

Query:  FRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQDISAME
        FRFEN WLN  DFK  ++ WW    ++GW G+  M   +      R    ++ + L            LD+ E  + L    R++R  L   I D++  E
Subjt:  FRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQDISAME

Query:  AIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERVFS----
         + W QR K+KW +EGD NTKFFHR+ +  +++N I ++   D   +     IE+E I F++ LY+ + N+ +    L+W PI++ +A  LER F     
Subjt:  AIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPTNLDWSPINESQAADLERVFS----

Query:  ---------------------------EEVLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKRVLPSTI
                                   E V  D++ +  DF+  GI+N   NET+ICLIPKK +S  V DYRPISL++  YK+I++VL +RL+ VL +TI
Subjt:  ---------------------------EEVLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKRVLPSTI

Query:  APNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPTRGLRQG
        + +Q AFV+ RQILDA L+ANE++++ R   +KG+  K+D EKA+D V+W+F+  V+  KGFG KWR WI GC+ S N+SI+ING+PRGK   +RGLRQG
Subjt:  APNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPTRGLRQG

Query:  DPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIEDLESAAG
        DPLSPFLF +VSD LSR+I     +  +     G     ++HLQFADDT+           NL   +K+F   SG+ IN  KS +LGIN S + L + AG
Subjt:  DPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIEDLESAAG

Query:  IFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQMNGGVH
         +GC++G WP  YLGLPLGGNP+ ++FW  V++K++ +L+ WK A +SKGGR TLIQA LS++P YY+SLF +P  V   ++++ R+F WEG +     H
Subjt:  IFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQMNGGVH

Query:  NINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSV-RGGSKSPWRYISSTIVLLTSRIQKRVGNGQNTD
         + W+     +  GGLGIG+L++RNEAL AKW+WR+  E NSLW ++I +KY    DS+ W +  + +   ++PWR IS          +  VGNG+   
Subjt:  NINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYYFTGDSSLWSSPSV-RGGSKSPWRYISSTIVLLTSRIQKRVGNGQNTD

Query:  FWHDQWLNSEKLATIYPKLYRLSSKQHDAIATFWNVENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNNR-DSWLWPLDPSKSFTVRSLMTDLLLSS
        FW D WL    L  ++P+L  LS ++                   +RNL+E EI E   L   L +V L  +R D   W ++   SF+ +S  + LL ++
Subjt:  FWHDQWLNSEKLATIYPKLYRLSSKQHDAIATFWNVENSAWDLGLRRNLNEEEILEWATLSHQLSSVILRNNR-DSWLWPLDPSKSFTVRSLMTDLLLSS

Query:  R----PSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSD
        R    P S+     IWK   P KI+ F+W  ++G INT D +QRR P   LSPSWCV+C +
Subjt:  R----PSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSD

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein4.4e-3421.76Show/hide
Query:  IYGPTDRAQRADFWQELHDLAGLGGNCWIVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDI-----PLQNGWFTWSSIGENRSLTLLDRFFTT
        IY P   A R    Q L DL     +  ++ GDF       ++S  Q +  + +  N  +    L+DI     P    +  +S+   + + + +D    +
Subjt:  IYGPTDRAQRADFWQELHDLAGLGGNCWIVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDI-----PLQNGWFTWSSIGENRSLTLLDRFFTT

Query:  NDCLLKMGAAQLTRLERVTSDHYPISLNF--------GDISWGPCPFRFENSWLNCK---DFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLS
           L K    ++  +    SDH  I L             +W        + W++ +   + K   E+  N+               +G    L ++   
Subjt:  NDCLLKMGAAQLTRLERVTSDHYPISLNF--------GDISWGPCPFRFENSWLNCK---DFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLS

Query:  QKKELAQITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQ
        +K+E ++I ++  ++K L++ E+       +R +  ++R  +++I   + +      +  + +  ++  +   R++  ++ KN I  + +  G       
Subjt:  QKKELAQITSMVDEIKTLDRLEEADCLTLEQRTKRHQLRESIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQ

Query:  EIEKEFIDFYQNLY-TKDNNLRFLPTNLDW----------------------------------SPINESQAADLERVFSEEVLHDILNIFADFYHHGII
        EI+    ++Y++LY  K  NL  + T LD                                   SP  +   A+  + + EE++  +L +F      GI+
Subjt:  EIEKEFIDFYQNLY-TKDNNLRFLPTNLDW----------------------------------SPINESQAADLERVFSEEVLHDILNIFADFYHHGII

Query:  NAALNETYICLIPKK-LDSKVVYDYRPISLISCAYKIIARVLSNRLKRVLPSTIAPNQLAFVEDRQ-ILDASLMANELIDDWRCSGKKGVTIKLDLEKAF
          +  E  I LIPK   D+    ++RPISL++   KI+ ++L+NR+++ +   I  +Q+ F+   Q   +     N +    R   K  V I +D EKAF
Subjt:  NAALNETYICLIPKK-LDSKVVYDYRPISLISCAYKIIARVLSNRLKRVLPSTIAPNQLAFVEDRQ-ILDASLMANELIDDWRCSGKKGVTIKLDLEKAF

Query:  DKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPTRGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQF
        DK+   F+   +   G    + K I         +II+NG+         G RQG PLSP LF +V + L+R I     +  I    +G     L+   F
Subjt:  DKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPTRGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQF

Query:  ADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIEDLESAAGIFGCKIGSWPSNYLGLPLGGNPKNV--SFWQIVIEKIQHKLRSWK
        ADD +++      +  NL   I  F   SG  IN  KS+    N++ +      G     I S    YLG+ L  + K++    ++ ++++I+     WK
Subjt:  ADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIEDLESAAGIFGCKIGSWPSNYLGLPLGGNPKNV--SFWQIVIEKIQHKLRSWK

Query:  YALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLT----LDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAK--WIWRYL
            S  GR  +++  +    IY  +   IP K+ +T    L+K    F W   +       ++ K        GG+ + + K   +A + K  W W Y 
Subjt:  YALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLT----LDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAK--WIWRYL

Query:  HEENSLWRQ
        + +   W +
Subjt:  HEENSLWRQ

P0C2F6 Putative ribonuclease H protein At1g657502.1e-3128.27Show/hide
Query:  VIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLA
        ++E++  ++  W+   +S  GR TL +A LS+MP++ +S   +P  ++  LD++ R F W  +      H + W     P+  GGLG+   K  N AL++
Subjt:  VIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLA

Query:  KWIWRYLHEENSLWRQLIVAKYYFTGD--SSLWSSPSVRGGSKSPWRYISSTIVLLTSR-IQKRVGNGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHD
        K  WR L E+NSLW  L++ K Y  G+   S W  P  +G   S WR I+  +  + S  +    G+GQ   FW D+W++ + L  +     R +     
Subjt:  KWIWRYLHEENSLWRQLIVAKYYFTGD--SSLWSSPSVRGGSKSPWRYISSTIVLLTSR-IQKRVGNGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHD

Query:  AIATFWNVENSAWDLGLRRNLNEEEILEWATLSH--QLSSVIL---RNNRDSWLWPLDPSKSFTVRSLMTDLLLSS--RPSSNNLYYVIWKDAYPKKIKI
             W +    WD          +I  + T +   +L +V+L      RD   W       F+VRS    L +    RP+  + +  +WK   P+++K 
Subjt:  AIATFWNVENSAWDLGLRRNLNEEEILEWATLSH--QLSSVIL---RNNRDSWLWPLDPSKSFTVRSLMTDLLLSS--RPSSNNLYYVIWKDAYPKKIKI

Query:  FLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDG
        FLW + +  + T +   RR  H S S + C +C  G
Subjt:  FLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDG

P11369 LINE-1 retrotransposable element ORF2 protein2.4e-3223.54Show/hide
Query:  GWAAL---NSIGASGGIIIMWSDP-DY---TIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQE-LHDLAGLGGNCWIVGGDFKVTRWTW
        GW  +   N +    G+ I+ SD  D+    IK+   G F L     + E  S  +  IY P  RA  A F ++ L  L        I+ GDF     + 
Subjt:  GWAAL---NSIGASGGIIIMWSDP-DY---TIKETIRGLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQE-LHDLAGLGGNCWIVGGDFKVTRWTW

Query:  EKSNDQPITNNMRMFNRWIEDHQLLDI-----PLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNF-GDISWGPCPF-
        ++S  Q +  +       ++   L DI     P   G+  +S+   + + + +D        L +    ++  +  + SDH+ + L F  +I+ G   F 
Subjt:  EKSNDQPITNNMRMFNRWIEDHQLLDI-----PLQNGWFTWSSIGENRSLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNF-GDISWGPCPF-

Query:  -RFENSWLN--------CKDFKSVLESWWNRTPLIGWPGHGLMQKLKG-LKYELRSWNLSQKK-ELAQITSMVDEIKTLDRLEEADCLTLEQRTKRHQLR
         +  N+ LN         K+ K  LE  +N      +P   L   +K  L+ +L + + S+KK E A  +S+   +K L++ +EA+     +R +  +LR
Subjt:  -RFENSWLN--------CKDFKSVLESWWNRTPLIGWPGHGLMQKLKG-LKYELRSWNLSQKK-ELAQITSMVDEIKTLDRLEEADCLTLEQRTKRHQLR

Query:  ESIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLY-TKDNNLRFLPTNLDW---------
          I  +     I    + +  + ++ ++  K   R+    + K  I ++ +  G      +EI+     FY+ LY TK  NL  +   LD          
Subjt:  ESIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLY-TKDNNLRFLPTNLDW---------

Query:  ------SPINESQ-------------------AADLERVFSEEVLHDILNIFADFYHHGIINAALNETYICLIPK-KLDSKVVYDYRPISLISCAYKIIA
              SPI+  +                   +A+  + F E+++  +  +F      G +  +  E  I LIPK + D   + ++RPISL++   KI+ 
Subjt:  ------SPINESQ-------------------AADLERVFSEEVLHDILNIFADFYHHGIINAALNETYICLIPK-KLDSKVVYDYRPISLISCAYKIIA

Query:  RVLSNRLKRVLPSTIAPNQLAFVED-------RQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSAN
        ++L+NR++  + + I P+Q+ F+         R+ ++     N+L D      K  + I LD EKAFDK+   F+  V++  G    +   I    S   
Subjt:  RVLSNRLKRVLPSTIAPNQLAFVED-------RQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSAN

Query:  YSIIINGRPRGKIIPTRGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNI
         +I +NG     I    G RQG PLSP+LF +V + L+R I     +  I    IG     ++ L  ADD +++ +    +   L N I  F    G  I
Subjt:  YSIIINGRPRGKIIPTRGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNI

Query:  NFGKSELLGINSSIEDLESAAGIFGCKIGSWPSNYLGLPLGGNPKNV--SFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSL--FHIP
        N  KS       + +  +         I +    YLG+ L    K++    ++ + ++I+  LR WK    S  GR  +++  +    IY  +     IP
Subjt:  NFGKSELLGINSSIEDLESAAGIFGCKIGSWPSNYLGLPLGGNPKNV--SFWQIVIEKIQHKLRSWKYALISKGGRHTLIQATLSNMPIYYLSL--FHIP

Query:  SKVVLTLDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAK--WIWRYLHEENSLWRQL
        ++    L+     F W   +       +  K T      GG+ + +LK    A++ K  W W Y   +   W ++
Subjt:  SKVVLTLDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAK--WIWRYLHEENSLWRQL

P14381 Transposon TX1 uncharacterized 149 kDa protein1.1e-4023.85Show/hide
Query:  KKTTTTRRLIKSIWSSSFIGWAALNSI-GASGGIIIMWSD---PDYTIKETIRGLFSLSIHVCMAE-GFSFWLSIIYGPTDRAQRADFWQELHDLAGL--
        ++T TT  L ++ W+  + G    N +   S G++ ++SD   P+     ++  +    +H+ + E G ++ L  +Y PT   +RA F++ L        
Subjt:  KKTTTTRRLIKSIWSSSFIGWAALNSI-GASGGIIIMWSD---PDYTIKETIRGLFSLSIHVCMAE-GFSFWLSIIYGPTDRAQRADFWQELHDLAGL--

Query:  GGNCWIVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNG----WFTWSSIGENR-SLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHY
             I+GGDF  T    +++  +   ++  +    I    L+D+  +       FT+  + +   S + +DR + ++  L+    +   RL    SDH 
Subjt:  GGNCWIVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNG----WFTWSSIGENR-SLTLLDRFFTTNDCLLKMGAAQLTRLERVTSDHY

Query:  PISLNFGDISWGP--CPFRFENSWLNCKDF-KSVLESW--WNR-----TPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDEIKTLD-RLEE
         +SL        P    + F NS L  + F KSV ++W  W         L  W   G +  LK L  E       Q+   A+I ++  E+  L+ RL  
Subjt:  PISLNFGDISWGP--CPFRFENSWLNCKDF-KSVLESW--WNR-----TPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDEIKTLD-RLEE

Query:  ADCLTLEQRTKRHQLRESIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKD------
        ++   L+   +  + +E+++++   +A     R +++ L + D  ++FF+ +   +  +  IT + + DG  L   + I      FYQNL++ D      
Subjt:  ADCLTLEQRTKRHQLRESIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKD------

Query:  -----------------------------NNLRFLPTNLDWSPINESQAADLERVFSEEVLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYD
                                       LR +P N   SP  +    +  + F + +  D   +  + +  G +  +     + L+PKK D +++ +
Subjt:  -----------------------------NNLRFLPTNLDWSPINESQAADLERVFSEEVLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYD

Query:  YRPISLISCAYKIIARVLSNRLKRVLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWI
        +RP+SL+S  YKI+A+ +S RLK VL   I P+Q   V  R I D   +  +L+   R +G     + LD EKAFD+VD  +L   ++   FG ++  ++
Subjt:  YRPISLISCAYKIIARVLSNRLKRVLPSTIAPNQLAFVEDRQILDASLMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWI

Query:  FGCISSANYSIIINGRPRGKIIPTRGLRQGDPLSPFLFIMVSD---CLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTI
            +SA   + IN      +   RG+RQG PLS  L+ +  +   CL R        G +L  P       +    +ADD +L +  D   L       
Subjt:  FGCISSANYSIIINGRPRGKIIPTRGLRQGDPLSPFLFIMVSD---CLSRLISHGAYLGKILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTI

Query:  KVFELASGLNINFGKSE-LLGINSSIEDLESAAGIFGCKIGSWPS---NYLGLPLGGNPKNVSFWQIVIEK-IQHKLRSWK--YALISKGGRHTLIQATL
        +V+  AS   IN+ KS  LL  +  ++ L  A      +  SW S    YLG+ L      VS   I +E+ +  +L  WK    ++S  GR  +I   +
Subjt:  KVFELASGLNINFGKSE-LLGINSSIEDLESAAGIFGCKIGSWPS---NYLGLPLGGNPKNVSFWQIVIEK-IQHKLRSWK--YALISKGGRHTLIQATL

Query:  SNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYY
        ++   Y L       + +  + +   DF W G       H ++   + LP   GG G+  ++ +      + I RYL+ + S     + + +Y
Subjt:  SNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQLIVAKYY

Q10AZ7 Protein GLUTELIN PRECURSOR ACCUMULATION 39.4e-9356.35Show/hide
Query:  GGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDCIVLDRVTAQWKRLPTGNEAPSARAYHS
        GG GPI+GDLWALKG+ EE+NETPGWTQLKLPGQ PSPRCGH++TSGG YLLLFGGHGTGGWLSRYDVY+N+CI+LDRV+ QWK L T NE P  RAYHS
Subjt:  GGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDCIVLDRVTAQWKRLPTGNEAPSARAYHS

Query:  MNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHDAVVELGKSLGISISFSNPGVPVVGEMEDKE
        M CIGSR+LLFGGFDGK+TFGDLWWLV E DPI KR     PN    +K         +SA ++S      +++L K LGIS+S        V E+ DKE
Subjt:  MNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHDAVVELGKSLGISISFSNPGVPVVGEMEDKE

Query:  FLNLAYSLSEVKPSISGQITHIEATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFISTSLPGKDAYQFYHINNFRQLRMDDIPKL
         + L+  L    P    Q   I   QALR+HW +     I L+EL PLLRDYQRLI   Y  N         TS   K+ ++F+H+ N  +LRMDDIP L
Subjt:  FLNLAYSLSEVKPSISGQITHIEATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFISTSLPGKDAYQFYHINNFRQLRMDDIPKL

Query:  LAEYKRL
        L EY +L
Subjt:  LAEYKRL

Q10AZ7 Protein GLUTELIN PRECURSOR ACCUMULATION 32.8e-8976.47Show/hide
Query:  LPSQVNGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSKRMGDFWVLDTDIWQWS
        +P+  +GH+AVSIG SKVVVFGG  DK+FLSDIAVYD+EN++W+ PEC G+GS  Q GPSPRAFH+A+ IDC+MF+FGGRSG KR+GDFW+LDTDIWQWS
Subjt:  LPSQVNGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSKRMGDFWVLDTDIWQWS

Query:  ELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGRVTPPPL
        ELT FGDLPSPR+FAAAS+ GNRKIVMYGGWDGKKWLSDVY++DTMSLEWTELSV GS+PPPRCGH+ATM+EKRLLV+GGR    P+
Subjt:  ELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGRVTPPPL

Arabidopsis top hitse value%identityAlignment
AT2G36360.1 Galactose oxidase/kelch repeat superfamily protein1.7e-9754.33Show/hide
Query:  CINTADRLQRRMPHRSLSPSWCVMCSDGGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDC
        C +TA  +++R+          V    GGGGPI+GDLWALKGLI+EE ETPGWTQLKLPGQ PS RCGHT+TSGGHYLLLFGGHGTGGWLSRYDVY+ND 
Subjt:  CINTADRLQRRMPHRSLSPSWCVMCSDGGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDC

Query:  IVLDRVTAQWKRLPTGNEAPSARAYHSMNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHDAVV
        I+LDRVTAQWKRLP GNE P  RAYH+M CIG+R+LL GGFDGK TFGDLWWLV E+DPI KR      + +PQ  +   +KE      ++      ++V
Subjt:  IVLDRVTAQWKRLPTGNEAPSARAYHSMNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHDAVV

Query:  ELGKSLGISISFSNPGVPVVGEMEDKEFLNLAYSLSEVKPSISGQITHIE-ATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFIS
        +L + +GIS+S S   + +  E ED+EF+ L   L E    +  + + I+ A QALR HW+ S PR + LKEL  LLRDYQRL+T  + T     S    
Subjt:  ELGKSLGISISFSNPGVPVVGEMEDKEFLNLAYSLSEVKPSISGQITHIE-ATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFIS

Query:  TSLPGKDAYQFYHINNFRQLRMDDIPKLLAEYKRL
          LPG   + FYHI +  +LR++DI KLL EYK L
Subjt:  TSLPGKDAYQFYHINNFRQLRMDDIPKLLAEYKRL

AT2G36360.1 Galactose oxidase/kelch repeat superfamily protein9.0e-9180.43Show/hide
Query:  SGLPSQV-NGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSKRMGDFWVLDTDIW
        SG P Q  +GH+AV++G S VVVFGGLVDKKFLSDI VYDIENKLWF+PECTG+ S+ QVGP+PRAFH+A+ IDCHMF+FGGRSG KR+GDFWVLDTDIW
Subjt:  SGLPSQV-NGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSKRMGDFWVLDTDIW

Query:  QWSELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGR
        QWSELTSFGDLP+PRDFAAA++ G++KIV+ GGWDGKKWLSDVYV+DTMSLEW ELSV+GSLPPPRCGHTATM+EKRLLV+GGR
Subjt:  QWSELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGR

AT2G36360.2 Galactose oxidase/kelch repeat superfamily protein1.7e-9754.33Show/hide
Query:  CINTADRLQRRMPHRSLSPSWCVMCSDGGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDC
        C +TA  +++R+          V    GGGGPI+GDLWALKGLI+EE ETPGWTQLKLPGQ PS RCGHT+TSGGHYLLLFGGHGTGGWLSRYDVY+ND 
Subjt:  CINTADRLQRRMPHRSLSPSWCVMCSDGGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDC

Query:  IVLDRVTAQWKRLPTGNEAPSARAYHSMNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHDAVV
        I+LDRVTAQWKRLP GNE P  RAYH+M CIG+R+LL GGFDGK TFGDLWWLV E+DPI KR      + +PQ  +   +KE      ++      ++V
Subjt:  IVLDRVTAQWKRLPTGNEAPSARAYHSMNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHDAVV

Query:  ELGKSLGISISFSNPGVPVVGEMEDKEFLNLAYSLSEVKPSISGQITHIE-ATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFIS
        +L + +GIS+S S   + +  E ED+EF+ L   L E    +  + + I+ A QALR HW+ S PR + LKEL  LLRDYQRL+T  + T     S    
Subjt:  ELGKSLGISISFSNPGVPVVGEMEDKEFLNLAYSLSEVKPSISGQITHIE-ATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFIS

Query:  TSLPGKDAYQFYHINNFRQLRMDDIPKLLAEYKRL
          LPG   + FYHI +  +LR++DI KLL EYK L
Subjt:  TSLPGKDAYQFYHINNFRQLRMDDIPKLLAEYKRL

AT2G36360.2 Galactose oxidase/kelch repeat superfamily protein9.0e-9180.43Show/hide
Query:  SGLPSQV-NGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSKRMGDFWVLDTDIW
        SG P Q  +GH+AV++G S VVVFGGLVDKKFLSDI VYDIENKLWF+PECTG+ S+ QVGP+PRAFH+A+ IDCHMF+FGGRSG KR+GDFWVLDTDIW
Subjt:  SGLPSQV-NGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSKRMGDFWVLDTDIW

Query:  QWSELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGR
        QWSELTSFGDLP+PRDFAAA++ G++KIV+ GGWDGKKWLSDVYV+DTMSLEW ELSV+GSLPPPRCGHTATM+EKRLLV+GGR
Subjt:  QWSELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGR

AT2G36360.3 Galactose oxidase/kelch repeat superfamily protein2.5e-9653.87Show/hide
Query:  CINTADRLQRRMPHRSLSPSWCVMCSDGGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDC
        C +TA  +++R+          V    GGGGPI+GDLWALKGLI+EE ETPGWTQLKLPGQ PS RCGHT+TSGGHYLLLFGGHGTGGWLSRYDVY+ND 
Subjt:  CINTADRLQRRMPHRSLSPSWCVMCSDGGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDC

Query:  IVLDRVTAQWKRLPTGNEAPSARAYHSMNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHD-AV
        I+LDRVTAQWKRLP GNE P  RAYH+M CIG+R+LL GGFDGK TFGDLWWLV E+DPI KR       + P+ K+     +          G+   ++
Subjt:  IVLDRVTAQWKRLPTGNEAPSARAYHSMNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHD-AV

Query:  VELGKSLGISISFSNPGVPVVGEMEDKEFLNLAYSLSEVKPSISGQITHIE-ATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFI
        V+L + +GIS+S S   + +  E ED+EF+ L   L E    +  + + I+ A QALR HW+ S PR + LKEL  LLRDYQRL+T  + T     S   
Subjt:  VELGKSLGISISFSNPGVPVVGEMEDKEFLNLAYSLSEVKPSISGQITHIE-ATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFI

Query:  STSLPGKDAYQFYHINNFRQLRMDDIPKLLAEYKRL
           LPG   + FYHI +  +LR++DI KLL EYK L
Subjt:  STSLPGKDAYQFYHINNFRQLRMDDIPKLLAEYKRL

AT2G36360.3 Galactose oxidase/kelch repeat superfamily protein9.0e-9180.43Show/hide
Query:  SGLPSQV-NGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSKRMGDFWVLDTDIW
        SG P Q  +GH+AV++G S VVVFGGLVDKKFLSDI VYDIENKLWF+PECTG+ S+ QVGP+PRAFH+A+ IDCHMF+FGGRSG KR+GDFWVLDTDIW
Subjt:  SGLPSQV-NGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSKRMGDFWVLDTDIW

Query:  QWSELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGR
        QWSELTSFGDLP+PRDFAAA++ G++KIV+ GGWDGKKWLSDVYV+DTMSLEW ELSV+GSLPPPRCGHTATM+EKRLLV+GGR
Subjt:  QWSELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGR

AT2G36360.4 Galactose oxidase/kelch repeat superfamily protein2.5e-9653.87Show/hide
Query:  CINTADRLQRRMPHRSLSPSWCVMCSDGGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDC
        C +TA  +++R+          V    GGGGPI+GDLWALKGLI+EE ETPGWTQLKLPGQ PS RCGHT+TSGGHYLLLFGGHGTGGWLSRYDVY+ND 
Subjt:  CINTADRLQRRMPHRSLSPSWCVMCSDGGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDC

Query:  IVLDRVTAQWKRLPTGNEAPSARAYHSMNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHD-AV
        I+LDRVTAQWKRLP GNE P  RAYH+M CIG+R+LL GGFDGK TFGDLWWLV E+DPI KR       + P+ K+     +          G+   ++
Subjt:  IVLDRVTAQWKRLPTGNEAPSARAYHSMNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHD-AV

Query:  VELGKSLGISISFSNPGVPVVGEMEDKEFLNLAYSLSEVKPSISGQITHIE-ATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFI
        V+L + +GIS+S S   + +  E ED+EF+ L   L E    +  + + I+ A QALR HW+ S PR + LKEL  LLRDYQRL+T  + T     S   
Subjt:  VELGKSLGISISFSNPGVPVVGEMEDKEFLNLAYSLSEVKPSISGQITHIE-ATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFI

Query:  STSLPGKDAYQFYHINNFRQLRMDDIPKLLAEYKRL
           LPG   + FYHI +  +LR++DI KLL EYK L
Subjt:  STSLPGKDAYQFYHINNFRQLRMDDIPKLLAEYKRL

AT2G36360.4 Galactose oxidase/kelch repeat superfamily protein1.4e-8877.08Show/hide
Query:  SGLPSQV-NGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSK--------RMGDF
        SG P Q  +GH+AV++G S VVVFGGLVDKKFLSDI VYDIENKLWF+PECTG+ S+ QVGP+PRAFH+A+ IDCHMF+FGGRSG K        R+GDF
Subjt:  SGLPSQV-NGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSK--------RMGDF

Query:  WVLDTDIWQWSELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGR
        WVLDTDIWQWSELTSFGDLP+PRDFAAA++ G++KIV+ GGWDGKKWLSDVYV+DTMSLEW ELSV+GSLPPPRCGHTATM+EKRLLV+GGR
Subjt:  WVLDTDIWQWSELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGR

AT2G36360.5 Galactose oxidase/kelch repeat superfamily protein2.5e-9653.87Show/hide
Query:  CINTADRLQRRMPHRSLSPSWCVMCSDGGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDC
        C +TA  +++R+          V    GGGGPI+GDLWALKGLI+EE ETPGWTQLKLPGQ PS RCGHT+TSGGHYLLLFGGHGTGGWLSRYDVY+ND 
Subjt:  CINTADRLQRRMPHRSLSPSWCVMCSDGGGGPILGDLWALKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDC

Query:  IVLDRVTAQWKRLPTGNEAPSARAYHSMNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHD-AV
        I+LDRVTAQWKRLP GNE P  RAYH+M CIG+R+LL GGFDGK TFGDLWWLV E+DPI KR       + P+ K+     +          G+   ++
Subjt:  IVLDRVTAQWKRLPTGNEAPSARAYHSMNCIGSRYLLFGGFDGKSTFGDLWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHD-AV

Query:  VELGKSLGISISFSNPGVPVVGEMEDKEFLNLAYSLSEVKPSISGQITHIE-ATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFI
        V+L + +GIS+S S   + +  E ED+EF+ L   L E    +  + + I+ A QALR HW+ S PR + LKEL  LLRDYQRL+T  + T     S   
Subjt:  VELGKSLGISISFSNPGVPVVGEMEDKEFLNLAYSLSEVKPSISGQITHIE-ATQALRNHWRNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFI

Query:  STSLPGKDAYQFYHINNFRQLRMDDIPKLLAEYKRL
           LPG   + FYHI +  +LR++DI KLL EYK L
Subjt:  STSLPGKDAYQFYHINNFRQLRMDDIPKLLAEYKRL

AT2G36360.5 Galactose oxidase/kelch repeat superfamily protein1.9e-8876.68Show/hide
Query:  SGLPSQV-NGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSKRMGDFWVLDT---
        SG P Q  +GH+AV++G S VVVFGGLVDKKFLSDI VYDIENKLWF+PECTG+ S+ QVGP+PRAFH+A+ IDCHMF+FGGRSG KR+GDFWVLDT   
Subjt:  SGLPSQV-NGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSKRMGDFWVLDT---

Query:  ------DIWQWSELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGR
              DIWQWSELTSFGDLP+PRDFAAA++ G++KIV+ GGWDGKKWLSDVYV+DTMSLEW ELSV+GSLPPPRCGHTATM+EKRLLV+GGR
Subjt:  ------DIWQWSELTSFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGACCAAGTCCTCTGGTCTGCCGTCTCAGGTGAATGGCCACTCTGCGGTTAGCATCGGGAATTCGAAGGTCGTCGTGTTTGGTGGCCTCGTCGACAAGAAATTTCT
GAGCGATATCGCTGTTTATGACATCGAAAATAAATTATGGTTTCAGCCAGAATGCACTGGCAATGGCTCAAAAGAACAAGTTGGTCCAAGTCCACGAGCTTTTCACATTG
CTGTTGCAATTGACTGCCACATGTTTGTTTTTGGTGGACGTTCAGGTAGCAAAAGGATGGGTGATTTCTGGGTTCTAGACACTGATATCTGGCAATGGTCAGAGCTAACT
AGTTTCGGTGACTTGCCTTCACCAAGGGACTTTGCTGCAGCATCTTCTTTTGGGAACCGTAAAATTGTCATGTATGGGGGCTGGGATGGTAAAAAGTGGCTGTCAGATGT
ATATGTCCTAGACACAATGTCACTTGAATGGACTGAACTTTCAGTTGCTGGATCACTGCCTCCTCCAAGATGTGGCCATACAGCAACTATGCTTGAGAAAAGGTTGCTTG
TCTATGGTGGAAGAGTCACGCCTCCCCCTTTACCATCCTCTGTTTTATCGCCAATATGCCCAAGCAGCCCATCACAGTTTCGCGATTTACCTCAAGCATTACGGCACATT
GCTCCAATACTACAAGATCATGGTCTCTGTATTATGGCAATCCCACCCCCACCACCAACCACTAAAAAGAAATATGCTCCAAAAGGGAAGAAACCGGGTTGTAGAGAGTT
AAAGAATCTTACTTCCAATGTGAACTATGATAAAACAGCCACTTTGGCATTAATGGCGGGCTCTCCGGAAACTAAGAAGACTACAACTACCAGACGGTTAATTAAATCTA
TATGGAGCTCATCCTTCATTGGTTGGGCAGCCCTTAACTCCATAGGAGCTTCGGGCGGGATCATTATCATGTGGAGCGATCCGGATTACACCATTAAGGAGACTATTCGA
GGTCTTTTCTCACTCTCTATTCATGTTTGTATGGCTGAAGGTTTTTCTTTTTGGTTATCGATTATCTATGGCCCAACAGATCGAGCTCAAAGGGCGGATTTTTGGCAAGA
GCTTCATGACTTGGCTGGTTTGGGAGGGAATTGTTGGATTGTTGGGGGAGACTTTAAAGTTACTCGTTGGACCTGGGAGAAATCTAACGACCAGCCTATTACCAACAATA
TGCGAATGTTCAATAGATGGATTGAGGATCATCAGTTGTTGGATATTCCTCTACAAAATGGGTGGTTTACTTGGTCTAGCATAGGTGAAAATCGGTCTCTTACTCTATTG
GACAGATTTTTCACCACTAATGATTGTCTTTTGAAAATGGGGGCAGCTCAATTAACGAGACTTGAACGAGTTACCTCAGACCATTATCCAATTTCTTTGAATTTTGGAGA
TATCTCATGGGGCCCTTGTCCCTTCCGGTTCGAAAATTCATGGTTAAATTGTAAAGATTTCAAATCGGTTTTGGAATCTTGGTGGAACAGAACTCCCCTCATAGGTTGGC
CTGGCCATGGGTTGATGCAGAAACTTAAAGGATTAAAATATGAACTTCGCTCATGGAACCTTTCCCAAAAGAAGGAGTTAGCTCAAATAACTTCCATGGTGGATGAAATA
AAGACTCTGGACAGACTTGAAGAGGCTGATTGTTTAACGTTAGAACAACGAACAAAAAGGCATCAGTTACGAGAATCGATTCAAGACATTTCAGCTATGGAAGCTATTTA
CTGGCATCAAAGGTGCAAGCTAAAATGGCTAAAAGAAGGGGACGAAAATACTAAATTTTTTCATCGCATTATGGCTGCTCGAAAAAGAAAGAATTCAATTACTGAGGTTC
TGTCCAGGGATGGTATTAGTTTGCTTACTGCCCAAGAAATTGAGAAGGAATTTATTGATTTTTATCAGAATTTGTATACCAAAGACAACAACTTGAGATTTCTCCCTACC
AATCTTGACTGGAGTCCAATCAATGAGTCTCAAGCTGCGGATTTAGAACGAGTTTTTTCAGAAGAAGTTCTGCATGACATCCTGAACATTTTTGCTGATTTTTATCATCA
TGGAATTATCAATGCAGCCTTGAATGAGACATACATCTGCCTTATCCCAAAAAAGTTGGACTCAAAAGTAGTATATGACTATCGTCCAATAAGCCTTATTTCTTGTGCTT
ATAAAATAATTGCTCGAGTTCTATCAAATAGGCTGAAAAGAGTTCTTCCTTCTACAATTGCTCCAAATCAATTGGCTTTTGTTGAAGATCGACAAATTCTAGATGCTTCC
TTGATGGCTAATGAGTTAATAGATGATTGGCGTTGTTCGGGCAAGAAGGGGGTGACTATCAAATTGGATCTTGAAAAAGCTTTTGACAAGGTTGATTGGGATTTTTTACA
TGCAGTTATGAAAATAAAAGGTTTTGGCAAAAAATGGCGAAAATGGATATTTGGTTGCATTTCTAGTGCAAACTATTCGATTATTATAAATGGAAGGCCTAGAGGTAAGA
TCATTCCAACAAGGGGTCTACGTCAAGGTGATCCGCTTTCCCCTTTTCTATTTATCATGGTTTCTGATTGCCTCTCTCGTTTAATATCACATGGAGCTTACTTAGGTAAA
ATCTTGACTCACCCTATCGGTGTTTCATCTTTTTGCCTAAATCATCTTCAATTCGCGGACGATACTTTATTATTCTCCACCATGGATCCTACTGCTTTGACTAATCTATT
TAATACTATCAAAGTTTTTGAGCTAGCCTCCGGCCTGAATATTAATTTTGGGAAAAGTGAGCTTCTTGGTATCAATTCTTCTATAGAAGATCTGGAATCAGCTGCTGGAA
TTTTTGGGTGCAAAATTGGTTCCTGGCCATCTAACTATTTAGGTCTTCCTTTGGGTGGTAATCCAAAAAATGTATCTTTTTGGCAGATTGTTATAGAGAAGATTCAACAC
AAGTTACGAAGCTGGAAATATGCATTGATTTCTAAGGGTGGTCGCCATACTCTTATTCAGGCTACTTTATCCAACATGCCTATTTATTATCTCTCTTTATTTCATATTCC
TTCAAAAGTGGTTCTCACCCTGGATAAAATTTTTAGGGATTTTTTTTGGGAAGGTTCTCAAATGAATGGTGGTGTGCACAATATTAATTGGAAGACCACTCAACTTCCCC
AACTTATGGGAGGTCTTGGGATTGGAAATCTAAAGCAGAGAAATGAGGCATTGTTAGCAAAATGGATTTGGCGTTATCTTCATGAGGAGAATTCTCTTTGGCGTCAGCTT
ATTGTGGCTAAATATTACTTCACAGGTGATTCTAGTTTGTGGTCTTCTCCTTCTGTGAGAGGTGGTTCTAAGTCTCCGTGGAGGTATATTAGCTCAACGATTGTTTTACT
CACATCTCGAATACAGAAGCGAGTGGGTAATGGGCAGAACACAGACTTTTGGCACGATCAATGGCTTAATAGTGAGAAATTAGCCACTATTTATCCCAAATTATATAGAC
TATCTTCCAAGCAACATGATGCAATTGCTACTTTCTGGAATGTTGAAAATTCAGCTTGGGACCTCGGCCTTAGGAGGAATCTTAATGAGGAAGAAATTTTGGAATGGGCT
ACTCTGTCCCATCAACTATCCTCGGTTATCTTGAGGAATAATCGAGACTCTTGGTTGTGGCCACTTGACCCTTCAAAGTCATTCACAGTTCGTTCTTTAATGACTGATTT
GCTTCTTTCTAGCAGACCATCATCAAATAATCTTTATTATGTGATATGGAAAGATGCTTACCCTAAAAAAATAAAAATATTTTTGTGGGAGCTTAGTCATGGATGTATCA
ATACTGCTGACCGTCTTCAAAGAAGGATGCCTCACCGTTCTCTATCTCCATCTTGGTGTGTCATGTGTTCTGATGGAGGTGGTGGGCCAATACTGGGTGATTTATGGGCT
TTGAAAGGACTCATTGAAGAAGAGAATGAAACCCCTGGATGGACCCAGTTGAAGCTTCCAGGTCAAGGTCCTTCTCCCCGTTGTGGACATACCATTACATCGGGTGGACA
TTATCTATTGTTATTTGGAGGGCATGGGACTGGTGGTTGGCTCAGTCGCTATGATGTTTACCACAATGATTGCATTGTGTTAGACAGGGTGACTGCTCAGTGGAAACGGT
TGCCTACTGGAAATGAAGCGCCTTCAGCACGGGCATACCATTCAATGAACTGTATTGGATCACGTTATTTGTTATTTGGCGGCTTTGATGGGAAATCAACCTTTGGCGAT
CTATGGTGGTTAGTTACTGAAGAGGACCCAATTGTAAAGAGGTTGTTTTCCGCATCACCCAATGATCTCCCTCAAAATAAGGATTTGACATCGTTGAAGGAAGATTATAA
TTCCGCACACGAGGATTCTCATGGGAGGCACGATGCAGTCGTAGAGTTAGGGAAAAGTTTGGGGATTAGTATCTCATTCTCTAATCCTGGAGTTCCTGTTGTAGGCGAGA
TGGAGGACAAAGAGTTCCTTAATCTAGCATATAGTTTAAGTGAAGTTAAACCTTCCATCTCCGGACAGATCACACATATTGAGGCCACCCAGGCACTTCGGAACCATTGG
AGGAACTCTAATCCTAGGTTAATTCCACTGAAAGAGCTTGAGCCCTTACTTCGTGATTACCAGCGCTTGATCACCCATAATTATTTTACAAATGATGGACCTCATTCAGA
ATTCATTAGTACCAGTCTTCCTGGAAAAGATGCTTATCAATTTTACCATATTAACAACTTTCGTCAGTTACGTATGGACGATATTCCTAAGCTTCTGGCAGAGTACAAAC
GGCTTCGTCTACCTGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGACCAAGTCCTCTGGTCTGCCGTCTCAGGTGAATGGCCACTCTGCGGTTAGCATCGGGAATTCGAAGGTCGTCGTGTTTGGTGGCCTCGTCGACAAGAAATTTCT
GAGCGATATCGCTGTTTATGACATCGAAAATAAATTATGGTTTCAGCCAGAATGCACTGGCAATGGCTCAAAAGAACAAGTTGGTCCAAGTCCACGAGCTTTTCACATTG
CTGTTGCAATTGACTGCCACATGTTTGTTTTTGGTGGACGTTCAGGTAGCAAAAGGATGGGTGATTTCTGGGTTCTAGACACTGATATCTGGCAATGGTCAGAGCTAACT
AGTTTCGGTGACTTGCCTTCACCAAGGGACTTTGCTGCAGCATCTTCTTTTGGGAACCGTAAAATTGTCATGTATGGGGGCTGGGATGGTAAAAAGTGGCTGTCAGATGT
ATATGTCCTAGACACAATGTCACTTGAATGGACTGAACTTTCAGTTGCTGGATCACTGCCTCCTCCAAGATGTGGCCATACAGCAACTATGCTTGAGAAAAGGTTGCTTG
TCTATGGTGGAAGAGTCACGCCTCCCCCTTTACCATCCTCTGTTTTATCGCCAATATGCCCAAGCAGCCCATCACAGTTTCGCGATTTACCTCAAGCATTACGGCACATT
GCTCCAATACTACAAGATCATGGTCTCTGTATTATGGCAATCCCACCCCCACCACCAACCACTAAAAAGAAATATGCTCCAAAAGGGAAGAAACCGGGTTGTAGAGAGTT
AAAGAATCTTACTTCCAATGTGAACTATGATAAAACAGCCACTTTGGCATTAATGGCGGGCTCTCCGGAAACTAAGAAGACTACAACTACCAGACGGTTAATTAAATCTA
TATGGAGCTCATCCTTCATTGGTTGGGCAGCCCTTAACTCCATAGGAGCTTCGGGCGGGATCATTATCATGTGGAGCGATCCGGATTACACCATTAAGGAGACTATTCGA
GGTCTTTTCTCACTCTCTATTCATGTTTGTATGGCTGAAGGTTTTTCTTTTTGGTTATCGATTATCTATGGCCCAACAGATCGAGCTCAAAGGGCGGATTTTTGGCAAGA
GCTTCATGACTTGGCTGGTTTGGGAGGGAATTGTTGGATTGTTGGGGGAGACTTTAAAGTTACTCGTTGGACCTGGGAGAAATCTAACGACCAGCCTATTACCAACAATA
TGCGAATGTTCAATAGATGGATTGAGGATCATCAGTTGTTGGATATTCCTCTACAAAATGGGTGGTTTACTTGGTCTAGCATAGGTGAAAATCGGTCTCTTACTCTATTG
GACAGATTTTTCACCACTAATGATTGTCTTTTGAAAATGGGGGCAGCTCAATTAACGAGACTTGAACGAGTTACCTCAGACCATTATCCAATTTCTTTGAATTTTGGAGA
TATCTCATGGGGCCCTTGTCCCTTCCGGTTCGAAAATTCATGGTTAAATTGTAAAGATTTCAAATCGGTTTTGGAATCTTGGTGGAACAGAACTCCCCTCATAGGTTGGC
CTGGCCATGGGTTGATGCAGAAACTTAAAGGATTAAAATATGAACTTCGCTCATGGAACCTTTCCCAAAAGAAGGAGTTAGCTCAAATAACTTCCATGGTGGATGAAATA
AAGACTCTGGACAGACTTGAAGAGGCTGATTGTTTAACGTTAGAACAACGAACAAAAAGGCATCAGTTACGAGAATCGATTCAAGACATTTCAGCTATGGAAGCTATTTA
CTGGCATCAAAGGTGCAAGCTAAAATGGCTAAAAGAAGGGGACGAAAATACTAAATTTTTTCATCGCATTATGGCTGCTCGAAAAAGAAAGAATTCAATTACTGAGGTTC
TGTCCAGGGATGGTATTAGTTTGCTTACTGCCCAAGAAATTGAGAAGGAATTTATTGATTTTTATCAGAATTTGTATACCAAAGACAACAACTTGAGATTTCTCCCTACC
AATCTTGACTGGAGTCCAATCAATGAGTCTCAAGCTGCGGATTTAGAACGAGTTTTTTCAGAAGAAGTTCTGCATGACATCCTGAACATTTTTGCTGATTTTTATCATCA
TGGAATTATCAATGCAGCCTTGAATGAGACATACATCTGCCTTATCCCAAAAAAGTTGGACTCAAAAGTAGTATATGACTATCGTCCAATAAGCCTTATTTCTTGTGCTT
ATAAAATAATTGCTCGAGTTCTATCAAATAGGCTGAAAAGAGTTCTTCCTTCTACAATTGCTCCAAATCAATTGGCTTTTGTTGAAGATCGACAAATTCTAGATGCTTCC
TTGATGGCTAATGAGTTAATAGATGATTGGCGTTGTTCGGGCAAGAAGGGGGTGACTATCAAATTGGATCTTGAAAAAGCTTTTGACAAGGTTGATTGGGATTTTTTACA
TGCAGTTATGAAAATAAAAGGTTTTGGCAAAAAATGGCGAAAATGGATATTTGGTTGCATTTCTAGTGCAAACTATTCGATTATTATAAATGGAAGGCCTAGAGGTAAGA
TCATTCCAACAAGGGGTCTACGTCAAGGTGATCCGCTTTCCCCTTTTCTATTTATCATGGTTTCTGATTGCCTCTCTCGTTTAATATCACATGGAGCTTACTTAGGTAAA
ATCTTGACTCACCCTATCGGTGTTTCATCTTTTTGCCTAAATCATCTTCAATTCGCGGACGATACTTTATTATTCTCCACCATGGATCCTACTGCTTTGACTAATCTATT
TAATACTATCAAAGTTTTTGAGCTAGCCTCCGGCCTGAATATTAATTTTGGGAAAAGTGAGCTTCTTGGTATCAATTCTTCTATAGAAGATCTGGAATCAGCTGCTGGAA
TTTTTGGGTGCAAAATTGGTTCCTGGCCATCTAACTATTTAGGTCTTCCTTTGGGTGGTAATCCAAAAAATGTATCTTTTTGGCAGATTGTTATAGAGAAGATTCAACAC
AAGTTACGAAGCTGGAAATATGCATTGATTTCTAAGGGTGGTCGCCATACTCTTATTCAGGCTACTTTATCCAACATGCCTATTTATTATCTCTCTTTATTTCATATTCC
TTCAAAAGTGGTTCTCACCCTGGATAAAATTTTTAGGGATTTTTTTTGGGAAGGTTCTCAAATGAATGGTGGTGTGCACAATATTAATTGGAAGACCACTCAACTTCCCC
AACTTATGGGAGGTCTTGGGATTGGAAATCTAAAGCAGAGAAATGAGGCATTGTTAGCAAAATGGATTTGGCGTTATCTTCATGAGGAGAATTCTCTTTGGCGTCAGCTT
ATTGTGGCTAAATATTACTTCACAGGTGATTCTAGTTTGTGGTCTTCTCCTTCTGTGAGAGGTGGTTCTAAGTCTCCGTGGAGGTATATTAGCTCAACGATTGTTTTACT
CACATCTCGAATACAGAAGCGAGTGGGTAATGGGCAGAACACAGACTTTTGGCACGATCAATGGCTTAATAGTGAGAAATTAGCCACTATTTATCCCAAATTATATAGAC
TATCTTCCAAGCAACATGATGCAATTGCTACTTTCTGGAATGTTGAAAATTCAGCTTGGGACCTCGGCCTTAGGAGGAATCTTAATGAGGAAGAAATTTTGGAATGGGCT
ACTCTGTCCCATCAACTATCCTCGGTTATCTTGAGGAATAATCGAGACTCTTGGTTGTGGCCACTTGACCCTTCAAAGTCATTCACAGTTCGTTCTTTAATGACTGATTT
GCTTCTTTCTAGCAGACCATCATCAAATAATCTTTATTATGTGATATGGAAAGATGCTTACCCTAAAAAAATAAAAATATTTTTGTGGGAGCTTAGTCATGGATGTATCA
ATACTGCTGACCGTCTTCAAAGAAGGATGCCTCACCGTTCTCTATCTCCATCTTGGTGTGTCATGTGTTCTGATGGAGGTGGTGGGCCAATACTGGGTGATTTATGGGCT
TTGAAAGGACTCATTGAAGAAGAGAATGAAACCCCTGGATGGACCCAGTTGAAGCTTCCAGGTCAAGGTCCTTCTCCCCGTTGTGGACATACCATTACATCGGGTGGACA
TTATCTATTGTTATTTGGAGGGCATGGGACTGGTGGTTGGCTCAGTCGCTATGATGTTTACCACAATGATTGCATTGTGTTAGACAGGGTGACTGCTCAGTGGAAACGGT
TGCCTACTGGAAATGAAGCGCCTTCAGCACGGGCATACCATTCAATGAACTGTATTGGATCACGTTATTTGTTATTTGGCGGCTTTGATGGGAAATCAACCTTTGGCGAT
CTATGGTGGTTAGTTACTGAAGAGGACCCAATTGTAAAGAGGTTGTTTTCCGCATCACCCAATGATCTCCCTCAAAATAAGGATTTGACATCGTTGAAGGAAGATTATAA
TTCCGCACACGAGGATTCTCATGGGAGGCACGATGCAGTCGTAGAGTTAGGGAAAAGTTTGGGGATTAGTATCTCATTCTCTAATCCTGGAGTTCCTGTTGTAGGCGAGA
TGGAGGACAAAGAGTTCCTTAATCTAGCATATAGTTTAAGTGAAGTTAAACCTTCCATCTCCGGACAGATCACACATATTGAGGCCACCCAGGCACTTCGGAACCATTGG
AGGAACTCTAATCCTAGGTTAATTCCACTGAAAGAGCTTGAGCCCTTACTTCGTGATTACCAGCGCTTGATCACCCATAATTATTTTACAAATGATGGACCTCATTCAGA
ATTCATTAGTACCAGTCTTCCTGGAAAAGATGCTTATCAATTTTACCATATTAACAACTTTCGTCAGTTACGTATGGACGATATTCCTAAGCTTCTGGCAGAGTACAAAC
GGCTTCGTCTACCTGATTGA
Protein sequenceShow/hide protein sequence
MLTKSSGLPSQVNGHSAVSIGNSKVVVFGGLVDKKFLSDIAVYDIENKLWFQPECTGNGSKEQVGPSPRAFHIAVAIDCHMFVFGGRSGSKRMGDFWVLDTDIWQWSELT
SFGDLPSPRDFAAASSFGNRKIVMYGGWDGKKWLSDVYVLDTMSLEWTELSVAGSLPPPRCGHTATMLEKRLLVYGGRVTPPPLPSSVLSPICPSSPSQFRDLPQALRHI
APILQDHGLCIMAIPPPPPTTKKKYAPKGKKPGCRELKNLTSNVNYDKTATLALMAGSPETKKTTTTRRLIKSIWSSSFIGWAALNSIGASGGIIIMWSDPDYTIKETIR
GLFSLSIHVCMAEGFSFWLSIIYGPTDRAQRADFWQELHDLAGLGGNCWIVGGDFKVTRWTWEKSNDQPITNNMRMFNRWIEDHQLLDIPLQNGWFTWSSIGENRSLTLL
DRFFTTNDCLLKMGAAQLTRLERVTSDHYPISLNFGDISWGPCPFRFENSWLNCKDFKSVLESWWNRTPLIGWPGHGLMQKLKGLKYELRSWNLSQKKELAQITSMVDEI
KTLDRLEEADCLTLEQRTKRHQLRESIQDISAMEAIYWHQRCKLKWLKEGDENTKFFHRIMAARKRKNSITEVLSRDGISLLTAQEIEKEFIDFYQNLYTKDNNLRFLPT
NLDWSPINESQAADLERVFSEEVLHDILNIFADFYHHGIINAALNETYICLIPKKLDSKVVYDYRPISLISCAYKIIARVLSNRLKRVLPSTIAPNQLAFVEDRQILDAS
LMANELIDDWRCSGKKGVTIKLDLEKAFDKVDWDFLHAVMKIKGFGKKWRKWIFGCISSANYSIIINGRPRGKIIPTRGLRQGDPLSPFLFIMVSDCLSRLISHGAYLGK
ILTHPIGVSSFCLNHLQFADDTLLFSTMDPTALTNLFNTIKVFELASGLNINFGKSELLGINSSIEDLESAAGIFGCKIGSWPSNYLGLPLGGNPKNVSFWQIVIEKIQH
KLRSWKYALISKGGRHTLIQATLSNMPIYYLSLFHIPSKVVLTLDKIFRDFFWEGSQMNGGVHNINWKTTQLPQLMGGLGIGNLKQRNEALLAKWIWRYLHEENSLWRQL
IVAKYYFTGDSSLWSSPSVRGGSKSPWRYISSTIVLLTSRIQKRVGNGQNTDFWHDQWLNSEKLATIYPKLYRLSSKQHDAIATFWNVENSAWDLGLRRNLNEEEILEWA
TLSHQLSSVILRNNRDSWLWPLDPSKSFTVRSLMTDLLLSSRPSSNNLYYVIWKDAYPKKIKIFLWELSHGCINTADRLQRRMPHRSLSPSWCVMCSDGGGGPILGDLWA
LKGLIEEENETPGWTQLKLPGQGPSPRCGHTITSGGHYLLLFGGHGTGGWLSRYDVYHNDCIVLDRVTAQWKRLPTGNEAPSARAYHSMNCIGSRYLLFGGFDGKSTFGD
LWWLVTEEDPIVKRLFSASPNDLPQNKDLTSLKEDYNSAHEDSHGRHDAVVELGKSLGISISFSNPGVPVVGEMEDKEFLNLAYSLSEVKPSISGQITHIEATQALRNHW
RNSNPRLIPLKELEPLLRDYQRLITHNYFTNDGPHSEFISTSLPGKDAYQFYHINNFRQLRMDDIPKLLAEYKRLRLPD