; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028538 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028538
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:24623046..24631773
RNA-Seq ExpressionLag0028538
SyntenyLag0028538
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0004523 - RNA-DNA hybrid ribonuclease activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR002156 - Ribonuclease H domain
IPR012337 - Ribonuclease H-like superfamily
IPR036397 - Ribonuclease H superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily
IPR044730 - Ribonuclease H-like domain, plant type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
VVA32947.1 PREDICTED: retrotransposon [Prunus dulcis]9.7e-15631.14Show/hide
Query:  NYSIHHIDADIV-WNGTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRDLGFDGGMFT
        +YS+ HI   I   N ++++ T FYGHP+T  R H+WE+LRRL +     W+V GDFNEIL+  +K GG  R  RQ+N+FK A++DC L    F G  FT
Subjt:  NYSIHHIDADIV-WNGTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRDLGFDGGMFT

Query:  WCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGLMLSNVVGSWDAYGPTSSLLH-----------------------------QDL
        W  R   G  V  RLDR +AN      +      +L    SDH PI  ++       +A    S+  H                               L
Subjt:  WCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGLMLSNVVGSWDAYGPTSSLLH-----------------------------QDL

Query:  RKCAFEL-----------------------GCRGR---------------------RQNMELRQ-------------NINKEYFMNLFSSFKPSFEDLNR
          CA EL                         +GR                     +Q +  RQ             ++  +YF  LFSS     + + R
Subjt:  RKCAFEL-----------------------GCRGR---------------------RQNMELRQ-------------NINKEYFMNLFSSFKPSFEDLNR

Query:  VLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------------------------------VKDWNHTIITLIPKVRNTRLVIDYRPI
        +L+ +   +T+ MN  L + FT+EE+E  +    PTKAPG D                               V+++NHT+I LIPKV+    V ++RPI
Subjt:  VLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------------------------------VKDWNHTIITLIPKVRNTRLVIDYRPI

Query:  SLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIM
        SLC   YK+I K IANRL +VL  +I E+QS F+P R I DN++   E +H ++  +  +    ALKLDM+KAYDRVEW +L  +M++LGF   WV  +M
Subjt:  SLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIM

Query:  GRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGR-DLAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYE
          IST TFS+   G   G   P R +RQG PLSPYLFL+C EG S L   A  R DL GV +AR    ++HL FADDS++F+KA        +T+   YE
Subjt:  GRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGR-DLAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYE

Query:  RASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK------------------------GGGREILIKSVAQAIPTYAMSC
          SGQ +NY+KS    SPN  +     +  +L + VV     YLGLP+   +G+                          G+EIL+K+V QAIPTY+MSC
Subjt:  RASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK------------------------GGGREILIKSVAQAIPTYAMSC

Query:  FRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYF
        FR+PKG    ++ + A+FWW    + + IH  +W+ LCK K  GGL FRDLE FNQALLAKQ WR++  P S VA I + RY P+ P L+     N S+ 
Subjt:  FRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYF

Query:  WKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVNFISNSLHWNVPMLKQYLDPMDIDVISCLSINESA-PDRWIWHYDR
        W+   WG ELL KG+R  +GNG SI++++D WLP PS FK+        S +V +  ++S  WNVP+LK      ++D    + +   A  D  IWHY+R
Subjt:  WKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVNFISNSLHWNVPMLKQYLDPMDIDVISCLSINESA-PDRWIWHYDR

Query:  HEKSQRQPTMRYFGAREQRRCNAPGIR----TSNWTHVVIPREPPRLHHRARNT--HHLQSLKMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERETRW
        +     +   R     + +    P +R    +  W  +   + P ++           L   ++L  +K   I  T +      K+ SV          W
Subjt:  HEKSQRQPTMRYFGAREQRRCNAPGIR----TSNWTHVVIPREPPRLHHRARNT--HHLQSLKMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERETRW

Query:  TVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGC
         +    KE+  W     G   EE                        W +  +      ++    GE  E+       W +WN RNS I           
Subjt:  TVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGC

Query:  CAWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQG-----NDDMVIHVDATCDLREHRASVGVAIRNSGGTLIAALHSPLVPFVSILCAEALAILEGLRLA
           +     E ++  N S     +    ++   G          I+VD      +    VGV +RN+ G  +AA    +         E +A +EGLR A
Subjt:  CAWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQG-----NDDMVIHVDATCDLREHRASVGVAIRNSGGTLIAALHSPLVPFVSILCAEALAILEGLRLA

Query:  DHMNLKRVKIFSNSLSLV-MMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSLSDGSML-WLSTFPSWLSEIVSNEL
          M      +  ++   +  +L  E    ID   +I +V   +  FR+V  ++  R+ N +AHTLA+ +      + W+   P WL  ++  ++
Subjt:  DHMNLKRVKIFSNSLSLV-MMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSLSDGSML-WLSTFPSWLSEIVSNEL

XP_022158377.1 uncharacterized protein LOC111024874 [Momordica charantia]3.7e-17934.85Show/hide
Query:  FTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRDLGFDGGMFTWCNRQDVGVQVSLRLDRFLA
        FT FYGHP  H R  TWELLRR+ N D SPWL+GGD N ILW+ E +     D  Q+  F+  +D CSL D+GF GG+FTWCN +  G Q+  RLDRFL 
Subjt:  FTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRDLGFDGGMFTWCNRQDVGVQVSLRLDRFLA

Query:  NLSCSSIFPVCRALNLDWEKSDHR----PIGLMLSNVVGSW--------------------DAYGPTSSLLHQDLRKCAFELGCRGRRQNMELRQNINKE
        N + + +FP        W  + H           S+ +  W                    DAY     L    +     +L      + +  +Q   ++
Subjt:  NLSCSSIFPVCRALNLDWEKSDHR----PIGLMLSNVVGSW--------------------DAYGPTSSLLHQDLRKCAFELGCRGRRQNMELRQNINKE

Query:  YFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGP-------------------------------DVKDWNHTII
        +     +       D+  +++ I  ++T+++N  L   +TKEE+E+AI+   PTKA GP                               D+K WN T I
Subjt:  YFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGP-------------------------------DVKDWNHTII

Query:  TLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYL
         LIPK++  R + D+RPISLCNVSYKII+K I NRL +V+  +I ++QS F+P R+I+DN+IIGHE LH + + ++     AALKLD+SKA+DRVEW YL
Subjt:  TLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYL

Query:  HHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAH--GRDLAGVSIARTCQKISHLFFADDSLIF
          +M ++GF+E W+  I+  IST  FSI +NG   G F+PSR IRQGDPLSPYLFLLC EGLS L +  +  GR L G+        I+HL FADDSLIF
Subjt:  HHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAH--GRDLAGVSIARTCQKISHLFFADDSLIF

Query:  LKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGKGGGREILIKSVAQAIPTYAMSCFRLPK
        L+++  E    + ++ +Y RASGQC+N++KS + FSPNV  +  +YL  IL +K+V+H G YLGLPS F+R +G                          
Subjt:  LKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGKGGGREILIKSVAQAIPTYAMSCFRLPK

Query:  GWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFV
                           E +K+H  +W  +C PKE GGLNFRDLEGFNQAL+AK VWR + +PN  V+ ++K +Y+ +   L  S+ + SSYFWKGF+
Subjt:  GWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFV

Query:  WGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVNFISNSLHWNVPMLKQYLDPMDIDVISCLSINE-SAPDRWIWHYD-RHEKS
        WG +LL KG+RL +GNG +IK FSDPWLPRP+TFK            V +FI+   +W+V  +       D D+I  + I+  +  D W+WHYD R   S
Subjt:  WGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVNFISNSLHWNVPMLKQYLDPMDIDVISCLSINE-SAPDRWIWHYD-RHEKS

Query:  QRQPTMRYFGAREQRRCNAPGIRTSNWTHV---VIPREPPRLHHRARNTHHLQSLKMLGEKKGMG-INTTTVLEKRGEKSGSVGFGENERETRWTVARRG
         R     Y   +      +   R + W  +    +P +      R+ + H   +  +L   +G+G +   T+   R E      F            +R 
Subjt:  QRQPTMRYFGAREQRRCNAPGIRTSNWTHV---VIPREPPRLHHRARNTHHLQSLKMLGEKKGMG-INTTTVLEKRGEKSGSVGFGENERETRWTVARRG

Query:  KEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGCCAWIFD
        +++ R                C + E          +N    EL  W+         +  E K+        W IWNDRNS+IH K +  VE  C W+  
Subjt:  KEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGCCAWIFD

Query:  YLAEVTSVKNCSSKRRQQVDEVRSLFQ-----GNDDMVIHVDATCDLREHRASVGVAIRNSGGTLIAALHSPLVPF-VSILCAEALAILEGLRLADHMNL
        +L   +  +  +   R Q +  R + Q      +  + ++ DA C  R    S G  IR+S  +L+AA  S  VPF +S L AE   ILEGL+ A   N 
Subjt:  YLAEVTSVKNCSSKRRQQVDEVRSLFQ-----GNDDMVIHVDATCDLREHRASVGVAIRNSGGTLIAALHSPLVPF-VSILCAEALAILEGLRLADHMNL

Query:  KRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSLSDGS--MLWLSTFPSWLSEIVSNE
          +++ S+SL  + +++ E     D  + + +++    CF  ++  + +R  N  AH LA+  ++  S    WL  FP+WL ++V  +
Subjt:  KRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSLSDGS--MLWLSTFPSWLSEIVSNE

XP_030479133.1 uncharacterized protein LOC115696372 [Cannabis sativa]1.1e-15630.58Show/hide
Query:  KDNVDVSIRNYSIHHIDADIVW-NGTRWWFTVFYGHPETHLRKHTWELLRRLHN-NDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLR
        KD VDV++ + + +H D  +++ +G RW     YG PE + +KHTW L+RRL + +   PWL+ GD NEI   + K+ GPLR    +  F+  +D C L 
Subjt:  KDNVDVSIRNYSIHHIDADIVW-NGTRWWFTVFYGHPETHLRKHTWELLRRLHN-NDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLR

Query:  DLGFDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGL------------------------------------------
         L   G  FTW   +  G  +  R+D    N             +LD+  SDHR +                                            
Subjt:  DLGFDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGL------------------------------------------

Query:  -----------------MLSNVVGSWDAYGPTSSLLHQDLRKCAFELGCRGRRQNMELRQ----------------NINKEYFMNLFSSFKPSFEDLNRV
                         +L+N    W        L   D     F      R  N ++++                ++  +YF  +F++       L+ V
Subjt:  -----------------MLSNVVGSWDAYGPTSSLLHQDLRKCAFELGCRGRRQNMELRQ----------------NINKEYFMNLFSSFKPSFEDLNRV

Query:  LDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------------VKD------------------WNHTIITLIPKVRNTRLVIDYRPIS
        L +I   +++  N FL + FT+ +V  A+K     K+PG D             V D                  +N T++TLIPK++  + + D+RPIS
Subjt:  LDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------------VKD------------------WNHTIITLIPKVRNTRLVIDYRPIS

Query:  LCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMG
        LCNV YKII+K++A RL  VLH +I E+QS F+  R ITDN+++  E +H L++ +     +AALK DMSKA+DRVEW ++  VM ++GF+  W+ LIM 
Subjt:  LCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMG

Query:  RISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTL--FSAAHGRDLAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYE
         + T  FS  INGE  G   P R +RQGDPLSPYLFL+C EGLS L  +    GR L G++++R    ISHLFFADDSL+F +A     G  K  +  Y 
Subjt:  RISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTL--FSAAHGRDLAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYE

Query:  RASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGKG------------------------GGREILIKSVAQAIPTYAMSC
        RASGQ LN  KS+M FSPN           IL M +     AYLGLP+   R K                         GG+E+L+K+V Q+IPTYAMSC
Subjt:  RASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGKG------------------------GGREILIKSVAQAIPTYAMSC

Query:  FRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYF
        FRLP      + ++ AKFWWGS+ ++KKIH K+W+ LCK K  GG+ FR    FNQALLAKQ WR+  +P S ++ ++KG Y+     +       SS  
Subjt:  FRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYF

Query:  WKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFK-VTFTGPSKTSMMVVNFISNSLHWNVPMLKQYLDPMDIDVI-----SCLSINESAPDRWI
        W+G VWG ELL KG+RL +G G +I+   D W+P    FK   +TG       V ++I+ +  WN+  L+      D+D I     S L +N    DRWI
Subjt:  WKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFK-VTFTGPSKTSMMVVNFISNSLHWNVPMLKQYLDPMDIDVI-----SCLSINESAPDRWI

Query:  WHYD---RHEKSQRQPTMRYFGAREQRRC-NAPGIRTSNWTHVVIPREPPRLHHRARNTHHLQSLKMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERE
        WHY+    +  S         G  +   C +       ++  + +P +      +   +  +   K L  +K +   T ++ +   E  G   F     +
Subjt:  WHYD---RHEKSQRQPTMRYFGAREQRRC-NAPGIRTSNWTHVVIPREPPRLHHRARNTHHLQSLKMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERE

Query:  TRWTVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNK----P
          W  +                                 G S +  N    + G +     ++      EK          W IW+DRN+ IH K    P
Subjt:  TRWTVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNK----P

Query:  IPLVEGCCAWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQGNDDMVIHVDATCDLREHRASVGVAIRNSGGTLIAALHSPLVPFVSILCAEALAILEGLR
        + +     A++  Y +  ++V   +S R  Q           +   ++VDA  D    +  +GV +RNS G + AAL +P +        EA A+  GL 
Subjt:  IPLVEGCCAWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQGNDDMVIHVDATCDLREHRASVGVAIRNSGGTLIAALHSPLVPFVSILCAEALAILEGLR

Query:  LADHMNLKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSLS-DGSMLWLSTFPSWLSEIVSNEL
         A    L    + ++ L LV  L  + +       ++ DV+  +  F +  V ++ RN N  AH LAR +L  D   +WL   PS +  +V N++
Subjt:  LADHMNLKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSLS-DGSMLWLSTFPSWLSEIVSNEL

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]4.8e-15530.22Show/hide
Query:  KDNVDVSIRNYSIHHIDADIVW-NGTRWWFTVFYGHPETHLRKHTWELLRRLHN-NDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLR
        +D VDV++ + + ++ D  I++ +G RW F+  YG PE   +KHTW+L+RRL + +   PWL+ GD NEI  ++ KNGGPLRD  Q+  F+  +D C L 
Subjt:  KDNVDVSIRNYSIHHIDADIVW-NGTRWWFTVFYGHPETHLRKHTWELLRRLHN-NDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLR

Query:  DLGFDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGLML-----------------------------SNVVGSWDAYG
        ++   G  FTW   +     +  RLD    N      F   +  +LD+  SDHR +   +                               +  SW    
Subjt:  DLGFDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGLML-----------------------------SNVVGSWDAYG

Query:  PT--SSLLHQDLRKCAFELGCRGRRQNMELRQNIN-----------------------------------------------------------------
         T  +S L   LR CA  L     R+  +++Q+I                                                                  
Subjt:  PT--SSLLHQDLRKCAFELGCRGRRQNMELRQNIN-----------------------------------------------------------------

Query:  -----------------------------KEYFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD---------
                                      +YF  LF++       L+ VL +I   ++ + N FL + FT  EV  A+K     K+PG D         
Subjt:  -----------------------------KEYFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD---------

Query:  ----------------------VKDWNHTIITLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLH
                               + +N T+ITLIPK++  + + D+RPISLCNV+YKII+K++A R   VLH +I E+QS F+  R ITDN+++  E +H
Subjt:  ----------------------VKDWNHTIITLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLH

Query:  FLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAA
         L++       +AALKLDMSKA+DRVEW +L  VM ++GF    + LIM  + T +FS  INGE +G   P R +RQGDPLSPYLFL+C EGLS L    
Subjt:  FLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAA

Query:  H--GRDLAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSF
           GR L G++++R    I+HL FADDSL+F +A     G  K  +  Y RASGQ LN  KS+M FSPN  +        IL M + A   +YLGLP+  
Subjt:  H--GRDLAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSF

Query:  SRGKG------------------------GGREILIKSVAQAIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRD
         R K                         GG+E+L+K+V QAIPTYAMSCFRL   +  ++ ++ A+FWWGS+ ++KKIH K WK LC  K  GGL FR 
Subjt:  SRGKG------------------------GGREILIKSVAQAIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRD

Query:  LEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFK-VTFTGPSKT
           FNQA LAKQ WR+   PNS ++ ++KGRY+     +       SS  W+G VWG ELL KG+ + +G+G  +    D W+P    FK + FTG    
Subjt:  LEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFK-VTFTGPSKT

Query:  SMMVVNFISNSLHWNVPMLKQYLDPMDIDVISCLSIN-ESAPDRWIWHYDRHEKSQRQPTMRYFGAREQRRCNAPGIRTSNWTHVV----IPREPPRLHH
        S +V ++I+++  W++ +L     P DID I  + ++  S  DRW WHYD       +       + E +  ++       W  +     +P +      
Subjt:  SMMVVNFISNSLHWNVPMLKQYLDPMDIDVISCLSIN-ESAPDRWIWHYDRHEKSQRQPTMRYFGAREQRRCNAPGIRTSNWTHVV----IPREPPRLHH

Query:  RARNTHHLQSLKMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERETRWTVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELG
        R  N+  L   + L  +K +   T ++  +  E  G   F     ++ W       +   ++L                     D        DG + L 
Subjt:  RARNTHHLQSLKMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERETRWTVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELG

Query:  GWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGCCAWIFDYLAEVTSVKNCSSKR----RQQVDEVRSLFQGNDDMVIHVDATCD
               T+    + EK   T      W IW+DRN+ IH K +       +    YLA   SVK+ ++            V+ +     ++ ++VDA  D
Subjt:  GWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGCCAWIFDYLAEVTSVKNCSSKR----RQQVDEVRSLFQGNDDMVIHVDATCD

Query:  LREHRASVGVAIRNSGGTLIAALHSPLVPFVSILCAEALAILEGLRLADHMNLKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEY
           ++  +GV IR+S G +IAA+  P+V        EA A+  GL+ A  + L+   + ++ L LV  L+ + +       ++ D+   +  F +  + +
Subjt:  LREHRASVGVAIRNSGGTLIAALHSPLVPFVSILCAEALAILEGLRLADHMNLKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEY

Query:  VNRNYNFLAHTLARQSLS-DGSMLWLSTFPSWLSEIVSNE
        V R+ N  AH LA+Q+L  D   +W+   PS +  +V N+
Subjt:  VNRNYNFLAHTLARQSLS-DGSMLWLSTFPSWLSEIVSNE

XP_042962672.1 uncharacterized protein LOC122296942 [Carya illinoinensis]1.1e-15431.34Show/hide
Query:  KDNVDVSIRNYSIHHIDADIV-WNGTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRD
        KD++++ I NYS HHI A I   +G  W  T  YGH ++  R   W LL+ L      PW+V GDFNEIL   EK GG +R   Q+ +F+E + DC LRD
Subjt:  KDNVDVSIRNYSIHHIDADIV-WNGTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRD

Query:  LGFDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGL------------MLSNVVGSWDAYGPTSSLLHQ----------
        LG+ G  FTW NR+     V  RLDRFLAN     +FP  R  +     SDH P+ L             L      W      SS++ +          
Subjt:  LGFDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGL------------MLSNVVGSWDAYGPTSSLLHQ----------

Query:  ------DLRKCAFELG--------------------------------------------------------------------CRG----------RRQ
               +  CA ELG                                                                    C            RR+
Subjt:  ------DLRKCAFELG--------------------------------------------------------------------CRG----------RRQ

Query:  N--MELRQN------------INKEYFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------VKDW----
        N  M+L+              +  EYF  LF++      D+  VL  +  +VT +MN  L K +  EEVE+A+K   P+KAPGPD        K W    
Subjt:  N--MELRQN------------INKEYFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------VKDW----

Query:  --------------------NHTIITLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHR
                            NHT ITLIPK  +   V D+RPISLCNV YKI++KVIANRL SVL DII  SQS F+PGR I+DN++I +E LHFL+N R
Subjt:  --------------------NHTIITLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHR

Query:  TCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRD-L
          +  + +LKLDMSKAYDRV+W +L  +M  LGF +  + LIM  + T +FS+ +NG   G   PSR +RQGDPLSPYLFLLC EGL +L      R+ +
Subjt:  TCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRD-L

Query:  AGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK---
         G+ I R   +I+HL FADDS+IF KA  +     ++++  YERASGQC+N  K+ M FS NV  D  R +  +           YLG P    R K   
Subjt:  AGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK---

Query:  ---------------------GGGREILIKSVAQAIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQA
                              GGRE+LIK+VA +IPTYAMSCF  PK     +  + A+FWWG      KIH  RW+ LC  K  GG+ FRDL  FN A
Subjt:  ---------------------GGGREILIKSVAQAIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQA

Query:  LLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVNFI
        LLAKQ WR++ N +S    + K +Y+PN    +     NSSY WKG    ++ L+KG R  +GNG+++++F DPW+P           PS ++++     
Subjt:  LLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVNFI

Query:  SNSLHWNVPMLKQYLDPMDIDVISCLSINESAPDRWIWHYDRHEKSQRQPTMRYFGAREQRRCNAPGIRTSN---WT---HVVIPREPPRLHHRARNTHH
                          ++ V S +  +  A D   W ++++     +   RY    +Q         +S    W    H+ +P++      RA     
Subjt:  SNSLHWNVPMLKQYLDPMDIDVISCLSINESAPDRWIWHYDRHEKSQRQPTMRYFGAREQRRCNAPGIRTSN---WT---HVVIPREPPRLHHRARNTHH

Query:  LQSLKMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERETRWTVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAV
        L +   L +K  +   T  +  +  E +    F  +E  + W V     +  +  L                                 W+L   AR   
Subjt:  LQSLKMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERETRWTVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAV

Query:  TVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGCCAWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQGNDDMVIHVDATCDLREHRASVGVA
         VR  D        +     W +W  RN  I+      +            E   V+      ++    VR     ND + +++D         A +GV 
Subjt:  TVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGCCAWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQGNDDMVIHVDATCDLREHRASVGVA

Query:  IRNSGGTLIAALHSPLVPFVSILCAEALAILEGLRLADHMNLKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHT
        +R+  G +I A         S    EA+A+L GL+L     + ++ + ++ L LV  L + +    D+  I+ D+ R M  F+ V V +VNR  N +AH 
Subjt:  IRNSGGTLIAALHSPLVPFVSILCAEALAILEGLRLADHMNLKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHT

Query:  LARQS-LSDGSMLWLSTFPSWLSE
        LAR + L D   +W    PS++S+
Subjt:  LARQS-LSDGSMLWLSTFPSWLSE

TrEMBL top hitse value%identityAlignment
A0A2N9ELB0 Uncharacterized protein7.8e-15936.37Show/hide
Query:  NVDVSIRNYSIHHIDADIVW-NGTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRDLG
        +++V I++YS HHIDAD+++ +G  W  T FYGHPE  LR H+W LLR LH     PWLV GDFNEI   DEK G   R   Q+  F+E++ DCSLRDLG
Subjt:  NVDVSIRNYSIHHIDADIVW-NGTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRDLG

Query:  FDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDH------------RPIGLMLSN----------------VVGSWDA--YGPT
        + G  FTW NR++ G  V +RLDR +AN +  S+FP  + L++    SDH             PIG                     +  +W++   G  
Subjt:  FDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDH------------RPIGLMLSN----------------VVGSWDA--YGPT

Query:  SSLLHQDLRKCAFEL----------------------------------------------------------------------------GCRGRRQN-
          ++ Q +++C  +L                                                                             C  +RQ  
Subjt:  SSLLHQDLRKCAFEL----------------------------------------------------------------------------GCRGRRQN-

Query:  ---MELR-------------QNINKEYFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------VKDW---
           + LR              +I  +YF NLF+S  P  E ++ V+ S++  V+ DMN  L + ++ EE+  A+    P+KAPGPD        K W   
Subjt:  ---MELR-------------QNINKEYFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------VKDW---

Query:  ---------------------NHTIITLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNH
                             N T I LIPKV+N   + ++RPISLCNV YKI++KV+ NR+  +L  +I +SQS F+PGR ITDN+I+  E LH+L+N 
Subjt:  ---------------------NHTIITLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNH

Query:  RTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRD-
        R+      A KLDMSKAYDRVEWDYL  ++++LGF+ +WV LIM  +++A++S+ +NGEA G+ KPSR +RQGDPLSPYLFL+C EGLS+L   A  RD 
Subjt:  RTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRD-

Query:  -LAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK-
           GV+I R   +ISHLFFADDS+IF +A   +  V + ++  YE+ASGQ +N  K+ + FS N        + S+           YLGLP    R K 
Subjt:  -LAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK-

Query:  -----------------------GGGREILIKSVAQAIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFN
                                 GRE LIK+V QAIPTYAMSCF+ P G  A + S+  +FWWG     +KIH  R   L +PK+ GG+ FRDL  FN
Subjt:  -----------------------GGGREILIKSVAQAIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFN

Query:  QALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVN
        +ALLA+Q WR++ +P S V+  +K +Y+P+   LD    NN+SY W+       +L+ G+R  +G G SIK++ D WLP P+++KV       +    V+
Subjt:  QALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVN

Query:  FI--SNSLHWNVPMLKQYLDPMDIDVISCLSINESAP-DRWIW
         +   +   WN  +L+Q   P D+++I  + +++  P DR IW
Subjt:  FI--SNSLHWNVPMLKQYLDPMDIDVISCLSINESAP-DRWIW

A0A2N9F345 Uncharacterized protein5.0e-15831.26Show/hide
Query:  MVVPKGLVND--TLGETSVNGRKLANPPTVPKGNSKAATCKKRARVGFIPKGLDPNEAHELTKRKEGPDVEQAGLKRLKFLDDDDMEAGSAD--------
        +V P  +V+   T+   S+N R L NP TV                         NE H+L  RK+GP++         FL +  +E  S +        
Subjt:  MVVPKGLVND--TLGETSVNGRKLANPPTVPKGNSKAATCKKRARVGFIPKGLDPNEAHELTKRKEGPDVEQAGLKRLKFLDDDDMEAGSAD--------

Query:  -SPARTNERILFVGMSV--KDNVDVSIRNYSIHHIDADIVW-NGTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLR
              +   L  G+++    +VDV I++YS HHI A++V  +G +W  T FYGHPET LR  +W LLR L      PW+V GDFNEI+  DE  G   R
Subjt:  -SPARTNERILFVGMSV--KDNVDVSIRNYSIHHIDADIVW-NGTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLR

Query:  DFRQLNDFKEAIDDCSLRDLGFDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGLMLSN--------------------
        +  Q+  F+EA+ DC L D+GF G   TW N ++    V  RLDR +AN    S+FP+    +L    SDH  +GL+++                     
Subjt:  DFRQLNDFKEAIDDCSLRDLGFDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGLMLSN--------------------

Query:  ------------VVGSWDAY--GPTSSLLHQDLRKCAFEL------------------------------------GCRGRRQNMELR-----------Q
                    + G+WD +  G     + + ++ C   L                                      +  RQ + +            +
Subjt:  ------------VVGSWDAY--GPTSSLLHQDLRKCAFEL------------------------------------GCRGRRQNMELR-----------Q

Query:  NINKEYFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD---VKDWNHTIITLIPKVRNTRLVIDYRPISLCNV
         +  +YF  +FSS  P  + ++ V+  +   VT  MN+ L K F  EEV+ A+    PTKAPGPD   +   N T I LIPKV     +  +RPISLCNV
Subjt:  NINKEYFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD---VKDWNHTIITLIPKVRNTRLVIDYRPISLCNV

Query:  SYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRIST
         YKI +KV+ NR+ S+L  +I ESQS F+PGR ITDN+II  ET+H+L+N R       A+KLDMSKAYDRVEWDYL  +M++LGFH NWV LIM  ++T
Subjt:  SYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRIST

Query:  ATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRDL-AGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQ
        AT++I +NGE  G+ KP R +RQGDPLSPYLFLLC EGLS L   A    L  G+SI R   +ISHLFFADDS+IF +A + E      ++  Y  ASGQ
Subjt:  ATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRDL-AGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQ

Query:  CLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK------------------------GGGREILIKSVAQAIPTYAMSCFRLPK
         +N  K+ + FS N+ Q T   + S+           YLGLP    R K                          GRE+LIK+V QAIP YAM CF+ P 
Subjt:  CLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK------------------------GGGREILIKSVAQAIPTYAMSCFRLPK

Query:  GWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFV
        G  A +S++  +FWWG     +KIH      L K K  GG+  RDL+ FN+ALLA+Q W ++  P+S +  ++K +Y+PN   L++   +N+SY W+   
Subjt:  GWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFV

Query:  WGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVT--FTGPSKTSMMVVNFISNSLHWNVPMLKQYLDPMDIDVISCLSINESAP-DRWIWHYDRH--
           E+L  GMR  +G G  IK++ D WLP PST+KVT   TG    + +      +S+ WN+P+L       D++ I  + +++  P D  IW   +   
Subjt:  WGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVT--FTGPSKTSMMVVNFISNSLHWNVPMLKQYLDPMDIDVISCLSINESAP-DRWIWHYDRH--

Query:  --EKSQRQPTMRYFGAREQRRCNAPGIRTSNWTHVVIPREPPRLHHRARNTHHLQSLKMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERETRWTVARR
           KS  +    +    E    ++       W+ +     PP++            + +    KG+    T + +K    + S  +   E ET       
Subjt:  --EKSQRQPTMRYFGAREQRRCNAPGIRTSNWTHVVIPREPPRLHHRARNTHHLQSLKMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERETRWTVARR

Query:  GKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNK-PIPLVEGC---C
         K+   W    A R  +E    C     +                  +    +T  A    E    T     AWAIW  RN ++ N+   P+ E C    
Subjt:  GKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNK-PIPLVEGC---C

Query:  AWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQGNDDMVIHVDATCDLRE--HRASVGVAIRNSGGTLIAALHSPLVPFVSILCAEALAILEGLRLADHMN
            D+L E  S+   S      + E++  ++  + M   ++ +C L    ++  +GV +R+S G + A++          L     A+L  L  A  + 
Subjt:  AWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQGNDDMVIHVDATCDLRE--HRASVGVAIRNSGGTLIAALHSPLVPFVSILCAEALAILEGLRLADHMN

Query:  LKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSLSDG-SMLWLSTFPSWLSEIV
        L+ +++      L  +L+K  +    +  +I D+      F  ++  ++    N  A  LA ++LS     +WL   P+ ++  V
Subjt:  LKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSLSDG-SMLWLSTFPSWLSEIV

A0A2N9GDB5 Reverse transcriptase domain-containing protein6.6e-15838.04Show/hide
Query:  KDNVDVSIRNYSIHHIDADIVWN-GTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRD
        K  + + + +YS  HIDA +  N    W FT FYG PETH R  +W LLRRL++    PW   GDFNE++  +EK G   R  RQ+  F++ +D+C L D
Subjt:  KDNVDVSIRNYSIHHIDADIVWN-GTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRD

Query:  LGFDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGLMLSNVV------------------------GSWDAYGP-----
        LGF G  FTW N +  G     RLDR +A      +FP  R  +L+   SDH+PI +    VV                         SW    P     
Subjt:  LGFDGGMFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGLMLSNVV------------------------GSWDAYGP-----

Query:  ------TSSLLHQD------LRKCAFEL---------------------------GCRG---RRQN--MELRQNINK-------------EYFMNLFSSF
               +S+L +D      L+K    L                            CR    +R+N    LR +  +             +++ +LF + 
Subjt:  ------TSSLLHQD------LRKCAFEL---------------------------GCRG---RRQN--MELRQNINK-------------EYFMNLFSSF

Query:  KPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------------------------------VKDWNHTIITLIPKVRNT
         P  + +  V++ I R VT +MN  L K FT  EVE+A+K   P KAPGPD                               +K  NHT ITLIPKV+N 
Subjt:  KPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------------------------------VKDWNHTIITLIPKVRNT

Query:  RLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGF
          V+++RPISLCNV YKII+K++ANRL  +L  I+ ESQS FIPGR ITDN+++  ETLH +Q+ +T KT   ALKLDMSKAYDRVEW +L  VM ++GF
Subjt:  RLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGF

Query:  HENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLF-SAAHGRDLAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGV
        HE WV ++M  IST ++SI +NGE  G+ KPSR +RQGDPLSPYLFLLC EGL +L     +   L GVSI+R   KI+HLFFADDSL+F KA   +   
Subjt:  HENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLF-SAAHGRDLAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGV

Query:  FKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK------------------------GGGREILIKSVAQ
         ++I+  YE+ASGQ +N  K+ + FS +      R +  +L++  +     YLGLPS   R K                          GRE+LIKSVAQ
Subjt:  FKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK------------------------GGGREILIKSVAQ

Query:  AIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDI
        AIP+YAMSCFRLP   I  +  +  +FWWG   +  K+H   W+ LCK K  GG+  RDL  FN+ALLAKQVWR++ NP+S    + K +Y+P    L+ 
Subjt:  AIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDI

Query:  SSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMM-VVNFISNSLH-WNVPMLKQYLDPMDIDVISCLSI-NES
         S    SY WK  +   +L+ KG    +G+G +I ++ D WLP      +T      TS+  V++ I   L  W   ++K+   P +  VI  + + +  
Subjt:  SSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMM-VVNFISNSLH-WNVPMLKQYLDPMDIDVISCLSI-NES

Query:  APDRWIW
          D  IW
Subjt:  APDRWIW

A0A2N9IYW9 Reverse transcriptase domain-containing protein1.3e-15830.58Show/hide
Query:  GNSKAATCKKRARVGFIPKGLDPNEAHELTKRKE-------GPDVEQAGLKRLKFLDDDDMEAG--SADSPARTNERILFVGMSV--KDNVDVSIRNYSI
        G S++   KKRAR      G+D +  H  T+++        G D E +   + + LD DD   G  S ++  +        G+++    +V V+I +YS 
Subjt:  GNSKAATCKKRARVGFIPKGLDPNEAHELTKRKE-------GPDVEQAGLKRLKFLDDDDMEAG--SADSPARTNERILFVGMSV--KDNVDVSIRNYSI

Query:  HHIDADIV-WNGTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRDLGFDGGMFTWCNR
        HHI+A+IV  +G  W FT FYG+PET LR  +W LL+ LH+    PW+V GDFNEI   DE+ G   R+  Q+  F+EA+ DCSL D+GF G +FTW NR
Subjt:  HHIDADIV-WNGTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRDLGFDGGMFTWCNR

Query:  QDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGLMLSN--------------------------------VVGSWD-------AYGPTSS
        ++    V +RLDR +A  +   +FP     +L    SDH  +GL+L                                  +  +W+        Y     
Subjt:  QDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGLMLSN--------------------------------VVGSWD-------AYGPTSS

Query:  LLH------------------------QDLRKC------AFELG-----------------------------------------CRGRRQNMELRQNIN
        + H                        Q LR+        +E G                                         C  +R+     Q I 
Subjt:  LLH------------------------QDLRKC------AFELG-----------------------------------------CRGRRQNMELRQNIN

Query:  K-----------------EYFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------VKDW----------
                          +YF  +F+S  P   D+  V+  +   VTT MN  L K F++EEV+ A+    P+KAPGPD        K W          
Subjt:  K-----------------EYFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------VKDW----------

Query:  --------------NHTIITLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNY
                      N T I LIPKV +   +  +RPISLCNV YKI +KV+ NR+ ++L  +I +SQS F+PGR ITDN+I+  ET+H+L+N RT     
Subjt:  --------------NHTIITLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNY

Query:  AALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRD--LAGVSI
         A KLDMSKAYDRVEW+YL  +M++LGFHE WV LIM  ++TAT++I +NGE  G+ KPSR +RQGDPLSPYLFLLC EGLS L   A  RD  + GVSI
Subjt:  AALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRD--LAGVSI

Query:  ARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK--------
         R   +ISHLFFADDS++F +A   E G  + I+  Y  ASGQ +N AK+ + FS N  Q     + S+    +      YLGLP    R K        
Subjt:  ARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGK--------

Query:  ----------------GGGREILIKSVAQAIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQ
                          GRE+LIK+V QAIPTYAMSCF+ P G  A +S++  +FWWG     +KIH      L +PK  GG+ FRDL+ FN ALLAKQ
Subjt:  ----------------GGGREILIKSVAQAIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQ

Query:  VWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVN--FISNS
         WR++  P+S V  ++K +Y+P    L+ +   N+S+ W+       +L +G+R  +GNG  IK++ D WLP P T+K+            V+     N 
Subjt:  VWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVN--FISNS

Query:  LHWNVPMLKQYLDPMDIDVISCLSINESAP-DRWIWHYDR----HEKSQRQPTMRYFGAREQRRCNAPGIRTSNWTHVVIPREPPRLHHRARNTHHLQSL
        + WN  ++ Q   P +++VI  + +++  P D  IW   +      KS  +  +      E     +    +  W  +      P++            +
Subjt:  LHWNVPMLKQYLDPMDIDVISCLSINESAP-DRWIWHYDR----HEKSQRQPTMRYFGAREQRRCNAPGIRTSNWTHVVIPREPPRLHHRARNTHHLQSL

Query:  KMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERETRWTVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRA
         +    KG+      + +K    S S  + E+  ET         +   W    A R  +     C  +               G     +  G V    
Subjt:  KMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERETRWTVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRA

Query:  DDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGCC----AWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQGNDDMVIHVDATCDLREHRASVGVA
            E    T     AWAIW  RN ++ N  I  V   C        DY+  V+ +K         +  V        +  +++     +   +   GV 
Subjt:  DDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGCC----AWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQGNDDMVIHVDATCDLREHRASVGVA

Query:  IRNSGGTLIAALHSPLVPFVSILCAEALAILEGLRLADHMNLKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHT
        IR+S G + AA+ S L      L   A+ +L  L+ A  + L+R+++   +  L+ +++  T     +  II D+   +  F  +   ++ ++ N  AH 
Subjt:  IRNSGGTLIAALHSPLVPFVSILCAEALAILEGLRLADHMNLKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHT

Query:  LARQSLSDGS-MLWLSTFPSWLSEIVSNE
        LA ++LS  S  +WL   P+ ++  V ++
Subjt:  LARQSLSDGS-MLWLSTFPSWLSEIVSNE

A0A6J1DX30 uncharacterized protein LOC1110248741.8e-17934.85Show/hide
Query:  FTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRDLGFDGGMFTWCNRQDVGVQVSLRLDRFLA
        FT FYGHP  H R  TWELLRR+ N D SPWL+GGD N ILW+ E +     D  Q+  F+  +D CSL D+GF GG+FTWCN +  G Q+  RLDRFL 
Subjt:  FTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRDLGFDGGMFTWCNRQDVGVQVSLRLDRFLA

Query:  NLSCSSIFPVCRALNLDWEKSDHR----PIGLMLSNVVGSW--------------------DAYGPTSSLLHQDLRKCAFELGCRGRRQNMELRQNINKE
        N + + +FP        W  + H           S+ +  W                    DAY     L    +     +L      + +  +Q   ++
Subjt:  NLSCSSIFPVCRALNLDWEKSDHR----PIGLMLSNVVGSW--------------------DAYGPTSSLLHQDLRKCAFELGCRGRRQNMELRQNINKE

Query:  YFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGP-------------------------------DVKDWNHTII
        +     +       D+  +++ I  ++T+++N  L   +TKEE+E+AI+   PTKA GP                               D+K WN T I
Subjt:  YFMNLFSSFKPSFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGP-------------------------------DVKDWNHTII

Query:  TLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYL
         LIPK++  R + D+RPISLCNVSYKII+K I NRL +V+  +I ++QS F+P R+I+DN+IIGHE LH + + ++     AALKLD+SKA+DRVEW YL
Subjt:  TLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYL

Query:  HHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAH--GRDLAGVSIARTCQKISHLFFADDSLIF
          +M ++GF+E W+  I+  IST  FSI +NG   G F+PSR IRQGDPLSPYLFLLC EGLS L +  +  GR L G+        I+HL FADDSLIF
Subjt:  HHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAH--GRDLAGVSIARTCQKISHLFFADDSLIF

Query:  LKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGKGGGREILIKSVAQAIPTYAMSCFRLPK
        L+++  E    + ++ +Y RASGQC+N++KS + FSPNV  +  +YL  IL +K+V+H G YLGLPS F+R +G                          
Subjt:  LKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGKGGGREILIKSVAQAIPTYAMSCFRLPK

Query:  GWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFV
                           E +K+H  +W  +C PKE GGLNFRDLEGFNQAL+AK VWR + +PN  V+ ++K +Y+ +   L  S+ + SSYFWKGF+
Subjt:  GWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDISSRNNSSYFWKGFV

Query:  WGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVNFISNSLHWNVPMLKQYLDPMDIDVISCLSINE-SAPDRWIWHYD-RHEKS
        WG +LL KG+RL +GNG +IK FSDPWLPRP+TFK            V +FI+   +W+V  +       D D+I  + I+  +  D W+WHYD R   S
Subjt:  WGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVNFISNSLHWNVPMLKQYLDPMDIDVISCLSINE-SAPDRWIWHYD-RHEKS

Query:  QRQPTMRYFGAREQRRCNAPGIRTSNWTHV---VIPREPPRLHHRARNTHHLQSLKMLGEKKGMG-INTTTVLEKRGEKSGSVGFGENERETRWTVARRG
         R     Y   +      +   R + W  +    +P +      R+ + H   +  +L   +G+G +   T+   R E      F            +R 
Subjt:  QRQPTMRYFGAREQRRCNAPGIRTSNWTHV---VIPREPPRLHHRARNTHHLQSLKMLGEKKGMG-INTTTVLEKRGEKSGSVGFGENERETRWTVARRG

Query:  KEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGCCAWIFD
        +++ R                C + E          +N    EL  W+         +  E K+        W IWNDRNS+IH K +  VE  C W+  
Subjt:  KEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGCCAWIFD

Query:  YLAEVTSVKNCSSKRRQQVDEVRSLFQ-----GNDDMVIHVDATCDLREHRASVGVAIRNSGGTLIAALHSPLVPF-VSILCAEALAILEGLRLADHMNL
        +L   +  +  +   R Q +  R + Q      +  + ++ DA C  R    S G  IR+S  +L+AA  S  VPF +S L AE   ILEGL+ A   N 
Subjt:  YLAEVTSVKNCSSKRRQQVDEVRSLFQ-----GNDDMVIHVDATCDLREHRASVGVAIRNSGGTLIAALHSPLVPF-VSILCAEALAILEGLRLADHMNL

Query:  KRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSLSDGS--MLWLSTFPSWLSEIVSNE
          +++ S+SL  + +++ E     D  + + +++    CF  ++  + +R  N  AH LA+  ++  S    WL  FP+WL ++V  +
Subjt:  KRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSLSDGS--MLWLSTFPSWLSEIVSNE

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein1.0e-2224.66Show/hide
Query:  QNINKEYFMNLFSSFKPSFEDLNRVLD-------------SISRKVTTD-----MNAFLTK--------------RFTKEEVEIAIKGFQPTKAPGPDVK
        Q   +EY+ +L+++   + E+++  LD             S++R +T       +N+  TK              R+ +E V   +K FQ  +  G    
Subjt:  QNINKEYFMNLFSSFKPSFEDLNRVLD-------------SISRKVTTD-----MNAFLTK--------------RFTKEEVEIAIKGFQPTKAPGPDVK

Query:  DWNHTIITLIPKV-RNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQN-HRTCKTNYAALKLDMSKA
         +    I LIPK  R+T    ++RPISL N+  KI+ K++ANR+   +  +I   Q  FIPG     N+    ++++ +Q+ +R    N+  + +D  KA
Subjt:  DWNHTIITLIPKV-RNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQN-HRTCKTNYAALKLDMSKA

Query:  YDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRDLAGVSIARTCQKISHLFF
        +D+++  ++   + +LG    ++ +I       T +I +NG+    F      RQG PLSP LF +  E L+   +    +++ G+ + +   K+S   F
Subjt:  YDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRDLAGVSIARTCQKISHLFF

Query:  ADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSI---LQMKVVAHLGAYL
        ADD +++L+   +       ++  + + SG  +N  KS      N +Q  S+ +  +   +  K + +LG  L
Subjt:  ADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSI---LQMKVVAHLGAYL

P08548 LINE-1 reverse transcriptase homolog5.6e-2123.98Show/hide
Query:  QNINKEYFMNLFSSFKPSFEDLNRVLDSIS-RKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD--VKDWNHTI-----------------------
        Q I  EY+  L+S    + +++++ L++    +++      L +  +  E+   I+     K+PGPD    ++  T                        
Subjt:  QNINKEYFMNLFSSFKPSFEDLNRVLDSIS-RKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD--VKDWNHTI-----------------------

Query:  ------ITLIPKV-RNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKT-NYAALKLDMSKA
              ITLIPK  ++     +YRPISL N+  KI+ K++ NR+   +  II   Q  FIPG     N+    ++++ +Q+    K  ++  L +D  KA
Subjt:  ------ITLIPKV-RNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKT-NYAALKLDMSKA

Query:  YDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRDLAGVSIARTCQKISHLFF
        +D ++  ++   + ++G    ++ LI    S  T +I +NG     F      RQG PLSP LF +  E L+   +    + + G+ I     K+S   F
Subjt:  YDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRDLAGVSIARTCQKISHLFF

Query:  ADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSI---LQMKVVAHLGAYLGLPSSFSRGKGGGREILIKSVAQAIPT
        ADD +++L+           ++  Y   SG  +N  KS+     N  Q      +SI   +  K + +LG Y  L            E L K +A+ +  
Subjt:  ADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSI---LQMKVVAHLGAYLGLPSSFSRGKGGGREILIKSVAQAIPT

Query:  YAMSCFRLPKGWIARVS
        +      +P  W+ R++
Subjt:  YAMSCFRLPKGWIARVS

P11369 LINE-1 retrotransposable element ORF2 protein5.8e-2623.53Show/hide
Query:  QNINKEYFMNLFSSFKPSFEDLNRVLDSIS-RKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------------------------------VK
        QN  + ++  L+S+   + +++++ LD     K+  D    L    + +E+E  I      K+PGPD                                 
Subjt:  QNINKEYFMNLFSSFKPSFEDLNRVLDSIS-RKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPD-------------------------------VK

Query:  DWNHTIITLIPKVRNTRLVID-YRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAY
         +    ITLIPK +     I+ +RPISL N+  KI+ K++ANR+   +  II   Q  FIPG     N+      +H++  ++    N+  + LD  KA+
Subjt:  DWNHTIITLIPKVRNTRLVID-YRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAY

Query:  DRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRDLAGVSIARTCQKISHLFFA
        D+++  ++  V+ R G    ++ +I    S    +IK+NGE           RQG PLSPYLF +  E L+   +    +++ G+ I +   KIS L  A
Subjt:  DRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAAHGRDLAGVSIARTCQKISHLFFA

Query:  DDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGL----------PSSFSRGKGGGREIL----
        DD ++++            ++ ++    G  +N  KS M F     +   + +       +V +   YLG+            +F   K   +E L    
Subjt:  DDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGL----------PSSFSRGKGGGREIL----

Query:  ---------IKSVAQAIPTYAMSCF-----RLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVW
                 I  V  AI   A+  F     ++P  +   +     KF W   ++  +I +   KD    +  GG+   DL+ + +A++ K  W
Subjt:  ---------IKSVAQAIPTYAMSCF-----RLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVW

P14381 Transposon TX1 uncharacterized 149 kDa protein4.9e-2525.13Show/hide
Query:  DMNAFLTKRFTKEEVEIAIKGFQPTKAPGPDVKDWNHTIITLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDN
        D +  LT+ F K E+ ++ +                  +++L+PK  + RL+ ++RP+SL +  YKI+ K I+ RL SVL ++I   QS  +PGR+I DN
Subjt:  DMNAFLTKRFTKEEVEIAIKGFQPTKAPGPDVKDWNHTIITLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFIPGRSITDN

Query:  MIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFE
        + +  + LHF    R    + A L LD  KA+DRV+  YL   +    F   +V  +    ++A   +KIN   T      R +RQG PLS  L+ L  E
Subjt:  MIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFE

Query:  GLSTLFSAAHGRDLAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQD--TSRYLNSILQMKVVAHLG
            L      + L G+ +     ++    +ADD ++  + + ++    +     Y  AS   +N++KS      +++ D     + +   + K++ +LG
Subjt:  GLSTLFSAAHGRDLAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQD--TSRYLNSILQMKVVAHLG

Query:  AYLG----------------LPSSFSRGKG-------GGREILIKSVAQAIPTYAMSCFRLPKGWIARVSSLCAKFWW
         YL                 + +   + KG        GR ++I  +  +   Y + C    + +IA++      F W
Subjt:  AYLG----------------LPSSFSRGKG-------GGREILIKSVAQAIPTYAMSCFRLPKGWIARVSSLCAKFWW

P93295 Uncharacterized mitochondrial protein AtMg003109.5e-2941.96Show/hide
Query:  AIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKE-VGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLD
        A+P YAMSCFRL K    +++S   +FWW S +  +KI    W+ LCK KE  GGL FRDL  FNQALLAKQ +R+I  P++ ++ +++ RY+P+   ++
Subjt:  AIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKE-VGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLD

Query:  ISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWL
         S     SY W+  + G ELL +G+   +G+G   K++ D W+
Subjt:  ISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWL

Arabidopsis top hitse value%identityAlignment
AT3G09510.1 Ribonuclease H-like superfamily protein6.1e-0723.14Show/hide
Query:  IKGRYWPNFPSLDISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKM----FSDPWLPRPSTFKVTFTGPSKTSMMVVNFISNS---LHWNVPMLKQ
        +K RY+ +   LD   R   SY W   + G+ LLKKG R L+G+G++I++      D   PRP   + T+       M + N          W+   + Q
Subjt:  IKGRYWPNFPSLDISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKM----FSDPWLPRPSTFKVTFTGPSKTSMMVVNFISNS---LHWNVPMLKQ

Query:  YLDPMDIDVISCLSINES-APDRWIWHYDRHEKSQRQPTMR--YFGAREQRRCNAPGIRTSNWTHVVIPREPPRLHHRARNTHHLQSLK-MLGEKKGMGI
        ++D  D   I  + + +S  PD+ IW+Y+    +  + T+R  Y+        N P I          P     L  R  N   +  LK  L       +
Subjt:  YLDPMDIDVISCLSINES-APDRWIWHYDRHEKSQRQPTMR--YFGAREQRRCNAPGIRTSNWTHVVIPREPPRLHHRARNTHHLQSLK-MLGEKKGMGI

Query:  NTTTVLEKRG---EKSGSVGFGENERETRWTVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKE
         TT  L  RG   + S      ENE             MA W L  +     +               S + E +            +    D       
Subjt:  NTTTVLEKRG---EKSGSVGFGENERETRWTVARRGKEMARWELGTAGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKE

Query:  KTQRKGGAWAIWNDRNSMIHNK----PIPLVEGCCAWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQGNDDMVI--HVDATCDLREHRASVGVAIRNSGG
        K       W IW  RN+++ NK    P   V    A   D+L    S K   S  R Q+ E +  ++      +  + DA  D+++  A+ G  IRN  G
Subjt:  KTQRKGGAWAIWNDRNSMIHNK----PIPLVEGCCAWIFDYLAEVTSVKNCSSKRRQQVDEVRSLFQGNDDMVI--HVDATCDLREHRASVGVAIRNSGG

Query:  TLIAALHSPLVPFVSILCAEALAILEGLRLADHMNLKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSL
        T I+     L    + L AE  A+L  L+        +V +  +  +L+ ++    + +  + + + D+      F S+   ++ R  N LAH LA+   
Subjt:  TLIAALHSPLVPFVSILCAEALAILEGLRLADHMNLKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERKMECFRSVTVEYVNRNYNFLAHTLARQSL

Query:  SDGSMLWLS-TFPSWLSEIVSNE
        +  +    S + P WL     N+
Subjt:  SDGSMLWLS-TFPSWLSEIVSNE

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.6e-1334.19Show/hide
Query:  IANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKIN
        +  RL  ++ ++I  +Q++FIPGR  TDN++   E +H ++  +  K  +  LKLD+ KAYDR+ WDYL   +I  GF E W    +  I+ +TF  +  
Subjt:  IANRLNSVLHDIIDESQSTFIPGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKIN

Query:  GEATGFFKPSREIRQGD
            G    S+  R  D
Subjt:  GEATGFFKPSREIRQGD

AT4G29090.1 Ribonuclease H-like superfamily protein6.3e-2839.44Show/hide
Query:  AIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDI
        A+PTY M+CF LPK    ++ S+ A FWW +  E K +H K W  L   K  GG+ F+D+E FN ALL KQ+WR++  P S +A + K RY+     L+ 
Subjt:  AIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLDI

Query:  SSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWL
           +  S+ WK      E+L++G R ++GNG  I ++   WL
Subjt:  SSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWL

ATMG00310.1 RNA-directed DNA polymerase (reverse transcriptase)-related family protein6.7e-3041.96Show/hide
Query:  AIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKE-VGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLD
        A+P YAMSCFRL K    +++S   +FWW S +  +KI    W+ LCK KE  GGL FRDL  FNQALLAKQ +R+I  P++ ++ +++ RY+P+   ++
Subjt:  AIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKE-VGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSLD

Query:  ISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWL
         S     SY W+  + G ELL +G+   +G+G   K++ D W+
Subjt:  ISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWL

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)5.4e-1152.94Show/hide
Query:  INGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAA--HGRDLAGVSIARTCQKISHLFFADDS
        ING   G   PSR +RQGDPLSPYLF+LC E LS L   A   GR L G+ ++    +I+HL FADD+
Subjt:  INGEATGFFKPSREIRQGDPLSPYLFLLCFEGLSTLFSAA--HGRDLAGVSIARTCQKISHLFFADDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCCTCTGGTTGGCCACACGATCTAAGTGTAACTGAACAGGTCGAGAACGACGAGTGGGAAGAGCTAAGCTCACCATTTCCAAAACGACATATAGTTCAACACCA
TATCAAACCAATATTACGCCCGTTTCGATTGTTTTTGGCCTGTGATGGAGTGATTTTAGCCACTGAGCTACTATATCAACACGTCGGGTGGCAATATGTTTTCTCATCAC
TTCCTGACTGCTCTGTGATTGATTCTTTATCTTTTCTTGTGTGTTTTTTGTCCCCTCTTTTGCCGAGTATGAATCCTATTTCTTTGGTGGAGGATGGGCCTAAGTTAATC
TTAACCGAATTTGAGGAGGACGTCTATGTGGATGCGGATCGAAGTGTGGTGGACAGCACGAGCCAACTACTTGGTTGCTGTTGGATTGGAAAGTTGCAGTCTCACCGTTT
TTTGGTAGCCGAAGTTATGAGGAAAACTTTCAAGGCGACTAGGCGGATTGATAGTCGGAATGGAGCTTCATCATCAGCGGTGGGCTTGGGTGCTGGTCGGTCGATCTTGG
GAGCTCCGGCGCATTGTACACTTTTCGGGGGCTCGGTTTTCGGGTCTCCAGCAAAGACAGACATTCGGAAATCAAGTGGGCTAAGACGATGGCTTCCAAAAGTCACCGGT
GGCGAAGCTTTGATTGGCGGTGTAGCTGTTGGAATGCCGGCGGTGGGGGATTTTTCGGTGGATCTAGAAAGGAAGACAATTACTCTTGGACATTCAATGAATGATTATGA
CCATTTAATGGCAAGAGGCAAGAGGCTTTTAAATGTCAAGCTGGAGGAGATTTTAAATTCCAACGATAATGGTGGCATTGAAGGAGGAGAGATATTGGGTTCAGATATGG
CAAATATGGAAAACGACATTGCAAAGACCCCGCCGGCGTGTGTTAAACGACTTGATTGGGATGACCAATTTTCAACATGTGGTAACGGTTCAAAGATGAATGAAGAAGGA
GCAGCGATTTTTATTAAGGAGCCTGGTATGGTTGTGCCGAAGGGTCTCGTTAATGATACACTTGGTGAGACTTCGGTCAACGGCCGTAAACTAGCCAACCCACCTACTGT
ACCGAAAGGAAATTCAAAGGCAGCAACTTGTAAGAAGCGGGCTCGAGTTGGATTTATCCCAAAAGGCCTAGATCCTAACGAGGCTCATGAGCTTACAAAGCGGAAGGAGG
GACCAGATGTAGAGCAAGCTGGTCTGAAACGACTTAAATTTCTTGATGATGATGACATGGAGGCGGGGTCTGCGGATAGCCCCGCCAGGACGAATGAACGAATATTATTT
GTTGGAATGTCCGTGAAGGATAATGTTGATGTCTCTATTCGAAATTATTCTATCCATCATATAGATGCAGATATTGTTTGGAATGGGACTCGTTGGTGGTTTACTGTTTT
TTATGGTCATCCGGAAACACATTTAAGGAAGCATACTTGGGAACTTCTCAGACGTTTGCATAACAATGATGATTCACCATGGTTAGTCGGAGGTGACTTTAATGAAATCC
TGTGGGATGATGAAAAGAATGGGGGTCCATTGCGTGACTTTCGCCAACTTAATGACTTTAAGGAGGCTATTGATGATTGCAGCCTTCGGGATTTGGGTTTTGACGGTGGT
ATGTTTACTTGGTGTAATCGACAGGATGTAGGAGTCCAAGTTAGCCTTCGTCTTGATCGATTTCTGGCTAATCTAAGTTGTAGTTCAATTTTTCCAGTCTGTCGGGCTTT
GAATTTGGATTGGGAAAAATCAGATCACCGACCAATTGGGCTTATGTTATCTAATGTTGTTGGTTCTTGGGATGCGTATGGTCCGACCTCTTCATTATTGCATCAGGATT
TGCGAAAATGTGCCTTTGAATTGGGTTGCCGGGGACGAAGGCAAAATATGGAGTTGAGACAGAACATTAATAAGGAATATTTTATGAATCTTTTTAGCTCTTTTAAGCCC
TCATTTGAGGACTTAAATCGAGTTTTGGATTCTATTTCGAGGAAAGTGACTACGGACATGAATGCCTTTTTGACGAAGCGATTCACAAAGGAAGAAGTGGAAATTGCCAT
TAAGGGTTTTCAACCTACAAAAGCCCCTGGTCCAGATGTTAAGGATTGGAACCACACCATTATCACCCTTATTCCAAAGGTTCGAAACACAAGGTTAGTAATTGATTATA
GGCCTATTAGTCTTTGTAATGTTTCATATAAAATCATAACAAAGGTAATTGCTAATCGGCTAAATTCGGTTCTTCATGATATTATTGATGAGAGCCAGTCAACATTTATT
CCAGGACGATCAATTACTGACAATATGATTATTGGTCATGAAACTTTACATTTCCTACAAAACCATAGAACTTGCAAAACAAACTATGCTGCTCTAAAATTAGATATGAG
TAAGGCGTATGATAGGGTGGAGTGGGATTATTTGCATCATGTAATGATTCGTTTGGGCTTCCATGAAAACTGGGTTGTTTTAATTATGGGTCGTATTTCTACTGCTACTT
TTTCGATTAAGATAAATGGGGAAGCGACAGGTTTTTTTAAACCTTCAAGGGAAATTCGACAAGGTGACCCATTGTCCCCTTATTTGTTTCTTCTATGCTTTGAAGGGCTA
TCGACTCTGTTTTCAGCCGCGCATGGTCGAGATTTAGCTGGAGTGTCTATTGCCCGCACATGCCAAAAAATTTCTCACTTATTTTTTGCTGATGACAGCTTGATTTTTCT
GAAAGCCATTGCGATGGAGTTTGGTGTTTTTAAGACCATTATGGGTGCATATGAACGGGCTTCCGGACAGTGTTTGAATTATGCTAAATCTATGATGTGTTTCTCTCCTA
ACGTCCAGCAAGATACTAGTAGGTATCTCAATAGTATTCTCCAGATGAAAGTTGTGGCTCACCTTGGGGCGTATCTTGGTCTTCCATCATCTTTCTCTCGGGGGAAGGGG
GGAGGGAGGGAGATATTAATTAAGAGTGTGGCGCAAGCAATTCCTACATATGCTATGAGTTGTTTCCGGTTGCCAAAAGGTTGGATAGCTCGCGTGTCTAGTTTGTGTGC
CAAATTTTGGTGGGGATCCACAGATGAGCACAAAAAAATTCACCGGAAGAGGTGGAAGGATCTATGTAAGCCAAAGGAGGTAGGTGGTTTAAATTTTAGGGATTTGGAAG
GTTTTAATCAGGCTTTGTTGGCTAAACAGGTGTGGCGAGTGATTGATAATCCTAACTCACGTGTAGCCCATATTATTAAAGGTCGTTATTGGCCTAACTTTCCTTCCCTA
GATATATCAAGTCGTAATAACTCCTCTTATTTTTGGAAGGGTTTCGTTTGGGGCATGGAGCTTTTGAAAAAAGGAATGAGGTTACTTTTGGGAAATGGGCGATCTATTAA
GATGTTCTCAGATCCTTGGCTACCTCGCCCTTCTACGTTTAAGGTAACATTCACTGGTCCAAGCAAAACTTCCATGATGGTGGTGAACTTTATTTCTAATTCCCTCCATT
GGAACGTTCCAATGTTGAAGCAATATTTAGATCCAATGGATATTGATGTGATTAGTTGTTTGTCGATCAATGAATCAGCTCCAGACAGATGGATTTGGCACTATGATAGA
CATGAAAAGAGCCAGAGACAACCGACCATGCGCTATTTCGGTGCAAGAGAGCAAAGAAGATGTAACGCCCCAGGAATCCGAACAAGTAACTGGACCCATGTGGTCATCCC
TCGTGAGCCTCCTCGTCTACATCACCGGGCACGTAATACTCATCACCTGCAAAGTCTGAAAATGTTGGGGGAAAAGAAGGGGATGGGTATCAATACAACCACAGTACTGG
AAAAGAGGGGGGAAAAGAGCGGCAGCGTGGGTTTCGGCGAAAACGAGCGAGAGACGAGATGGACGGTGGCGCGAAGAGGAAAGGAGATGGCGCGCTGGGAGCTCGGCACG
GCTGGGCGACGCGAGGAGGAAAGAGGCGGTGGCTGCGGCGCTCGCGAAGGGGAAGGCGACGGCGGCTCTCGCGAAGGGGAAAACGACGGTGGCTGGGAGCTTGGCGGCTG
GGCGAGAGGGGCGGTGACAGTGAGGGCAGACGACGATGGAGAAAAGAAAGAAAAAACGCAAAGAAAAGGGGGAGCTTGGGCTATTTGGAATGATCGAAATTCGATGATTC
ATAATAAGCCCATTCCTCTGGTTGAAGGGTGTTGTGCCTGGATTTTTGATTATTTGGCCGAGGTTACCTCTGTTAAAAATTGTTCCTCGAAGCGGAGGCAGCAGGTGGAT
GAAGTTAGATCACTCTTCCAAGGGAATGATGATATGGTGATTCATGTGGATGCGACTTGTGATTTGAGGGAGCATCGTGCGAGCGTTGGGGTGGCTATTCGGAATTCAGG
TGGTACTTTAATTGCGGCTTTACATAGTCCTTTGGTACCATTTGTTAGTATTTTATGTGCGGAGGCATTGGCCATTTTAGAAGGTCTTCGTTTGGCGGATCACATGAATC
TCAAAAGGGTAAAAATTTTCTCGAATTCTCTTTCTTTGGTGATGATGTTAAAGAAGGAAACTGCCATGAATATCGACGTCACCTCTATTATTTGGGATGTTGAAAGGAAG
ATGGAATGCTTTAGGTCGGTAACGGTTGAATATGTGAATCGAAATTATAATTTCTTAGCTCATACACTGGCTCGCCAGAGTCTTTCAGATGGATCGATGCTATGGTTATC
GACTTTCCCTTCTTGGTTATCTGAGATTGTTTCTAATGAGTTACATTTGTTTGTACCCCTGCGGGGAGATTGTTCTTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTCCCTCTGGTTGGCCACACGATCTAAGTGTAACTGAACAGGTCGAGAACGACGAGTGGGAAGAGCTAAGCTCACCATTTCCAAAACGACATATAGTTCAACACCA
TATCAAACCAATATTACGCCCGTTTCGATTGTTTTTGGCCTGTGATGGAGTGATTTTAGCCACTGAGCTACTATATCAACACGTCGGGTGGCAATATGTTTTCTCATCAC
TTCCTGACTGCTCTGTGATTGATTCTTTATCTTTTCTTGTGTGTTTTTTGTCCCCTCTTTTGCCGAGTATGAATCCTATTTCTTTGGTGGAGGATGGGCCTAAGTTAATC
TTAACCGAATTTGAGGAGGACGTCTATGTGGATGCGGATCGAAGTGTGGTGGACAGCACGAGCCAACTACTTGGTTGCTGTTGGATTGGAAAGTTGCAGTCTCACCGTTT
TTTGGTAGCCGAAGTTATGAGGAAAACTTTCAAGGCGACTAGGCGGATTGATAGTCGGAATGGAGCTTCATCATCAGCGGTGGGCTTGGGTGCTGGTCGGTCGATCTTGG
GAGCTCCGGCGCATTGTACACTTTTCGGGGGCTCGGTTTTCGGGTCTCCAGCAAAGACAGACATTCGGAAATCAAGTGGGCTAAGACGATGGCTTCCAAAAGTCACCGGT
GGCGAAGCTTTGATTGGCGGTGTAGCTGTTGGAATGCCGGCGGTGGGGGATTTTTCGGTGGATCTAGAAAGGAAGACAATTACTCTTGGACATTCAATGAATGATTATGA
CCATTTAATGGCAAGAGGCAAGAGGCTTTTAAATGTCAAGCTGGAGGAGATTTTAAATTCCAACGATAATGGTGGCATTGAAGGAGGAGAGATATTGGGTTCAGATATGG
CAAATATGGAAAACGACATTGCAAAGACCCCGCCGGCGTGTGTTAAACGACTTGATTGGGATGACCAATTTTCAACATGTGGTAACGGTTCAAAGATGAATGAAGAAGGA
GCAGCGATTTTTATTAAGGAGCCTGGTATGGTTGTGCCGAAGGGTCTCGTTAATGATACACTTGGTGAGACTTCGGTCAACGGCCGTAAACTAGCCAACCCACCTACTGT
ACCGAAAGGAAATTCAAAGGCAGCAACTTGTAAGAAGCGGGCTCGAGTTGGATTTATCCCAAAAGGCCTAGATCCTAACGAGGCTCATGAGCTTACAAAGCGGAAGGAGG
GACCAGATGTAGAGCAAGCTGGTCTGAAACGACTTAAATTTCTTGATGATGATGACATGGAGGCGGGGTCTGCGGATAGCCCCGCCAGGACGAATGAACGAATATTATTT
GTTGGAATGTCCGTGAAGGATAATGTTGATGTCTCTATTCGAAATTATTCTATCCATCATATAGATGCAGATATTGTTTGGAATGGGACTCGTTGGTGGTTTACTGTTTT
TTATGGTCATCCGGAAACACATTTAAGGAAGCATACTTGGGAACTTCTCAGACGTTTGCATAACAATGATGATTCACCATGGTTAGTCGGAGGTGACTTTAATGAAATCC
TGTGGGATGATGAAAAGAATGGGGGTCCATTGCGTGACTTTCGCCAACTTAATGACTTTAAGGAGGCTATTGATGATTGCAGCCTTCGGGATTTGGGTTTTGACGGTGGT
ATGTTTACTTGGTGTAATCGACAGGATGTAGGAGTCCAAGTTAGCCTTCGTCTTGATCGATTTCTGGCTAATCTAAGTTGTAGTTCAATTTTTCCAGTCTGTCGGGCTTT
GAATTTGGATTGGGAAAAATCAGATCACCGACCAATTGGGCTTATGTTATCTAATGTTGTTGGTTCTTGGGATGCGTATGGTCCGACCTCTTCATTATTGCATCAGGATT
TGCGAAAATGTGCCTTTGAATTGGGTTGCCGGGGACGAAGGCAAAATATGGAGTTGAGACAGAACATTAATAAGGAATATTTTATGAATCTTTTTAGCTCTTTTAAGCCC
TCATTTGAGGACTTAAATCGAGTTTTGGATTCTATTTCGAGGAAAGTGACTACGGACATGAATGCCTTTTTGACGAAGCGATTCACAAAGGAAGAAGTGGAAATTGCCAT
TAAGGGTTTTCAACCTACAAAAGCCCCTGGTCCAGATGTTAAGGATTGGAACCACACCATTATCACCCTTATTCCAAAGGTTCGAAACACAAGGTTAGTAATTGATTATA
GGCCTATTAGTCTTTGTAATGTTTCATATAAAATCATAACAAAGGTAATTGCTAATCGGCTAAATTCGGTTCTTCATGATATTATTGATGAGAGCCAGTCAACATTTATT
CCAGGACGATCAATTACTGACAATATGATTATTGGTCATGAAACTTTACATTTCCTACAAAACCATAGAACTTGCAAAACAAACTATGCTGCTCTAAAATTAGATATGAG
TAAGGCGTATGATAGGGTGGAGTGGGATTATTTGCATCATGTAATGATTCGTTTGGGCTTCCATGAAAACTGGGTTGTTTTAATTATGGGTCGTATTTCTACTGCTACTT
TTTCGATTAAGATAAATGGGGAAGCGACAGGTTTTTTTAAACCTTCAAGGGAAATTCGACAAGGTGACCCATTGTCCCCTTATTTGTTTCTTCTATGCTTTGAAGGGCTA
TCGACTCTGTTTTCAGCCGCGCATGGTCGAGATTTAGCTGGAGTGTCTATTGCCCGCACATGCCAAAAAATTTCTCACTTATTTTTTGCTGATGACAGCTTGATTTTTCT
GAAAGCCATTGCGATGGAGTTTGGTGTTTTTAAGACCATTATGGGTGCATATGAACGGGCTTCCGGACAGTGTTTGAATTATGCTAAATCTATGATGTGTTTCTCTCCTA
ACGTCCAGCAAGATACTAGTAGGTATCTCAATAGTATTCTCCAGATGAAAGTTGTGGCTCACCTTGGGGCGTATCTTGGTCTTCCATCATCTTTCTCTCGGGGGAAGGGG
GGAGGGAGGGAGATATTAATTAAGAGTGTGGCGCAAGCAATTCCTACATATGCTATGAGTTGTTTCCGGTTGCCAAAAGGTTGGATAGCTCGCGTGTCTAGTTTGTGTGC
CAAATTTTGGTGGGGATCCACAGATGAGCACAAAAAAATTCACCGGAAGAGGTGGAAGGATCTATGTAAGCCAAAGGAGGTAGGTGGTTTAAATTTTAGGGATTTGGAAG
GTTTTAATCAGGCTTTGTTGGCTAAACAGGTGTGGCGAGTGATTGATAATCCTAACTCACGTGTAGCCCATATTATTAAAGGTCGTTATTGGCCTAACTTTCCTTCCCTA
GATATATCAAGTCGTAATAACTCCTCTTATTTTTGGAAGGGTTTCGTTTGGGGCATGGAGCTTTTGAAAAAAGGAATGAGGTTACTTTTGGGAAATGGGCGATCTATTAA
GATGTTCTCAGATCCTTGGCTACCTCGCCCTTCTACGTTTAAGGTAACATTCACTGGTCCAAGCAAAACTTCCATGATGGTGGTGAACTTTATTTCTAATTCCCTCCATT
GGAACGTTCCAATGTTGAAGCAATATTTAGATCCAATGGATATTGATGTGATTAGTTGTTTGTCGATCAATGAATCAGCTCCAGACAGATGGATTTGGCACTATGATAGA
CATGAAAAGAGCCAGAGACAACCGACCATGCGCTATTTCGGTGCAAGAGAGCAAAGAAGATGTAACGCCCCAGGAATCCGAACAAGTAACTGGACCCATGTGGTCATCCC
TCGTGAGCCTCCTCGTCTACATCACCGGGCACGTAATACTCATCACCTGCAAAGTCTGAAAATGTTGGGGGAAAAGAAGGGGATGGGTATCAATACAACCACAGTACTGG
AAAAGAGGGGGGAAAAGAGCGGCAGCGTGGGTTTCGGCGAAAACGAGCGAGAGACGAGATGGACGGTGGCGCGAAGAGGAAAGGAGATGGCGCGCTGGGAGCTCGGCACG
GCTGGGCGACGCGAGGAGGAAAGAGGCGGTGGCTGCGGCGCTCGCGAAGGGGAAGGCGACGGCGGCTCTCGCGAAGGGGAAAACGACGGTGGCTGGGAGCTTGGCGGCTG
GGCGAGAGGGGCGGTGACAGTGAGGGCAGACGACGATGGAGAAAAGAAAGAAAAAACGCAAAGAAAAGGGGGAGCTTGGGCTATTTGGAATGATCGAAATTCGATGATTC
ATAATAAGCCCATTCCTCTGGTTGAAGGGTGTTGTGCCTGGATTTTTGATTATTTGGCCGAGGTTACCTCTGTTAAAAATTGTTCCTCGAAGCGGAGGCAGCAGGTGGAT
GAAGTTAGATCACTCTTCCAAGGGAATGATGATATGGTGATTCATGTGGATGCGACTTGTGATTTGAGGGAGCATCGTGCGAGCGTTGGGGTGGCTATTCGGAATTCAGG
TGGTACTTTAATTGCGGCTTTACATAGTCCTTTGGTACCATTTGTTAGTATTTTATGTGCGGAGGCATTGGCCATTTTAGAAGGTCTTCGTTTGGCGGATCACATGAATC
TCAAAAGGGTAAAAATTTTCTCGAATTCTCTTTCTTTGGTGATGATGTTAAAGAAGGAAACTGCCATGAATATCGACGTCACCTCTATTATTTGGGATGTTGAAAGGAAG
ATGGAATGCTTTAGGTCGGTAACGGTTGAATATGTGAATCGAAATTATAATTTCTTAGCTCATACACTGGCTCGCCAGAGTCTTTCAGATGGATCGATGCTATGGTTATC
GACTTTCCCTTCTTGGTTATCTGAGATTGTTTCTAATGAGTTACATTTGTTTGTACCCCTGCGGGGAGATTGTTCTTAA
Protein sequenceShow/hide protein sequence
MSPSGWPHDLSVTEQVENDEWEELSSPFPKRHIVQHHIKPILRPFRLFLACDGVILATELLYQHVGWQYVFSSLPDCSVIDSLSFLVCFLSPLLPSMNPISLVEDGPKLI
LTEFEEDVYVDADRSVVDSTSQLLGCCWIGKLQSHRFLVAEVMRKTFKATRRIDSRNGASSSAVGLGAGRSILGAPAHCTLFGGSVFGSPAKTDIRKSSGLRRWLPKVTG
GEALIGGVAVGMPAVGDFSVDLERKTITLGHSMNDYDHLMARGKRLLNVKLEEILNSNDNGGIEGGEILGSDMANMENDIAKTPPACVKRLDWDDQFSTCGNGSKMNEEG
AAIFIKEPGMVVPKGLVNDTLGETSVNGRKLANPPTVPKGNSKAATCKKRARVGFIPKGLDPNEAHELTKRKEGPDVEQAGLKRLKFLDDDDMEAGSADSPARTNERILF
VGMSVKDNVDVSIRNYSIHHIDADIVWNGTRWWFTVFYGHPETHLRKHTWELLRRLHNNDDSPWLVGGDFNEILWDDEKNGGPLRDFRQLNDFKEAIDDCSLRDLGFDGG
MFTWCNRQDVGVQVSLRLDRFLANLSCSSIFPVCRALNLDWEKSDHRPIGLMLSNVVGSWDAYGPTSSLLHQDLRKCAFELGCRGRRQNMELRQNINKEYFMNLFSSFKP
SFEDLNRVLDSISRKVTTDMNAFLTKRFTKEEVEIAIKGFQPTKAPGPDVKDWNHTIITLIPKVRNTRLVIDYRPISLCNVSYKIITKVIANRLNSVLHDIIDESQSTFI
PGRSITDNMIIGHETLHFLQNHRTCKTNYAALKLDMSKAYDRVEWDYLHHVMIRLGFHENWVVLIMGRISTATFSIKINGEATGFFKPSREIRQGDPLSPYLFLLCFEGL
STLFSAAHGRDLAGVSIARTCQKISHLFFADDSLIFLKAIAMEFGVFKTIMGAYERASGQCLNYAKSMMCFSPNVQQDTSRYLNSILQMKVVAHLGAYLGLPSSFSRGKG
GGREILIKSVAQAIPTYAMSCFRLPKGWIARVSSLCAKFWWGSTDEHKKIHRKRWKDLCKPKEVGGLNFRDLEGFNQALLAKQVWRVIDNPNSRVAHIIKGRYWPNFPSL
DISSRNNSSYFWKGFVWGMELLKKGMRLLLGNGRSIKMFSDPWLPRPSTFKVTFTGPSKTSMMVVNFISNSLHWNVPMLKQYLDPMDIDVISCLSINESAPDRWIWHYDR
HEKSQRQPTMRYFGAREQRRCNAPGIRTSNWTHVVIPREPPRLHHRARNTHHLQSLKMLGEKKGMGINTTTVLEKRGEKSGSVGFGENERETRWTVARRGKEMARWELGT
AGRREEERGGGCGAREGEGDGGSREGENDGGWELGGWARGAVTVRADDDGEKKEKTQRKGGAWAIWNDRNSMIHNKPIPLVEGCCAWIFDYLAEVTSVKNCSSKRRQQVD
EVRSLFQGNDDMVIHVDATCDLREHRASVGVAIRNSGGTLIAALHSPLVPFVSILCAEALAILEGLRLADHMNLKRVKIFSNSLSLVMMLKKETAMNIDVTSIIWDVERK
MECFRSVTVEYVNRNYNFLAHTLARQSLSDGSMLWLSTFPSWLSEIVSNELHLFVPLRGDCS