; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0028919 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0028919
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionReverse transcriptase domain-containing protein
Genome locationchr8:32867647..32872518
RNA-Seq ExpressionLag0028919
SyntenyLag0028919
Gene Ontology termsGO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000477 - Reverse transcriptase domain
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily
IPR043502 - DNA/RNA polymerase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
OMO59710.1 reverse transcriptase [Corchorus capsularis]5.3e-17841.45Show/hide
Query:  PPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQD
        PP  M  + WN +GLG+P  +R L +L++ + P + F+ ETK   F+++ ++RR +    F V   GRSGGLA+ WD +V   L+S+S +HID WV    
Subjt:  PPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQD

Query:  --SRWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYER
          ++WR TG YG +    +  +W LL +     +  W   GDFN +L   EKDGGR++P A++ AF++A+D CGL D+G+ G+  TW       E I+ER
Subjt:  --SRWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYER

Query:  LDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGSRGQ-----RIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQA
        LDR   T  W   F  + +THL  S  DH P  + L   + Q  R +     +   F+  W +  D ++LV   W  T     G+         +R +Q 
Subjt:  LDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGSRGQ-----RIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQA

Query:  MLAWGKAKLGNYPKRIREANQRVQ------SAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLEN
          + GK     Y ++ R   +R+       + I+G+       + V+   ++  +L EEE +W Q SR  WL EGDRNT +FH +AS RRK N I  LE 
Subjt:  MLAWGKAKLGNYPKRIREANQRVQ------SAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLEN

Query:  DQGMWSQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV------------------
        + G  S D  E+  + + YF+ LF SS       D  L+ V P + ++MN  LL  F++EE+  ALKQ HP KAPGPDG+ V                  
Subjt:  DQGMWSQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV------------------

Query:  -----------AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWA
                     N+T IVL+PKV  P+ +++F+PISLCNV YK+ISKVLVNR+K IL   IS +QSAF+PGR + DN ++ FE +H L+    G  G+ 
Subjt:  -----------AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWA

Query:  ALKLDMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLAR
        ALKLDMSKAYDRVEW FL+ IML MGF R WV+LI+RCV SV+FS  +NG+   +  P  GLRQGDPLSPYLFL+C EGLS+L+   ++  L+SG  ++R
Subjt:  ALKLDMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLAR

Query:  ASPAVTHLFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDR
          P V+HLFFA+DSLLF +A + E G V++ L +YE  SGQ +NF KSV+ FS N ++  +  V  I AV  +    +YLGLP+F+ RN+     +IK+R
Subjt:  ASPAVTHLFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDR

Query:  IWKQVQGWKGRFFSSGGKEVLLKAVVQAIP
        I K++  W  R+ S GG+EV++K+V+QAIP
Subjt:  IWKQVQGWKGRFFSSGGKEVLLKAVVQAIP

XP_024172304.2 uncharacterized protein LOC112178381 [Rosa chinensis]3.0e-18142.13Show/hide
Query:  MSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQ---------GRSGGLALLWDSTVSFSLLSYSSNHIDGW
        M++L WN QG+G+P  +  L  LV   +P + F+SETK  T  M+ ++ +L +   F+VDCQ          R+GGL LLW   +  +L ++S NHID  
Subjt:  MSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQ---------GRSGGLALLWDSTVSFSLLSYSSNHIDGW

Query:  V--LWQDSRWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGE
        +  +   +RWRFTGVYG S  EL+  TW+L+ K+  +   PWLIGGDFN IL   EK+GG  +   ++ AF+  V+ C L D+ FVG   TW  +R GGE
Subjt:  V--LWQDSRWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGE

Query:  TIYERLDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSP-QRLRSAAERCMQ
         I  RLDR   T SW DLF  S VTHL  S  DH P+ + +   I +  R +R  RF+E WL   +   +V+  W   +    G  P Q +    E+  +
Subjt:  TIYERLDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSP-QRLRSAAERCMQ

Query:  AMLAWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMW
        A+  W   K G+    I     ++        SA   E+ ++ ET+L ++L+ E  YW+QRSR +WL +GD NTR+FH RAS R+K N I GL N+ G+W
Subjt:  AMLAWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMW

Query:  SQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFP-CVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV----------------------
          +  ++  +V DYF  LF++S+P   +L     ++FP  V   MNSEL+R F  EE+L AL Q HP KAPGPDG S                       
Subjt:  SQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFP-CVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV----------------------

Query:  -------AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKL
                +N T + L+PKVK    + + +PISLCNV YKL SKVL NR+K +L  II+P QSAF+PGR + DN +L FE  H L+R +GG+ G+ ALKL
Subjt:  -------AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKL

Query:  DMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPA
        DMSKAYDRVEW F++ +M  MGF + W+  I+ CV++V++SF LNGE  GH+ PTRGLRQGD +SPYLFLLCAEGLS ++   E +  + G  +A  +P+
Subjt:  DMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPA

Query:  VTHLFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQ
        + HLFFA+DS +F +A   E   V+E+L  YE ASGQ VNF KS I+FS N    C++ ++++  V     H +YLGLP+ +  ++    +FI ++   +
Subjt:  VTHLFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQ

Query:  VQGWKGRFFSSGGKEVLLKAVVQAIP
        ++ WK +  S  GKEV++K+VVQ++P
Subjt:  VQGWKGRFFSSGGKEVLLKAVVQAIP

XP_030497600.1 uncharacterized protein LOC115713257 [Cannabis sativa]1.6e-17441.94Show/hide
Query:  MSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQDS-RW
        M LL WNA+GLGS  A RRL  L+  + P + F+ ETK     +   K  L F  G  V   G  GGL LLW   V  +LLS ++N+ D ++L+ D  RW
Subjt:  MSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQDS-RW

Query:  RFTGVYGFSSAELQCQTWSLLGKLRG-SPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYERLDRC
         F+ +YGF  A  +  TW L+ +L   SP  PWL+ GD N I  +E K+GG  +   ++ AF+  +D C L ++  VGD  TW   R    ++ ERLD C
Subjt:  RFTGVYGFSSAELQCQTWSLLGKLRG-SPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYERLDRC

Query:  FCTSSWQDLFSNSVVTHLDYSGFDHRPL----DLTLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSP-QRLRSAAERCMQAMLAW
        F    W+D F    ++HLDY G DHR L    D  L PP  Q  R  R  RF++ WL+  +  E++  SW    L S+   P  RL S+   C   +  W
Subjt:  FCTSSWQDLFSNSVVTHLDYSGFDHRPL----DLTLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSP-QRLRSAAERCMQAMLAW

Query:  GKAKLGNYPKRIREANQRVQSAIAGLRSAGSRE-----DLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMW
           K G   + I+ A    Q  + GL ++ S +      +  AE+ L+E+L  EE YW+QRSR  WL  GDRNT++FH +AS R   NRI  L +D G  
Subjt:  GKAKLGNYPKRIREANQRVQSAIAGLRSAGSRE-----DLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMW

Query:  SQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV-----------------------
           K+ + ++V DYFQ LFT+SN +   L   L  +   +  + N  L + F++ EV  ALK     K+PG DG+S                        
Subjt:  SQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV-----------------------

Query:  ------AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLD
              + N+T+I L+PK+K P+ + +F+PISLCNV+YK+ISK+L  R K +L+ +IS  QSAF+  R + DN ++ FE +H L+  + G+ G+AALKLD
Subjt:  ------AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLD

Query:  MSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAV
        MSKA+DRVEWSFL  +M  MGF    + LI+ C+ + +FSF +NGE  G V P RGLRQGDPLSPYLFL+C+EGLS L++  E    + G  ++R SP++
Subjt:  MSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAV

Query:  THLFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQV
        THL FA+DSLLF +A     G ++  L +Y RASGQ +N  KSV++FSPNT E  K    QIL +    CH  YLGLP++  R++      IK+RIWK +
Subjt:  THLFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQV

Query:  QGWKGRFFSSGGKEVLLKAVVQAIP
          W  + FS GGKEVLLKAVVQAIP
Subjt:  QGWKGRFFSSGGKEVLLKAVVQAIP

XP_030502555.1 uncharacterized protein LOC115717715 [Cannabis sativa]7.9e-17441.46Show/hide
Query:  MSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQDS-RW
        M LL WNA+GLGS  A RRL  LV+ + P + F+ ETK     +   K  L F  G  V   G  GGL LLW   V  +LLS ++NH D ++L+ D  RW
Subjt:  MSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQDS-RW

Query:  RFTGVYGFSSAELQCQTWSLLGKLRG-SPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYERLDRC
          + +YGF  A  +  TW L+ +L   SP  PWL+ GD N I  +E K+GG  +   ++ AF+  +D C L ++   GD  TW   R     + ERLD C
Subjt:  RFTGVYGFSSAELQCQTWSLLGKLRG-SPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYERLDRC

Query:  FCTSSWQDLFSNSVVTHLDYSGFDHRPL----DLTLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAMLAWG
        F    W+D  +   +THLDY G DHR L    D     P VQ  R  R  RF+++WL+  +  E++  SW      S   S  +L S+   C   +  W 
Subjt:  FCTSSWQDLFSNSVVTHLDYSGFDHRPL----DLTLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAMLAWG

Query:  KAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLV-QAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKD
          K G   + I+ A Q V      + S    +  +  AET L+++L  EE YW+QRSR  WL  GDRNT++FH +AS R   NRI  L +D G     K+
Subjt:  KAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLV-QAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKD

Query:  EVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV----------------------------
         + Q+V DYFQ LFT+SN +   L   L  +   +  + N  L + F++ +V   LK     K+PG DG+S                             
Subjt:  EVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV----------------------------

Query:  -AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAY
         A N+T+I L+PK+K P+ + +F+PISLCNV+YK+ISK+L  R K +L+ +IS  QSAF+  R + DN ++ FE +H L+  + G+ G+ A KLDMSKA+
Subjt:  -AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAY

Query:  DRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFF
        DRVEWSF+  +M  MGF   W+ LI+ C+ + +FSF +NGE +G V P RGLRQGDPLSPYLFL+C+EGLS L++  E    + G  ++R SP++THL F
Subjt:  DRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFF

Query:  ANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQGWKG
        A+DSLLF RA     G ++  L +Y RASGQ +N  KSV++FSPNT +  K    QIL +    CH  YLGLP++  R++      IK+RIWK +  W  
Subjt:  ANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQGWKG

Query:  RFFSSGGKEVLLKAVVQAIP
        + FS GGKEVLLKAVVQAIP
Subjt:  RFFSSGGKEVLLKAVVQAIP

XP_030505068.1 uncharacterized protein LOC115720043 [Cannabis sativa]3.2e-17540.71Show/hide
Query:  MSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQDS-RW
        M ++ WNA+GL +PRA+R+L  L+    P + FI E+K +   +   +  L+F  G  V   G SGGL  LW S V+ ++L+Y  N +D ++   D   W
Subjt:  MSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQDS-RW

Query:  RFTGVYGFSSAELQCQTWSLLGKLRG-SPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYERLDRC
         F+G YG      +  TW LL KL+  +P  PWL+ GDFN  + H +K GG  KP  ++ AF+ A+D CGL ++ F G+  TW N+   G  + ERLD  
Subjt:  RFTGVYGFSSAELQCQTWSLLGKLRG-SPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYERLDRC

Query:  FCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGSRGQRIR-RFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQ-RLRSAAERCMQAMLAWGKA
        F  S+W D FS+ V++HLD+  FD R L  T+   +       + R RF+++W     LQE    S +  +L  TG  P  ++    + C   + AW ++
Subjt:  FCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGSRGQRIR-RFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQ-RLRSAAERCMQAMLAWGKA

Query:  KLGNYPKRIREANQRVQSAIAGL-RSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDEV
        K G+ PK+IR + ++V +    L  S    +DL Q+E  L+++L +EE YW+QRSR  WL  GD NTR+FH++A+ RR TN I  L +D G+       +
Subjt:  KLGNYPKRIREANQRVQSAIAGL-RSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDEV

Query:  IQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV-----------------------------A
          ++  YFQN+FT+   +   +   +  +   + ++MN++L +PFS+ EV  AL       +PG DG+SV                             +
Subjt:  IQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV-----------------------------A

Query:  INETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAYDR
         N+T+I L+PKVK P  +S+ +PISLCNV YKL+SK +V R+K  L+ +IS +QSAF+  R + DN ++ FE +H L+    G  G+AA+KLDMSKA+DR
Subjt:  INETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAYDR

Query:  VEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFAN
        VEW FLQ ++  MGF    VDLI+RC+SSVT+SF++NG+  GHV+PTRG+ QGDPLSPYLF++CAEGL  L++  E R  + G +++R +PAV+HLFFA+
Subjt:  VEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFAN

Query:  DSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQGWKGRF
        DSL+  RA       ++  L  Y RASGQ +N  KSV++FSPN     +  V QIL +  + CH +YLGLPS+  R++      IK+++W  +  W+ + 
Subjt:  DSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQGWKGRF

Query:  FSSGGKEVLLKAVVQAIP
        FS GGKEVLLKAV QAIP
Subjt:  FSSGGKEVLLKAVVQAIP

TrEMBL top hitse value%identityAlignment
A0A2N9EWI8 Uncharacterized protein3.0e-18742.77Show/hide
Query:  PAPPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLW
        P P R +SL   N +GLG+P  +  L   V+ + P + F+ ET+   +++E ++ RL     F V+  G  GGLALLWD +V   + SYS +HID WV  
Subjt:  PAPPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLW

Query:  QDSR-WRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYE
             WRFTG YG      +  +W LL +L+G  D PWL+ GDFN I+  +EK G   +  A+++ F++A++ C L+D+GF G   TW N R   E + E
Subjt:  QDSR-WRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYE

Query:  RLDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQG-SRGQRIRR--FDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAM
        RLDR   T  W DLF  S + H+ ++  DH  L L     ++QG  R  R RR  F+  WLR    +E +  +W        G +  RL    ++C   +
Subjt:  RLDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQG-SRGQRIRR--FDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAM

Query:  LAWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQ
        ++W KA L   PK I +  +R+     G  S  +  +      +L  +L +EE+YW+QRSR  WL EGDRNT +FH  AS R+KTN IVG+ + Q +W +
Subjt:  LAWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQ

Query:  DKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV-------------------------
        ++ E+  +V+ YF  ++T+++P    +D  +++V   V SDMN ELL+PF+ EEV IAL Q  P KAPGPDG++                          
Subjt:  DKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV-------------------------

Query:  ----AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMS
            ++N T I L+PKVKSP  ++ F+PISLCNV YK+ISKVLVNRMK IL  ++S +QSAF+PGR + DN ++ FE IH L+    G     A KLDMS
Subjt:  ----AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMS

Query:  KAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTH
        KAY+RVEW +L++IML +GF   WV LI+ CV+SV++S  +NG+  G+V P+RGLRQGDPLSPYLFL+CAEGLS+L+R AE    I G  ++R  P V+H
Subjt:  KAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTH

Query:  LFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQG
        LFFA+DSL+F RA  T+   ++++L LYERASGQ +N  K+ + FS N S   K  + ++   S      +YLGLP  + R++      IKDRIW+++QG
Subjt:  LFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQG

Query:  WKGRFFSSGGKEVLLKAVVQAIP
        WK +F S  GKE+L+KAV+QAIP
Subjt:  WKGRFFSSGGKEVLLKAVVQAIP

A0A2N9GJ35 Uncharacterized protein1.7e-18241.48Show/hide
Query:  APPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQ
        APP  M  L  N +GLG+P+ +  L  LV+ + P + F+ ET+     +E ++ RL       V+  G+ GGLALLWDS+V  ++ SYS +HIDG V+  
Subjt:  APPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQ

Query:  DS-RWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYER
        D  RWR TG YG+  A L+ ++WSLL  LR   D PW+I GDFN I   EEK G  D+   +++AF++A+  C L DMGF G   TW N R  G+ +  R
Subjt:  DS-RWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYER

Query:  LDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDL---TLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAML
        LDR    ++W  LF ++ + HL  +  DH  L L   T  P      R +R+ RF++ WL+    +E+++++W    +G+   +  ++    ++C   ++
Subjt:  LDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDL---TLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAML

Query:  AWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQD
         W ++ +   PK I    +++Q      +       +   +  L  +  + E+ W+QRSR +WL EGDRNT++FH  AS R+K N I+GL + Q  W  +
Subjt:  AWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQD

Query:  KDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV--------------------------
          EV Q+  DYF +LF SSNP    +D  L +V   V   MN+ L+RPF+ EE+  AL Q HP K+PGPDG+S                           
Subjt:  KDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV--------------------------

Query:  ---AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSK
           +IN T +VL+PKV +P  +++F+PISLCNV YK++SKVLVNRMK IL ++IS +QSAF+PGR + DN I+ FE IH L+    G +   A+KLDMSK
Subjt:  ---AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSK

Query:  AYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHL
        AYDRVEW +LQ IM+ +GF   WV L++ CV + T+S  +NGE  G++TP RGLRQGDPLSPYLFLLC EGLS+++R AE  +L+ G  + R  P V+HL
Subjt:  AYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHL

Query:  FFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQGW
        FFA+DS++F RA   +   ++ LL  Y  ASGQ VN  K+ + FSPNT +  +  +      S      +YLGLP  + R +      IKDR+W+++QGW
Subjt:  FFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQGW

Query:  KGRFFSSGGKEVLLKAVVQAIP
        K +  S  G+EVL+KAV+QAIP
Subjt:  KGRFFSSGGKEVLLKAVVQAIP

A0A2N9I946 Uncharacterized protein2.1e-18041.85Show/hide
Query:  APPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQ
        APP  M     N +GLG+P  +R L   V+ + P + F+ ET+     +E ++ RL     F V+  G  GGLAL+W S+V+  + S+S+NHID  V+  
Subjt:  APPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQ

Query:  DS-RWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYER
        D  +WR TG YG     L+  +W+LL +L    + PWL+ GDFN +L  EE+ G  D+ +++++AF+ A+  C L D+G+ G   +W NRR  G  +  R
Subjt:  DS-RWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYER

Query:  LDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGSRGQRIR--RFDEIWLRYPDLQELVRLSW-VPTSLGSTGMSPQRLRSAAERCMQAML
        LDRC   + W  LF +  V H+ ++  DH  L + L PP V  S G R +  RF+ +W+R    ++ ++ +W  P S     +  Q++++    C   +L
Subjt:  LDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGSRGQRIR--RFDEIWLRYPDLQELVRLSW-VPTSLGSTGMSPQRLRSAAERCMQAML

Query:  AWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQD
         W ++++   P+ I +   R+    +      S  ++     ++  ++ +EE++W+QRSR  WL EGDRNT+++H  AS R+KTN I+GL +DQG+W  +
Subjt:  AWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQD

Query:  KDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV--------------------------
           +  +  +YF  LF SSNP+   +   +  V   V   MN  LLR FS+EE+  AL Q  P KAPGPDG++                           
Subjt:  KDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV--------------------------

Query:  ---AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSK
           +IN T IVL+PKVK+P  +S+F+PISLCNV YK+ SKVLVNRMK IL  IIS +QSAF+PGR + DN I+ FE +H L+    GA+   A KLDMSK
Subjt:  ---AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSK

Query:  AYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHL
        AYDRVEW FLQ I+L +GF R WVDLI+ CV+S ++S  +NG   G++ P+RGLRQGDPLSPYLFLLCAEGLS+LIR AE    I G  + R  P ++HL
Subjt:  AYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHL

Query:  FFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQGW
        FFA+DS++F RA   +GG +  +L LYERASGQ +N  K+ I FS NT    +  +  +   S      +YLGLP  + R++      IKDRIWK++QGW
Subjt:  FFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQGW

Query:  KGRFFSSGGKEVLLKAVVQAIP
        K +  S  G+E+L+KAVVQAIP
Subjt:  KGRFFSSGGKEVLLKAVVQAIP

A0A2N9IMU5 Uncharacterized protein3.5e-18041.95Show/hide
Query:  APPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQ
        APP  M  L  N +GLG+P  +R L  LV+ + P + F+ ET+     +E  + RL     F VD  G  GGLALLW S+VS  + SYS+ HID  V+  
Subjt:  APPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHIDGWVLWQ

Query:  DS-RWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYER
        D  +WR TG YG     L+  +W+LL  L  + + PWL+ GDFN I+  EE+ G  D+ + +++AF+DA+  C L D+G+ G   +W NRR  G  +  R
Subjt:  DS-RWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYER

Query:  LDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGS-RGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAMLAW
        LDRC   + W  LF +  V H+ ++  DH  L + L PP        +++ RFD  W+R    +E ++++W   S   +G    R+    + C   +L W
Subjt:  LDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGS-RGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAMLAW

Query:  GKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKD
         + +    P+ I     R+    +   +  +  ++    +++  +  +EE++W+QRSR  WL EGDRNT++FH  A+ R+K N I GL +D G+W  +  
Subjt:  GKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKD

Query:  EVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV----------------------------
         +  +  +YF +LF SSNPN   +   +  V   V   MN  LL+  SSEE+  AL Q  P KAPGPDG++                             
Subjt:  EVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV----------------------------

Query:  -AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAY
         +IN T IVL+PKVK+P  +S+F+PISLCNV YK+ SKVLVNRMK IL +IIS +QSAF+PGR + DN I+ FE +H L+    GA+   A KLDMSKAY
Subjt:  -AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAY

Query:  DRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFF
        DRVEW+FLQ I+L  GF R WVDLI+ CVS+ +++  +NG   G++ P+RGLRQGDPLSPYLFLLCAEGLS+LIR AE    I G  + R  P ++HLFF
Subjt:  DRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFF

Query:  ANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQGWKG
        A+DS++F RA   +G V+  +L LYERASGQ +N  K+   FS NT    +  +  +   S      +YLGLP  + R++      IKDRIWK++QGWK 
Subjt:  ANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQVQGWKG

Query:  RFFSSGGKEVLLKAVVQAIP
           S  G+EVL+KAVVQAIP
Subjt:  RFFSSGGKEVLLKAVVQAIP

A0A7N2LIH6 Uncharacterized protein4.6e-18041.04Show/hide
Query:  GGGWIPAPPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHID
        GGG   APP  M++L WN +GLG+  A+R L   V+ K P++ F+ ETKA+  +M+  + +L F  G  V   GRSGGLALLW         S S +HID
Subjt:  GGGWIPAPPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDSTVSFSLLSYSSNHID

Query:  GWVLWQDS--RWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPG
          V    S   WR TG YG      +  +W LL  L    + PWL+ GDFN I+  +EK G +D+  A++ AF++ +  CGL+D+GFVG   TWCN R G
Subjt:  GWVLWQDS--RWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPG

Query:  GETIYERLDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCM
         +    RLDR     +W  +F  + V H+  S  DH  L L L   +    RG++   F+E+W R  + +E+V L+W P    S     +RL    ERC 
Subjt:  GETIYERLDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCM

Query:  QAMLAWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGM
        + +  W +   GN  K I++   R+Q   +      + E++   + ++ E+   EE+ WKQRSR  WL  GD+N+++FH  AS RR+ NRI GL +D G+
Subjt:  QAMLAWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGM

Query:  WSQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSVA---------------------
        W +D++   +++ DYF+++++S+ P     D++L+ +   V  +MN EL + F + EV  AL+Q HP KAPGPDG+S                       
Subjt:  WSQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSVA---------------------

Query:  --------INETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKL
                IN+T I L+PK K+P++++EF+PISLCNV YK+ISKVL NR+K +L+ +I   QSAF+PGR + DN I+ FE +H + +   G  G  A+KL
Subjt:  --------INETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKL

Query:  DMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPA
        DMSKAYDRVEW++L+ +M  MGF   W+ LI+ CV+SV+FS  +NGE  G  TP+RGLRQGDP+SPYLFLLC EGLS++I+  E   LI G   AR +P 
Subjt:  DMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPA

Query:  VTHLFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQ
        ++HLFFA+DS++F RA   E   V ++L +YE  SGQ +N  K+ + FS NT ++ K++   I        H +YLGLP  + R +      IKD++ ++
Subjt:  VTHLFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGLPSFMPRNRFGTLKFIKDRIWKQ

Query:  VQGWKGRFFSSGGKEVLLKAVVQAIP
        + GWKG+  S+ G+EVL+KAV QA P
Subjt:  VQGWKGRFFSSGGKEVLLKAVVQAIP

SwissProt top hitse value%identityAlignment
O00370 LINE-1 retrotransposable element ORF2 protein2.0e-3122.7Show/hide
Query:  KAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDE
        ++K+     +++E  ++ Q+       A  R+++ +   +L+E+  ++ L     SR  +    ++  R   R    +R+ N+I  ++ND+G  + D  E
Subjt:  KAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDE

Query:  VIQMVNDYFQNLFTSSNPNFGDLDLALQD-VFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV----------------------------
        +   + +Y+++L+ +   N  ++D  L     P ++ +    L RP +  E++  +      K+PGPDG +                             
Subjt:  VIQMVNDYFQNLFTSSNPNFGDLDLALQD-VFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV----------------------------

Query:  -AINETMIVLVPKV-KSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKA
         +  E  I+L+PK  +   +   F+PISL N+  K+++K+L NR++  + K+I  +Q  FIPG     N       I  + R          + +D  KA
Subjt:  -AINETMIVLVPKV-KSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKA

Query:  YDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLF
        +D+++  F+ + +  +G    ++ +I       T +  LNG+++       G RQG PLSP LF +  E L+  IR  +    I G +L +    ++   
Subjt:  YDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLF

Query:  FANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGL------PSFMPRNRFGTLKFIKD--RI
        FA+D +++          + +L+  + + SG  +N  KS  AF  N +   +  +   L  +     ++YLG+            N    LK IK+    
Subjt:  FANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGL------PSFMPRNRFGTLKFIKD--RI

Query:  WKQVQ-GWKGR
        WK +   W GR
Subjt:  WKQVQ-GWKGR

P08548 LINE-1 reverse transcriptase homolog4.7e-2824.38Show/hide
Query:  REDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDV
        R+++ +   +L E+ ++  +    +S+  +  + ++  +        +R  + I  + N     + D  E+ +++N+Y++ L++    N  ++D  L+  
Subjt:  REDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDV

Query:  -FPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLS-------------VAIN----------------ETMIVLVPKV-KSPRRVSEFKPISLC
          P +       L RP SS E+   ++     K+PGPDG +             + +N                E  I L+PK  K P R   ++PISL 
Subjt:  -FPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLS-------------VAIN----------------ETMIVLVPKV-KSPRRVSEFKPISLC

Query:  NVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCV
        N+  K+++K+L NR++  + KII  +Q  FIPG     N       I  + +          L +D  KA+D ++  F+ + +  +G    ++ LI    
Subjt:  NVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCV

Query:  SSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFANDSLLFFRARATEGGVVRELLLLYERAS
        S  T +  LNG ++       G RQG PLSP LF +  E L+  IR  E +A I G  +   S  +    FA+D +++          + E++  Y   S
Subjt:  SSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFANDSLLFFRARATEGGVVRELLLLYERAS

Query:  GQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGL
        G  +N  KSV AF    +   ++ V   +  +  P  ++YLG+
Subjt:  GQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYLGL

P11369 LINE-1 retrotransposable element ORF2 protein1.9e-2922.06Show/hide
Query:  YPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAMLAWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRE
        YP+L + ++ +++   L +   S ++  +A    +   L   + K  N PKR R                  R+++++   ++ +V     +    ++R 
Subjt:  YPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAMLAWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRE

Query:  LWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDV-FPCVDSDMNSELLRPFSSEEVLIALKQ
         +  + ++  +   R     R    I  + N++G  + D +E+   +  +++ L+++   N  ++D  L     P ++ D    L  P S +E+   +  
Subjt:  LWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDEVIQMVNDYFQNLFTSSNPNFGDLDLALQDV-FPCVDSDMNSELLRPFSSEEVLIALKQ

Query:  THPHKAPGPDGLSVAINETM-----------------------------IVLVPK-VKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQS
            K+PGPDG S    +T                              I L+PK  K P ++  F+PISL N+  K+++K+L NR++  +  II P+Q 
Subjt:  THPHKAPGPDGLSVAINETM-----------------------------IVLVPK-VKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQS

Query:  AFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDP
         FIPG     N       IH + +          + LD  KA+D+++  F+ +++   G    ++++I    S    +  +NGE++  +    G RQG P
Subjt:  AFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDP

Query:  LSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQI
        LSPYLF +  E L+  IR  +    I G ++ +    ++ L  A+D +++          +  L+  +    G  +N  KS +AF    ++  ++ + + 
Subjt:  LSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQI

Query:  LAVSCRPCHLQYLG------LPSFMPRNRFGTLKFIKD--RIWKQVQ-GWKGR
           S    +++YLG      +     +N     K IK+  R WK +   W GR
Subjt:  LAVSCRPCHLQYLG------LPSFMPRNRFGTLKFIKD--RIWKQVQ-GWKGR

P14381 Transposon TX1 uncharacterized 149 kDa protein1.5e-2923.4Show/hide
Query:  DTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWC---NRRPGGETIYERLDRCFCTSSWQDLFSNSVVTHLDYSGFDHR
        D   +IGGDFN  L   +++  + +  +E S  ++ +    LVD+    +P T      R   G     R+DR + +S       +S +    +S  +  
Subjt:  DTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWC---NRRPGGETIYERLDRCFCTSSWQDLFSNSVVTHLDYSGFDHR

Query:  PLDLTLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAMLAW--GKAKL----GNYPKRIREANQRVQSAIAG
         L +++ P + + +       F+   L      + VR +W            +  R+  +        W  GK  L      Y K +         A+ G
Subjt:  PLDLTLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAMLAW--GKAKL----GNYPKRIREANQRVQSAIAG

Query:  ------LRSAGSREDLVQAE-TQLEEVLHEEELYWKQ----RSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDEVIQMVNDYFQNLF
               R +GS +  +Q E  + +E L   E    +    RSR   L + DR +R+F+     +    +I  L  + G   +D + +      ++QNLF
Subjt:  ------LRSAGSREDLVQAE-TQLEEVLHEEELYWKQ----RSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDEVIQMVNDYFQNLF

Query:  TSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV-----------------------------AINETMIVLVPKV
         S +P   D    L D  P V       L  P + +E+  AL+    +K+PG DGL++                             +    ++ L+PK 
Subjt:  TSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSV-----------------------------AINETMIVLVPKV

Query:  KSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAYDRVEWSFLQQIMLH
           R +  ++P+SL +  YK+++K +  R+K +L ++I P+QS  +PGR + DN  L  + +H  +R        A L LD  KA+DRV+  +L   +  
Subjt:  KSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAYDRVEWSFLQQIMLH

Query:  MGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFANDSLLFFRARATE
          F   +V  +    +S      +N      +   RG+RQG PLS  L+ L  E    L+     R  ++G  L      V    +A+D +L  +    +
Subjt:  MGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFANDSLLFFRARATE

Query:  GGVVRELLLLYERASGQTVNFAKS
            +E   +Y  AS   +N++KS
Subjt:  GGVVRELLLLYERASGQTVNFAKS

P92555 Uncharacterized mitochondrial protein AtMg012502.2e-1455.07Show/hide
Query:  FNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFANDS
        F +NG   G VTP+RGLRQGDPLSPYLF+LC E LS L R A+ +  + G R++  SP + HL FA+D+
Subjt:  FNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFANDS

Arabidopsis top hitse value%identityAlignment
AT1G43760.1 DNAse I-like superfamily protein1.4e-1925.45Show/hide
Query:  PVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYERLDRCFCTSSWQDLFSNSVVTHLDYSGF-DHRPLDLTLCPPIVQG--SRGQRIRRFDEI
        P+  L  FQ+ +    LVD+   G   TW N +     I  +LDR      W   F +++    + SG  DH P     C  I++    R ++  R+   
Subjt:  PVAELSAFQDAVDSCGLVDMGFVGDPLTWCNRRPGGETIYERLDRCFCTSSWQDLFSNSVVTHLDYSGF-DHRPLDLTLCPPIVQG--SRGQRIRRFDEI

Query:  WLRYPDLQELVRLSW-VPTSLGSTGMSPQRLRSAAERCMQAMLAWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLH----EEEL
           +P     + ++W     +GS   S      AA++C + +   G    GN   + +EA   ++S  + L +  S + L + E    +  +      E 
Subjt:  WLRYPDLQELVRLSW-VPTSLGSTGMSPQRLRSAAERCMQAMLAWGKAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLH----EEEL

Query:  YWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDEVIQMVNDYFQNLFTS-SNPNFGDLDLALQDVFP--CVDSDMNSELLRPFS
        +++Q+SR  WL +GD NTR+FH+     +  N I  L  D  +  ++  +V +M+  Y+ +L  S S+    D    ++D+ P  C D+ + S L    S
Subjt:  YWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDEVIQMVNDYFQNLFTS-SNPNFGDLDLALQDVFP--CVDSDMNSELLRPFS

Query:  SEEVLIALKQTHPHKAPGPDGLSV-----------------------------AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLIS
         +E+  A+     +KAPGPD  +                                N T I L+PKV    ++S F+P+S C V YK+I+
Subjt:  SEEVLIALKQTHPHKAPGPDGLSV-----------------------------AINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLIS

AT4G20520.1 RNA binding;RNA-directed DNA polymerases2.2e-1732.89Show/hide
Query:  LVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLN
        +V R+K ++  +I P Q++FIPGR   DN +   E +H ++R   G  GW  LKLD+ KAYDR+ W +L+  ++  GF   W+  I R     TF     
Subjt:  LVNRMKGILNKIISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLN

Query:  GERVGHVTPTR---------GLRQGDPLSPYL--FLLCAEGLSSLIRGA
           VG    ++         G R  D  +P+    + CAE L  + RG+
Subjt:  GERVGHVTPTR---------GLRQGDPLSPYL--FLLCAEGLSSLIRGA

ATMG01250.1 RNA-directed DNA polymerase (reverse transcriptase)1.6e-1555.07Show/hide
Query:  FNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFANDS
        F +NG   G VTP+RGLRQGDPLSPYLF+LC E LS L R A+ +  + G R++  SP + HL FA+D+
Subjt:  FNLNGERVGHVTPTRGLRQGDPLSPYLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFANDS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGATCTCTCTCTCAGCGGTCTCTCTCCCTCTCGTGTGTAGTCGTGTTCCGTCCAGCCGTCGCCGCCAGCTGGTTCGTCGTCCAGCTCGAGCGCTGCTTTCCATCAGG
CAGCGTCGCCGTGATCTCCCTCTGTAGTTTTCGGCCATGGCCCGTCTCTGTAGATCTCGCATGCCAGCAGCTTGAATCCCCTCTTCCGAGCGATTTCGCCTCTGTCCAGC
GATGTTTTGGCTTCGTTTTTGATCGTCGATTGGGTTCGTATCATTGCGAGCTCGAATACCCACTGCCCAAGGAGCATTCTAACATGTTGTTCGAGGAGCTTTCTTGGACT
GCTTGGATATTTGGTTGCTTTGGAAGCACTTTTGGTGGTGGACGGTGGGGTGGATGTGTCTGTGGACCAGATCATGGCGGAGGCTGGATACCAGCCCCGCCTAGGATTAT
GAGTCTCCTACTGTGGAACGCCCAAGGCTTGGGGTCCCCTCGGGCGCTCCGACGCTTGGCCAAGTTGGTACAGGCGAAATGGCCCTTGATGTTCTTCATCTCTGAAACAA
AGGCTACTACATTCAGGATGGAGGTAGTTAAAAGAAGGTTAGAGTTTGATTGTGGATTTTCGGTTGATTGTCAGGGCAGAAGTGGGGGCTTAGCTCTCCTTTGGGACTCG
ACAGTGTCATTTAGTTTGCTCTCGTACTCCAGTAATCATATTGATGGGTGGGTATTGTGGCAAGATAGTAGGTGGCGATTTACAGGGGTCTATGGTTTCTCGTCAGCTGA
GTTACAGTGTCAGACGTGGTCTCTTCTTGGTAAGTTGAGGGGAAGTCCTGATACCCCGTGGCTTATTGGGGGGGACTTCAATGCCATATTATGCCATGAGGAGAAGGATG
GGGGAAGGGATAAGCCAGTGGCTGAACTATCTGCATTTCAGGATGCTGTTGATTCTTGTGGCCTAGTTGATATGGGGTTTGTAGGGGACCCTTTAACGTGGTGCAATCGT
AGGCCAGGGGGTGAAACCATTTACGAGCGGTTGGATCGATGTTTCTGTACGTCCTCCTGGCAGGATTTGTTCTCCAACTCAGTGGTAACTCACCTGGACTATAGTGGATT
TGATCATCGTCCGCTGGATCTGACGCTATGCCCGCCTATAGTTCAGGGTTCCAGGGGGCAGCGTATTAGACGGTTTGATGAGATTTGGCTCCGGTATCCGGATCTTCAGG
AGCTGGTTCGACTGTCATGGGTACCTACATCATTGGGGTCCACTGGTATGAGTCCTCAGAGGCTTAGGTCAGCGGCTGAGAGGTGTATGCAAGCTATGTTGGCTTGGGGG
AAAGCTAAACTGGGGAACTATCCCAAGCGTATAAGGGAGGCGAATCAGAGGGTTCAGTCTGCTATTGCTGGCCTAAGGAGTGCGGGATCTAGGGAGGATTTGGTTCAGGC
AGAGACTCAGTTAGAGGAAGTCCTCCATGAGGAGGAATTATACTGGAAGCAGAGGTCCCGTGAGCTGTGGCTCTTAGAAGGAGACCGTAATACACGGTGGTTTCACCGTA
GGGCATCGTATAGACGGAAGACTAATCGAATAGTAGGTCTTGAAAATGATCAGGGTATGTGGTCTCAGGACAAGGATGAGGTTATACAGATGGTCAATGATTATTTCCAG
AATCTTTTCACTTCATCAAATCCCAACTTTGGGGATCTTGATCTGGCTTTGCAAGATGTATTTCCGTGTGTTGATAGTGATATGAATAGTGAGCTTCTTAGACCTTTTTC
TTCAGAGGAGGTTCTTATAGCCTTAAAGCAAACACACCCTCATAAAGCTCCGGGTCCGGATGGGCTGTCAGTGGCCATCAATGAAACAATGATAGTTCTTGTCCCAAAGG
TGAAGTCTCCCCGTCGAGTGTCAGAGTTTAAGCCTATCTCTCTATGCAATGTTAGTTACAAACTGATCTCGAAAGTGTTGGTGAACAGAATGAAGGGCATCTTGAATAAG
ATAATTTCCCCAAACCAGAGTGCTTTTATACCAGGGAGATGTGTTGTGGATAATGGCATCTTGGGTTTTGAATGCATACATGAGCTGCAGAGGCACTCAGGAGGGGCTTC
GGGGTGGGCGGCTCTGAAGCTGGATATGAGCAAAGCTTACGACAGAGTCGAATGGTCTTTCCTCCAACAGATCATGCTCCATATGGGTTTTGCTAGAGATTGGGTTGACT
TGATACTGCGTTGTGTTAGCTCTGTGACCTTTTCCTTTAATCTAAATGGGGAAAGGGTAGGCCATGTGACCCCGACCAGAGGTCTCCGTCAGGGGGACCCGTTATCTCCA
TACCTGTTCCTACTATGTGCAGAGGGTTTGTCCAGTTTGATCAGGGGGGCTGAGAGTCGGGCACTCATCTCAGGGTTTCGGTTGGCTCGAGCGAGCCCTGCGGTGACCCA
TCTATTCTTTGCTAACGATAGCTTGTTGTTCTTTCGGGCGAGAGCTACCGAAGGAGGAGTGGTTCGGGAGCTGTTATTGTTATATGAGAGAGCATCTGGCCAGACAGTTA
ATTTTGCAAAGTCAGTTATTGCTTTCAGTCCTAACACTAGCGAGGACTGCAAACAGTATGTGAGTCAGATCCTTGCGGTATCATGTAGGCCATGTCACCTTCAATACTTG
GGACTTCCCTCCTTCATGCCCCGAAATCGTTTTGGCACTTTGAAGTTCATTAAGGACAGGATATGGAAGCAAGTTCAGGGGTGGAAGGGCAGATTCTTCTCTTCGGGGGG
TAAAGAGGTCCTGCTTAAGGCAGTTGTGCAAGCAATCCCATGTTAA
mRNA sequenceShow/hide mRNA sequence
ATGAGATCTCTCTCTCAGCGGTCTCTCTCCCTCTCGTGTGTAGTCGTGTTCCGTCCAGCCGTCGCCGCCAGCTGGTTCGTCGTCCAGCTCGAGCGCTGCTTTCCATCAGG
CAGCGTCGCCGTGATCTCCCTCTGTAGTTTTCGGCCATGGCCCGTCTCTGTAGATCTCGCATGCCAGCAGCTTGAATCCCCTCTTCCGAGCGATTTCGCCTCTGTCCAGC
GATGTTTTGGCTTCGTTTTTGATCGTCGATTGGGTTCGTATCATTGCGAGCTCGAATACCCACTGCCCAAGGAGCATTCTAACATGTTGTTCGAGGAGCTTTCTTGGACT
GCTTGGATATTTGGTTGCTTTGGAAGCACTTTTGGTGGTGGACGGTGGGGTGGATGTGTCTGTGGACCAGATCATGGCGGAGGCTGGATACCAGCCCCGCCTAGGATTAT
GAGTCTCCTACTGTGGAACGCCCAAGGCTTGGGGTCCCCTCGGGCGCTCCGACGCTTGGCCAAGTTGGTACAGGCGAAATGGCCCTTGATGTTCTTCATCTCTGAAACAA
AGGCTACTACATTCAGGATGGAGGTAGTTAAAAGAAGGTTAGAGTTTGATTGTGGATTTTCGGTTGATTGTCAGGGCAGAAGTGGGGGCTTAGCTCTCCTTTGGGACTCG
ACAGTGTCATTTAGTTTGCTCTCGTACTCCAGTAATCATATTGATGGGTGGGTATTGTGGCAAGATAGTAGGTGGCGATTTACAGGGGTCTATGGTTTCTCGTCAGCTGA
GTTACAGTGTCAGACGTGGTCTCTTCTTGGTAAGTTGAGGGGAAGTCCTGATACCCCGTGGCTTATTGGGGGGGACTTCAATGCCATATTATGCCATGAGGAGAAGGATG
GGGGAAGGGATAAGCCAGTGGCTGAACTATCTGCATTTCAGGATGCTGTTGATTCTTGTGGCCTAGTTGATATGGGGTTTGTAGGGGACCCTTTAACGTGGTGCAATCGT
AGGCCAGGGGGTGAAACCATTTACGAGCGGTTGGATCGATGTTTCTGTACGTCCTCCTGGCAGGATTTGTTCTCCAACTCAGTGGTAACTCACCTGGACTATAGTGGATT
TGATCATCGTCCGCTGGATCTGACGCTATGCCCGCCTATAGTTCAGGGTTCCAGGGGGCAGCGTATTAGACGGTTTGATGAGATTTGGCTCCGGTATCCGGATCTTCAGG
AGCTGGTTCGACTGTCATGGGTACCTACATCATTGGGGTCCACTGGTATGAGTCCTCAGAGGCTTAGGTCAGCGGCTGAGAGGTGTATGCAAGCTATGTTGGCTTGGGGG
AAAGCTAAACTGGGGAACTATCCCAAGCGTATAAGGGAGGCGAATCAGAGGGTTCAGTCTGCTATTGCTGGCCTAAGGAGTGCGGGATCTAGGGAGGATTTGGTTCAGGC
AGAGACTCAGTTAGAGGAAGTCCTCCATGAGGAGGAATTATACTGGAAGCAGAGGTCCCGTGAGCTGTGGCTCTTAGAAGGAGACCGTAATACACGGTGGTTTCACCGTA
GGGCATCGTATAGACGGAAGACTAATCGAATAGTAGGTCTTGAAAATGATCAGGGTATGTGGTCTCAGGACAAGGATGAGGTTATACAGATGGTCAATGATTATTTCCAG
AATCTTTTCACTTCATCAAATCCCAACTTTGGGGATCTTGATCTGGCTTTGCAAGATGTATTTCCGTGTGTTGATAGTGATATGAATAGTGAGCTTCTTAGACCTTTTTC
TTCAGAGGAGGTTCTTATAGCCTTAAAGCAAACACACCCTCATAAAGCTCCGGGTCCGGATGGGCTGTCAGTGGCCATCAATGAAACAATGATAGTTCTTGTCCCAAAGG
TGAAGTCTCCCCGTCGAGTGTCAGAGTTTAAGCCTATCTCTCTATGCAATGTTAGTTACAAACTGATCTCGAAAGTGTTGGTGAACAGAATGAAGGGCATCTTGAATAAG
ATAATTTCCCCAAACCAGAGTGCTTTTATACCAGGGAGATGTGTTGTGGATAATGGCATCTTGGGTTTTGAATGCATACATGAGCTGCAGAGGCACTCAGGAGGGGCTTC
GGGGTGGGCGGCTCTGAAGCTGGATATGAGCAAAGCTTACGACAGAGTCGAATGGTCTTTCCTCCAACAGATCATGCTCCATATGGGTTTTGCTAGAGATTGGGTTGACT
TGATACTGCGTTGTGTTAGCTCTGTGACCTTTTCCTTTAATCTAAATGGGGAAAGGGTAGGCCATGTGACCCCGACCAGAGGTCTCCGTCAGGGGGACCCGTTATCTCCA
TACCTGTTCCTACTATGTGCAGAGGGTTTGTCCAGTTTGATCAGGGGGGCTGAGAGTCGGGCACTCATCTCAGGGTTTCGGTTGGCTCGAGCGAGCCCTGCGGTGACCCA
TCTATTCTTTGCTAACGATAGCTTGTTGTTCTTTCGGGCGAGAGCTACCGAAGGAGGAGTGGTTCGGGAGCTGTTATTGTTATATGAGAGAGCATCTGGCCAGACAGTTA
ATTTTGCAAAGTCAGTTATTGCTTTCAGTCCTAACACTAGCGAGGACTGCAAACAGTATGTGAGTCAGATCCTTGCGGTATCATGTAGGCCATGTCACCTTCAATACTTG
GGACTTCCCTCCTTCATGCCCCGAAATCGTTTTGGCACTTTGAAGTTCATTAAGGACAGGATATGGAAGCAAGTTCAGGGGTGGAAGGGCAGATTCTTCTCTTCGGGGGG
TAAAGAGGTCCTGCTTAAGGCAGTTGTGCAAGCAATCCCATGTTAA
Protein sequenceShow/hide protein sequence
MRSLSQRSLSLSCVVVFRPAVAASWFVVQLERCFPSGSVAVISLCSFRPWPVSVDLACQQLESPLPSDFASVQRCFGFVFDRRLGSYHCELEYPLPKEHSNMLFEELSWT
AWIFGCFGSTFGGGRWGGCVCGPDHGGGWIPAPPRIMSLLLWNAQGLGSPRALRRLAKLVQAKWPLMFFISETKATTFRMEVVKRRLEFDCGFSVDCQGRSGGLALLWDS
TVSFSLLSYSSNHIDGWVLWQDSRWRFTGVYGFSSAELQCQTWSLLGKLRGSPDTPWLIGGDFNAILCHEEKDGGRDKPVAELSAFQDAVDSCGLVDMGFVGDPLTWCNR
RPGGETIYERLDRCFCTSSWQDLFSNSVVTHLDYSGFDHRPLDLTLCPPIVQGSRGQRIRRFDEIWLRYPDLQELVRLSWVPTSLGSTGMSPQRLRSAAERCMQAMLAWG
KAKLGNYPKRIREANQRVQSAIAGLRSAGSREDLVQAETQLEEVLHEEELYWKQRSRELWLLEGDRNTRWFHRRASYRRKTNRIVGLENDQGMWSQDKDEVIQMVNDYFQ
NLFTSSNPNFGDLDLALQDVFPCVDSDMNSELLRPFSSEEVLIALKQTHPHKAPGPDGLSVAINETMIVLVPKVKSPRRVSEFKPISLCNVSYKLISKVLVNRMKGILNK
IISPNQSAFIPGRCVVDNGILGFECIHELQRHSGGASGWAALKLDMSKAYDRVEWSFLQQIMLHMGFARDWVDLILRCVSSVTFSFNLNGERVGHVTPTRGLRQGDPLSP
YLFLLCAEGLSSLIRGAESRALISGFRLARASPAVTHLFFANDSLLFFRARATEGGVVRELLLLYERASGQTVNFAKSVIAFSPNTSEDCKQYVSQILAVSCRPCHLQYL
GLPSFMPRNRFGTLKFIKDRIWKQVQGWKGRFFSSGGKEVLLKAVVQAIPC