; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

HG10017841 (gene) of Bottle gourd (Hangzhou Gourd) v1 genome

Gene IDHG10017841
OrganismLagenaria siceraria cv. Hangzhou Gourd (Bottle gourd (Hangzhou Gourd) v1)
DescriptionARM repeat superfamily protein
Genome locationChr03:24098575..24135588
RNA-Seq ExpressionHG10017841
SyntenyHG10017841
Gene Ontology termsNA
InterPro domainsIPR016024 - Armadillo-type fold


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022967683.1 uncharacterized protein LOC111467138 [Cucurbita maxima]9.6e-21777.95Show/hide
Query:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE
        GSRLQSDRVLSLIP+WS SVQDWKFLIG LIDK+FAEPSNA+LVR LS+INEHLVKATDV+LKRILSYVKGQKEIDE FY K  SQN DIS SVQQSLFE
Subjt:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE

Query:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILP--DAATLMHSWSRVWGF
        RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRA ++G W    +LYSF  + L                + +S  L+  R +      AT++ S   ++  
Subjt:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILP--DAATLMHSWSRVWGF

Query:  GLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSK
          Y    AFSKFEFDDVRKLAAELCGRI+PQVLYP+VSL+LEDA  S NIPGIKACLFSMCTSLAVRG+H  SHFD+FEIVKTLEVVLSWPSQ+GDEVSK
Subjt:  GLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSK

Query:  SQHGCIDCMALMICAELQAPDSCSASNLEKIDID-KKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDS
        SQHGCIDCMALMICAELQAP++CS SNLEKID+D KKGHAS+KGSILGYVIHQLI+G KELVSTYDLD   NT+DNSTP+S  LCMANVLISACQKLSD 
Subjt:  SQHGCIDCMALMICAELQAPDSCSASNLEKIDID-KKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDS

Query:  RKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDVL
        RKK FARKVLPRL+ F +V STQVDIRAACI VIFSAVYHLKSAILPYANDI RVS+NALK+G EKERIAGAKLMVSLMSSEDPIL+CISG L+EARDVL
Subjt:  RKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDVL

Query:  SSVSSLDPSIEVQQICQKMLQCLLSP
        SSVSSLDPSIEVQQICQKMLQCLLSP
Subjt:  SSVSSLDPSIEVQQICQKMLQCLLSP

XP_038882125.1 uncharacterized protein LOC120073376 isoform X1 [Benincasa hispida]6.6e-21878.3Show/hide
Query:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE
        GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDK+FAEPSNAILVRFLSMINEHLVKATDVVLK ILSYVKGQKEID+CF  K ESQ+EDI  SVQ  LFE
Subjt:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE

Query:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL
        RLCPLLVIRMLPLEVFNDLSMS MYGQLPNRA +H                                                D   + H          
Subjt:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL

Query:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ
         LL  AFSKFEFDDVRKLAAELCGRI+PQVLYP+V+ +LEDAA+S NIP IKACLFSMCTSL VR +H FSHFD+FEIVKTLEVVLSWPSQNGDEVSKSQ
Subjt:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ

Query:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK
        HGCIDCMALMICAELQAPDSCSAS LEKIDIDKKGHASLKGSIL YVIHQ+I GTKELVSTYDLDNNDNTSDNSTPLS  LCM NVLISACQKLSDSRKK
Subjt:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK

Query:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEK-------ERIAGAKLMVSLMSSEDPILECISGGLVEA
         FARKVLPRLI FVEV STQVDIRAACI VIFSAVYHLKSAILPYANDI  VSLNALKNG EK       ERIAGAKLMVSLMSSEDPILECISGGL+EA
Subjt:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEK-------ERIAGAKLMVSLMSSEDPILECISGGLVEA

Query:  RDVLSSVSSLDPSIEVQQICQKMLQCLLSP
        RDVLSSVSSLDPSIEVQQICQKMLQCLLSP
Subjt:  RDVLSSVSSLDPSIEVQQICQKMLQCLLSP

XP_038882126.1 uncharacterized protein LOC120073376 isoform X2 [Benincasa hispida]6.6e-21878.3Show/hide
Query:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE
        GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDK+FAEPSNAILVRFLSMINEHLVKATDVVLK ILSYVKGQKEID+CF  K ESQ+EDI  SVQ  LFE
Subjt:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE

Query:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL
        RLCPLLVIRMLPLEVFNDLSMS MYGQLPNRA +H                                                D   + H          
Subjt:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL

Query:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ
         LL  AFSKFEFDDVRKLAAELCGRI+PQVLYP+V+ +LEDAA+S NIP IKACLFSMCTSL VR +H FSHFD+FEIVKTLEVVLSWPSQNGDEVSKSQ
Subjt:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ

Query:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK
        HGCIDCMALMICAELQAPDSCSAS LEKIDIDKKGHASLKGSIL YVIHQ+I GTKELVSTYDLDNNDNTSDNSTPLS  LCM NVLISACQKLSDSRKK
Subjt:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK

Query:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEK-------ERIAGAKLMVSLMSSEDPILECISGGLVEA
         FARKVLPRLI FVEV STQVDIRAACI VIFSAVYHLKSAILPYANDI  VSLNALKNG EK       ERIAGAKLMVSLMSSEDPILECISGGL+EA
Subjt:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEK-------ERIAGAKLMVSLMSSEDPILECISGGLVEA

Query:  RDVLSSVSSLDPSIEVQQICQKMLQCLLSP
        RDVLSSVSSLDPSIEVQQICQKMLQCLLSP
Subjt:  RDVLSSVSSLDPSIEVQQICQKMLQCLLSP

XP_038882127.1 uncharacterized protein LOC120073376 isoform X3 [Benincasa hispida]5.4e-22079.35Show/hide
Query:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE
        GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDK+FAEPSNAILVRFLSMINEHLVKATDVVLK ILSYVKGQKEID+CF  K ESQ+EDI  SVQ  LFE
Subjt:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE

Query:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL
        RLCPLLVIRMLPLEVFNDLSMS MYGQLPNRA +H                                                D   + H          
Subjt:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL

Query:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ
         LL  AFSKFEFDDVRKLAAELCGRI+PQVLYP+V+ +LEDAA+S NIP IKACLFSMCTSL VR +H FSHFD+FEIVKTLEVVLSWPSQNGDEVSKSQ
Subjt:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ

Query:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK
        HGCIDCMALMICAELQAPDSCSAS LEKIDIDKKGHASLKGSIL YVIHQ+I GTKELVSTYDLDNNDNTSDNSTPLS  LCM NVLISACQKLSDSRKK
Subjt:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK

Query:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDVLSSV
         FARKVLPRLI FVEV STQVDIRAACI VIFSAVYHLKSAILPYANDI  VSLNALKNG EKERIAGAKLMVSLMSSEDPILECISGGL+EARDVLSSV
Subjt:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDVLSSV

Query:  SSLDPSIEVQQICQKMLQCLLSP
        SSLDPSIEVQQICQKMLQCLLSP
Subjt:  SSLDPSIEVQQICQKMLQCLLSP

XP_038882128.1 uncharacterized protein LOC120073376 isoform X4 [Benincasa hispida]6.6e-21878.3Show/hide
Query:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE
        GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDK+FAEPSNAILVRFLSMINEHLVKATDVVLK ILSYVKGQKEID+CF  K ESQ+EDI  SVQ  LFE
Subjt:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE

Query:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL
        RLCPLLVIRMLPLEVFNDLSMS MYGQLPNRA +H                                                D   + H          
Subjt:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL

Query:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ
         LL  AFSKFEFDDVRKLAAELCGRI+PQVLYP+V+ +LEDAA+S NIP IKACLFSMCTSL VR +H FSHFD+FEIVKTLEVVLSWPSQNGDEVSKSQ
Subjt:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ

Query:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK
        HGCIDCMALMICAELQAPDSCSAS LEKIDIDKKGHASLKGSIL YVIHQ+I GTKELVSTYDLDNNDNTSDNSTPLS  LCM NVLISACQKLSDSRKK
Subjt:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK

Query:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEK-------ERIAGAKLMVSLMSSEDPILECISGGLVEA
         FARKVLPRLI FVEV STQVDIRAACI VIFSAVYHLKSAILPYANDI  VSLNALKNG EK       ERIAGAKLMVSLMSSEDPILECISGGL+EA
Subjt:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEK-------ERIAGAKLMVSLMSSEDPILECISGGLVEA

Query:  RDVLSSVSSLDPSIEVQQICQKMLQCLLSP
        RDVLSSVSSLDPSIEVQQICQKMLQCLLSP
Subjt:  RDVLSSVSSLDPSIEVQQICQKMLQCLLSP

TrEMBL top hitse value%identityAlignment
A0A1S3B593 uncharacterized protein LOC103486160 isoform X69.1e-21375.33Show/hide
Query:  ELLPGSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQ
        EL PGSRLQSDRVLSLIPQWSQSVQ+WKFLIGPL+DK+FAEPSNAILVRFLSMINEH VKATDVVL+RILSYVKGQKEIDECFY K ++Q+ED+SLSVQQ
Subjt:  ELLPGSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQ

Query:  SLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVW
        SLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRA +H                                                D   + H      
Subjt:  SLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVW

Query:  GFGLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEV
             LL  AFSK EFDDVRKLAAEL GRI+PQVLYPFV+ +LEDAA S NIP IKACLFSMCTSL VRG+H FSHFDMF+IVKTLE++LSWPSQNGDEV
Subjt:  GFGLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEV

Query:  SKSQHGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSD
        SKSQHGCIDC+ALMIC ELQAP+SCSASN  KIDI+KKGHASLKGSIL YV+ +LI+GTKE  + +DLDNNDNTSDNSTPLS +LCMANVL SACQKLSD
Subjt:  SKSQHGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSD

Query:  SRKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDV
        SRKK FARKVLPRLI FVEV ST VDIR ACI VIFSAVYHLKSAILPY+ D+  VSLNALKNG E+ERIAGAKLMVSLMSSEDPILECISGGL+EARDV
Subjt:  SRKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDV

Query:  LSSVSSLDPSIEVQQICQKMLQCLLSP
        LSSVSS DPSIEVQQICQKMLQCL+SP
Subjt:  LSSVSSLDPSIEVQQICQKMLQCLLSP

A0A1S4DUA0 uncharacterized protein LOC103486160 isoform X51.6e-20973.33Show/hide
Query:  ELLPGSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEI-------------DECFYIKH
        EL PGSRLQSDRVLSLIPQWSQSVQ+WKFLIGPL+DK+FAEPSNAILVRFLSMINEH VKATDVVL+RILSYVKGQKE              DECFY K 
Subjt:  ELLPGSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEI-------------DECFYIKH

Query:  ESQNEDISLSVQQSLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILP
        ++Q+ED+SLSVQQSLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRA +H                                                
Subjt:  ESQNEDISLSVQQSLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILP

Query:  DAATLMHSWSRVWGFGLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLE
        D   + H           LL  AFSK EFDDVRKLAAEL GRI+PQVLYPFV+ +LEDAA S NIP IKACLFSMCTSL VRG+H FSHFDMF+IVKTLE
Subjt:  DAATLMHSWSRVWGFGLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLE

Query:  VVLSWPSQNGDEVSKSQHGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCM
        ++LSWPSQNGDEVSKSQHGCIDC+ALMIC ELQAP+SCSASN  KIDI+KKGHASLKGSIL YV+ +LI+GTKE  + +DLDNNDNTSDNSTPLS +LCM
Subjt:  VVLSWPSQNGDEVSKSQHGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCM

Query:  ANVLISACQKLSDSRKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPIL
        ANVL SACQKLSDSRKK FARKVLPRLI FVEV ST VDIR ACI VIFSAVYHLKSAILPY+ D+  VSLNALKNG E+ERIAGAKLMVSLMSSEDPIL
Subjt:  ANVLISACQKLSDSRKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPIL

Query:  ECISGGLVEARDVLSSVSSLDPSIEVQQICQKMLQCLLSP
        ECISGGL+EARDVLSSVSS DPSIEVQQICQKMLQCL+SP
Subjt:  ECISGGLVEARDVLSSVSSLDPSIEVQQICQKMLQCLLSP

A0A1S4DUA6 uncharacterized protein LOC103486160 isoform X101.6e-20973.33Show/hide
Query:  ELLPGSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEI-------------DECFYIKH
        EL PGSRLQSDRVLSLIPQWSQSVQ+WKFLIGPL+DK+FAEPSNAILVRFLSMINEH VKATDVVL+RILSYVKGQKE              DECFY K 
Subjt:  ELLPGSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEI-------------DECFYIKH

Query:  ESQNEDISLSVQQSLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILP
        ++Q+ED+SLSVQQSLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRA +H                                                
Subjt:  ESQNEDISLSVQQSLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILP

Query:  DAATLMHSWSRVWGFGLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLE
        D   + H           LL  AFSK EFDDVRKLAAEL GRI+PQVLYPFV+ +LEDAA S NIP IKACLFSMCTSL VRG+H FSHFDMF+IVKTLE
Subjt:  DAATLMHSWSRVWGFGLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLE

Query:  VVLSWPSQNGDEVSKSQHGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCM
        ++LSWPSQNGDEVSKSQHGCIDC+ALMIC ELQAP+SCSASN  KIDI+KKGHASLKGSIL YV+ +LI+GTKE  + +DLDNNDNTSDNSTPLS +LCM
Subjt:  VVLSWPSQNGDEVSKSQHGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCM

Query:  ANVLISACQKLSDSRKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPIL
        ANVL SACQKLSDSRKK FARKVLPRLI FVEV ST VDIR ACI VIFSAVYHLKSAILPY+ D+  VSLNALKNG E+ERIAGAKLMVSLMSSEDPIL
Subjt:  ANVLISACQKLSDSRKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPIL

Query:  ECISGGLVEARDVLSSVSSLDPSIEVQQICQKMLQCLLSP
        ECISGGL+EARDVLSSVSS DPSIEVQQICQKMLQCL+SP
Subjt:  ECISGGLVEARDVLSSVSSLDPSIEVQQICQKMLQCLLSP

A0A1S4DUC1 uncharacterized protein LOC103486160 isoform X71.6e-20973.33Show/hide
Query:  ELLPGSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEI-------------DECFYIKH
        EL PGSRLQSDRVLSLIPQWSQSVQ+WKFLIGPL+DK+FAEPSNAILVRFLSMINEH VKATDVVL+RILSYVKGQKE              DECFY K 
Subjt:  ELLPGSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEI-------------DECFYIKH

Query:  ESQNEDISLSVQQSLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILP
        ++Q+ED+SLSVQQSLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRA +H                                                
Subjt:  ESQNEDISLSVQQSLFERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILP

Query:  DAATLMHSWSRVWGFGLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLE
        D   + H           LL  AFSK EFDDVRKLAAEL GRI+PQVLYPFV+ +LEDAA S NIP IKACLFSMCTSL VRG+H FSHFDMF+IVKTLE
Subjt:  DAATLMHSWSRVWGFGLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLE

Query:  VVLSWPSQNGDEVSKSQHGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCM
        ++LSWPSQNGDEVSKSQHGCIDC+ALMIC ELQAP+SCSASN  KIDI+KKGHASLKGSIL YV+ +LI+GTKE  + +DLDNNDNTSDNSTPLS +LCM
Subjt:  VVLSWPSQNGDEVSKSQHGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCM

Query:  ANVLISACQKLSDSRKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPIL
        ANVL SACQKLSDSRKK FARKVLPRLI FVEV ST VDIR ACI VIFSAVYHLKSAILPY+ D+  VSLNALKNG E+ERIAGAKLMVSLMSSEDPIL
Subjt:  ANVLISACQKLSDSRKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPIL

Query:  ECISGGLVEARDVLSSVSSLDPSIEVQQICQKMLQCLLSP
        ECISGGL+EARDVLSSVSS DPSIEVQQICQKMLQCL+SP
Subjt:  ECISGGLVEARDVLSSVSSLDPSIEVQQICQKMLQCLLSP

A0A6J1HVT8 uncharacterized protein LOC1114671384.6e-21777.95Show/hide
Query:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE
        GSRLQSDRVLSLIP+WS SVQDWKFLIG LIDK+FAEPSNA+LVR LS+INEHLVKATDV+LKRILSYVKGQKEIDE FY K  SQN DIS SVQQSLFE
Subjt:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE

Query:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILP--DAATLMHSWSRVWGF
        RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRA ++G W    +LYSF  + L                + +S  L+  R +      AT++ S   ++  
Subjt:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILP--DAATLMHSWSRVWGF

Query:  GLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSK
          Y    AFSKFEFDDVRKLAAELCGRI+PQVLYP+VSL+LEDA  S NIPGIKACLFSMCTSLAVRG+H  SHFD+FEIVKTLEVVLSWPSQ+GDEVSK
Subjt:  GLYLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSK

Query:  SQHGCIDCMALMICAELQAPDSCSASNLEKIDID-KKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDS
        SQHGCIDCMALMICAELQAP++CS SNLEKID+D KKGHAS+KGSILGYVIHQLI+G KELVSTYDLD   NT+DNSTP+S  LCMANVLISACQKLSD 
Subjt:  SQHGCIDCMALMICAELQAPDSCSASNLEKIDID-KKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDS

Query:  RKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDVL
        RKK FARKVLPRL+ F +V STQVDIRAACI VIFSAVYHLKSAILPYANDI RVS+NALK+G EKERIAGAKLMVSLMSSEDPIL+CISG L+EARDVL
Subjt:  RKKLFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDVL

Query:  SSVSSLDPSIEVQQICQKMLQCLLSP
        SSVSSLDPSIEVQQICQKMLQCLLSP
Subjt:  SSVSSLDPSIEVQQICQKMLQCLLSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G57570.1 ARM repeat superfamily protein6.0e-11646.15Show/hide
Query:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE
        G    SDRVL LIP+W++SVQ+W  LIGPL+DK+F EPSNAI+VRFLS I+E L   +D+VL  +LS++K Q ++D  F  + ++++       ++SLF+
Subjt:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE

Query:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL
         LCPLL++R+LP  VF+D+  S +YG+  +  +++                 + ++KF                         D   +            
Subjt:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL

Query:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ
        ++L+ AFSKFEF++VRKL+AELCGR++PQVL+P V L LE A   ++   IKACLFS+CTSL VRG    SH    +I K LE +L WPS   DE+SK Q
Subjt:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ

Query:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK
        HGCIDC+ALMICAELQ   S   S  EKI    K  +    S+L Y IH LI       S   L  +  T +N  P+   LCMANV+ISACQK  +S KK
Subjt:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK

Query:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDVLSSV
         FARK LP LI  ++VIS   ++RAACI+V+FSA YHLKS +LP ++D+L++SL  L+ G EKE++AGAKLM SLM+SED ILE IS GL+EAR VLS  
Subjt:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDVLSSV

Query:  SSLDPSIEVQQICQKMLQCL
        S  DPS +V+++C K+L C+
Subjt:  SSLDPSIEVQQICQKMLQCL

AT3G57570.2 ARM repeat superfamily protein6.0e-11646.15Show/hide
Query:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE
        G    SDRVL LIP+W++SVQ+W  LIGPL+DK+F EPSNAI+VRFLS I+E L   +D+VL  +LS++K Q ++D  F  + ++++       ++SLF+
Subjt:  GSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLFE

Query:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL
         LCPLL++R+LP  VF+D+  S +YG+  +  +++                 + ++KF                         D   +            
Subjt:  RLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGL

Query:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ
        ++L+ AFSKFEF++VRKL+AELCGR++PQVL+P V L LE A   ++   IKACLFS+CTSL VRG    SH    +I K LE +L WPS   DE+SK Q
Subjt:  YLLKMAFSKFEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQ

Query:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK
        HGCIDC+ALMICAELQ   S   S  EKI    K  +    S+L Y IH LI       S   L  +  T +N  P+   LCMANV+ISACQK  +S KK
Subjt:  HGCIDCMALMICAELQAPDSCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKK

Query:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDVLSSV
         FARK LP LI  ++VIS   ++RAACI+V+FSA YHLKS +LP ++D+L++SL  L+ G EKE++AGAKLM SLM+SED ILE IS GL+EAR VLS  
Subjt:  LFARKVLPRLICFVEVISTQVDIRAACIEVIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDVLSSV

Query:  SSLDPSIEVQQICQKMLQCL
        S  DPS +V+++C K+L C+
Subjt:  SSLDPSIEVQQICQKMLQCL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATACTTGAATCTTACGTAGAGCTTTTGCCAGGATCAAGGTTGCAAAGTGACCGAGTGCTCAGCCTAATTCCTCAATGGTCTCAGAGTGTTCAAGATTGGAAATTCTT
GATAGGGCCGTTGATTGATAAGCTGTTTGCAGAACCATCTAATGCAATTCTCGTAAGGTTCCTGAGTATGATAAACGAGCACTTGGTGAAAGCCACTGATGTGGTCCTAA
AGCGTATTTTGTCATATGTGAAAGGACAGAAAGAGATAGATGAGTGCTTCTACATTAAACATGAGAGCCAAAACGAAGATATCTCTCTGAGTGTGCAACAATCTCTGTTT
GAGCGTCTTTGTCCACTACTTGTTATTAGGATGCTTCCTCTTGAAGTTTTTAATGACCTGAGTATGTCAGTCATGTATGGTCAGCTTCCTAACCGAGCAACTATGCATGG
TGGTTGGTGGAAGGCCCCTTTACTATATTCCTTCATCATACTGCCACTCTTCATTGAGGTGAAATTCTCCCGAAGGTTTTTGCTTGAGAAGTTTAACATGGGAGAATCAA
CATTGTTGACTTTATTCAGAGCCATTCTTCCTGATGCAGCCACATTGATGCATTCTTGGTCACGTGTATGGGGATTTGGGTTATATCTTTTGAAGATGGCATTTTCTAAG
TTTGAATTTGATGATGTACGGAAGTTGGCTGCTGAGCTGTGTGGACGCATTAATCCCCAGGTGCTTTATCCTTTTGTTAGCTTGATACTAGAAGATGCTGCCAGTTCTCG
TAATATACCAGGAATAAAAGCCTGCCTTTTTTCGATGTGCACGTCCCTTGCGGTAAGAGGCCAGCATAAGTTTTCACATTTTGACATGTTTGAAATTGTAAAAACCTTGG
AAGTAGTTCTATCGTGGCCATCTCAGAATGGAGATGAAGTTTCCAAATCACAACATGGATGCATTGATTGCATGGCGTTGATGATATGTGCTGAACTACAAGCTCCGGAC
TCATGCAGCGCCTCCAATTTGGAGAAAATTGACATTGATAAGAAAGGGCATGCCTCCTTAAAAGGTTCTATCCTGGGTTATGTGATCCATCAATTAATAAATGGTACAAA
AGAACTAGTTTCAACCTATGACTTGGACAATAATGACAACACATCCGACAATTCTACTCCTTTATCTCGTCACCTCTGCATGGCAAATGTGCTCATCAGTGCCTGCCAAA
AGCTTTCGGATTCAAGAAAGAAACTATTTGCTCGAAAAGTGCTTCCACGTCTGATTTGTTTTGTTGAGGTAATAAGTACACAGGTAGATATTAGAGCTGCATGTATTGAA
GTCATCTTTTCAGCCGTGTATCATCTGAAGTCGGCTATTCTACCTTATGCCAATGATATTCTCAGAGTCTCCTTAAACGCTTTGAAAAATGGGCCAGAAAAGGAAAGGAT
AGCCGGTGCTAAGCTGATGGTATCCCTTATGTCAAGTGAAGATCCAATTTTGGAGTGTATTTCAGGAGGATTAGTAGAAGCAAGAGATGTGCTCTCAAGTGTATCTTCTT
TGGATCCTTCAATTGAAGTCCAACAAATTTGCCAGAAAATGCTCCAATGTTTGCTTTCTCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGATACTTGAATCTTACGTAGAGCTTTTGCCAGGATCAAGGTTGCAAAGTGACCGAGTGCTCAGCCTAATTCCTCAATGGTCTCAGAGTGTTCAAGATTGGAAATTCTT
GATAGGGCCGTTGATTGATAAGCTGTTTGCAGAACCATCTAATGCAATTCTCGTAAGGTTCCTGAGTATGATAAACGAGCACTTGGTGAAAGCCACTGATGTGGTCCTAA
AGCGTATTTTGTCATATGTGAAAGGACAGAAAGAGATAGATGAGTGCTTCTACATTAAACATGAGAGCCAAAACGAAGATATCTCTCTGAGTGTGCAACAATCTCTGTTT
GAGCGTCTTTGTCCACTACTTGTTATTAGGATGCTTCCTCTTGAAGTTTTTAATGACCTGAGTATGTCAGTCATGTATGGTCAGCTTCCTAACCGAGCAACTATGCATGG
TGGTTGGTGGAAGGCCCCTTTACTATATTCCTTCATCATACTGCCACTCTTCATTGAGGTGAAATTCTCCCGAAGGTTTTTGCTTGAGAAGTTTAACATGGGAGAATCAA
CATTGTTGACTTTATTCAGAGCCATTCTTCCTGATGCAGCCACATTGATGCATTCTTGGTCACGTGTATGGGGATTTGGGTTATATCTTTTGAAGATGGCATTTTCTAAG
TTTGAATTTGATGATGTACGGAAGTTGGCTGCTGAGCTGTGTGGACGCATTAATCCCCAGGTGCTTTATCCTTTTGTTAGCTTGATACTAGAAGATGCTGCCAGTTCTCG
TAATATACCAGGAATAAAAGCCTGCCTTTTTTCGATGTGCACGTCCCTTGCGGTAAGAGGCCAGCATAAGTTTTCACATTTTGACATGTTTGAAATTGTAAAAACCTTGG
AAGTAGTTCTATCGTGGCCATCTCAGAATGGAGATGAAGTTTCCAAATCACAACATGGATGCATTGATTGCATGGCGTTGATGATATGTGCTGAACTACAAGCTCCGGAC
TCATGCAGCGCCTCCAATTTGGAGAAAATTGACATTGATAAGAAAGGGCATGCCTCCTTAAAAGGTTCTATCCTGGGTTATGTGATCCATCAATTAATAAATGGTACAAA
AGAACTAGTTTCAACCTATGACTTGGACAATAATGACAACACATCCGACAATTCTACTCCTTTATCTCGTCACCTCTGCATGGCAAATGTGCTCATCAGTGCCTGCCAAA
AGCTTTCGGATTCAAGAAAGAAACTATTTGCTCGAAAAGTGCTTCCACGTCTGATTTGTTTTGTTGAGGTAATAAGTACACAGGTAGATATTAGAGCTGCATGTATTGAA
GTCATCTTTTCAGCCGTGTATCATCTGAAGTCGGCTATTCTACCTTATGCCAATGATATTCTCAGAGTCTCCTTAAACGCTTTGAAAAATGGGCCAGAAAAGGAAAGGAT
AGCCGGTGCTAAGCTGATGGTATCCCTTATGTCAAGTGAAGATCCAATTTTGGAGTGTATTTCAGGAGGATTAGTAGAAGCAAGAGATGTGCTCTCAAGTGTATCTTCTT
TGGATCCTTCAATTGAAGTCCAACAAATTTGCCAGAAAATGCTCCAATGTTTGCTTTCTCCATGA
Protein sequenceShow/hide protein sequence
MILESYVELLPGSRLQSDRVLSLIPQWSQSVQDWKFLIGPLIDKLFAEPSNAILVRFLSMINEHLVKATDVVLKRILSYVKGQKEIDECFYIKHESQNEDISLSVQQSLF
ERLCPLLVIRMLPLEVFNDLSMSVMYGQLPNRATMHGGWWKAPLLYSFIILPLFIEVKFSRRFLLEKFNMGESTLLTLFRAILPDAATLMHSWSRVWGFGLYLLKMAFSK
FEFDDVRKLAAELCGRINPQVLYPFVSLILEDAASSRNIPGIKACLFSMCTSLAVRGQHKFSHFDMFEIVKTLEVVLSWPSQNGDEVSKSQHGCIDCMALMICAELQAPD
SCSASNLEKIDIDKKGHASLKGSILGYVIHQLINGTKELVSTYDLDNNDNTSDNSTPLSRHLCMANVLISACQKLSDSRKKLFARKVLPRLICFVEVISTQVDIRAACIE
VIFSAVYHLKSAILPYANDILRVSLNALKNGPEKERIAGAKLMVSLMSSEDPILECISGGLVEARDVLSSVSSLDPSIEVQQICQKMLQCLLSP