; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr016727 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr016727
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionUnknown protein
Genome locationtig00152985:1653877..1661614
RNA-Seq ExpressionSgr016727
SyntenySgr016727
Gene Ontology termsGO:0016020 - membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6590083.1 hypothetical protein SDJN03_15506, partial [Cucurbita argyrosperma subsp. sororia]5.5e-10760.96Show/hide
Query:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT
        ME  R IDWNMEEQ SSGDG+SSEK+ SAVQK C++GKKLL+TGLAISS PVVLPPLVIMSAFG AASIPYGVFLASYACTE IMSVWLPIPP  +LD  
Subjt:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT

Query:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLV--DEEDKIDTGSGEAAAIEVTSVEFEGNGGVDIGD---EEEQLKETKGLLERIKDEGRSGDDV
        DEE+VEE+ Y+++    +E  +    ++    + ++V  DEE + D GS   AAIEVT+VEFEGNG  DIGD   EEE+LKET+GLLERI+DEGR  +  
Subjt:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLV--DEEDKIDTGSGEAAAIEVTSVEFEGNGGVDIGD---EEEQLKETKGLLERIKDEGRSGDDV

Query:  AEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSEVAVSSQPA---------------KEEAE-LPSVTIIDVVESGEDLSVS
         +ANG  D+VRELEI           E + LGLLNE +S+A++P A+Y TSE   S++ +                EEAE LP VT+IDV+ES E LS+S
Subjt:  AEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSEVAVSSQPA---------------KEEAE-LPSVTIIDVVESGEDLSVS

Query:  ALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLMSIVGVK
        A+TIE K + N+PH++H++ +NE+L  EVKIRE+I SMKKIVGY AT LGTY+DEVNALYAF+GVEPPS ++DS +DDD+N LNQKLQFLMSIVGVK
Subjt:  ALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLMSIVGVK

XP_022987097.1 uncharacterized protein LOC111484754 isoform X1 [Cucurbita maxima]7.2e-10760.64Show/hide
Query:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT
        ME  R IDWN+EEQ SSGDG+SSEK+ SAVQK C++GKKLL+TGLAISS PVVLPPLVIMSAFG AASIPYGVFLASYACTE IMSVWLPIPP  +LD  
Subjt:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT

Query:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVD-----------EEDKIDTGSGEAAAIEVTSVEFEGNG-GVDIGDEEEQLKETKGLLERIKDE
        DEEMVEE+ Y         E+EE+ ++E  K  E L D           EE + D GS   AAIEVT+VEFEGNG   D  +EEE+LKET+GLLERI+DE
Subjt:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVD-----------EEDKIDTGSGEAAAIEVTSVEFEGNG-GVDIGDEEEQLKETKGLLERIKDE

Query:  GRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSE--------------VAVSSQPAK-EEAE-LPSVTIIDVVES
        GR  +   + NG  D+VRELEI           E + LGLLNE +S+A++P A+Y TSE                +S++ AK EEAE LP VT+IDV+ES
Subjt:  GRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSE--------------VAVSSQPAK-EEAE-LPSVTIIDVVES

Query:  GEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLMSI
         E LS+SA+TIE K + N+PH++H++ +NE+L  EVKIRE+I SMKKIVGY AT LGTY+DEVNALYAF+GVEPPS ++DS +DDD+N LNQKLQFLMSI
Subjt:  GEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLMSI

Query:  VGVK
        VGVK
Subjt:  VGVK

XP_022987099.1 uncharacterized protein LOC111484755 isoform X1 [Cucurbita maxima]7.2e-10760.84Show/hide
Query:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT
        ME  R IDWN+EEQ SSGDG+SSEK+ SAVQK C++GKKLL+TGLAISS PVVLPPLVIMSAFG AASIPYGVFLASYACTE IMSVWLPIPP  +LD  
Subjt:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT

Query:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVD-----------EEDKIDTGSGEAAAIEVTSVEFEGNGGVDIGD---EEEQLKETKGLLERIK
        DEEMVEE+ Y         E+EE+ ++E  K  E L D           EE + D GS   AAIEVT+VEFEGNG  D GD   EEE+LKET+GLLERI+
Subjt:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVD-----------EEDKIDTGSGEAAAIEVTSVEFEGNGGVDIGD---EEEQLKETKGLLERIK

Query:  DEGRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSE--------------VAVSSQPAK-EEAE-LPSVTIIDVV
        DEGR  +   + NG  D+VRELEI           E + LGLLNE +S+A++P A+Y TSE                +S++ AK EEAE LP VT+IDV+
Subjt:  DEGRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSE--------------VAVSSQPAK-EEAE-LPSVTIIDVV

Query:  ESGEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLM
        ES E LS+SA+TIE K + N+PH++H++ +NE+L  EVKIRE+I SMKKIVGY AT LGTY+DEVNALYAF+GVEPPS ++DS +DDD+N LNQKLQFLM
Subjt:  ESGEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLM

Query:  SIVGVK
        SIVGVK
Subjt:  SIVGVK

XP_038894925.1 uncharacterized protein LOC120083308 isoform X1 [Benincasa hispida]7.4e-12067.61Show/hide
Query:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT
        ME  R IDWNMEEQ SSGDG+SSEK+RS VQK C+VGKKLL+TGLAISSAPVVLPPLVIMSAFGF ASIPYGVFLASYAC E IMSVWLP+PPPPEL   
Subjt:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT

Query:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELL-----------VDEEDKIDTGSGE-AAAIEVTSVEFEGNGGVDIGDEEEQLKETKGLLERIKDE
        D+E+VEENGY+EDI   ++E+EE+  +E TKS E+L            DEED++D GS E  A IEVT+VEFE NG  DIGDEEEQL+ET+GLL+RI+DE
Subjt:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELL-----------VDEEDKIDTGSGE-AAAIEVTSVEFEGNGGVDIGDEEEQLKETKGLLERIKDE

Query:  GRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSEVAVSSQ-PAKEEAE-LPSVTIIDVVESGEDLSVSALTIESK
        GR  DD AEANG  D+VRELEI           E +   LLNE +S+ +HP  EY  SEVAVSS+ P  EEAE L SVT+IDV+ES E+LSVSA+TI+ K
Subjt:  GRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSEVAVSSQ-PAKEEAE-LPSVTIIDVVESGEDLSVSALTIESK

Query:  AQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDSSDDDLNQLNQKLQFLMSIVGVK
         + NAPH+++++ ++E+L SEVKIRE I SMKKI+GYNAT LGTYIDEVNALYAFVGVEPPS L+ SSD DLN LNQKLQFLMSIVGVK
Subjt:  AQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDSSDDDLNQLNQKLQFLMSIVGVK

XP_038894927.1 uncharacterized protein LOC120083308 isoform X2 [Benincasa hispida]3.1e-11867.01Show/hide
Query:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT
        ME  R IDWNMEEQ SSGDG+SSEK+RS VQK C+VGKKLL+TGLAISSAPVVLPPLVIMSAFGF ASIPYGVFLASYAC E IMSVWLP+PPPPEL   
Subjt:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT

Query:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELL-----------VDEEDKIDTGSGE-AAAIEVTSVEFEGNGGVDIGDEEEQLKETKGLLERIKDE
        D+E+VEENGY+EDI   ++E+EE+  +E TKS E+L            DEED++D GS E  A IEVT+VEFE NG  DIGDEEEQL+ET+GLL+RI+DE
Subjt:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELL-----------VDEEDKIDTGSGE-AAAIEVTSVEFEGNGGVDIGDEEEQLKETKGLLERIKDE

Query:  GRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAE-LPSVTIIDVVESGEDLSVSALTIESKA
        GR  DD AEANG  D+VRELEI           E +   LLNE +S+ +HP  EY  SEV+ S  P  EEAE L SVT+IDV+ES E+LSVSA+TI+ K 
Subjt:  GRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAE-LPSVTIIDVVESGEDLSVSALTIESKA

Query:  QPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDSSDDDLNQLNQKLQFLMSIVGVK
        + NAPH+++++ ++E+L SEVKIRE I SMKKI+GYNAT LGTYIDEVNALYAFVGVEPPS L+ SSD DLN LNQKLQFLMSIVGVK
Subjt:  QPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDSSDDDLNQLNQKLQFLMSIVGVK

TrEMBL top hitse value%identityAlignment
A0A0A0LW30 Uncharacterized protein8.5e-10661.13Show/hide
Query:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT
        ME  R IDWNMEEQ SSG+ +SS+K+RS ++K CNVGKKLLITGLAISSAPVVLPPLVIMSAFGF ASIPYGVFLASYACTE  MSVWLP+PPPPELD  
Subjt:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT

Query:  -DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGE--------------AAAIEVTSVEFEGNGGVDIGDEEEQLKETKGLLERI
         DEE+ EEN Y+E I      E+E+  +E TKS  +L D ++ +    G+                 IE+T VEFE N   DI DE+EQL+ET+GLL+RI
Subjt:  -DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGE--------------AAAIEVTSVEFEGNGGVDIGDEEEQLKETKGLLERI

Query:  KDEGRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELP-SVTIIDVVESGEDLSVSALTIE
        +DEG+  DD  EANG+ D+VRELEI           E +  GLL+E +S+ +HP  EY  SEV+ S     EEAE P SVT+IDV+ES EDLS+SA+TIE
Subjt:  KDEGRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELP-SVTIIDVVESGEDLSVSALTIE

Query:  SKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDSSDDDLNQLNQKLQFLMSIVGVK
         K + NAPH++ ++ ANE+L SE+KIRE I SMKKI+GYNAT +GTYIDEVNALY+FVGVEPP+ L+DSS DDLN L+QKLQFLMSIVGVK
Subjt:  SKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDSSDDDLNQLNQKLQFLMSIVGVK

A0A5A7TQ38 Uncharacterized protein2.7e-10460.88Show/hide
Query:  MEDNRRIDWNMEEQSSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTD
        ME  R IDWNMEE SSGD  S +++ S ++K CNVGKKLLI GLAISSAPV+LPPLVIMSAFGF ASIPYGVFLASYACTE IMSVWLP+P PPE+D  D
Subjt:  MEDNRRIDWNMEEQSSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTD

Query:  EEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVD----------EEDKIDTGS-GEAAAIEVTSVEFEGNGGVDIGDEEEQLKETKGLLERIKDEGR
        EE+VEEN Y+E I     +E+E+  +E  +S  +L D          +ED+ D GS  +   IEVT+VEFEGNG  DI D+EEQL+ET+GLL+RI+DEGR
Subjt:  EEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVD----------EEDKIDTGS-GEAAAIEVTSVEFEGNGGVDIGDEEEQLKETKGLLERIKDEGR

Query:  SGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELP-SVTIIDVVESGEDLSVSALTIESKAQP
          DD  EA  + D+VRELEI           E +  GLL+E +S  +HP  EY  SEV+ S     EEAE P SVT+IDV+ES EDLS+SA+TIE K + 
Subjt:  SGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELP-SVTIIDVVESGEDLSVSALTIESKAQP

Query:  NAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDSSDDDLNQLNQKLQFLMSIVGVK
        NAP ++ ++ +NEDL SE+KIRE I SMKKI+GYN T +GTYIDEVNALY+ VGVEPP+ L+DSSDDDLN L+Q+LQFLMSIVGVK
Subjt:  NAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDSSDDDLNQLNQKLQFLMSIVGVK

A0A6J1H9E7 uncharacterized protein LOC111461753 isoform X14.2e-10560.1Show/hide
Query:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT
        ME  R IDWNMEEQ SSGDG+SSEK+ SAVQK C++GKKLL+TGLAISS PVVLPPLVIMSAFG AASIPYGVFLASYACTE IMSVWLPIPP  +LD  
Subjt:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT

Query:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELL-----------VDEEDKIDTGSGEAAAIEVTSVEFEGNGGVDIGD---EEEQLKETKGLLERIK
        DEE+VEE+ Y         E+EE+ ++E  K  E L            DEE + D GS   AAIEVT+VEFEGNG  DIGD   EEE+LKET+GLLERI+
Subjt:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELL-----------VDEEDKIDTGSGEAAAIEVTSVEFEGNGGVDIGD---EEEQLKETKGLLERIK

Query:  DEGRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSE--------------VAVSSQPAK-EEAE-LPSVTIIDVV
        DEGR  +   + NG  ++VRELEI           E + LGLLNE +S+A++P   Y TSE                ++++ AK EEAE LP VT+IDV+
Subjt:  DEGRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSE--------------VAVSSQPAK-EEAE-LPSVTIIDVV

Query:  ESGEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLM
        ES E LS+S +TIE K + N PH++H+  +NE+L  EVKIRE+I SMKKIVGY AT LGTY+DEVNALYAF+GVEPPS ++DS +DDD+N LNQKLQFLM
Subjt:  ESGEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLM

Query:  SIVGVK
        SIVGVK
Subjt:  SIVGVK

A0A6J1J9E9 uncharacterized protein LOC111484754 isoform X13.5e-10760.64Show/hide
Query:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT
        ME  R IDWN+EEQ SSGDG+SSEK+ SAVQK C++GKKLL+TGLAISS PVVLPPLVIMSAFG AASIPYGVFLASYACTE IMSVWLPIPP  +LD  
Subjt:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT

Query:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVD-----------EEDKIDTGSGEAAAIEVTSVEFEGNG-GVDIGDEEEQLKETKGLLERIKDE
        DEEMVEE+ Y         E+EE+ ++E  K  E L D           EE + D GS   AAIEVT+VEFEGNG   D  +EEE+LKET+GLLERI+DE
Subjt:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVD-----------EEDKIDTGSGEAAAIEVTSVEFEGNG-GVDIGDEEEQLKETKGLLERIKDE

Query:  GRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSE--------------VAVSSQPAK-EEAE-LPSVTIIDVVES
        GR  +   + NG  D+VRELEI           E + LGLLNE +S+A++P A+Y TSE                +S++ AK EEAE LP VT+IDV+ES
Subjt:  GRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSE--------------VAVSSQPAK-EEAE-LPSVTIIDVVES

Query:  GEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLMSI
         E LS+SA+TIE K + N+PH++H++ +NE+L  EVKIRE+I SMKKIVGY AT LGTY+DEVNALYAF+GVEPPS ++DS +DDD+N LNQKLQFLMSI
Subjt:  GEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLMSI

Query:  VGVK
        VGVK
Subjt:  VGVK

A0A6J1JHX4 uncharacterized protein LOC111484755 isoform X13.5e-10760.84Show/hide
Query:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT
        ME  R IDWN+EEQ SSGDG+SSEK+ SAVQK C++GKKLL+TGLAISS PVVLPPLVIMSAFG AASIPYGVFLASYACTE IMSVWLPIPP  +LD  
Subjt:  MEDNRRIDWNMEEQ-SSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCT

Query:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVD-----------EEDKIDTGSGEAAAIEVTSVEFEGNGGVDIGD---EEEQLKETKGLLERIK
        DEEMVEE+ Y         E+EE+ ++E  K  E L D           EE + D GS   AAIEVT+VEFEGNG  D GD   EEE+LKET+GLLERI+
Subjt:  DEEMVEENGYQEDIDECVEEEEEESVVEATKSNELLVD-----------EEDKIDTGSGEAAAIEVTSVEFEGNGGVDIGD---EEEQLKETKGLLERIK

Query:  DEGRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSE--------------VAVSSQPAK-EEAE-LPSVTIIDVV
        DEGR  +   + NG  D+VRELEI           E + LGLLNE +S+A++P A+Y TSE                +S++ AK EEAE LP VT+IDV+
Subjt:  DEGRSGDDVAEANGADDNVRELEI-----------EATGLGLLNEANSSALHPPAEYETSE--------------VAVSSQPAK-EEAE-LPSVTIIDVV

Query:  ESGEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLM
        ES E LS+SA+TIE K + N+PH++H++ +NE+L  EVKIRE+I SMKKIVGY AT LGTY+DEVNALYAF+GVEPPS ++DS +DDD+N LNQKLQFLM
Subjt:  ESGEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPSQLEDS-SDDDLNQLNQKLQFLM

Query:  SIVGVK
        SIVGVK
Subjt:  SIVGVK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G65090.1 unknown protein1.7e-1330.71Show/hide
Query:  MEEQSSGDGVSSEKLRSAV-QKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTDEEMVEENGY
        MEE  S +   SE  RS +  K+ +VGKK+L  G+ +SSAP+++P L + S   F +S+P+ +FLA+YACT+K+MS  LP                    
Subjt:  MEEQSSGDGVSSEKLRSAV-QKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTDEEMVEENGY

Query:  QEDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGEAAAIEVTSVEFEGNG---GVDIGDEEEQLKETKGLLERIKDEGRSGDDVAEANGADD----
                 + EE   V     +E   DE  KI  G GE AA    +  F G      + + ++EE  KE+  LLE+I+DEGR+  + +E    DD    
Subjt:  QEDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGEAAAIEVTSVEFEGNG---GVDIGDEEEQLKETKGLLERIKDEGRSGDDVAEANGADD----

Query:  NVRELEIEATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELPSVTIID
        N +  E++             A     E ET    + +   K++ E+ S   ID
Subjt:  NVRELEIEATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELPSVTIID

AT1G65090.2 unknown protein7.6e-2227.16Show/hide
Query:  MEEQSSGDGVSSEKLRSAV-QKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTDEEMVEENGY
        MEE  S +   SE  RS +  K+ +VGKK+L  G+ +SSAP+++P L + S   F +S+P+ +FLA+YACT+K+MS  LP                    
Subjt:  MEEQSSGDGVSSEKLRSAV-QKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTDEEMVEENGY

Query:  QEDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGEAAAIEVTSVEFEGNG---GVDIGDEEEQLKETKGLLERIKDEGRSGDDVAEANGADD----
                 + EE   V     +E   DE  KI  G GE AA    +  F G      + + ++EE  KE+  LLE+I+DEGR+  + +E    DD    
Subjt:  QEDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGEAAAIEVTSVEFEGNG---GVDIGDEEEQLKETKGLLERIKDEGRSGDDVAEANGADD----

Query:  NVRELEIEATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELPSVTIIDVVE----SGED-----------------------------------
        N +  E++             A     E ET    + +   K++ E+ S   ID       +GE+                                   
Subjt:  NVRELEIEATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELPSVTIIDVVE----SGED-----------------------------------

Query:  ----------------LSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVG-VEPPSQLEDSSDDDL
                        ++ SAL++ S+A  +       +  N  +YSE ++ E +++++K+VGY+     T  +E+ ALY F G VEPP    +    D+
Subjt:  ----------------LSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVG-VEPPSQLEDSSDDDL

Query:  NQLNQKLQFLMSIVGV
          L  +L+FLMS++G+
Subjt:  NQLNQKLQFLMSIVGV

AT1G65090.3 unknown protein7.3e-2529.97Show/hide
Query:  MEEQSSGDGVSSEKLRSAV-QKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTDEEMVEENGY
        MEE  S +   SE  RS +  K+ +VGKK+L  G+ +SSAP+++P L + S   F +S+P+ +FLA+YACT+K+MS  LP                    
Subjt:  MEEQSSGDGVSSEKLRSAV-QKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTDEEMVEENGY

Query:  QEDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGEAAAIEVTSVEFEGNG---GVDIGDEEEQLKETKGLLERIKDEGRSGDDVAEANGADDNVRE
                 + EE   V     +E   DE  KI  G GE AA    +  F G      + + ++EE  KE+  LLE+I+DEGR+  + +E    DD    
Subjt:  QEDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGEAAAIEVTSVEFEGNG---GVDIGDEEEQLKETKGLLERIKDEGRSGDDVAEANGADDNVRE

Query:  LEIEATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELPSVTIIDVVESGEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMK
                      N+ +             V  QP K EA        +    GE       T  +K + +   ++ ++ +NE +YSE ++ E +++++
Subjt:  LEIEATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELPSVTIIDVVESGEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMK

Query:  KIVGYNATALGTYIDEVNALYAFVG-VEPPSQLEDSSDDDLNQLNQKLQFLMSIVGV
        K+VGY+     T  +E+ ALY F G VEPP    +    D+  L  +L+FLMS++G+
Subjt:  KIVGYNATALGTYIDEVNALYAFVG-VEPPSQLEDSSDDDLNQLNQKLQFLMSIVGV

AT5G36100.1 unknown protein4.1e-2027.73Show/hide
Query:  MEEQSSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTDEEMVEENGYQ
        MEE +  DG    K+    +K  +VGKK+L     + SAP ++P LV+ S     +S+PY  FL SY CTEK+M   LP                 N + 
Subjt:  MEEQSSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTDEEMVEENGYQ

Query:  EDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGEAAAIEVTSVEFEGNGGVDIG--DEEEQLKETKGLLERIKDEGRSGDDVAEANGADDNVRELE
           D        E V+   K     +   D  D      A  E   V+ E    + I   ++E+  KE K  LE I+DEG++   +              
Subjt:  EDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGEAAAIEVTSVEFEGNGGVDIG--DEEEQLKETKGLLERIKDEGRSGDDVAEANGADDNVRELE

Query:  IEATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELPSVTIIDVV-ESGEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKK
              G++ E           +E  +   S  P   ++E     + D++ +  E +++    +ES     +  ++ ++ +   LYSE +I  +I++++K
Subjt:  IEATGLGLLNEANSSALHPPAEYETSEVAVSSQPAKEEAELPSVTIIDVV-ESGEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKK

Query:  IVGYNATALGTYIDEVNALYAFVGVE-PPSQLEDSSDDDLNQLNQKLQFLMSIVGVK
        +VGYN T   TY +E+ ALY F GVE P S LE   + D+ ++++ L FLMS++G+K
Subjt:  IVGYNATALGTYIDEVNALYAFVGVE-PPSQLEDSSDDDLNQLNQKLQFLMSIVGVK

AT5G36100.2 unknown protein2.0e-0629.57Show/hide
Query:  MEEQSSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTDEEMVEENGYQ
        MEE +  DG    K+    +K  +VGKK+L     + SAP ++P LV+ S     +S+PY  FL SY CTEK+M   LP                 N + 
Subjt:  MEEQSSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTDEEMVEENGYQ

Query:  EDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGEAAAIEVTSVEFEGNGGVDIG--DEEEQLKETKGLLERIKDEGRSGDDV
           D        E V+   K     +   D  D      A  E   V+ E    + I   ++E+  KE K  LE I+DEG++   +
Subjt:  EDIDECVEEEEEESVVEATKSNELLVDEEDKIDTGSGEAAAIEVTSVEFEGNGGVDIG--DEEEQLKETKGLLERIKDEGRSGDDV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTGCATGCACTCCCAATCTTCTCTCTCTCTCTTTCCTTCTCTCTATTCATTCTGCAACCGCCATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCCCTCTCTGCC
ATGCGACTCTTCTCCAGGGGTTCGAAGTAGGGCGGCGCTCCAGATTATGGCCGGAGAACTACCTGATCTCATTAGGTTTCTTCATCTGATCTTCTTTTGTTGTTTTGTTT
TGTTTCCTCCACTGTTTGAAACTCTGGGAAATTTTTGCAGGGTTTTGGTTCTCTGTGACCACTGTTGTGATGGTGTTAGTTTGACAGAAATGATGGAGGACAACCGCAGG
ATAGACTGGAACATGGAAGAACAGTCCTCCGGTGACGGAGTGAGCAGCGAAAAGCTGCGCTCCGCCGTGCAAAAGAGTTGCAATGTAGGGAAGAAGCTTCTCATCACTGG
TTTAGCCATATCTTCTGCTCCAGTGGTTCTTCCTCCATTGGTGATCATGTCAGCCTTTGGATTTGCAGCCTCAATTCCCTACGGAGTCTTCCTCGCTTCTTACGCATGCA
CCGAGAAGATCATGAGTGTTTGGCTTCCAATTCCTCCACCACCGGAACTCGATTGTACAGATGAAGAAATGGTGGAGGAAAACGGCTACCAGGAAGACATCGACGAGTGT
GTGGAAGAAGAAGAAGAAGAATCTGTGGTGGAGGCAACAAAGAGCAATGAGCTTCTGGTGGACGAAGAGGACAAAATCGATACCGGAAGTGGAGAGGCAGCAGCCATTGA
AGTAACCAGTGTAGAATTTGAAGGAAATGGAGGGGTTGATATTGGAGATGAGGAAGAACAGTTGAAAGAAACAAAGGGTTTGCTTGAAAGAATCAAGGATGAGGGAAGAA
GCGGTGATGATGTTGCTGAGGCAAATGGAGCTGATGATAATGTTCGAGAGCTGGAGATTGAAGCAACTGGGCTTGGTTTGTTGAACGAAGCCAACTCTTCTGCTCTTCAT
CCTCCTGCAGAATATGAAACTTCTGAAGTTGCAGTGTCAAGCCAGCCTGCCAAGGAAGAAGCTGAACTTCCTTCAGTGACAATAATTGATGTGGTGGAATCTGGGGAGGA
TTTGTCTGTGTCAGCTTTGACAATTGAATCTAAAGCTCAGCCAAATGCTCCACATGAAGAGCACAAGCTGCCTGCCAATGAGGACTTGTACAGTGAGGTGAAGATAAGGG
AACAAATTGATTCAATGAAGAAGATCGTAGGATACAACGCCACTGCGCTCGGAACCTACATAGATGAAGTGAATGCCCTCTATGCCTTTGTCGGAGTCGAGCCACCGTCC
CAACTCGAAGATTCTTCTGATGATGATCTCAATCAACTTAACCAGAAGTTGCAGTTTCTCATGTCCATAGTAGGGGTCAAATCTCAGCCGCCTCTTGCATGGAGGCAGCC
CAACGGAACCACCTCCATCAACCTGGCTACACTCTGTTTTACCTTCCCGAACTTCTTTGCAGAGTTTCTGCCGACCGCTGAACTTGCTCGATGCATGGGGAGAAGGAGCA
TAACACCCCATCTTGTTTCAACATTCTCTCAGCTGAAGGAATGGCCAACCAAGGTTGAGGAAGGAGCTACAGCTCTAGCTAGGGACGTTGTCAAAGATCCACTTCCTGTT
CCTGATTCTAGAACCAAACAGCCAGGAACTACTTCTAAATACATAATTATGAAGCTGATGTCCGCAATGTATAGAATCTGTGTCCTGTGGCTTAGTACCAAAGTCCATAA
TTCAGGAGTTGGAGCTAACAGTTTGGAGCAAATCGTTTTGAATAGACTCCTAACCAAAGAGGGAGGGGAATATGTACGGGGGAATATGAACAGTCTTTTTTGGCCATCCA
TACTTTTCTTCTATGCACTCAAAAGAGCAAGTGACCCCACAAAAGTTCAGAGGCCAAGCTTCTGTATTTTCTTTCGTCAACCAAAGATCAAGAGTCAAGACTTCTCAAGA
GTCAAGACTTCTCTTTTGCTTCATTCACGATGTGGGTTTCATATGCTAGCCCCTAATCTTTTGAGTTTCCTCTCCCCACCAACTCATTTCTCCTCTATTTTTCCCTTCTA
TTACCAAAGCATTTTAGATTCACAACGTTGCAATTTCCCATTGTTTTCTTTCCCAAATCAATCAAACTCTATGAACCCCAAAAACAACCACGAGATGGGGCCATCACCGA
AAAATAAAAATGATGAGAGAACATACCTAATGATGCCAATGACGACAACTTATAACCGAGAGATTCTTCAGCCGAATCCTTGCCGTAAATTGGAACTACACGTCGTGATG
TTCCGGTCGTACCTGGACTTGCTTCCAGAGGCTGAAACGATGGCGCTTCTCGAGAGTAGATGTCTTCAAAAGCGAGTAAATTTTGAGGGTTTGAGGGTTCGAGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGTCTGCATGCACTCCCAATCTTCTCTCTCTCTCTTTCCTTCTCTCTATTCATTCTGCAACCGCCATCTCTCTCTCTCTCTCTCTCTCTCTCTCTCTCCCCTCTCTGCC
ATGCGACTCTTCTCCAGGGGTTCGAAGTAGGGCGGCGCTCCAGATTATGGCCGGAGAACTACCTGATCTCATTAGGTTTCTTCATCTGATCTTCTTTTGTTGTTTTGTTT
TGTTTCCTCCACTGTTTGAAACTCTGGGAAATTTTTGCAGGGTTTTGGTTCTCTGTGACCACTGTTGTGATGGTGTTAGTTTGACAGAAATGATGGAGGACAACCGCAGG
ATAGACTGGAACATGGAAGAACAGTCCTCCGGTGACGGAGTGAGCAGCGAAAAGCTGCGCTCCGCCGTGCAAAAGAGTTGCAATGTAGGGAAGAAGCTTCTCATCACTGG
TTTAGCCATATCTTCTGCTCCAGTGGTTCTTCCTCCATTGGTGATCATGTCAGCCTTTGGATTTGCAGCCTCAATTCCCTACGGAGTCTTCCTCGCTTCTTACGCATGCA
CCGAGAAGATCATGAGTGTTTGGCTTCCAATTCCTCCACCACCGGAACTCGATTGTACAGATGAAGAAATGGTGGAGGAAAACGGCTACCAGGAAGACATCGACGAGTGT
GTGGAAGAAGAAGAAGAAGAATCTGTGGTGGAGGCAACAAAGAGCAATGAGCTTCTGGTGGACGAAGAGGACAAAATCGATACCGGAAGTGGAGAGGCAGCAGCCATTGA
AGTAACCAGTGTAGAATTTGAAGGAAATGGAGGGGTTGATATTGGAGATGAGGAAGAACAGTTGAAAGAAACAAAGGGTTTGCTTGAAAGAATCAAGGATGAGGGAAGAA
GCGGTGATGATGTTGCTGAGGCAAATGGAGCTGATGATAATGTTCGAGAGCTGGAGATTGAAGCAACTGGGCTTGGTTTGTTGAACGAAGCCAACTCTTCTGCTCTTCAT
CCTCCTGCAGAATATGAAACTTCTGAAGTTGCAGTGTCAAGCCAGCCTGCCAAGGAAGAAGCTGAACTTCCTTCAGTGACAATAATTGATGTGGTGGAATCTGGGGAGGA
TTTGTCTGTGTCAGCTTTGACAATTGAATCTAAAGCTCAGCCAAATGCTCCACATGAAGAGCACAAGCTGCCTGCCAATGAGGACTTGTACAGTGAGGTGAAGATAAGGG
AACAAATTGATTCAATGAAGAAGATCGTAGGATACAACGCCACTGCGCTCGGAACCTACATAGATGAAGTGAATGCCCTCTATGCCTTTGTCGGAGTCGAGCCACCGTCC
CAACTCGAAGATTCTTCTGATGATGATCTCAATCAACTTAACCAGAAGTTGCAGTTTCTCATGTCCATAGTAGGGGTCAAATCTCAGCCGCCTCTTGCATGGAGGCAGCC
CAACGGAACCACCTCCATCAACCTGGCTACACTCTGTTTTACCTTCCCGAACTTCTTTGCAGAGTTTCTGCCGACCGCTGAACTTGCTCGATGCATGGGGAGAAGGAGCA
TAACACCCCATCTTGTTTCAACATTCTCTCAGCTGAAGGAATGGCCAACCAAGGTTGAGGAAGGAGCTACAGCTCTAGCTAGGGACGTTGTCAAAGATCCACTTCCTGTT
CCTGATTCTAGAACCAAACAGCCAGGAACTACTTCTAAATACATAATTATGAAGCTGATGTCCGCAATGTATAGAATCTGTGTCCTGTGGCTTAGTACCAAAGTCCATAA
TTCAGGAGTTGGAGCTAACAGTTTGGAGCAAATCGTTTTGAATAGACTCCTAACCAAAGAGGGAGGGGAATATGTACGGGGGAATATGAACAGTCTTTTTTGGCCATCCA
TACTTTTCTTCTATGCACTCAAAAGAGCAAGTGACCCCACAAAAGTTCAGAGGCCAAGCTTCTGTATTTTCTTTCGTCAACCAAAGATCAAGAGTCAAGACTTCTCAAGA
GTCAAGACTTCTCTTTTGCTTCATTCACGATGTGGGTTTCATATGCTAGCCCCTAATCTTTTGAGTTTCCTCTCCCCACCAACTCATTTCTCCTCTATTTTTCCCTTCTA
TTACCAAAGCATTTTAGATTCACAACGTTGCAATTTCCCATTGTTTTCTTTCCCAAATCAATCAAACTCTATGAACCCCAAAAACAACCACGAGATGGGGCCATCACCGA
AAAATAAAAATGATGAGAGAACATACCTAATGATGCCAATGACGACAACTTATAACCGAGAGATTCTTCAGCCGAATCCTTGCCGTAAATTGGAACTACACGTCGTGATG
TTCCGGTCGTACCTGGACTTGCTTCCAGAGGCTGAAACGATGGCGCTTCTCGAGAGTAGATGTCTTCAAAAGCGAGTAAATTTTGAGGGTTTGAGGGTTCGAGGGTAA
Protein sequenceShow/hide protein sequence
MSACTPNLLSLSFLLSIHSATAISLSLSLSLSLPSLPCDSSPGVRSRAALQIMAGELPDLIRFLHLIFFCCFVLFPPLFETLGNFCRVLVLCDHCCDGVSLTEMMEDNRR
IDWNMEEQSSGDGVSSEKLRSAVQKSCNVGKKLLITGLAISSAPVVLPPLVIMSAFGFAASIPYGVFLASYACTEKIMSVWLPIPPPPELDCTDEEMVEENGYQEDIDEC
VEEEEEESVVEATKSNELLVDEEDKIDTGSGEAAAIEVTSVEFEGNGGVDIGDEEEQLKETKGLLERIKDEGRSGDDVAEANGADDNVRELEIEATGLGLLNEANSSALH
PPAEYETSEVAVSSQPAKEEAELPSVTIIDVVESGEDLSVSALTIESKAQPNAPHEEHKLPANEDLYSEVKIREQIDSMKKIVGYNATALGTYIDEVNALYAFVGVEPPS
QLEDSSDDDLNQLNQKLQFLMSIVGVKSQPPLAWRQPNGTTSINLATLCFTFPNFFAEFLPTAELARCMGRRSITPHLVSTFSQLKEWPTKVEEGATALARDVVKDPLPV
PDSRTKQPGTTSKYIIMKLMSAMYRICVLWLSTKVHNSGVGANSLEQIVLNRLLTKEGGEYVRGNMNSLFWPSILFFYALKRASDPTKVQRPSFCIFFRQPKIKSQDFSR
VKTSLLLHSRCGFHMLAPNLLSFLSPPTHFSSIFPFYYQSILDSQRCNFPLFSFPNQSNSMNPKNNHEMGPSPKNKNDERTYLMMPMTTTYNREILQPNPCRKLELHVVM
FRSYLDLLPEAETMALLESRCLQKRVNFEGLRVRG