; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS021072 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS021072
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold290:1027402..1030197
RNA-Seq ExpressionMS021072
SyntenyMS021072
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR036259 - MFS transporter superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577745.1 hypothetical protein SDJN03_25319, partial [Cucurbita argyrosperma subsp. sororia]6.2e-23574.65Show/hide
Query:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV
        MAEQ       EIQ+ P PR+ SG     R TST RS S GG R++TPD  S AAKLERAKEVYRAYEGHGE+PS++E+ GW FYELCS  V+TVLIP+V
Subjt:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV

Query:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC
        FP+IISQI GA  EPPQGW KS MGFDC P EM+LYQ LTEHTIK+  + FSPLIWTSISWALGL++A P+LVFASFHLDYGFNQ +IA+ AVAAGA+SC
Subjt:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC

Query:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS
        LP GVFRTVKIFPLYIVLIV+AHSVAFTSHTRHLGLMLRGL      KAKF+ RR G+GLISS S AVGGLG+AAISAFTYHMLRR+RQ KE D++HFLS
Subjt:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS

Query:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP
        LWIV+IFGGLKWLLG+VHVF+TNRS+S++IP DSELH ++IFKYPHAIG+VIS G LSSFTT+ +F    L+LIGQ+CFKP  +LYLWLIYFL+PLISLP
Subjt:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP

Query:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV
        LL Q Q RIK DASKMQ+LGFILSA TS  CFYFH+ AWR+  VF+FAALQGT+AALLH++GRVLVLDCSPAGKE AISMWFSW +AIG C+GFT+AA+V
Subjt:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV

Query:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP
        P ++QVSSGVAFC AVVGGV+LI+GN+TDYGGAVAAGHV+ DSEKGSPV+GLESRSVSK+L SP
Subjt:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP

KAG7015784.1 hypothetical protein SDJN02_23422, partial [Cucurbita argyrosperma subsp. argyrosperma]1.4e-23474.65Show/hide
Query:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV
        MAEQ       EIQ+ P PR+ SG     R TST RS S GG R++TPD  S AAKLERAKEVYRAYEGHGE+PS++E+ GW FYELCS  V+TVLIP+V
Subjt:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV

Query:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC
        FP+IISQI GA  EPPQGW KS MGFDC P EM+LYQ LTEHTIK+  + FSPLIWTSISWALGL++A P+LVFASFHLDYGFNQ +IA+ AVAAGA+SC
Subjt:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC

Query:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS
        LP GVFRTVKIFPLYIVLIV+AHSVAFTSHTRHLGLMLRGL      KAKF+ RR G+GLISS S AVGGLG+AAISAFTYHMLRR+RQ KE D++HFLS
Subjt:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS

Query:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP
        LWIV+IFGGLKWLLGIVHVF+TNRS+S++IP DSELH ++IFKYPHAIG+VIS G LSSFTT+ +F    L+LIGQ+CFKP  +LYLWLIYFL+PLISLP
Subjt:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP

Query:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV
        LL Q Q RIK DASKMQ+LGFILSA TS  CFYFH+ AWR+  VF+FA+LQGT+AALLH++GRVLVLDCSPAGKE AISMWFSW +AIG C+GFT+AA+V
Subjt:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV

Query:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP
        P ++QVSSGVAFC AVVGGV+LI+GN+TDYGGAVAAGHV+ DSEKGSPV+GLESRSVSK+L SP
Subjt:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP

XP_008448612.1 PREDICTED: uncharacterized protein LOC103490734 [Cucumis melo]1.7e-23274.69Show/hide
Query:  EQEIQNLPTPRTASGGGRSERSTSTPRSASGGGG--RRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIVFPIIISQ
        + EIQNLP  ++ SG     RS STPRSA+GGGG  RRETPD  STAAKLERAKEVY+AYEGHGERP++VEIVGW FYELCS FV+T+LIP+VFP+IISQ
Subjt:  EQEIQNLPTPRTASGGGRSERSTSTPRSASGGGG--RRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIVFPIIISQ

Query:  IGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSCLPIGVFR
        I G P  PPQGW KSFMGFDCP +EM+LYQ LTE TIK+ ++EFSPLIWTSISWA+GL++A P+L  ASFHLDYGFNQ +I LAAVAAGA++CLP G+F+
Subjt:  IGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSCLPIGVFR

Query:  TVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLSLWIVSIF
        TVKIFPLYIVLIV+AHSVAFTSHTRHLGLMLRGL      KAKFS RRIG+G ISS+S AVGG+G++ ISAFTYHMLRRD+QV+E  D+HFL+LWIV+IF
Subjt:  TVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLSLWIVSIF

Query:  GGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLPLLQQLQN
         GLKWL+GI HVF+TNRS+SISIP +SELH +SIFKYP+AI +VISGG LSSF T+ IFT  LL+LIGQ+CFKPV +LYL LIYFLVPLISLPLL Q Q 
Subjt:  GGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLPLLQQLQN

Query:  RIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALVPAKVQVS
        RIK DASKM +LGFILSAATS TCFYFH   WR+  VF+FA LQGT+AA+LHA+GR LVLDCSPAGKE AISMWFSW ++IG C+GFT+AA+VPA++QVS
Subjt:  RIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALVPAKVQVS

Query:  SGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP
        SGV FC AVVGGV+LIFGN+TDY GAVAAGHVR+DSEKGSPVIGL+SRS SK+L SP
Subjt:  SGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP

XP_022965332.1 uncharacterized protein LOC111465229 isoform X1 [Cucurbita maxima]1.5e-23675.71Show/hide
Query:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV
        MAEQ       EIQN P PR+ SG     R TST RS S GG R++TPD  S AAKLERAKEVYRAYEGHGE+PS++E+ GW FYELCS  V+TVLIP+V
Subjt:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV

Query:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC
        FP+IISQI GA  EPPQGW +SFMGFDCPP EM+LYQ LT+HTIKI  + FSPLIWTSISWALGL++A P+L FASFHLDYGFNQ +IA+ AVAAGA+SC
Subjt:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC

Query:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS
        LP GVFRTVKIFPLYIVLIV+AHSVAFTSHTRHLGLMLRGL      KAKF+ RR G+GLISS S AVGGLG+AAISAFTYHMLRR+RQ KE DD+HFLS
Subjt:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS

Query:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP
        LWIV+IFGGLKWLLGI HVF+TNRS+S++IP DSELH ++IFKYPHAIG+VIS G LSSFTT+ IF    L+LIGQ+CFKPV +LYLWLIYFL+PLISLP
Subjt:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP

Query:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV
        LL Q Q RIK DASKMQ+LGFILSA TS  CFYFH+ AWR   VF+FAALQGT+AALLH +GRVLVLDCSPAGKE AISMWFSW +AIG C+GFT+AA+V
Subjt:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV

Query:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP
        PA++QVSSGVAFC AVVGGV+LI+GNITDYGGAV+AGHV+ DSEKGSPVIGLESRSVSK+L SP
Subjt:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP

XP_022965333.1 uncharacterized protein LOC111465229 isoform X2 [Cucurbita maxima]3.1e-23475.53Show/hide
Query:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV
        MAEQ       EIQN P PR+ SG     R TST RS S GG R++TPD  S AAKLERAKEVYRAYEGHGE+PS++E+ GW FYELCS  V+TVLIP+V
Subjt:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV

Query:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC
        FP+IISQI GA  EPPQGW +SFMGFDCPP EM+LYQ LT+HTIKI  + FSPLIWTSISWALGL++A P+L FASFHLDYGFNQ +IA+ AVAAGA+SC
Subjt:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC

Query:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS
        LP GVFRTVKIFPLYIVLIV+AHSVAFTSHTRHLGLMLRGL      KAKF+ RR G+GLISS S AVGGLG+AAISAFTYHMLR  RQ KE DD+HFLS
Subjt:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS

Query:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP
        LWIV+IFGGLKWLLGI HVF+TNRS+S++IP DSELH ++IFKYPHAIG+VIS G LSSFTT+ IF    L+LIGQ+CFKPV +LYLWLIYFL+PLISLP
Subjt:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP

Query:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV
        LL Q Q RIK DASKMQ+LGFILSA TS  CFYFH+ AWR   VF+FAALQGT+AALLH +GRVLVLDCSPAGKE AISMWFSW +AIG C+GFT+AA+V
Subjt:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV

Query:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP
        PA++QVSSGVAFC AVVGGV+LI+GNITDYGGAV+AGHV+ DSEKGSPVIGLESRSVSK+L SP
Subjt:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP

TrEMBL top hitse value%identityAlignment
A0A0A0L1Q8 Uncharacterized protein4.2e-22171.51Show/hide
Query:  EQEIQNLPTPRTASGGGRSERSTSTPRSAS---GGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIVFPIIIS
        + EI NLP P++ S      RS STPRSA+   GGG RRETPD  STAAKLERAKEVYRAYEGHGERP++ EI+GW FYELCS FV+ +LIP+VFP+IIS
Subjt:  EQEIQNLPTPRTASGGGRSERSTSTPRSAS---GGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIVFPIIIS

Query:  QIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSCLPIGVF
        QI G P  PPQGW KSF GFDC  +EM+LYQ LTE TI + +++FSPLIWTSISWA+GL++A P+L  ASFHLDYGF+Q +I LAAVAAGA++CLP G F
Subjt:  QIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSCLPIGVF

Query:  RTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLSLWIVSI
        +TVKIFPLYI+LIV+AHSVA TSHTRHLGLMLRGL      KAKFS R IG+G ISS+S  VGG+G+AAISAFTYHMLR D+QV +  D HFL+LWIV+I
Subjt:  RTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLSLWIVSI

Query:  FGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLPLLQQLQ
        F GLKWL+GI HVF+TNRS+S+SIP DSE+H +SIFKYPHAI +VISGG LSSF T+ IFT  LL+LI Q+CFKPV + YL LIYFLVPLISLPLL QLQ
Subjt:  FGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLPLLQQLQ

Query:  NRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALVPAKVQV
         RIK DASKM +LGFILSAATS TCFYFH  AW++  VF+FA LQGT+AA+LHA+GR LV+ CSPAGKE AISMWFSW +AIG C+GFT+AA+VP  +QV
Subjt:  NRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALVPAKVQV

Query:  SSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP
        SSGV FC AVVGG+LLIFGN+TDY GAVAAGHVR+DSEKGSPV GL+SRS SK+L SP
Subjt:  SSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP

A0A1S3BK45 uncharacterized protein LOC1034907348.1e-23374.69Show/hide
Query:  EQEIQNLPTPRTASGGGRSERSTSTPRSASGGGG--RRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIVFPIIISQ
        + EIQNLP  ++ SG     RS STPRSA+GGGG  RRETPD  STAAKLERAKEVY+AYEGHGERP++VEIVGW FYELCS FV+T+LIP+VFP+IISQ
Subjt:  EQEIQNLPTPRTASGGGRSERSTSTPRSASGGGG--RRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIVFPIIISQ

Query:  IGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSCLPIGVFR
        I G P  PPQGW KSFMGFDCP +EM+LYQ LTE TIK+ ++EFSPLIWTSISWA+GL++A P+L  ASFHLDYGFNQ +I LAAVAAGA++CLP G+F+
Subjt:  IGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSCLPIGVFR

Query:  TVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLSLWIVSIF
        TVKIFPLYIVLIV+AHSVAFTSHTRHLGLMLRGL      KAKFS RRIG+G ISS+S AVGG+G++ ISAFTYHMLRRD+QV+E  D+HFL+LWIV+IF
Subjt:  TVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLSLWIVSIF

Query:  GGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLPLLQQLQN
         GLKWL+GI HVF+TNRS+SISIP +SELH +SIFKYP+AI +VISGG LSSF T+ IFT  LL+LIGQ+CFKPV +LYL LIYFLVPLISLPLL Q Q 
Subjt:  GGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLPLLQQLQN

Query:  RIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALVPAKVQVS
        RIK DASKM +LGFILSAATS TCFYFH   WR+  VF+FA LQGT+AA+LHA+GR LVLDCSPAGKE AISMWFSW ++IG C+GFT+AA+VPA++QVS
Subjt:  RIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALVPAKVQVS

Query:  SGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP
        SGV FC AVVGGV+LIFGN+TDY GAVAAGHVR+DSEKGSPVIGL+SRS SK+L SP
Subjt:  SGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP

A0A5D3CJT7 Uncharacterized protein8.1e-23374.69Show/hide
Query:  EQEIQNLPTPRTASGGGRSERSTSTPRSASGGGG--RRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIVFPIIISQ
        + EIQNLP  ++ SG     RS STPRSA+GGGG  RRETPD  STAAKLERAKEVY+AYEGHGERP++VEIVGW FYELCS FV+T+LIP+VFP+IISQ
Subjt:  EQEIQNLPTPRTASGGGRSERSTSTPRSASGGGG--RRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIVFPIIISQ

Query:  IGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSCLPIGVFR
        I G P  PPQGW KSFMGFDCP +EM+LYQ LTE TIK+ ++EFSPLIWTSISWA+GL++A P+L  ASFHLDYGFNQ +I LAAVAAGA++CLP G+F+
Subjt:  IGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSCLPIGVFR

Query:  TVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLSLWIVSIF
        TVKIFPLYIVLIV+AHSVAFTSHTRHLGLMLRGL      KAKFS RRIG+G ISS+S AVGG+G++ ISAFTYHMLRRD+QV+E  D+HFL+LWIV+IF
Subjt:  TVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLSLWIVSIF

Query:  GGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLPLLQQLQN
         GLKWL+GI HVF+TNRS+SISIP +SELH +SIFKYP+AI +VISGG LSSF T+ IFT  LL+LIGQ+CFKPV +LYL LIYFLVPLISLPLL Q Q 
Subjt:  GGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLPLLQQLQN

Query:  RIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALVPAKVQVS
        RIK DASKM +LGFILSAATS TCFYFH   WR+  VF+FA LQGT+AA+LHA+GR LVLDCSPAGKE AISMWFSW ++IG C+GFT+AA+VPA++QVS
Subjt:  RIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALVPAKVQVS

Query:  SGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP
        SGV FC AVVGGV+LIFGN+TDY GAVAAGHVR+DSEKGSPVIGL+SRS SK+L SP
Subjt:  SGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP

A0A6J1HK19 uncharacterized protein LOC111465229 isoform X17.1e-23775.71Show/hide
Query:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV
        MAEQ       EIQN P PR+ SG     R TST RS S GG R++TPD  S AAKLERAKEVYRAYEGHGE+PS++E+ GW FYELCS  V+TVLIP+V
Subjt:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV

Query:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC
        FP+IISQI GA  EPPQGW +SFMGFDCPP EM+LYQ LT+HTIKI  + FSPLIWTSISWALGL++A P+L FASFHLDYGFNQ +IA+ AVAAGA+SC
Subjt:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC

Query:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS
        LP GVFRTVKIFPLYIVLIV+AHSVAFTSHTRHLGLMLRGL      KAKF+ RR G+GLISS S AVGGLG+AAISAFTYHMLRR+RQ KE DD+HFLS
Subjt:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS

Query:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP
        LWIV+IFGGLKWLLGI HVF+TNRS+S++IP DSELH ++IFKYPHAIG+VIS G LSSFTT+ IF    L+LIGQ+CFKPV +LYLWLIYFL+PLISLP
Subjt:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP

Query:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV
        LL Q Q RIK DASKMQ+LGFILSA TS  CFYFH+ AWR   VF+FAALQGT+AALLH +GRVLVLDCSPAGKE AISMWFSW +AIG C+GFT+AA+V
Subjt:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV

Query:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP
        PA++QVSSGVAFC AVVGGV+LI+GNITDYGGAV+AGHV+ DSEKGSPVIGLESRSVSK+L SP
Subjt:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP

A0A6J1HNK9 uncharacterized protein LOC111465229 isoform X21.5e-23475.53Show/hide
Query:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV
        MAEQ       EIQN P PR+ SG     R TST RS S GG R++TPD  S AAKLERAKEVYRAYEGHGE+PS++E+ GW FYELCS  V+TVLIP+V
Subjt:  MAEQ-------EIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIV

Query:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC
        FP+IISQI GA  EPPQGW +SFMGFDCPP EM+LYQ LT+HTIKI  + FSPLIWTSISWALGL++A P+L FASFHLDYGFNQ +IA+ AVAAGA+SC
Subjt:  FPIIISQIGGAPPEPPQGWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSC

Query:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS
        LP GVFRTVKIFPLYIVLIV+AHSVAFTSHTRHLGLMLRGL      KAKF+ RR G+GLISS S AVGGLG+AAISAFTYHMLR  RQ KE DD+HFLS
Subjt:  LPIGVFRTVKIFPLYIVLIVVAHSVAFTSHTRHLGLMLRGLA-----KAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLS

Query:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP
        LWIV+IFGGLKWLLGI HVF+TNRS+S++IP DSELH ++IFKYPHAIG+VIS G LSSFTT+ IF    L+LIGQ+CFKPV +LYLWLIYFL+PLISLP
Subjt:  LWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIFKYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLP

Query:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV
        LL Q Q RIK DASKMQ+LGFILSA TS  CFYFH+ AWR   VF+FAALQGT+AALLH +GRVLVLDCSPAGKE AISMWFSW +AIG C+GFT+AA+V
Subjt:  LLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQGTSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALV

Query:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP
        PA++QVSSGVAFC AVVGGV+LI+GNITDYGGAV+AGHV+ DSEKGSPVIGLESRSVSK+L SP
Subjt:  PAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLGSP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAGCAGGAGATCCAAAACCTACCGACGCCGAGGACCGCGAGCGGGGGCGGCCGGAGCGAGAGGAGCACATCGACACCGAGGTCGGCGAGCGGCGGCGGAGGGCG
GAGGGAGACGCCGGATGTTCAGAGCACGGCGGCGAAGCTGGAGAGGGCGAAGGAGGTATACAGGGCGTACGAAGGGCACGGAGAGAGGCCATCGGTTGTGGAGATTGTGG
GATGGAGCTTCTACGAGCTGTGTTCTTCGTTTGTTGTGACGGTTCTGATTCCGATTGTGTTTCCGATCATTATCAGCCAGATCGGCGGCGCTCCGCCGGAGCCGCCGCAG
GGCTGGTCCAAGAGTTTCATGGGCTTCGATTGCCCTCCTAAAGAAATGAAACTGTACCAGAAGCTGACGGAGCATACGATAAAGATCGGCAGCTCCGAATTTTCGCCATT
AATTTGGACATCAATTTCATGGGCTTTGGGCCTTCTTGTGGCCGCCCCTGTCCTTGTCTTCGCCTCCTTCCACCTCGATTACGGCTTCAATCAACAGATCATCGCTCTCG
CCGCCGTCGCCGCCGGCGCCGTCTCCTGTCTTCCGATCGGCGTCTTCAGAACCGTCAAGATTTTCCCTCTCTACATCGTCCTCATCGTCGTCGCTCACTCCGTCGCCTTC
ACCTCCCACACGCGCCACCTCGGCCTCATGCTCCGCGGCCTCGCCAAGGCCAAATTCTCCGACAGAAGAATTGGCGCCGGCCTGATCTCGTCGTACTCCGTCGCCGTAGG
TGGCTTGGGCTCCGCCGCCATCTCCGCCTTCACCTACCACATGCTTCGACGTGACAGGCAGGTCAAAGAGGAAGACGACGACCACTTTCTTAGCCTGTGGATCGTGTCGA
TCTTCGGTGGCCTGAAATGGCTTCTCGGAATTGTTCACGTTTTCGTTACGAATCGATCGCTCTCCATCAGCATCCCTTGCGATTCAGAACTCCATTGTATTTCGATTTTC
AAGTATCCTCACGCAATCGGAAGCGTAATCTCCGGCGGAATTCTTTCTTCTTTCACTACTATGTGCATCTTCACCGGAGCTTTGCTGTATCTAATCGGCCAACTCTGCTT
CAAACCAGTGTTCGTTCTCTATCTCTGGCTAATCTACTTCCTCGTCCCTCTGATTTCACTGCCATTACTGCAACAATTGCAGAATCGCATCAAAGTCGACGCCTCCAAAA
TGCAACTCCTAGGTTTCATCCTCTCCGCCGCCACTTCCGGCACTTGCTTCTACTTCCACGACGGCGCGTGGCGGAAACGCGCCGTGTTCCTCTTCGCCGCGCTCCAAGGA
ACATCGGCCGCTCTTCTGCACGCTTTCGGAAGAGTTCTGGTGCTCGATTGCTCGCCTGCGGGGAAAGAAGGTGCCATTTCGATGTGGTTTTCGTGGAAGAAAGCAATCGG
CGCTTGCATTGGTTTCACGATCGCAGCCCTAGTTCCAGCGAAGGTGCAGGTTTCTTCCGGCGTGGCGTTCTGCAGCGCCGTCGTCGGAGGAGTGCTGTTGATCTTCGGGA
ACATCACCGATTACGGCGGCGCTGTGGCGGCGGGGCATGTCAGAGAGGATAGCGAAAAAGGCTCGCCGGTGATCGGATTGGAGTCTCGGAGTGTGAGTAAGGACCTTGGA
TCGCCG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAGCAGGAGATCCAAAACCTACCGACGCCGAGGACCGCGAGCGGGGGCGGCCGGAGCGAGAGGAGCACATCGACACCGAGGTCGGCGAGCGGCGGCGGAGGGCG
GAGGGAGACGCCGGATGTTCAGAGCACGGCGGCGAAGCTGGAGAGGGCGAAGGAGGTATACAGGGCGTACGAAGGGCACGGAGAGAGGCCATCGGTTGTGGAGATTGTGG
GATGGAGCTTCTACGAGCTGTGTTCTTCGTTTGTTGTGACGGTTCTGATTCCGATTGTGTTTCCGATCATTATCAGCCAGATCGGCGGCGCTCCGCCGGAGCCGCCGCAG
GGCTGGTCCAAGAGTTTCATGGGCTTCGATTGCCCTCCTAAAGAAATGAAACTGTACCAGAAGCTGACGGAGCATACGATAAAGATCGGCAGCTCCGAATTTTCGCCATT
AATTTGGACATCAATTTCATGGGCTTTGGGCCTTCTTGTGGCCGCCCCTGTCCTTGTCTTCGCCTCCTTCCACCTCGATTACGGCTTCAATCAACAGATCATCGCTCTCG
CCGCCGTCGCCGCCGGCGCCGTCTCCTGTCTTCCGATCGGCGTCTTCAGAACCGTCAAGATTTTCCCTCTCTACATCGTCCTCATCGTCGTCGCTCACTCCGTCGCCTTC
ACCTCCCACACGCGCCACCTCGGCCTCATGCTCCGCGGCCTCGCCAAGGCCAAATTCTCCGACAGAAGAATTGGCGCCGGCCTGATCTCGTCGTACTCCGTCGCCGTAGG
TGGCTTGGGCTCCGCCGCCATCTCCGCCTTCACCTACCACATGCTTCGACGTGACAGGCAGGTCAAAGAGGAAGACGACGACCACTTTCTTAGCCTGTGGATCGTGTCGA
TCTTCGGTGGCCTGAAATGGCTTCTCGGAATTGTTCACGTTTTCGTTACGAATCGATCGCTCTCCATCAGCATCCCTTGCGATTCAGAACTCCATTGTATTTCGATTTTC
AAGTATCCTCACGCAATCGGAAGCGTAATCTCCGGCGGAATTCTTTCTTCTTTCACTACTATGTGCATCTTCACCGGAGCTTTGCTGTATCTAATCGGCCAACTCTGCTT
CAAACCAGTGTTCGTTCTCTATCTCTGGCTAATCTACTTCCTCGTCCCTCTGATTTCACTGCCATTACTGCAACAATTGCAGAATCGCATCAAAGTCGACGCCTCCAAAA
TGCAACTCCTAGGTTTCATCCTCTCCGCCGCCACTTCCGGCACTTGCTTCTACTTCCACGACGGCGCGTGGCGGAAACGCGCCGTGTTCCTCTTCGCCGCGCTCCAAGGA
ACATCGGCCGCTCTTCTGCACGCTTTCGGAAGAGTTCTGGTGCTCGATTGCTCGCCTGCGGGGAAAGAAGGTGCCATTTCGATGTGGTTTTCGTGGAAGAAAGCAATCGG
CGCTTGCATTGGTTTCACGATCGCAGCCCTAGTTCCAGCGAAGGTGCAGGTTTCTTCCGGCGTGGCGTTCTGCAGCGCCGTCGTCGGAGGAGTGCTGTTGATCTTCGGGA
ACATCACCGATTACGGCGGCGCTGTGGCGGCGGGGCATGTCAGAGAGGATAGCGAAAAAGGCTCGCCGGTGATCGGATTGGAGTCTCGGAGTGTGAGTAAGGACCTTGGA
TCGCCG
Protein sequenceShow/hide protein sequence
MAEQEIQNLPTPRTASGGGRSERSTSTPRSASGGGGRRETPDVQSTAAKLERAKEVYRAYEGHGERPSVVEIVGWSFYELCSSFVVTVLIPIVFPIIISQIGGAPPEPPQ
GWSKSFMGFDCPPKEMKLYQKLTEHTIKIGSSEFSPLIWTSISWALGLLVAAPVLVFASFHLDYGFNQQIIALAAVAAGAVSCLPIGVFRTVKIFPLYIVLIVVAHSVAF
TSHTRHLGLMLRGLAKAKFSDRRIGAGLISSYSVAVGGLGSAAISAFTYHMLRRDRQVKEEDDDHFLSLWIVSIFGGLKWLLGIVHVFVTNRSLSISIPCDSELHCISIF
KYPHAIGSVISGGILSSFTTMCIFTGALLYLIGQLCFKPVFVLYLWLIYFLVPLISLPLLQQLQNRIKVDASKMQLLGFILSAATSGTCFYFHDGAWRKRAVFLFAALQG
TSAALLHAFGRVLVLDCSPAGKEGAISMWFSWKKAIGACIGFTIAALVPAKVQVSSGVAFCSAVVGGVLLIFGNITDYGGAVAAGHVREDSEKGSPVIGLESRSVSKDLG
SP