; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg021277 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg021277
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold6:47326071..47330532
RNA-Seq ExpressionSpg021277
SyntenySpg021277
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR036259 - MFS transporter superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6577745.1 hypothetical protein SDJN03_25319, partial [Cucurbita argyrosperma subsp. sororia]1.4e-25583.78Show/hide
Query:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL
        MAEQSPRP   EIQ+ PP RS S GR  ST R  SG GGSR++TPD HS AAKLERAKEVYRAYEGHGE+P+I+E+ GWCFYELCSLSV TVLIPVVFPL
Subjt:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL

Query:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV
        IISQISGA TEPPQGWFKS MGFDC P E +LYQ LTEHTIKV+ + FSPLIWTS+SWALGL+LAGPIL FASFHLDYGFNQHLI + AVAAGALSCLP 
Subjt:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV

Query:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLR--RLVKEEDETHFLSLWI
        GVF+TVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGL GPTV KA+FA RR GSG ISSCS AVGGLG+AAISAFTYHMLR  R  KE DE HFLSLWI
Subjt:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLR--RLVKEEDETHFLSLWI

Query:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH
        VTIFGGLKWLLG+ HVF+TNRS+SVTIPSDSELH+L+IFKYPHAIG+VIS GFLSSF TI +F AV LFLIGQICFKP LILYLWLIYFLIPLISLPLLH
Subjt:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH

Query:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK
        QFQIRIK+DASKMQILGFILSA TSAICFYFH+ AWRR VVFVFA LQGTAAA+L +YGRVLVLDCSPAGKE AISMWFSWMRAIGGC GFTVAAVVP +
Subjt:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK

Query:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP
        LQVS+GVAFC AVVGGVVLIYGNVTDY GAVAAGHV++DSEKGSPV+GLESRS SKELESP
Subjt:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP

KAG7015784.1 hypothetical protein SDJN02_23422, partial [Cucurbita argyrosperma subsp. argyrosperma]2.4e-25583.96Show/hide
Query:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL
        MAEQSPRP   EIQ+ PP RS S GR  ST R  SG GGSR++TPD HS AAKLERAKEVYRAYEGHGE+P+I+E+ GWCFYELCSLSV TVLIPVVFPL
Subjt:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL

Query:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV
        IISQISGA TEPPQGWFKS MGFDC P E +LYQ LTEHTIKV+ + FSPLIWTS+SWALGL+LAGPIL FASFHLDYGFNQHLI + AVAAGALSCLP 
Subjt:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV

Query:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLR--RLVKEEDETHFLSLWI
        GVF+TVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGL GPTV KA+FA RR GSG ISSCS AVGGLG+AAISAFTYHMLR  R  KE DE HFLSLWI
Subjt:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLR--RLVKEEDETHFLSLWI

Query:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH
        VTIFGGLKWLLGI HVF+TNRS+SVTIPSDSELH+L+IFKYPHAIG+VIS GFLSSF TI +F AV LFLIGQICFKP LILYLWLIYFLIPLISLPLLH
Subjt:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH

Query:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK
        QFQIRIK+DASKMQILGFILSA TSAICFYFH+ AWRR VVFVFA LQGTAAA+L +YGRVLVLDCSPAGKE AISMWFSWMRAIGGC GFTVAAVVP +
Subjt:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK

Query:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP
        LQVS+GVAFC AVVGGVVLIYGNVTDY GAVAAGHV++DSEKGSPV+GLESRS SKELESP
Subjt:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP

XP_008448612.1 PREDICTED: uncharacterized protein LOC103490734 [Cucumis melo]2.4e-25584.14Show/hide
Query:  AEQSPRP---EIQNLPPARSASGGRSMSTPRSAS-GGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL
        AEQSPRP   EIQNLPP++S S GRS+STPRSA+ GGGGSRRETPD HSTAAKLERAKEVY+AYEGHGERPTIVEIVGWCFYELCS  V T+LIPVVFPL
Subjt:  AEQSPRP---EIQNLPPARSASGGRSMSTPRSAS-GGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL

Query:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV
        IISQISG  T PPQGWFKSFMGFDCP +E +LYQ LTE TIKV+ +EFSPLIWTS+SWA+GLVLAGPILA ASFHLDYGFNQHLITLAAVAAGAL+CLP 
Subjt:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV

Query:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRR--LVKEEDETHFLSLWI
        G+FKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGL GP V KA+F+ RR GSG ISS SAAVGG+G++ ISAFTYHMLRR   V+E  + HFL+LWI
Subjt:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRR--LVKEEDETHFLSLWI

Query:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH
        VTIF GLKWL+GIFHVF+TNRS+S++IPS+SELHILSIFKYP+AI +VISGGFLSSFATI IFTAVLLFLIGQICFKPVLILYL LIYFL+PLISLPLLH
Subjt:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH

Query:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK
        QFQIRIK+DASKM ILGFILSA TSA CFYFH+  WRR +VFVFAVLQGTAAAVL AYGR LVLDCSPAGKE AISMWFSWMR+IGGC GFTVAAVVPA+
Subjt:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK

Query:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP
        LQVS+GV FC AVVGGVVLI+GNVTDYDGAVAAGHVRDDSEKGSPVIGL+SRSESKELESP
Subjt:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP

XP_022965332.1 uncharacterized protein LOC111465229 isoform X1 [Cucurbita maxima]7.6e-25783.96Show/hide
Query:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL
        MAEQSPRP   EIQN PP RS S GR  ST R  SG GGSR++TPD HS AAKLERAKEVYRAYEGHGE+P+I+E+ GWCFYELCSLSV TVLIPVVFPL
Subjt:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL

Query:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV
        IISQISGA  EPPQGWF+SFMGFDCPP E +LYQ LT+HTIK++ + FSPLIWTS+SWALGL++AGPILAFASFHLDYGFNQHLI + AVAAGALSCLP 
Subjt:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV

Query:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLR--RLVKEEDETHFLSLWI
        GVF+TVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGL GPTV KA+FA RR GSG ISSCS AVGGLG+AAISAFTYHMLR  R  KE D+ HFLSLWI
Subjt:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLR--RLVKEEDETHFLSLWI

Query:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH
        VTIFGGLKWLLGIFHVF+TNRS+SVTIPSDSELH+L+IFKYPHAIG+VIS GFLSSF TI IF AV LFLIGQICFKPVLILYLWLIYFLIPLISLPLLH
Subjt:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH

Query:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK
        QFQIRIK+DASKMQILGFILSA TSAICFYFH+ AWR  VVFVFA LQGTAAA+L  YGRVLVLDCSPAGKE AISMWFSWMRAIGGC GFTVAAVVPA+
Subjt:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK

Query:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP
        LQVS+GVAFC AVVGGVVLIYGN+TDY GAV+AGHV++DSEKGSPVIGLESRS SKELESP
Subjt:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP

XP_022965333.1 uncharacterized protein LOC111465229 isoform X2 [Cucurbita maxima]2.3e-25884.26Show/hide
Query:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL
        MAEQSPRP   EIQN PP RS S GR  ST R  SG GGSR++TPD HS AAKLERAKEVYRAYEGHGE+P+I+E+ GWCFYELCSLSV TVLIPVVFPL
Subjt:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL

Query:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV
        IISQISGA  EPPQGWF+SFMGFDCPP E +LYQ LT+HTIK++ + FSPLIWTS+SWALGL++AGPILAFASFHLDYGFNQHLI + AVAAGALSCLP 
Subjt:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV

Query:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRRLVKEEDETHFLSLWIVT
        GVF+TVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGL GPTV KA+FA RR GSG ISSCS AVGGLG+AAISAFTYHMLRR  KE D+ HFLSLWIVT
Subjt:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRRLVKEEDETHFLSLWIVT

Query:  IFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQF
        IFGGLKWLLGIFHVF+TNRS+SVTIPSDSELH+L+IFKYPHAIG+VIS GFLSSF TI IF AV LFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQF
Subjt:  IFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQF

Query:  QIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAKLQ
        QIRIK+DASKMQILGFILSA TSAICFYFH+ AWR  VVFVFA LQGTAAA+L  YGRVLVLDCSPAGKE AISMWFSWMRAIGGC GFTVAAVVPA+LQ
Subjt:  QIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAKLQ

Query:  VSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP
        VS+GVAFC AVVGGVVLIYGN+TDY GAV+AGHV++DSEKGSPVIGLESRS SKELESP
Subjt:  VSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP

TrEMBL top hitse value%identityAlignment
A0A0A0L1Q8 Uncharacterized protein4.3e-24279.89Show/hide
Query:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSAS--GGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVF
        M EQSPRP   EI NLPP +S S  RS+STPRSA+  GGGGSRRETPD HSTAAKLERAKEVYRAYEGHGERPTI EI+GWCFYELCS  V  +LIPVVF
Subjt:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSAS--GGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVF

Query:  PLIISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCL
        PLIISQISG  T PPQGWFKSF GFDC  +E +LYQ LTE TI V+ ++FSPLIWTS+SWA+GLVLAGPILA ASFHLDYGF+Q+LITLAAVAAGAL+CL
Subjt:  PLIISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCL

Query:  PVGVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRRLVKEED-ETHFLSLW
        P G FKTVKIFPLYI+LIVIAHSVA TSHTRHLGLMLRGL GP + KA+F+ R  GSG ISS SA VGG+G+AAISAFTYHMLR   + +  ++HFL+LW
Subjt:  PVGVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRRLVKEED-ETHFLSLW

Query:  IVTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLL
        IVTIF GLKWL+GIFHVF+TNRS+SV+IPSDSE+HILSIFKYPHAI +VISGGFLSSFATI IFT+VLLFLI QICFKPVLI YL LIYFL+PLISLPLL
Subjt:  IVTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLL

Query:  HQFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPA
        HQ QIRIK+DASKM ILGFILSA TSA CFYFH+ AW+R +VFVFAVLQGTAAAVL AYGR LV+ CSPAGKE AISMWFSWMRAIGGC GFTVAAVVP 
Subjt:  HQFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPA

Query:  KLQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP
         LQVS+GV FC AVVGG++LI+GNVTDYDGAVAAGHVRDDSEKGSPV GL+SRSESKELESP
Subjt:  KLQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP

A0A1S3BK45 uncharacterized protein LOC1034907341.2e-25584.14Show/hide
Query:  AEQSPRP---EIQNLPPARSASGGRSMSTPRSAS-GGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL
        AEQSPRP   EIQNLPP++S S GRS+STPRSA+ GGGGSRRETPD HSTAAKLERAKEVY+AYEGHGERPTIVEIVGWCFYELCS  V T+LIPVVFPL
Subjt:  AEQSPRP---EIQNLPPARSASGGRSMSTPRSAS-GGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL

Query:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV
        IISQISG  T PPQGWFKSFMGFDCP +E +LYQ LTE TIKV+ +EFSPLIWTS+SWA+GLVLAGPILA ASFHLDYGFNQHLITLAAVAAGAL+CLP 
Subjt:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV

Query:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRR--LVKEEDETHFLSLWI
        G+FKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGL GP V KA+F+ RR GSG ISS SAAVGG+G++ ISAFTYHMLRR   V+E  + HFL+LWI
Subjt:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRR--LVKEEDETHFLSLWI

Query:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH
        VTIF GLKWL+GIFHVF+TNRS+S++IPS+SELHILSIFKYP+AI +VISGGFLSSFATI IFTAVLLFLIGQICFKPVLILYL LIYFL+PLISLPLLH
Subjt:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH

Query:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK
        QFQIRIK+DASKM ILGFILSA TSA CFYFH+  WRR +VFVFAVLQGTAAAVL AYGR LVLDCSPAGKE AISMWFSWMR+IGGC GFTVAAVVPA+
Subjt:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK

Query:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP
        LQVS+GV FC AVVGGVVLI+GNVTDYDGAVAAGHVRDDSEKGSPVIGL+SRSESKELESP
Subjt:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP

A0A5D3CJT7 Uncharacterized protein1.2e-25584.14Show/hide
Query:  AEQSPRP---EIQNLPPARSASGGRSMSTPRSAS-GGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL
        AEQSPRP   EIQNLPP++S S GRS+STPRSA+ GGGGSRRETPD HSTAAKLERAKEVY+AYEGHGERPTIVEIVGWCFYELCS  V T+LIPVVFPL
Subjt:  AEQSPRP---EIQNLPPARSASGGRSMSTPRSAS-GGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL

Query:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV
        IISQISG  T PPQGWFKSFMGFDCP +E +LYQ LTE TIKV+ +EFSPLIWTS+SWA+GLVLAGPILA ASFHLDYGFNQHLITLAAVAAGAL+CLP 
Subjt:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV

Query:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRR--LVKEEDETHFLSLWI
        G+FKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGL GP V KA+F+ RR GSG ISS SAAVGG+G++ ISAFTYHMLRR   V+E  + HFL+LWI
Subjt:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRR--LVKEEDETHFLSLWI

Query:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH
        VTIF GLKWL+GIFHVF+TNRS+S++IPS+SELHILSIFKYP+AI +VISGGFLSSFATI IFTAVLLFLIGQICFKPVLILYL LIYFL+PLISLPLLH
Subjt:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH

Query:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK
        QFQIRIK+DASKM ILGFILSA TSA CFYFH+  WRR +VFVFAVLQGTAAAVL AYGR LVLDCSPAGKE AISMWFSWMR+IGGC GFTVAAVVPA+
Subjt:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK

Query:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP
        LQVS+GV FC AVVGGVVLI+GNVTDYDGAVAAGHVRDDSEKGSPVIGL+SRSESKELESP
Subjt:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP

A0A6J1HK19 uncharacterized protein LOC111465229 isoform X13.7e-25783.96Show/hide
Query:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL
        MAEQSPRP   EIQN PP RS S GR  ST R  SG GGSR++TPD HS AAKLERAKEVYRAYEGHGE+P+I+E+ GWCFYELCSLSV TVLIPVVFPL
Subjt:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL

Query:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV
        IISQISGA  EPPQGWF+SFMGFDCPP E +LYQ LT+HTIK++ + FSPLIWTS+SWALGL++AGPILAFASFHLDYGFNQHLI + AVAAGALSCLP 
Subjt:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV

Query:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLR--RLVKEEDETHFLSLWI
        GVF+TVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGL GPTV KA+FA RR GSG ISSCS AVGGLG+AAISAFTYHMLR  R  KE D+ HFLSLWI
Subjt:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLR--RLVKEEDETHFLSLWI

Query:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH
        VTIFGGLKWLLGIFHVF+TNRS+SVTIPSDSELH+L+IFKYPHAIG+VIS GFLSSF TI IF AV LFLIGQICFKPVLILYLWLIYFLIPLISLPLLH
Subjt:  VTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLH

Query:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK
        QFQIRIK+DASKMQILGFILSA TSAICFYFH+ AWR  VVFVFA LQGTAAA+L  YGRVLVLDCSPAGKE AISMWFSWMRAIGGC GFTVAAVVPA+
Subjt:  QFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAK

Query:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP
        LQVS+GVAFC AVVGGVVLIYGN+TDY GAV+AGHV++DSEKGSPVIGLESRS SKELESP
Subjt:  LQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP

A0A6J1HNK9 uncharacterized protein LOC111465229 isoform X21.1e-25884.26Show/hide
Query:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL
        MAEQSPRP   EIQN PP RS S GR  ST R  SG GGSR++TPD HS AAKLERAKEVYRAYEGHGE+P+I+E+ GWCFYELCSLSV TVLIPVVFPL
Subjt:  MAEQSPRP---EIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPL

Query:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV
        IISQISGA  EPPQGWF+SFMGFDCPP E +LYQ LT+HTIK++ + FSPLIWTS+SWALGL++AGPILAFASFHLDYGFNQHLI + AVAAGALSCLP 
Subjt:  IISQISGATTEPPQGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPV

Query:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRRLVKEEDETHFLSLWIVT
        GVF+TVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGL GPTV KA+FA RR GSG ISSCS AVGGLG+AAISAFTYHMLRR  KE D+ HFLSLWIVT
Subjt:  GVFKTVKIFPLYIVLIVIAHSVAFTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRRLVKEEDETHFLSLWIVT

Query:  IFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQF
        IFGGLKWLLGIFHVF+TNRS+SVTIPSDSELH+L+IFKYPHAIG+VIS GFLSSF TI IF AV LFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQF
Subjt:  IFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHILSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQF

Query:  QIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAKLQ
        QIRIK+DASKMQILGFILSA TSAICFYFH+ AWR  VVFVFA LQGTAAA+L  YGRVLVLDCSPAGKE AISMWFSWMRAIGGC GFTVAAVVPA+LQ
Subjt:  QIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFAVLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAKLQ

Query:  VSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP
        VS+GVAFC AVVGGVVLIYGN+TDY GAV+AGHV++DSEKGSPVIGLESRS SKELESP
Subjt:  VSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSESKELESP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGAACAGAGCCCCAGACCAGAGATCCAAAACCTTCCTCCGGCGAGGTCCGCCAGTGGCGGGCGGAGCATGTCGACGCCGAGGTCGGCGAGCGGCGGAGGCGGCAG
CCGGAGGGAGACGCCGGACGTTCACAGCACGGCGGCGAAGCTGGAGAGGGCGAAGGAGGTGTACAGGGCGTACGAAGGGCACGGGGAAAGGCCGACGATTGTGGAGATTG
TGGGATGGTGCTTCTATGAACTTTGCTCGTTGTCTGTTGCGACGGTGCTGATTCCGGTGGTGTTTCCGTTGATTATCAGCCAGATTAGTGGAGCTACGACGGAGCCGCCG
CAGGGATGGTTTAAGAGCTTTATGGGCTTCGATTGTCCTCCTAAAGAAACTAAACTGTACCAAAGGCTAACGGAGCACACAATAAAAGTGGCCACCTCTGAATTCTCCCC
ATTAATATGGACCTCAATGTCATGGGCTTTGGGCCTGGTTTTAGCCGGCCCAATCCTCGCATTCGCTTCCTTCCACCTCGATTACGGCTTCAACCAACACCTAATCACTC
TCGCCGCCGTCGCCGCCGGAGCCCTCTCCTGTCTGCCGGTCGGCGTCTTCAAAACCGTCAAGATTTTTCCTCTTTACATTGTTTTAATCGTTATTGCTCACTCCGTCGCC
TTTACTTCTCACACGCGCCACCTCGGCCTCATGCTCCGTGGCCTCGCCGGACCGACCGTCCGTAAGGCCAGATTTGCTGCGAGAAGAAACGGATCTGGTTTTATTTCGTC
GTGCTCTGCGGCCGTCGGCGGCTTGGGCTCCGCCGCTATCTCCGCCTTCACTTACCACATGCTTCGACGGTTGGTGAAAGAAGAGGACGAAACTCACTTCCTCAGCCTAT
GGATCGTCACGATCTTCGGCGGCCTGAAATGGCTTCTCGGAATCTTCCACGTCTTCGTCACAAATCGATCACTCTCTGTTACAATCCCTTCCGATTCAGAGCTTCACATT
CTTTCAATCTTCAAATATCCTCACGCAATCGGCAGCGTAATCTCCGGCGGATTCCTCTCTTCCTTCGCCACAATCTGCATCTTCACTGCCGTTTTACTCTTCCTAATCGG
CCAAATCTGCTTCAAACCAGTCCTGATTCTCTATCTATGGCTGATCTACTTCCTCATCCCTCTAATTTCCCTTCCATTGCTCCATCAATTCCAGATCCGAATCAAATCCG
ACGCCTCAAAAATGCAAATCCTAGGGTTCATCCTCTCCGCCACAACTTCCGCCATTTGCTTCTACTTCCACAGCGGCGCCTGGCGGCGGAGTGTGGTTTTCGTCTTCGCC
GTTCTCCAAGGCACGGCGGCGGCGGTTCTCCAGGCGTACGGAAGAGTTTTAGTGCTCGATTGCTCGCCGGCCGGGAAGGAAGGTGCAATTTCGATGTGGTTTTCGTGGAT
GAGAGCGATCGGAGGTTGCGCTGGATTTACGGTCGCCGCGGTGGTTCCGGCGAAGTTGCAGGTTTCTGCCGGTGTGGCGTTTTGTAGCGCCGTCGTCGGAGGAGTGGTGT
TGATTTATGGTAATGTTACTGATTATGACGGCGCTGTGGCGGCGGGGCATGTGAGAGATGACAGTGAAAAGGGATCGCCGGTGATTGGATTGGAGTCTCGGAGTGAGAGT
AAAGAACTTGAGTCGCCCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCCGAACAGAGCCCCAGACCAGAGATCCAAAACCTTCCTCCGGCGAGGTCCGCCAGTGGCGGGCGGAGCATGTCGACGCCGAGGTCGGCGAGCGGCGGAGGCGGCAG
CCGGAGGGAGACGCCGGACGTTCACAGCACGGCGGCGAAGCTGGAGAGGGCGAAGGAGGTGTACAGGGCGTACGAAGGGCACGGGGAAAGGCCGACGATTGTGGAGATTG
TGGGATGGTGCTTCTATGAACTTTGCTCGTTGTCTGTTGCGACGGTGCTGATTCCGGTGGTGTTTCCGTTGATTATCAGCCAGATTAGTGGAGCTACGACGGAGCCGCCG
CAGGGATGGTTTAAGAGCTTTATGGGCTTCGATTGTCCTCCTAAAGAAACTAAACTGTACCAAAGGCTAACGGAGCACACAATAAAAGTGGCCACCTCTGAATTCTCCCC
ATTAATATGGACCTCAATGTCATGGGCTTTGGGCCTGGTTTTAGCCGGCCCAATCCTCGCATTCGCTTCCTTCCACCTCGATTACGGCTTCAACCAACACCTAATCACTC
TCGCCGCCGTCGCCGCCGGAGCCCTCTCCTGTCTGCCGGTCGGCGTCTTCAAAACCGTCAAGATTTTTCCTCTTTACATTGTTTTAATCGTTATTGCTCACTCCGTCGCC
TTTACTTCTCACACGCGCCACCTCGGCCTCATGCTCCGTGGCCTCGCCGGACCGACCGTCCGTAAGGCCAGATTTGCTGCGAGAAGAAACGGATCTGGTTTTATTTCGTC
GTGCTCTGCGGCCGTCGGCGGCTTGGGCTCCGCCGCTATCTCCGCCTTCACTTACCACATGCTTCGACGGTTGGTGAAAGAAGAGGACGAAACTCACTTCCTCAGCCTAT
GGATCGTCACGATCTTCGGCGGCCTGAAATGGCTTCTCGGAATCTTCCACGTCTTCGTCACAAATCGATCACTCTCTGTTACAATCCCTTCCGATTCAGAGCTTCACATT
CTTTCAATCTTCAAATATCCTCACGCAATCGGCAGCGTAATCTCCGGCGGATTCCTCTCTTCCTTCGCCACAATCTGCATCTTCACTGCCGTTTTACTCTTCCTAATCGG
CCAAATCTGCTTCAAACCAGTCCTGATTCTCTATCTATGGCTGATCTACTTCCTCATCCCTCTAATTTCCCTTCCATTGCTCCATCAATTCCAGATCCGAATCAAATCCG
ACGCCTCAAAAATGCAAATCCTAGGGTTCATCCTCTCCGCCACAACTTCCGCCATTTGCTTCTACTTCCACAGCGGCGCCTGGCGGCGGAGTGTGGTTTTCGTCTTCGCC
GTTCTCCAAGGCACGGCGGCGGCGGTTCTCCAGGCGTACGGAAGAGTTTTAGTGCTCGATTGCTCGCCGGCCGGGAAGGAAGGTGCAATTTCGATGTGGTTTTCGTGGAT
GAGAGCGATCGGAGGTTGCGCTGGATTTACGGTCGCCGCGGTGGTTCCGGCGAAGTTGCAGGTTTCTGCCGGTGTGGCGTTTTGTAGCGCCGTCGTCGGAGGAGTGGTGT
TGATTTATGGTAATGTTACTGATTATGACGGCGCTGTGGCGGCGGGGCATGTGAGAGATGACAGTGAAAAGGGATCGCCGGTGATTGGATTGGAGTCTCGGAGTGAGAGT
AAAGAACTTGAGTCGCCCTGA
Protein sequenceShow/hide protein sequence
MAEQSPRPEIQNLPPARSASGGRSMSTPRSASGGGGSRRETPDVHSTAAKLERAKEVYRAYEGHGERPTIVEIVGWCFYELCSLSVATVLIPVVFPLIISQISGATTEPP
QGWFKSFMGFDCPPKETKLYQRLTEHTIKVATSEFSPLIWTSMSWALGLVLAGPILAFASFHLDYGFNQHLITLAAVAAGALSCLPVGVFKTVKIFPLYIVLIVIAHSVA
FTSHTRHLGLMLRGLAGPTVRKARFAARRNGSGFISSCSAAVGGLGSAAISAFTYHMLRRLVKEEDETHFLSLWIVTIFGGLKWLLGIFHVFVTNRSLSVTIPSDSELHI
LSIFKYPHAIGSVISGGFLSSFATICIFTAVLLFLIGQICFKPVLILYLWLIYFLIPLISLPLLHQFQIRIKSDASKMQILGFILSATTSAICFYFHSGAWRRSVVFVFA
VLQGTAAAVLQAYGRVLVLDCSPAGKEGAISMWFSWMRAIGGCAGFTVAAVVPAKLQVSAGVAFCSAVVGGVVLIYGNVTDYDGAVAAGHVRDDSEKGSPVIGLESRSES
KELESP