; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS011983 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS011983
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionUnknown protein
Genome locationscaffold123_2:346787..350878
RNA-Seq ExpressionMS011983
SyntenyMS011983
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022149995.1 uncharacterized protein LOC111018276 isoform X1 [Momordica charantia]0.0e+0099.22Show/hide
Query:  MGSLFSASFFFLFLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES
        MGSLFSASFFFLFL LIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES
Subjt:  MGSLFSASFFFLFLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES

Query:  DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL
        DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL
Subjt:  DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL

Query:  VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLT
        VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDA TGF+GGYHYDGRGIMRKLPESPNFKVRLT
Subjt:  VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLT

Query:  LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK
        LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK
Subjt:  LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK

Query:  PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV
        PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVW SINVGAEIYISEGGVTAEWSVSDFDVLV
Subjt:  PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV

Query:  PQDPRDANCCSY
        PQDPRDANCCSY
Subjt:  PQDPRDANCCSY

XP_022149996.1 uncharacterized protein LOC111018276 isoform X2 [Momordica charantia]4.2e-29494.14Show/hide
Query:  MGSLFSASFFFLFLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES
        MGSLFSASFFFLFL LIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES
Subjt:  MGSLFSASFFFLFLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES

Query:  DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL
        DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL
Subjt:  DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL

Query:  VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLT
        VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVST                           TGF+GGYHYDGRGIMRKLPESPNFKVRLT
Subjt:  VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLT

Query:  LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK
        LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK
Subjt:  LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK

Query:  PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV
        PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVW SINVGAEIYISEGGVTAEWSVSDFDVLV
Subjt:  PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV

Query:  PQDPRDANCCSY
        PQDPRDANCCSY
Subjt:  PQDPRDANCCSY

XP_022925493.1 uncharacterized protein LOC111432779 isoform X2 [Cucurbita moschata]1.9e-27586.82Show/hide
Query:  MGSLFSASFFFL----FLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCL-LQ
        MGSLFS+SFF L    FLNL H S HE++E+ SAIGDPGMKNPNVRV FEAWNFCNEVGAEA  MGSPR+ADCADLRAPLASD +DCF   SD+ C+ L 
Subjt:  MGSLFSASFFFL----FLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCL-LQ

Query:  KVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYH
        KVNESDNKLGAGEKFPSERFKPY DPDLY VEKERYLGSLCEVHDSS+PW FWMIMLKNGNFDKNSTLC ENGKNV KI+TDRTFPCFGEGCMNQPLVYH
Subjt:  KVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYH

Query:  KSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNF
          SRLVS  +RMVSLTGGFYGTYELDADLSNGIGKNSYFSV+W KNVS+GSWIF +RLTTSSKYPWLMLYLRSDA  GFNGGYHYDGRGIMRKLPESPNF
Subjt:  KSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNF

Query:  KVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNA
        KVRLTL +KSGGG N+QFYLIDIGSCWKNNGD CNGDTTTDVTRYSEMIINPETTS C+PSNL +CPPYHV A+GEKIYRNETSRFPYSAYHLYCSPGN 
Subjt:  KVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNA

Query:  MHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSD
        MHLEKPYDICDPYSNPQAQEL+QILPHPEW VHGYP KQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYISEG  TAEWSVSD
Subjt:  MHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSD

Query:  FDVLVPQDPRDANCCS
        FDV+VP D RDANCCS
Subjt:  FDVLVPQDPRDANCCS

XP_022973970.1 uncharacterized protein LOC111472586 isoform X2 [Cucurbita maxima]1.6e-27486.24Show/hide
Query:  MGSLFSASFFFL----FLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCL-LQ
        MGSLFS+SFF L    FLNL H S HES+E+ SAIGDPGMK+PNVRV FEAWNFCNEVGAEA  MGSPR+ADCADLRAPLASD +DCF   SD+ C+ L 
Subjt:  MGSLFSASFFFL----FLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCL-LQ

Query:  KVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYH
        KVNESDNKL AGEKFPS+RFKPY DPDLY VEKERYLGSLCEVHDSS+PW FWMIMLKNGNFDKNSTLCPENGKN  KI+TDRTFPCFGEGCMNQPLVYH
Subjt:  KVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYH

Query:  KSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNF
          SRLVS  +RMVSLTGGFYGTYELDADLSNGIGKNSYFSV+W KNVS+GSWIF +RLTTSSKYPWLMLYLRSDA  GFNGGYHYDGRGIMRKLPESPNF
Subjt:  KSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNF

Query:  KVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNA
        KVRLTL +KSGGG N+QFYLIDIGSCWKNNGD CNGDTTTDVTRYSEMIINPETTS C+PSNL +CPPYHV A+GEKIYRNETSRFPYSAYHLYCSPGNA
Subjt:  KVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNA

Query:  MHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSD
        MHLEKPYD+CDPYSNPQAQEL+QILPHPEW VHGYP KQGDGW+GDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYISE G TAEWSVSD
Subjt:  MHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSD

Query:  FDVLVPQDPRDANCCS
        FDV+VP D RDANCCS
Subjt:  FDVLVPQDPRDANCCS

XP_038889615.1 uncharacterized protein LOC120079486 [Benincasa hispida]5.5e-27885.52Show/hide
Query:  MGSLFSASFFF------------LFLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLES
        MGSLFS+SFFF             FLNL   SPHES+EY SAIGDPGMKNPNVRV FEAWNFCNEVGAEA HMGSPR+ADCADLR P ASD +DC  LE 
Subjt:  MGSLFSASFFF------------LFLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLES

Query:  DNKCL-LQKVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGC
        DN CL LQKV+E+DNKLGAGEKFPSERFK YQDPDLY VEKERYLGSLCEVHDSS+PW FWMIMLKNGNFDKNSTLCPENGKN++K++TDR FPCFG+GC
Subjt:  DNKCL-LQKVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGC

Query:  MNQPLVYHKSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMR
        MNQP++YH  SRLVS G+RMVSLTGGFYGTYELDADLS+GIGKNSYFSV+W KNVSTGSWIF +RL TSSKYPWLMLYLRSDA TGFNGGYHYDGRGIMR
Subjt:  MNQPLVYHKSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMR

Query:  KLPESPNFKVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYH
        KLPESPNFKVRLTL IK+GGG N+QFYLIDIGSCWKNNGD CNGDTTTDVTRYSEMIINPET+S C+P+NL +CPPYHVSA+GEKIYRNETSRFPYSAYH
Subjt:  KLPESPNFKVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYH

Query:  LYCSPGNAMHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGV
        LYCSPGNAMHLEKPYDICDPYSNPQAQEL+QILPHPEWAVHGYPKKQGDGW+GDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVG EIYISE G 
Subjt:  LYCSPGNAMHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGV

Query:  TAEWSVSDFDVLVPQDPRDANCCSY
        TAEWSVSDFDV+VP D RDANCCSY
Subjt:  TAEWSVSDFDVLVPQDPRDANCCSY

TrEMBL top hitse value%identityAlignment
A0A6J1D795 uncharacterized protein LOC111018276 isoform X10.0e+0099.22Show/hide
Query:  MGSLFSASFFFLFLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES
        MGSLFSASFFFLFL LIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES
Subjt:  MGSLFSASFFFLFLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES

Query:  DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL
        DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL
Subjt:  DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL

Query:  VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLT
        VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDA TGF+GGYHYDGRGIMRKLPESPNFKVRLT
Subjt:  VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLT

Query:  LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK
        LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK
Subjt:  LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK

Query:  PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV
        PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVW SINVGAEIYISEGGVTAEWSVSDFDVLV
Subjt:  PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV

Query:  PQDPRDANCCSY
        PQDPRDANCCSY
Subjt:  PQDPRDANCCSY

A0A6J1DA43 uncharacterized protein LOC111018276 isoform X22.0e-29494.14Show/hide
Query:  MGSLFSASFFFLFLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES
        MGSLFSASFFFLFL LIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES
Subjt:  MGSLFSASFFFLFLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNES

Query:  DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL
        DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL
Subjt:  DNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRL

Query:  VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLT
        VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVST                           TGF+GGYHYDGRGIMRKLPESPNFKVRLT
Subjt:  VSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLT

Query:  LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK
        LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK
Subjt:  LAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEK

Query:  PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV
        PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVW SINVGAEIYISEGGVTAEWSVSDFDVLV
Subjt:  PYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV

Query:  PQDPRDANCCSY
        PQDPRDANCCSY
Subjt:  PQDPRDANCCSY

A0A6J1EBU8 uncharacterized protein LOC111432779 isoform X12.3e-27486.65Show/hide
Query:  MGSLFSASFFFL----FLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLAS-DYRDCFRLESDNKCL-L
        MGSLFS+SFF L    FLNL H S HE++E+ SAIGDPGMKNPNVRV FEAWNFCNEVGAEA  MGSPR+ADCADLRAPLAS D +DCF   SD+ C+ L
Subjt:  MGSLFSASFFFL----FLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLAS-DYRDCFRLESDNKCL-L

Query:  QKVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVY
         KVNESDNKLGAGEKFPSERFKPY DPDLY VEKERYLGSLCEVHDSS+PW FWMIMLKNGNFDKNSTLC ENGKNV KI+TDRTFPCFGEGCMNQPLVY
Subjt:  QKVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVY

Query:  HKSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPN
        H  SRLVS  +RMVSLTGGFYGTYELDADLSNGIGKNSYFSV+W KNVS+GSWIF +RLTTSSKYPWLMLYLRSDA  GFNGGYHYDGRGIMRKLPESPN
Subjt:  HKSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPN

Query:  FKVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGN
        FKVRLTL +KSGGG N+QFYLIDIGSCWKNNGD CNGDTTTDVTRYSEMIINPETTS C+PSNL +CPPYHV A+GEKIYRNETSRFPYSAYHLYCSPGN
Subjt:  FKVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGN

Query:  AMHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVS
         MHLEKPYDICDPYSNPQAQEL+QILPHPEW VHGYP KQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYISEG  TAEWSVS
Subjt:  AMHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVS

Query:  DFDVLVPQDPRDANCCS
        DFDV+VP D RDANCCS
Subjt:  DFDVLVPQDPRDANCCS

A0A6J1EFC7 uncharacterized protein LOC111432779 isoform X29.4e-27686.82Show/hide
Query:  MGSLFSASFFFL----FLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCL-LQ
        MGSLFS+SFF L    FLNL H S HE++E+ SAIGDPGMKNPNVRV FEAWNFCNEVGAEA  MGSPR+ADCADLRAPLASD +DCF   SD+ C+ L 
Subjt:  MGSLFSASFFFL----FLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCL-LQ

Query:  KVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYH
        KVNESDNKLGAGEKFPSERFKPY DPDLY VEKERYLGSLCEVHDSS+PW FWMIMLKNGNFDKNSTLC ENGKNV KI+TDRTFPCFGEGCMNQPLVYH
Subjt:  KVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYH

Query:  KSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNF
          SRLVS  +RMVSLTGGFYGTYELDADLSNGIGKNSYFSV+W KNVS+GSWIF +RLTTSSKYPWLMLYLRSDA  GFNGGYHYDGRGIMRKLPESPNF
Subjt:  KSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNF

Query:  KVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNA
        KVRLTL +KSGGG N+QFYLIDIGSCWKNNGD CNGDTTTDVTRYSEMIINPETTS C+PSNL +CPPYHV A+GEKIYRNETSRFPYSAYHLYCSPGN 
Subjt:  KVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNA

Query:  MHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSD
        MHLEKPYDICDPYSNPQAQEL+QILPHPEW VHGYP KQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYISEG  TAEWSVSD
Subjt:  MHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSD

Query:  FDVLVPQDPRDANCCS
        FDV+VP D RDANCCS
Subjt:  FDVLVPQDPRDANCCS

A0A6J1ICQ8 uncharacterized protein LOC111472586 isoform X28.0e-27586.24Show/hide
Query:  MGSLFSASFFFL----FLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCL-LQ
        MGSLFS+SFF L    FLNL H S HES+E+ SAIGDPGMK+PNVRV FEAWNFCNEVGAEA  MGSPR+ADCADLRAPLASD +DCF   SD+ C+ L 
Subjt:  MGSLFSASFFFL----FLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCL-LQ

Query:  KVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYH
        KVNESDNKL AGEKFPS+RFKPY DPDLY VEKERYLGSLCEVHDSS+PW FWMIMLKNGNFDKNSTLCPENGKN  KI+TDRTFPCFGEGCMNQPLVYH
Subjt:  KVNESDNKLGAGEKFPSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYH

Query:  KSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNF
          SRLVS  +RMVSLTGGFYGTYELDADLSNGIGKNSYFSV+W KNVS+GSWIF +RLTTSSKYPWLMLYLRSDA  GFNGGYHYDGRGIMRKLPESPNF
Subjt:  KSSRLVSLGRRMVSLTGGFYGTYELDADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNF

Query:  KVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNA
        KVRLTL +KSGGG N+QFYLIDIGSCWKNNGD CNGDTTTDVTRYSEMIINPETTS C+PSNL +CPPYHV A+GEKIYRNETSRFPYSAYHLYCSPGNA
Subjt:  KVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNA

Query:  MHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSD
        MHLEKPYD+CDPYSNPQAQEL+QILPHPEW VHGYP KQGDGW+GDPRTWELDVGALSNRLYFYQDPGTKPARR+WTSINVG EIYISE G TAEWSVSD
Subjt:  MHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSD

Query:  FDVLVPQDPRDANCCS
        FDV+VP D RDANCCS
Subjt:  FDVLVPQDPRDANCCS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G17030.1 unknown protein4.0e-16254.24Show/hide
Query:  SVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKC----LLQKVNESDNKLGAGEKFPSERFKPYQ
        ++ Y SA+GDPGM+N N+RV  EAWN CNEVG EA +MGSPRMADC D+                DN      ++ KV+E DN+LG G            
Subjt:  SVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKC----LLQKVNESDNKLGAGEKFPSERFKPYQ

Query:  DPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRLVSLGRRMVSLTGGFYGTYE
        + D+YA +KE YLG+ C+V D  +PW FWMIMLKNGN D  + +CPENGK          FPCFG+GCMN P ++H+ + LV        ++G FYGT++
Subjt:  DPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRLVSLGRRMVSLTGGFYGTYE

Query:  LDADLSNGIGKNSYFSVTWVKNV-STGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLTLAIKSGGGSNNQFYLIDI
        LD D  + +G NSY+ V W K +    SW+F H L TSSKYPWLMLYLR+DA  GF+GGYHYD RG+M+   +SP+FKV+  L I  GGGS +QFYL+D+
Subjt:  LDADLSNGIGKNSYFSVTWVKNV-STGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLTLAIKSGGGSNNQFYLIDI

Query:  GSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEKPYDICDPYSNPQAQELVQ
        GSCWKN+G  C+GD TTDVTRYSEMIINP  T+ C  + L ACPP H    G K++R +  +FP+ AYH YC PGNA   E PY++CDPYSNPQ QE++Q
Subjt:  GSCWKNNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEKPYDICDPYSNPQAQELVQ

Query:  ILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLVP
        ILPHP W   GYP K+G GWIGDPRTWELDVG LS  L+FYQDPGTKP  R W+SI++G EIY+S+  + AEW+V+DFD+++P
Subjt:  ILPHPEWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLVP

AT2G47010.1 unknown protein1.3e-17960.17Show/hide
Query:  KSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDY-RDCFRLESDNKCLLQKVNESDNKLGAGEKFP---SERFKPYQDPDL
        +SA+GDPGMK   +RV FEAWNFCNEVG EA HMGSPR ADC DL +     Y  D     +    L+ KV++SDN+LG G+  P   SE      +PDL
Subjt:  KSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDY-RDCFRLESDNKCLLQKVNESDNKLGAGEKFP---SERFKPYQDPDL

Query:  YAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRLVSLGRRMVSLTGGFYGTYELDAD
        YAVEKE YLGSLC+V D  +PW FWM+MLKNGN+D  S LCP+NGK +        FPCFG GCMNQP + H  + L   G+ M    G F GTYE  AD
Subjt:  YAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRLVSLGRRMVSLTGGFYGTYELDAD

Query:  LSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLTLAIKSGGGSNNQFYLIDIGSCWK
          NG+   SY+ V W K V  G W+F H+L TS+KYPWLMLYLR+DA  GF+GGYHYD RG+++ LPESPNFKVRLTL +K GGG+ +QFYL+DIGSCWK
Subjt:  LSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLTLAIKSGGGSNNQFYLIDIGSCWK

Query:  NNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEKPYDICDPYSNPQAQELVQILPHP
        NNG PC+GD TTDVTRYSEMIINPET   C P +L  CPPYH    G +++R +   FPY AYH+YC+PGNA HLE P   CD YSNPQAQE++Q+LPHP
Subjt:  NNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEKPYDICDPYSNPQAQELVQILPHP

Query:  EWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV
         W  +GYP + GDGW+GDPRTW+LDVG LS+RL+FYQDPGT PARR+WTS++VG EIY  E    AEW +SDFDVL+
Subjt:  EWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV

AT2G47010.2 unknown protein1.3e-17960.17Show/hide
Query:  KSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDY-RDCFRLESDNKCLLQKVNESDNKLGAGEKFP---SERFKPYQDPDL
        +SA+GDPGMK   +RV FEAWNFCNEVG EA HMGSPR ADC DL +     Y  D     +    L+ KV++SDN+LG G+  P   SE      +PDL
Subjt:  KSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDY-RDCFRLESDNKCLLQKVNESDNKLGAGEKFP---SERFKPYQDPDL

Query:  YAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRLVSLGRRMVSLTGGFYGTYELDAD
        YAVEKE YLGSLC+V D  +PW FWM+MLKNGN+D  S LCP+NGK +        FPCFG GCMNQP + H  + L   G+ M    G F GTYE  AD
Subjt:  YAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRLVSLGRRMVSLTGGFYGTYELDAD

Query:  LSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLTLAIKSGGGSNNQFYLIDIGSCWK
          NG+   SY+ V W K V  G W+F H+L TS+KYPWLMLYLR+DA  GF+GGYHYD RG+++ LPESPNFKVRLTL +K GGG+ +QFYL+DIGSCWK
Subjt:  LSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLTLAIKSGGGSNNQFYLIDIGSCWK

Query:  NNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEKPYDICDPYSNPQAQELVQILPHP
        NNG PC+GD TTDVTRYSEMIINPET   C P +L  CPPYH    G +++R +   FPY AYH+YC+PGNA HLE P   CD YSNPQAQE++Q+LPHP
Subjt:  NNGDPCNGDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEKPYDICDPYSNPQAQELVQILPHP

Query:  EWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV
         W  +GYP + GDGW+GDPRTW+LDVG LS+RL+FYQDPGT PARR+WTS++VG EIY  E    AEW +SDFDVL+
Subjt:  EWAVHGYPKKQGDGWIGDPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLV

AT4G09965.1 unknown protein8.8e-2471.83Show/hide
Query:  KQGDGWIGDPRTWELDVGALSNRLYFYQD-PGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLVPQ
        KQG+GWIGD RTWE++ GALS+RLYFYQ+ PGTKPA+R+WTSINV  +IY+S    TAEW+VSDFDVLV Q
Subjt:  KQGDGWIGDPRTWELDVGALSNRLYFYQD-PGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLVPQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTCTCTGTTTTCTGCTTCCTTCTTCTTTCTTTTTCTGAATTTGATCCATTCTTCGCCTCACGAATCAGTGGAGTATAAATCTGCCATTGGAGACCCGGGAATGAA
GAACCCAAATGTTCGAGTCGGATTTGAGGCGTGGAACTTCTGTAATGAAGTTGGAGCGGAAGCGGCCCACATGGGCAGCCCCAGAATGGCTGATTGTGCTGATTTGCGAG
CCCCATTGGCTTCTGATTATCGCGATTGTTTCCGTCTTGAGAGCGATAACAAGTGTCTACTACAAAAAGTAAATGAATCAGATAATAAACTTGGGGCAGGGGAAAAATTC
CCATCGGAACGTTTTAAGCCATACCAGGACCCAGATCTGTATGCTGTGGAGAAGGAGCGTTATCTTGGATCATTATGTGAGGTTCATGATTCTTCGGATCCATGGTACTT
CTGGATGATTATGCTAAAAAATGGAAACTTTGACAAGAACTCTACTCTCTGCCCTGAAAATGGAAAGAATGTTAGTAAGATTGTAACTGATAGAACATTCCCTTGTTTTG
GAGAAGGATGTATGAACCAGCCACTTGTTTACCATAAAAGTTCAAGATTGGTATCTCTTGGCAGACGAATGGTCTCTTTAACAGGCGGGTTTTATGGAACCTATGAACTT
GATGCTGATTTGAGTAACGGTATAGGGAAGAACTCTTACTTTTCAGTCACCTGGGTGAAGAATGTTAGTACAGGTAGTTGGATTTTCTTGCATCGTTTGACGACATCGTC
CAAGTATCCTTGGCTTATGCTGTACCTCCGTTCCGATGCAGTAACAGGTTTCAACGGTGGATATCACTACGATGGTCGTGGCATCATGAGAAAGTTGCCTGAATCCCCAA
ATTTCAAAGTAAGATTAACACTTGCCATAAAAAGTGGAGGTGGAAGCAACAACCAATTCTATCTCATTGACATAGGAAGCTGTTGGAAGAACAATGGAGATCCTTGCAAT
GGCGACACGACCACCGACGTAACTCGATACAGTGAAATGATTATCAACCCGGAGACTACTAGCCGGTGCAAACCGAGCAATCTACAGGCTTGTCCACCATATCATGTTAG
TGCTGCTGGTGAGAAAATATATAGGAATGAGACATCAAGGTTCCCATATTCAGCTTATCACCTGTACTGCAGTCCTGGAAATGCTATGCATTTGGAGAAACCATATGATA
TTTGTGATCCATATAGCAACCCACAGGCTCAGGAGTTGGTACAAATTCTTCCACATCCTGAATGGGCTGTACATGGCTATCCAAAGAAGCAAGGAGATGGATGGATTGGA
GATCCTAGAACTTGGGAGCTTGACGTTGGAGCTTTGTCGAACCGCTTGTACTTCTACCAGGATCCGGGAACGAAGCCAGCAAGGCGGGTATGGACATCGATCAATGTCGG
TGCAGAAATATATATTAGTGAAGGGGGGGTGACAGCAGAGTGGAGTGTAAGTGATTTTGATGTTCTGGTTCCACAAGATCCTAGAGATGCCAATTGCTGCTCTTAT
mRNA sequenceShow/hide mRNA sequence
ATGGGTTCTCTGTTTTCTGCTTCCTTCTTCTTTCTTTTTCTGAATTTGATCCATTCTTCGCCTCACGAATCAGTGGAGTATAAATCTGCCATTGGAGACCCGGGAATGAA
GAACCCAAATGTTCGAGTCGGATTTGAGGCGTGGAACTTCTGTAATGAAGTTGGAGCGGAAGCGGCCCACATGGGCAGCCCCAGAATGGCTGATTGTGCTGATTTGCGAG
CCCCATTGGCTTCTGATTATCGCGATTGTTTCCGTCTTGAGAGCGATAACAAGTGTCTACTACAAAAAGTAAATGAATCAGATAATAAACTTGGGGCAGGGGAAAAATTC
CCATCGGAACGTTTTAAGCCATACCAGGACCCAGATCTGTATGCTGTGGAGAAGGAGCGTTATCTTGGATCATTATGTGAGGTTCATGATTCTTCGGATCCATGGTACTT
CTGGATGATTATGCTAAAAAATGGAAACTTTGACAAGAACTCTACTCTCTGCCCTGAAAATGGAAAGAATGTTAGTAAGATTGTAACTGATAGAACATTCCCTTGTTTTG
GAGAAGGATGTATGAACCAGCCACTTGTTTACCATAAAAGTTCAAGATTGGTATCTCTTGGCAGACGAATGGTCTCTTTAACAGGCGGGTTTTATGGAACCTATGAACTT
GATGCTGATTTGAGTAACGGTATAGGGAAGAACTCTTACTTTTCAGTCACCTGGGTGAAGAATGTTAGTACAGGTAGTTGGATTTTCTTGCATCGTTTGACGACATCGTC
CAAGTATCCTTGGCTTATGCTGTACCTCCGTTCCGATGCAGTAACAGGTTTCAACGGTGGATATCACTACGATGGTCGTGGCATCATGAGAAAGTTGCCTGAATCCCCAA
ATTTCAAAGTAAGATTAACACTTGCCATAAAAAGTGGAGGTGGAAGCAACAACCAATTCTATCTCATTGACATAGGAAGCTGTTGGAAGAACAATGGAGATCCTTGCAAT
GGCGACACGACCACCGACGTAACTCGATACAGTGAAATGATTATCAACCCGGAGACTACTAGCCGGTGCAAACCGAGCAATCTACAGGCTTGTCCACCATATCATGTTAG
TGCTGCTGGTGAGAAAATATATAGGAATGAGACATCAAGGTTCCCATATTCAGCTTATCACCTGTACTGCAGTCCTGGAAATGCTATGCATTTGGAGAAACCATATGATA
TTTGTGATCCATATAGCAACCCACAGGCTCAGGAGTTGGTACAAATTCTTCCACATCCTGAATGGGCTGTACATGGCTATCCAAAGAAGCAAGGAGATGGATGGATTGGA
GATCCTAGAACTTGGGAGCTTGACGTTGGAGCTTTGTCGAACCGCTTGTACTTCTACCAGGATCCGGGAACGAAGCCAGCAAGGCGGGTATGGACATCGATCAATGTCGG
TGCAGAAATATATATTAGTGAAGGGGGGGTGACAGCAGAGTGGAGTGTAAGTGATTTTGATGTTCTGGTTCCACAAGATCCTAGAGATGCCAATTGCTGCTCTTAT
Protein sequenceShow/hide protein sequence
MGSLFSASFFFLFLNLIHSSPHESVEYKSAIGDPGMKNPNVRVGFEAWNFCNEVGAEAAHMGSPRMADCADLRAPLASDYRDCFRLESDNKCLLQKVNESDNKLGAGEKF
PSERFKPYQDPDLYAVEKERYLGSLCEVHDSSDPWYFWMIMLKNGNFDKNSTLCPENGKNVSKIVTDRTFPCFGEGCMNQPLVYHKSSRLVSLGRRMVSLTGGFYGTYEL
DADLSNGIGKNSYFSVTWVKNVSTGSWIFLHRLTTSSKYPWLMLYLRSDAVTGFNGGYHYDGRGIMRKLPESPNFKVRLTLAIKSGGGSNNQFYLIDIGSCWKNNGDPCN
GDTTTDVTRYSEMIINPETTSRCKPSNLQACPPYHVSAAGEKIYRNETSRFPYSAYHLYCSPGNAMHLEKPYDICDPYSNPQAQELVQILPHPEWAVHGYPKKQGDGWIG
DPRTWELDVGALSNRLYFYQDPGTKPARRVWTSINVGAEIYISEGGVTAEWSVSDFDVLVPQDPRDANCCSY