; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc05G11150 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc05G11150
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionProtein IWS1-like protein A isoform X1
Genome locationClcChr05:8981419..8984458
RNA-Seq ExpressionClc05G11150
SyntenyClc05G11150
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0050461.1 protein IWS1-like protein A isoform X1 [Cucumis melo var. makuwa]1.2e-23774.84Show/hide
Query:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELK---------------------VRIEDGKKEVV
        MDVEEVRHKVE  D+CKCSELKLRCE+AE KCEELELKIQIKKIEYELLQ QMKKLESEKSVIE ELK                     +  ED K EVV
Subjt:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELK---------------------VRIEDGKKEVV

Query:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG
        NLV+ NE+++E+ QLLIENNVLAVEKK AE+EAELWK KFK LE Q SKMDE+KGD +PVL GKFEG IGLS+S E  A DGVASNHYA  K+I      
Subjt:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG

Query:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS
         PPNGS P PT+SFG G+K  TIES+F+HGNRV+KQL FEE+SPSKKMAPLTPGVSRPLHGVINIDDSDDE+ +AKLP HT++KSEACVSA QDLS C S
Subjt:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS

Query:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA
        IC EEKID+GNK+++SDC T N ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTN TQV S+LN+ PTS CDKVVVSA P 
Subjt:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA

Query:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRNY+E GS KENS+SQ     N + D EDELEDV S   +++DS EGFIV+ SD S  GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDP+LCMKAVCALYR+QTSEEK+ KETLFSNKRGFSKFDAIRGT LAEFLT+G+PQGNL KSVKELEEYDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFL
        EIYGNKEDPFFL
Subjt:  EIYGNKEDPFFL

TYK03142.1 protein IWS1-like protein A isoform X1 [Cucumis melo var. makuwa]7.3e-23574.35Show/hide
Query:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAEL---------------------KVRIEDGKKEVV
        MDVEEVRHKVE  D+CKCSELKLRCE+AE KCEELELKIQIKKIEYELLQ QMKKLESEKSVIE EL                     K+  ED K EVV
Subjt:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAEL---------------------KVRIEDGKKEVV

Query:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG
        NLV+ NE+++E+ +LLIENNVLAVEKK AE+EAELWK KFK LE Q SKMDE+KGD +PVL GKFEG IGLS+S E  A DGVASNHYA           
Subjt:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG

Query:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS
         PPNGS   PTVSFG G+K  TIES+F+HGNRV+KQL FEE+SPSKKMAPLTPGVSRPLHGVINIDDSDDE+ +AKLP HT++KSEACVSA QDLS C S
Subjt:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS

Query:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA
        IC EEKID+GNK+++SDC T N ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTN TQV S+LN+ PTS CDKVVVSA P 
Subjt:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA

Query:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRNY+E GS KENS+SQ     N + D EDELEDV S   +++DS EGFIV+ SD S  GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDP+LCMKAVCALYR+QTSEEK+ KETLFSNKRGFSKFDAIRGT LAEFLT+G+PQGNL KSVKELEEYDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFL
        EIYGNKEDPFFL
Subjt:  EIYGNKEDPFFL

XP_008465383.1 PREDICTED: uncharacterized protein LOC103503017 isoform X1 [Cucumis melo]9.5e-24375.65Show/hide
Query:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELK---------------------VRIEDGKKEVV
        MDVEEVRHKVE  D+CKCSELKLRCE+AE KCEELELKIQIKKIEYELLQ QMKKLESEKSVIE ELK                     +  ED K EVV
Subjt:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELK---------------------VRIEDGKKEVV

Query:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG
        NLV+ NE+++E+ QLLIENNVLAVEKK AE+EAELWK KFK LE Q SKMDE+KGD +PVL GKFEG IGLS+S E  A DGVASNHYA  K+ISN+EAG
Subjt:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG

Query:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS
         PPNGS P PT+SFG G+K  TIES+F+HGNRV+KQL FEE+SPSKKMAPLTPGVSRPLHGVINIDDSDDE+ +AKLP HT++KSEACVSA QDLS C S
Subjt:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS

Query:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA
        IC EEKID+GNK+++SDC T N ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTN TQV S+LN+ PTS CDKVVVSA P 
Subjt:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA

Query:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRNY+E GS KENS+SQ     N + D EDELEDV S   +++DS EGFIV+ SD S  GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDP+LCMKAVCALYR+QTSEEK+ KETLFSNKRGFSKFDAIRGT LAEFLT+G+PQGNL KSVKELEEYDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFL
        EIYGNKEDPFFL
Subjt:  EIYGNKEDPFFL

XP_008465387.1 PREDICTED: uncharacterized protein LOC103503017 isoform X2 [Cucumis melo]2.7e-22974.28Show/hide
Query:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELKVRIEDGKKEVVNLVNKNEMEDEVVQLLIENNV
        MDVEEVRHKVE  D+CKCSELKLRCE+AE KCEELELKIQIKKIEYELLQ QMKKLESEKSVIE ELK  +E   +                        
Subjt:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELKVRIEDGKKEVVNLVNKNEMEDEVVQLLIENNV

Query:  LAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAGAPPNGSPPTPTVSFGGGKKLR
           ++K AE+EAELWK KFK LE Q SKMDE+KGD +PVL GKFEG IGLS+S E  A DGVASNHYA  K+ISN+EAG PPNGS P PT+SFG G+K  
Subjt:  LAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAGAPPNGSPPTPTVSFGGGKKLR

Query:  TIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGSICSEEKIDHGNKLSKSDCCTI
        TIES+F+HGNRV+KQL FEE+SPSKKMAPLTPGVSRPLHGVINIDDSDDE+ +AKLP HT++KSEACVSA QDLS C SIC EEKID+GNK+++SDC T 
Subjt:  TIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGSICSEEKIDHGNKLSKSDCCTI

Query:  NVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLRNYEE-GSGKENSS
        N ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTN TQV S+LN+ PTS CDKVVVSA P RRRLVSLRNY+E GS KENS+
Subjt:  NVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLRNYEE-GSGKENSS

Query:  SQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPKLCMKAV
        SQ     N + D EDELEDV S   +++DS EGFIV+ SD S  GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKWEYEADMLA+FGKDP+LCMKAV
Subjt:  SQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPKLCMKAV

Query:  CALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFL
        CALYR+QTSEEK+ KETLFSNKRGFSKFDAIRGT LAEFLT+G+PQGNL KSVKELEEYDPKAI+LCR LATHYSKQLFEIYGNKEDPFFL
Subjt:  CALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFL

XP_038898046.1 uncharacterized protein LOC120085872 [Benincasa hispida]1.0e-27385.42Show/hide
Query:  KMDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELKVRIEDGKKEVVNLVNKNEMEDEVVQLLIENN
        KMDVEE+R+KVE +D+CKC+ELKLRCE+AEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIE+ELK+ IE+GKKE VNLV+KNEM+DEVVQLLIENN
Subjt:  KMDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELKVRIEDGKKEVVNLVNKNEMEDEVVQLLIENN

Query:  VLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAGAPPNGSPPTPTVSFGGGKKL
        VLAVEKKKAESEAELWKDKFKQLE QISKMDE+KGD+EPVL GKFEGGIGLSLSPEARA DGVA NHY L K+ISNLEAG PPNGS P PTVSFG GKK+
Subjt:  VLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAGAPPNGSPPTPTVSFGGGKKL

Query:  RTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGSICSEEKIDHGNKLSKSDCCT
         T+ESYFKHGNRV+K+L FEE+SP KKMAPLTPGVSRPLHGVINIDDSDDE+++AKLP   RKKSEACVSA QDLSGCGSICS+EKI   NKLS+SDCCT
Subjt:  RTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGSICSEEKIDHGNKLSKSDCCT

Query:  INVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLR-NYEEGSGKENS
         NVENVE FSD+FIF+ST KR RASNIITSDTETDDDDN+PICKLK  PLQKTN TQVGSE  D PTSACDKVVVS TPARRRLV LR NYE+GSGKENS
Subjt:  INVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLR-NYEEGSGKENS

Query:  SSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQSEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPKLCMKAVCA
        SSQ     N ++DVEDELEDVGSELETESDSLEGFIV DS+VS  GSTSQSEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDP+LCMKAVCA
Subjt:  SSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQSEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPKLCMKAVCA

Query:  LYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFLP
        LYR+QTSEEKV K TLFSNK+GFSKFDAIRGTSLAEFLTDG PQGNL KSVKE+EEYDP AIDLCRKLATHYSKQLFEI+GNKEDPFFLP
Subjt:  LYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFLP

TrEMBL top hitse value%identityAlignment
A0A1S3CNP3 uncharacterized protein LOC103503017 isoform X21.3e-22974.28Show/hide
Query:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELKVRIEDGKKEVVNLVNKNEMEDEVVQLLIENNV
        MDVEEVRHKVE  D+CKCSELKLRCE+AE KCEELELKIQIKKIEYELLQ QMKKLESEKSVIE ELK  +E   +                        
Subjt:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELKVRIEDGKKEVVNLVNKNEMEDEVVQLLIENNV

Query:  LAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAGAPPNGSPPTPTVSFGGGKKLR
           ++K AE+EAELWK KFK LE Q SKMDE+KGD +PVL GKFEG IGLS+S E  A DGVASNHYA  K+ISN+EAG PPNGS P PT+SFG G+K  
Subjt:  LAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAGAPPNGSPPTPTVSFGGGKKLR

Query:  TIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGSICSEEKIDHGNKLSKSDCCTI
        TIES+F+HGNRV+KQL FEE+SPSKKMAPLTPGVSRPLHGVINIDDSDDE+ +AKLP HT++KSEACVSA QDLS C SIC EEKID+GNK+++SDC T 
Subjt:  TIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGSICSEEKIDHGNKLSKSDCCTI

Query:  NVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLRNYEE-GSGKENSS
        N ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTN TQV S+LN+ PTS CDKVVVSA P RRRLVSLRNY+E GS KENS+
Subjt:  NVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLRNYEE-GSGKENSS

Query:  SQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPKLCMKAV
        SQ     N + D EDELEDV S   +++DS EGFIV+ SD S  GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKWEYEADMLA+FGKDP+LCMKAV
Subjt:  SQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFGKDPKLCMKAV

Query:  CALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFL
        CALYR+QTSEEK+ KETLFSNKRGFSKFDAIRGT LAEFLT+G+PQGNL KSVKELEEYDPKAI+LCR LATHYSKQLFEIYGNKEDPFFL
Subjt:  CALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFL

A0A1S3CNR9 uncharacterized protein LOC103503017 isoform X14.6e-24375.65Show/hide
Query:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELK---------------------VRIEDGKKEVV
        MDVEEVRHKVE  D+CKCSELKLRCE+AE KCEELELKIQIKKIEYELLQ QMKKLESEKSVIE ELK                     +  ED K EVV
Subjt:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELK---------------------VRIEDGKKEVV

Query:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG
        NLV+ NE+++E+ QLLIENNVLAVEKK AE+EAELWK KFK LE Q SKMDE+KGD +PVL GKFEG IGLS+S E  A DGVASNHYA  K+ISN+EAG
Subjt:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG

Query:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS
         PPNGS P PT+SFG G+K  TIES+F+HGNRV+KQL FEE+SPSKKMAPLTPGVSRPLHGVINIDDSDDE+ +AKLP HT++KSEACVSA QDLS C S
Subjt:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS

Query:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA
        IC EEKID+GNK+++SDC T N ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTN TQV S+LN+ PTS CDKVVVSA P 
Subjt:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA

Query:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRNY+E GS KENS+SQ     N + D EDELEDV S   +++DS EGFIV+ SD S  GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDP+LCMKAVCALYR+QTSEEK+ KETLFSNKRGFSKFDAIRGT LAEFLT+G+PQGNL KSVKELEEYDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFL
        EIYGNKEDPFFL
Subjt:  EIYGNKEDPFFL

A0A5A7UA97 Protein IWS1-like protein A isoform X15.8e-23874.84Show/hide
Query:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELK---------------------VRIEDGKKEVV
        MDVEEVRHKVE  D+CKCSELKLRCE+AE KCEELELKIQIKKIEYELLQ QMKKLESEKSVIE ELK                     +  ED K EVV
Subjt:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELK---------------------VRIEDGKKEVV

Query:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG
        NLV+ NE+++E+ QLLIENNVLAVEKK AE+EAELWK KFK LE Q SKMDE+KGD +PVL GKFEG IGLS+S E  A DGVASNHYA  K+I      
Subjt:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG

Query:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS
         PPNGS P PT+SFG G+K  TIES+F+HGNRV+KQL FEE+SPSKKMAPLTPGVSRPLHGVINIDDSDDE+ +AKLP HT++KSEACVSA QDLS C S
Subjt:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS

Query:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA
        IC EEKID+GNK+++SDC T N ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTN TQV S+LN+ PTS CDKVVVSA P 
Subjt:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA

Query:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRNY+E GS KENS+SQ     N + D EDELEDV S   +++DS EGFIV+ SD S  GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDP+LCMKAVCALYR+QTSEEK+ KETLFSNKRGFSKFDAIRGT LAEFLT+G+PQGNL KSVKELEEYDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFL
        EIYGNKEDPFFL
Subjt:  EIYGNKEDPFFL

A0A5D3BWC5 Protein IWS1-like protein A isoform X13.5e-23574.35Show/hide
Query:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAEL---------------------KVRIEDGKKEVV
        MDVEEVRHKVE  D+CKCSELKLRCE+AE KCEELELKIQIKKIEYELLQ QMKKLESEKSVIE EL                     K+  ED K EVV
Subjt:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAEL---------------------KVRIEDGKKEVV

Query:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG
        NLV+ NE+++E+ +LLIENNVLAVEKK AE+EAELWK KFK LE Q SKMDE+KGD +PVL GKFEG IGLS+S E  A DGVASNHYA           
Subjt:  NLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKISNLEAG

Query:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS
         PPNGS   PTVSFG G+K  TIES+F+HGNRV+KQL FEE+SPSKKMAPLTPGVSRPLHGVINIDDSDDE+ +AKLP HT++KSEACVSA QDLS C S
Subjt:  APPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTRKKSEACVSATQDLSGCGS

Query:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA
        IC EEKID+GNK+++SDC T N ENVEA SDDFI NST KRKRA NI+TSDTETDDDDNVPI KLK   LQKTN TQV S+LN+ PTS CDKVVVSA P 
Subjt:  ICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPA

Query:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW
        RRRLVSLRNY+E GS KENS+SQ     N + D EDELEDV S   +++DS EGFIV+ SD S  GSTSQ   SEDLSNG EGYDEI+SKLGRRKKELKW
Subjt:  RRRLVSLRNYEE-GSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQ---SEDLSNGSEGYDEIISKLGRRKKELKW

Query:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF
        EYEADMLA+FGKDP+LCMKAVCALYR+QTSEEK+ KETLFSNKRGFSKFDAIRGT LAEFLT+G+PQGNL KSVKELEEYDPKAI+LCR LATHYSKQLF
Subjt:  EYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLF

Query:  EIYGNKEDPFFL
        EIYGNKEDPFFL
Subjt:  EIYGNKEDPFFL

A0A6J1CI11 uncharacterized protein LOC1110116871.3e-22471.61Show/hide
Query:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELKV---------------------RIEDGKKE-V
        MD+ EVRHKVEQSD+C+CSELKLRCERAEEKCEELE+++Q++KIE ELLQ QMKKL+SEK  IE ELKV                       ED KKE  
Subjt:  MDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQIKKIEYELLQAQMKKLESEKSVIEAELKV---------------------RIEDGKKE-V

Query:  VNLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEK---KGDDE-PVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKIS
        VNL++KNE+EDEVVQL+IENNVLA EKK AESEAELWKDKFK+LE QI K+D     KGD+   VL G FEGG    LSPEAR   GVAS  Y+L K I 
Subjt:  VNLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEK---KGDDE-PVLYGKFEGGIGLSLSPEARALDGVASNHYALGKKIS

Query:  NLEAGAPPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPC-HTRKKSEACVSATQD
        +L+AG PPNGSP   TVS G G K   IESYFKHGNR ++QL FEE++PS+KMAPLTP VSRP+HG+INIDDSDDE  +AKLP  +++KKSE C SA Q 
Subjt:  NLEAGAPPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPC-HTRKKSEACVSATQD

Query:  LSGCGSICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGP----TSAC
        LSGC S  SEEKI +GN+         N EN+EAFSDDF F+ST KRKRASNIITSDTETDDDDNVPICKLKG PLQKTN TQ  SELN+GP    T AC
Subjt:  LSGCGSICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGP----TSAC

Query:  DKVVVSATPARRRLVSLRN-YEEGSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQSEDLSNGSEGYDEIISKLGR
        DKV +SATPARRRLV LRN YE+GSGK+NSSSQ     N +EDV D LEDVGSELETE +SL+GFIVD SDVS  GS SQSEDLSNGSE YDEII KLGR
Subjt:  DKVVVSATPARRRLVSLRN-YEEGSGKENSSSQ-----NDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQSEDLSNGSEGYDEIISKLGR

Query:  RKKELKWEYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLAT
        R+KELKWEYEADMLAAFGKDP+LCMK+VCALYR+QT EEKV KETLF NKRGFSKFDAI+GT LAEFLTDG+PQG+L KSVKELEEYDPKAIDLCRKLAT
Subjt:  RKKELKWEYEADMLAAFGKDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLAT

Query:  HYSKQLFEIYGNKEDPFFLP
        HYSKQLFEIYG KEDPFFLP
Subjt:  HYSKQLFEIYGNKEDPFFLP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G53220.1 unknown protein3.3e-5247Show/hide
Query:  STRKRKRA-SNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLRNYEEGSGKENSSSQNDVEDV-------
        S RKRKR  ++    D + DD+DN+PI  LK   L+ TN  Q  S+L D P     +   S   + +R VS R  ++   +E S+S   +  +       
Subjt:  STRKRKRA-SNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLRNYEEGSGKENSSSQNDVEDV-------

Query:  EDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKDPKLCMKAVCALYRRQTSEE
        +DE E+ GS  E+E +SL+GFI+DD D   + S    E   + S+G  GY +++S+L R KK  + KWEYEADMLA FGKDP+LCM+AVC L+R QT +E
Subjt:  EDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKDPKLCMKAVCALYRRQTSEE

Query:  KVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF-LPP
        KV + +  SN RGFSK DA+RGTS+A FLTDG+  G++KKSV+EL+ +D K ++ C +LA  YSKQLF+IY N+EDPFF LPP
Subjt:  KVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF-LPP

AT5G53220.2 unknown protein3.3e-5247Show/hide
Query:  STRKRKRA-SNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLRNYEEGSGKENSSSQNDVEDV-------
        S RKRKR  ++    D + DD+DN+PI  LK   L+ TN  Q  S+L D P     +   S   + +R VS R  ++   +E S+S   +  +       
Subjt:  STRKRKRA-SNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLRNYEEGSGKENSSSQNDVEDV-------

Query:  EDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKDPKLCMKAVCALYRRQTSEE
        +DE E+ GS  E+E +SL+GFI+DD D   + S    E   + S+G  GY +++S+L R KK  + KWEYEADMLA FGKDP+LCM+AVC L+R QT +E
Subjt:  EDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKDPKLCMKAVCALYRRQTSEE

Query:  KVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF-LPP
        KV + +  SN RGFSK DA+RGTS+A FLTDG+  G++KKSV+EL+ +D K ++ C +LA  YSKQLF+IY N+EDPFF LPP
Subjt:  KVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF-LPP

AT5G53220.3 unknown protein3.3e-5247Show/hide
Query:  STRKRKRA-SNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLRNYEEGSGKENSSSQNDVEDV-------
        S RKRKR  ++    D + DD+DN+PI  LK   L+ TN  Q  S+L D P     +   S   + +R VS R  ++   +E S+S   +  +       
Subjt:  STRKRKRA-SNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDKVVVSATPARRRLVSLRNYEEGSGKENSSSQNDVEDV-------

Query:  EDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKDPKLCMKAVCALYRRQTSEE
        +DE E+ GS  E+E +SL+GFI+DD D   + S    E   + S+G  GY +++S+L R KK  + KWEYEADMLA FGKDP+LCM+AVC L+R QT +E
Subjt:  EDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQSE---DLSNGSEGYDEIISKLGRRKK--ELKWEYEADMLAAFGKDPKLCMKAVCALYRRQTSEE

Query:  KVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF-LPP
        KV + +  SN RGFSK DA+RGTS+A FLTDG+  G++KKSV+EL+ +D K ++ C +LA  YSKQLF+IY N+EDPFF LPP
Subjt:  KVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFF-LPP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTCCTGATCACTCGGAGCAATTTCATTACACCTTTTTCTAAACCCTCGCAATCCCAATCCCCATTAACAATTTGCCCGAACCCATTTGATTTTTACGTATACGCTTC
TCTTTCTTCGCTCATTTCCCTGTTATTTTATTCTATTCTTACCTGTTTGGTTCCCGAGAAAACTATGGAACAAGAAGACAAACAGAAAATCAAAAAGATGGACGTGGAGG
AAGTTAGACATAAAGTAGAACAAAGCGATCTATGTAAGTGCAGTGAACTGAAATTGAGGTGTGAAAGAGCTGAAGAAAAGTGTGAAGAGTTGGAATTGAAGATTCAGATA
AAGAAGATTGAGTATGAATTACTACAGGCCCAAATGAAGAAGTTGGAGTCAGAAAAGTCAGTCATTGAAGCAGAGCTAAAAGTTCGGATTGAAGATGGAAAGAAAGAAGT
TGTGAATTTAGTTAATAAGAACGAAATGGAAGATGAGGTTGTTCAGCTTCTGATTGAGAATAATGTTTTGGCTGTTGAGAAGAAGAAGGCTGAAAGTGAGGCTGAGCTGT
GGAAAGATAAGTTCAAACAATTGGAGAATCAGATCTCAAAGATGGATGAAAAGAAAGGTGATGACGAGCCAGTGCTTTATGGGAAGTTTGAAGGGGGAATTGGATTATCA
TTGAGTCCTGAAGCTCGTGCTTTGGATGGTGTGGCCTCTAACCATTATGCTCTTGGTAAAAAAATTAGTAACTTGGAAGCAGGAGCACCTCCAAATGGCTCACCGCCCAC
TCCTACCGTTTCATTTGGGGGAGGAAAAAAACTTCGTACAATAGAATCTTATTTCAAGCATGGAAATCGAGTTAAAAAACAGTTGAGATTTGAGGAACAGAGCCCCAGCA
AAAAGATGGCACCATTAACTCCAGGTGTTTCTAGACCTTTACATGGCGTCATTAACATTGATGATAGTGATGATGAACTAACTGTTGCCAAATTGCCCTGTCATACTCGG
AAAAAGAGTGAGGCTTGTGTTTCAGCCACTCAAGATTTAAGTGGATGTGGATCTATTTGTTCTGAGGAGAAAATAGACCATGGAAACAAGTTGAGCAAGTCTGATTGCTG
TACAATAAATGTGGAAAATGTTGAAGCTTTTAGTGATGATTTTATCTTTAATTCAACACGCAAAAGGAAACGAGCTTCTAACATTATAACCAGCGACACTGAAACTGATG
ATGACGATAATGTTCCAATCTGCAAACTGAAGGGATCGCCTCTTCAGAAAACAAACTCTACACAAGTAGGTTCAGAATTGAACGATGGCCCTACTTCTGCATGTGACAAA
GTTGTGGTTAGTGCAACCCCTGCAAGGAGGCGTTTAGTCTCATTAAGAAATTACGAGGAAGGAAGTGGAAAGGAGAACTCTTCAAGTCAGAACGACGTTGAGGATGTTGA
AGATGAACTGGAAGATGTTGGGTCAGAGCTAGAGACCGAGAGTGATAGCTTGGAAGGATTTATTGTTGACGACTCTGATGTGTCGGCCGCTGGAAGCACTTCTCAATCAG
AAGATTTATCAAATGGCAGTGAAGGATATGATGAGATAATATCTAAGCTAGGAAGGAGAAAAAAAGAGTTGAAATGGGAGTATGAGGCTGATATGCTTGCAGCTTTTGGC
AAGGATCCTAAATTATGTATGAAGGCTGTGTGTGCTCTATATAGACGGCAAACTTCCGAGGAGAAAGTAAGGAAAGAAACGTTGTTTTCTAACAAGAGAGGATTCAGCAA
GTTCGATGCTATCAGGGGCACCAGTTTGGCTGAGTTTCTCACTGATGGCGAACCACAAGGTAATCTCAAAAAATCTGTTAAAGAGTTGGAGGAGTATGACCCAAAGGCCA
TTGACCTCTGCAGAAAGTTAGCGACACATTACTCCAAGCAGTTGTTCGAGATATATGGAAATAAAGAAGATCCTTTCTTTTTACCTCCATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTCCTGATCACTCGGAGCAATTTCATTACACCTTTTTCTAAACCCTCGCAATCCCAATCCCCATTAACAATTTGCCCGAACCCATTTGATTTTTACGTATACGCTTC
TCTTTCTTCGCTCATTTCCCTGTTATTTTATTCTATTCTTACCTGTTTGGTTCCCGAGAAAACTATGGAACAAGAAGACAAACAGAAAATCAAAAAGATGGACGTGGAGG
AAGTTAGACATAAAGTAGAACAAAGCGATCTATGTAAGTGCAGTGAACTGAAATTGAGGTGTGAAAGAGCTGAAGAAAAGTGTGAAGAGTTGGAATTGAAGATTCAGATA
AAGAAGATTGAGTATGAATTACTACAGGCCCAAATGAAGAAGTTGGAGTCAGAAAAGTCAGTCATTGAAGCAGAGCTAAAAGTTCGGATTGAAGATGGAAAGAAAGAAGT
TGTGAATTTAGTTAATAAGAACGAAATGGAAGATGAGGTTGTTCAGCTTCTGATTGAGAATAATGTTTTGGCTGTTGAGAAGAAGAAGGCTGAAAGTGAGGCTGAGCTGT
GGAAAGATAAGTTCAAACAATTGGAGAATCAGATCTCAAAGATGGATGAAAAGAAAGGTGATGACGAGCCAGTGCTTTATGGGAAGTTTGAAGGGGGAATTGGATTATCA
TTGAGTCCTGAAGCTCGTGCTTTGGATGGTGTGGCCTCTAACCATTATGCTCTTGGTAAAAAAATTAGTAACTTGGAAGCAGGAGCACCTCCAAATGGCTCACCGCCCAC
TCCTACCGTTTCATTTGGGGGAGGAAAAAAACTTCGTACAATAGAATCTTATTTCAAGCATGGAAATCGAGTTAAAAAACAGTTGAGATTTGAGGAACAGAGCCCCAGCA
AAAAGATGGCACCATTAACTCCAGGTGTTTCTAGACCTTTACATGGCGTCATTAACATTGATGATAGTGATGATGAACTAACTGTTGCCAAATTGCCCTGTCATACTCGG
AAAAAGAGTGAGGCTTGTGTTTCAGCCACTCAAGATTTAAGTGGATGTGGATCTATTTGTTCTGAGGAGAAAATAGACCATGGAAACAAGTTGAGCAAGTCTGATTGCTG
TACAATAAATGTGGAAAATGTTGAAGCTTTTAGTGATGATTTTATCTTTAATTCAACACGCAAAAGGAAACGAGCTTCTAACATTATAACCAGCGACACTGAAACTGATG
ATGACGATAATGTTCCAATCTGCAAACTGAAGGGATCGCCTCTTCAGAAAACAAACTCTACACAAGTAGGTTCAGAATTGAACGATGGCCCTACTTCTGCATGTGACAAA
GTTGTGGTTAGTGCAACCCCTGCAAGGAGGCGTTTAGTCTCATTAAGAAATTACGAGGAAGGAAGTGGAAAGGAGAACTCTTCAAGTCAGAACGACGTTGAGGATGTTGA
AGATGAACTGGAAGATGTTGGGTCAGAGCTAGAGACCGAGAGTGATAGCTTGGAAGGATTTATTGTTGACGACTCTGATGTGTCGGCCGCTGGAAGCACTTCTCAATCAG
AAGATTTATCAAATGGCAGTGAAGGATATGATGAGATAATATCTAAGCTAGGAAGGAGAAAAAAAGAGTTGAAATGGGAGTATGAGGCTGATATGCTTGCAGCTTTTGGC
AAGGATCCTAAATTATGTATGAAGGCTGTGTGTGCTCTATATAGACGGCAAACTTCCGAGGAGAAAGTAAGGAAAGAAACGTTGTTTTCTAACAAGAGAGGATTCAGCAA
GTTCGATGCTATCAGGGGCACCAGTTTGGCTGAGTTTCTCACTGATGGCGAACCACAAGGTAATCTCAAAAAATCTGTTAAAGAGTTGGAGGAGTATGACCCAAAGGCCA
TTGACCTCTGCAGAAAGTTAGCGACACATTACTCCAAGCAGTTGTTCGAGATATATGGAAATAAAGAAGATCCTTTCTTTTTACCTCCATGA
Protein sequenceShow/hide protein sequence
MFLITRSNFITPFSKPSQSQSPLTICPNPFDFYVYASLSSLISLLFYSILTCLVPEKTMEQEDKQKIKKMDVEEVRHKVEQSDLCKCSELKLRCERAEEKCEELELKIQI
KKIEYELLQAQMKKLESEKSVIEAELKVRIEDGKKEVVNLVNKNEMEDEVVQLLIENNVLAVEKKKAESEAELWKDKFKQLENQISKMDEKKGDDEPVLYGKFEGGIGLS
LSPEARALDGVASNHYALGKKISNLEAGAPPNGSPPTPTVSFGGGKKLRTIESYFKHGNRVKKQLRFEEQSPSKKMAPLTPGVSRPLHGVINIDDSDDELTVAKLPCHTR
KKSEACVSATQDLSGCGSICSEEKIDHGNKLSKSDCCTINVENVEAFSDDFIFNSTRKRKRASNIITSDTETDDDDNVPICKLKGSPLQKTNSTQVGSELNDGPTSACDK
VVVSATPARRRLVSLRNYEEGSGKENSSSQNDVEDVEDELEDVGSELETESDSLEGFIVDDSDVSAAGSTSQSEDLSNGSEGYDEIISKLGRRKKELKWEYEADMLAAFG
KDPKLCMKAVCALYRRQTSEEKVRKETLFSNKRGFSKFDAIRGTSLAEFLTDGEPQGNLKKSVKELEEYDPKAIDLCRKLATHYSKQLFEIYGNKEDPFFLPP