; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS007067 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS007067
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionProtein of unknown function, DUF547
Genome locationscaffold430:285720..290694
RNA-Seq ExpressionMS007067
SyntenyMS007067
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437070.1 PREDICTED: uncharacterized protein LOC103482606 isoform X2 [Cucumis melo]4.3e-29087.77Show/hide
Query:  MSGFD--MRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN
        MSG D  MRGEE+ +GKRELRDYLASQRVH+RHRRSRSSSD+NSN FRGG LHSN KND+SDAQASPLSTSGIRA+SPLHE ST FNDNSS+K RASLEN
Subjt:  MSGFD--MRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSV
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPAHGKHESRKHPS+
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSV

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+I+TGKT SGT+KVREA SQ+KRTSLR+LKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVA

Query:  SNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSS
        SNKAQKK GSFP VKQPQ  P+EEQ   GKAMLEIH ISTNNSQFSRAS+AIN +RVLVEQLEKVNVSKM IDAQTAFWINVYNAL+MHAYLAYGIP  S
Subjt:  SNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSS

Query:  LRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV
        LRRLALFHK            AAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV
Subjt:  LRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV

Query:  YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLT
        YTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLERFAREASI  DEL K VS+NVD +L +SIQKCM+HR GKK SQIIEWLPYSSRFRYVFSTNLT
Subjt:  YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLT

Query:  EKTWW
        E+ WW
Subjt:  EKTWW

XP_011654811.1 uncharacterized protein LOC101204173 isoform X1 [Cucumis sativus]3.0e-29187.93Show/hide
Query:  MSGFD--MRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN
        MSG D  MRGEE+ +GKRELRDYLASQRVH+RHRRSRSSSD+NSN FRG  LHSN KND+SDAQASPLSTSGIRA+SPLHE ST FNDNSS+K RASLEN
Subjt:  MSGFD--MRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSV
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH AQTKDLI+EIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPAHGKHESRKHPS+
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSV

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFG KS+I+TGKT SGT+KVREA SQ+KRTSLR+LKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVA

Query:  SNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSS
        SNKAQKK GSFP VKQPQCGP+EEQ   GKAMLEIH ISTNNSQFSRAS+AIN +RVLVEQLEKVNVSKM IDAQTAFWINVYNAL+MHAYLAYGIP  S
Subjt:  SNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSS

Query:  LRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV
        LRRLALFHK            AAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV
Subjt:  LRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV

Query:  YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLT
        YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASI  DEL K VSENVD +L +SIQKCM+HR GKK SQIIEWLPYSSRFRYVFSTNLT
Subjt:  YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLT

Query:  EKTWW
        E+ WW
Subjt:  EKTWW

XP_022154827.1 uncharacterized protein LOC111021990 isoform X1 [Momordica charantia]0.0e+0096.69Show/hide
Query:  MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI
        MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI
Subjt:  MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI

Query:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS
        ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS
Subjt:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS

Query:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN
        SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVRE ISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN
Subjt:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN

Query:  KAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLR
        KAQKKRGS PDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINT+RVLVEQLEKVNVSKMEIDAQTAFWINVYNAL+MHAYLAYGIPQSSLR
Subjt:  KAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLR

Query:  RLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT
        RLALFHK            AAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT
Subjt:  RLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT

Query:  ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLTEK
        ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASI PDELLK+VS+NVDVELHDSIQKCMDHR GKKASQIIEWLPYSSRFRYVFSTNLTEK
Subjt:  ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLTEK

Query:  TWWP
        TWWP
Subjt:  TWWP

XP_022154828.1 uncharacterized protein LOC111021990 isoform X2 [Momordica charantia]0.0e+0095.7Show/hide
Query:  MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI
        MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI
Subjt:  MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI

Query:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS
        ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS
Subjt:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS

Query:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN
        SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSK       VKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN
Subjt:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN

Query:  KAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLR
        KAQKKRGS PDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINT+RVLVEQLEKVNVSKMEIDAQTAFWINVYNAL+MHAYLAYGIPQSSLR
Subjt:  KAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLR

Query:  RLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT
        RLALFHK            AAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT
Subjt:  RLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT

Query:  ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLTEK
        ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASI PDELLK+VS+NVDVELHDSIQKCMDHR GKKASQIIEWLPYSSRFRYVFSTNLTEK
Subjt:  ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLTEK

Query:  TWWP
        TWWP
Subjt:  TWWP

XP_038874743.1 uncharacterized protein LOC120067282 isoform X1 [Benincasa hispida]4.2e-29388.6Show/hide
Query:  MSGFD--MRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN
        MSGFD  MRGEE+ + KRELRD+LASQRVH+ HRRSRSSSDRNSNVFRGGVLHS+ KND+SDAQASPLSTSGIRA+SPLHESS   NDNSSSK RASLEN
Subjt:  MSGFD--MRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSV
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPAHGKHESRKHPS+
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSV

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+++TGKT SGT+KVREA SQVKR SLRTLKDHLFECPSKLSEEMVRCMA IYCSLHR A
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVA

Query:  SNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSS
        SNKAQKK GSFP VKQPQCGP+EEQ    KAMLEIH IST+NSQFSRAS+AIN +RVLVEQLEKVNVSKM IDAQTAFWINVYNAL+MHAYLAYGIP  S
Subjt:  SNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSS

Query:  LRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV
        LRRLALFHK            AAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV
Subjt:  LRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV

Query:  YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLT
        YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASI  DELLK VSENVD +LH+SIQKCM+HR GKKASQIIEWLPYSSRFRYVFSTNLT
Subjt:  YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLT

Query:  EKTWW
        EK WW
Subjt:  EKTWW

TrEMBL top hitse value%identityAlignment
A0A1S3AT95 uncharacterized protein LOC103482606 isoform X22.1e-29087.77Show/hide
Query:  MSGFD--MRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN
        MSG D  MRGEE+ +GKRELRDYLASQRVH+RHRRSRSSSD+NSN FRGG LHSN KND+SDAQASPLSTSGIRA+SPLHE ST FNDNSS+K RASLEN
Subjt:  MSGFD--MRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSV
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPAHGKHESRKHPS+
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSV

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+I+TGKT SGT+KVREA SQ+KRTSLR+LKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVA

Query:  SNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSS
        SNKAQKK GSFP VKQPQ  P+EEQ   GKAMLEIH ISTNNSQFSRAS+AIN +RVLVEQLEKVNVSKM IDAQTAFWINVYNAL+MHAYLAYGIP  S
Subjt:  SNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSS

Query:  LRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV
        LRRLALFHK            AAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV
Subjt:  LRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV

Query:  YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLT
        YTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLERFAREASI  DEL K VS+NVD +L +SIQKCM+HR GKK SQIIEWLPYSSRFRYVFSTNLT
Subjt:  YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLT

Query:  EKTWW
        E+ WW
Subjt:  EKTWW

A0A1S4E657 uncharacterized protein LOC103482606 isoform X11.4e-28684.69Show/hide
Query:  MSGFD--MRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN
        MSG D  MRGEE+ +GKRELRDYLASQRVH+RHRRSRSSSD+NSN FRGG LHSN KND+SDAQASPLSTSGIRA+SPLHE ST FNDNSS+K RASLEN
Subjt:  MSGFD--MRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSS
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA                      QTKDLI+EIELLEEEVANREQHVLSLYRSIFE CVSKPSS
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSS

Query:  QQNSVTASPAHGKHESRKHPSVISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECP
        QQNSVTASPAHGKHESRKHPS+ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+I+TGKT SGT+KVREA SQ+KRTSLR+LKDHLFECP
Subjt:  QQNSVTASPAHGKHESRKHPSVISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECP

Query:  SKLSEEMVRCMADIYCSLHRVASNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAF
        SKLSEEMVRCMA IYCSLHRVASNKAQKK GSFP VKQPQ  P+EEQ   GKAMLEIH ISTNNSQFSRAS+AIN +RVLVEQLEKVNVSKM IDAQTAF
Subjt:  SKLSEEMVRCMADIYCSLHRVASNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAF

Query:  WINVYNALIMHAYLAYGIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLP
        WINVYNAL+MHAYLAYGIP  SLRRLALFHK            AAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLP
Subjt:  WINVYNALIMHAYLAYGIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLP

Query:  SPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKA
        SPQPLVCFGLCTGASSDPVLKVYTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLERFAREASI  DEL K VS+NVD +L +SIQKCM+HR GKK 
Subjt:  SPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKA

Query:  SQIIEWLPYSSRFRYVFSTNLTEKTWW
        SQIIEWLPYSSRFRYVFSTNLTE+ WW
Subjt:  SQIIEWLPYSSRFRYVFSTNLTEKTWW

A0A6J1DLC9 uncharacterized protein LOC111021990 isoform X20.0e+0095.7Show/hide
Query:  MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI
        MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI
Subjt:  MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI

Query:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS
        ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS
Subjt:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS

Query:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN
        SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSK       VKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN
Subjt:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN

Query:  KAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLR
        KAQKKRGS PDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINT+RVLVEQLEKVNVSKMEIDAQTAFWINVYNAL+MHAYLAYGIPQSSLR
Subjt:  KAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLR

Query:  RLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT
        RLALFHK            AAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT
Subjt:  RLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT

Query:  ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLTEK
        ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASI PDELLK+VS+NVDVELHDSIQKCMDHR GKKASQIIEWLPYSSRFRYVFSTNLTEK
Subjt:  ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLTEK

Query:  TWWP
        TWWP
Subjt:  TWWP

A0A6J1DMR5 uncharacterized protein LOC111021990 isoform X10.0e+0096.69Show/hide
Query:  MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI
        MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI
Subjt:  MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI

Query:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS
        ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS
Subjt:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVIS

Query:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN
        SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVRE ISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN
Subjt:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASN

Query:  KAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLR
        KAQKKRGS PDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINT+RVLVEQLEKVNVSKMEIDAQTAFWINVYNAL+MHAYLAYGIPQSSLR
Subjt:  KAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLR

Query:  RLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT
        RLALFHK            AAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT
Subjt:  RLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYT

Query:  ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLTEK
        ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASI PDELLK+VS+NVDVELHDSIQKCMDHR GKKASQIIEWLPYSSRFRYVFSTNLTEK
Subjt:  ASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLTEK

Query:  TWWP
        TWWP
Subjt:  TWWP

A0A6J1E2A4 uncharacterized protein LOC1114301779.7e-28085.1Show/hide
Query:  MSGFDM--RGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN
        MSGFDM  RGEE  +G RELRDYLAS  VHARHRRSRSSSDRNSNV RGGVLHSN KN +SD QASPLSTSGIRA+SPLHE +T FNDNS SKHRASLEN
Subjt:  MSGFDM--RGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSV
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFE CVSK SSQQ+SVT SPAHGKHES+KHPS+
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSV

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKR SNAGPNSL GGK +I+TGK  SG +KVREA+S VK+TSLRTLKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVA

Query:  SNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSS
        SN A+KK  SF  VK+P+ GP+EEQC   KAMLEIH ISTNN+QFSRAS+AIN +RVLVEQLEKVNVSKMEIDAQTAFWINVYNAL+MHAYLAYGIP  S
Subjt:  SNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSS

Query:  LRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV
        LRRLALFHK            AAYNIGG IISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLPS QPLVCFGLCTGASSDPVLKV
Subjt:  LRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKV

Query:  YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLT
        YTASNVKEELE+AKR+FLQANIVVKKSKKVFLPKVLERFAREASI  DEL K +SENVD +LH+SIQKCMD + GKKAS IIEWLPYSSRFRYVFSTNLT
Subjt:  YTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLT

Query:  EKTW
        EK W
Subjt:  EKTW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G47380.1 Protein of unknown function, DUF5474.5e-15251.47Show/hide
Query:  MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI
        M GFD+   + G  +R   D       H   R   +SS+R+ +    G   S   N+ +  QAS + T+  +   PLH       +N SS  RASLE D+
Subjt:  MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDI

Query:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA-QTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHG-KHESRKH-PS
        E L LRLQQE+SMR +LERAMGRASS+LSPGHRH A Q  +LITEIELLE EV NRE HVLSLYRSIFEQ VS+  S+Q+S  +SPAH  K   RK  P+
Subjt:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA-QTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHG-KHESRKH-PS

Query:  VISSAFCSSKKFPLGPLQPF-SVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSK---VREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIY--
        VIS+AFCSS  FPL P     ++ D  ++TS    +S F  ++ I +  + S  +K   ++++++ VK  S RTLKDHL++CP+KLSE+MV+CM+ +Y  
Subjt:  VISSAFCSSKKFPLGPLQPF-SVNDLGKRTSNAGPNSLFGGKSNINTGKTSSGTSK---VREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIY--

Query:  --CSLHRVASNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAY
          CS       K    R S  +V  P+    E++  S ++M+E+ WIS++  +FS+ ++AIN +R+LVEQLE+V +++ME +A+ AFWIN+YNAL+MHAY
Subjt:  --CSLHRVASNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAY

Query:  LAYGIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTG
        LAYG+P  SLRRLALFHK            +AYNIGGHII+AN IE SIFCF+TPR G WLETIISTALRKK  E++  + S   L  P+PLVCF LC G
Subjt:  LAYGIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTG

Query:  ASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHR-IGKKASQIIEWLPYSSR
        A SDPVLK YTASNVKEEL+ +KR+FL AN+VVK  KKV LPK++ERF +EAS+  D+L++ + +N D +L +SIQKC+  +   KKASQ++EWLPYSS+
Subjt:  ASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHR-IGKKASQIIEWLPYSSR

Query:  FRYVFSTNLTEK
        FRYVFS +L EK
Subjt:  FRYVFSTNLTEK

AT5G66600.1 Protein of unknown function, DUF5471.3e-5528.82Show/hide
Query:  RVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--R
        R+   H+RS+S+S               KK  + D  ++    +  R +  +  S+   ++   S    SL+ +I  L+ RLQ +  +R  LE+A+G   
Subjt:  RVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--R

Query:  ASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSVISSAF
        ASS +      +A  K   DLI ++ +LE EV + EQ++LSLYR  FEQ +S  S + +N    SP                   K +    P ++    
Subjt:  ASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSVISSAF

Query:  CSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFG----------GKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCS
          SKK  +  +    ++   +R+ +    S FG          GK++ +              IS  +    R + DH+ E P+KLSE MV+CM++IYC 
Subjt:  CSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFG----------GKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCS

Query:  -------LHR-VASNKAQKKRGSFP-----DVKQPQCGPLE--------------EQCVSG--KAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVN
               LHR ++S  +     +F      D   P  G                 E+  SG   +++E+  I  +  + S     +  F+ L+ +LE+V+
Subjt:  -------LHR-VASNKAQKKRGSFP-----DVKQPQCGPLE--------------EQCVSG--KAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVN

Query:  VSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGE
          K++ + + AFWINV+NAL+MHA+LAYGIPQ++++R            +L +++AAYNIGGH ISA AI+ SI   K    G WL  + ++  + K+G+
Subjt:  VSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGE

Query:  ERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSI
        ER   +    +  P+PL+ F L +G+ SDP ++VYT   +++ELE +K ++++ N+ ++K +++ LPK++E FA+++ +CP  L + V+ ++       +
Subjt:  ERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSI

Query:  QKCMDHRIGKKASQIIEWLPYSSRFRYV
        ++C       K  + I+W+P+S  FRY+
Subjt:  QKCMDHRIGKKASQIIEWLPYSSRFRYV

AT5G66600.2 Protein of unknown function, DUF5471.3e-5529.17Show/hide
Query:  KKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--RASSTLSPGHRHLAQTK---DLITEIELL
        KK  + D  ++    +  R +  +  S+   ++   S    SL+ +I  L+ RLQ +  +R  LE+A+G   ASS +      +A  K   DLI ++ +L
Subjt:  KKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--RASSTLSPGHRHLAQTK---DLITEIELL

Query:  EEEVANREQHVLSLYRSIFEQCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSVISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGP
        E EV + EQ++LSLYR  FEQ +S  S + +N    SP                   K +    P ++      SKK  +  +    ++   +R+ +   
Subjt:  EEEVANREQHVLSLYRSIFEQCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSVISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGP

Query:  NSLFG----------GKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCS-------LHR-VASNKAQKKRGSFP---
         S FG          GK++ +              IS  +    R + DH+ E P+KLSE MV+CM++IYC        LHR ++S  +     +F    
Subjt:  NSLFG----------GKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCS-------LHR-VASNKAQKKRGSFP---

Query:  --DVKQPQCGPLE--------------EQCVSG--KAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAY
          D   P  G                 E+  SG   +++E+  I  +  + S     +  F+ L+ +LE+V+  K++ + + AFWINV+NAL+MHA+LAY
Subjt:  --DVKQPQCGPLE--------------EQCVSG--KAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAY

Query:  GIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASS
        GIPQ++++R            +L +++AAYNIGGH ISA AI+ SI   K    G WL  + ++  + K+G+ER   +    +  P+PL+ F L +G+ S
Subjt:  GIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASS

Query:  DPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYV
        DP ++VYT   +++ELE +K ++++ N+ ++K +++ LPK++E FA+++ +CP  L + V+ ++       +++C       K  + I+W+P+S  FRY+
Subjt:  DPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYV

AT5G66600.3 Protein of unknown function, DUF5471.3e-5528.82Show/hide
Query:  RVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--R
        R+   H+RS+S+S               KK  + D  ++    +  R +  +  S+   ++   S    SL+ +I  L+ RLQ +  +R  LE+A+G   
Subjt:  RVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--R

Query:  ASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSVISSAF
        ASS +      +A  K   DLI ++ +LE EV + EQ++LSLYR  FEQ +S  S + +N    SP                   K +    P ++    
Subjt:  ASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSVISSAF

Query:  CSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFG----------GKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCS
          SKK  +  +    ++   +R+ +    S FG          GK++ +              IS  +    R + DH+ E P+KLSE MV+CM++IYC 
Subjt:  CSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFG----------GKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCS

Query:  -------LHR-VASNKAQKKRGSFP-----DVKQPQCGPLE--------------EQCVSG--KAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVN
               LHR ++S  +     +F      D   P  G                 E+  SG   +++E+  I  +  + S     +  F+ L+ +LE+V+
Subjt:  -------LHR-VASNKAQKKRGSFP-----DVKQPQCGPLE--------------EQCVSG--KAMLEIHWISTNNSQFSRASFAINTFRVLVEQLEKVN

Query:  VSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGE
          K++ + + AFWINV+NAL+MHA+LAYGIPQ++++R            +L +++AAYNIGGH ISA AI+ SI   K    G WL  + ++  + K+G+
Subjt:  VSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGE

Query:  ERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSI
        ER   +    +  P+PL+ F L +G+ SDP ++VYT   +++ELE +K ++++ N+ ++K +++ LPK++E FA+++ +CP  L + V+ ++       +
Subjt:  ERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVELHDSI

Query:  QKCMDHRIGKKASQIIEWLPYSSRFRYV
        ++C       K  + I+W+P+S  FRY+
Subjt:  QKCMDHRIGKKASQIIEWLPYSSRFRYV

AT5G66600.4 Protein of unknown function, DUF5473.9e-5529.07Show/hide
Query:  RVHARHRRSRSSSDRNSNVFRG-----GVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDIELLQLRLQQERSMRSMLERA
        R+   H+RS+ SS     VF G           KK  + D  ++    +  R +  +  S+   ++   S    SL+ +I  L+ RLQ +  +R  LE+A
Subjt:  RVHARHRRSRSSSDRNSNVFRG-----GVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDIELLQLRLQQERSMRSMLERA

Query:  MG--RASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSV
        +G   ASS +      +A  K   DLI ++ +LE EV + EQ++LSLYR  FEQ +S  S + +N    SP                   K +    P +
Subjt:  MG--RASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSV

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFG----------GKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMA
        +      SKK  +  +    ++   +R+ +    S FG          GK++ +              IS  +    R + DH+ E P+KLSE MV+CM+
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFG----------GKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMA

Query:  DIYCS-------LHR-VASNKAQKKRGSFP-----DVKQPQCGPLE--------------EQCVSG--KAMLEIHWISTNNSQFSRASFAINTFRVLVEQ
        +IYC        LHR ++S  +     +F      D   P  G                 E+  SG   +++E+  I  +  + S     +  F+ L+ +
Subjt:  DIYCS-------LHR-VASNKAQKKRGSFP-----DVKQPQCGPLE--------------EQCVSG--KAMLEIHWISTNNSQFSRASFAINTFRVLVEQ

Query:  LEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALR
        LE+V+  K++ + + AFWINV+NAL+MHA+LAYGIPQ++++R            +L +++AAYNIGGH ISA AI+ SI   K    G WL  + ++  +
Subjt:  LEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALR

Query:  KKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVE
         K+G+ER   +    +  P+PL+ F L +G+ SDP ++VYT   +++ELE +K ++++ N+ ++K +++ LPK++E FA+++ +CP  L + V+ ++   
Subjt:  KKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLKQVSENVDVE

Query:  LHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYV
            +++C       K  + I+W+P+S  FRY+
Subjt:  LHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATTTGATATGCGTGGAGAAGAAACGGGTGCTGGGAAGAGAGAGCTTAGAGATTATTTAGCTTCACAGCGTGTTCATGCCCGGCATAGGCGCTCTAGAAGTTC
TTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTTCTGCATTCCAATAAGAAGAATGACCAAAGTGACGCACAAGCATCCCCACTTTCTACAAGTGGTATTAGAGCAC
AAAGTCCTCTGCATGAGAGTTCTACTAAATTCAATGATAATTCATCATCTAAACACAGAGCGTCTTTGGAAAATGATATTGAGCTGCTACAGCTGCGCTTGCAACAAGAG
AGATCCATGCGAAGTATGCTTGAAAGGGCAATGGGTCGTGCATCAAGTACTTTATCTCCTGGGCACAGGCACTTAGCCCAGACAAAGGATTTGATCACAGAAATTGAATT
ACTTGAGGAAGAGGTCGCAAACCGTGAACAGCATGTGCTCTCCCTCTATAGGAGTATTTTTGAACAATGTGTTAGTAAGCCATCTTCTCAGCAAAATTCTGTCACAGCTT
CTCCAGCTCATGGGAAGCATGAATCAAGAAAGCACCCAAGTGTCATTTCAAGTGCATTTTGTTCTTCAAAGAAGTTTCCTTTAGGACCTTTGCAACCCTTCTCTGTAAAC
GACTTGGGAAAAAGAACCTCAAATGCTGGTCCTAATTCCTTGTTTGGCGGCAAAAGCAACATAAATACAGGAAAAACTTCTTCAGGCACTTCAAAGGTTCGTGAAGCAAT
TTCACAGGTGAAGAGAACTTCTCTTCGAACTCTAAAGGATCATCTTTTTGAGTGTCCAAGCAAGTTATCGGAAGAGATGGTGAGGTGCATGGCTGATATATACTGCTCTC
TGCATAGGGTGGCATCAAACAAGGCTCAAAAAAAGAGAGGTTCCTTCCCCGATGTTAAACAGCCACAATGTGGACCTTTGGAGGAACAATGTGTGAGTGGGAAAGCAATG
CTGGAAATACATTGGATATCAACCAATAACAGCCAGTTCTCTCGTGCCTCGTTCGCAATTAACACTTTCAGAGTATTAGTTGAGCAGCTGGAAAAAGTGAATGTCAGTAA
AATGGAGATTGACGCTCAAACTGCATTCTGGATAAATGTGTATAATGCTCTTATAATGCATGCATATTTGGCATATGGGATTCCCCAGAGCTCTCTAAGAAGGTTGGCAT
TGTTTCATAAGACAAGCATTGTTGACGGGCTTCTCTGTATAGTGCAGGCTGCTTACAACATCGGTGGCCATATCATCAGTGCAAATGCAATAGAGCAATCAATTTTTTGC
TTCAAAACTCCCCGAATAGGATGGTGGCTTGAAACTATCATTTCCACTGCACTGAGGAAAAAGTCTGGGGAAGAAAGACAACTTATCTCTTCAAAATTGGGCCTTCCAAG
TCCTCAACCTCTTGTTTGCTTTGGCCTCTGCACTGGCGCCTCTTCGGATCCTGTGCTGAAAGTGTACACTGCATCAAATGTCAAAGAGGAACTAGAAGTGGCCAAAAGGG
ACTTTCTCCAAGCAAATATAGTTGTGAAGAAGTCAAAGAAAGTATTCCTACCAAAGGTACTCGAAAGATTTGCACGCGAAGCATCCATCTGCCCAGATGAACTTCTGAAA
CAAGTTTCCGAAAATGTTGATGTGGAACTCCACGATTCGATACAGAAATGTATGGATCATCGGATTGGAAAGAAAGCATCTCAGATCATCGAGTGGTTACCTTACAGCTC
AAGGTTTCGGTATGTATTTTCTACCAATCTAACTGAAAAGACATGGTGGCCG
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATTTGATATGCGTGGAGAAGAAACGGGTGCTGGGAAGAGAGAGCTTAGAGATTATTTAGCTTCACAGCGTGTTCATGCCCGGCATAGGCGCTCTAGAAGTTC
TTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTTCTGCATTCCAATAAGAAGAATGACCAAAGTGACGCACAAGCATCCCCACTTTCTACAAGTGGTATTAGAGCAC
AAAGTCCTCTGCATGAGAGTTCTACTAAATTCAATGATAATTCATCATCTAAACACAGAGCGTCTTTGGAAAATGATATTGAGCTGCTACAGCTGCGCTTGCAACAAGAG
AGATCCATGCGAAGTATGCTTGAAAGGGCAATGGGTCGTGCATCAAGTACTTTATCTCCTGGGCACAGGCACTTAGCCCAGACAAAGGATTTGATCACAGAAATTGAATT
ACTTGAGGAAGAGGTCGCAAACCGTGAACAGCATGTGCTCTCCCTCTATAGGAGTATTTTTGAACAATGTGTTAGTAAGCCATCTTCTCAGCAAAATTCTGTCACAGCTT
CTCCAGCTCATGGGAAGCATGAATCAAGAAAGCACCCAAGTGTCATTTCAAGTGCATTTTGTTCTTCAAAGAAGTTTCCTTTAGGACCTTTGCAACCCTTCTCTGTAAAC
GACTTGGGAAAAAGAACCTCAAATGCTGGTCCTAATTCCTTGTTTGGCGGCAAAAGCAACATAAATACAGGAAAAACTTCTTCAGGCACTTCAAAGGTTCGTGAAGCAAT
TTCACAGGTGAAGAGAACTTCTCTTCGAACTCTAAAGGATCATCTTTTTGAGTGTCCAAGCAAGTTATCGGAAGAGATGGTGAGGTGCATGGCTGATATATACTGCTCTC
TGCATAGGGTGGCATCAAACAAGGCTCAAAAAAAGAGAGGTTCCTTCCCCGATGTTAAACAGCCACAATGTGGACCTTTGGAGGAACAATGTGTGAGTGGGAAAGCAATG
CTGGAAATACATTGGATATCAACCAATAACAGCCAGTTCTCTCGTGCCTCGTTCGCAATTAACACTTTCAGAGTATTAGTTGAGCAGCTGGAAAAAGTGAATGTCAGTAA
AATGGAGATTGACGCTCAAACTGCATTCTGGATAAATGTGTATAATGCTCTTATAATGCATGCATATTTGGCATATGGGATTCCCCAGAGCTCTCTAAGAAGGTTGGCAT
TGTTTCATAAGACAAGCATTGTTGACGGGCTTCTCTGTATAGTGCAGGCTGCTTACAACATCGGTGGCCATATCATCAGTGCAAATGCAATAGAGCAATCAATTTTTTGC
TTCAAAACTCCCCGAATAGGATGGTGGCTTGAAACTATCATTTCCACTGCACTGAGGAAAAAGTCTGGGGAAGAAAGACAACTTATCTCTTCAAAATTGGGCCTTCCAAG
TCCTCAACCTCTTGTTTGCTTTGGCCTCTGCACTGGCGCCTCTTCGGATCCTGTGCTGAAAGTGTACACTGCATCAAATGTCAAAGAGGAACTAGAAGTGGCCAAAAGGG
ACTTTCTCCAAGCAAATATAGTTGTGAAGAAGTCAAAGAAAGTATTCCTACCAAAGGTACTCGAAAGATTTGCACGCGAAGCATCCATCTGCCCAGATGAACTTCTGAAA
CAAGTTTCCGAAAATGTTGATGTGGAACTCCACGATTCGATACAGAAATGTATGGATCATCGGATTGGAAAGAAAGCATCTCAGATCATCGAGTGGTTACCTTACAGCTC
AAGGTTTCGGTATGTATTTTCTACCAATCTAACTGAAAAGACATGGTGGCCG
Protein sequenceShow/hide protein sequence
MSGFDMRGEETGAGKRELRDYLASQRVHARHRRSRSSSDRNSNVFRGGVLHSNKKNDQSDAQASPLSTSGIRAQSPLHESSTKFNDNSSSKHRASLENDIELLQLRLQQE
RSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEQCVSKPSSQQNSVTASPAHGKHESRKHPSVISSAFCSSKKFPLGPLQPFSVN
DLGKRTSNAGPNSLFGGKSNINTGKTSSGTSKVREAISQVKRTSLRTLKDHLFECPSKLSEEMVRCMADIYCSLHRVASNKAQKKRGSFPDVKQPQCGPLEEQCVSGKAM
LEIHWISTNNSQFSRASFAINTFRVLVEQLEKVNVSKMEIDAQTAFWINVYNALIMHAYLAYGIPQSSLRRLALFHKTSIVDGLLCIVQAAYNIGGHIISANAIEQSIFC
FKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSPQPLVCFGLCTGASSDPVLKVYTASNVKEELEVAKRDFLQANIVVKKSKKVFLPKVLERFAREASICPDELLK
QVSENVDVELHDSIQKCMDHRIGKKASQIIEWLPYSSRFRYVFSTNLTEKTWWP