; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr024275 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr024275
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function, DUF547
Genome locationtig00001291:1343480..1348141
RNA-Seq ExpressionSgr024275
SyntenySgr024275
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008437070.1 PREDICTED: uncharacterized protein LOC103482606 isoform X2 [Cucumis melo]2.4e-29388.28Show/hide
Query:  MSGFD--MRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN
        MSG D  MRGE + +GKR LRDYLASQRVH+RHRRS+SSSD+NSN FRGG LHS  KND+ DAQASPLSTSG RA+SPLHE ST+F+DNSS+K RASLEN
Subjt:  MSGFD--MRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFE+CVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDI+TGKT SGTAKVREA SQMKRTSLR+LKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVA

Query:  SNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSS
        SN+A+KK GSF  VKQP   PVEEQ G GKAMLEIHCIS NNSQFS ASYAINNYRVLVEQLEKVNVSKM IDAQTAFWINVYNALLMHAYLAYGIPH S
Subjt:  SNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSS

Query:  LRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKV
        LRRLALFHK            AAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLPS QP+VCFGLCTGASSDPVLKV
Subjt:  LRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKV

Query:  YTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLT
        YTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDEL KWVS+NVDGKL +SI KCM+HRTGKK SQIIEWLPYSSRFRYVFSTNLT
Subjt:  YTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLT

Query:  EKPWWS
        E+PWWS
Subjt:  EKPWWS

XP_011654811.1 uncharacterized protein LOC101204173 isoform X1 [Cucumis sativus]2.9e-29488.28Show/hide
Query:  MSGFD--MRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN
        MSG D  MRGE + +GKR LRDYLASQRVH+RHRRS+SSSD+NSN FRG  LHS  KND+ DAQASPLSTSG RA+SPLHE ST+F+DNSS+K RASLEN
Subjt:  MSGFD--MRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRH AQTKDLI+EIELLEEEVANREQHVLSLYRSIFE+CVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFG KSDI+TGKT SGTAKVREA SQMKRTSLR+LKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVA

Query:  SNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSS
        SN+A+KK GSF  VKQP CGPVEEQ G GKAMLEIHCIS NNSQFS ASYAINNYRVLVEQLEKVNVSKM IDAQTAFWINVYNALLMHAYLAYGIPH S
Subjt:  SNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSS

Query:  LRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKV
        LRRLALFHK            AAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLPS QP+VCFGLCTGASSDPVLKV
Subjt:  LRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKV

Query:  YTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLT
        YTASNVKEELE+AKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDEL KWVSENVDGKL +SI KCM+HRTGKK SQIIEWLPYSSRFRYVFSTNLT
Subjt:  YTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLT

Query:  EKPWWS
        E+PWWS
Subjt:  EKPWWS

XP_022154827.1 uncharacterized protein LOC111021990 isoform X1 [Momordica charantia]2.6e-30391.21Show/hide
Query:  MSGFDMRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDI
        MSGFDMRGE TGAGKR LRDYLASQRVHARHRRS+SSSDRNSNVFRGGVLHS KKNDQ DAQASPLSTSG RAQSPLHESST F+DNSSSKHRASLENDI
Subjt:  MSGFDMRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDI

Query:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSIIS
        ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPAHGKHESRKHPS+IS
Subjt:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSIIS

Query:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVASN
        SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+INTGKTSSGT+KVRE ISQ+KRTSLRTLKDHLFECPSKLSEEMVRCMA IYCSLHRVASN
Subjt:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVASN

Query:  RAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLR
        +A+KK GS  DVKQP CGP+EEQ  SGKAMLEIH IS NNSQFS AS+AIN YRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIP SSLR
Subjt:  RAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLR

Query:  RLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYT
        RLALFHK            AAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPS QP+VCFGLCTGASSDPVLKVYT
Subjt:  RLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYT

Query:  ASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLTEK
        ASNVKEELE+AKRDFLQANIVVKKSKKVFLPKVLERFAREASIS DELLK VS+NVD +LHDSI KCMDHRTGKK SQIIEWLPYSSRFRYVFSTNLTEK
Subjt:  ASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLTEK

Query:  PWW
         WW
Subjt:  PWW

XP_022154828.1 uncharacterized protein LOC111021990 isoform X2 [Momordica charantia]1.5e-29890.38Show/hide
Query:  MSGFDMRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDI
        MSGFDMRGE TGAGKR LRDYLASQRVHARHRRS+SSSDRNSNVFRGGVLHS KKNDQ DAQASPLSTSG RAQSPLHESST F+DNSSSKHRASLENDI
Subjt:  MSGFDMRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDI

Query:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSIIS
        ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPAHGKHESRKHPS+IS
Subjt:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSIIS

Query:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVASN
        SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+INTGKTSSGT+KV       KRTSLRTLKDHLFECPSKLSEEMVRCMA IYCSLHRVASN
Subjt:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVASN

Query:  RAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLR
        +A+KK GS  DVKQP CGP+EEQ  SGKAMLEIH IS NNSQFS AS+AIN YRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIP SSLR
Subjt:  RAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLR

Query:  RLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYT
        RLALFHK            AAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPS QP+VCFGLCTGASSDPVLKVYT
Subjt:  RLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYT

Query:  ASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLTEK
        ASNVKEELE+AKRDFLQANIVVKKSKKVFLPKVLERFAREASIS DELLK VS+NVD +LHDSI KCMDHRTGKK SQIIEWLPYSSRFRYVFSTNLTEK
Subjt:  ASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLTEK

Query:  PWW
         WW
Subjt:  PWW

XP_038874743.1 uncharacterized protein LOC120067282 isoform X1 [Benincasa hispida]1.2e-29588.61Show/hide
Query:  MSGFD--MRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN
        MSGFD  MRGE + + KR LRD+LASQRVH+ HRRS+SSSDRNSNVFRGGVLHS  KND+ DAQASPLSTSG RA+SPLHESS N +DNSSSK RASLEN
Subjt:  MSGFD--MRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFE+CVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSD++TGKT SGTAKVREA SQ+KR SLRTLKDHLFECPSKLSEEMVRCMA IYCSLHR A
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVA

Query:  SNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSS
        SN+A+KK GSF  VKQP CGPVEEQ G  KAMLEIHCIS +NSQFS ASYAINNYRVLVEQLEKVNVSKM IDAQTAFWINVYNALLMHAYLAYGIPH S
Subjt:  SNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSS

Query:  LRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKV
        LRRLALFHK            AAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLPS QP+VCFGLCTGASSDPVLKV
Subjt:  LRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKV

Query:  YTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLT
        YTASNVKEELE+AKRDFLQANIVVKKSKKVFLPKVLERFAREASI SDELLKWVSENVDGKLH+SI KCM+HRTGKK SQIIEWLPYSSRFRYVFSTNLT
Subjt:  YTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLT

Query:  EKPWWS
        EKPWWS
Subjt:  EKPWWS

TrEMBL top hitse value%identityAlignment
A0A1S3AT95 uncharacterized protein LOC103482606 isoform X21.2e-29388.28Show/hide
Query:  MSGFD--MRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN
        MSG D  MRGE + +GKR LRDYLASQRVH+RHRRS+SSSD+NSN FRGG LHS  KND+ DAQASPLSTSG RA+SPLHE ST+F+DNSS+K RASLEN
Subjt:  MSGFD--MRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLI+EIELLEEEVANREQHVLSLYRSIFE+CVSKPSSQQNSVTASPAHGKHESRKHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDI+TGKT SGTAKVREA SQMKRTSLR+LKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVA

Query:  SNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSS
        SN+A+KK GSF  VKQP   PVEEQ G GKAMLEIHCIS NNSQFS ASYAINNYRVLVEQLEKVNVSKM IDAQTAFWINVYNALLMHAYLAYGIPH S
Subjt:  SNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSS

Query:  LRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKV
        LRRLALFHK            AAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLPS QP+VCFGLCTGASSDPVLKV
Subjt:  LRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKV

Query:  YTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLT
        YTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDEL KWVS+NVDGKL +SI KCM+HRTGKK SQIIEWLPYSSRFRYVFSTNLT
Subjt:  YTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLT

Query:  EKPWWS
        E+PWWS
Subjt:  EKPWWS

A0A1S4E657 uncharacterized protein LOC103482606 isoform X17.9e-29085.19Show/hide
Query:  MSGFD--MRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN
        MSG D  MRGE + +GKR LRDYLASQRVH+RHRRS+SSSD+NSN FRGG LHS  KND+ DAQASPLSTSG RA+SPLHE ST+F+DNSS+K RASLEN
Subjt:  MSGFD--MRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSS
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA                      QTKDLI+EIELLEEEVANREQHVLSLYRSIFE+CVSKPSS
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA----------------------QTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSS

Query:  QQNSVTASPAHGKHESRKHPSIISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECP
        QQNSVTASPAHGKHESRKHPSIISSAFCSS+KFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDI+TGKT SGTAKVREA SQMKRTSLR+LKDHLFECP
Subjt:  QQNSVTASPAHGKHESRKHPSIISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECP

Query:  SKLSEEMVRCMAVIYCSLHRVASNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAF
        SKLSEEMVRCMA IYCSLHRVASN+A+KK GSF  VKQP   PVEEQ G GKAMLEIHCIS NNSQFS ASYAINNYRVLVEQLEKVNVSKM IDAQTAF
Subjt:  SKLSEEMVRCMAVIYCSLHRVASNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAF

Query:  WINVYNALLMHAYLAYGIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLP
        WINVYNALLMHAYLAYGIPH SLRRLALFHK            AAYNIGGHIISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLP
Subjt:  WINVYNALLMHAYLAYGIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLP

Query:  SLQPIVCFGLCTGASSDPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKV
        S QP+VCFGLCTGASSDPVLKVYTASNVKEELE AKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDEL KWVS+NVDGKL +SI KCM+HRTGKK 
Subjt:  SLQPIVCFGLCTGASSDPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKV

Query:  SQIIEWLPYSSRFRYVFSTNLTEKPWWS
        SQIIEWLPYSSRFRYVFSTNLTE+PWWS
Subjt:  SQIIEWLPYSSRFRYVFSTNLTEKPWWS

A0A6J1DLC9 uncharacterized protein LOC111021990 isoform X27.2e-29990.38Show/hide
Query:  MSGFDMRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDI
        MSGFDMRGE TGAGKR LRDYLASQRVHARHRRS+SSSDRNSNVFRGGVLHS KKNDQ DAQASPLSTSG RAQSPLHESST F+DNSSSKHRASLENDI
Subjt:  MSGFDMRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDI

Query:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSIIS
        ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPAHGKHESRKHPS+IS
Subjt:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSIIS

Query:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVASN
        SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+INTGKTSSGT+KV       KRTSLRTLKDHLFECPSKLSEEMVRCMA IYCSLHRVASN
Subjt:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVASN

Query:  RAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLR
        +A+KK GS  DVKQP CGP+EEQ  SGKAMLEIH IS NNSQFS AS+AIN YRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIP SSLR
Subjt:  RAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLR

Query:  RLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYT
        RLALFHK            AAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPS QP+VCFGLCTGASSDPVLKVYT
Subjt:  RLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYT

Query:  ASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLTEK
        ASNVKEELE+AKRDFLQANIVVKKSKKVFLPKVLERFAREASIS DELLK VS+NVD +LHDSI KCMDHRTGKK SQIIEWLPYSSRFRYVFSTNLTEK
Subjt:  ASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLTEK

Query:  PWW
         WW
Subjt:  PWW

A0A6J1DMR5 uncharacterized protein LOC111021990 isoform X11.3e-30391.21Show/hide
Query:  MSGFDMRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDI
        MSGFDMRGE TGAGKR LRDYLASQRVHARHRRS+SSSDRNSNVFRGGVLHS KKNDQ DAQASPLSTSG RAQSPLHESST F+DNSSSKHRASLENDI
Subjt:  MSGFDMRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDI

Query:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSIIS
        ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFE CVSKPSSQQNSVTASPAHGKHESRKHPS+IS
Subjt:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSIIS

Query:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVASN
        SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS+INTGKTSSGT+KVRE ISQ+KRTSLRTLKDHLFECPSKLSEEMVRCMA IYCSLHRVASN
Subjt:  SAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVASN

Query:  RAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLR
        +A+KK GS  DVKQP CGP+EEQ  SGKAMLEIH IS NNSQFS AS+AIN YRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIP SSLR
Subjt:  RAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLR

Query:  RLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYT
        RLALFHK            AAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPS QP+VCFGLCTGASSDPVLKVYT
Subjt:  RLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYT

Query:  ASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLTEK
        ASNVKEELE+AKRDFLQANIVVKKSKKVFLPKVLERFAREASIS DELLK VS+NVD +LHDSI KCMDHRTGKK SQIIEWLPYSSRFRYVFSTNLTEK
Subjt:  ASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLTEK

Query:  PWW
         WW
Subjt:  PWW

A0A6J1E2A4 uncharacterized protein LOC1114301771.2e-28586.14Show/hide
Query:  MSGFDM--RGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN
        MSGFDM  RGE   +G R LRDYLAS  VHARHRRS+SSSDRNSNV RGGVLHS  KN + D QASPLSTSG RA+SPLHE +TNF+DNS SKHRASLEN
Subjt:  MSGFDM--RGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLEN

Query:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSI
        DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFE+CVSK SSQQ+SVT SPAHGKHES+KHPSI
Subjt:  DIELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSI

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVA
        ISSAFCSS+KFPLGPLQPFSVNDLGKR SNAGPNSL GGK DI+TGK  SG AKVREA+S +K+TSLRTLKDHLFECPSKLSEEMVRCMA IYCSLHRVA
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVA

Query:  SNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSS
        SN AKKK  SF+ VK+P  GPVEEQ G  KAMLEIHCIS NN+QFS ASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPH S
Subjt:  SNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSS

Query:  LRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKV
        LRRLALFHK            AAYNIGG IISANAIEQSIF FK+PRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQP+VCFGLCTGASSDPVLKV
Subjt:  LRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKV

Query:  YTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLT
        YTASNVKEELE+AKR+FLQANIVVKKSKKVFLPKVLERFAREASISSDEL KW+SENVDGKLH+SI KCMD +TGKK S IIEWLPYSSRFRYVFSTNLT
Subjt:  YTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLT

Query:  EKPWWS
        EKPW S
Subjt:  EKPWWS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G47380.1 Protein of unknown function, DUF5471.1e-15051.31Show/hide
Query:  MSGFDMRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDI
        M GFD+  +  G  +R   D       H   R   +SS+R+ +    G   S   N+    QAS + T+ ++   PLH       +N SS  RASLE D+
Subjt:  MSGFDMRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDI

Query:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA-QTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHG-KHESRKH-PS
        E L LRLQQE+SMR +LERAMGRASS+LSPGHRH A Q  +LITEIELLE EV NRE HVLSLYRSIFE  VS+  S+Q+S  +SPAH  K   RK  P+
Subjt:  ELLQLRLQQERSMRSMLERAMGRASSTLSPGHRHLA-QTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHG-KHESRKH-PS

Query:  IISSAFCSSKKFPLGPLQPF-SVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAK---VREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIY--
        +IS+AFCSS  FPL P     ++ D  ++TS    +S F  ++ I +  + S  AK   ++++++ +K  S RTLKDHL++CP+KLSE+MV+CM+ +Y  
Subjt:  IISSAFCSSKKFPLGPLQPF-SVNDLGKRTSNAGPNSLFGGKSDINTGKTSSGTAK---VREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIY--

Query:  --CSLHRVASNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAY
          CS       +      S S+V  P     E+++ S ++M+E+  IS++  +FS  +YAINNYR+LVEQLE+V +++ME +A+ AFWIN+YNALLMHAY
Subjt:  --CSLHRVASNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAY

Query:  LAYGIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTG
        LAYG+P  SLRRLALFHK            +AYNIGGHII+AN IE SIFCF+TPR G WLETIISTALRKK  E++  + S   L   +P+VCF LC G
Subjt:  LAYGIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTG

Query:  ASSDPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHR-TGKKVSQIIEWLPYSSR
        A SDPVLK YTASNVKEEL+ +KR+FL AN+VVK  KKV LPK++ERF +EAS+S D+L++W+ +N D KL +SI KC+  +   KK SQ++EWLPYSS+
Subjt:  ASSDPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHR-TGKKVSQIIEWLPYSSR

Query:  FRYVFSTNLTEK
        FRYVFS +L EK
Subjt:  FRYVFSTNLTEK

AT5G66600.1 Protein of unknown function, DUF5471.1e-5228.34Show/hide
Query:  RVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--R
        R+   H+RSKS+S               KK  + D  ++    +  R +  +  S+ +  +   S    SL+ +I  L+ RLQ +  +R  LE+A+G   
Subjt:  RVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--R

Query:  ASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAF
        ASS +      +A  K   DLI ++ +LE EV + EQ++LSLYR  FE  +S  S + +N    SP                   K +    P ++    
Subjt:  ASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAF

Query:  CSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS---DINTGKTSSG-------TAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCS
          SKK  +  +    ++   +R+ +    S FG +    + + GK S                IS  +    R + DH+ E P+KLSE MV+CM+ IYC 
Subjt:  CSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS---DINTGKTSSG-------TAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCS

Query:  -------LHR-VASNRAKKKTGSFS-----DVKQP----------------HCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVN
               LHR ++S  +   + +FS     D   P                H    ++ SG   +++E+ CI  +  + S     + N++ L+ +LE+V+
Subjt:  -------LHR-VASNRAKKKTGSFS-----DVKQP----------------HCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVN

Query:  VSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGE
          K++ + + AFWINV+NAL+MHA+LAYGIP ++++R+ L            +++AAYNIGGH ISA AI+ SI   K    G WL  + ++  + K+G+
Subjt:  VSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGE

Query:  ERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSI
        ER   +    +   +P++ F L +G+ SDP ++VYT   +++ELE +K ++++ N+ ++K +++ LPK++E FA+++ +    L + V+ ++       +
Subjt:  ERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSI

Query:  LKCMDHRTGKKVSQIIEWLPYSSRFRYV
         +C    +  K  + I+W+P+S  FRY+
Subjt:  LKCMDHRTGKKVSQIIEWLPYSSRFRYV

AT5G66600.2 Protein of unknown function, DUF5472.4e-5228.5Show/hide
Query:  KKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--RASSTLSPGHRHLAQTK---DLITEIELL
        KK  + D  ++    +  R +  +  S+ +  +   S    SL+ +I  L+ RLQ +  +R  LE+A+G   ASS +      +A  K   DLI ++ +L
Subjt:  KKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--RASSTLSPGHRHLAQTK---DLITEIELL

Query:  EEEVANREQHVLSLYRSIFEHCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGP
        E EV + EQ++LSLYR  FE  +S  S + +N    SP                   K +    P ++      SKK  +  +    ++   +R+ +   
Subjt:  EEEVANREQHVLSLYRSIFEHCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGP

Query:  NSLFGGKS---DINTGKTSSG-------TAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCS-------LHR-VASNRAKKKTGSFS---
         S FG +    + + GK S                IS  +    R + DH+ E P+KLSE MV+CM+ IYC        LHR ++S  +   + +FS   
Subjt:  NSLFGGKS---DINTGKTSSG-------TAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCS-------LHR-VASNRAKKKTGSFS---

Query:  --DVKQP----------------HCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAY
          D   P                H    ++ SG   +++E+ CI  +  + S     + N++ L+ +LE+V+  K++ + + AFWINV+NAL+MHA+LAY
Subjt:  --DVKQP----------------HCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAY

Query:  GIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASS
        GIP ++++R+ L            +++AAYNIGGH ISA AI+ SI   K    G WL  + ++  + K+G+ER   +    +   +P++ F L +G+ S
Subjt:  GIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASS

Query:  DPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYV
        DP ++VYT   +++ELE +K ++++ N+ ++K +++ LPK++E FA+++ +    L + V+ ++       + +C    +  K  + I+W+P+S  FRY+
Subjt:  DPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYV

AT5G66600.3 Protein of unknown function, DUF5471.1e-5228.34Show/hide
Query:  RVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--R
        R+   H+RSKS+S               KK  + D  ++    +  R +  +  S+ +  +   S    SL+ +I  L+ RLQ +  +R  LE+A+G   
Subjt:  RVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDIELLQLRLQQERSMRSMLERAMG--R

Query:  ASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAF
        ASS +      +A  K   DLI ++ +LE EV + EQ++LSLYR  FE  +S  S + +N    SP                   K +    P ++    
Subjt:  ASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSIISSAF

Query:  CSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS---DINTGKTSSG-------TAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCS
          SKK  +  +    ++   +R+ +    S FG +    + + GK S                IS  +    R + DH+ E P+KLSE MV+CM+ IYC 
Subjt:  CSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS---DINTGKTSSG-------TAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCS

Query:  -------LHR-VASNRAKKKTGSFS-----DVKQP----------------HCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVN
               LHR ++S  +   + +FS     D   P                H    ++ SG   +++E+ CI  +  + S     + N++ L+ +LE+V+
Subjt:  -------LHR-VASNRAKKKTGSFS-----DVKQP----------------HCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQLEKVN

Query:  VSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGE
          K++ + + AFWINV+NAL+MHA+LAYGIP ++++R+ L            +++AAYNIGGH ISA AI+ SI   K    G WL  + ++  + K+G+
Subjt:  VSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALRKKSGE

Query:  ERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSI
        ER   +    +   +P++ F L +G+ SDP ++VYT   +++ELE +K ++++ N+ ++K +++ LPK++E FA+++ +    L + V+ ++       +
Subjt:  ERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGKLHDSI

Query:  LKCMDHRTGKKVSQIIEWLPYSSRFRYV
         +C    +  K  + I+W+P+S  FRY+
Subjt:  LKCMDHRTGKKVSQIIEWLPYSSRFRYV

AT5G66600.4 Protein of unknown function, DUF5472.4e-5228.59Show/hide
Query:  RVHARHRRSKSSSDRNSNVFRG-----GVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDIELLQLRLQQERSMRSMLERA
        R+   H+RSK SS     VF G           KK  + D  ++    +  R +  +  S+ +  +   S    SL+ +I  L+ RLQ +  +R  LE+A
Subjt:  RVHARHRRSKSSSDRNSNVFRG-----GVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDIELLQLRLQQERSMRSMLERA

Query:  MG--RASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSI
        +G   ASS +      +A  K   DLI ++ +LE EV + EQ++LSLYR  FE  +S  S + +N    SP                   K +    P +
Subjt:  MG--RASSTLSPGHRHLAQTK---DLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPS-SQQNSVTASP----------------AHGKHESRKHPSI

Query:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS---DINTGKTSSG-------TAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMA
        +      SKK  +  +    ++   +R+ +    S FG +    + + GK S                IS  +    R + DH+ E P+KLSE MV+CM+
Subjt:  ISSAFCSSKKFPLGPLQPFSVNDLGKRTSNAGPNSLFGGKS---DINTGKTSSG-------TAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMA

Query:  VIYCS-------LHR-VASNRAKKKTGSFS-----DVKQP----------------HCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQ
         IYC        LHR ++S  +   + +FS     D   P                H    ++ SG   +++E+ CI  +  + S     + N++ L+ +
Subjt:  VIYCS-------LHR-VASNRAKKKTGSFS-----DVKQP----------------HCGPVEEQSGSGKAMLEIHCISANNSQFSHASYAINNYRVLVEQ

Query:  LEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALR
        LE+V+  K++ + + AFWINV+NAL+MHA+LAYGIP ++++R+ L            +++AAYNIGGH ISA AI+ SI   K    G WL  + ++  +
Subjt:  LEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFCFKTPRIGWWLETIISTALR

Query:  KKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGK
         K+G+ER   +    +   +P++ F L +G+ SDP ++VYT   +++ELE +K ++++ N+ ++K +++ LPK++E FA+++ +    L + V+ ++   
Subjt:  KKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLKWVSENVDGK

Query:  LHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYV
            + +C    +  K  + I+W+P+S  FRY+
Subjt:  LHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGTGGATTTGATATGCGTGGAGAAGCAACAGGGGCTGGGAAGAGAGCGCTTAGAGATTATTTAGCGTCCCAACGTGTTCATGCCCGGCACAGGCGCTCCAAAAGTTC
TTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTGCTGCATTCTATTAAGAAGAATGACCAGATTGACGCACAAGCATCCCCACTTTCTACAAGTGGTTCTAGAGCAC
AGAGTCCTCTGCATGAGAGTTCTACTAATTTTGATGATAATTCATCGTCTAAACACAGAGCCTCTTTGGAAAATGATATCGAGCTCCTACAGCTGCGCTTGCAACAAGAG
AGATCTATGCGAAGTATGCTTGAAAGGGCAATGGGTCGTGCATCAAGTACTTTGTCTCCTGGGCACAGGCACTTAGCACAGACAAAGGATTTGATCACAGAAATTGAATT
ACTCGAAGAAGAGGTTGCAAACCGTGAACAGCATGTGCTCTCTCTCTATAGGAGTATTTTTGAACATTGTGTTAGTAAGCCATCTTCTCAGCAAAATTCTGTCACAGCCT
CTCCAGCTCATGGGAAGCATGAATCAAGAAAACACCCAAGTATCATTTCGAGTGCATTTTGTTCTTCAAAGAAGTTTCCTTTAGGACCTTTGCAACCTTTCTCCGTAAAT
GACTTGGGAAAGAGAACCTCAAATGCTGGTCCTAATTCCTTGTTCGGAGGCAAAAGTGACATAAATACAGGAAAAACTTCTTCAGGCACTGCAAAGGTTCGTGAAGCAAT
TTCACAGATGAAGAGAACTTCTCTGCGAACTCTAAAAGATCATCTTTTTGAGTGTCCAAGCAAGTTATCAGAGGAGATGGTGAGGTGCATGGCTGTTATATACTGCTCTC
TTCATAGAGTGGCATCAAACAGAGCTAAAAAAAAGACAGGTTCCTTCTCTGATGTTAAACAGCCCCATTGTGGACCTGTGGAGGAACAAAGTGGGAGTGGGAAAGCAATG
CTGGAAATACATTGCATATCAGCCAATAACAGCCAGTTCTCTCATGCTTCATACGCAATCAACAATTATAGAGTATTAGTTGAGCAGCTGGAAAAAGTGAATGTCAGTAA
AATGGAGATTGATGCCCAAACTGCATTCTGGATAAATGTGTATAATGCTCTTCTAATGCATGCGTATTTGGCTTATGGAATCCCTCATAGCTCTCTCAGAAGGTTGGCAT
TGTTTCATAAGACAAGCATTGTTGATGGGCTTCTCAGTATAGTGCAGGCTGCTTACAACATCGGTGGTCATATCATCAGTGCAAATGCAATAGAGCAATCAATTTTTTGC
TTCAAAACTCCCCGAATAGGATGGTGGCTTGAAACTATCATTTCCACTGCACTGAGGAAAAAGTCTGGGGAAGAAAGACAACTTATCTCTTCAAAATTGGGCCTTCCAAG
TCTGCAACCTATTGTTTGTTTTGGCCTCTGCACTGGCGCCTCTTCAGATCCTGTGCTGAAAGTATACACTGCATCAAATGTTAAAGAGGAACTAGAAATGGCTAAAAGGG
ATTTTCTTCAAGCAAATATAGTTGTGAAGAAGTCAAAGAAAGTATTTCTACCAAAGGTGCTCGAAAGATTTGCGCGCGAAGCATCCATCAGCTCAGATGAACTTCTGAAA
TGGGTTTCCGAAAACGTCGATGGGAAGCTCCACGATTCAATACTGAAATGTATGGATCATCGGACTGGCAAGAAGGTATCTCAGATCATCGAGTGGTTACCTTACAGCTC
AAGGTTCCGGTATGTATTTTCTACGAATCTAACTGAAAAGCCGTGGTGGTCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGTGGATTTGATATGCGTGGAGAAGCAACAGGGGCTGGGAAGAGAGCGCTTAGAGATTATTTAGCGTCCCAACGTGTTCATGCCCGGCACAGGCGCTCCAAAAGTTC
TTCAGACAGGAACTCCAATGTCTTTAGAGGTGGGGTGCTGCATTCTATTAAGAAGAATGACCAGATTGACGCACAAGCATCCCCACTTTCTACAAGTGGTTCTAGAGCAC
AGAGTCCTCTGCATGAGAGTTCTACTAATTTTGATGATAATTCATCGTCTAAACACAGAGCCTCTTTGGAAAATGATATCGAGCTCCTACAGCTGCGCTTGCAACAAGAG
AGATCTATGCGAAGTATGCTTGAAAGGGCAATGGGTCGTGCATCAAGTACTTTGTCTCCTGGGCACAGGCACTTAGCACAGACAAAGGATTTGATCACAGAAATTGAATT
ACTCGAAGAAGAGGTTGCAAACCGTGAACAGCATGTGCTCTCTCTCTATAGGAGTATTTTTGAACATTGTGTTAGTAAGCCATCTTCTCAGCAAAATTCTGTCACAGCCT
CTCCAGCTCATGGGAAGCATGAATCAAGAAAACACCCAAGTATCATTTCGAGTGCATTTTGTTCTTCAAAGAAGTTTCCTTTAGGACCTTTGCAACCTTTCTCCGTAAAT
GACTTGGGAAAGAGAACCTCAAATGCTGGTCCTAATTCCTTGTTCGGAGGCAAAAGTGACATAAATACAGGAAAAACTTCTTCAGGCACTGCAAAGGTTCGTGAAGCAAT
TTCACAGATGAAGAGAACTTCTCTGCGAACTCTAAAAGATCATCTTTTTGAGTGTCCAAGCAAGTTATCAGAGGAGATGGTGAGGTGCATGGCTGTTATATACTGCTCTC
TTCATAGAGTGGCATCAAACAGAGCTAAAAAAAAGACAGGTTCCTTCTCTGATGTTAAACAGCCCCATTGTGGACCTGTGGAGGAACAAAGTGGGAGTGGGAAAGCAATG
CTGGAAATACATTGCATATCAGCCAATAACAGCCAGTTCTCTCATGCTTCATACGCAATCAACAATTATAGAGTATTAGTTGAGCAGCTGGAAAAAGTGAATGTCAGTAA
AATGGAGATTGATGCCCAAACTGCATTCTGGATAAATGTGTATAATGCTCTTCTAATGCATGCGTATTTGGCTTATGGAATCCCTCATAGCTCTCTCAGAAGGTTGGCAT
TGTTTCATAAGACAAGCATTGTTGATGGGCTTCTCAGTATAGTGCAGGCTGCTTACAACATCGGTGGTCATATCATCAGTGCAAATGCAATAGAGCAATCAATTTTTTGC
TTCAAAACTCCCCGAATAGGATGGTGGCTTGAAACTATCATTTCCACTGCACTGAGGAAAAAGTCTGGGGAAGAAAGACAACTTATCTCTTCAAAATTGGGCCTTCCAAG
TCTGCAACCTATTGTTTGTTTTGGCCTCTGCACTGGCGCCTCTTCAGATCCTGTGCTGAAAGTATACACTGCATCAAATGTTAAAGAGGAACTAGAAATGGCTAAAAGGG
ATTTTCTTCAAGCAAATATAGTTGTGAAGAAGTCAAAGAAAGTATTTCTACCAAAGGTGCTCGAAAGATTTGCGCGCGAAGCATCCATCAGCTCAGATGAACTTCTGAAA
TGGGTTTCCGAAAACGTCGATGGGAAGCTCCACGATTCAATACTGAAATGTATGGATCATCGGACTGGCAAGAAGGTATCTCAGATCATCGAGTGGTTACCTTACAGCTC
AAGGTTCCGGTATGTATTTTCTACGAATCTAACTGAAAAGCCGTGGTGGTCGTGA
Protein sequenceShow/hide protein sequence
MSGFDMRGEATGAGKRALRDYLASQRVHARHRRSKSSSDRNSNVFRGGVLHSIKKNDQIDAQASPLSTSGSRAQSPLHESSTNFDDNSSSKHRASLENDIELLQLRLQQE
RSMRSMLERAMGRASSTLSPGHRHLAQTKDLITEIELLEEEVANREQHVLSLYRSIFEHCVSKPSSQQNSVTASPAHGKHESRKHPSIISSAFCSSKKFPLGPLQPFSVN
DLGKRTSNAGPNSLFGGKSDINTGKTSSGTAKVREAISQMKRTSLRTLKDHLFECPSKLSEEMVRCMAVIYCSLHRVASNRAKKKTGSFSDVKQPHCGPVEEQSGSGKAM
LEIHCISANNSQFSHASYAINNYRVLVEQLEKVNVSKMEIDAQTAFWINVYNALLMHAYLAYGIPHSSLRRLALFHKTSIVDGLLSIVQAAYNIGGHIISANAIEQSIFC
FKTPRIGWWLETIISTALRKKSGEERQLISSKLGLPSLQPIVCFGLCTGASSDPVLKVYTASNVKEELEMAKRDFLQANIVVKKSKKVFLPKVLERFAREASISSDELLK
WVSENVDGKLHDSILKCMDHRTGKKVSQIIEWLPYSSRFRYVFSTNLTEKPWWS