; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr021985 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr021985
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function, DUF547
Genome locationtig00153870:386198..393780
RNA-Seq ExpressionSgr021985
SyntenySgr021985
Gene Ontology termsNA
InterPro domainsIPR006869 - Domain of unknown function DUF547
IPR025757 - Ternary complex factor MIP1, leucine-zipper


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK06707.1 uncharacterized protein E5676_scaffold13G00080 [Cucumis melo var. makuwa]1.9e-20180.04Show/hide
Query:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM
        ++  K+Q+ D  VQ SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTM
Subjt:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM

Query:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN
        DD+LESY  P+ VI+ EHS IHSDHIVSP+T   NQSKGRN VEE EKL H  RS SSL QRS GSS+NY LSKYMAKAVDSYHS PLSMLEQS+ D  +
Subjt:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN

Query:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK
        S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAEPPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFHIEEF APY TMLK
Subjt:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK

Query:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP
        VQWISR+RKKDSDI+HMLQGFRS I+RLKEV LK MKH E+LAFWINVHNTLVMHAYLQYGIPK+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLP
Subjt:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP

Query:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
        R GQWLHLFLSSKTKFKVND  KSF INHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LE
Subjt:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE

XP_008454883.1 PREDICTED: uncharacterized protein LOC103495193 [Cucumis melo]1.9e-20180.04Show/hide
Query:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM
        ++  K+Q+ D  VQ SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTM
Subjt:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM

Query:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN
        DD+LESY  P+ VI+ EHS IHSDHIVSP+T   NQSKGRN VEE EKL H  RS SSL QRS GSS+NY LSKYMAKAVDSYHS PLSMLEQS+ D  +
Subjt:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN

Query:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK
        S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAEPPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFHIEEF APY TMLK
Subjt:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK

Query:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP
        VQWISR+RKKDSDI+HMLQGFRS I+RLKEV LK MKH E+LAFWINVHNTLVMHAYLQYGIPK+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLP
Subjt:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP

Query:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
        R GQWLHLFLSSKTKFKVND  KSF INHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LE
Subjt:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE

XP_011658927.1 uncharacterized protein LOC101203131 isoform X2 [Cucumis sativus]4.0e-19979.61Show/hide
Query:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM
        ++  K+Q+ D   Q SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTM
Subjt:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM

Query:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN
        DD+LESY  P+ VI+ EHS IHSDHI SP+T   NQSKGRN VEE E L H  RS SSL QRS GSS+NY LSK MAKAVDSYHS PLSMLEQS+ D  +
Subjt:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN

Query:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK
        S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAEPPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFH EEF APY TMLK
Subjt:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK

Query:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP
        VQWISR+RK DSDI+HMLQGFRS I+RLKEV LKAMKH E+LAFWINVHNTLVMHAYLQYGI K+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLP
Subjt:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP

Query:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
        R GQWLHLFLSSKTKFKVND  KSF INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Subjt:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE

XP_022135648.1 uncharacterized protein LOC111007555 isoform X1 [Momordica charantia]2.4e-22086.47Show/hide
Query:  MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQ
        ME  GA LEA+KKQLPDSHV QNSLKQEI QLQEQLQSQFVIRHALEKA+NFQP SLDSATE+SIPKAAMELIKQIAVLE+EVVYLEKYLLSLYRRTFKQ
Subjt:  MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQ

Query:  QVSSFSTMDDQLESYSGPHIVIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSM
        QVSS STMDD+LESYSGP  VI+ E  HSFIHSDHIVSPQTS  NQSKGRNEVEE EKL H  RSYSSLL+RSPGSS NYPLSK +AKAVDSYHSLPLSM
Subjt:  QVSSFSTMDDQLESYSGPHIVIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSM

Query:  LEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI
        LEQSQSDASNS+SL EH GA +P++A  SPNW+SEEMIKSIS IYCELA+PPL+ NHNNPSPI+PLSSM ELSS+D LGSMRNYEK   FNS+F NPFHI
Subjt:  LEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI

Query:  EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVD
        EEFS PY TMLKVQWISR+RKKDSDI+HMLQGFRS IYRLKEVDLKAMKH+E+LAFWINVHNTLVMHAYLQYGIPKNSLKR SLI KAAYNVGGHIISVD
Subjt:  EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVD

Query:  MIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
        MIQSSILGC LPR GQWLHLFLSSKTKFKVNDA KSF+INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Subjt:  MIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE

XP_022135649.1 uncharacterized protein LOC111007555 isoform X2 [Momordica charantia]3.3e-21785.84Show/hide
Query:  MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQ
        ME  GA LEA+KKQLPDSHV QNSLKQEI QLQEQLQSQFVIRHALEKA+NFQP SLDSATE+SIPKAAMELIKQIAVLE+EVVYLEKYLLSLYRRTFKQ
Subjt:  MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQ

Query:  QVSSFSTMDDQLESYSGPHIVIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSM
        QVSS STMDD+LESYSGP  VI+ E  HSFIHSDHIVSPQTS  NQSKGRNEVEE EKL H  RSYSSLL+RSPGSS NYPLSK +AKAVDSYHSLPLSM
Subjt:  QVSSFSTMDDQLESYSGPHIVIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSM

Query:  LEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI
        LE   SDASNS+SL EH GA +P++A  SPNW+SEEMIKSIS IYCELA+PPL+ NHNNPSPI+PLSSM ELSS+D LGSMRNYEK   FNS+F NPFHI
Subjt:  LEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI

Query:  EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVD
        EEFS PY TMLKVQWISR+RKKDSDI+HMLQGFRS IYRLKEVDLKAMKH+E+LAFWINVHNTLVMHAYLQYGIPKNSLKR SLI KAAYNVGGHIISVD
Subjt:  EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVD

Query:  MIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
        MIQSSILGC LPR GQWLHLFLSSKTKFKVNDA KSF+INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Subjt:  MIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE

TrEMBL top hitse value%identityAlignment
A0A0A0K861 Uncharacterized protein1.9e-19979.61Show/hide
Query:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM
        ++  K+Q+ D   Q SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTM
Subjt:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM

Query:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN
        DD+LESY  P+ VI+ EHS IHSDHI SP+T   NQSKGRN VEE E L H  RS SSL QRS GSS+NY LSK MAKAVDSYHS PLSMLEQS+ D  +
Subjt:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN

Query:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK
        S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAEPPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFH EEF APY TMLK
Subjt:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK

Query:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP
        VQWISR+RK DSDI+HMLQGFRS I+RLKEV LKAMKH E+LAFWINVHNTLVMHAYLQYGI K+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLP
Subjt:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP

Query:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
        R GQWLHLFLSSKTKFKVND  KSF INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Subjt:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE

A0A1S3BZ51 uncharacterized protein LOC1034951939.3e-20280.04Show/hide
Query:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM
        ++  K+Q+ D  VQ SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTM
Subjt:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM

Query:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN
        DD+LESY  P+ VI+ EHS IHSDHIVSP+T   NQSKGRN VEE EKL H  RS SSL QRS GSS+NY LSKYMAKAVDSYHS PLSMLEQS+ D  +
Subjt:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN

Query:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK
        S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAEPPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFHIEEF APY TMLK
Subjt:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK

Query:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP
        VQWISR+RKKDSDI+HMLQGFRS I+RLKEV LK MKH E+LAFWINVHNTLVMHAYLQYGIPK+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLP
Subjt:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP

Query:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
        R GQWLHLFLSSKTKFKVND  KSF INHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LE
Subjt:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE

A0A5D3C4C9 Uncharacterized protein9.3e-20280.04Show/hide
Query:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM
        ++  K+Q+ D  VQ SLKQEILQL+EQLQSQF  RHALEKA+NFQPLSL SATE++IP+A MELIKQIAVLE+EVVYLEKYLLSLYRRTF QQVSSFSTM
Subjt:  LEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTM

Query:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN
        DD+LESY  P+ VI+ EHS IHSDHIVSP+T   NQSKGRN VEE EKL H  RS SSL QRS GSS+NY LSKYMAKAVDSYHS PLSMLEQS+ D  +
Subjt:  DDQLESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASN

Query:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK
        S SL EH GAC+  +   SPNWLSEEMIKSISAIY ELAEPPL+NHNNPSPI+PLSSMYELSS+D GSMRNYEK    NSHF+NPFHIEEF APY TMLK
Subjt:  SLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLK

Query:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP
        VQWISR+RKKDSDI+HMLQGFRS I+RLKEV LK MKH E+LAFWINVHNTLVMHAYLQYGIPK+ LKRISLI KAAYN+GGHIISVD IQSSILGCRLP
Subjt:  VQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLP

Query:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
        R GQWLHLFLSSKTKFKVND  KSF INHPEPRLYFALCCG+ SDPAVR+YTAKRVNE+LE
Subjt:  RLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE

A0A6J1C220 uncharacterized protein LOC111007555 isoform X21.6e-21785.84Show/hide
Query:  MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQ
        ME  GA LEA+KKQLPDSHV QNSLKQEI QLQEQLQSQFVIRHALEKA+NFQP SLDSATE+SIPKAAMELIKQIAVLE+EVVYLEKYLLSLYRRTFKQ
Subjt:  MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQ

Query:  QVSSFSTMDDQLESYSGPHIVIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSM
        QVSS STMDD+LESYSGP  VI+ E  HSFIHSDHIVSPQTS  NQSKGRNEVEE EKL H  RSYSSLL+RSPGSS NYPLSK +AKAVDSYHSLPLSM
Subjt:  QVSSFSTMDDQLESYSGPHIVIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSM

Query:  LEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI
        LE   SDASNS+SL EH GA +P++A  SPNW+SEEMIKSIS IYCELA+PPL+ NHNNPSPI+PLSSM ELSS+D LGSMRNYEK   FNS+F NPFHI
Subjt:  LEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI

Query:  EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVD
        EEFS PY TMLKVQWISR+RKKDSDI+HMLQGFRS IYRLKEVDLKAMKH+E+LAFWINVHNTLVMHAYLQYGIPKNSLKR SLI KAAYNVGGHIISVD
Subjt:  EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVD

Query:  MIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
        MIQSSILGC LPR GQWLHLFLSSKTKFKVNDA KSF+INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Subjt:  MIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE

A0A6J1C5D8 uncharacterized protein LOC111007555 isoform X11.2e-22086.47Show/hide
Query:  MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQ
        ME  GA LEA+KKQLPDSHV QNSLKQEI QLQEQLQSQFVIRHALEKA+NFQP SLDSATE+SIPKAAMELIKQIAVLE+EVVYLEKYLLSLYRRTFKQ
Subjt:  MEKVGACLEAQKKQLPDSHV-QNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQ

Query:  QVSSFSTMDDQLESYSGPHIVIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSM
        QVSS STMDD+LESYSGP  VI+ E  HSFIHSDHIVSPQTS  NQSKGRNEVEE EKL H  RSYSSLL+RSPGSS NYPLSK +AKAVDSYHSLPLSM
Subjt:  QVSSFSTMDDQLESYSGPHIVIDRE--HSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSM

Query:  LEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI
        LEQSQSDASNS+SL EH GA +P++A  SPNW+SEEMIKSIS IYCELA+PPL+ NHNNPSPI+PLSSM ELSS+D LGSMRNYEK   FNS+F NPFHI
Subjt:  LEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLI-NHNNPSPITPLSSMYELSSRD-LGSMRNYEKFALFNSHFDNPFHI

Query:  EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVD
        EEFS PY TMLKVQWISR+RKKDSDI+HMLQGFRS IYRLKEVDLKAMKH+E+LAFWINVHNTLVMHAYLQYGIPKNSLKR SLI KAAYNVGGHIISVD
Subjt:  EEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVD

Query:  MIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
        MIQSSILGC LPR GQWLHLFLSSKTKFKVNDA KSF+INHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE
Subjt:  MIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELE

SwissProt top hitse value%identityAlignment
Q9XII1 Plastid division protein PDV21.0e-4843.73Show/hide
Query:  EEQSTAIILARAMELRLKIRSSV-NTTTTASSTVTSQE---IGDDR-SAVDGNGVAGYSGTGSRRT-EADASG--------EAEEDDEADLQ---QRQRY
        +E+   +ILARA ELRLKI   + N++TT S      E    G+ R S + GN    +    S    EA+A          EA E   A LQ   QRQ+Y
Subjt:  EEQSTAIILARAMELRLKIRSSV-NTTTTASSTVTSQE---IGDDR-SAVDGNGVAGYSGTGSRRT-EADASG--------EAEEDDEADLQ---QRQRY

Query:  EKEAALSEIEHSRKILLDKLKKYKGEDLEVIHEASAFAGDTVQHNQDLMLPPYPSHP---LHSPLGNGHVHPFPSGHKSVSNGLKDIATNKATKEPNESE
        EK+ ALSEI++SRK+LL+KLK+YKG+D EV+ E + FAG+ V +  DL+LPPYP HP   L     NG++   PS  KS +NG        +    NE+E
Subjt:  EKEAALSEIEHSRKILLDKLKKYKGEDLEVIHEASAFAGDTVQHNQDLMLPPYPSHP---LHSPLGNGHVHPFPSGHKSVSNGLKDIATNKATKEPNESE

Query:  RKCSQTDSRNSRNGLGSFVSVAAKSVFTIVGIVSILHLSGFRPKFGGKVAALKVLDLLRQSAAEDNGSHNECPPGKFLVMEDGEARCVVKERIEIPFSSV
         K     S  S +G+  F+   AK V  I+G++S+L  SG+ P+   + A+L +  LL   A     + N+CPPGK LV+EDGEARC+VKER+EIPF SV
Subjt:  RKCSQTDSRNSRNGLGSFVSVAAKSVFTIVGIVSILHLSGFRPKFGGKVAALKVLDLLRQSAAEDNGSHNECPPGKFLVMEDGEARCVVKERIEIPFSSV

Query:  VAKPDVNYGCG
        VAK DV YG G
Subjt:  VAKPDVNYGCG

Arabidopsis top hitse value%identityAlignment
AT2G23700.1 Protein of unknown function, DUF5471.3e-7536.27Show/hide
Query:  EAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMD
        E +K   PD   ++SLKQEI +L+++LQ+QF +R ALEKA+ ++  S D    +S PK   ELIK+IAVLE+EV +LE+YLLSLYR+ F QQ SS S   
Subjt:  EAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMD

Query:  DQLESYSGPHIVI----------DREHSFIHSDHIVSP---------------QTSLSNQ--------------SKGRNEVEEAEKL-------------
         + +S   P   +               F   + + SP               Q SL+ Q              S GR   +E  ++             
Subjt:  DQLESYSGPHIVI----------DREHSFIHSDHIVSP---------------QTSLSNQ--------------SKGRNEVEEAEKL-------------

Query:  -----LHFG-------------------------------------------------------------RSYSSLLQRSPGSSKNYPLSKYMAKAVDSY
              HF                                                              R  SSL QRS  +++  P       +V + 
Subjt:  -----LHFG-------------------------------------------------------------RSYSSLLQRSPGSSKNYPLSKYMAKAVDSY

Query:  HSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFD
        HS PLS+ E  Q + SN  SL EH G  I D   ++PN LSEEMIK  SAIY +LA+PP INH   SP +  SS  E S +D   M  +      NS FD
Subjt:  HSLPLSMLEQSQSDASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFD

Query:  NPFHIEEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGH
        + F   EFS PY +M++V  I R+RK+  D+  M + F   + +L+ VD + + H+E+LAFWINVHN LVMH +L  GIP+N+ KR  L+ K AY +GG 
Subjt:  NPFHIEEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGH

Query:  IISVDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES
        ++S++ IQS IL  ++PR GQWL L L  K KF+  D  + +S+ H EP LYFALC G+HSDPA+R++T K + +ELE+
Subjt:  IISVDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES

AT5G66600.1 Protein of unknown function, DUF5472.0e-9544.96Show/hide
Query:  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESY
        S+ + SLKQEI  L+ +LQ QF +R ALEKA+ ++  S   L    + ++PK A +LIK +AVLE+EV++LE+YLLSLYR+ F+QQ+SS S   +  +  
Subjt:  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESY

Query:  SGPHIVIDREHSFIHSD--------HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSD
        S P     R   F   D        H V       NQSK      V+  +    F RS+S   QRS   S+         KA  S HS PL +      +
Subjt:  SGPHIVIDREHSFIHSD--------HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSD

Query:  ASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE
          N +SL EH G  I D    +PN LSE M+K +S IYC+LAEPP + H    SP + LSS        Y+ SS   G+  +      F+   DN FH+E
Subjt:  ASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE

Query:  ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIIS
           +FS PY ++++V  I RD KK S++  +LQ F+S I RL+EVD + +KH+E+LAFWINVHN LVMHA+L YGIP+N++KR+ L+ KAAYN+GGH IS
Subjt:  ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIIS

Query:  VDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES
         + IQSSILGC++   GQWL L  +S+ KFK  D   +++I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE+
Subjt:  VDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES

AT5G66600.2 Protein of unknown function, DUF5472.0e-9544.96Show/hide
Query:  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESY
        S+ + SLKQEI  L+ +LQ QF +R ALEKA+ ++  S   L    + ++PK A +LIK +AVLE+EV++LE+YLLSLYR+ F+QQ+SS S   +  +  
Subjt:  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESY

Query:  SGPHIVIDREHSFIHSD--------HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSD
        S P     R   F   D        H V       NQSK      V+  +    F RS+S   QRS   S+         KA  S HS PL +      +
Subjt:  SGPHIVIDREHSFIHSD--------HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSD

Query:  ASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE
          N +SL EH G  I D    +PN LSE M+K +S IYC+LAEPP + H    SP + LSS        Y+ SS   G+  +      F+   DN FH+E
Subjt:  ASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE

Query:  ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIIS
           +FS PY ++++V  I RD KK S++  +LQ F+S I RL+EVD + +KH+E+LAFWINVHN LVMHA+L YGIP+N++KR+ L+ KAAYN+GGH IS
Subjt:  ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIIS

Query:  VDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES
         + IQSSILGC++   GQWL L  +S+ KFK  D   +++I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE+
Subjt:  VDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES

AT5G66600.3 Protein of unknown function, DUF5472.0e-9544.96Show/hide
Query:  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESY
        S+ + SLKQEI  L+ +LQ QF +R ALEKA+ ++  S   L    + ++PK A +LIK +AVLE+EV++LE+YLLSLYR+ F+QQ+SS S   +  +  
Subjt:  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESY

Query:  SGPHIVIDREHSFIHSD--------HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSD
        S P     R   F   D        H V       NQSK      V+  +    F RS+S   QRS   S+         KA  S HS PL +      +
Subjt:  SGPHIVIDREHSFIHSD--------HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSD

Query:  ASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE
          N +SL EH G  I D    +PN LSE M+K +S IYC+LAEPP + H    SP + LSS        Y+ SS   G+  +      F+   DN FH+E
Subjt:  ASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE

Query:  ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIIS
           +FS PY ++++V  I RD KK S++  +LQ F+S I RL+EVD + +KH+E+LAFWINVHN LVMHA+L YGIP+N++KR+ L+ KAAYN+GGH IS
Subjt:  ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIIS

Query:  VDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES
         + IQSSILGC++   GQWL L  +S+ KFK  D   +++I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE+
Subjt:  VDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES

AT5G66600.4 Protein of unknown function, DUF5472.0e-9544.96Show/hide
Query:  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESY
        S+ + SLKQEI  L+ +LQ QF +R ALEKA+ ++  S   L    + ++PK A +LIK +AVLE+EV++LE+YLLSLYR+ F+QQ+SS S   +  +  
Subjt:  SHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLS---LDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQLESY

Query:  SGPHIVIDREHSFIHSD--------HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSD
        S P     R   F   D        H V       NQSK      V+  +    F RS+S   QRS   S+         KA  S HS PL +      +
Subjt:  SGPHIVIDREHSFIHSD--------HIVSPQTSLSNQSKGRN--EVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSD

Query:  ASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE
          N +SL EH G  I D    +PN LSE M+K +S IYC+LAEPP + H    SP + LSS        Y+ SS   G+  +      F+   DN FH+E
Subjt:  ASNSLSLKEHPGACIPDQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNN-PSPITPLSS-------MYELSSRDLGSMRNYEKFALFNSHFDNPFHIE

Query:  ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIIS
           +FS PY ++++V  I RD KK S++  +LQ F+S I RL+EVD + +KH+E+LAFWINVHN LVMHA+L YGIP+N++KR+ L+ KAAYN+GGH IS
Subjt:  ---EFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRSFIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIIS

Query:  VDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES
         + IQSSILGC++   GQWL L  +S+ KFK  D   +++I+HPEP L+FAL  GSHSDPAVR+YT KR+ +ELE+
Subjt:  VDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPRLYFALCCGSHSDPAVRIYTAKRVNEELES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGGTTGGTGCCTGTTTGGAAGCTCAGAAAAAGCAACTCCCTGATAGTCATGTTCAGAATTCCTTGAAGCAGGAGATTTTACAGCTTCAAGAACAACTACAGAG
CCAATTTGTCATTCGTCATGCCTTGGAGAAGGCAATGAACTTTCAGCCTCTCTCACTTGATTCGGCAACCGAAAACTCGATCCCGAAGGCTGCGATGGAACTGATTAAGC
AAATCGCAGTCTTGGAGATAGAAGTTGTTTACTTGGAAAAATATCTTCTGTCACTATATCGTCGAACGTTCAAGCAACAAGTATCCTCTTTTTCAACCATGGATGATCAG
CTTGAATCCTATTCTGGGCCTCATATTGTGATAGACAGAGAACATTCTTTCATTCATTCTGACCATATCGTGTCGCCACAAACTTCATTGAGCAATCAATCAAAAGGAAG
AAATGAAGTTGAGGAAGCGGAGAAGCTGTTACACTTTGGTCGCAGCTATTCATCTCTTTTGCAGAGATCGCCTGGTTCATCTAAAAACTACCCTCTGTCAAAGTATATGG
CTAAAGCAGTAGATTCATACCATTCCCTTCCATTATCAATGCTGGAGCAATCTCAGAGTGATGCTTCAAATTCTCTGAGCCTCAAGGAGCATCCCGGTGCCTGTATACCT
GATCAAGCACATGTGTCGCCGAACTGGCTTTCGGAGGAGATGATCAAGTCTATCTCTGCAATATACTGTGAACTTGCAGAACCTCCTTTGATAAATCATAACAATCCTTC
TCCAATCACACCATTGTCATCCATGTATGAGCTTTCTTCACGAGACTTAGGCAGCATGAGGAACTACGAGAAATTTGCGTTGTTCAACTCGCATTTTGATAACCCTTTTC
ACATTGAAGAATTTAGTGCACCATACTACACAATGTTGAAGGTGCAATGGATTTCTAGAGATAGAAAGAAGGACTCAGATATCAGCCACATGCTACAAGGCTTCAGGTCG
TTTATTTATCGGCTCAAAGAAGTTGATCTCAAAGCGATGAAACACAAGGAAAGGCTCGCGTTTTGGATTAATGTACACAACACACTTGTAATGCATGCATATTTGCAATA
TGGGATTCCCAAAAATAGTTTGAAGAGAATATCGTTGATACAGAAGGCTGCATATAATGTTGGGGGTCACATAATAAGTGTAGATATGATACAAAGCTCAATTCTCGGGT
GTCGTTTGCCTCGTTTGGGACAGTGGCTGCACCTGTTCCTCTCTTCAAAAACAAAATTTAAGGTTAATGATGCACTGAAATCCTTTTCAATCAACCACCCCGAACCTCGG
TTATACTTCGCTCTATGTTGCGGGAGCCATTCTGATCCAGCGGTCCGTATCTATACGGCTAAGAGGGTGAATGAGGAGCTGGAGAGTCTTTTGCCAAGATTCAGGTTCAT
GCCTGGAAGATTTGGTGGACATTGTGGAGCGTTTAAAACCCGACGGGCAGGCAAACGACATTCAGCAGCAGCAACGGAAAAAGATTTGGAAAAGTATTGGGTGGATACCT
CACAACTTCACCTTCAGCTTTCTGCTGTCCAAAGAATTGGCATGCCAGTCCCTGCCCTGATAGTTCCATTTCCAACAAGATTCGATCTCGGAGCTACATCAACTCATCAT
TTGGCGTTCAAGGCCGGAAAGTTAAGGGCTACATCCATTTACGTATGGAGAGAGAAGGAAATTGGCCTCGAAAGTCGAATCCCAACAAAGAAGCACCCGCCACAGAGTCA
TAGCTTGGCCGTTCTAAACCCTCTTGCAGATAGGGACGCGGTTGGGCTGGGTTCGGGGCAGATGGAAGAACAAAGCACGGCTATAATTTTAGCAAGAGCAATGGAGCTGA
GGCTGAAGATTAGAAGCTCTGTTAACACCACGACGACGGCCAGTTCGACGGTGACTTCCCAGGAAATTGGGGATGATCGGTCCGCCGTAGATGGAAATGGGGTTGCGGGA
TATAGTGGCACTGGTTCACGGCGGACTGAGGCCGATGCGAGTGGGGAGGCGGAGGAAGATGACGAAGCGGATTTACAACAACGGCAAAGGTATGAGAAAGAAGCAGCCCT
TTCCGAGATTGAGCATAGTCGTAAGATTTTACTAGATAAACTGAAGAAGTACAAAGGGGAGGATTTGGAAGTGATACATGAGGCTTCAGCTTTTGCTGGGGACACAGTGC
AGCACAACCAGGATCTCATGCTTCCGCCATATCCAAGCCATCCTCTTCATTCCCCTTTAGGTAATGGCCACGTACATCCCTTCCCTTCTGGACACAAGTCTGTGAGTAAT
GGGCTAAAGGACATTGCGACAAATAAAGCTACAAAGGAACCCAATGAATCAGAAAGAAAATGCTCGCAAACGGATTCCAGGAACTCGAGGAATGGATTGGGATCTTTTGT
TAGTGTAGCTGCAAAATCAGTGTTCACGATTGTTGGCATAGTATCCATATTGCACTTGTCTGGTTTTAGACCAAAGTTTGGGGGGAAAGTTGCTGCTTTGAAGGTTCTGG
ACCTTCTTCGACAGTCTGCAGCTGAAGATAATGGATCACACAATGAATGTCCTCCGGGTAAATTCCTCGTGATGGAAGATGGGGAGGCTCGATGCGTTGTGAAAGAGAGA
ATTGAAATTCCATTTTCTTCAGTTGTGGCTAAACCAGATGTAAACTATGGATGCGGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAAGGTTGGTGCCTGTTTGGAAGCTCAGAAAAAGCAACTCCCTGATAGTCATGTTCAGAATTCCTTGAAGCAGGAGATTTTACAGCTTCAAGAACAACTACAGAG
CCAATTTGTCATTCGTCATGCCTTGGAGAAGGCAATGAACTTTCAGCCTCTCTCACTTGATTCGGCAACCGAAAACTCGATCCCGAAGGCTGCGATGGAACTGATTAAGC
AAATCGCAGTCTTGGAGATAGAAGTTGTTTACTTGGAAAAATATCTTCTGTCACTATATCGTCGAACGTTCAAGCAACAAGTATCCTCTTTTTCAACCATGGATGATCAG
CTTGAATCCTATTCTGGGCCTCATATTGTGATAGACAGAGAACATTCTTTCATTCATTCTGACCATATCGTGTCGCCACAAACTTCATTGAGCAATCAATCAAAAGGAAG
AAATGAAGTTGAGGAAGCGGAGAAGCTGTTACACTTTGGTCGCAGCTATTCATCTCTTTTGCAGAGATCGCCTGGTTCATCTAAAAACTACCCTCTGTCAAAGTATATGG
CTAAAGCAGTAGATTCATACCATTCCCTTCCATTATCAATGCTGGAGCAATCTCAGAGTGATGCTTCAAATTCTCTGAGCCTCAAGGAGCATCCCGGTGCCTGTATACCT
GATCAAGCACATGTGTCGCCGAACTGGCTTTCGGAGGAGATGATCAAGTCTATCTCTGCAATATACTGTGAACTTGCAGAACCTCCTTTGATAAATCATAACAATCCTTC
TCCAATCACACCATTGTCATCCATGTATGAGCTTTCTTCACGAGACTTAGGCAGCATGAGGAACTACGAGAAATTTGCGTTGTTCAACTCGCATTTTGATAACCCTTTTC
ACATTGAAGAATTTAGTGCACCATACTACACAATGTTGAAGGTGCAATGGATTTCTAGAGATAGAAAGAAGGACTCAGATATCAGCCACATGCTACAAGGCTTCAGGTCG
TTTATTTATCGGCTCAAAGAAGTTGATCTCAAAGCGATGAAACACAAGGAAAGGCTCGCGTTTTGGATTAATGTACACAACACACTTGTAATGCATGCATATTTGCAATA
TGGGATTCCCAAAAATAGTTTGAAGAGAATATCGTTGATACAGAAGGCTGCATATAATGTTGGGGGTCACATAATAAGTGTAGATATGATACAAAGCTCAATTCTCGGGT
GTCGTTTGCCTCGTTTGGGACAGTGGCTGCACCTGTTCCTCTCTTCAAAAACAAAATTTAAGGTTAATGATGCACTGAAATCCTTTTCAATCAACCACCCCGAACCTCGG
TTATACTTCGCTCTATGTTGCGGGAGCCATTCTGATCCAGCGGTCCGTATCTATACGGCTAAGAGGGTGAATGAGGAGCTGGAGAGTCTTTTGCCAAGATTCAGGTTCAT
GCCTGGAAGATTTGGTGGACATTGTGGAGCGTTTAAAACCCGACGGGCAGGCAAACGACATTCAGCAGCAGCAACGGAAAAAGATTTGGAAAAGTATTGGGTGGATACCT
CACAACTTCACCTTCAGCTTTCTGCTGTCCAAAGAATTGGCATGCCAGTCCCTGCCCTGATAGTTCCATTTCCAACAAGATTCGATCTCGGAGCTACATCAACTCATCAT
TTGGCGTTCAAGGCCGGAAAGTTAAGGGCTACATCCATTTACGTATGGAGAGAGAAGGAAATTGGCCTCGAAAGTCGAATCCCAACAAAGAAGCACCCGCCACAGAGTCA
TAGCTTGGCCGTTCTAAACCCTCTTGCAGATAGGGACGCGGTTGGGCTGGGTTCGGGGCAGATGGAAGAACAAAGCACGGCTATAATTTTAGCAAGAGCAATGGAGCTGA
GGCTGAAGATTAGAAGCTCTGTTAACACCACGACGACGGCCAGTTCGACGGTGACTTCCCAGGAAATTGGGGATGATCGGTCCGCCGTAGATGGAAATGGGGTTGCGGGA
TATAGTGGCACTGGTTCACGGCGGACTGAGGCCGATGCGAGTGGGGAGGCGGAGGAAGATGACGAAGCGGATTTACAACAACGGCAAAGGTATGAGAAAGAAGCAGCCCT
TTCCGAGATTGAGCATAGTCGTAAGATTTTACTAGATAAACTGAAGAAGTACAAAGGGGAGGATTTGGAAGTGATACATGAGGCTTCAGCTTTTGCTGGGGACACAGTGC
AGCACAACCAGGATCTCATGCTTCCGCCATATCCAAGCCATCCTCTTCATTCCCCTTTAGGTAATGGCCACGTACATCCCTTCCCTTCTGGACACAAGTCTGTGAGTAAT
GGGCTAAAGGACATTGCGACAAATAAAGCTACAAAGGAACCCAATGAATCAGAAAGAAAATGCTCGCAAACGGATTCCAGGAACTCGAGGAATGGATTGGGATCTTTTGT
TAGTGTAGCTGCAAAATCAGTGTTCACGATTGTTGGCATAGTATCCATATTGCACTTGTCTGGTTTTAGACCAAAGTTTGGGGGGAAAGTTGCTGCTTTGAAGGTTCTGG
ACCTTCTTCGACAGTCTGCAGCTGAAGATAATGGATCACACAATGAATGTCCTCCGGGTAAATTCCTCGTGATGGAAGATGGGGAGGCTCGATGCGTTGTGAAAGAGAGA
ATTGAAATTCCATTTTCTTCAGTTGTGGCTAAACCAGATGTAAACTATGGATGCGGGTAA
Protein sequenceShow/hide protein sequence
MEKVGACLEAQKKQLPDSHVQNSLKQEILQLQEQLQSQFVIRHALEKAMNFQPLSLDSATENSIPKAAMELIKQIAVLEIEVVYLEKYLLSLYRRTFKQQVSSFSTMDDQ
LESYSGPHIVIDREHSFIHSDHIVSPQTSLSNQSKGRNEVEEAEKLLHFGRSYSSLLQRSPGSSKNYPLSKYMAKAVDSYHSLPLSMLEQSQSDASNSLSLKEHPGACIP
DQAHVSPNWLSEEMIKSISAIYCELAEPPLINHNNPSPITPLSSMYELSSRDLGSMRNYEKFALFNSHFDNPFHIEEFSAPYYTMLKVQWISRDRKKDSDISHMLQGFRS
FIYRLKEVDLKAMKHKERLAFWINVHNTLVMHAYLQYGIPKNSLKRISLIQKAAYNVGGHIISVDMIQSSILGCRLPRLGQWLHLFLSSKTKFKVNDALKSFSINHPEPR
LYFALCCGSHSDPAVRIYTAKRVNEELESLLPRFRFMPGRFGGHCGAFKTRRAGKRHSAAATEKDLEKYWVDTSQLHLQLSAVQRIGMPVPALIVPFPTRFDLGATSTHH
LAFKAGKLRATSIYVWREKEIGLESRIPTKKHPPQSHSLAVLNPLADRDAVGLGSGQMEEQSTAIILARAMELRLKIRSSVNTTTTASSTVTSQEIGDDRSAVDGNGVAG
YSGTGSRRTEADASGEAEEDDEADLQQRQRYEKEAALSEIEHSRKILLDKLKKYKGEDLEVIHEASAFAGDTVQHNQDLMLPPYPSHPLHSPLGNGHVHPFPSGHKSVSN
GLKDIATNKATKEPNESERKCSQTDSRNSRNGLGSFVSVAAKSVFTIVGIVSILHLSGFRPKFGGKVAALKVLDLLRQSAAEDNGSHNECPPGKFLVMEDGEARCVVKER
IEIPFSSVVAKPDVNYGCG