; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0016852 (gene) of Snake gourd v1 genome

Gene IDTan0016852
OrganismTrichosanthes anguina (Snake gourd v1)
DescriptionCUE domain-containing protein
Genome locationLG06:8095682..8101384
RNA-Seq ExpressionTan0016852
SyntenyTan0016852
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0043130 - ubiquitin binding (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6575892.1 hypothetical protein SDJN03_26531, partial [Cucurbita argyrosperma subsp. sororia]1.2e-21067.35Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD
        M F +VYKCL E+FPEVDHR+LRAVALENPKDVH A+NDVL EV+P F  + I LP +D         EVENK +MDS  + +  MDSL+S T G E SD
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD

Query:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT
         T   S VAH D ALNQSVSEN+Y+ASDDC+QS ENTET SL   A +QED S+V L+ VAPGK N L++ED  YN  E+  +  +I+N VTEDIKQ+K+
Subjt:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT

Query:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL
         F GV+      HSH I DL+   + E +    +   FSA+ N D+Q ANP   NHSPVQS  H HSE D+QKSNANGTSNP P+QECSTGE+ VIEDGL
Subjt:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL

Query:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA
        +GHTI+TQSGQ C+I+LL++ IEDAK+NKITLFSAMQSVI++MK LEH EKYVEKVKE++ANGE EILAK+EEMK+ V   +EANDM+AGEVYGEKAILA
Subjt:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA

Query:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI
        TETRELQSRLLSLS+ERDKSLSILDEMH T+ AR+ AV+A+LK  +EE  A+EE ARKALAE+EALMEKV++ES IL+ EA+EN+KLREFLIH GQ+VDI
Subjt:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI

Query:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE
        LQGEISVI QDVR LKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAA D+ARFC PL D   S++D E G+SSN  KEGSRASS   S++  SSN+L+
Subjt:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE

Query:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE
        EER  RNH KA SDDGWD+FDK+AEFADAP+ VDAKE
Subjt:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE

XP_022953512.1 uncharacterized protein LOC111456040 isoform X1 [Cucurbita moschata]6.9e-21468.29Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD
        M F +VYKCL E+FPEVDHR+LRAVALENPKDVH A+NDVL EV+P F  + I LP +DP        EVENK +MDS  + +  MDSL+S T G E SD
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD

Query:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT
         T   S VAH D ALNQSVSEN+Y+ASDDC+QS ENTET SL   A +QED S+V L+ VAPGK N L++ED  YN  E+  +  +I+N VTEDIKQ+KT
Subjt:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT

Query:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL
         F GV+      HSH I DL+   V E +    +   FSA+ N D+Q ANP   NHSPVQS IH HSE D+QKSNANGTSNP P+QECSTGE+ VIEDGL
Subjt:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL

Query:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA
        +GHTI+TQSGQ C+I+LL++ IEDAK+NKITLFSAMQSVI++MK LEH EKYVEKVKE++ANGE EILAK+EEMK+ V   KEANDM+AGEVYGEKAILA
Subjt:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA

Query:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI
        TETRELQSRLLSLS+ERDKSLSILDEMH T+ AR+ AV+A+LK  +EE LA+EE ARKALAE+EALMEKV++ES IL+ EA+EN+KLREFLIH GQ+VDI
Subjt:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI

Query:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE
        LQGEISVI QDVR LKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAA D+ARFC PL D   S++D E G+SSN  KEGSRASS   S++  SSN+L+
Subjt:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE

Query:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE
        EER  RNH KA SDDGWD+FDK+AEFADAP+ VDAKE
Subjt:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE

XP_022953515.1 uncharacterized protein LOC111456040 isoform X2 [Cucurbita moschata]7.1e-21167.97Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD
        M F +VYKCL E+FPEVDHR+LRAVALENPKDVH A+NDVL EV+P F  + I LP +DP        EVENK          M MDSL+S T G E SD
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD

Query:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT
         T   S VAH D ALNQSVSEN+Y+ASDDC+QS ENTET SL   A +QED S+V L+ VAPGK N L++ED  YN  E+  +  +I+N VTEDIKQ+KT
Subjt:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT

Query:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL
         F GV+      HSH I DL+   V E +    +   FSA+ N D+Q ANP   NHSPVQS IH HSE D+QKSNANGTSNP P+QECSTGE+ VIEDGL
Subjt:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL

Query:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA
        +GHTI+TQSGQ C+I+LL++ IEDAK+NKITLFSAMQSVI++MK LEH EKYVEKVKE++ANGE EILAK+EEMK+ V   KEANDM+AGEVYGEKAILA
Subjt:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA

Query:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI
        TETRELQSRLLSLS+ERDKSLSILDEMH T+ AR+ AV+A+LK  +EE LA+EE ARKALAE+EALMEKV++ES IL+ EA+EN+KLREFLIH GQ+VDI
Subjt:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI

Query:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE
        LQGEISVI QDVR LKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAA D+ARFC PL D   S++D E G+SSN  KEGSRASS   S++  SSN+L+
Subjt:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE

Query:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE
        EER  RNH KA SDDGWD+FDK+AEFADAP+ VDAKE
Subjt:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE

XP_022992500.1 uncharacterized protein LOC111488818 isoform X1 [Cucurbita maxima]1.2e-21368.29Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD
        M  ++VYKCL E+FPEVDHR+LRAVALENPKDVH A+NDVL EV+P F  + I LP +DP        EVENK +MDS  + +  MDSLES T G E SD
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD

Query:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT
         T   S VAH D ALNQSVSE +Y+ASDDC+QS ENTET SL V A +QED S+V L+ VAPGK N L+HED  YN  E+  +  +I+N VTEDIKQ+KT
Subjt:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT

Query:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL
        LF GV+      HSH I DL+   V E +    +   FSA+ N D+Q ANP   NHSPVQS IH HSE D+QKSNANGTSNP  +Q+CSTGE+ VIEDGL
Subjt:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL

Query:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA
        +GHTI+TQSGQ C+I+LL++ IEDAK+NKITLFSAMQSVI++MK LEH EKYVEKVKE++ NGE EILAKVEEMK+ V   KEANDM+AGEVYGEKAILA
Subjt:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA

Query:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI
        TETRELQSRLLSLS+ERDKSLSILDEMH T+ AR+ AV+A+LK  +EE LA+EE ARKALAE+EALMEKV++ES ILQ EAEEN+KLREFL+H GQ+VDI
Subjt:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI

Query:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE
        LQGEISVI QDVR LKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAA D+ARF  PL D   S++D E G+SSN  KEGSRASS   S++  SSN+L+
Subjt:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE

Query:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE
        EER  RNH KA SDDGWD+FDK+AEFADAP+ VDAKE
Subjt:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE

XP_023548577.1 uncharacterized protein LOC111807200 [Cucurbita pepo subsp. pepo]1.0e-21268.13Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD
        M F++VYKCL E+FPEVDHR+LRAVALENPKDVHVA+NDVL EVIP F  +  +LP +DP        EVENK +MDS  + +  MDSL+S T G E SD
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD

Query:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT
         T   S VAH D ALNQSVSE +Y+ASDD +QS ENTET SL   A +QED S+V L+ VAPGK N L+HED  YN  E+  +  +I+N VTEDIKQ+KT
Subjt:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT

Query:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL
         F GV+      HSH I DL+   V E +    +   F+A+ N D+Q ANP   NHSPVQS IH HSE D+QKSNANGTSNP P+QECSTGE+ VIEDGL
Subjt:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL

Query:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA
        +GHTI+TQSGQ C+I+LL++ IEDAK+NKITLFSAMQSVI++MK LEH EKYVEKVKE++ANGE EILAKVEEMK+ V   KEANDM+AGEVYGEKAILA
Subjt:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA

Query:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI
        TETRELQSRLLSLS+ERDKSLSILDEMH T+ AR+ AV+A+LK  +EE LA+EE ARKALAE+EALMEKV++ES IL+ EA+EN+KLREFLIH GQ+VDI
Subjt:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI

Query:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE
        LQGEISVI QDVR LKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAA D+ RFC PL D   S++D E G+SSN  KEGSRASS   S++  SSN+L+
Subjt:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE

Query:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE
        EER  RNH KA SDDGWD+FDK+AEFADAP+ VDAKE
Subjt:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE

TrEMBL top hitse value%identityAlignment
A0A6J1GN67 uncharacterized protein LOC111456040 isoform X33.3e-20666.72Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD
        M F +VYKCL E+FPEVDHR+LRAVALENPKDVH A+NDVL EV+P F  + I LP +DP        EVENK +MDS  + +  MDSL+S T G E SD
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD

Query:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT
         T   S VAH D ALNQSVSEN+Y+ASDDC+QS ENTET SL   A +QED S+V L+ VAPGK N L++ED  YN  E+  +  +I+N VTEDIKQ+KT
Subjt:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT

Query:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL
         F GV+      HSH I DL+   V E +    +   FSA+ N D+Q ANP   NHSPVQS IH HSE D+QKSNANGTSNP P+QECSTGE+ VIEDGL
Subjt:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL

Query:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA
        +GHTI+TQSGQ C+I+LL++ IEDAK+NKITLFSAMQSVI++MK LEH EKYVEKVKE++ANGE EILAK+EEMK+ V   KEANDM+AGEVYGEKAILA
Subjt:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA

Query:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI
        TETRELQSRLLSLS           EMH T+ AR+ AV+A+LK  +EE LA+EE ARKALAE+EALMEKV++ES IL+ EA+EN+KLREFLIH GQ+VDI
Subjt:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI

Query:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE
        LQGEISVI QDVR LKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAA D+ARFC PL D   S++D E G+SSN  KEGSRASS   S++  SSN+L+
Subjt:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE

Query:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE
        EER  RNH KA SDDGWD+FDK+AEFADAP+ VDAKE
Subjt:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE

A0A6J1GNI7 uncharacterized protein LOC111456040 isoform X13.3e-21468.29Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD
        M F +VYKCL E+FPEVDHR+LRAVALENPKDVH A+NDVL EV+P F  + I LP +DP        EVENK +MDS  + +  MDSL+S T G E SD
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD

Query:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT
         T   S VAH D ALNQSVSEN+Y+ASDDC+QS ENTET SL   A +QED S+V L+ VAPGK N L++ED  YN  E+  +  +I+N VTEDIKQ+KT
Subjt:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT

Query:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL
         F GV+      HSH I DL+   V E +    +   FSA+ N D+Q ANP   NHSPVQS IH HSE D+QKSNANGTSNP P+QECSTGE+ VIEDGL
Subjt:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL

Query:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA
        +GHTI+TQSGQ C+I+LL++ IEDAK+NKITLFSAMQSVI++MK LEH EKYVEKVKE++ANGE EILAK+EEMK+ V   KEANDM+AGEVYGEKAILA
Subjt:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA

Query:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI
        TETRELQSRLLSLS+ERDKSLSILDEMH T+ AR+ AV+A+LK  +EE LA+EE ARKALAE+EALMEKV++ES IL+ EA+EN+KLREFLIH GQ+VDI
Subjt:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI

Query:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE
        LQGEISVI QDVR LKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAA D+ARFC PL D   S++D E G+SSN  KEGSRASS   S++  SSN+L+
Subjt:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE

Query:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE
        EER  RNH KA SDDGWD+FDK+AEFADAP+ VDAKE
Subjt:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE

A0A6J1GPU9 uncharacterized protein LOC111456040 isoform X23.5e-21167.97Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD
        M F +VYKCL E+FPEVDHR+LRAVALENPKDVH A+NDVL EV+P F  + I LP +DP        EVENK          M MDSL+S T G E SD
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD

Query:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT
         T   S VAH D ALNQSVSEN+Y+ASDDC+QS ENTET SL   A +QED S+V L+ VAPGK N L++ED  YN  E+  +  +I+N VTEDIKQ+KT
Subjt:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT

Query:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL
         F GV+      HSH I DL+   V E +    +   FSA+ N D+Q ANP   NHSPVQS IH HSE D+QKSNANGTSNP P+QECSTGE+ VIEDGL
Subjt:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL

Query:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA
        +GHTI+TQSGQ C+I+LL++ IEDAK+NKITLFSAMQSVI++MK LEH EKYVEKVKE++ANGE EILAK+EEMK+ V   KEANDM+AGEVYGEKAILA
Subjt:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA

Query:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI
        TETRELQSRLLSLS+ERDKSLSILDEMH T+ AR+ AV+A+LK  +EE LA+EE ARKALAE+EALMEKV++ES IL+ EA+EN+KLREFLIH GQ+VDI
Subjt:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI

Query:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE
        LQGEISVI QDVR LKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAA D+ARFC PL D   S++D E G+SSN  KEGSRASS   S++  SSN+L+
Subjt:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE

Query:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE
        EER  RNH KA SDDGWD+FDK+AEFADAP+ VDAKE
Subjt:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE

A0A6J1JQ13 uncharacterized protein LOC111488818 isoform X25.9e-21167.97Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD
        M  ++VYKCL E+FPEVDHR+LRAVALENPKDVH A+NDVL EV+P F  + I LP +DP        EVENK          M MDSLES T G E SD
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD

Query:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT
         T   S VAH D ALNQSVSE +Y+ASDDC+QS ENTET SL V A +QED S+V L+ VAPGK N L+HED  YN  E+  +  +I+N VTEDIKQ+KT
Subjt:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT

Query:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL
        LF GV+      HSH I DL+   V E +    +   FSA+ N D+Q ANP   NHSPVQS IH HSE D+QKSNANGTSNP  +Q+CSTGE+ VIEDGL
Subjt:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL

Query:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA
        +GHTI+TQSGQ C+I+LL++ IEDAK+NKITLFSAMQSVI++MK LEH EKYVEKVKE++ NGE EILAKVEEMK+ V   KEANDM+AGEVYGEKAILA
Subjt:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA

Query:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI
        TETRELQSRLLSLS+ERDKSLSILDEMH T+ AR+ AV+A+LK  +EE LA+EE ARKALAE+EALMEKV++ES ILQ EAEEN+KLREFL+H GQ+VDI
Subjt:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI

Query:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE
        LQGEISVI QDVR LKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAA D+ARF  PL D   S++D E G+SSN  KEGSRASS   S++  SSN+L+
Subjt:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE

Query:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE
        EER  RNH KA SDDGWD+FDK+AEFADAP+ VDAKE
Subjt:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE

A0A6J1JVV4 uncharacterized protein LOC111488818 isoform X15.7e-21468.29Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD
        M  ++VYKCL E+FPEVDHR+LRAVALENPKDVH A+NDVL EV+P F  + I LP +DP        EVENK +MDS  + +  MDSLES T G E SD
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD

Query:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT
         T   S VAH D ALNQSVSE +Y+ASDDC+QS ENTET SL V A +QED S+V L+ VAPGK N L+HED  YN  E+  +  +I+N VTEDIKQ+KT
Subjt:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKT

Query:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL
        LF GV+      HSH I DL+   V E +    +   FSA+ N D+Q ANP   NHSPVQS IH HSE D+QKSNANGTSNP  +Q+CSTGE+ VIEDGL
Subjt:  LFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGL

Query:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA
        +GHTI+TQSGQ C+I+LL++ IEDAK+NKITLFSAMQSVI++MK LEH EKYVEKVKE++ NGE EILAKVEEMK+ V   KEANDM+AGEVYGEKAILA
Subjt:  MGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILA

Query:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI
        TETRELQSRLLSLS+ERDKSLSILDEMH T+ AR+ AV+A+LK  +EE LA+EE ARKALAE+EALMEKV++ES ILQ EAEEN+KLREFL+H GQ+VDI
Subjt:  TETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDI

Query:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE
        LQGEISVI QDVR LKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAA D+ARF  PL D   S++D E G+SSN  KEGSRASS   S++  SSN+L+
Subjt:  LQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV--SRTISSSNDLE

Query:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE
        EER  RNH KA SDDGWD+FDK+AEFADAP+ VDAKE
Subjt:  EERG-RNHFKAISDDGWDLFDKEAEFADAPYFVDAKE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G03290.1 unknown protein1.1e-7637.36Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVE--SVSEIEVENKTKMDSLRYGQ-MNMDSLESDTTGSE
        MGF +VY+ L E+FP++D RILRAVA+E+PKD   A   VL E+IP F  ++    ++  +    S+SE EVE+  +  + R    +     ++ T+ S 
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVE--SVSEIEVENKTKMDSLRYGQ-MNMDSLESDTTGSE

Query:  TSDDTF-------HESSVAHQD--AALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIV
        +S +T        H +     D  + +N+  +    +  D C +  E+ E  S+    G +  N ++                   +  C ++   A+I 
Subjt:  TSDDTF-------HESSVAHQD--AALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIV

Query:  NQVTEDIKQDKTLFHGVDGSLNDFHSHVIRDLNTMLV--QELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSN-ANGTSNPGPQ
          V ED          +D       S+   DL   +   Q   +V    DS   +  +  Q    C               E  H  +N  + TSN    
Subjt:  NQVTEDIKQDKTLFHGVDGSLNDFHSHVIRDLNTMLV--QELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSN-ANGTSNPGPQ

Query:  QECSTGELSVIEDGLMGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEAN
         E    E+             + S  +C++D L++IIEDAK+NK  L + M++V N M+E+E KEK  EK KEEAA G  + L KVEE+K+ +  AKEAN
Subjt:  QECSTGELSVIEDGLMGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEAN

Query:  DMNAGEVYGEKAILATETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENS
        DM+AGEVYGEK+ILATE +EL++RLL+LS+ER+KSL+ILDEM  +L+ R+AA   + K  E+EK  +E+ A KALAE+EA MEKVVQES +LQ EAEENS
Subjt:  DMNAGEVYGEKAILATETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENS

Query:  KLREFLIHRGQVVDILQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRAS
        KLR+FL+ RGQ+VD LQGEISVICQDV+ LKEKF+  VPL+KS+SSS TS    S GSS+KS   +               ++  T  S+N +   + A 
Subjt:  KLREFLIHRGQVVDILQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRAS

Query:  SVSRTISSSNDLEEERGRNHFKAISDDGWDLFDKEAE
         +++      DL E            DGWD+FDKE E
Subjt:  SVSRTISSSNDLEEERGRNHFKAISDDGWDLFDKEAE

AT1G03290.2 unknown protein1.1e-7637.36Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVE--SVSEIEVENKTKMDSLRYGQ-MNMDSLESDTTGSE
        MGF +VY+ L E+FP++D RILRAVA+E+PKD   A   VL E+IP F  ++    ++  +    S+SE EVE+  +  + R    +     ++ T+ S 
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVE--SVSEIEVENKTKMDSLRYGQ-MNMDSLESDTTGSE

Query:  TSDDTF-------HESSVAHQD--AALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIV
        +S +T        H +     D  + +N+  +    +  D C +  E+ E  S+    G +  N ++                   +  C ++   A+I 
Subjt:  TSDDTF-------HESSVAHQD--AALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIV

Query:  NQVTEDIKQDKTLFHGVDGSLNDFHSHVIRDLNTMLV--QELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSN-ANGTSNPGPQ
          V ED          +D       S+   DL   +   Q   +V    DS   +  +  Q    C               E  H  +N  + TSN    
Subjt:  NQVTEDIKQDKTLFHGVDGSLNDFHSHVIRDLNTMLV--QELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSN-ANGTSNPGPQ

Query:  QECSTGELSVIEDGLMGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEAN
         E    E+             + S  +C++D L++IIEDAK+NK  L + M++V N M+E+E KEK  EK KEEAA G  + L KVEE+K+ +  AKEAN
Subjt:  QECSTGELSVIEDGLMGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEAN

Query:  DMNAGEVYGEKAILATETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENS
        DM+AGEVYGEK+ILATE +EL++RLL+LS+ER+KSL+ILDEM  +L+ R+AA   + K  E+EK  +E+ A KALAE+EA MEKVVQES +LQ EAEENS
Subjt:  DMNAGEVYGEKAILATETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENS

Query:  KLREFLIHRGQVVDILQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRAS
        KLR+FL+ RGQ+VD LQGEISVICQDV+ LKEKF+  VPL+KS+SSS TS    S GSS+KS   +               ++  T  S+N +   + A 
Subjt:  KLREFLIHRGQVVDILQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRAS

Query:  SVSRTISSSNDLEEERGRNHFKAISDDGWDLFDKEAE
         +++      DL E            DGWD+FDKE E
Subjt:  SVSRTISSSNDLEEERGRNHFKAISDDGWDLFDKEAE

AT4G02880.1 unknown protein3.4e-7837.28Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD
        MG++ VY+ L E+FP++D R+L+AVA+E+PKDV+ A   V+ E++P F  ++    S  P  ++   +  E +  M    +  ++   +    +GS +  
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSD

Query:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRK----------FARIVNQ
          +HE+      A + +SVS+ N +                  V   IQ    ++GLS    G   S V       +C+   K          F    NQ
Subjt:  DTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRK----------FARIVNQ

Query:  VTEDIKQDKTLFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFS-ANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECS
               D         S +  H  V    N  + Q    ++  + S    N  S   +A   +EN     +E+   +  D     +    N  P+    
Subjt:  VTEDIKQDKTLFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFS-ANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECS

Query:  TGELSVIEDGLMGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNA
                DG +  ++  +S Q CN+  L++IIEDAK+NK TLF+ M+S++N M+E+E +EK  EK KE+A+ G  + L KVEE+K+ +  AKEANDM A
Subjt:  TGELSVIEDGLMGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNA

Query:  GEVYGEKAILATETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLRE
        GEVYGE++IL TE  EL++RL+SLS+ERD SLS+LDEM   L+ R+A    +  A E+EK  +E  ARKA AE+EA+ME+VVQES +LQ EAEENSKLRE
Subjt:  GEVYGEKAILATETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLRE

Query:  FLIHRGQVVDILQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSVSR
        FL+  G++VD LQGEISVICQD+R LKEKFD  VPLS+S+SSSQTSC LASS SS+KS   +      PL+                        +S   
Subjt:  FLIHRGQVVDILQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSVSR

Query:  TISSSNDLEEERGRNHFKAISDDGWDLFDKEAE
          +SSN+   +   N  K + DDGWD FDKE E
Subjt:  TISSSNDLEEERGRNHFKAISDDGWDLFDKEAE

AT4G02880.2 unknown protein2.4e-7937.64Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEI--EVENKTKMDSLRYGQMNMDSLESDTTGSET
        MG++ VY+ L E+FP++D R+L+AVA+E+PKDV+ A   V+ E++P F  ++    S  P  ++   +  EVEN  + D + +  ++   +    +GS +
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEI--EVENKTKMDSLRYGQMNMDSLESDTTGSET

Query:  SDDTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRK----------FARIV
            +HE+      A + +SVS+ N +                  V   IQ    ++GLS    G   S V       +C+   K          F    
Subjt:  SDDTFHESSVAHQDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRK----------FARIV

Query:  NQVTEDIKQDKTLFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFS-ANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQE
        NQ       D         S +  H  V    N  + Q    ++  + S    N  S   +A   +EN     +E+   +  D     +    N  P+  
Subjt:  NQVTEDIKQDKTLFHGVDGSLNDFHSHVIRDLNTMLVQELFSVKSNYDSFS-ANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQE

Query:  CSTGELSVIEDGLMGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDM
                  DG +  ++  +S Q CN+  L++IIEDAK+NK TLF+ M+S++N M+E+E +EK  EK KE+A+ G  + L KVEE+K+ +  AKEANDM
Subjt:  CSTGELSVIEDGLMGHTIITQSGQLCNIDLLDEIIEDAKNNKITLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDM

Query:  NAGEVYGEKAILATETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKL
         AGEVYGE++IL TE  EL++RL+SLS+ERD SLS+LDEM   L+ R+A    +  A E+EK  +E  ARKA AE+EA+ME+VVQES +LQ EAEENSKL
Subjt:  NAGEVYGEKAILATETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDAMLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKL

Query:  REFLIHRGQVVDILQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV
        REFL+  G++VD LQGEISVICQD+R LKEKFD  VPLS+S+SSSQTSC LASS SS+KS   +      PL+                        +S 
Subjt:  REFLIHRGQVVDILQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAACDVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSV

Query:  SRTISSSNDLEEERGRNHFKAISDDGWDLFDKEAE
            +SSN+   +   N  K + DDGWD FDKE E
Subjt:  SRTISSSNDLEEERGRNHFKAISDDGWDLFDKEAE

AT5G64980.1 unknown protein2.3e-0545.65Show/hide
Query:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIP
        MGF +VY+ L E+FP++D +ILR VA+E+  D   A + V+ E+ P
Subjt:  MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTCGAGAACGTTTACAAGTGTCTGAGGGAAGTATTTCCGGAGGTTGACCATCGGATACTAAGGGCTGTAGCTCTTGAAAATCCTAAGGATGTTCATGTAGCTGT
TAATGATGTCCTCATGGAGGTTATCCCTCGTTTTGATGAAGATATTATTAAATTACCCTCGAAGGATCCCTTTGTTGAGTCTGTATCTGAAATAGAAGTTGAGAACAAAA
CAAAAATGGATTCATTGAGATATGGGCAGATGAATATGGATTCTTTGGAATCAGACACAACTGGTAGTGAAACATCTGATGACACTTTTCATGAGAGTAGTGTGGCACAT
CAAGATGCAGCATTGAATCAATCTGTCTCTGAAAATAATTATATTGCTAGTGATGATTGTGAACAATCACGTGAGAATACTGAAACTACAAGCCTCACTGTACCAGCTGG
CATACAAGAAGACAACAGTGAAGTTGGATTGAGTCATGTGGCACCTGGAAAATCAAATAGTTTGGTCCATGAAGATGGTGGATATAACACGTGTGAAGAAATTCGTAAAT
TTGCCAGAATCGTGAACCAGGTTACTGAGGATATCAAACAGGATAAGACACTGTTTCATGGAGTCGATGGAAGCTTGAACGATTTTCATTCTCATGTCATTCGTGATCTG
AACACCATGCTCGTGCAAGAATTATTCAGTGTTAAATCAAATTATGACTCCTTTTCTGCAAATGGGAATTCTGATAATCAGATTGCCAATCCATGTATTGAAAATCACTC
TCCCGTTCAATCTGAAATCCATTTTCACTCGGAATTTGATCATCAGAAATCAAATGCTAATGGCACTTCAAATCCTGGTCCCCAGCAAGAGTGTTCTACCGGTGAACTGA
GTGTGATAGAAGATGGGTTGATGGGGCATACCATAATCACACAGTCAGGCCAACTGTGTAATATCGATCTTCTTGATGAGATCATTGAGGATGCTAAAAACAACAAGATA
ACGTTGTTCTCAGCGATGCAATCAGTTATCAATGAGATGAAAGAATTGGAACATAAAGAGAAATATGTTGAAAAAGTCAAAGAAGAAGCTGCCAATGGAGAGTCAGAAAT
TCTGGCCAAAGTGGAGGAAATGAAGCGGACGGTAGTCCTTGCTAAGGAAGCAAATGATATGAATGCTGGAGAAGTTTATGGAGAGAAGGCGATTTTAGCGACGGAAACAA
GGGAACTCCAATCTCGTCTGCTTAGCTTGTCAGATGAACGAGACAAATCTCTTTCAATTCTTGATGAGATGCATACAACTCTTAAAGCAAGAATAGCTGCAGTAGATGCT
ATGCTGAAAGCAACAGAGGAAGAAAAGTTAGCCAGGGAAGAACATGCGCGAAAGGCTCTTGCGGAGAAAGAGGCCCTCATGGAGAAGGTCGTCCAGGAATCAATGATTCT
ACAAATGGAGGCTGAGGAGAATTCCAAGTTGCGAGAGTTTCTAATCCATCGTGGGCAAGTAGTTGACATATTACAAGGAGAAATTTCAGTTATTTGTCAAGACGTGAGGC
AGCTGAAGGAGAAGTTTGACTTGGACGTACCATTAAGCAAATCTCTTTCCTCCAGTCAAACAAGCTGCATTCTAGCTTCATCAGGTTCATCTCTAAAAAGTGCAGCTTGC
GATGTGGCTCGTTTTTGCTTACCCCTAAAGGATATCATACCATCTAATATGGATGATGAGACAGGTGCATCATCGAATCCCAGAAAGGAAGGAAGCCGAGCATCCTCGGT
CAGTAGAACTATCTCGTCATCGAACGATCTTGAAGAAGAAAGAGGAAGAAACCATTTCAAAGCAATTTCAGATGATGGGTGGGATTTATTTGACAAAGAGGCTGAGTTTG
CTGATGCTCCATATTTTGTGGATGCAAAAGAATAA
mRNA sequenceShow/hide mRNA sequence
TCTCATTTGATGGAGAACCAGATTGAATTCCTTGCCGGCTAAGTAGCATGCTCTTTGTTACACCCAATTAATCAATTCTCGTTTTCCCACTCTTCGGATTCTCTCTCGAT
CACTGATTAATCTATTTGGGAAATGGGTTTCGAGAACGTTTACAAGTGTCTGAGGGAAGTATTTCCGGAGGTTGACCATCGGATACTAAGGGCTGTAGCTCTTGAAAATC
CTAAGGATGTTCATGTAGCTGTTAATGATGTCCTCATGGAGGTTATCCCTCGTTTTGATGAAGATATTATTAAATTACCCTCGAAGGATCCCTTTGTTGAGTCTGTATCT
GAAATAGAAGTTGAGAACAAAACAAAAATGGATTCATTGAGATATGGGCAGATGAATATGGATTCTTTGGAATCAGACACAACTGGTAGTGAAACATCTGATGACACTTT
TCATGAGAGTAGTGTGGCACATCAAGATGCAGCATTGAATCAATCTGTCTCTGAAAATAATTATATTGCTAGTGATGATTGTGAACAATCACGTGAGAATACTGAAACTA
CAAGCCTCACTGTACCAGCTGGCATACAAGAAGACAACAGTGAAGTTGGATTGAGTCATGTGGCACCTGGAAAATCAAATAGTTTGGTCCATGAAGATGGTGGATATAAC
ACGTGTGAAGAAATTCGTAAATTTGCCAGAATCGTGAACCAGGTTACTGAGGATATCAAACAGGATAAGACACTGTTTCATGGAGTCGATGGAAGCTTGAACGATTTTCA
TTCTCATGTCATTCGTGATCTGAACACCATGCTCGTGCAAGAATTATTCAGTGTTAAATCAAATTATGACTCCTTTTCTGCAAATGGGAATTCTGATAATCAGATTGCCA
ATCCATGTATTGAAAATCACTCTCCCGTTCAATCTGAAATCCATTTTCACTCGGAATTTGATCATCAGAAATCAAATGCTAATGGCACTTCAAATCCTGGTCCCCAGCAA
GAGTGTTCTACCGGTGAACTGAGTGTGATAGAAGATGGGTTGATGGGGCATACCATAATCACACAGTCAGGCCAACTGTGTAATATCGATCTTCTTGATGAGATCATTGA
GGATGCTAAAAACAACAAGATAACGTTGTTCTCAGCGATGCAATCAGTTATCAATGAGATGAAAGAATTGGAACATAAAGAGAAATATGTTGAAAAAGTCAAAGAAGAAG
CTGCCAATGGAGAGTCAGAAATTCTGGCCAAAGTGGAGGAAATGAAGCGGACGGTAGTCCTTGCTAAGGAAGCAAATGATATGAATGCTGGAGAAGTTTATGGAGAGAAG
GCGATTTTAGCGACGGAAACAAGGGAACTCCAATCTCGTCTGCTTAGCTTGTCAGATGAACGAGACAAATCTCTTTCAATTCTTGATGAGATGCATACAACTCTTAAAGC
AAGAATAGCTGCAGTAGATGCTATGCTGAAAGCAACAGAGGAAGAAAAGTTAGCCAGGGAAGAACATGCGCGAAAGGCTCTTGCGGAGAAAGAGGCCCTCATGGAGAAGG
TCGTCCAGGAATCAATGATTCTACAAATGGAGGCTGAGGAGAATTCCAAGTTGCGAGAGTTTCTAATCCATCGTGGGCAAGTAGTTGACATATTACAAGGAGAAATTTCA
GTTATTTGTCAAGACGTGAGGCAGCTGAAGGAGAAGTTTGACTTGGACGTACCATTAAGCAAATCTCTTTCCTCCAGTCAAACAAGCTGCATTCTAGCTTCATCAGGTTC
ATCTCTAAAAAGTGCAGCTTGCGATGTGGCTCGTTTTTGCTTACCCCTAAAGGATATCATACCATCTAATATGGATGATGAGACAGGTGCATCATCGAATCCCAGAAAGG
AAGGAAGCCGAGCATCCTCGGTCAGTAGAACTATCTCGTCATCGAACGATCTTGAAGAAGAAAGAGGAAGAAACCATTTCAAAGCAATTTCAGATGATGGGTGGGATTTA
TTTGACAAAGAGGCTGAGTTTGCTGATGCTCCATATTTTGTGGATGCAAAAGAATAATGTTTGAGGGAGTTGTGAAGAGTCCACAGTTGAAGATTTGTTAGTTTAGCCAC
TTGCTTTTAAGTTTTAAATTCTTTTTCTGTCTCAGTTTTAAGTAGAATTTTGATAGTTAGGAGGCCAAATAAATTTCAATCTCTTGTACATAGATGGAAACTTTTCCGAA
GGGTTTTTCATATTTATTTTTTATATTGGTTTTGCTTCAGTGTATATTGCATTGGATTGGATATTAGATGTTCTTTTGTGATCATAATAAATGTAGTGGGTAAGTGCCAA
GCTTGGATTGTTCTCACTCAGCCTTGTTGATATTGGAAGTTTAACTATTATTGTTATTTGATAA
Protein sequenceShow/hide protein sequence
MGFENVYKCLREVFPEVDHRILRAVALENPKDVHVAVNDVLMEVIPRFDEDIIKLPSKDPFVESVSEIEVENKTKMDSLRYGQMNMDSLESDTTGSETSDDTFHESSVAH
QDAALNQSVSENNYIASDDCEQSRENTETTSLTVPAGIQEDNSEVGLSHVAPGKSNSLVHEDGGYNTCEEIRKFARIVNQVTEDIKQDKTLFHGVDGSLNDFHSHVIRDL
NTMLVQELFSVKSNYDSFSANGNSDNQIANPCIENHSPVQSEIHFHSEFDHQKSNANGTSNPGPQQECSTGELSVIEDGLMGHTIITQSGQLCNIDLLDEIIEDAKNNKI
TLFSAMQSVINEMKELEHKEKYVEKVKEEAANGESEILAKVEEMKRTVVLAKEANDMNAGEVYGEKAILATETRELQSRLLSLSDERDKSLSILDEMHTTLKARIAAVDA
MLKATEEEKLAREEHARKALAEKEALMEKVVQESMILQMEAEENSKLREFLIHRGQVVDILQGEISVICQDVRQLKEKFDLDVPLSKSLSSSQTSCILASSGSSLKSAAC
DVARFCLPLKDIIPSNMDDETGASSNPRKEGSRASSVSRTISSSNDLEEERGRNHFKAISDDGWDLFDKEAEFADAPYFVDAKE