; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr022562 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr022562
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionSequence-specific DNA binding transcription factors
Genome locationtig00000289:1001288..1002625
RNA-Seq ExpressionSgr022562
SyntenySgr022562
Gene Ontology termsNA
InterPro domainsIPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0054953.1 putative transcription factor [Cucumis melo var. makuwa]6.7e-22488.64Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHT+S+V+YNKGERCKNSASD++PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG

Query:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD DG GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE
        GTSCQVVENPALLDVIDYLT+K+K+DVRKILNSKQLFYEEMCSYHN+NRLHLPHDPALQRSLQLAFR RDDHDNDEPRRHQNDDFD++E GETD+HDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE

Query:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAY--PQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL
        ENF PH DNRR LGV  GS+KR +R QDHDDAH CGNSLS  DCNKSSH +   QFAQ DTAHLETESMKASTSQKQWME RLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAY--PQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH
        EKQKFKWERFNKKKDRELEKM+MVNE MKLENER+ALDLKQK+ GSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH

XP_008441519.2 PREDICTED: LOW QUALITY PROTEIN: uncharacterized protein LOC103485620 [Cucumis melo]3.3e-22388.42Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHT+S+V+YNKGERCKNSASD++PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG

Query:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD DG GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE
        GTSCQVVENPALLDVIDYLT+K+K+DVRKILNSKQLFYEEMCSYHN+NRLHLPHDPALQRSLQLAFR RDDHDNDEPRRHQNDDFD++E GETD+HDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE

Query:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAY--PQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL
        ENF PH DNRR LGV  GS+KR +R QDHDDAH CGNSLS  DCNKSSH +   QFAQ DTAHLETESMKASTSQKQWME RLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAY--PQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH
        EKQKFKWERFNK KDRELEKM+MVNE MKLENER+ALDLKQK+ GSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH

XP_022134251.1 uncharacterized protein LOC111006553 [Momordica charantia]2.7e-22589.66Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQGSFKVHNQA H H      H HTRQGSSANPSIQEGFSLSM  +QNCDHT+SMVDYNKGER KNS SDDEPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG

Query:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASD+DGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE
        GTSCQVVENPALLDV+DYLTDK+K+DVRKILNSKQLFYEEMCSYHN+NRLHLPHDPALQRSLQLAFR RDDHDNDEPRRHQNDDFD+NEHGETD+HDDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE

Query:  ENFAPHGDNRRLLGVGSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLELEKQK
        ENFAPHGDNRRL G GS KR RR QDHD+AH CGNSL+SHDCNKSSH Y QF   DTA LETESMKASTSQKQWME RLLQ+EDQKLQIQVEMLELEKQ+
Subjt:  ENFAPHGDNRRLLGVGSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLELEKQK

Query:  FKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH
        FKWERFNKKKD ELEKM+MVNE MKLENERIALDLKQKE GSGFH
Subjt:  FKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH

XP_023551421.1 uncharacterized protein LOC111809238 [Cucurbita pepo subsp. pepo]6.3e-22288.59Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVH+QAQHSHALHQQHHPHTRQGS+ANPSIQEGFSLSMGVVQNCDH +S+VDYNKGERCKNSASD+EPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG

Query:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DGHNETSKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD+DG GRRK  IIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE
        GTSC+VVENPALLDV++YLTDKEK+DVRKILNSKQLFYEEMCSYHN+NRLHLPHDPALQRSLQLAFR RDDHDNDEPRRHQNDDFD+NEH ETD+ DDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE

Query:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLELEK
        ENFAPHGD+RR  GV  GS+KR RR QDHDD H CG SLSSH     +HA  QFAQ DTAHLETE MK STSQKQWME RLLQLEDQKLQIQVEMLELEK
Subjt:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLELEK

Query:  QKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH
        QKFKWERFNKKKDRELEKM+MVNE MKLENERIALDLKQKE GSGFH
Subjt:  QKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH

XP_038885368.1 uncharacterized protein LOC120075776 [Benincasa hispida]4.0e-22488.64Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVHNQ QHSHALHQ HHPHTRQGSSANPSIQEGFSLSMGVV NCDHT+ +V+YNKGERCKNSASD+EPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG

Query:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD DGGGR+K QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE
        GTSCQVVENPALLDVIDYLTDKEK+DVRKILNSKQLFYEEMCSYHN+NRLHLPHDPALQRSLQLAFR RDDHDNDEPRRHQNDDFD+ EHGETD+HDDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE

Query:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQ--FAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL
        ENF PH DNRR LGV  GS+KR +R QDHDDAH CGNSLSS DCNKSSH + Q  FAQ DTAHLETESMKASTSQKQWME RLLQLE+QKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQ--FAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH
        EKQKFKW+RFNKKKDRELE M+MVNE MKLEN+R+ALDLKQK+ GSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH

TrEMBL top hitse value%identityAlignment
A0A0A0KBC2 Uncharacterized protein3.1e-22288.2Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ Q SHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHT+S+V+YNKGERCKNSASD++PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG

Query:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD DGGGRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE
        GTSCQVVENPALLDVIDYLT+K+K+DVRKILNSKQLFYEEMCSYHN+NRLHLPHDPALQRSLQLAFR RDDHDNDEPRRHQNDDFD++E  ETD+HDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE

Query:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAY--PQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL
        ENF PH DNRR LGV  GS+KR +R QDHDDAH CGNSLS  DCNKSSH +   QF Q DTAHLETESMKASTSQKQWME RLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAY--PQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH
        EKQKFKWERFNKKKDRELEKM+MVNE MKLENER+ALDLKQK+ GSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH

A0A1S3B4A7 LOW QUALITY PROTEIN: uncharacterized protein LOC1034856201.6e-22388.42Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHT+S+V+YNKGERCKNSASD++PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG

Query:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD DG GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE
        GTSCQVVENPALLDVIDYLT+K+K+DVRKILNSKQLFYEEMCSYHN+NRLHLPHDPALQRSLQLAFR RDDHDNDEPRRHQNDDFD++E GETD+HDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE

Query:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAY--PQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL
        ENF PH DNRR LGV  GS+KR +R QDHDDAH CGNSLS  DCNKSSH +   QFAQ DTAHLETESMKASTSQKQWME RLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAY--PQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH
        EKQKFKWERFNK KDRELEKM+MVNE MKLENER+ALDLKQK+ GSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH

A0A5D3DGK7 Putative transcription factor3.3e-22488.64Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQG FKVHNQ QHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHT+S+V+YNKGERCKNSASD++PSF ED 
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG

Query:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DGHNE SKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDIASD DG GRRK QIIQKKGKWKLISKV+AERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE
        GTSCQVVENPALLDVIDYLT+K+K+DVRKILNSKQLFYEEMCSYHN+NRLHLPHDPALQRSLQLAFR RDDHDNDEPRRHQNDDFD++E GETD+HDD+E
Subjt:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE

Query:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAY--PQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL
        ENF PH DNRR LGV  GS+KR +R QDHDDAH CGNSLS  DCNKSSH +   QFAQ DTAHLETESMKASTSQKQWME RLLQLEDQKLQIQVEMLEL
Subjt:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAY--PQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLEL

Query:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH
        EKQKFKWERFNKKKDRELEKM+MVNE MKLENER+ALDLKQK+ GSGFH
Subjt:  EKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH

A0A6J1BXD6 uncharacterized protein LOC1110065531.3e-22589.66Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG
        MEGNLSQGGLIPGG+SYGGLDLQGSFKVHNQA H H      H HTRQGSSANPSIQEGFSLSM  +QNCDHT+SMVDYNKGER KNS SDDEPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG

Query:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASD+DGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE
        GTSCQVVENPALLDV+DYLTDK+K+DVRKILNSKQLFYEEMCSYHN+NRLHLPHDPALQRSLQLAFR RDDHDNDEPRRHQNDDFD+NEHGETD+HDDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE

Query:  ENFAPHGDNRRLLGVGSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLELEKQK
        ENFAPHGDNRRL G GS KR RR QDHD+AH CGNSL+SHDCNKSSH Y QF   DTA LETESMKASTSQKQWME RLLQ+EDQKLQIQVEMLELEKQ+
Subjt:  ENFAPHGDNRRLLGVGSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLELEKQK

Query:  FKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH
        FKWERFNKKKD ELEKM+MVNE MKLENERIALDLKQKE GSGFH
Subjt:  FKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH

A0A6J1FFN2 uncharacterized protein LOC1114452949.8e-22188.37Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG
        MEGNLSQGGLIPGG SYGGLDLQG FKVH+QAQHSHALHQQHHPHTRQGS+ANPSIQEGFSLSMGVVQNCDH +S+VDYNKGERCKNSASD+EPSFTEDG
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG

Query:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
         DGHNETSKGKKGS+WHRVKWTDKMVKLLITAVSYIGDDI SD+DG GRRK Q IQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
Subjt:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE
        GTSC+VVENPALLD+++YLTDKEK+DVRKILNSKQLFYEEMCSYHN+NRLHLPHDPALQRSLQLAFR RDDHDNDEPRRHQNDDFD+NEH ETD+ DDFE
Subjt:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFE

Query:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLELEK
        ENFAPHGDNRR  GV  GS+KR RR QDHDD H CG SLSSH     +H+  QFAQ DTAHLETE MK STSQKQWME RLLQLEDQKLQIQVEMLELEK
Subjt:  ENFAPHGDNRRLLGV--GSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLELEK

Query:  QKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH
        QKFKWERFNKKKDRELEKM+MVNE MKLENERIALDLKQKE GSGFH
Subjt:  QKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGSGFH

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G21200.1 sequence-specific DNA binding transcription factors1.0e-13760.84Show/hide
Query:  MEGNLSQGGLIPGGA-SYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDH----TVSMVDYNKGERCKNSAS-DDEP
        M+GN  QGG++  GA SYGG DLQGS +VH    H  +++QQH    R   ++ P + EG   +M   Q CDH     +SM +  K ER KNS S DDEP
Subjt:  MEGNLSQGGLIPGGA-SYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDH----TVSMVDYNKGERCKNSAS-DDEP

Query:  SFTEDGTDG-HNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKR
        SFTE+G DG HNE ++  KGS W RVKWTDKMVKLLITAVSYIGDD  S  D   RRKF ++QKKGKWK +SKVMAERGY VSPQQCEDKFNDLNKRYK+
Subjt:  SFTEDGTDG-HNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKR

Query:  LNDIIGRGTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEH-GE
        LND++GRGTSCQVVENPALLD I YL DKEK+DVRKI++SK LFYEEMCSYHN NRLHLPHD ALQRSLQLA R+RDDHDND+ R+HQ +D DD +H G+
Subjt:  LNDIIGRGTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEH-GE

Query:  TDDHDDFEENFAPHGD---NRRLLGVGSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQI
         D+HD++EE    +GD   N    G G LK+ R +  H+D     + ++S +CNK S     F+Q D      ES +A + QKQWME R LQLE+QKLQI
Subjt:  TDDHDDFEENFAPHGD---NRRLLGVGSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQI

Query:  QVEMLELEKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETG
        QVE+LELEKQ+F+W+RF+KK+D+ELE+M+M NE MKLEN+R+ L+LKQ+E G
Subjt:  QVEMLELEKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETG

AT1G76870.1 BEST Arabidopsis thaliana protein match is: sequence-specific DNA binding transcription factors (TAIR:AT1G21200.1)8.2e-9548.98Show/hide
Query:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG
        MEGN SQG      +S    DL+ +    NQ        +QHHP++RQ S        GF+ +M              +N  +R K S S+D+       
Subjt:  MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDG

Query:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR
        +DG N   K K+ S W RVKW DKMVKL+ITA+SYIG+D  SD      +KF ++QKKGKW+ +SKVM ERGY VSPQQCEDKFNDLNKRYK+LN+++GR
Subjt:  TDGHNETSKGKKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGR

Query:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQL-AFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDF
        GTSC+VVENP+LLD IDYL +KEK++VR+I++SK LFYEEMCSYHN NRLHLPHDPA+QRSL L    +RDDHDNDE  +HQN+D DD+        DD+
Subjt:  GTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQL-AFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDF

Query:  EENFAPHGDNRRLLGVGSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLELEKQ
        EE      D+   L    LKR R++Q H+D    G+    +D        P+        +  +S KA+  Q+Q +E + L+LE +KLQIQ EM+ELE+Q
Subjt:  EENFAPHGDNRRLLGVGSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLELEKQ

Query:  KFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGS
        +FKWE F+K+++++L KM+M NE MKLENER++L+LK+ E G+
Subjt:  KFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKETGS

AT3G10040.1 sequence-specific DNA binding transcription factors4.3e-5137.53Show/hide
Query:  DDEPSFTEDGTDGHNETSKGKKG----SLWHRVKWTDKMVKLLITAVSYIGDDIA----------SDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVS
        DDE   +  G+  + E S G  G    S WHR+KWTD MV+LLI AV YIGD+            +   GGG     ++QKKGKWK +S+ M E+G+ VS
Subjt:  DDEPSFTEDGTDGHNETSKGKKG----SLWHRVKWTDKMVKLLITAVSYIGDDIA----------SDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVS

Query:  PQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHD--PALQRSLQLAFRTRDDHDN
        PQQCEDKFNDLNKRYKR+NDI+G+G +C+VVEN  LL+ +D+LT K K++V+K+LNSK LF+ EMC+YHN+      HD  P  Q  + +          
Subjt:  PQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLTDKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHD--PALQRSLQLAFRTRDDHDN

Query:  DEPRRHQNDDFDDNEHGETDDHDDFEENFAPHGDNRRLLGVGSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQW
          P + QN  F   E G+     +  E             V        A+D +          +    + S A  +  + + A +  +  K+   +K+W
Subjt:  DEPRRHQNDDFDDNEHGETDDHDDFEENFAPHGDNRRLLGVGSLKRPRRAQDHDDAHTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQW

Query:  MEHRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKE
        +  ++L++E++K+  + E +E+EKQ+ KW R+  KK+RE+EK K+ N+  +LE ER+ L L++ E
Subjt:  MEHRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCTCGATTTGCAAGGATCTTTTAAGGTTCATAATCAGGCACAACACTCTCACGC
TTTACACCAGCAACATCATCCTCATACTCGTCAGGGTTCTTCAGCCAATCCCTCCATTCAGGAGGGCTTTTCACTTTCCATGGGAGTCGTACAAAATTGTGATCACACCG
TGTCCATGGTAGATTATAACAAAGGTGAAAGGTGTAAAAACTCAGCCAGTGACGACGAGCCGAGCTTTACTGAAGATGGTACTGATGGTCATAATGAGACTAGTAAGGGG
AAGAAGGGATCTCTATGGCATCGCGTGAAATGGACAGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCCTCAGATTATGATGGGGG
TGGAAGAAGGAAATTTCAAATTATACAGAAGAAAGGTAAATGGAAATTGATATCAAAGGTCATGGCTGAAAGAGGATATCAAGTTTCACCCCAGCAGTGTGAGGATAAAT
TCAATGACCTCAATAAGAGGTATAAGAGGCTCAATGATATCATTGGAAGAGGCACTTCTTGCCAGGTTGTTGAGAACCCTGCACTTCTTGATGTCATAGATTATTTAACA
GACAAAGAAAAGGAAGATGTGAGAAAAATTTTAAACTCAAAGCAATTGTTCTATGAAGAGATGTGTTCTTACCATAATGCAAACCGACTCCATCTGCCTCATGATCCTGC
TTTGCAGCGTTCTTTGCAGTTGGCTTTTAGAACAAGGGATGATCACGATAATGATGAGCCAAGGAGACACCAAAATGATGATTTTGATGATAACGAGCATGGTGAAACTG
ATGATCATGACGATTTTGAGGAGAATTTTGCACCCCATGGGGACAACAGGCGATTACTTGGAGTAGGCTCGCTGAAGAGGCCAAGGCGAGCCCAAGACCATGATGATGCT
CATACTTGTGGAAATTCCTTGAGTTCTCATGATTGCAACAAAAGTTCTCATGCTTACCCACAATTTGCACAAGTCGATACAGCTCACTTAGAAACTGAAAGTATGAAAGC
TTCTACGTCACAAAAGCAGTGGATGGAGCACCGCTTACTTCAGTTGGAAGATCAGAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAACAGAAGTTCAAGTGGG
AGAGATTTAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAAGATGGTAAATGAGAGTATGAAGCTTGAAAATGAGCGCATCGCACTCGACTTAAAGCAAAAGGAAACT
GGATCAGGATTTCATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGGGAATTTATCACAAGGAGGGTTGATTCCAGGAGGGGCCTCTTATGGAGGTCTCGATTTGCAAGGATCTTTTAAGGTTCATAATCAGGCACAACACTCTCACGC
TTTACACCAGCAACATCATCCTCATACTCGTCAGGGTTCTTCAGCCAATCCCTCCATTCAGGAGGGCTTTTCACTTTCCATGGGAGTCGTACAAAATTGTGATCACACCG
TGTCCATGGTAGATTATAACAAAGGTGAAAGGTGTAAAAACTCAGCCAGTGACGACGAGCCGAGCTTTACTGAAGATGGTACTGATGGTCATAATGAGACTAGTAAGGGG
AAGAAGGGATCTCTATGGCATCGCGTGAAATGGACAGATAAAATGGTGAAGCTTCTGATTACAGCAGTGTCTTATATAGGAGATGATATTGCCTCAGATTATGATGGGGG
TGGAAGAAGGAAATTTCAAATTATACAGAAGAAAGGTAAATGGAAATTGATATCAAAGGTCATGGCTGAAAGAGGATATCAAGTTTCACCCCAGCAGTGTGAGGATAAAT
TCAATGACCTCAATAAGAGGTATAAGAGGCTCAATGATATCATTGGAAGAGGCACTTCTTGCCAGGTTGTTGAGAACCCTGCACTTCTTGATGTCATAGATTATTTAACA
GACAAAGAAAAGGAAGATGTGAGAAAAATTTTAAACTCAAAGCAATTGTTCTATGAAGAGATGTGTTCTTACCATAATGCAAACCGACTCCATCTGCCTCATGATCCTGC
TTTGCAGCGTTCTTTGCAGTTGGCTTTTAGAACAAGGGATGATCACGATAATGATGAGCCAAGGAGACACCAAAATGATGATTTTGATGATAACGAGCATGGTGAAACTG
ATGATCATGACGATTTTGAGGAGAATTTTGCACCCCATGGGGACAACAGGCGATTACTTGGAGTAGGCTCGCTGAAGAGGCCAAGGCGAGCCCAAGACCATGATGATGCT
CATACTTGTGGAAATTCCTTGAGTTCTCATGATTGCAACAAAAGTTCTCATGCTTACCCACAATTTGCACAAGTCGATACAGCTCACTTAGAAACTGAAAGTATGAAAGC
TTCTACGTCACAAAAGCAGTGGATGGAGCACCGCTTACTTCAGTTGGAAGATCAGAAGCTTCAAATTCAAGTTGAAATGTTGGAATTGGAGAAACAGAAGTTCAAGTGGG
AGAGATTTAACAAGAAAAAGGACCGTGAGTTGGAAAAAATGAAGATGGTAAATGAGAGTATGAAGCTTGAAAATGAGCGCATCGCACTCGACTTAAAGCAAAAGGAAACT
GGATCAGGATTTCATTAA
Protein sequenceShow/hide protein sequence
MEGNLSQGGLIPGGASYGGLDLQGSFKVHNQAQHSHALHQQHHPHTRQGSSANPSIQEGFSLSMGVVQNCDHTVSMVDYNKGERCKNSASDDEPSFTEDGTDGHNETSKG
KKGSLWHRVKWTDKMVKLLITAVSYIGDDIASDYDGGGRRKFQIIQKKGKWKLISKVMAERGYQVSPQQCEDKFNDLNKRYKRLNDIIGRGTSCQVVENPALLDVIDYLT
DKEKEDVRKILNSKQLFYEEMCSYHNANRLHLPHDPALQRSLQLAFRTRDDHDNDEPRRHQNDDFDDNEHGETDDHDDFEENFAPHGDNRRLLGVGSLKRPRRAQDHDDA
HTCGNSLSSHDCNKSSHAYPQFAQVDTAHLETESMKASTSQKQWMEHRLLQLEDQKLQIQVEMLELEKQKFKWERFNKKKDRELEKMKMVNESMKLENERIALDLKQKET
GSGFH