; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g0079 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g0079
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptiontrihelix transcription factor GT-1-like
Genome locationMC08:478504..484197
RNA-Seq ExpressionMC08g0079
SyntenyMC08g0079
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022131736.1 trihelix transcription factor GT-1-like isoform X1 [Momordica charantia]3.31e-292100Show/hide
Query:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
        MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
Subjt:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN

Query:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
        KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
Subjt:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG

Query:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
        GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
Subjt:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR

Query:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
Subjt:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS

XP_022131737.1 trihelix transcription factor GT-1-like isoform X2 [Momordica charantia]6.16e-27896.61Show/hide
Query:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
        MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
Subjt:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN

Query:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
        KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDK             AG
Subjt:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG

Query:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
        GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
Subjt:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR

Query:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
Subjt:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS

XP_022951166.1 trihelix transcription factor GT-1-like [Cucurbita moschata]8.25e-27192.97Show/hide
Query:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
        MYL DKPRPIDIYKEEGSRDMMIEVASNGDHH+ PHQ QQ      TH QHQ+MLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRR MDGLFNTSKSN
Subjt:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN

Query:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
        KHLWEQIS+KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSK+A YKSPTPPKIDSY+QFSDKGIEDN LSFGPVEAG
Subjt:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG

Query:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
        GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGE QAFGGRVI+VKWGDYTRRIG+DGTS+AIKEAIKSAFRLRTKRAFWLEDEDQVVR
Subjt:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR

Query:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTE+KIFY ED+YREFLARR WTCLREFDGYRNIDNMDDLRPGA+YRGVS
Subjt:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS

XP_038885458.1 trihelix transcription factor GT-1-like isoform X1 [Benincasa hispida]1.78e-27292.82Show/hide
Query:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
        MYL DKPRPIDIYKEEGSRDMMIEVASNGDHH+ PHQ  QQ Q QQTH QHQ+MLGDSSGEDHEVKAPKKRAETWVQDETRSLI+LRREMDGLFNTSKSN
Subjt:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN

Query:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
        KHLWEQIS+KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMS YKEIEEILKERSKS  YKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
Subjt:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG

Query:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
        GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQ FGGRVISVKWGDYTRRIG+DGTS+AIKEAIKSAFRLRTKRAFWLEDEDQVVR
Subjt:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR

Query:  SLDRDMPLGNYTLHLDE------GLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        SLDRDMPLGNYTLHLDE      G+AVKICLYDESDHLPVHTEDK+FY+E+DYR+FLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
Subjt:  SLDRDMPLGNYTLHLDE------GLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS

XP_038885460.1 trihelix transcription factor GT-1-like isoform X3 [Benincasa hispida]3.65e-27594.27Show/hide
Query:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
        MYL DKPRPIDIYKEEGSRDMMIEVASNGDHH+ PHQ  QQ Q QQTH QHQ+MLGDSSGEDHEVKAPKKRAETWVQDETRSLI+LRREMDGLFNTSKSN
Subjt:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN

Query:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
        KHLWEQIS+KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMS YKEIEEILKERSKS  YKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
Subjt:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG

Query:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
        GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQ FGGRVISVKWGDYTRRIG+DGTS+AIKEAIKSAFRLRTKRAFWLEDEDQVVR
Subjt:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR

Query:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        SLDRDMPLGNYTLHLDEG+AVKICLYDESDHLPVHTEDK+FY+E+DYR+FLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
Subjt:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS

TrEMBL top hitse value%identityAlignment
A0A6J1BQB9 trihelix transcription factor GT-1-like isoform X11.60e-292100Show/hide
Query:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
        MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
Subjt:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN

Query:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
        KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
Subjt:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG

Query:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
        GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
Subjt:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR

Query:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
Subjt:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS

A0A6J1BRV7 trihelix transcription factor GT-1-like isoform X22.98e-27896.61Show/hide
Query:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
        MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
Subjt:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN

Query:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
        KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDK             AG
Subjt:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG

Query:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
        GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
Subjt:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR

Query:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
Subjt:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS

A0A6J1GGT5 trihelix transcription factor GT-1-like4.00e-27192.97Show/hide
Query:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
        MYL DKPRPIDIYKEEGSRDMMIEVASNGDHH+ PHQ QQ      TH QHQ+MLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRR MDGLFNTSKSN
Subjt:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN

Query:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
        KHLWEQIS+KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSK+A YKSPTPPKIDSY+QFSDKGIEDN LSFGPVEAG
Subjt:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG

Query:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
        GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGE QAFGGRVI+VKWGDYTRRIG+DGTS+AIKEAIKSAFRLRTKRAFWLEDEDQVVR
Subjt:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR

Query:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTE+KIFY ED+YREFLARR WTCLREFDGYRNIDNMDDLRPGA+YRGVS
Subjt:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS

A0A6J1HBR7 trihelix transcription factor GT-1-like isoform X13.03e-26892.19Show/hide
Query:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
        MYL DKPRPIDIYKEEGSRDMMIEVASNGDHH+ PHQ Q       TH QHQ+MLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
Subjt:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN

Query:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
        KHLWEQIS+KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKS  YKSPTPPKIDSY+QF+DKGIEDNGL+FGPVE G
Subjt:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG

Query:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
        GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGN GESQAFGGRVISVKWGDYTRRIGIDGTS+AIKEAIKSAFRLRTKRAFWLEDEDQV+R
Subjt:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR

Query:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTE+KIFY+E+DYR+FL RRGWTCLREFDGYRNID MDDLRPGAIYRG+S
Subjt:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS

A0A6J1KR52 trihelix transcription factor GT-1-like4.00e-27192.97Show/hide
Query:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN
        MYL DKPRPIDIYKEEGSRDMMIEVASNGDHH+ PHQ QQ      TH QHQ+MLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRR MDGLFNTSKSN
Subjt:  MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSN

Query:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG
        KHLWEQIS+KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSK+A YKSPTPPKIDSY+QFSDKGIEDN LSFGPVEAG
Subjt:  KHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAG

Query:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR
        GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGE QAFGGRVI+VKWGDYTRRIG+DGTS+AIKEAIKSAFRLRTKRAFWLEDEDQVVR
Subjt:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVR

Query:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTE+KIFY ED+YREFLARR WTCLREFDGYRNIDNMDDLRPGA+YRGVS
Subjt:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS

SwissProt top hitse value%identityAlignment
O80450 Trihelix transcription factor GT-3b3.6e-1134.65Show/hide
Query:  HQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFK
        HQH    Q Q  + +H L     + E     A   R   W  +ET+ LI +R E+D  F  +K NK LWE IS+KMR++ F RSP  C  KW+NL+  FK
Subjt:  HQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFK

Query:  KAKHHDRGSGSAKMSYYKEIEEILKER
          +  +  +   +  +Y +++ I   R
Subjt:  KAKHHDRGSGSAKMSYYKEIEEILKER

Q9C6K3 Trihelix transcription factor DF15.3e-1031.82Show/hide
Query:  DSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHD--RGSGSAKMSYYKE
        D+ G+ +   A    +  W + E  +LI LR  +D  +  +     LWE+IS+ MR  GF+R+   C +KW N+ K FKK K  +  R   S    Y+ +
Subjt:  DSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHD--RGSGSAKMSYYKE

Query:  IEEILKERSK
        ++ + +ER+K
Subjt:  IEEILKERSK

Q9FX53 Trihelix transcription factor GT-12.1e-16069.7Show/hide
Query:  MYLPDKPRPIDIYKEE--------GSRDMMIEV--ASNGDHHLPPHQHQQQQQQ--QQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRR
        M++ DK RP D YK++         +RDMMI+V   +N    L  H H         Q+ PQ Q++LG+SSGEDHEVKAPKKRAETWVQDETRSLI  RR
Subjt:  MYLPDKPRPIDIYKEE--------GSRDMMIEV--ASNGDHHLPPHQHQQQQQQ--QQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRR

Query:  EMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSA----HYKSP-TPP---KIDSYM
         MDGLFNTSKSNKHLWEQISSKMRE+GFDRSPTMCTDKWRNLLKEFKKAKHHDRG+GSAKMSYYKEIE+IL+ERSK      + KSP TPP   K+DS+M
Subjt:  EMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSA----HYKSP-TPP---KIDSYM

Query:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGES--QAFGGRVISVKWGDYTRRIGIDGTSDAIKEA
        QF+DKG +D  +SFG VEA GRP+LNLER+LDHDGHPLAI TA DAVAA G+ PWNWRE PGNG +S  Q FGGRVI+VK+GDYTRRIG+DG+++AIKE 
Subjt:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGES--QAFGGRVISVKWGDYTRRIGIDGTSDAIKEA

Query:  IKSAFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPG
        I+SAF LRT+RAFWLEDEDQ++R LDRDMPLGNY L LD+GLA+++C YDES+ LPVH+E+KIFY E+DYREFLAR+GW+ L + DG+RNI+NMDDL+PG
Subjt:  IKSAFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPG

Query:  AIYRGV
        A+YRGV
Subjt:  AIYRGV

Q9LU92 Trihelix transcription factor GT-43.0e-14668.95Show/hide
Query:  SRD--MMI-EVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSS-GEDHE-VKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMR
        SRD  MMI +V SNGD  L P               HQ++LG+SS GEDHE +KAPKKRAETW QDETR+LISLRREMD LFNTSKSNKHLWEQIS KMR
Subjt:  SRD--MMI-EVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSS-GEDHE-VKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMR

Query:  ERGFDRSPTMCTDKWRNLLKEFKKAKHHD---RGSGSAKMSYYKEIEEILKERSKS-AHYKSP---TP--PKIDSYMQFSDKGIEDNGLSFGPVEAGGRP
        E+GFDRSP+MCTDKWRN+LKEFKKAK H+      GS KMSYY EIE+I +ER K  A YKSP   TP   K+DS+MQF+DKG ED G+SF  VEA GRP
Subjt:  ERGFDRSPTMCTDKWRNLLKEFKKAKHHD---RGSGSAKMSYYKEIEEILKERSKS-AHYKSP---TP--PKIDSYMQFSDKGIEDNGLSFGPVEAGGRP

Query:  SLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLD
        +LNLE +LDHDG PL I AAD + A G+PPWNWR+ PGNG + Q F GR+I+VK+GDYTRR+GIDGT++AIKEAI+SAFRLRT+RAFWLEDE+QV+RSLD
Subjt:  SLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLD

Query:  RDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGV
        RDMPLGNY L +DEG+AV++C YDESD LPVH E+KIFY E+DYR+FLARRGWTCLREFD ++NIDNMD+L+ G +YRG+
Subjt:  RDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGV

Q9SDW0 Trihelix transcription factor GT-3a5.6e-1230.86Show/hide
Query:  HHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNL
        HH   H H  QQQQ    P       D  G         +R   W  +ET+ L+++R E+D  F  +K NK LWE +++KM ++GF RS   C  KW+NL
Subjt:  HHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNL

Query:  LKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDS---YMQFSDKGIED
        +  +K  +  +  +   +  +Y EI+ I + R +   +   T P   S   + QFS    E+
Subjt:  LKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDS---YMQFSDKGIED

Arabidopsis top hitse value%identityAlignment
AT1G13450.1 Homeodomain-like superfamily protein1.5e-16169.7Show/hide
Query:  MYLPDKPRPIDIYKEE--------GSRDMMIEV--ASNGDHHLPPHQHQQQQQQ--QQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRR
        M++ DK RP D YK++         +RDMMI+V   +N    L  H H         Q+ PQ Q++LG+SSGEDHEVKAPKKRAETWVQDETRSLI  RR
Subjt:  MYLPDKPRPIDIYKEE--------GSRDMMIEV--ASNGDHHLPPHQHQQQQQQ--QQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRR

Query:  EMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSA----HYKSP-TPP---KIDSYM
         MDGLFNTSKSNKHLWEQISSKMRE+GFDRSPTMCTDKWRNLLKEFKKAKHHDRG+GSAKMSYYKEIE+IL+ERSK      + KSP TPP   K+DS+M
Subjt:  EMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSA----HYKSP-TPP---KIDSYM

Query:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGES--QAFGGRVISVKWGDYTRRIGIDGTSDAIKEA
        QF+DKG +D  +SFG VEA GRP+LNLER+LDHDGHPLAI TA DAVAA G+ PWNWRE PGNG +S  Q FGGRVI+VK+GDYTRRIG+DG+++AIKE 
Subjt:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGES--QAFGGRVISVKWGDYTRRIGIDGTSDAIKEA

Query:  IKSAFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPG
        I+SAF LRT+RAFWLEDEDQ++R LDRDMPLGNY L LD+GLA+++C YDES+ LPVH+E+KIFY E+DYREFLAR+GW+ L + DG+RNI+NMDDL+PG
Subjt:  IKSAFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPG

Query:  AIYRGV
        A+YRGV
Subjt:  AIYRGV

AT1G13450.2 Homeodomain-like superfamily protein3.9e-13361.04Show/hide
Query:  MYLPDKPRPIDIYKEE--------GSRDMMIEV--ASNGDHHLPPHQHQQQQQQ--QQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRR
        M++ DK RP D YK++         +RDMMI+V   +N    L  H H         Q+ PQ Q++LG+SSGEDHEVKAPKKRAETWVQDETRSLI  RR
Subjt:  MYLPDKPRPIDIYKEE--------GSRDMMIEV--ASNGDHHLPPHQHQQQQQQ--QQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRR

Query:  EMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSA----HYKSP-TPP---KIDSYM
         MDGLFNTSKSNKHLWEQISSKMRE+GFDRSPTMCTDKWRNLLKEFKKAKHHDRG+GSAKMSYYKEIE+IL+ERSK      + KSP TPP   K+DS+M
Subjt:  EMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSA----HYKSP-TPP---KIDSYM

Query:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKS
        QF+DKG +D  +SFG VE                                          G+    Q FGGRVI+VK+GDYTRRIG+DG+++AIKE I+S
Subjt:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKS

Query:  AFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIY
        AF LRT+RAFWLEDEDQ++R LDRDMPLGNY L LD+GLA+++C YDES+ LPVH+E+KIFY E+DYREFLAR+GW+ L + DG+RNI+NMDDL+PGA+Y
Subjt:  AFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIY

Query:  RGV
        RGV
Subjt:  RGV

AT1G13450.3 Homeodomain-like superfamily protein1.1e-9568.28Show/hide
Query:  MYLPDKPRPIDIYKEE--------GSRDMMIEV--ASNGDHHLPPHQHQQQQQQ--QQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRR
        M++ DK RP D YK++         +RDMMI+V   +N    L  H H         Q+ PQ Q++LG+SSGEDHEVKAPKKRAETWVQDETRSLI  RR
Subjt:  MYLPDKPRPIDIYKEE--------GSRDMMIEV--ASNGDHHLPPHQHQQQQQQ--QQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRR

Query:  EMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSA----HYKSP-TPP---KIDSYM
         MDGLFNTSKSNKHLWEQISSKMRE+GFDRSPTMCTDKWRNLLKEFKKAKHHDRG+GSAKMSYYKEIE+IL+ERSK      + KSP TPP   K+DS+M
Subjt:  EMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSA----HYKSP-TPP---KIDSYM

Query:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGESQ
        QF+DKG +D  +SFG VEA GRP+LNLER+LDHDGHPLAI TA DAVAA G+ PWNWRE PGNG  S+
Subjt:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGESQ

AT3G25990.1 Homeodomain-like superfamily protein2.1e-14768.95Show/hide
Query:  SRD--MMI-EVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSS-GEDHE-VKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMR
        SRD  MMI +V SNGD  L P               HQ++LG+SS GEDHE +KAPKKRAETW QDETR+LISLRREMD LFNTSKSNKHLWEQIS KMR
Subjt:  SRD--MMI-EVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSS-GEDHE-VKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMR

Query:  ERGFDRSPTMCTDKWRNLLKEFKKAKHHD---RGSGSAKMSYYKEIEEILKERSKS-AHYKSP---TP--PKIDSYMQFSDKGIEDNGLSFGPVEAGGRP
        E+GFDRSP+MCTDKWRN+LKEFKKAK H+      GS KMSYY EIE+I +ER K  A YKSP   TP   K+DS+MQF+DKG ED G+SF  VEA GRP
Subjt:  ERGFDRSPTMCTDKWRNLLKEFKKAKHHD---RGSGSAKMSYYKEIEEILKERSKS-AHYKSP---TP--PKIDSYMQFSDKGIEDNGLSFGPVEAGGRP

Query:  SLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLD
        +LNLE +LDHDG PL I AAD + A G+PPWNWR+ PGNG + Q F GR+I+VK+GDYTRR+GIDGT++AIKEAI+SAFRLRT+RAFWLEDE+QV+RSLD
Subjt:  SLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLD

Query:  RDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGV
        RDMPLGNY L +DEG+AV++C YDESD LPVH E+KIFY E+DYR+FLARRGWTCLREFD ++NIDNMD+L+ G +YRG+
Subjt:  RDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGV

AT5G01380.1 Homeodomain-like superfamily protein4.0e-1330.86Show/hide
Query:  HHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNL
        HH   H H  QQQQ    P       D  G         +R   W  +ET+ L+++R E+D  F  +K NK LWE +++KM ++GF RS   C  KW+NL
Subjt:  HHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSKMRERGFDRSPTMCTDKWRNL

Query:  LKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDS---YMQFSDKGIED
        +  +K  +  +  +   +  +Y EI+ I + R +   +   T P   S   + QFS    E+
Subjt:  LKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDS---YMQFSDKGIED


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTTGCCTGATAAGCCTCGCCCAATCGATATTTACAAGGAAGAAGGTAGCAGAGATATGATGATCGAGGTCGCCTCTAATGGCGACCACCATCTTCCTCCTCACCA
GCATCAGCAGCAGCAGCAGCAGCAGCAGACTCATCCGCAGCACCAACTTATGCTCGGCGATAGCAGTGGGGAGGATCACGAAGTCAAGGCTCCGAAGAAACGGGCGGAGA
CTTGGGTTCAGGACGAGACTCGGAGCTTAATTTCCCTACGCCGGGAGATGGATGGCTTGTTCAATACCTCCAAATCCAACAAGCATTTGTGGGAGCAGATATCCTCCAAG
ATGAGGGAAAGGGGCTTCGATCGCTCCCCGACTATGTGTACTGATAAGTGGAGGAACTTGCTCAAGGAGTTCAAGAAGGCAAAGCACCATGACAGGGGAAGTGGCTCTGC
CAAGATGTCGTATTACAAGGAGATTGAAGAAATCTTGAAGGAGAGAAGCAAGAGTGCGCATTACAAGAGCCCCACACCACCCAAGATTGATTCATATATGCAATTCTCAG
ACAAAGGAATTGAGGATAATGGTCTATCGTTCGGACCTGTTGAAGCTGGTGGCAGACCATCGCTCAATCTTGAAAGACAGTTAGATCACGATGGACATCCCCTTGCCATC
ACAGCTGCTGATGCAGTTGCTGCGACGGGTATTCCACCATGGAATTGGAGAGAGGCACCTGGAAATGGTGGTGAGAGTCAGGCATTTGGCGGGAGAGTTATATCAGTCAA
GTGGGGGGATTACACAAGAAGAATCGGTATTGATGGCACCTCAGATGCCATCAAGGAGGCAATTAAATCTGCTTTTAGGTTAAGAACTAAACGGGCATTTTGGTTAGAGG
ATGAGGACCAGGTTGTCAGAAGTCTGGACCGGGACATGCCTTTAGGAAACTACACTCTTCACCTCGATGAAGGGTTGGCTGTTAAAATCTGCCTCTATGATGAATCTGAC
CACTTACCAGTACATACTGAAGATAAAATTTTTTACGTTGAAGATGATTACCGGGAGTTTTTAGCTCGTCGGGGCTGGACATGCCTACGGGAGTTTGATGGGTATAGAAA
CATCGATAATATGGATGATCTCCGTCCTGGTGCGATATATCGCGGAGTGAGTTGA
mRNA sequenceShow/hide mRNA sequence
GAAATGTCTTGAATTATTATTGAACCAGTTGAAACATCATTTATGTTTAAATCCCCAATTTTCTTAAACTCAAGCTCAATGAGAGAGATTAGTAGCTTTAATTACCACTC
GACCCAATTAAACTAAATCCAAGTACAAAATTGGACCCAAAAAAAAAAAAGAAAAAACCATCTTAAATTCTAAACGAGTCTACAAATTTCAGAATTTTGCTATTTCTCAT
TTTTATCCGTACAATTATTCCCTGTTTCTCCGGCATCACACGCGACCTTCGTCTCTCTCTGTCTCTCTCTTCATTAGCTAAAGAAGAGAACCCTCAGAATTGCGGCCTGC
GGAAAAAGGACTCTGAACTTACAGTCTCAGGAAGAAAGAAAAAAGAGAAATGAAAATTTAAGCCACCATTTCTTATCTCGTACCCTCCCACAGGCCACAACGTGGGCTCA
CCCTTCGCTCTACGAACTCCCAGCTTTCTCGAAATTTCACGAATCCAGACATGTACTTGCCTGATAAGCCTCGCCCAATCGATATTTACAAGGAAGAAGGTAGCAGAGAT
ATGATGATCGAGGTCGCCTCTAATGGCGACCACCATCTTCCTCCTCACCAGCATCAGCAGCAGCAGCAGCAGCAGCAGACTCATCCGCAGCACCAACTTATGCTCGGCGA
TAGCAGTGGGGAGGATCACGAAGTCAAGGCTCCGAAGAAACGGGCGGAGACTTGGGTTCAGGACGAGACTCGGAGCTTAATTTCCCTACGCCGGGAGATGGATGGCTTGT
TCAATACCTCCAAATCCAACAAGCATTTGTGGGAGCAGATATCCTCCAAGATGAGGGAAAGGGGCTTCGATCGCTCCCCGACTATGTGTACTGATAAGTGGAGGAACTTG
CTCAAGGAGTTCAAGAAGGCAAAGCACCATGACAGGGGAAGTGGCTCTGCCAAGATGTCGTATTACAAGGAGATTGAAGAAATCTTGAAGGAGAGAAGCAAGAGTGCGCA
TTACAAGAGCCCCACACCACCCAAGATTGATTCATATATGCAATTCTCAGACAAAGGAATTGAGGATAATGGTCTATCGTTCGGACCTGTTGAAGCTGGTGGCAGACCAT
CGCTCAATCTTGAAAGACAGTTAGATCACGATGGACATCCCCTTGCCATCACAGCTGCTGATGCAGTTGCTGCGACGGGTATTCCACCATGGAATTGGAGAGAGGCACCT
GGAAATGGTGGTGAGAGTCAGGCATTTGGCGGGAGAGTTATATCAGTCAAGTGGGGGGATTACACAAGAAGAATCGGTATTGATGGCACCTCAGATGCCATCAAGGAGGC
AATTAAATCTGCTTTTAGGTTAAGAACTAAACGGGCATTTTGGTTAGAGGATGAGGACCAGGTTGTCAGAAGTCTGGACCGGGACATGCCTTTAGGAAACTACACTCTTC
ACCTCGATGAAGGGTTGGCTGTTAAAATCTGCCTCTATGATGAATCTGACCACTTACCAGTACATACTGAAGATAAAATTTTTTACGTTGAAGATGATTACCGGGAGTTT
TTAGCTCGTCGGGGCTGGACATGCCTACGGGAGTTTGATGGGTATAGAAACATCGATAATATGGATGATCTCCGTCCTGGTGCGATATATCGCGGAGTGAGTTGAGCAAG
TACACAGTTATGCCAATATGTTATGAATTTTGGCTTTAGTGCCCTTTGTGGGACTTGGGTTTCAGCACTCCATCTTTACTTGTAATCCATACACTAAATCAACCATGTAA
ACAAATATGGCCTTTATGCACAGACACTGCTTGTGCATTTTTTGACAGGTTATCAGTTATCTTCTCTCTGTATATATAATATGGATTGTTGGTAGTATTGTCTCATCTGT
ACAGTGCCTATGCATGTTTTCAAAGATGTCGATAAATATAAACATGTGAAAAATGATACCATTACATTACTTGTTTATTTTTCCTTAAACCAAGCCGATGATTACTGCTC
GTCGTCGTTGCCTTTCAAGTGAACCAACAGTTTCTTAATTCCTGACTAGCATTTGGGGAATCTTGATACACTCAACATCCAGATGTTTGGAAATTTGGATGCTCAGTGAT
TCTCAGACGAAAGAAGTTTGGGGAAATGTTGACGTTCGGGTCCGGTATCCCTCTTTAAAACTAGAACACATGATTCCATAGTTGTAGAAAAAGATGAGTTAATATCTCAG
GTATGCTATGTGGGTGGTGCTACACTCGTCTTTTCCAGCCACAACAGAACAAAAAAACTTCCATTTGTTCAATAGCACTTCATTATTTGAAACAACTCGAGTTATGCATC
GACAACAAATTCAGCAGGCAATCAACAGCTAATCTGAATGGTCAGTAGCAGAATTGGATTTAGTTTGGATCTGAGAAGAATTTGTTGATGGAAGGCATAATCTGAGGGGA
TCGTGTTCTGAGGAACTTCCTGAGGCCATGTTCAGCATACTTTTCACCTTCTTCAAAGTTGTCAAGCACTCCTGCAGCTCTCTTCTCCTTGATAACCCTGCCATATTCGT
CCACAACAATCAGAACAATTCTGATTTCTGAACAGGAAATATCCCTCTTTTTACTGATCAAAATATTAACATTTCGATGATATTTACCTGACTTCAGGGTTGGTGGTTAT
GTTTCTAACAAGCTGCATCCCACAGATGCCAACAGCCACACCAACTGCAGCAAAGAGAGGATACACCTGCACATAATATTCAACAAAAGCAAGATTTTGATGAGATAAGT
TATTGTTTTCTTCATGTTTTTTTTTTCTCAGCAGCCAAACAGAAAGTAAAGGAAAAGGAAAGGGAGAAGAGAAAAAAACCAAACCTCAGGCTTTAACCATCTGTTGGTGG
CGGCCATGGCAGCAGAGCAGAGAGAAGAGAATGAGTCTTCTCGTGGAGAAATATGAGCAGCGGGTGAGGCATGTTTTTTGATTGTGTGAATTTATAAGGGAAGGGAAGGG
AAATGCAAAGAACCAAGAGGGGTCAGAAATTAGGGGCAGCCAATAGCCGTCTAAGTTTGCCAACTTATTTCCTTCCTTATTTGGCGTCTCTATGCCTAATATATCTTTTG
ACTTTTCTGCTGCCTTTCATTCACACCATTTACACCCTTTTTTTAAATCTAAATTAATACCCTTTTTCTAGGTACAAGCATGGTAGATTATAAAGAGATTTGTAGTGGTA
CCTAAGATCCAAAAAAACTTCGAGCATGAGTGGTTAAAGACACCCTACTCCCTCTTCGC
Protein sequenceShow/hide protein sequence
MYLPDKPRPIDIYKEEGSRDMMIEVASNGDHHLPPHQHQQQQQQQQTHPQHQLMLGDSSGEDHEVKAPKKRAETWVQDETRSLISLRREMDGLFNTSKSNKHLWEQISSK
MRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKSAHYKSPTPPKIDSYMQFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI
TAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSDAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESD
HLPVHTEDKIFYVEDDYREFLARRGWTCLREFDGYRNIDNMDDLRPGAIYRGVS