; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0009013 (gene) of Snake gourd v1 genome

Gene IDTan0009013
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptiontrihelix transcription factor GT-1-like
Genome locationLG02:91016807..91020818
RNA-Seq ExpressionTan0009013
SyntenyTan0009013
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6585203.1 Trihelix transcription factor GT-1, partial [Cucurbita argyrosperma subsp. sororia]9.8e-21696.03Show/hide
Query:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ
        MYLSDKPRPIDIYKEEGSRDMMIEVASNG+HHIQPHQ Q THQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLI LRR MDGLFNTSKSNKHLWEQ
Subjt:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ

Query:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN
        IS KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDN LSFGPVEAGGRPSLN
Subjt:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN

Query:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
        LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGE QAFGGRVI+VKWGDYTRRIG+DGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
Subjt:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM

Query:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTED+YR+FL RRAWTCLREFDGYRNIDNMDDLRPGA+YRG S
Subjt:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS

KAG6598266.1 Trihelix transcription factor GT-1, partial [Cucurbita argyrosperma subsp. sororia]3.2e-21495.77Show/hide
Query:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ
        MYLSDKPRPIDIYKEEGSRDMMIEVASNG+HHIQPHQ QPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLI LRREMDGLFNTSKSNKHLWEQ
Subjt:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ

Query:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN
        IS KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSK+ QYKSPTPPKIDSYIQFSDKGIEDNGL+FGPVE GGRPSLN
Subjt:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN

Query:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
        LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGN GESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQV+RSLDRDM
Subjt:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM

Query:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFY E+DYRDFL RR WTCLREFDGYRNID MDDLRPGAIYRG+S
Subjt:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS

XP_022951166.1 trihelix transcription factor GT-1-like [Cucurbita moschata]2.6e-21696.3Show/hide
Query:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ
        MYLSDKPRPIDIYKEEGSRDMMIEVASNG+HHIQPHQ Q THQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLI LRR MDGLFNTSKSNKHLWEQ
Subjt:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ

Query:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN
        IS KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDN LSFGPVEAGGRPSLN
Subjt:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN

Query:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
        LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGE QAFGGRVI+VKWGDYTRRIG+DGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
Subjt:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM

Query:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTED+YR+FL RRAWTCLREFDGYRNIDNMDDLRPGA+YRGVS
Subjt:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS

XP_023538198.1 trihelix transcription factor GT-1-like [Cucurbita pepo subsp. pepo]9.8e-21696.03Show/hide
Query:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ
        MYLSDKPRPIDIYKEEGSRDMMIEVA NG+HHIQPHQ Q THQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLI LRR MDGLFNTSKSNKHLWEQ
Subjt:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ

Query:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN
        IS KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDN LSFGPVEAGGRPSLN
Subjt:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN

Query:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
        LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGE QAFGGRVI+VKWGDYTRRIG+DGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
Subjt:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM

Query:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTED+YR+FL RRAWTCLREFDGYRNIDNMDDLRPGA+YRGVS
Subjt:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS

XP_038885460.1 trihelix transcription factor GT-1-like isoform X3 [Benincasa hispida]2.4e-21494.76Show/hide
Query:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQP----THQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKH
        MYLSDKPRPIDIYKEEGSRDMMIEVASNG+HHIQPHQQQP    THQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLI LRREMDGLFNTSKSNKH
Subjt:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQP----THQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKH

Query:  LWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGR
        LWEQIS KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMS YKEIEEILKERSK+ QYKSPTPPKIDSY+QFSDKGIEDNGLSFGPVEAGGR
Subjt:  LWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGR

Query:  PSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSL
        PSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQ FGGRVISVKWGDYTRRIG+DGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSL
Subjt:  PSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSL

Query:  DRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        DRDMPLGNYTLHLDEG+AVKICLYDESDHLPVHTE+K+FY E+DYRDFL RR WTCLREFDGYRNIDNMDDLRPGAIYRGVS
Subjt:  DRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS

TrEMBL top hitse value%identityAlignment
A0A6J1BQB9 trihelix transcription factor GT-1-like isoform X18.4e-21393.75Show/hide
Query:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPH------QQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSN
        MYL DKPRPIDIYKEEGSRDMMIEVASNG+HH+ PH      QQQ TH QHQ+MLGDSSGEDHEVKAPKKRAETWVQDETRSLI LRREMDGLFNTSKSN
Subjt:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPH------QQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSN

Query:  KHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAG
        KHLWEQIS+KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSK+A YKSPTPPKIDSY+QFSDKGIEDNGLSFGPVEAG
Subjt:  KHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAG

Query:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVR
        GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTS+AIKEAIKSAFRLRTKRAFWLEDEDQVVR
Subjt:  GRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVR

Query:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTE+KIFY EDDYR+FL RR WTCLREFDGYRNIDNMDDLRPGAIYRGVS
Subjt:  SLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS

A0A6J1GGT5 trihelix transcription factor GT-1-like1.2e-21696.3Show/hide
Query:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ
        MYLSDKPRPIDIYKEEGSRDMMIEVASNG+HHIQPHQ Q THQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLI LRR MDGLFNTSKSNKHLWEQ
Subjt:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ

Query:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN
        IS KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDN LSFGPVEAGGRPSLN
Subjt:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN

Query:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
        LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGE QAFGGRVI+VKWGDYTRRIG+DGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
Subjt:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM

Query:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTED+YR+FL RRAWTCLREFDGYRNIDNMDDLRPGA+YRGVS
Subjt:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS

A0A6J1HBR7 trihelix transcription factor GT-1-like isoform X13.4e-21495.5Show/hide
Query:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ
        MYLSDKPRPIDIYKEEGSRDMMIEVASNG+HHIQPHQ QPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLI LRREMDGLFNTSKSNKHLWEQ
Subjt:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ

Query:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN
        IS KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSK+ QYKSPTPPKIDSYIQF+DKGIEDNGL+FGPVE GGRPSLN
Subjt:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN

Query:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
        LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGN GESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQV+RSLDRDM
Subjt:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM

Query:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFY E+DYRDFL RR WTCLREFDGYRNID MDDLRPGAIYRG+S
Subjt:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS

A0A6J1K3D5 trihelix transcription factor GT-1-like isoform X12.6e-21495.77Show/hide
Query:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ
        MYLSDKPRPIDIYKEEGSRDMMIEVASNG+HHIQPHQ QPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLI LRREMDGLFNTSKSNKHLWEQ
Subjt:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ

Query:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN
        IS KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSG+AKMSYYKEIEEILKERSK+ QYKSPTPPKIDSYIQFSDKGIEDNGL+FGPVE GGRPSLN
Subjt:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN

Query:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
        LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGN GESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
Subjt:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM

Query:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFY E+DYRDFL RR WTCLREFDGYRNID MDDLRPGAIYRG+S
Subjt:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS

A0A6J1KR52 trihelix transcription factor GT-1-like1.2e-21696.3Show/hide
Query:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ
        MYLSDKPRPIDIYKEEGSRDMMIEVASNG+HHIQPHQ Q THQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLI LRR MDGLFNTSKSNKHLWEQ
Subjt:  MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQ

Query:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN
        IS KMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDN LSFGPVEAGGRPSLN
Subjt:  ISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLN

Query:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
        LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGE QAFGGRVI+VKWGDYTRRIG+DGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM
Subjt:  LERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDM

Query:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS
        PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTED+YR+FL RRAWTCLREFDGYRNIDNMDDLRPGA+YRGVS
Subjt:  PLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS

SwissProt top hitse value%identityAlignment
O80450 Trihelix transcription factor GT-3b3.2e-1233.33Show/hide
Query:  NGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAP---KKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNL
        +G  H   HQ Q  ++ H      +  +  E+ +P     R   W  +ET+ LIG+R E+D  F  +K NK LWE IS KMR++ F RSP  C  KW+NL
Subjt:  NGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAP---KKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNL

Query:  LKEFKKAKHHDRGSGSAKMSYYKEIEEILKER
        +  FK  +  +  +   +  +Y +++ I   R
Subjt:  LKEFKKAKHHDRGSGSAKMSYYKEIEEILKER

Q9C6K3 Trihelix transcription factor DF15.2e-1031.82Show/hide
Query:  DSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHD--RGSGSAKMSYYKE
        D+ G+ +   A    +  W + E  +LI LR  +D  +  +     LWE+IS  MR  GF+R+   C +KW N+ K FKK K  +  R   S    Y+ +
Subjt:  DSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHD--RGSGSAKMSYYKE

Query:  IEEILKERSK
        ++ + +ER+K
Subjt:  IEEILKERSK

Q9FX53 Trihelix transcription factor GT-16.1e-16069.7Show/hide
Query:  MYLSDKPRPIDIYKEE--------GSRDMMIEVASN-------GEHHIQPHQQQPTHQ---QHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRR
        M++SDK RP D YK++         +RDMMI+V +          HH   H     HQ   Q Q++LG+SSGEDHEVKAPKKRAETWVQDETRSLI  RR
Subjt:  MYLSDKPRPIDIYKEE--------GSRDMMIEVASN-------GEHHIQPHQQQPTHQ---QHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRR

Query:  EMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNA---QY-KSP-TPP---KIDSYI
         MDGLFNTSKSNKHLWEQIS+KMRE+GFDRSPTMCTDKWRNLLKEFKKAKHHDRG+GSAKMSYYKEIE+IL+ERSK     QY KSP TPP   K+DS++
Subjt:  EMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNA---QY-KSP-TPP---KIDSYI

Query:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGES--QAFGGRVISVKWGDYTRRIGIDGTSEAIKEA
        QF+DKG +D  +SFG VEA GRP+LNLER+LDHDGHPLAI TA DAVAA G+ PWNWRE PGNG +S  Q FGGRVI+VK+GDYTRRIG+DG++EAIKE 
Subjt:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGES--QAFGGRVISVKWGDYTRRIGIDGTSEAIKEA

Query:  IKSAFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPG
        I+SAF LRT+RAFWLEDEDQ++R LDRDMPLGNY L LD+GLA+++C YDES+ LPVH+EEKIFYTE+DYR+FL R+ W+ L + DG+RNI+NMDDL+PG
Subjt:  IKSAFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPG

Query:  AIYRGV
        A+YRGV
Subjt:  AIYRGV

Q9LU92 Trihelix transcription factor GT-45.9e-14767.77Show/hide
Query:  MYLSDKPRPIDIYKEEGSRD--MMI-EVASNGEHHIQPHQQQPTHQQHQMMLGDSS-GEDHE-VKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNK
        M++SD   P        SRD  MMI +V SNG+  +QP         HQ++LG+SS GEDHE +KAPKKRAETW QDETR+LI LRREMD LFNTSKSNK
Subjt:  MYLSDKPRPIDIYKEEGSRD--MMI-EVASNGEHHIQPHQQQPTHQQHQMMLGDSS-GEDHE-VKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNK

Query:  HLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHD---RGSGSAKMSYYKEIEEILKERSKN-AQYKSP---TP--PKIDSYIQFSDKGIEDNGL
        HLWEQIS KMRE+GFDRSP+MCTDKWRN+LKEFKKAK H+      GS KMSYY EIE+I +ER K  A YKSP   TP   K+DS++QF+DKG ED G+
Subjt:  HLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHD---RGSGSAKMSYYKEIEEILKERSKN-AQYKSP---TP--PKIDSYIQFSDKGIEDNGL

Query:  SFGPVEAGGRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWL
        SF  VEA GRP+LNLE +LDHDG PL I AAD + A G+PPWNWR+ PGNG + Q F GR+I+VK+GDYTRR+GIDGT+EAIKEAI+SAFRLRT+RAFWL
Subjt:  SFGPVEAGGRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWL

Query:  EDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGV
        EDE+QV+RSLDRDMPLGNY L +DEG+AV++C YDESD LPVH EEKIFYTE+DYRDFL RR WTCLREFD ++NIDNMD+L+ G +YRG+
Subjt:  EDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGV

Q9SDW0 Trihelix transcription factor GT-3a7.2e-1229.49Show/hide
Query:  HHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKK
        HH+   QQ P          D  G         +R   W  +ET+ L+ +R E+D  F  +K NK LWE ++ KM ++GF RS   C  KW+NL+  +K 
Subjt:  HHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKK

Query:  AKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDS---YIQFSDKGIED
         +  +  +   +  +Y EI+ I + R +   +   T P   S   + QFS    E+
Subjt:  AKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDS---YIQFSDKGIED

Arabidopsis top hitse value%identityAlignment
AT1G13450.1 Homeodomain-like superfamily protein4.3e-16169.7Show/hide
Query:  MYLSDKPRPIDIYKEE--------GSRDMMIEVASN-------GEHHIQPHQQQPTHQ---QHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRR
        M++SDK RP D YK++         +RDMMI+V +          HH   H     HQ   Q Q++LG+SSGEDHEVKAPKKRAETWVQDETRSLI  RR
Subjt:  MYLSDKPRPIDIYKEE--------GSRDMMIEVASN-------GEHHIQPHQQQPTHQ---QHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRR

Query:  EMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNA---QY-KSP-TPP---KIDSYI
         MDGLFNTSKSNKHLWEQIS+KMRE+GFDRSPTMCTDKWRNLLKEFKKAKHHDRG+GSAKMSYYKEIE+IL+ERSK     QY KSP TPP   K+DS++
Subjt:  EMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNA---QY-KSP-TPP---KIDSYI

Query:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGES--QAFGGRVISVKWGDYTRRIGIDGTSEAIKEA
        QF+DKG +D  +SFG VEA GRP+LNLER+LDHDGHPLAI TA DAVAA G+ PWNWRE PGNG +S  Q FGGRVI+VK+GDYTRRIG+DG++EAIKE 
Subjt:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGES--QAFGGRVISVKWGDYTRRIGIDGTSEAIKEA

Query:  IKSAFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPG
        I+SAF LRT+RAFWLEDEDQ++R LDRDMPLGNY L LD+GLA+++C YDES+ LPVH+EEKIFYTE+DYR+FL R+ W+ L + DG+RNI+NMDDL+PG
Subjt:  IKSAFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPG

Query:  AIYRGV
        A+YRGV
Subjt:  AIYRGV

AT1G13450.2 Homeodomain-like superfamily protein8.5e-13361.04Show/hide
Query:  MYLSDKPRPIDIYKEE--------GSRDMMIEVASN-------GEHHIQPHQQQPTHQ---QHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRR
        M++SDK RP D YK++         +RDMMI+V +          HH   H     HQ   Q Q++LG+SSGEDHEVKAPKKRAETWVQDETRSLI  RR
Subjt:  MYLSDKPRPIDIYKEE--------GSRDMMIEVASN-------GEHHIQPHQQQPTHQ---QHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRR

Query:  EMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNA---QY-KSP-TPP---KIDSYI
         MDGLFNTSKSNKHLWEQIS+KMRE+GFDRSPTMCTDKWRNLLKEFKKAKHHDRG+GSAKMSYYKEIE+IL+ERSK     QY KSP TPP   K+DS++
Subjt:  EMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNA---QY-KSP-TPP---KIDSYI

Query:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKS
        QF+DKG +D  +SFG VE                                          G+    Q FGGRVI+VK+GDYTRRIG+DG++EAIKE I+S
Subjt:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKS

Query:  AFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIY
        AF LRT+RAFWLEDEDQ++R LDRDMPLGNY L LD+GLA+++C YDES+ LPVH+EEKIFYTE+DYR+FL R+ W+ L + DG+RNI+NMDDL+PGA+Y
Subjt:  AFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIY

Query:  RGV
        RGV
Subjt:  RGV

AT1G13450.3 Homeodomain-like superfamily protein8.3e-9668.28Show/hide
Query:  MYLSDKPRPIDIYKEE--------GSRDMMIEVASN-------GEHHIQPHQQQPTHQ---QHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRR
        M++SDK RP D YK++         +RDMMI+V +          HH   H     HQ   Q Q++LG+SSGEDHEVKAPKKRAETWVQDETRSLI  RR
Subjt:  MYLSDKPRPIDIYKEE--------GSRDMMIEVASN-------GEHHIQPHQQQPTHQ---QHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRR

Query:  EMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNA---QY-KSP-TPP---KIDSYI
         MDGLFNTSKSNKHLWEQIS+KMRE+GFDRSPTMCTDKWRNLLKEFKKAKHHDRG+GSAKMSYYKEIE+IL+ERSK     QY KSP TPP   K+DS++
Subjt:  EMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNA---QY-KSP-TPP---KIDSYI

Query:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGESQ
        QF+DKG +D  +SFG VEA GRP+LNLER+LDHDGHPLAI TA DAVAA G+ PWNWRE PGNG  S+
Subjt:  QFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAI-TAADAVAATGIPPWNWREAPGNGGESQ

AT2G38250.1 Homeodomain-like superfamily protein2.3e-1333.33Show/hide
Query:  NGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAP---KKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNL
        +G  H   HQ Q  ++ H      +  +  E+ +P     R   W  +ET+ LIG+R E+D  F  +K NK LWE IS KMR++ F RSP  C  KW+NL
Subjt:  NGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAP---KKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQISTKMRERGFDRSPTMCTDKWRNL

Query:  LKEFKKAKHHDRGSGSAKMSYYKEIEEILKER
        +  FK  +  +  +   +  +Y +++ I   R
Subjt:  LKEFKKAKHHDRGSGSAKMSYYKEIEEILKER

AT3G25990.1 Homeodomain-like superfamily protein4.2e-14867.77Show/hide
Query:  MYLSDKPRPIDIYKEEGSRD--MMI-EVASNGEHHIQPHQQQPTHQQHQMMLGDSS-GEDHE-VKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNK
        M++SD   P        SRD  MMI +V SNG+  +QP         HQ++LG+SS GEDHE +KAPKKRAETW QDETR+LI LRREMD LFNTSKSNK
Subjt:  MYLSDKPRPIDIYKEEGSRD--MMI-EVASNGEHHIQPHQQQPTHQQHQMMLGDSS-GEDHE-VKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNK

Query:  HLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHD---RGSGSAKMSYYKEIEEILKERSKN-AQYKSP---TP--PKIDSYIQFSDKGIEDNGL
        HLWEQIS KMRE+GFDRSP+MCTDKWRN+LKEFKKAK H+      GS KMSYY EIE+I +ER K  A YKSP   TP   K+DS++QF+DKG ED G+
Subjt:  HLWEQISTKMRERGFDRSPTMCTDKWRNLLKEFKKAKHHD---RGSGSAKMSYYKEIEEILKERSKN-AQYKSP---TP--PKIDSYIQFSDKGIEDNGL

Query:  SFGPVEAGGRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWL
        SF  VEA GRP+LNLE +LDHDG PL I AAD + A G+PPWNWR+ PGNG + Q F GR+I+VK+GDYTRR+GIDGT+EAIKEAI+SAFRLRT+RAFWL
Subjt:  SFGPVEAGGRPSLNLERQLDHDGHPLAITAADAVAATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWL

Query:  EDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGV
        EDE+QV+RSLDRDMPLGNY L +DEG+AV++C YDESD LPVH EEKIFYTE+DYRDFL RR WTCLREFD ++NIDNMD+L+ G +YRG+
Subjt:  EDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHTEEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTTGTCTGATAAGCCTCGCCCGATTGATATCTACAAGGAAGAAGGTAGCAGAGATATGATGATCGAGGTGGCCTCTAATGGGGAACATCATATTCAACCTCATCA
GCAACAACCGACCCATCAGCAGCACCAAATGATGCTCGGGGATAGCAGTGGTGAGGATCATGAAGTCAAGGCGCCGAAGAAGCGGGCGGAGACTTGGGTTCAGGACGAGA
CTCGGAGCTTAATTGGCCTAAGAAGGGAGATGGATGGATTGTTTAATACCTCGAAATCCAACAAGCATTTGTGGGAGCAGATATCCACTAAGATGAGGGAAAGGGGATTT
GATCGCTCTCCGACTATGTGTACTGATAAATGGAGGAACTTGCTCAAGGAGTTCAAGAAGGCTAAGCATCATGACAGGGGAAGTGGCTCTGCTAAGATGTCGTATTACAA
GGAGATTGAAGAAATTTTGAAGGAGAGAAGCAAAAATGCGCAGTACAAGAGCCCCACGCCACCCAAGATTGACTCCTACATACAATTCTCAGACAAAGGAATTGAGGATA
ATGGTCTATCGTTCGGACCTGTCGAAGCTGGTGGCAGGCCCTCACTCAATCTTGAAAGACAGCTAGATCACGATGGACATCCCCTTGCCATCACAGCAGCTGATGCAGTT
GCTGCAACGGGCATTCCGCCTTGGAATTGGAGAGAGGCACCCGGAAATGGTGGGGAGAGTCAGGCATTTGGCGGGAGAGTTATATCAGTCAAGTGGGGAGACTACACAAG
AAGAATCGGTATTGATGGCACCTCAGAAGCCATCAAGGAGGCTATCAAGTCTGCATTTAGGTTGAGAACTAAACGTGCATTTTGGTTAGAGGATGAGGACCAGGTTGTCA
GAAGTCTGGACAGGGACATGCCTTTAGGAAACTACACTCTTCACCTTGATGAAGGATTGGCTGTTAAAATCTGCCTCTATGATGAATCTGATCACTTACCAGTACATACT
GAAGAAAAAATTTTTTACACTGAAGATGATTACCGAGATTTTTTAGTTCGTCGGGCCTGGACATGCCTTCGGGAGTTTGATGGGTATAGAAACATCGATAATATGGATGA
TCTACGTCCTGGTGCAATATACCGCGGTGTAAGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACTTGTCTGATAAGCCTCGCCCGATTGATATCTACAAGGAAGAAGGTAGCAGAGATATGATGATCGAGGTGGCCTCTAATGGGGAACATCATATTCAACCTCATCA
GCAACAACCGACCCATCAGCAGCACCAAATGATGCTCGGGGATAGCAGTGGTGAGGATCATGAAGTCAAGGCGCCGAAGAAGCGGGCGGAGACTTGGGTTCAGGACGAGA
CTCGGAGCTTAATTGGCCTAAGAAGGGAGATGGATGGATTGTTTAATACCTCGAAATCCAACAAGCATTTGTGGGAGCAGATATCCACTAAGATGAGGGAAAGGGGATTT
GATCGCTCTCCGACTATGTGTACTGATAAATGGAGGAACTTGCTCAAGGAGTTCAAGAAGGCTAAGCATCATGACAGGGGAAGTGGCTCTGCTAAGATGTCGTATTACAA
GGAGATTGAAGAAATTTTGAAGGAGAGAAGCAAAAATGCGCAGTACAAGAGCCCCACGCCACCCAAGATTGACTCCTACATACAATTCTCAGACAAAGGAATTGAGGATA
ATGGTCTATCGTTCGGACCTGTCGAAGCTGGTGGCAGGCCCTCACTCAATCTTGAAAGACAGCTAGATCACGATGGACATCCCCTTGCCATCACAGCAGCTGATGCAGTT
GCTGCAACGGGCATTCCGCCTTGGAATTGGAGAGAGGCACCCGGAAATGGTGGGGAGAGTCAGGCATTTGGCGGGAGAGTTATATCAGTCAAGTGGGGAGACTACACAAG
AAGAATCGGTATTGATGGCACCTCAGAAGCCATCAAGGAGGCTATCAAGTCTGCATTTAGGTTGAGAACTAAACGTGCATTTTGGTTAGAGGATGAGGACCAGGTTGTCA
GAAGTCTGGACAGGGACATGCCTTTAGGAAACTACACTCTTCACCTTGATGAAGGATTGGCTGTTAAAATCTGCCTCTATGATGAATCTGATCACTTACCAGTACATACT
GAAGAAAAAATTTTTTACACTGAAGATGATTACCGAGATTTTTTAGTTCGTCGGGCCTGGACATGCCTTCGGGAGTTTGATGGGTATAGAAACATCGATAATATGGATGA
TCTACGTCCTGGTGCAATATACCGCGGTGTAAGTTGA
Protein sequenceShow/hide protein sequence
MYLSDKPRPIDIYKEEGSRDMMIEVASNGEHHIQPHQQQPTHQQHQMMLGDSSGEDHEVKAPKKRAETWVQDETRSLIGLRREMDGLFNTSKSNKHLWEQISTKMRERGF
DRSPTMCTDKWRNLLKEFKKAKHHDRGSGSAKMSYYKEIEEILKERSKNAQYKSPTPPKIDSYIQFSDKGIEDNGLSFGPVEAGGRPSLNLERQLDHDGHPLAITAADAV
AATGIPPWNWREAPGNGGESQAFGGRVISVKWGDYTRRIGIDGTSEAIKEAIKSAFRLRTKRAFWLEDEDQVVRSLDRDMPLGNYTLHLDEGLAVKICLYDESDHLPVHT
EEKIFYTEDDYRDFLVRRAWTCLREFDGYRNIDNMDDLRPGAIYRGVS