; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0007773 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0007773
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontrihelix transcription factor GT-2-like
Genome locationchr9:4524302..4526070
RNA-Seq ExpressionLag0007773
SyntenyLag0007773
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040781.1 trihelix transcription factor GT-2-like [Cucumis melo var. makuwa]2.9e-22885.3Show/hide
Query:  MLEISPSPENSSAAATAAA-----EDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVS-----------------RK
        MLEISPSPENSSAAA  AA     ++D AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVS                 RK
Subjt:  MLEISPSPENSSAAATAAA-----EDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVS-----------------RK

Query:  LAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTS
        L ELGYNR+AKKCKEKFENIYKYHKRTKD RSGK NGKNYRYFEQL+ALDNHPLLPSQA SMEEIP+IIPNN+VHNAIPCSVVNPG+NFVETTTTS+STS
Subjt:  LAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTS

Query:  TTSCSSKESGGTRKKKRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVG
        TTSCSSKESGGTRKKKRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+ SEQ G
Subjt:  TTSCSSKESGGTRKKKRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVG

Query:  AVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWEN
         VQFPE+ +LMENL EKQDD N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ++GPKGPLWEEISLAMKK GYDR+AKRCKEKWEN
Subjt:  AVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWEN

Query:  INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDED
        INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKV++NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE E EDEDED
Subjt:  INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDED

Query:  YQIVANNNSN---QMEV
        Y+IVAN+N+N   QM+V
Subjt:  YQIVANNNSN---QMEV

XP_004147355.2 trihelix transcription factor GT-2 [Cucumis sativus]5.3e-23088.18Show/hide
Query:  MLEISPSPENSSAA---ATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFEN
        MLEISPSPENSSAA   A    +++ AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKL ELGYNR+AKKCKEKFEN
Subjt:  MLEISPSPENSSAA---ATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFEN

Query:  IYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKRKF
        IYKYHKRTKD RSGK NGKNYRYFEQL+ALDNH LLPSQA SMEEIPRIIPNN+VHNAIPCSVVNPG+NFVETTTTS+STSTTS SSKESGGTRKKKRKF
Subjt:  IYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKRKF

Query:  VEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQD
        VEFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+FSEQ G VQFPE+ +LMENL EKQD
Subjt:  VEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQD

Query:  DGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDS
        D N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ+NGPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPEDS
Subjt:  DGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDS

Query:  KTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDEDYQIVA----NNNSNQMEV
        KTCPYFQQLDALYKQKSKKV++NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE E EDEDEDY+IVA    NNN+NQM+V
Subjt:  KTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDEDYQIVA----NNNSNQMEV

XP_008460913.1 PREDICTED: trihelix transcription factor GT-2-like [Cucumis melo]1.7e-23188.2Show/hide
Query:  MLEISPSPENSSAAATAAA-----EDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF
        MLEISPSPENSSAAA  AA     ++D AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKL ELGYNR+AKKCKEKF
Subjt:  MLEISPSPENSSAAATAAA-----EDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF

Query:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKR
        ENIYKYHKRTKD RSGK NGKNYRYFEQL+ALDNHPLLPSQA SMEEIP+IIPNN+VHNAIPCSVVNPG+NFVETTTTS+STSTTSCSSKESGGTRKKKR
Subjt:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKR

Query:  KFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK
        KFVEFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+ SEQ G VQFPE+ +LMENL EK
Subjt:  KFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK

Query:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE
        QDD N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ++GPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPE
Subjt:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE

Query:  DSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDEDYQIVANNNSN---QMEV
        DSKTCPYFQQLDALYKQKSKKV++NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE E EDEDEDY+IVAN+N+N   QM+V
Subjt:  DSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDEDYQIVANNNSN---QMEV

XP_022159187.1 trihelix transcription factor GTL1-like [Momordica charantia]3.8e-23690.98Show/hide
Query:  MLEISPSPENSSAAATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYK
        MLE S SPENS+    AAAE+D A  S GLAEEADRSWAGNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSR LAELGYNRSAKKCKEKFENIYK
Subjt:  MLEISPSPENSSAAATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYK

Query:  YHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKRKFVEF
        YHKRTKDSRSGK NGKNYRYFEQL+ALDNHPLLPSQA SMEE+PRIIPNNIVHNAIPCSVVNPGSNFV+TTTTSISTS TSCSSKESGGT KKKRKFVEF
Subjt:  YHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKRKFVEF

Query:  FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGN
        FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLN ERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILME+ M+KQDDGN
Subjt:  FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGN

Query:  ADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTC
         DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPEDSKTC
Subjt:  ADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTC

Query:  PYFQQLDALYKQKSKKVV----DNPSNPNYELKPEELLMHMMGGQEESH-QPESATDDGEAENADQNQEDE-GEDEDEDEDEDYQIVANNNSNQMEVAS
        PYFQQLDALYK+KSKKV     +N +NPNYELKPEELLMHMMGGQEE H QPESATDDGE ENADQNQEDE  EDE+E+EDEDYQIVANNNSNQM VAS
Subjt:  PYFQQLDALYKQKSKKVV----DNPSNPNYELKPEELLMHMMGGQEESH-QPESATDDGEAENADQNQEDE-GEDEDEDEDEDYQIVANNNSNQMEVAS

XP_038901714.1 trihelix transcription factor GT-2-like [Benincasa hispida]6.1e-23489.09Show/hide
Query:  MLEISPSPENSSAAATA----AAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFE
        MLEISPSPENS+  A A    AAE+DGAA SAGL+EE DR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNR+AKKCKEKFE
Subjt:  MLEISPSPENSSAAATA----AAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFE

Query:  NIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKRK
        NIYKYHKRTKD RSGK NGKNYRYFEQL+A DNHPLLPSQA SMEEIPRIIPNN+VHNAIPCSVV PG+NFVETTTTSISTSTTSCSSKESGGTRKKKRK
Subjt:  NIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKRK

Query:  FVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQ
        FVEFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+FSEQVG VQFPE+ ILMENL EKQ
Subjt:  FVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQ

Query:  DDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPED
        DDGN DRNTS +EN NNGNS+QISSSRWPKEEIDALIQLRT+LQMKYQ+NGPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRV+ESNKKRPED
Subjt:  DDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPED

Query:  SKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENADQNQEDEGEDEDEDEDEDYQIVANNNSNQMEV
        SKTCPYFQQLDALYKQKSKK+++NP+NPNYELKPEELLMHMMGGQEESHQPESATDDG     DQNQEDE E E EDEDEDYQIVANN++NQMEV
Subjt:  SKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENADQNQEDEGEDEDEDEDEDYQIVANNNSNQMEV

TrEMBL top hitse value%identityAlignment
A0A0A0LK12 Uncharacterized protein2.6e-23088.18Show/hide
Query:  MLEISPSPENSSAA---ATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFEN
        MLEISPSPENSSAA   A    +++ AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKL ELGYNR+AKKCKEKFEN
Subjt:  MLEISPSPENSSAA---ATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFEN

Query:  IYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKRKF
        IYKYHKRTKD RSGK NGKNYRYFEQL+ALDNH LLPSQA SMEEIPRIIPNN+VHNAIPCSVVNPG+NFVETTTTS+STSTTS SSKESGGTRKKKRKF
Subjt:  IYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKRKF

Query:  VEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQD
        VEFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+FSEQ G VQFPE+ +LMENL EKQD
Subjt:  VEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQD

Query:  DGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDS
        D N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ+NGPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPEDS
Subjt:  DGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDS

Query:  KTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDEDYQIVA----NNNSNQMEV
        KTCPYFQQLDALYKQKSKKV++NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE E EDEDEDY+IVA    NNN+NQM+V
Subjt:  KTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDEDYQIVA----NNNSNQMEV

A0A1S3CD19 trihelix transcription factor GT-2-like8.0e-23288.2Show/hide
Query:  MLEISPSPENSSAAATAAA-----EDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF
        MLEISPSPENSSAAA  AA     ++D AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKL ELGYNR+AKKCKEKF
Subjt:  MLEISPSPENSSAAATAAA-----EDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF

Query:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKR
        ENIYKYHKRTKD RSGK NGKNYRYFEQL+ALDNHPLLPSQA SMEEIP+IIPNN+VHNAIPCSVVNPG+NFVETTTTS+STSTTSCSSKESGGTRKKKR
Subjt:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKR

Query:  KFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK
        KFVEFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+ SEQ G VQFPE+ +LMENL EK
Subjt:  KFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK

Query:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE
        QDD N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ++GPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPE
Subjt:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE

Query:  DSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDEDYQIVANNNSN---QMEV
        DSKTCPYFQQLDALYKQKSKKV++NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE E EDEDEDY+IVAN+N+N   QM+V
Subjt:  DSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDEDYQIVANNNSN---QMEV

A0A5A7TGJ1 Trihelix transcription factor GT-2-like1.4e-22885.3Show/hide
Query:  MLEISPSPENSSAAATAAA-----EDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVS-----------------RK
        MLEISPSPENSSAAA  AA     ++D AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVS                 RK
Subjt:  MLEISPSPENSSAAATAAA-----EDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVS-----------------RK

Query:  LAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTS
        L ELGYNR+AKKCKEKFENIYKYHKRTKD RSGK NGKNYRYFEQL+ALDNHPLLPSQA SMEEIP+IIPNN+VHNAIPCSVVNPG+NFVETTTTS+STS
Subjt:  LAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTS

Query:  TTSCSSKESGGTRKKKRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVG
        TTSCSSKESGGTRKKKRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+ SEQ G
Subjt:  TTSCSSKESGGTRKKKRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVG

Query:  AVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWEN
         VQFPE+ +LMENL EKQDD N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ++GPKGPLWEEISLAMKK GYDR+AKRCKEKWEN
Subjt:  AVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWEN

Query:  INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDED
        INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKV++NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE E EDEDED
Subjt:  INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDED

Query:  YQIVANNNSN---QMEV
        Y+IVAN+N+N   QM+V
Subjt:  YQIVANNNSN---QMEV

A0A5D3BRR0 Trihelix transcription factor GT-2-like8.0e-23288.2Show/hide
Query:  MLEISPSPENSSAAATAAA-----EDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF
        MLEISPSPENSSAAA  AA     ++D AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKL ELGYNR+AKKCKEKF
Subjt:  MLEISPSPENSSAAATAAA-----EDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF

Query:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKR
        ENIYKYHKRTKD RSGK NGKNYRYFEQL+ALDNHPLLPSQA SMEEIP+IIPNN+VHNAIPCSVVNPG+NFVETTTTS+STSTTSCSSKESGGTRKKKR
Subjt:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKR

Query:  KFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK
        KFVEFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+ SEQ G VQFPE+ +LMENL EK
Subjt:  KFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK

Query:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE
        QDD N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ++GPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPE
Subjt:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE

Query:  DSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDEDYQIVANNNSN---QMEV
        DSKTCPYFQQLDALYKQKSKKV++NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE E EDEDEDY+IVAN+N+N   QM+V
Subjt:  DSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGEDEDEDEDEDYQIVANNNSN---QMEV

A0A6J1DZ47 trihelix transcription factor GTL1-like1.8e-23690.98Show/hide
Query:  MLEISPSPENSSAAATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYK
        MLE S SPENS+    AAAE+D A  S GLAEEADRSWAGNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSR LAELGYNRSAKKCKEKFENIYK
Subjt:  MLEISPSPENSSAAATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYK

Query:  YHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKRKFVEF
        YHKRTKDSRSGK NGKNYRYFEQL+ALDNHPLLPSQA SMEE+PRIIPNNIVHNAIPCSVVNPGSNFV+TTTTSISTS TSCSSKESGGT KKKRKFVEF
Subjt:  YHKRTKDSRSGKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKRKFVEF

Query:  FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGN
        FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLN ERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILME+ M+KQDDGN
Subjt:  FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGN

Query:  ADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTC
         DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPEDSKTC
Subjt:  ADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTC

Query:  PYFQQLDALYKQKSKKVV----DNPSNPNYELKPEELLMHMMGGQEESH-QPESATDDGEAENADQNQEDE-GEDEDEDEDEDYQIVANNNSNQMEVAS
        PYFQQLDALYK+KSKKV     +N +NPNYELKPEELLMHMMGGQEE H QPESATDDGE ENADQNQEDE  EDE+E+EDEDYQIVANNNSNQM VAS
Subjt:  PYFQQLDALYKQKSKKVV----DNPSNPNYELKPEELLMHMMGGQEESH-QPESATDDGEAENADQNQEDE-GEDEDEDEDEDYQIVANNNSNQMEVAS

SwissProt top hitse value%identityAlignment
Q39117 Trihelix transcription factor GT-21.1e-8741.09Show/hide
Query:  EDDGAAPSAGLAEEADRSW---------AGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRS
        E  G      + EE D             GNRWPR ET+ALL++RS MD AFRD++LKAPLWEE+SRK+ ELGY RS+KKCKEKFEN+YKYHKRTK+ R+
Subjt:  EDDGAAPSAGLAEEADRSW---------AGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRS

Query:  GKPNGKNYRYFEQLQALD------------------------------------------------------------NHPLLPSQAGSMEEIPRIIPNN
        GK  GK YR+FE+L+A +                                                            N   L  Q  S    P    NN
Subjt:  GKPNGKNYRYFEQLQALD------------------------------------------------------------NHPLLPSQAGSMEEIPRIIPNN

Query:  IVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESG-----GTRKKKRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARI
            + P    +  +N       S STS+++ S +E        +RKK++ +   F +L  E++EKQEK+QK+F+E LE  E+ER++REE W++QE+ RI
Subjt:  IVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESG-----GTRKKKRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARI

Query:  KKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPES---------------SILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEID
         +E E L  ERS AAAKDAA++SFL   S   G  Q P+                SI  E+   K+        T    N +N +S   SSSRWPK E++
Subjt:  KKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPES---------------SILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEID

Query:  ALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKP
        ALI++R NL+  YQENG KGPLWEEIS  M++ GY+RSAKRCKEKWENINKYFK+VKESNKKRP DSKTCPYF QL+ALY +++K        P      
Subjt:  ALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKP

Query:  EELLMHMMGGQE-ESHQPESATD-----DGEAENADQNQEDEGEDEDEDEDEDYQIVANNNSNQMEV
         +LL+      E E+ Q E   D     +GE+E  + ++E+EG  E ++E  +++IV N  S+ M++
Subjt:  EELLMHMMGGQE-ESHQPESATD-----DGEAENADQNQEDEGEDEDEDEDEDYQIVANNNSNQMEV

Q8H181 Trihelix transcription factor GTL21.0e-3729.93Show/hide
Query:  WPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN-----------GKNYRYFEQLQALDNH-
        W  +E +ALL+ RS+++  F + +     WE  SRKLAE+G+ RS ++CKEKFE   + +  + ++ +   N           G NYR F +++   +H 
Subjt:  WPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN-----------GKNYRYFEQLQALDNH-

Query:  ------------------PLLPSQAGSMEEIPRIIPNNIVHN-----AIPCSVVNPGSNF-------VETTTTSISTSTTSCSSKESGGTRKKKRK----
                           L+  +    E +  ++  + + +         S+ N  ++        VE    S S+S+     KE    ++KK K    
Subjt:  ------------------PLLPSQAGSMEEIPRIIPNNIVHN-----AIPCSVVNPGSNF-------VETTTTSISTSTTSCSSKESGGTRKKKRK----

Query:  -FVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQ-------------------
            F E L+  +I +QE++ KK +E + K E+E++AREE WK QE+ R+ KE E   QE+++A+ ++  ++ F+  F++                    
Subjt:  -FVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQ-------------------

Query:  ----VGAVQFPESSILMENLMEKQDDGNADRN-------TSNQENSN----NGNSNQISSSRWPKEEIDALIQLRTNL----------QMKYQENGPKGP
             G  +F  SS L+   +   +    D++       T   +N N      +       RWPK+E+ ALI +R ++          +     +    P
Subjt:  ----VGAVQFPESSILMENLMEKQDDGNADRN-------TSNQENSN----NGNSNQISSSRWPKEEIDALIQLRTNL----------QMKYQENGPKGP

Query:  LWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQ
        LWE IS  M + GY RSAKRCKEKWENINKYF++ K+ NKKRP DS+TCPYF QL ALY Q
Subjt:  LWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQ

Q9C6K3 Trihelix transcription factor DF11.4e-10044.83Show/hide
Query:  SPENSSAAATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTK
        S  N SAA  AAA   GA   +   E  DR + GNRWPR+ET+ALLK+RS M  AFRDAS+K PLWEEVSRK+AE GY R+AKKCKEKFEN+YKYHKRTK
Subjt:  SPENSSAAATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTK

Query:  DSRSGKPNGKNYRYFEQLQALDNH------------PLLPSQAGSMEE--------IPRIIPNNIVHNAIPCSVVNP-------------GSNFVETTTT
        + R+GK  GK YR+F+QL+AL++             PL P Q  +                P   V   +P S + P               +F+   +T
Subjt:  DSRSGKPNGKNYRYFEQLQALDNH------------PLLPSQAGSMEE--------IPRIIPNNIVHNAIPCSVVNP-------------GSNFVETTTT

Query:  SISTSTTSCSSKESGG----TRKK-KRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLS
        S S+S ++ S  E GG    TRKK KRK+  FFERLM +V++KQE+LQ+KF+EA+EK E ERL REE W++QE+ARI +E E L QERS++AAKDAAV++
Subjt:  SISTSTTSCSSKESGG----TRKK-KRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLS

Query:  FLKMFSEQVGAVQFPE-------SSILMENLMEKQDDGNA---------------------DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQ
        FL+  SE+      P+        S+ + N  ++Q    +                        T N  + N   +   SSSRWPK EI+ALI+LRTNL 
Subjt:  FLKMFSEQVGAVQFPE-------SSILMENLMEKQDDGNA---------------------DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQ

Query:  MKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDN----PSNPNYELKPEELLMH
         KYQENGPKGPLWEEIS  M++ G++R++KRCKEKWENINKYFK+VKESNKKRPEDSKTCPYF QLDALY++++K   +N     S+ +  +KP+  +  
Subjt:  MKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDN----PSNPNYELKPEELLMH

Query:  MMGGQ--------------------EESHQPESATDDGEAENADQNQEDEGEDEDEDEDEDYQIVANNNSN
        M+  +                    ++S   E   DD E  + + + EDE E+ +E+E  ++++V +NN+N
Subjt:  MMGGQ--------------------EESHQPESATDDGEAENADQNQEDEGEDEDEDEDEDYQIVANNNSN

Q9C882 Trihelix transcription factor GTL14.4e-7838.52Show/hide
Query:  TAAAEDDG-AAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN
        +AAA+D G      G    +  S +GNRWPREET+ALL++RS MD+ FRDA+LKAPLWE VSRKL ELGY RS+KKCKEKFEN+ KY+KRTK++R G+ +
Subjt:  TAAAEDDG-AAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN

Query:  GKNYRYFEQLQALDNHP----------------LLPSQAGS------------MEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKE
        GK Y++F QL+AL+  P                L+PS + S              + P+    +      P  + + G  F   T +S S+ST S    +
Subjt:  GKNYRYFEQLQALDNHP----------------LLPSQAGS------------MEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKE

Query:  --------------SGGTRKKKR-------KFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA
                         +RK+KR       K +E FE L+ +V++KQ  +Q+ F+EALEK EQERL REE WK QE+AR+ +E E ++QER+ +A++DAA
Subjt:  --------------SGGTRKKKR-------KFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA

Query:  VLSFLKMFS---------------------------------------EQVGAVQFPESSIL----MENLMEKQDDGNADRNTSNQENSNNGNSNQISSS
        ++S ++  +                                        Q   +  P+  IL      +    Q +    +    +   ++  S+  SSS
Subjt:  VLSFLKMFS---------------------------------------EQVGAVQFPESSIL----MENLMEKQDDGNADRNTSNQENSNNGNSNQISSS

Query:  RWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQK---------
        RWPK EI ALI LR+ ++ +YQ+N PKG LWEEIS +MK+ GY+R+AKRCKEKWENINKY+K+VKESNKKRP+D+KTCPYF +LD LY+ K         
Subjt:  RWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQK---------

Query:  -SKKVVDNPSNPNYELK-PEELLMHMMGGQEESHQPESATDDGEAENADQNQE
         S    D   +P   +K P+E L+++    +++H   S  ++   E + Q  E
Subjt:  -SKKVVDNPSNPNYELK-PEELLMHMMGGQEESHQPESATDDGEAENADQNQE

Q9LZS0 Trihelix transcription factor PTL3.6e-4334.13Show/hide
Query:  RWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLA-ELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLQAL--DNHPLLPSQAG
        RWPR+ET+ LL++RS +D  F++A+ K PLW+EVSR ++ E GY RS KKC+EKFEN+YKY+++TK+ ++G+ +GK+YR+F QL+AL  D++ L+     
Subjt:  RWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLA-ELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLQAL--DNHPLLPSQAG

Query:  SMEEIPRIIPNNIVHNAIPCSVVNPGSNFVET----------------TTTSISTSTTSCSSKESGGTRKK---KRKFVEFFERLMNEVIEKQEKLQKKF
        + + +   +     H   P +V    SN                     ++ +   T+S    +S   RKK   K K  EF +  M  +IE+Q+   +K 
Subjt:  SMEEIPRIIPNNIVHNAIPCSVVNPGSNFVET----------------TTTSISTSTTSCSSKESGGTRKK---KRKFVEFFERLMNEVIEKQEKLQKKF

Query:  VEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGN---S
         + +E  E++R+ +EEEW+  E ARI KE     +ER+   A+D AV+  L+  + +      P    L  +  E+ +  N  RN S  +N N  +   +
Subjt:  VEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGN---S

Query:  NQI----SSSRWPKEEIDALIQLRTNLQMKYQE--NGPKGP-LWEEISLAMKKHGYD-RSAKRCKEKWENI-NKYFKRVKESNKKRPEDSKTCPYF---Q
        N +    SSS W ++EI  L+++RT++   +QE   G     LWEEI+  + + G+D RSA  CKEKWE I N   K  K+ NKKR ++S +C  +    
Subjt:  NQI----SSSRWPKEEIDALIQLRTNLQMKYQE--NGPKGP-LWEEISLAMKKHGYD-RSAKRCKEKWENI-NKYFKRVKESNKKRPEDSKTCPYF---Q

Query:  QLDALYKQKSKKVVDN
        + + +Y  +     DN
Subjt:  QLDALYKQKSKKVVDN

Arabidopsis top hitse value%identityAlignment
AT1G33240.1 GT-2-like 11.9e-7636.48Show/hide
Query:  TAAAEDDG-AAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN
        +AAA+D G      G    +  S +GNRWPREET+ALL++RS MD+ FRDA+LKAPLWE VSRKL ELGY RS+KKCKEKFEN+ KY+KRTK++R G+ +
Subjt:  TAAAEDDG-AAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN

Query:  GKNYRYFEQLQALDNHP----------------LLPSQAGS------------MEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKE
        GK Y++F QL+AL+  P                L+PS + S              + P+    +      P  + + G  F   T +S S+ST S    +
Subjt:  GKNYRYFEQLQALDNHP----------------LLPSQAGS------------MEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKE

Query:  --------------SGGTRKKKR-------KFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA
                         +RK+KR       K +E FE L+ +V++KQ  +Q+ F+EALEK EQERL REE WK QE+AR+ +E E ++QER+ +A++DAA
Subjt:  --------------SGGTRKKKR-------KFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA

Query:  VLSFLKMFS---------------------------------------EQVGAVQFPESSIL----MENLMEKQDDGNADRNTSNQENSNNGNSNQISSS
        ++S ++  +                                        Q   +  P+  IL      +    Q +    +    +   ++  S+  SSS
Subjt:  VLSFLKMFS---------------------------------------EQVGAVQFPESSIL----MENLMEKQDDGNADRNTSNQENSNNGNSNQISSS

Query:  RWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYK-----------
        RWPK EI ALI LR+ ++ +YQ+N PKG LWEEIS +MK+ GY+R+AKRCKEKWENINKY+K+VKESNKKRP+D+KTCPYF +LD LY+           
Subjt:  RWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYK-----------

Query:  -------QKSKKVV-----------------------DNP--SNPNYELKPEELLM-HMMGGQEESHQPESATDDGEAENADQNQEDEGEDEDEDEDEDY
               QK   V                        + P   +P    KPE+L+M  ++  Q++  Q ES   + E      N  +  E+ED++ DE+ 
Subjt:  -------QKSKKVV-----------------------DNP--SNPNYELKPEELLM-HMMGGQEESHQPESATDDGEAENADQNQEDEGEDEDEDEDEDY

Query:  QIVANNNSNQMEVA
        ++  +  S   E+A
Subjt:  QIVANNNSNQMEVA

AT1G76880.1 Duplicated homeodomain-like superfamily protein1.0e-10144.83Show/hide
Query:  SPENSSAAATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTK
        S  N SAA  AAA   GA   +   E  DR + GNRWPR+ET+ALLK+RS M  AFRDAS+K PLWEEVSRK+AE GY R+AKKCKEKFEN+YKYHKRTK
Subjt:  SPENSSAAATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTK

Query:  DSRSGKPNGKNYRYFEQLQALDNH------------PLLPSQAGSMEE--------IPRIIPNNIVHNAIPCSVVNP-------------GSNFVETTTT
        + R+GK  GK YR+F+QL+AL++             PL P Q  +                P   V   +P S + P               +F+   +T
Subjt:  DSRSGKPNGKNYRYFEQLQALDNH------------PLLPSQAGSMEE--------IPRIIPNNIVHNAIPCSVVNP-------------GSNFVETTTT

Query:  SISTSTTSCSSKESGG----TRKK-KRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLS
        S S+S ++ S  E GG    TRKK KRK+  FFERLM +V++KQE+LQ+KF+EA+EK E ERL REE W++QE+ARI +E E L QERS++AAKDAAV++
Subjt:  SISTSTTSCSSKESGG----TRKK-KRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLS

Query:  FLKMFSEQVGAVQFPE-------SSILMENLMEKQDDGNA---------------------DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQ
        FL+  SE+      P+        S+ + N  ++Q    +                        T N  + N   +   SSSRWPK EI+ALI+LRTNL 
Subjt:  FLKMFSEQVGAVQFPE-------SSILMENLMEKQDDGNA---------------------DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQ

Query:  MKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDN----PSNPNYELKPEELLMH
         KYQENGPKGPLWEEIS  M++ G++R++KRCKEKWENINKYFK+VKESNKKRPEDSKTCPYF QLDALY++++K   +N     S+ +  +KP+  +  
Subjt:  MKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDN----PSNPNYELKPEELLMH

Query:  MMGGQ--------------------EESHQPESATDDGEAENADQNQEDEGEDEDEDEDEDYQIVANNNSN
        M+  +                    ++S   E   DD E  + + + EDE E+ +E+E  ++++V +NN+N
Subjt:  MMGGQ--------------------EESHQPESATDDGEAENADQNQEDEGEDEDEDEDEDYQIVANNNSN

AT1G76890.2 Duplicated homeodomain-like superfamily protein7.5e-8941.09Show/hide
Query:  EDDGAAPSAGLAEEADRSW---------AGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRS
        E  G      + EE D             GNRWPR ET+ALL++RS MD AFRD++LKAPLWEE+SRK+ ELGY RS+KKCKEKFEN+YKYHKRTK+ R+
Subjt:  EDDGAAPSAGLAEEADRSW---------AGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRS

Query:  GKPNGKNYRYFEQLQALD------------------------------------------------------------NHPLLPSQAGSMEEIPRIIPNN
        GK  GK YR+FE+L+A +                                                            N   L  Q  S    P    NN
Subjt:  GKPNGKNYRYFEQLQALD------------------------------------------------------------NHPLLPSQAGSMEEIPRIIPNN

Query:  IVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESG-----GTRKKKRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARI
            + P    +  +N       S STS+++ S +E        +RKK++ +   F +L  E++EKQEK+QK+F+E LE  E+ER++REE W++QE+ RI
Subjt:  IVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESG-----GTRKKKRKFVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARI

Query:  KKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPES---------------SILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEID
         +E E L  ERS AAAKDAA++SFL   S   G  Q P+                SI  E+   K+        T    N +N +S   SSSRWPK E++
Subjt:  KKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPES---------------SILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEID

Query:  ALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKP
        ALI++R NL+  YQENG KGPLWEEIS  M++ GY+RSAKRCKEKWENINKYFK+VKESNKKRP DSKTCPYF QL+ALY +++K        P      
Subjt:  ALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKP

Query:  EELLMHMMGGQE-ESHQPESATD-----DGEAENADQNQEDEGEDEDEDEDEDYQIVANNNSNQMEV
         +LL+      E E+ Q E   D     +GE+E  + ++E+EG  E ++E  +++IV N  S+ M++
Subjt:  EELLMHMMGGQE-ESHQPESATD-----DGEAENADQNQEDEGEDEDEDEDEDYQIVANNNSNQMEV

AT5G03680.1 Duplicated homeodomain-like superfamily protein2.5e-4434.13Show/hide
Query:  RWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLA-ELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLQAL--DNHPLLPSQAG
        RWPR+ET+ LL++RS +D  F++A+ K PLW+EVSR ++ E GY RS KKC+EKFEN+YKY+++TK+ ++G+ +GK+YR+F QL+AL  D++ L+     
Subjt:  RWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLA-ELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLQAL--DNHPLLPSQAG

Query:  SMEEIPRIIPNNIVHNAIPCSVVNPGSNFVET----------------TTTSISTSTTSCSSKESGGTRKK---KRKFVEFFERLMNEVIEKQEKLQKKF
        + + +   +     H   P +V    SN                     ++ +   T+S    +S   RKK   K K  EF +  M  +IE+Q+   +K 
Subjt:  SMEEIPRIIPNNIVHNAIPCSVVNPGSNFVET----------------TTTSISTSTTSCSSKESGGTRKK---KRKFVEFFERLMNEVIEKQEKLQKKF

Query:  VEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGN---S
         + +E  E++R+ +EEEW+  E ARI KE     +ER+   A+D AV+  L+  + +      P    L  +  E+ +  N  RN S  +N N  +   +
Subjt:  VEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGN---S

Query:  NQI----SSSRWPKEEIDALIQLRTNLQMKYQE--NGPKGP-LWEEISLAMKKHGYD-RSAKRCKEKWENI-NKYFKRVKESNKKRPEDSKTCPYF---Q
        N +    SSS W ++EI  L+++RT++   +QE   G     LWEEI+  + + G+D RSA  CKEKWE I N   K  K+ NKKR ++S +C  +    
Subjt:  NQI----SSSRWPKEEIDALIQLRTNLQMKYQE--NGPKGP-LWEEISLAMKKHGYD-RSAKRCKEKWENI-NKYFKRVKESNKKRPEDSKTCPYF---Q

Query:  QLDALYKQKSKKVVDN
        + + +Y  +     DN
Subjt:  QLDALYKQKSKKVVDN

AT5G28300.1 Duplicated homeodomain-like superfamily protein7.1e-3929.93Show/hide
Query:  WPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN-----------GKNYRYFEQLQALDNH-
        W  +E +ALL+ RS+++  F + +     WE  SRKLAE+G+ RS ++CKEKFE   + +  + ++ +   N           G NYR F +++   +H 
Subjt:  WPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN-----------GKNYRYFEQLQALDNH-

Query:  ------------------PLLPSQAGSMEEIPRIIPNNIVHN-----AIPCSVVNPGSNF-------VETTTTSISTSTTSCSSKESGGTRKKKRK----
                           L+  +    E +  ++  + + +         S+ N  ++        VE    S S+S+     KE    ++KK K    
Subjt:  ------------------PLLPSQAGSMEEIPRIIPNNIVHN-----AIPCSVVNPGSNF-------VETTTTSISTSTTSCSSKESGGTRKKKRK----

Query:  -FVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQ-------------------
            F E L+  +I +QE++ KK +E + K E+E++AREE WK QE+ R+ KE E   QE+++A+ ++  ++ F+  F++                    
Subjt:  -FVEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQ-------------------

Query:  ----VGAVQFPESSILMENLMEKQDDGNADRN-------TSNQENSN----NGNSNQISSSRWPKEEIDALIQLRTNL----------QMKYQENGPKGP
             G  +F  SS L+   +   +    D++       T   +N N      +       RWPK+E+ ALI +R ++          +     +    P
Subjt:  ----VGAVQFPESSILMENLMEKQDDGNADRN-------TSNQENSN----NGNSNQISSSRWPKEEIDALIQLRTNL----------QMKYQENGPKGP

Query:  LWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQ
        LWE IS  M + GY RSAKRCKEKWENINKYF++ K+ NKKRP DS+TCPYF QL ALY Q
Subjt:  LWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGAAATTTCTCCTTCACCGGAAAACTCTTCCGCCGCCGCCACCGCCGCCGCCGAGGATGACGGTGCGGCGCCCTCCGCCGGACTTGCGGAGGAGGCTGACCGGAG
CTGGGCCGGCAACCGGTGGCCACGAGAGGAGACCATCGCTTTGCTTAAGGTGAGGTCGAGTATGGACACTGCGTTTAGAGATGCAAGCCTTAAAGCTCCTCTATGGGAGG
AAGTTTCCAGGAAATTGGCTGAGCTTGGGTATAATCGAAGTGCGAAAAAATGCAAAGAGAAATTTGAGAACATTTATAAGTATCACAAAAGGACTAAAGATAGCAGATCG
GGCAAACCCAATGGGAAAAATTATAGGTATTTTGAGCAATTACAAGCTTTAGATAATCATCCATTGCTACCTTCTCAAGCTGGTTCAATGGAAGAAATCCCAAGGATTAT
CCCAAACAATATTGTTCACAATGCAATTCCATGTTCCGTTGTGAACCCGGGTTCGAATTTTGTTGAAACTACCACCACATCGATATCGACTTCGACGACTTCTTGCTCGA
GCAAAGAGTCGGGTGGGACGAGGAAGAAGAAGAGGAAGTTTGTGGAGTTCTTTGAGAGGTTAATGAATGAGGTGATTGAGAAGCAGGAGAAATTGCAAAAGAAGTTTGTC
GAGGCATTGGAGAAATGTGAACAAGAAAGGTTAGCTAGAGAAGAAGAATGGAAGATGCAAGAATTAGCTCGAATCAAGAAAGAGCGAGAGCGTTTAAATCAAGAGAGATC
AATTGCGGCTGCAAAGGATGCAGCTGTTCTTTCATTCTTGAAGATGTTCTCTGAACAGGTGGGCGCAGTGCAGTTTCCCGAGAGCTCGATTTTGATGGAGAATTTAATGG
AGAAACAAGATGATGGCAATGCCGACAGAAATACAAGCAATCAAGAGAATAGCAACAACGGAAATTCGAATCAAATTAGCTCATCCCGGTGGCCGAAAGAAGAGATCGAT
GCTCTGATTCAGCTTAGGACTAATCTGCAGATGAAGTACCAAGAAAATGGGCCTAAAGGTCCTCTCTGGGAGGAAATATCACTGGCCATGAAGAAACATGGGTATGATAG
AAGTGCAAAGAGGTGTAAAGAGAAATGGGAGAACATCAACAAATACTTCAAGAGAGTAAAGGAAAGCAACAAAAAACGACCCGAGGATTCAAAGACATGCCCTTACTTCC
AGCAGCTTGACGCATTGTACAAACAGAAATCCAAGAAAGTCGTTGACAATCCATCTAATCCAAATTACGAACTAAAACCCGAAGAACTATTGATGCACATGATGGGCGGG
CAAGAGGAAAGCCACCAACCCGAATCAGCAACAGACGACGGGGAAGCTGAGAATGCAGATCAGAATCAAGAAGACGAAGGCGAAGACGAGGACGAGGACGAAGATGAAGA
TTATCAAATTGTAGCCAACAACAACAGTAATCAAATGGAAGTAGCAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGAAATTTCTCCTTCACCGGAAAACTCTTCCGCCGCCGCCACCGCCGCCGCCGAGGATGACGGTGCGGCGCCCTCCGCCGGACTTGCGGAGGAGGCTGACCGGAG
CTGGGCCGGCAACCGGTGGCCACGAGAGGAGACCATCGCTTTGCTTAAGGTGAGGTCGAGTATGGACACTGCGTTTAGAGATGCAAGCCTTAAAGCTCCTCTATGGGAGG
AAGTTTCCAGGAAATTGGCTGAGCTTGGGTATAATCGAAGTGCGAAAAAATGCAAAGAGAAATTTGAGAACATTTATAAGTATCACAAAAGGACTAAAGATAGCAGATCG
GGCAAACCCAATGGGAAAAATTATAGGTATTTTGAGCAATTACAAGCTTTAGATAATCATCCATTGCTACCTTCTCAAGCTGGTTCAATGGAAGAAATCCCAAGGATTAT
CCCAAACAATATTGTTCACAATGCAATTCCATGTTCCGTTGTGAACCCGGGTTCGAATTTTGTTGAAACTACCACCACATCGATATCGACTTCGACGACTTCTTGCTCGA
GCAAAGAGTCGGGTGGGACGAGGAAGAAGAAGAGGAAGTTTGTGGAGTTCTTTGAGAGGTTAATGAATGAGGTGATTGAGAAGCAGGAGAAATTGCAAAAGAAGTTTGTC
GAGGCATTGGAGAAATGTGAACAAGAAAGGTTAGCTAGAGAAGAAGAATGGAAGATGCAAGAATTAGCTCGAATCAAGAAAGAGCGAGAGCGTTTAAATCAAGAGAGATC
AATTGCGGCTGCAAAGGATGCAGCTGTTCTTTCATTCTTGAAGATGTTCTCTGAACAGGTGGGCGCAGTGCAGTTTCCCGAGAGCTCGATTTTGATGGAGAATTTAATGG
AGAAACAAGATGATGGCAATGCCGACAGAAATACAAGCAATCAAGAGAATAGCAACAACGGAAATTCGAATCAAATTAGCTCATCCCGGTGGCCGAAAGAAGAGATCGAT
GCTCTGATTCAGCTTAGGACTAATCTGCAGATGAAGTACCAAGAAAATGGGCCTAAAGGTCCTCTCTGGGAGGAAATATCACTGGCCATGAAGAAACATGGGTATGATAG
AAGTGCAAAGAGGTGTAAAGAGAAATGGGAGAACATCAACAAATACTTCAAGAGAGTAAAGGAAAGCAACAAAAAACGACCCGAGGATTCAAAGACATGCCCTTACTTCC
AGCAGCTTGACGCATTGTACAAACAGAAATCCAAGAAAGTCGTTGACAATCCATCTAATCCAAATTACGAACTAAAACCCGAAGAACTATTGATGCACATGATGGGCGGG
CAAGAGGAAAGCCACCAACCCGAATCAGCAACAGACGACGGGGAAGCTGAGAATGCAGATCAGAATCAAGAAGACGAAGGCGAAGACGAGGACGAGGACGAAGATGAAGA
TTATCAAATTGTAGCCAACAACAACAGTAATCAAATGGAAGTAGCAAGCTGA
Protein sequenceShow/hide protein sequence
MLEISPSPENSSAAATAAAEDDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRS
GKPNGKNYRYFEQLQALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSTTSCSSKESGGTRKKKRKFVEFFERLMNEVIEKQEKLQKKFV
EALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEID
ALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVVDNPSNPNYELKPEELLMHMMGG
QEESHQPESATDDGEAENADQNQEDEGEDEDEDEDEDYQIVANNNSNQMEVAS