; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027222 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027222
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptiontrihelix transcription factor GT-2-like
Genome locationscaffold8:5319287..5321634
RNA-Seq ExpressionSpg027222
SyntenySpg027222
Gene Ontology termsNA
InterPro domainsIPR001005 - SANT/Myb domain
IPR009057 - Homeobox-like domain superfamily
IPR044822 - Myb/SANT-like DNA-binding domain 4


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0040781.1 trihelix transcription factor GT-2-like [Cucumis melo var. makuwa]8.5e-22885.08Show/hide
Query:  MLEISPSPENSSAAATAAA-----EEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVS-----------------RK
        MLEISPSPENSSAAA  AA     +ED AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVS                 RK
Subjt:  MLEISPSPENSSAAATAAA-----EEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVS-----------------RK

Query:  LAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTS
        L ELGYNR+AKKCKEKFENIYKYHKRTKD RSGK NGKNYRYFEQLEALDNHPLLPSQA SMEEIP+IIPNN+VHNAIPCSVVNPG+NFVETTTTS+STS
Subjt:  LAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTS

Query:  MTSCSSKDSGGTRKKKRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVG
         TSCSSK+SGGTRKKKRKF+EFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+ SEQ G
Subjt:  MTSCSSKDSGGTRKKKRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVG

Query:  AVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWEN
         VQFPE+ +LMENL EKQDD N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ++GPKGPLWEEISLAMKK GYDR+AKRCKEKWEN
Subjt:  AVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWEN

Query:  INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDY
        INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVI+NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE+ E+EDEDY
Subjt:  INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDY

Query:  QIVANNNSN---QMEV
        +IVAN+N+N   QM+V
Subjt:  QIVANNNSN---QMEV

XP_004147355.2 trihelix transcription factor GT-2 [Cucumis sativus]1.5e-22987.95Show/hide
Query:  MLEISPSPENSSAA---ATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFEN
        MLEISPSPENSSAA   A    +E+ AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKL ELGYNR+AKKCKEKFEN
Subjt:  MLEISPSPENSSAA---ATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFEN

Query:  IYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKRKF
        IYKYHKRTKD RSGK NGKNYRYFEQLEALDNH LLPSQA SMEEIPRIIPNN+VHNAIPCSVVNPG+NFVETTTTS+STS TS SSK+SGGTRKKKRKF
Subjt:  IYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKRKF

Query:  LEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQD
        +EFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+FSEQ G VQFPE+ +LMENL EKQD
Subjt:  LEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQD

Query:  DGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDS
        D N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ+NGPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPEDS
Subjt:  DGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDS

Query:  KTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDYQIVA----NNNSNQMEV
        KTCPYFQQLDALYKQKSKKVI+NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE+ E+EDEDY+IVA    NNN+NQM+V
Subjt:  KTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDYQIVA----NNNSNQMEV

XP_008460913.1 PREDICTED: trihelix transcription factor GT-2-like [Cucumis melo]4.8e-23187.98Show/hide
Query:  MLEISPSPENSSAAATAAA-----EEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF
        MLEISPSPENSSAAA  AA     +ED AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKL ELGYNR+AKKCKEKF
Subjt:  MLEISPSPENSSAAATAAA-----EEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF

Query:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKR
        ENIYKYHKRTKD RSGK NGKNYRYFEQLEALDNHPLLPSQA SMEEIP+IIPNN+VHNAIPCSVVNPG+NFVETTTTS+STS TSCSSK+SGGTRKKKR
Subjt:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKR

Query:  KFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK
        KF+EFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+ SEQ G VQFPE+ +LMENL EK
Subjt:  KFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK

Query:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE
        QDD N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ++GPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPE
Subjt:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE

Query:  DSKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDYQIVANNNSN---QMEV
        DSKTCPYFQQLDALYKQKSKKVI+NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE+ E+EDEDY+IVAN+N+N   QM+V
Subjt:  DSKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDYQIVANNNSN---QMEV

XP_022159187.1 trihelix transcription factor GTL1-like [Momordica charantia]1.1e-23590.58Show/hide
Query:  MLEISPSPENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYK
        MLE S SPENS+    AAAEED A  S GLAEEADRSWAGNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSR LAELGYNRSAKKCKEKFENIYK
Subjt:  MLEISPSPENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYK

Query:  YHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKRKFLEF
        YHKRTKDSRSGK NGKNYRYFEQLEALDNHPLLPSQA SMEE+PRIIPNNIVHNAIPCSVVNPGSNFV+TTTTSISTS TSCSSK+SGGT KKKRKF+EF
Subjt:  YHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKRKFLEF

Query:  FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGN
        FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLN ERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILME+ M+KQDDGN
Subjt:  FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGN

Query:  ADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTC
         DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPEDSKTC
Subjt:  ADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTC

Query:  PYFQQLDALYKQKSKKVI----DNPSNPNYELKPEELLMHMMGGQEESH-QPESATDDGEAENADQNQEDEGEDEE---EDEDYQIVANNNSNQMEVAS
        PYFQQLDALYK+KSKKV     +N +NPNYELKPEELLMHMMGGQEE H QPESATDDGE ENADQNQEDE E++E   EDEDYQIVANNNSNQM VAS
Subjt:  PYFQQLDALYKQKSKKVI----DNPSNPNYELKPEELLMHMMGGQEESH-QPESATDDGEAENADQNQEDEGEDEE---EDEDYQIVANNNSNQMEVAS

XP_038901714.1 trihelix transcription factor GT-2-like [Benincasa hispida]5.1e-23388.89Show/hide
Query:  MLEISPSPENSSAAATA----AAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFE
        MLEISPSPENS+  A A    AAEEDGAA SAGL+EE DR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNR+AKKCKEKFE
Subjt:  MLEISPSPENSSAAATA----AAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFE

Query:  NIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKRK
        NIYKYHKRTKD RSGK NGKNYRYFEQLEA DNHPLLPSQA SMEEIPRIIPNN+VHNAIPCSVV PG+NFVETTTTSISTS TSCSSK+SGGTRKKKRK
Subjt:  NIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKRK

Query:  FLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQ
        F+EFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+FSEQVG VQFPE+ ILMENL EKQ
Subjt:  FLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQ

Query:  DDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPED
        DDGN DRNTS +EN NNGNS+QISSSRWPKEEIDALIQLRT+LQMKYQ+NGPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRV+ESNKKRPED
Subjt:  DDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPED

Query:  SKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENADQNQED--EGEDEEEDEDYQIVANNNSNQMEV
        SKTCPYFQQLDALYKQKSKK+I+NP+NPNYELKPEELLMHMMGGQEESHQPESATDDG     DQNQED  EGE E+EDEDYQIVANN++NQMEV
Subjt:  SKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENADQNQED--EGEDEEEDEDYQIVANNNSNQMEV

TrEMBL top hitse value%identityAlignment
A0A0A0LK12 Uncharacterized protein7.5e-23087.95Show/hide
Query:  MLEISPSPENSSAA---ATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFEN
        MLEISPSPENSSAA   A    +E+ AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKL ELGYNR+AKKCKEKFEN
Subjt:  MLEISPSPENSSAA---ATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFEN

Query:  IYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKRKF
        IYKYHKRTKD RSGK NGKNYRYFEQLEALDNH LLPSQA SMEEIPRIIPNN+VHNAIPCSVVNPG+NFVETTTTS+STS TS SSK+SGGTRKKKRKF
Subjt:  IYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKRKF

Query:  LEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQD
        +EFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+FSEQ G VQFPE+ +LMENL EKQD
Subjt:  LEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQD

Query:  DGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDS
        D N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ+NGPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPEDS
Subjt:  DGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDS

Query:  KTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDYQIVA----NNNSNQMEV
        KTCPYFQQLDALYKQKSKKVI+NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE+ E+EDEDY+IVA    NNN+NQM+V
Subjt:  KTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDYQIVA----NNNSNQMEV

A0A1S3CD19 trihelix transcription factor GT-2-like2.3e-23187.98Show/hide
Query:  MLEISPSPENSSAAATAAA-----EEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF
        MLEISPSPENSSAAA  AA     +ED AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKL ELGYNR+AKKCKEKF
Subjt:  MLEISPSPENSSAAATAAA-----EEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF

Query:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKR
        ENIYKYHKRTKD RSGK NGKNYRYFEQLEALDNHPLLPSQA SMEEIP+IIPNN+VHNAIPCSVVNPG+NFVETTTTS+STS TSCSSK+SGGTRKKKR
Subjt:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKR

Query:  KFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK
        KF+EFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+ SEQ G VQFPE+ +LMENL EK
Subjt:  KFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK

Query:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE
        QDD N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ++GPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPE
Subjt:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE

Query:  DSKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDYQIVANNNSN---QMEV
        DSKTCPYFQQLDALYKQKSKKVI+NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE+ E+EDEDY+IVAN+N+N   QM+V
Subjt:  DSKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDYQIVANNNSN---QMEV

A0A5A7TGJ1 Trihelix transcription factor GT-2-like4.1e-22885.08Show/hide
Query:  MLEISPSPENSSAAATAAA-----EEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVS-----------------RK
        MLEISPSPENSSAAA  AA     +ED AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVS                 RK
Subjt:  MLEISPSPENSSAAATAAA-----EEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVS-----------------RK

Query:  LAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTS
        L ELGYNR+AKKCKEKFENIYKYHKRTKD RSGK NGKNYRYFEQLEALDNHPLLPSQA SMEEIP+IIPNN+VHNAIPCSVVNPG+NFVETTTTS+STS
Subjt:  LAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTS

Query:  MTSCSSKDSGGTRKKKRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVG
         TSCSSK+SGGTRKKKRKF+EFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+ SEQ G
Subjt:  MTSCSSKDSGGTRKKKRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVG

Query:  AVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWEN
         VQFPE+ +LMENL EKQDD N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ++GPKGPLWEEISLAMKK GYDR+AKRCKEKWEN
Subjt:  AVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWEN

Query:  INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDY
        INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVI+NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE+ E+EDEDY
Subjt:  INKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDY

Query:  QIVANNNSN---QMEV
        +IVAN+N+N   QM+V
Subjt:  QIVANNNSN---QMEV

A0A5D3BRR0 Trihelix transcription factor GT-2-like2.3e-23187.98Show/hide
Query:  MLEISPSPENSSAAATAAA-----EEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF
        MLEISPSPENSSAAA  AA     +ED AA SAG+ EEADR+W GNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSRKL ELGYNR+AKKCKEKF
Subjt:  MLEISPSPENSSAAATAAA-----EEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKF

Query:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKR
        ENIYKYHKRTKD RSGK NGKNYRYFEQLEALDNHPLLPSQA SMEEIP+IIPNN+VHNAIPCSVVNPG+NFVETTTTS+STS TSCSSK+SGGTRKKKR
Subjt:  ENIYKYHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKR

Query:  KFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK
        KF+EFFERLMNEVIEKQEKLQKKFVEALEKCE ERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLK+ SEQ G VQFPE+ +LMENL EK
Subjt:  KFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEK

Query:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE
        QDD N +RNTS QEN NNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQ++GPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPE
Subjt:  QDDGNADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPE

Query:  DSKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDYQIVANNNSN---QMEV
        DSKTCPYFQQLDALYKQKSKKVI+NP+NPNYELKPEELLMHMMG QEE+HQPESATDDGEAENAD QNQEDEGE+ E+EDEDY+IVAN+N+N   QM+V
Subjt:  DSKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGGQEESHQPESATDDGEAENAD-QNQEDEGED-EEEDEDYQIVANNNSN---QMEV

A0A6J1DZ47 trihelix transcription factor GTL1-like5.4e-23690.58Show/hide
Query:  MLEISPSPENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYK
        MLE S SPENS+    AAAEED A  S GLAEEADRSWAGNRWPREET+ALLKVRSSMDTAFRDASLKAPLWEEVSR LAELGYNRSAKKCKEKFENIYK
Subjt:  MLEISPSPENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYK

Query:  YHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKRKFLEF
        YHKRTKDSRSGK NGKNYRYFEQLEALDNHPLLPSQA SMEE+PRIIPNNIVHNAIPCSVVNPGSNFV+TTTTSISTS TSCSSK+SGGT KKKRKF+EF
Subjt:  YHKRTKDSRSGKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKRKFLEF

Query:  FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGN
        FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLN ERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILME+ M+KQDDGN
Subjt:  FERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGN

Query:  ADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTC
         DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKK GYDR+AKRCKEKWENINKYFKRVKESNKKRPEDSKTC
Subjt:  ADRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTC

Query:  PYFQQLDALYKQKSKKVI----DNPSNPNYELKPEELLMHMMGGQEESH-QPESATDDGEAENADQNQEDEGEDEE---EDEDYQIVANNNSNQMEVAS
        PYFQQLDALYK+KSKKV     +N +NPNYELKPEELLMHMMGGQEE H QPESATDDGE ENADQNQEDE E++E   EDEDYQIVANNNSNQM VAS
Subjt:  PYFQQLDALYKQKSKKVI----DNPSNPNYELKPEELLMHMMGGQEESH-QPESATDDGEAENADQNQEDEGEDEE---EDEDYQIVANNNSNQMEVAS

SwissProt top hitse value%identityAlignment
Q39117 Trihelix transcription factor GT-22.1e-8841.27Show/hide
Query:  ENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDS
        E+S      + EE+         E A     GNRWPR ET+ALL++RS MD AFRD++LKAPLWEE+SRK+ ELGY RS+KKCKEKFEN+YKYHKRTK+ 
Subjt:  ENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDS

Query:  RSGKPNGKNYRYFEQLEALD------------------------------------------------------------NHPLLPSQAGSMEEIPRIIP
        R+GK  GK YR+FE+LEA +                                                            N   L  Q  S    P    
Subjt:  RSGKPNGKNYRYFEQLEALD------------------------------------------------------------NHPLLPSQAGSMEEIPRIIP

Query:  NNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSG-----GTRKKKRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELA
        NN    + P    +  +N       S STS ++ S ++        +RKK++ +   F +L  E++EKQEK+QK+F+E LE  E+ER++REE W++QE+ 
Subjt:  NNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSG-----GTRKKKRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELA

Query:  RIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPES---------------SILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEE
        RI +E E L  ERS AAAKDAA++SFL   S   G  Q P+                SI  E+   K+        T    N +N +S   SSSRWPK E
Subjt:  RIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPES---------------SILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEE

Query:  IDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDNPSNPNYEL
        ++ALI++R NL+  YQENG KGPLWEEIS  M++ GY+RSAKRCKEKWENINKYFK+VKESNKKRP DSKTCPYF QL+ALY +++K        P    
Subjt:  IDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDNPSNPNYEL

Query:  KPEELLMHMMGGQE-ESHQPESATD-----DGEAENADQNQEDEGEDEEEDEDYQIVANNNSNQMEV
           +LL+      E E+ Q E   D     +GE+E  + ++E+EGE + E  +++IV N  S+ M++
Subjt:  KPEELLMHMMGGQE-ESHQPESATD-----DGEAENADQNQEDEGEDEEEDEDYQIVANNNSNQMEV

Q8H181 Trihelix transcription factor GTL26.5e-3729.12Show/hide
Query:  EDGAAPSAGLAEEADRSWAGNR--------WPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGK
        +DG A +     + D   + N         W  +E +ALL+ RS+++  F + +     WE  SRKLAE+G+ RS ++CKEKFE   + +  + ++ +  
Subjt:  EDGAAPSAGLAEEADRSWAGNR--------WPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGK

Query:  PN-----------GKNYRYFEQLEALDNH-------------------PLLPSQAGSMEEIPRIIPNNIVHN-----AIPCSVVNPGSNF-------VET
         N           G NYR F ++E   +H                    L+  +    E +  ++  + + +         S+ N  ++        VE 
Subjt:  PN-----------GKNYRYFEQLEALDNH-------------------PLLPSQAGSMEEIPRIIPNNIVHN-----AIPCSVVNPGSNF-------VET

Query:  TTTSISTSMTSCSSKDSGGTRKKKRK-----FLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA
           S S+S      K+    ++KK K        F E L+  +I +QE++ KK +E + K E+E++AREE WK QE+ R+ KE E   QE+++A+ ++  
Subjt:  TTTSISTSMTSCSSKDSGGTRKKKRK-----FLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA

Query:  VLSFLKMFSEQ-----------------------VGAVQFPESSILMENLMEKQDDGNADRN-------TSNQENSN----NGNSNQISSSRWPKEEIDA
        ++ F+  F++                         G  +F  SS L+   +   +    D++       T   +N N      +       RWPK+E+ A
Subjt:  VLSFLKMFSEQ-----------------------VGAVQFPESSILMENLMEKQDDGNADRN-------TSNQENSN----NGNSNQISSSRWPKEEIDA

Query:  LIQLRTNL----------QMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQ
        LI +R ++          +     +    PLWE IS  M + GY RSAKRCKEKWENINKYF++ K+ NKKRP DS+TCPYF QL ALY Q
Subjt:  LIQLRTNL----------QMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQ

Q9C6K3 Trihelix transcription factor DF13.7e-10145.01Show/hide
Query:  SPENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTK
        S  N SAA  AAA   GA   +   E  DR + GNRWPR+ET+ALLK+RS M  AFRDAS+K PLWEEVSRK+AE GY R+AKKCKEKFEN+YKYHKRTK
Subjt:  SPENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTK

Query:  DSRSGKPNGKNYRYFEQLEALDNH------------PLLPSQAGSMEE--------IPRIIPNNIVHNAIPCSVVNP-------------GSNFVETTTT
        + R+GK  GK YR+F+QLEAL++             PL P Q  +                P   V   +P S + P               +F+   +T
Subjt:  DSRSGKPNGKNYRYFEQLEALDNH------------PLLPSQAGSMEE--------IPRIIPNNIVHNAIPCSVVNP-------------GSNFVETTTT

Query:  SISTSMTSCSSKDSGG----TRKK-KRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLS
        S S+S ++ S  + GG    TRKK KRK+  FFERLM +V++KQE+LQ+KF+EA+EK E ERL REE W++QE+ARI +E E L QERS++AAKDAAV++
Subjt:  SISTSMTSCSSKDSGG----TRKK-KRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLS

Query:  FLKMFSEQVGAVQFPE-------SSILMENLMEKQDDGNA---------------------DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQ
        FL+  SE+      P+        S+ + N  ++Q    +                        T N  + N   +   SSSRWPK EI+ALI+LRTNL 
Subjt:  FLKMFSEQVGAVQFPE-------SSILMENLMEKQDDGNA---------------------DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQ

Query:  MKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDN----PSNPNYELKPEELLMH
         KYQENGPKGPLWEEIS  M++ G++R++KRCKEKWENINKYFK+VKESNKKRPEDSKTCPYF QLDALY++++K   +N     S+ +  +KP+  +  
Subjt:  MKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDN----PSNPNYELKPEELLMH

Query:  MMGGQ--------------------EESHQPESATDDGEAENADQNQEDEGEDEEEDE--DYQIVANNNSN
        M+  +                    ++S   E   DD E  + + + EDE E+ EE+E  ++++V +NN+N
Subjt:  MMGGQ--------------------EESHQPESATDDGEAENADQNQEDEGEDEEEDE--DYQIVANNNSN

Q9C882 Trihelix transcription factor GTL12.6e-7838.52Show/hide
Query:  TAAAEEDG-AAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN
        +AAA++ G      G    +  S +GNRWPREET+ALL++RS MD+ FRDA+LKAPLWE VSRKL ELGY RS+KKCKEKFEN+ KY+KRTK++R G+ +
Subjt:  TAAAEEDG-AAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN

Query:  GKNYRYFEQLEALDNHP----------------LLPSQAGS------------MEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKD
        GK Y++F QLEAL+  P                L+PS + S              + P+    +      P  + + G  F   T +S S+S  S    D
Subjt:  GKNYRYFEQLEALDNHP----------------LLPSQAGS------------MEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKD

Query:  --------------SGGTRKKKR-------KFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA
                         +RK+KR       K +E FE L+ +V++KQ  +Q+ F+EALEK EQERL REE WK QE+AR+ +E E ++QER+ +A++DAA
Subjt:  --------------SGGTRKKKR-------KFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA

Query:  VLSFLKMFS---------------------------------------EQVGAVQFPESSIL----MENLMEKQDDGNADRNTSNQENSNNGNSNQISSS
        ++S ++  +                                        Q   +  P+  IL      +    Q +    +    +   ++  S+  SSS
Subjt:  VLSFLKMFS---------------------------------------EQVGAVQFPESSIL----MENLMEKQDDGNADRNTSNQENSNNGNSNQISSS

Query:  RWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQK---------
        RWPK EI ALI LR+ ++ +YQ+N PKG LWEEIS +MK+ GY+R+AKRCKEKWENINKY+K+VKESNKKRP+D+KTCPYF +LD LY+ K         
Subjt:  RWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQK---------

Query:  -SKKVIDNPSNPNYELK-PEELLMHMMGGQEESHQPESATDDGEAENADQNQE
         S    D   +P   +K P+E L+++    +++H   S  ++   E + Q  E
Subjt:  -SKKVIDNPSNPNYELK-PEELLMHMMGGQEESHQPESATDDGEAENADQNQE

Q9LZS0 Trihelix transcription factor PTL1.1e-4435.58Show/hide
Query:  RWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLA-ELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLEAL--DNHPLLPSQAG
        RWPR+ET+ LL++RS +D  F++A+ K PLW+EVSR ++ E GY RS KKC+EKFEN+YKY+++TK+ ++G+ +GK+YR+F QLEAL  D++ L+     
Subjt:  RWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLA-ELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLEAL--DNHPLLPSQAG

Query:  SMEEIPRIIPNNIVHNAIPCSVVNPGSNF-----VETTTTSISTS----------MTSCSSKDSGGTRKKKR----KFLEFFERLMNEVIEKQEKLQKKF
        + + +   +     H   P +V    SN      V     S+S S          MTS S  +   +R+KKR    K  EF +  M  +IE+Q+   +K 
Subjt:  SMEEIPRIIPNNIVHNAIPCSVVNPGSNF-----VETTTTSISTS----------MTSCSSKDSGGTRKKKR----KFLEFFERLMNEVIEKQEKLQKKF

Query:  VEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGN---S
         + +E  E++R+ +EEEW+  E ARI KE     +ER+   A+D AV+  L+  + +      P    L  +  E+ +  N  RN S  +N N  +   +
Subjt:  VEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGN---S

Query:  NQI----SSSRWPKEEIDALIQLRTNLQMKYQE--NGPKGP-LWEEISLAMKKHGYD-RSAKRCKEKWENI-NKYFKRVKESNKKRPEDSKTCPYF---Q
        N +    SSS W ++EI  L+++RT++   +QE   G     LWEEI+  + + G+D RSA  CKEKWE I N   K  K+ NKKR ++S +C  +    
Subjt:  NQI----SSSRWPKEEIDALIQLRTNLQMKYQE--NGPKGP-LWEEISLAMKKHGYD-RSAKRCKEKWENI-NKYFKRVKESNKKRPEDSKTCPYF---Q

Query:  QLDALYKQKSKKVIDN
        + + +Y  +     DN
Subjt:  QLDALYKQKSKKVIDN

Arabidopsis top hitse value%identityAlignment
AT1G33240.1 GT-2-like 11.1e-7637.35Show/hide
Query:  TAAAEEDG-AAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN
        +AAA++ G      G    +  S +GNRWPREET+ALL++RS MD+ FRDA+LKAPLWE VSRKL ELGY RS+KKCKEKFEN+ KY+KRTK++R G+ +
Subjt:  TAAAEEDG-AAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPN

Query:  GKNYRYFEQLEALDNHP----------------LLPSQAGS------------MEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKD
        GK Y++F QLEAL+  P                L+PS + S              + P+    +      P  + + G  F   T +S S+S  S    D
Subjt:  GKNYRYFEQLEALDNHP----------------LLPSQAGS------------MEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKD

Query:  --------------SGGTRKKKR-------KFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA
                         +RK+KR       K +E FE L+ +V++KQ  +Q+ F+EALEK EQERL REE WK QE+AR+ +E E ++QER+ +A++DAA
Subjt:  --------------SGGTRKKKR-------KFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA

Query:  VLSFLKMFS---------------------------------------EQVGAVQFPESSIL----MENLMEKQDDGNADRNTSNQENSNNGNSNQISSS
        ++S ++  +                                        Q   +  P+  IL      +    Q +    +    +   ++  S+  SSS
Subjt:  VLSFLKMFS---------------------------------------EQVGAVQFPESSIL----MENLMEKQDDGNADRNTSNQENSNNGNSNQISSS

Query:  RWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQK---------
        RWPK EI ALI LR+ ++ +YQ+N PKG LWEEIS +MK+ GY+R+AKRCKEKWENINKY+K+VKESNKKRP+D+KTCPYF +LD LY+ K         
Subjt:  RWPKEEIDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQK---------

Query:  -SKKVIDNPSNPNYELK-PEELLMHMM----GGQEESHQPESATDDGEAENAD--------QNQEDEGEDEEEDEDYQIVANNNSNQME
         S    D   +P   +K P+E L+++         E  +P   +  G  +  D        Q Q+ + ++    E  +I  ++N N ME
Subjt:  -SKKVIDNPSNPNYELK-PEELLMHMM----GGQEESHQPESATDDGEAENAD--------QNQEDEGEDEEEDEDYQIVANNNSNQME

AT1G76880.1 Duplicated homeodomain-like superfamily protein2.6e-10245.01Show/hide
Query:  SPENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTK
        S  N SAA  AAA   GA   +   E  DR + GNRWPR+ET+ALLK+RS M  AFRDAS+K PLWEEVSRK+AE GY R+AKKCKEKFEN+YKYHKRTK
Subjt:  SPENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTK

Query:  DSRSGKPNGKNYRYFEQLEALDNH------------PLLPSQAGSMEE--------IPRIIPNNIVHNAIPCSVVNP-------------GSNFVETTTT
        + R+GK  GK YR+F+QLEAL++             PL P Q  +                P   V   +P S + P               +F+   +T
Subjt:  DSRSGKPNGKNYRYFEQLEALDNH------------PLLPSQAGSMEE--------IPRIIPNNIVHNAIPCSVVNP-------------GSNFVETTTT

Query:  SISTSMTSCSSKDSGG----TRKK-KRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLS
        S S+S ++ S  + GG    TRKK KRK+  FFERLM +V++KQE+LQ+KF+EA+EK E ERL REE W++QE+ARI +E E L QERS++AAKDAAV++
Subjt:  SISTSMTSCSSKDSGG----TRKK-KRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLS

Query:  FLKMFSEQVGAVQFPE-------SSILMENLMEKQDDGNA---------------------DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQ
        FL+  SE+      P+        S+ + N  ++Q    +                        T N  + N   +   SSSRWPK EI+ALI+LRTNL 
Subjt:  FLKMFSEQVGAVQFPE-------SSILMENLMEKQDDGNA---------------------DRNTSNQENSNNGNSNQISSSRWPKEEIDALIQLRTNLQ

Query:  MKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDN----PSNPNYELKPEELLMH
         KYQENGPKGPLWEEIS  M++ G++R++KRCKEKWENINKYFK+VKESNKKRPEDSKTCPYF QLDALY++++K   +N     S+ +  +KP+  +  
Subjt:  MKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDN----PSNPNYELKPEELLMH

Query:  MMGGQ--------------------EESHQPESATDDGEAENADQNQEDEGEDEEEDE--DYQIVANNNSN
        M+  +                    ++S   E   DD E  + + + EDE E+ EE+E  ++++V +NN+N
Subjt:  MMGGQ--------------------EESHQPESATDDGEAENADQNQEDEGEDEEEDE--DYQIVANNNSN

AT1G76890.2 Duplicated homeodomain-like superfamily protein1.5e-8941.27Show/hide
Query:  ENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDS
        E+S      + EE+         E A     GNRWPR ET+ALL++RS MD AFRD++LKAPLWEE+SRK+ ELGY RS+KKCKEKFEN+YKYHKRTK+ 
Subjt:  ENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDS

Query:  RSGKPNGKNYRYFEQLEALD------------------------------------------------------------NHPLLPSQAGSMEEIPRIIP
        R+GK  GK YR+FE+LEA +                                                            N   L  Q  S    P    
Subjt:  RSGKPNGKNYRYFEQLEALD------------------------------------------------------------NHPLLPSQAGSMEEIPRIIP

Query:  NNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSG-----GTRKKKRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELA
        NN    + P    +  +N       S STS ++ S ++        +RKK++ +   F +L  E++EKQEK+QK+F+E LE  E+ER++REE W++QE+ 
Subjt:  NNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSG-----GTRKKKRKFLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELA

Query:  RIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPES---------------SILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEE
        RI +E E L  ERS AAAKDAA++SFL   S   G  Q P+                SI  E+   K+        T    N +N +S   SSSRWPK E
Subjt:  RIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPES---------------SILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEE

Query:  IDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDNPSNPNYEL
        ++ALI++R NL+  YQENG KGPLWEEIS  M++ GY+RSAKRCKEKWENINKYFK+VKESNKKRP DSKTCPYF QL+ALY +++K        P    
Subjt:  IDALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDNPSNPNYEL

Query:  KPEELLMHMMGGQE-ESHQPESATD-----DGEAENADQNQEDEGEDEEEDEDYQIVANNNSNQMEV
           +LL+      E E+ Q E   D     +GE+E  + ++E+EGE + E  +++IV N  S+ M++
Subjt:  KPEELLMHMMGGQE-ESHQPESATD-----DGEAENADQNQEDEGEDEEEDEDYQIVANNNSNQMEV

AT5G03680.1 Duplicated homeodomain-like superfamily protein7.8e-4635.58Show/hide
Query:  RWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLA-ELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLEAL--DNHPLLPSQAG
        RWPR+ET+ LL++RS +D  F++A+ K PLW+EVSR ++ E GY RS KKC+EKFEN+YKY+++TK+ ++G+ +GK+YR+F QLEAL  D++ L+     
Subjt:  RWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLA-ELGYNRSAKKCKEKFENIYKYHKRTKDSRSGKPNGKNYRYFEQLEAL--DNHPLLPSQAG

Query:  SMEEIPRIIPNNIVHNAIPCSVVNPGSNF-----VETTTTSISTS----------MTSCSSKDSGGTRKKKR----KFLEFFERLMNEVIEKQEKLQKKF
        + + +   +     H   P +V    SN      V     S+S S          MTS S  +   +R+KKR    K  EF +  M  +IE+Q+   +K 
Subjt:  SMEEIPRIIPNNIVHNAIPCSVVNPGSNF-----VETTTTSISTS----------MTSCSSKDSGGTRKKKR----KFLEFFERLMNEVIEKQEKLQKKF

Query:  VEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGN---S
         + +E  E++R+ +EEEW+  E ARI KE     +ER+   A+D AV+  L+  + +      P    L  +  E+ +  N  RN S  +N N  +   +
Subjt:  VEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGN---S

Query:  NQI----SSSRWPKEEIDALIQLRTNLQMKYQE--NGPKGP-LWEEISLAMKKHGYD-RSAKRCKEKWENI-NKYFKRVKESNKKRPEDSKTCPYF---Q
        N +    SSS W ++EI  L+++RT++   +QE   G     LWEEI+  + + G+D RSA  CKEKWE I N   K  K+ NKKR ++S +C  +    
Subjt:  NQI----SSSRWPKEEIDALIQLRTNLQMKYQE--NGPKGP-LWEEISLAMKKHGYD-RSAKRCKEKWENI-NKYFKRVKESNKKRPEDSKTCPYF---Q

Query:  QLDALYKQKSKKVIDN
        + + +Y  +     DN
Subjt:  QLDALYKQKSKKVIDN

AT5G28300.1 Duplicated homeodomain-like superfamily protein4.6e-3829.12Show/hide
Query:  EDGAAPSAGLAEEADRSWAGNR--------WPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGK
        +DG A +     + D   + N         W  +E +ALL+ RS+++  F + +     WE  SRKLAE+G+ RS ++CKEKFE   + +  + ++ +  
Subjt:  EDGAAPSAGLAEEADRSWAGNR--------WPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRSGK

Query:  PN-----------GKNYRYFEQLEALDNH-------------------PLLPSQAGSMEEIPRIIPNNIVHN-----AIPCSVVNPGSNF-------VET
         N           G NYR F ++E   +H                    L+  +    E +  ++  + + +         S+ N  ++        VE 
Subjt:  PN-----------GKNYRYFEQLEALDNH-------------------PLLPSQAGSMEEIPRIIPNNIVHN-----AIPCSVVNPGSNF-------VET

Query:  TTTSISTSMTSCSSKDSGGTRKKKRK-----FLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA
           S S+S      K+    ++KK K        F E L+  +I +QE++ KK +E + K E+E++AREE WK QE+ R+ KE E   QE+++A+ ++  
Subjt:  TTTSISTSMTSCSSKDSGGTRKKKRK-----FLEFFERLMNEVIEKQEKLQKKFVEALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAA

Query:  VLSFLKMFSEQ-----------------------VGAVQFPESSILMENLMEKQDDGNADRN-------TSNQENSN----NGNSNQISSSRWPKEEIDA
        ++ F+  F++                         G  +F  SS L+   +   +    D++       T   +N N      +       RWPK+E+ A
Subjt:  VLSFLKMFSEQ-----------------------VGAVQFPESSILMENLMEKQDDGNADRN-------TSNQENSN----NGNSNQISSSRWPKEEIDA

Query:  LIQLRTNL----------QMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQ
        LI +R ++          +     +    PLWE IS  M + GY RSAKRCKEKWENINKYF++ K+ NKKRP DS+TCPYF QL ALY Q
Subjt:  LIQLRTNL----------QMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTGGAAATTTCTCCTTCACCGGAAAACTCTTCCGCCGCCGCCACCGCCGCCGCCGAGGAGGACGGTGCGGCGCCCTCCGCCGGACTTGCGGAGGAGGCTGACCGGAG
CTGGGCCGGTAACCGGTGGCCGCGAGAGGAGACCATCGCTTTGCTTAAGGTGAGGTCGAGTATGGACACTGCGTTCAGAGATGCAAGCCTTAAAGCTCCTCTATGGGAGG
AAGTTTCCAGGAAATTGGCTGAGCTTGGGTATAATCGAAGTGCGAAAAAATGCAAAGAGAAATTTGAGAACATTTATAAGTATCACAAAAGGACTAAAGATAGCAGATCG
GGCAAACCCAATGGGAAAAATTATAGGTATTTTGAGCAATTAGAAGCTTTAGATAATCATCCATTGCTACCTTCTCAAGCTGGTTCAATGGAAGAAATCCCAAGGATTAT
CCCAAACAATATTGTTCACAATGCAATTCCATGTTCCGTAGTGAACCCGGGTTCGAATTTTGTTGAAACTACCACCACATCGATATCGACTTCGATGACGTCTTGCTCGA
GCAAAGACTCGGGTGGGACGAGGAAGAAGAAGAGGAAGTTTCTGGAGTTCTTTGAGAGGTTAATGAATGAGGTGATTGAGAAGCAGGAGAAATTGCAAAAGAAGTTTGTC
GAGGCATTGGAGAAATGTGAACAAGAAAGGTTAGCTAGAGAAGAAGAATGGAAGATGCAAGAATTAGCTCGAATCAAGAAAGAGCGAGAGCGTTTAAATCAAGAGAGATC
AATTGCGGCTGCAAAGGATGCAGCTGTTCTTTCATTCTTGAAGATGTTCTCTGAACAGGTGGGCGCAGTGCAGTTTCCCGAGAGCTCGATTTTGATGGAGAATTTAATGG
AGAAACAAGATGATGGCAATGCCGACAGAAATACAAGCAATCAAGAGAATAGCAACAATGGAAATTCGAATCAAATTAGCTCATCCCGGTGGCCGAAAGAAGAGATCGAT
GCTCTGATTCAGCTTAGGACTAATCTGCAGATGAAGTACCAAGAAAATGGGCCTAAAGGTCCTCTCTGGGAGGAAATATCACTGGCCATGAAGAAACACGGGTATGATAG
AAGTGCAAAGAGGTGTAAAGAGAAATGGGAGAACATCAACAAATACTTCAAGAGAGTAAAGGAAAGCAACAAAAAACGACCCGAGGATTCAAAGACATGCCCTTACTTCC
AGCAGCTCGACGCATTGTACAAACAGAAATCCAAGAAAGTCATTGACAATCCATCTAATCCAAATTACGAACTAAAACCCGAGGAACTATTGATGCACATGATGGGCGGG
CAAGAGGAAAGCCACCAACCCGAATCAGCAACAGACGACGGGGAAGCTGAGAATGCAGATCAGAATCAAGAAGACGAAGGCGAAGACGAGGAGGAGGACGAAGATTATCA
AATTGTAGCCAACAACAACAGTAATCAAATGGAAGTAGCAAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTGGAAATTTCTCCTTCACCGGAAAACTCTTCCGCCGCCGCCACCGCCGCCGCCGAGGAGGACGGTGCGGCGCCCTCCGCCGGACTTGCGGAGGAGGCTGACCGGAG
CTGGGCCGGTAACCGGTGGCCGCGAGAGGAGACCATCGCTTTGCTTAAGGTGAGGTCGAGTATGGACACTGCGTTCAGAGATGCAAGCCTTAAAGCTCCTCTATGGGAGG
AAGTTTCCAGGAAATTGGCTGAGCTTGGGTATAATCGAAGTGCGAAAAAATGCAAAGAGAAATTTGAGAACATTTATAAGTATCACAAAAGGACTAAAGATAGCAGATCG
GGCAAACCCAATGGGAAAAATTATAGGTATTTTGAGCAATTAGAAGCTTTAGATAATCATCCATTGCTACCTTCTCAAGCTGGTTCAATGGAAGAAATCCCAAGGATTAT
CCCAAACAATATTGTTCACAATGCAATTCCATGTTCCGTAGTGAACCCGGGTTCGAATTTTGTTGAAACTACCACCACATCGATATCGACTTCGATGACGTCTTGCTCGA
GCAAAGACTCGGGTGGGACGAGGAAGAAGAAGAGGAAGTTTCTGGAGTTCTTTGAGAGGTTAATGAATGAGGTGATTGAGAAGCAGGAGAAATTGCAAAAGAAGTTTGTC
GAGGCATTGGAGAAATGTGAACAAGAAAGGTTAGCTAGAGAAGAAGAATGGAAGATGCAAGAATTAGCTCGAATCAAGAAAGAGCGAGAGCGTTTAAATCAAGAGAGATC
AATTGCGGCTGCAAAGGATGCAGCTGTTCTTTCATTCTTGAAGATGTTCTCTGAACAGGTGGGCGCAGTGCAGTTTCCCGAGAGCTCGATTTTGATGGAGAATTTAATGG
AGAAACAAGATGATGGCAATGCCGACAGAAATACAAGCAATCAAGAGAATAGCAACAATGGAAATTCGAATCAAATTAGCTCATCCCGGTGGCCGAAAGAAGAGATCGAT
GCTCTGATTCAGCTTAGGACTAATCTGCAGATGAAGTACCAAGAAAATGGGCCTAAAGGTCCTCTCTGGGAGGAAATATCACTGGCCATGAAGAAACACGGGTATGATAG
AAGTGCAAAGAGGTGTAAAGAGAAATGGGAGAACATCAACAAATACTTCAAGAGAGTAAAGGAAAGCAACAAAAAACGACCCGAGGATTCAAAGACATGCCCTTACTTCC
AGCAGCTCGACGCATTGTACAAACAGAAATCCAAGAAAGTCATTGACAATCCATCTAATCCAAATTACGAACTAAAACCCGAGGAACTATTGATGCACATGATGGGCGGG
CAAGAGGAAAGCCACCAACCCGAATCAGCAACAGACGACGGGGAAGCTGAGAATGCAGATCAGAATCAAGAAGACGAAGGCGAAGACGAGGAGGAGGACGAAGATTATCA
AATTGTAGCCAACAACAACAGTAATCAAATGGAAGTAGCAAGCTGA
Protein sequenceShow/hide protein sequence
MLEISPSPENSSAAATAAAEEDGAAPSAGLAEEADRSWAGNRWPREETIALLKVRSSMDTAFRDASLKAPLWEEVSRKLAELGYNRSAKKCKEKFENIYKYHKRTKDSRS
GKPNGKNYRYFEQLEALDNHPLLPSQAGSMEEIPRIIPNNIVHNAIPCSVVNPGSNFVETTTTSISTSMTSCSSKDSGGTRKKKRKFLEFFERLMNEVIEKQEKLQKKFV
EALEKCEQERLAREEEWKMQELARIKKERERLNQERSIAAAKDAAVLSFLKMFSEQVGAVQFPESSILMENLMEKQDDGNADRNTSNQENSNNGNSNQISSSRWPKEEID
ALIQLRTNLQMKYQENGPKGPLWEEISLAMKKHGYDRSAKRCKEKWENINKYFKRVKESNKKRPEDSKTCPYFQQLDALYKQKSKKVIDNPSNPNYELKPEELLMHMMGG
QEESHQPESATDDGEAENADQNQEDEGEDEEEDEDYQIVANNNSNQMEVAS