; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc06G17430 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc06G17430
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionTranscription factor bhlh
Genome locationClcChr06:28034472..28043023
RNA-Seq ExpressionClc06G17430
SyntenyClc06G17430
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0010106 - cellular response to iron ion starvation (biological process)
GO:0005634 - nucleus (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49971.1 hypothetical protein L484_005308 [Morus notabilis]3.6e-17859.8Show/hide
Query:  AHSTTVALIWFTSAVLFFFLFKMALHN--------SSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA
        + S T+ LIW  SA +F+ LF+MAL N        SSS+SSS SSVSN E RSKLYD M +DLD+ GA FL  GETSQSLSL DIF +KDGSVTP   AA
Subjt:  AHSTTVALIWFTSAVLFFFLFKMALHN--------SSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA

Query:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW
        NPPVRANVL+L T+YSVPI EAVK IF+PYFDK+IWFQNSSLYHFSMFHASHHITP+PAT+ EIEAEAAAV +  + +C + IVLDRV+LTSTGVLLGCW
Subjt:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW

Query:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM
        QV+SGTDP+T+RA+LR ALPHAPEKQLYDAAILHTSFARLLG PK S       D ++ FHELV RLN +IRGF AVVSELWYVEEYDVLALALNGRMK 
Subjt:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM

Query:  RWQRDIGGFQVIGQESKMRGKERKRQNGSVSPFSHPHSFCLQPFPASLFLEVWHFTRFHLLLKIMVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPK
               G + IG                                                 K M SE     V + +  +  S  R+CPG KNQ KVPK
Subjt:  RWQRDIGGFQVIGQESKMRGKERKRQNGSVSPFSHPHSFCLQPFPASLFLEVWHFTRFHLLLKIMVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPK

Query:  KIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKLQRELQSRAL
        +IHKAEREK KRE LN+LF +L+N+L+L +PNNGKAS+LCEA+RLLKDL  QIECLRKE+ SLLSES YV +EKNELREE S+L +QI KLQ E+++R +
Subjt:  KIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKLQRELQSRAL

Query:  HSKPDLNIPPPSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMPTSHVSKPHARYPTPADSWPAGLLK
         SKPDLN PPP               +CL +P  EPTL Q H V ++P   DL +YP  DAAQ  + PTS+VSKPHARYPT  DSWPA LL+
Subjt:  HSKPDLNIPPPSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMPTSHVSKPHARYPTPADSWPAGLLK

OMO91912.1 TB2/DP1/HVA22-related protein [Corchorus capsularis]3.3e-14755.35Show/hide
Query:  MALHNSSSAS----SSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AANPPVRANVLYLSTEYSVPIFEAVKSI
        M L N+SS+S    SSDS +SNS+ RS LYDKME+DLDE+GA FL+ GETSQSLSLSD+FT+KDGSVTP   AANPPVRANVLY+STEYSVPI EAVK +
Subjt:  MALHNSSSAS----SSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AANPPVRANVLYLSTEYSVPIFEAVKSI

Query:  FDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCWQVISGTDPVTIRAKLRAALPHAPEKQ
        F+P FDKAIWFQNSSLYHFSMFHASHHITPIPA++ EIEAEAAA++S T  +C L+IVLDRV+LTSTGVLLGCWQVISGTDPV+IRAKLR+ALP APEKQ
Subjt:  FDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCWQVISGTDPVTIRAKLRAALPHAPEKQ

Query:  LYDAAILHTSFARLLGHPKVS-ETLHRSFDELQLFHEL-------------VARLNKQI-------RGFEAVVSELWYVEEYDVLALALNGRMKMRWQRD
        LYDAAILHTSFARLLG PK S    H + ++++LFH+L             VA  +  +           A+VSELWYVEEYDVLALALNGRMK+  +  
Subjt:  LYDAAILHTSFARLLGHPKVS-ETLHRSFDELQLFHEL-------------VARLNKQI-------RGFEAVVSELWYVEEYDVLALALNGRMKMRWQRD

Query:  IGGFQVIGQESKMRGKERKRQNGSVSPFSHPHSFCLQPFPASLFLEVWHFTRFHLLLKIMVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPKKIHKA
                                V  FS    F L  F    F+++         +K M SE P+  V +T+ P+  S           +  P  IH +
Subjt:  IGGFQVIGQESKMRGKERKRQNGSVSPFSHPHSFCLQPFPASLFLEVWHFTRFHLLLKIMVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPKKIHKA

Query:  EREKLKREHLNDLFLDL---ANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKLQRELQSRALHS
                HL    +D+      +E +  NNGKASILCEASRLLKDLFGQIE LRKE+ASLLSES YV+IEKNEL+EE S+L +QI+KL+ E+ ++   S
Subjt:  EREKLKREHLNDLFLDL---ANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKLQRELQSRALHS

Query:  KPDLNIPPPSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMPTSHVSKPHARYPTPADSWPAGLL
        KPDLN P P   +QQ     HF  +  GLP  EP LQQ  A+ +VP+  D+  YP  D+ Q    P S VSKPHARYPT ADSWP+ LL
Subjt:  KPDLNIPPPSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMPTSHVSKPHARYPTPADSWPAGLL

XP_008453197.1 PREDICTED: uncharacterized protein LOC103493987 isoform X1 [Cucumis melo]4.0e-14591.69Show/hide
Query:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA
        MGRT IFSAHSTTVALIWFT AVLFFFLF+MALHNS+  SSSDSSVS SELRSKLYDKME+DLDEKGA+FLK GETSQSLSLSDIFT+KDGSVTP   AA
Subjt:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA

Query:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW
        NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPA++ EIEAE AAVKSATEHMCRLKIVLDRVILTSTGVLLGCW
Subjt:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW

Query:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM
        QVISGTDPVTIRAKLR ALPHAPEKQLYDAAILHTSFARLLGHPK+S+TL RS DELQ FHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMK+
Subjt:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM

Query:  R
        R
Subjt:  R

XP_011660055.1 uncharacterized protein LOC101220816 isoform X1 [Cucumis sativus]2.1e-14691.69Show/hide
Query:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA
        MGRT IFSAHSTTVALIW TSAVLFFFLF+MALHNS+  SSSDSSVSNSELRSKLYDKME+DLDEKGA+FLK GETSQSLSLSDIFT+KDG+VTP   AA
Subjt:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA

Query:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW
        NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFD+AIWFQNSSLYHFSMFHASHHITPIPA++DEIEAEA+AVKSATEHMC LKIVLDRVILTSTGVLLGCW
Subjt:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW

Query:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM
        QVISGTDPVTIRAKLR ALPHAPEKQLYDAAILHTSFARLLGHPK+S+TL RS DELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMK+
Subjt:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM

Query:  R
        R
Subjt:  R

XP_038878981.1 uncharacterized protein LOC120071051 [Benincasa hispida]8.7e-14893.02Show/hide
Query:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA
        MGRT+IFS HS TVALIWFTSAVLFFFLF+MALHNSS ASSSDS VSNSELRSKLYDKME+DLDEKGALFLK GETSQSLSLSDIFT+KDGSVTP   AA
Subjt:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA

Query:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW
        NPPVRANVLYLSTEYSVPI EAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHI+PIPATD+EIEAEAAAVK+ATEHMCRLKIVLDRVILTSTGVLLG W
Subjt:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW

Query:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM
        QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPK SETLHRS DELQLFHELVARLNKQIRG EAVVSELWYVEEYDVLALALNGRMK+
Subjt:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM

Query:  R
        R
Subjt:  R

TrEMBL top hitse value%identityAlignment
A0A0A0LPD1 Uncharacterized protein1.0e-14691.69Show/hide
Query:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA
        MGRT IFSAHSTTVALIW TSAVLFFFLF+MALHNS+  SSSDSSVSNSELRSKLYDKME+DLDEKGA+FLK GETSQSLSLSDIFT+KDG+VTP   AA
Subjt:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA

Query:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW
        NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFD+AIWFQNSSLYHFSMFHASHHITPIPA++DEIEAEA+AVKSATEHMC LKIVLDRVILTSTGVLLGCW
Subjt:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW

Query:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM
        QVISGTDPVTIRAKLR ALPHAPEKQLYDAAILHTSFARLLGHPK+S+TL RS DELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMK+
Subjt:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM

Query:  R
        R
Subjt:  R

A0A1R3JAR2 TB2/DP1/HVA22-related protein1.6e-14755.35Show/hide
Query:  MALHNSSSAS----SSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AANPPVRANVLYLSTEYSVPIFEAVKSI
        M L N+SS+S    SSDS +SNS+ RS LYDKME+DLDE+GA FL+ GETSQSLSLSD+FT+KDGSVTP   AANPPVRANVLY+STEYSVPI EAVK +
Subjt:  MALHNSSSAS----SSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AANPPVRANVLYLSTEYSVPIFEAVKSI

Query:  FDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCWQVISGTDPVTIRAKLRAALPHAPEKQ
        F+P FDKAIWFQNSSLYHFSMFHASHHITPIPA++ EIEAEAAA++S T  +C L+IVLDRV+LTSTGVLLGCWQVISGTDPV+IRAKLR+ALP APEKQ
Subjt:  FDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCWQVISGTDPVTIRAKLRAALPHAPEKQ

Query:  LYDAAILHTSFARLLGHPKVS-ETLHRSFDELQLFHEL-------------VARLNKQI-------RGFEAVVSELWYVEEYDVLALALNGRMKMRWQRD
        LYDAAILHTSFARLLG PK S    H + ++++LFH+L             VA  +  +           A+VSELWYVEEYDVLALALNGRMK+  +  
Subjt:  LYDAAILHTSFARLLGHPKVS-ETLHRSFDELQLFHEL-------------VARLNKQI-------RGFEAVVSELWYVEEYDVLALALNGRMKMRWQRD

Query:  IGGFQVIGQESKMRGKERKRQNGSVSPFSHPHSFCLQPFPASLFLEVWHFTRFHLLLKIMVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPKKIHKA
                                V  FS    F L  F    F+++         +K M SE P+  V +T+ P+  S           +  P  IH +
Subjt:  IGGFQVIGQESKMRGKERKRQNGSVSPFSHPHSFCLQPFPASLFLEVWHFTRFHLLLKIMVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPKKIHKA

Query:  EREKLKREHLNDLFLDL---ANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKLQRELQSRALHS
                HL    +D+      +E +  NNGKASILCEASRLLKDLFGQIE LRKE+ASLLSES YV+IEKNEL+EE S+L +QI+KL+ E+ ++   S
Subjt:  EREKLKREHLNDLFLDL---ANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKLQRELQSRALHS

Query:  KPDLNIPPPSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMPTSHVSKPHARYPTPADSWPAGLL
        KPDLN P P   +QQ     HF  +  GLP  EP LQQ  A+ +VP+  D+  YP  D+ Q    P S VSKPHARYPT ADSWP+ LL
Subjt:  KPDLNIPPPSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMPTSHVSKPHARYPTPADSWPAGLL

A0A1S3BVM1 uncharacterized protein LOC103493987 isoform X12.0e-14591.69Show/hide
Query:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA
        MGRT IFSAHSTTVALIWFT AVLFFFLF+MALHNS+  SSSDSSVS SELRSKLYDKME+DLDEKGA+FLK GETSQSLSLSDIFT+KDGSVTP   AA
Subjt:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA

Query:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW
        NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPA++ EIEAE AAVKSATEHMCRLKIVLDRVILTSTGVLLGCW
Subjt:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW

Query:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM
        QVISGTDPVTIRAKLR ALPHAPEKQLYDAAILHTSFARLLGHPK+S+TL RS DELQ FHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMK+
Subjt:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM

Query:  R
        R
Subjt:  R

A0A6J1C0W4 uncharacterized protein LOC111007329 isoform X12.4e-14389.11Show/hide
Query:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHN--SSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---
        M RT I SAHSTTVA +WFTSAVLFFFLF+MALHN  S S SSS S  SNSELRSKLYDKMEKDLDEKGA+FLKDGETSQSLSLSD+FT+KDGSVTP   
Subjt:  MGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHN--SSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---

Query:  AANPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLG
        AANPPVRANVLYLSTEYSVPI EAVKS+F+P+FDKAIWFQNSS+YHFSMFHASHHITP+PAT+DEIEAE AAVKSATEHMCRLKIVLDRVILTSTGVLLG
Subjt:  AANPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLG

Query:  CWQVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRM
        CWQVISGTDPVTIRAKLR ALPHAPEKQLYDAAILHTSFARLLGHPK+SE LH S DELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRM
Subjt:  CWQVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRM

Query:  KMR
        K+R
Subjt:  KMR

W9QXK2 BHLH domain-containing protein1.7e-17859.8Show/hide
Query:  AHSTTVALIWFTSAVLFFFLFKMALHN--------SSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA
        + S T+ LIW  SA +F+ LF+MAL N        SSS+SSS SSVSN E RSKLYD M +DLD+ GA FL  GETSQSLSL DIF +KDGSVTP   AA
Subjt:  AHSTTVALIWFTSAVLFFFLFKMALHN--------SSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP---AA

Query:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW
        NPPVRANVL+L T+YSVPI EAVK IF+PYFDK+IWFQNSSLYHFSMFHASHHITP+PAT+ EIEAEAAAV +  + +C + IVLDRV+LTSTGVLLGCW
Subjt:  NPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCW

Query:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM
        QV+SGTDP+T+RA+LR ALPHAPEKQLYDAAILHTSFARLLG PK S       D ++ FHELV RLN +IRGF AVVSELWYVEEYDVLALALNGRMK 
Subjt:  QVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKM

Query:  RWQRDIGGFQVIGQESKMRGKERKRQNGSVSPFSHPHSFCLQPFPASLFLEVWHFTRFHLLLKIMVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPK
               G + IG                                                 K M SE     V + +  +  S  R+CPG KNQ KVPK
Subjt:  RWQRDIGGFQVIGQESKMRGKERKRQNGSVSPFSHPHSFCLQPFPASLFLEVWHFTRFHLLLKIMVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPK

Query:  KIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKLQRELQSRAL
        +IHKAEREK KRE LN+LF +L+N+L+L +PNNGKAS+LCEA+RLLKDL  QIECLRKE+ SLLSES YV +EKNELREE S+L +QI KLQ E+++R +
Subjt:  KIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKLQRELQSRAL

Query:  HSKPDLNIPPPSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMPTSHVSKPHARYPTPADSWPAGLLK
         SKPDLN PPP               +CL +P  EPTL Q H V ++P   DL +YP  DAAQ  + PTS+VSKPHARYPT  DSWPA LL+
Subjt:  HSKPDLNIPPPSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMPTSHVSKPHARYPTPADSWPAGLLK

SwissProt top hitse value%identityAlignment
Q10KL8 Protein IRON-RELATED TRANSCRIPTION FACTOR 33.8e-2940.19Show/hide
Query:  KKNQVKVPKKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKL
        KK + KVP+K+HK+EREKLKR HLNDLF +L N LE    +NGKA IL + +R+L+DL  Q++ LR+E+++L +ES YV +E+NEL++E  +L S+I  L
Subjt:  KKNQVKVPKKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKL

Query:  QRELQSRALHSK--------PDLNIPP------PSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMPT-SHVSKPH
        Q EL+ RA  S           L +PP      PS+   Q + +   +   L  P+ +PT+ +  A   + ++  L + PA D   S      ++V++P 
Subjt:  QRELQSRALHSK--------PDLNIPP------PSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMPT-SHVSKPH

Query:  ARYPTPADSWPAGL
         RYPT A SWP  L
Subjt:  ARYPTPADSWPAGL

Q69V10 Transcription factor BHLH0622.7e-2738.22Show/hide
Query:  DGPVVASASRNCPGKKNQVKVPKKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNEL
        +G +V S   N   KK   K PK+IHK+EREKLKR+  NDLF +L N LE    NNGKA +L E +R+LKDL  Q+E LRKE++SL +ES YV +E+NEL
Subjt:  DGPVVASASRNCPGKKNQVKVPKKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNEL

Query:  REETSSLASQIEKLQRELQSRA--------LHSKPDLNIPPPSEFLQQGTTVPHFS-GECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMP
         ++ S L ++I +LQ EL++R         ++++P L +P P+  +     +PH         P   P + + H     P    L    A      P+  
Subjt:  REETSSLASQIEKLQRELQSRA--------LHSKPDLNIPPPSEFLQQGTTVPHFS-GECLGLPVMEPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMP

Query:  ---TSHVSKPHARYPTPADSWPAGL
           + HV++P  RYPTP  + P  L
Subjt:  ---TSHVSKPHARYPTPADSWPAGL

Q8W2F2 Transcription factor bHLH111.3e-1341.18Show/hide
Query:  ASASRNCPGKKNQVKVPKKI---HKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELRE
        +S+ +N  G +  V+V K+     KAEREKL+R+ L + FL+L NAL+   P + KAS+L +  ++LKD+  Q++ L+ E+ +L  ES+ +  EK+ELRE
Subjt:  ASASRNCPGKKNQVKVPKKI---HKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELRE

Query:  ETSSLASQIEKLQRELQSR
        E ++L S IE L  + Q R
Subjt:  ETSSLASQIEKLQRELQSR

Q9LT23 Transcription factor bHLH1217.7e-1431.84Show/hide
Query:  KKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKLQRELQSRA
        +K  KA REKL+RE LN+ F++L N L+   P N KA+IL +  +LLK+L  ++  L+ E+ +L  ES+ +  EKN+LREE +SL S IE L  + Q R 
Subjt:  KKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKLQRELQSRA

Query:  LHSKP--------DLNIPPPSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSY-------PAMDAAQSPTM-----PTSHVSKPHARY
            P         +  PPPS         P+       +P+  P          +P+ P +PSY       P+M  A  PT      P + V +     
Subjt:  LHSKP--------DLNIPPPSEFLQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPDLPSY-------PAMDAAQSPTM-----PTSHVSKPHARY

Query:  P-TPADSWPAGLLKLSRTTEASK
        P  P +       K+SR + + K
Subjt:  P-TPADSWPAGLLKLSRTTEASK

Q9SN74 Transcription factor bHLH473.4e-4649.59Show/hide
Query:  MVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPKKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLL
        MVS+ PS S  E +    A+A   C     + KVPK+I+KA RE+LKREHLN+LF++LA+ LEL + N+GKASILCEA+R LKD+FGQIE LRKEHASLL
Subjt:  MVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPKKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLL

Query:  SESQYVDIEKNELREETSSLASQIEKLQRELQSRALHSKPDLNIPPPSEF------------LQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPD
        SES YV  EKNEL+EETS L ++I KLQ E+++RA  SKPDLN  P  E+            + Q   +P F G   G      TL     V ++P++PD
Subjt:  SESQYVDIEKNELREETSSLASQIEKLQRELQSRALHSKPDLNIPPPSEF------------LQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPD

Query:  --LPSYPAMDAAQSPTM-PTSHVSKPHARYPTPADSWPAGLL
                M  AQ P M  +S+VSKP  RY + ADSW + LL
Subjt:  --LPSYPAMDAAQSPTM-PTSHVSKPHARYPTPADSWPAGLL

Arabidopsis top hitse value%identityAlignment
AT1G74530.1 unknown protein8.8e-9062.37Show/hide
Query:  ASMGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSAS--SSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP-
        A+   ++I + +  T  L W  S  +F+ LF+M + NS S S  SSDS VS +E  ++LY+KME+DL E G +FLK GETSQSLSLSD+FT+KDG + P 
Subjt:  ASMGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSAS--SSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP-

Query:  --AANPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVL
           ANPPVRANVL+LSTEYSVP+ E VK++F PYF+  IWFQ+S +YHFSMFHAS+HI  +PAT+ E+EAEAAAVK+  + +C L+I+LDRV+LTSTGVL
Subjt:  --AANPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVL

Query:  LGCWQVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFE
        LGCW+V SG DP+TIR KLR+ LP APEKQLYDAAILHTS ARLLG P +S T   S D LQL HELV RLN QIRGF+
Subjt:  LGCWQVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFE

AT1G74530.2 unknown protein1.1e-7660.71Show/hide
Query:  ASMGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSAS--SSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP-
        A+   ++I + +  T  L W  S  +F+ LF+M + NS S S  SSDS VS +E  ++LY+KME+DL E G +FLK GETSQSLSLSD+FT+KDG + P 
Subjt:  ASMGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSAS--SSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP-

Query:  --AANPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDK----AIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTS
           ANPPVRANVL+LSTEYSVP+ E VK++F PYF+     AI   +S +YHFSMFHAS+HI  +PAT+ E+EAEAAAVK+  + +C L+I+LDRV+LTS
Subjt:  --AANPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDK----AIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTS

Query:  TGVLLGCWQVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHP
        TGVLLGCW+V SG DP+TIR KLR+ LP APEKQLYDAAILHTS ARLLG P
Subjt:  TGVLLGCWQVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHP

AT1G74530.3 unknown protein2.6e-10263.61Show/hide
Query:  ASMGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSAS--SSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP-
        A+   ++I + +  T  L W  S  +F+ LF+M + NS S S  SSDS VS +E  ++LY+KME+DL E G +FLK GETSQSLSLSD+FT+KDG + P 
Subjt:  ASMGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSAS--SSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDGSVTP-

Query:  --AANPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVL
           ANPPVRANVL+LSTEYSVP+ E VK++F PYF+  IWFQ+S +YHFSMFHAS+HI  +PAT+ E+EAEAAAVK+  + +C L+I+LDRV+LTSTGVL
Subjt:  --AANPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVL

Query:  LGCWQVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNG
        LGCW+V SG DP+TIR KLR+ LP APEKQLYDAAILHTS ARLLG P +S T   S D LQL HELV RLN QIRGF+A+VSELWYVEE+D+LALAL G
Subjt:  LGCWQVISGTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNG

Query:  RMKMR
        RM +R
Subjt:  RMKMR

AT3G47640.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.4e-4749.59Show/hide
Query:  MVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPKKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLL
        MVS+ PS S  E +    A+A   C     + KVPK+I+KA RE+LKREHLN+LF++LA+ LEL + N+GKASILCEA+R LKD+FGQIE LRKEHASLL
Subjt:  MVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPKKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLL

Query:  SESQYVDIEKNELREETSSLASQIEKLQRELQSRALHSKPDLNIPPPSEF------------LQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPD
        SES YV  EKNEL+EETS L ++I KLQ E+++RA  SKPDLN  P  E+            + Q   +P F G   G      TL     V ++P++PD
Subjt:  SESQYVDIEKNELREETSSLASQIEKLQRELQSRALHSKPDLNIPPPSEF------------LQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPD

Query:  --LPSYPAMDAAQSPTM-PTSHVSKPHARYPTPADSWPAGLL
                M  AQ P M  +S+VSKP  RY + ADSW + LL
Subjt:  --LPSYPAMDAAQSPTM-PTSHVSKPHARYPTPADSWPAGLL

AT3G47640.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.4e-4749.59Show/hide
Query:  MVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPKKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLL
        MVS+ PS S  E +    A+A   C     + KVPK+I+KA RE+LKREHLN+LF++LA+ LEL + N+GKASILCEA+R LKD+FGQIE LRKEHASLL
Subjt:  MVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPKKIHKAEREKLKREHLNDLFLDLANALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLL

Query:  SESQYVDIEKNELREETSSLASQIEKLQRELQSRALHSKPDLNIPPPSEF------------LQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPD
        SES YV  EKNEL+EETS L ++I KLQ E+++RA  SKPDLN  P  E+            + Q   +P F G   G      TL     V ++P++PD
Subjt:  SESQYVDIEKNELREETSSLASQIEKLQRELQSRALHSKPDLNIPPPSEF------------LQQGTTVPHFSGECLGLPVMEPTLQQTHAVFIVPVRPD

Query:  --LPSYPAMDAAQSPTM-PTSHVSKPHARYPTPADSWPAGLL
                M  AQ P M  +S+VSKP  RY + ADSW + LL
Subjt:  --LPSYPAMDAAQSPTM-PTSHVSKPHARYPTPADSWPAGLL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGACGGACCGACCGAGTCACAACTCACTGCCCAACTCGATTGCTCAAAAGCTTCCATGGGAAGGACTTCAATTTTCTCGGCTCACTCTACCACTGTCGCGTTGATCTG
GTTCACCTCTGCCGTCCTCTTCTTCTTTCTCTTCAAAATGGCCCTCCACAACTCTAGTTCCGCTTCTTCCTCAGATTCTTCAGTATCAAATTCTGAGTTAAGATCAAAAT
TATATGATAAGATGGAGAAGGATTTGGATGAGAAAGGGGCACTTTTCCTCAAGGACGGTGAAACATCTCAGTCTTTGTCACTTTCTGATATCTTCACTGTTAAGGATGGG
TCTGTGACCCCTGCAGCAAATCCTCCTGTTCGTGCTAATGTTTTGTATCTAAGCACAGAGTACTCGGTTCCCATATTTGAGGCTGTAAAATCTATATTTGATCCTTATTT
TGATAAAGCAATTTGGTTTCAGAACTCTAGCTTGTACCATTTTAGCATGTTCCATGCATCACATCACATTACACCGATTCCTGCTACTGATGATGAGATTGAAGCTGAAG
CAGCTGCTGTGAAGTCTGCAACAGAGCATATGTGCCGATTGAAAATAGTTTTAGATAGGGTTATTCTAACTTCAACAGGTGTTCTTCTAGGATGTTGGCAGGTAATATCA
GGGACAGATCCTGTGACCATCCGTGCTAAACTAAGGGCAGCACTCCCACATGCACCTGAGAAGCAGCTTTATGATGCTGCTATTCTTCACACATCATTTGCTCGGCTTTT
GGGTCACCCTAAAGTTTCAGAGACGCTCCATAGAAGTTTTGATGAACTCCAACTCTTTCATGAGCTGGTTGCTCGCTTGAATAAACAAATCCGTGGATTTGAGGCAGTGG
TGTCGGAGCTGTGGTATGTGGAGGAATATGATGTTTTAGCTCTGGCCTTGAATGGAAGAATGAAGATGAGGTGGCAAAGAGATATTGGAGGATTCCAAGTCATAGGGCAA
GAAAGCAAGATGAGGGGCAAAGAGAGGAAACGACAAAATGGGAGCGTTTCTCCCTTTTCTCACCCTCATTCTTTCTGTCTTCAACCTTTCCCTGCTTCTTTGTTCCTTGA
AGTCTGGCATTTCACTCGGTTTCACTTGCTGTTGAAGATCATGGTCTCTGAGGTTCCTTCGAAATCGGTTGTTGAAACTGATGGTCCAGTTGTGGCGTCGGCGTCTAGGA
ATTGTCCTGGTAAGAAGAATCAGGTGAAAGTTCCTAAGAAAATTCACAAGGCTGAGAGGGAGAAGTTGAAGCGAGAGCATTTAAACGATCTCTTCCTCGACCTTGCAAAT
GCACTTGAGCTGACAGAACCGAATAATGGAAAGGCATCTATATTGTGTGAAGCAAGTCGGCTGCTGAAAGACTTGTTTGGTCAGATTGAGTGCCTTAGAAAGGAGCATGC
ATCGTTGTTGTCAGAATCCCAATACGTTGACATCGAGAAGAATGAACTGCGAGAAGAGACTTCTTCTTTAGCATCTCAGATTGAGAAACTGCAAAGAGAGTTACAATCAA
GGGCTCTCCATTCTAAACCCGACTTAAATATTCCCCCACCTTCAGAGTTTCTGCAACAAGGGACAACGGTGCCACATTTCTCAGGGGAATGCCTTGGATTGCCTGTTATG
GAACCTACATTGCAGCAAACACATGCCGTTTTTATCGTCCCTGTGCGGCCAGATCTCCCATCTTATCCAGCAATGGATGCAGCTCAATCCCCAACAATGCCTACCTCACA
TGTAAGCAAACCACATGCCAGATATCCAACACCAGCAGATTCGTGGCCTGCTGGACTTCTCAAATTGTCACGAACAACAGAAGCAAGTAAAGAAGTTTTAGCCATTGGTT
GCAAGAGCATTGCAAATATTGGAGAAAGAGGATCTGACAGAAGTGAGAGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGGACGGACCGACCGAGTCACAACTCACTGCCCAACTCGATTGCTCAAAAGCTTCCATGGGAAGGACTTCAATTTTCTCGGCTCACTCTACCACTGTCGCGTTGATCTG
GTTCACCTCTGCCGTCCTCTTCTTCTTTCTCTTCAAAATGGCCCTCCACAACTCTAGTTCCGCTTCTTCCTCAGATTCTTCAGTATCAAATTCTGAGTTAAGATCAAAAT
TATATGATAAGATGGAGAAGGATTTGGATGAGAAAGGGGCACTTTTCCTCAAGGACGGTGAAACATCTCAGTCTTTGTCACTTTCTGATATCTTCACTGTTAAGGATGGG
TCTGTGACCCCTGCAGCAAATCCTCCTGTTCGTGCTAATGTTTTGTATCTAAGCACAGAGTACTCGGTTCCCATATTTGAGGCTGTAAAATCTATATTTGATCCTTATTT
TGATAAAGCAATTTGGTTTCAGAACTCTAGCTTGTACCATTTTAGCATGTTCCATGCATCACATCACATTACACCGATTCCTGCTACTGATGATGAGATTGAAGCTGAAG
CAGCTGCTGTGAAGTCTGCAACAGAGCATATGTGCCGATTGAAAATAGTTTTAGATAGGGTTATTCTAACTTCAACAGGTGTTCTTCTAGGATGTTGGCAGGTAATATCA
GGGACAGATCCTGTGACCATCCGTGCTAAACTAAGGGCAGCACTCCCACATGCACCTGAGAAGCAGCTTTATGATGCTGCTATTCTTCACACATCATTTGCTCGGCTTTT
GGGTCACCCTAAAGTTTCAGAGACGCTCCATAGAAGTTTTGATGAACTCCAACTCTTTCATGAGCTGGTTGCTCGCTTGAATAAACAAATCCGTGGATTTGAGGCAGTGG
TGTCGGAGCTGTGGTATGTGGAGGAATATGATGTTTTAGCTCTGGCCTTGAATGGAAGAATGAAGATGAGGTGGCAAAGAGATATTGGAGGATTCCAAGTCATAGGGCAA
GAAAGCAAGATGAGGGGCAAAGAGAGGAAACGACAAAATGGGAGCGTTTCTCCCTTTTCTCACCCTCATTCTTTCTGTCTTCAACCTTTCCCTGCTTCTTTGTTCCTTGA
AGTCTGGCATTTCACTCGGTTTCACTTGCTGTTGAAGATCATGGTCTCTGAGGTTCCTTCGAAATCGGTTGTTGAAACTGATGGTCCAGTTGTGGCGTCGGCGTCTAGGA
ATTGTCCTGGTAAGAAGAATCAGGTGAAAGTTCCTAAGAAAATTCACAAGGCTGAGAGGGAGAAGTTGAAGCGAGAGCATTTAAACGATCTCTTCCTCGACCTTGCAAAT
GCACTTGAGCTGACAGAACCGAATAATGGAAAGGCATCTATATTGTGTGAAGCAAGTCGGCTGCTGAAAGACTTGTTTGGTCAGATTGAGTGCCTTAGAAAGGAGCATGC
ATCGTTGTTGTCAGAATCCCAATACGTTGACATCGAGAAGAATGAACTGCGAGAAGAGACTTCTTCTTTAGCATCTCAGATTGAGAAACTGCAAAGAGAGTTACAATCAA
GGGCTCTCCATTCTAAACCCGACTTAAATATTCCCCCACCTTCAGAGTTTCTGCAACAAGGGACAACGGTGCCACATTTCTCAGGGGAATGCCTTGGATTGCCTGTTATG
GAACCTACATTGCAGCAAACACATGCCGTTTTTATCGTCCCTGTGCGGCCAGATCTCCCATCTTATCCAGCAATGGATGCAGCTCAATCCCCAACAATGCCTACCTCACA
TGTAAGCAAACCACATGCCAGATATCCAACACCAGCAGATTCGTGGCCTGCTGGACTTCTCAAATTGTCACGAACAACAGAAGCAAGTAAAGAAGTTTTAGCCATTGGTT
GCAAGAGCATTGCAAATATTGGAGAAAGAGGATCTGACAGAAGTGAGAGCTGA
Protein sequenceShow/hide protein sequence
MDGPTESQLTAQLDCSKASMGRTSIFSAHSTTVALIWFTSAVLFFFLFKMALHNSSSASSSDSSVSNSELRSKLYDKMEKDLDEKGALFLKDGETSQSLSLSDIFTVKDG
SVTPAANPPVRANVLYLSTEYSVPIFEAVKSIFDPYFDKAIWFQNSSLYHFSMFHASHHITPIPATDDEIEAEAAAVKSATEHMCRLKIVLDRVILTSTGVLLGCWQVIS
GTDPVTIRAKLRAALPHAPEKQLYDAAILHTSFARLLGHPKVSETLHRSFDELQLFHELVARLNKQIRGFEAVVSELWYVEEYDVLALALNGRMKMRWQRDIGGFQVIGQ
ESKMRGKERKRQNGSVSPFSHPHSFCLQPFPASLFLEVWHFTRFHLLLKIMVSEVPSKSVVETDGPVVASASRNCPGKKNQVKVPKKIHKAEREKLKREHLNDLFLDLAN
ALELTEPNNGKASILCEASRLLKDLFGQIECLRKEHASLLSESQYVDIEKNELREETSSLASQIEKLQRELQSRALHSKPDLNIPPPSEFLQQGTTVPHFSGECLGLPVM
EPTLQQTHAVFIVPVRPDLPSYPAMDAAQSPTMPTSHVSKPHARYPTPADSWPAGLLKLSRTTEASKEVLAIGCKSIANIGERGSDRSES