; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC06g2116 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC06g2116
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionheat stress transcription factor B-2b
Genome locationMC06:28631244..28634355
RNA-Seq ExpressionMC06g2116
SyntenyMC06g2116
Gene Ontology termsGO:0006012 - galactose metabolic process (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003978 - UDP-glucose 4-epimerase activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0042789.1 bifunctional UDP-glucose 4-epimerase and UDP-xylose 4-epimerase 1 [Cucumis melo var. makuwa]8.87e-19588.63Show/hide
Query:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        M+PSPAEPIG+SGTGDSQRSIPTPFLTKT+QLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL
        R+GEKGLLRDIQRRKVALSV TT  T AA+  PV VAA+PAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPEL+RENERLRKENMQLSHELTQL
Subjt:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL

Query:  KGLCNNILSLMTNYASG--HQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV
        KGLCNNILSLMTNYASG  H  ES SVRDGKALEL+PA QVM EDEGAVSDG  E+RLK+EE  TAAAAAA GVTPKLFGVSIGVKR+RRE EEEEEEMV
Subjt:  KGLCNNILSLMTNYASG--HQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV

Query:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
        GQNHVQSEEGE GSEIKAEPLDENS++P+GSAS WLELGNQGS
Subjt:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS

XP_004143930.1 heat stress transcription factor B-2b [Cucumis sativus]2.43e-19987.46Show/hide
Query:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        M+PSPAEPIG+SGTGDSQRSIPTPFLTKT+QLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL
        R+GEKGLLRDIQRRKV LSV TT  T AA+  PVTVA +PAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPEL+RENERLRKENMQLSHELTQL
Subjt:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL

Query:  KGLCNNILSLMTNYASGH--QSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV
        KGLCNNILSLMTNYASG   Q ES SVRDGKALEL+PA QVM EDEGAVSDG  E+RLK+EE  TAAAAA  G+TPKLFGVSIG+KR+RRE EEEEEEMV
Subjt:  KGLCNNILSLMTNYASGH--QSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV

Query:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
        GQNHVQSEEGE GSEIKAEPLDENS++P+GSAS WLELGNQGS
Subjt:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS

XP_008437221.1 PREDICTED: heat stress transcription factor B-2b [Cucumis melo]2.16e-19787.76Show/hide
Query:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        M+PSPAEPIG+SGTGDSQRSIPTPFLTKT+QLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL
        R+GEKGLLRDIQRRKVALSV TT  T AA+  PV VAA+PAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPEL+RENERLRKENMQLSHELTQL
Subjt:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL

Query:  KGLCNNILSLMTNYASG--HQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV
        KGLCNNILSLMTNYASG  H  ES SVRDGKALEL+PA QVM EDEGAVSDG  E+RLK+ E   AAAAA  GVTPKLFGVSIGVKR+RRE EEEEEEMV
Subjt:  KGLCNNILSLMTNYASG--HQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV

Query:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
        GQNHVQSEEGE GSEIKAEPLDENS++P+GSAS WLELGNQGS
Subjt:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS

XP_022157722.1 heat stress transcription factor B-2b [Momordica charantia]4.47e-236100Show/hide
Query:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQLK
        RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQLK
Subjt:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQLK

Query:  GLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMVGQN
        GLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMVGQN
Subjt:  GLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMVGQN

Query:  HVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
        HVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
Subjt:  HVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS

XP_038875590.1 heat stress transcription factor B-2b [Benincasa hispida]3.00e-20288.12Show/hide
Query:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        M+PSPAEPIG+SGTGDSQRSIPTPFLTKT+QLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RRGEKGLLRDIQRRKVALSVATTPPT--PAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELT
        RRGEKGLLRDIQRRKVALS+ TT  T  PAA+  PV VAA+PAV AHVISPANSGEEQVTSSNSSPM FQRGTSCTTTPEL+RENERLRKENMQLSHELT
Subjt:  RRGEKGLLRDIQRRKVALSVATTPPT--PAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELT

Query:  QLKGLCNNILSLMTNYAS--GHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEE
        QLKGLCNNILSLMTNYAS   HQ ESVSVRDGKALEL+PA QVM EDEGAVSDG QE+RLK+EE      AAA G+TPKLFGVSIGVKR+RREE++EEEE
Subjt:  QLKGLCNNILSLMTNYAS--GHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEE

Query:  MVGQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
        MVGQNHVQSEEGE GSEIKAEPLDENS+NPEGSAS WLELGNQGS
Subjt:  MVGQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS

TrEMBL top hitse value%identityAlignment
A0A0A0KNZ8 HSF_DOMAIN domain-containing protein1.18e-19987.46Show/hide
Query:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        M+PSPAEPIG+SGTGDSQRSIPTPFLTKT+QLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL
        R+GEKGLLRDIQRRKV LSV TT  T AA+  PVTVA +PAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPEL+RENERLRKENMQLSHELTQL
Subjt:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL

Query:  KGLCNNILSLMTNYASGH--QSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV
        KGLCNNILSLMTNYASG   Q ES SVRDGKALEL+PA QVM EDEGAVSDG  E+RLK+EE  TAAAAA  G+TPKLFGVSIG+KR+RRE EEEEEEMV
Subjt:  KGLCNNILSLMTNYASGH--QSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV

Query:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
        GQNHVQSEEGE GSEIKAEPLDENS++P+GSAS WLELGNQGS
Subjt:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS

A0A1S3AU16 heat stress transcription factor B-2b1.05e-19787.76Show/hide
Query:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        M+PSPAEPIG+SGTGDSQRSIPTPFLTKT+QLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL
        R+GEKGLLRDIQRRKVALSV TT  T AA+  PV VAA+PAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPEL+RENERLRKENMQLSHELTQL
Subjt:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL

Query:  KGLCNNILSLMTNYASG--HQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV
        KGLCNNILSLMTNYASG  H  ES SVRDGKALEL+PA QVM EDEGAVSDG  E+RLK+ E   AAAAA  GVTPKLFGVSIGVKR+RRE EEEEEEMV
Subjt:  KGLCNNILSLMTNYASG--HQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV

Query:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
        GQNHVQSEEGE GSEIKAEPLDENS++P+GSAS WLELGNQGS
Subjt:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS

A0A5A7TN46 Bifunctional UDP-glucose 4-epimerase and UDP-xylose 4-epimerase 14.30e-19588.63Show/hide
Query:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        M+PSPAEPIG+SGTGDSQRSIPTPFLTKT+QLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL
        R+GEKGLLRDIQRRKVALSV TT  T AA+  PV VAA+PAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPEL+RENERLRKENMQLSHELTQL
Subjt:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAV-AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQL

Query:  KGLCNNILSLMTNYASG--HQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV
        KGLCNNILSLMTNYASG  H  ES SVRDGKALEL+PA QVM EDEGAVSDG  E+RLK+EE  TAAAAAA GVTPKLFGVSIGVKR+RRE EEEEEEMV
Subjt:  KGLCNNILSLMTNYASG--HQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV

Query:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
        GQNHVQSEEGE GSEIKAEPLDENS++P+GSAS WLELGNQGS
Subjt:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS

A0A6J1DU11 heat stress transcription factor B-2b2.16e-236100Show/hide
Query:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQLK
        RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQLK
Subjt:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQLK

Query:  GLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMVGQN
        GLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMVGQN
Subjt:  GLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMVGQN

Query:  HVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
        HVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
Subjt:  HVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS

A0A6J1H317 heat stress transcription factor B-2b-like3.09e-19487.17Show/hide
Query:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        M+PSPAEPIG+SGTGDSQRSIPTPFLTKT+QLVDDP VDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSP-VTVAAAPAVAH-VISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQ
        +RGEK LLRDIQRRKVALSVA  P TPA++  P VTVAA+PAVA  VISP NS EEQVTSSNSSPM FQRGTSC TTPEL+RENERLRKENMQLSHELTQ
Subjt:  RRGEKGLLRDIQRRKVALSVATTPPTPAAMVSP-VTVAAAPAVAH-VISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQ

Query:  LKGLCNNILSLMTNYASGHQS-ESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV
        LKGLCNNILSLMTNYASGHQ  ESVSVRDGKAL+L+PA Q MM+DEGAVSDGIQE+RLKVEEA   A A AEG TPKLFGVSIGVKRVRREE++EE  MV
Subjt:  LKGLCNNILSLMTNYASGHQS-ESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMV

Query:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS
        G NHVQSEE E GSEIKAEPLDENS+NPEGSASQWLELGNQGS
Subjt:  GQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLELGNQGS

SwissProt top hitse value%identityAlignment
P22335 Heat shock factor protein HSF241.0e-4942.59Show/hide
Query:  SQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDIQRRKV
        SQR+ P PFL KT+QLVDD A DD+ISWNE G+TF+VW+ AEFA+DLLPKYFKHNNFSSFVRQLNTYGFRK+VPD+WEFAN+ F+RG+K LL  I+RRK 
Subjt:  SQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDIQRRKV

Query:  ALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQRGT-----SCTTTPELLRENERLRKENMQLSHELTQLKGLCNNILSLMT
             T   TPA   S    A+A       SP NSG++  +SS SSP +   G+       +   +L  ENE+L+K+N  LS EL Q K  CN +++ ++
Subjt:  ALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQRGT-----SCTTTPELLRENERLRKENMQLSHELTQLKGLCNNILSLMT

Query:  NYASG-----HQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMVGQNHVQSEE
         Y        ++  S     G +LE       ++++ G V D   E +    +         +G T KLFGV +  K+ +R  +E  E   G+  +    
Subjt:  NYASG-----HQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMVGQNHVQSEE

Query:  GENGSEIK-AEPLDENS
          NG  +K + P  E+S
Subjt:  GENGSEIK-AEPLDENS

Q652B0 Heat stress transcription factor B-2c1.5e-6949.86Show/hide
Query:  QRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDIQRRKVA
        QRS+PTPFLTKT+QLV+DPAVDD+ISWNEDGSTF+VWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRK+VPDRWEFANDCFRRGEK LL DI RRKV 
Subjt:  QRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDIQRRKVA

Query:  LSVATTPPTP--------AAMVS-PVTVAAAP-----AVAHVISPANSGEEQVTSSNSSPMAFQR------------GTSCTTTPELLRENERLRKENMQ
         + A  PP P        AA+ S  VTVAAAP      V    SPA+S EEQV SSNS      R            G    +  ++  ENERLR+EN +
Subjt:  LSVATTPPTP--------AAMVS-PVTVAAAP-----AVAHVISPANSGEEQVTSSNSSPMAFQR------------GTSCTTTPELLRENERLRKENMQ

Query:  LSHELTQLKGLCNNILSLMTNYASGHQSE------SVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGV------TPKLFGVS
        L+ EL  +K LCNNIL LM+ YA+    E      S++   G++ E +P           +   I +L        TAAAAA   +      + +LFGVS
Subjt:  LSHELTQLKGLCNNILSLMTNYASGHQSE------SVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGV------TPKLFGVS

Query:  IGVKRVRREEEEEEEEMVGQNHVQSEEGENGSEIKAEPLDENSDNPEGSAS
        IG+KR R +     +E  G    Q+E G  G+++K E  D +     G +S
Subjt:  IGVKRVRREEEEEEEEMVGQNHVQSEEGENGSEIKAEPLDENSDNPEGSAS

Q6Z9C8 Heat stress transcription factor B-2b8.9e-7048.03Show/hide
Query:  SPSPAEPIGES---GTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND
        SP P  P  E+   G G  QR++PTPFLTKT+QLVDDPAVDD+ISWN+DGSTF+VWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRK+VPDRWEFAND
Subjt:  SPSPAEPIGES---GTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFAND

Query:  CFRRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVI----SPANSGEEQVTSSNSSP--------MAFQRGTSCTTTPELLRENERLR
        CFRRGE+ LL +I RRKV      TPP PAA  + V  A   A+        SP  SGEEQV SS+SSP             G+    + ++  ENERLR
Subjt:  CFRRGEKGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVI----SPANSGEEQVTSSNSSP--------MAFQRGTSCTTTPELLRENERLR

Query:  KENMQLSHELTQLKGLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVS----DGIQELRLKVEEATTAAAAAAEG----VTPKLFGV
        +EN QL+ EL+Q++ LCNNIL LM+ YAS  Q ++ +     A           E   A +      + +L      A +AAA  ++     ++ KLFGV
Subjt:  KENMQLSHELTQLKGLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVS----DGIQELRLKVEEATTAAAAAAEG----VTPKLFGV

Query:  SIGVKRVRREEEEEEEEMVGQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLE
        SIG KR+R              H    + ++ + +KAEP+D     P G   Q  E
Subjt:  SIGVKRVRREEEEEEEEMVGQNHVQSEEGENGSEIKAEPLDENSDNPEGSASQWLE

Q9SCW4 Heat stress transcription factor B-2a1.8e-6247.67Show/hide
Query:  SGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDI
        +G   SQRSIPTPFLTKTF LV+D ++DD+ISWNEDGS+FIVW P +FA+DLLPK+FKHNNFSSFVRQLNTYGF+KVVPDRWEF+ND F+RGEK LLR+I
Subjt:  SGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDI

Query:  QRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSS--NSSPMAFQRGTSCTT-----TPELLRENERLRKENMQLSHELTQLKGLCN
        QRRK+         T   +V+P +      +  V+SP+NSGE+   +   +SSP ++    + TT     + ELL ENE+LR +N+QL+ ELTQ+K +C+
Subjt:  QRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSS--NSSPMAFQRGTSCTT-----TPELLRENERLRKENMQLSHELTQLKGLCN

Query:  NILSLMTNYASGH---QSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRRE-EEEEEEEMVGQN
        NI SLM+NY       +S S      + +E +PA +              E+ ++ EE            +P+LFGV IG+KR R E  + +   +VG+N
Subjt:  NILSLMTNYASGH---QSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRRE-EEEEEEEMVGQN

Q9T0D3 Heat stress transcription factor B-2b1.9e-9658.96Show/hide
Query:  GESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLR
        G  G GDSQRSIPTPFLTKT+QLV+DP  D+LISWNEDG+TFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEF+NDCF+RGEK LLR
Subjt:  GESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLR

Query:  DIQRRKV---ALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMA-------------FQRGTSCTTTPELLRENERLRKENMQLS
        DIQRRK+   A++ A      A   S VTVAA P VAH++SP+NSGEEQV SSNSSP A              QR TSCTT PEL+ ENERLRK+N +L 
Subjt:  DIQRRKV---ALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMA-------------FQRGTSCTTTPELLRENERLRKENMQLS

Query:  HELTQLKGLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEE--E
         E+T+LKGL  NI +LM N+  G +  +  + +GK L+L+P  Q M            E  +  E  T       E +TP+LFGVSIGVKR RREEE   
Subjt:  HELTQLKGLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEE--E

Query:  EEEEMVGQNHVQSEEGENGSEIKAEPLDE-NSDNPEGSASQWLELG
         EEE   +    ++EGE  S++KAEP++E NS N  GS   WLELG
Subjt:  EEEEMVGQNHVQSEEGENGSEIKAEPLDE-NSDNPEGSASQWLELG

Arabidopsis top hitse value%identityAlignment
AT1G46264.1 heat shock transcription factor B41.9e-4347.29Show/hide
Query:  RSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDIQRRKVAL
        +++P PFLTKT+QLVDDPA D ++SW +D +TF+VWRP EFARDLLP YFKHNNFSSFVRQLNTYGFRK+VPDRWEFAN+ F+RGEK LL +I RRK + 
Subjt:  RSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDIQRRKVAL

Query:  SVATT-PPTPAAMVSPVTVAAAPAVAHVISP--ANSGEEQVTSSNSSP-----MAFQRGTSCTTTPELLRENERLRKENMQLSHELTQLKGLCNNILSLM
         +     P  +   +P  +  +      + P    + EE     + SP     +  Q+  +      L  +NERLR+ N  L  EL  +K L N+I+  +
Subjt:  SVATT-PPTPAAMVSPVTVAAAPAVAHVISP--ANSGEEQVTSSNSSP-----MAFQRGTSCTTTPELLRENERLRKENMQLSHELTQLKGLCNNILSLM

Query:  TNY
         N+
Subjt:  TNY

AT2G41690.1 heat shock transcription factor B31.3e-3640.08Show/hide
Query:  PAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGE
        P E +  + T  ++   P PFL KT+++V+DP  D +ISWNE G+ F+VW+PAEFARDLLP  FKH NFSSFVRQLNTYGFRKV   RWEF+N+ FR+G+
Subjt:  PAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGE

Query:  KGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQLKGLCN
        + L+ +I+RRK   S   +       V P T          I   +  E+Q +S+ SS   +           LL EN+ L+ EN  LS EL + K  C 
Subjt:  KGLLRDIQRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQLKGLCN

Query:  NILSLMTNYASGHQSESVSVRDGKALELMPATQVMME
         ++ L+  Y  G   ++    D +  E +    V +E
Subjt:  NILSLMTNYASGHQSESVSVRDGKALELMPATQVMME

AT4G11660.1 winged-helix DNA-binding transcription factor family protein1.4e-9758.96Show/hide
Query:  GESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLR
        G  G GDSQRSIPTPFLTKT+QLV+DP  D+LISWNEDG+TFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEF+NDCF+RGEK LLR
Subjt:  GESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLR

Query:  DIQRRKV---ALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMA-------------FQRGTSCTTTPELLRENERLRKENMQLS
        DIQRRK+   A++ A      A   S VTVAA P VAH++SP+NSGEEQV SSNSSP A              QR TSCTT PEL+ ENERLRK+N +L 
Subjt:  DIQRRKV---ALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMA-------------FQRGTSCTTTPELLRENERLRKENMQLS

Query:  HELTQLKGLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEE--E
         E+T+LKGL  NI +LM N+  G +  +  + +GK L+L+P  Q M            E  +  E  T       E +TP+LFGVSIGVKR RREEE   
Subjt:  HELTQLKGLCNNILSLMTNYASGHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEE--E

Query:  EEEEMVGQNHVQSEEGENGSEIKAEPLDE-NSDNPEGSASQWLELG
         EEE   +    ++EGE  S++KAEP++E NS N  GS   WLELG
Subjt:  EEEEMVGQNHVQSEEGENGSEIKAEPLDE-NSDNPEGSASQWLELG

AT4G36990.1 heat shock factor 47.1e-4640.7Show/hide
Query:  SQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDIQRRKV
        +QRS+P PFL+KT+QLVDD + DD++SWNE+G+ F+VW+ AEFA+DLLP+YFKHNNFSSF+RQLNTYGFRK VPD+WEFAND FRRG + LL DI+RRK 
Subjt:  SQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDIQRRKV

Query:  ALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQR-GTSCTTTPELLRENERLRKENMQLSHELTQLKGLCNNILSLMTNYAS
         +         A+      V  +P+     S +  G++  +SS SSP + +  G+      +L  ENE+L++EN  LS EL   K   + +++ +T    
Subjt:  ALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQR-GTSCTTTPELLRENERLRKENMQLSHELTQLKGLCNNILSLMTNYAS

Query:  GHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTP--KLFGVSIGVKRVRREEEEEEEEMVG
        GH            L++ P  Q+    +G     ++       E       A EGV    KLFGV +  +R +R+ +E+   + G
Subjt:  GHQSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTP--KLFGVSIGVKRVRREEEEEEEEMVG

AT5G62020.1 heat shock transcription factor B2A1.3e-6347.67Show/hide
Query:  SGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDI
        +G   SQRSIPTPFLTKTF LV+D ++DD+ISWNEDGS+FIVW P +FA+DLLPK+FKHNNFSSFVRQLNTYGF+KVVPDRWEF+ND F+RGEK LLR+I
Subjt:  SGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRDI

Query:  QRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSS--NSSPMAFQRGTSCTT-----TPELLRENERLRKENMQLSHELTQLKGLCN
        QRRK+         T   +V+P +      +  V+SP+NSGE+   +   +SSP ++    + TT     + ELL ENE+LR +N+QL+ ELTQ+K +C+
Subjt:  QRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSS--NSSPMAFQRGTSCTT-----TPELLRENERLRKENMQLSHELTQLKGLCN

Query:  NILSLMTNYASGH---QSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRRE-EEEEEEEMVGQN
        NI SLM+NY       +S S      + +E +PA +              E+ ++ EE            +P+LFGV IG+KR R E  + +   +VG+N
Subjt:  NILSLMTNYASGH---QSESVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRRE-EEEEEEEMVGQN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTCTCCGTCGCCGGCGGAACCGATCGGCGAATCCGGAACCGGAGATTCTCAGAGATCTATTCCGACGCCTTTTCTAACAAAAACTTTTCAGCTCGTGGATGATCCTGC
TGTTGACGACCTCATCTCCTGGAACGAAGATGGATCTACCTTCATAGTTTGGCGACCTGCTGAATTCGCCCGAGATTTACTTCCTAAATACTTTAAACACAATAACTTTT
CTAGTTTCGTCCGCCAACTTAACACTTACGGATTTCGAAAGGTTGTGCCGGACCGATGGGAATTTGCGAACGATTGTTTCCGGAGAGGCGAGAAAGGACTTCTCCGAGAC
ATCCAGCGGCGGAAAGTAGCGCTGTCGGTTGCGACGACTCCGCCAACGCCGGCCGCCATGGTGTCGCCGGTGACGGTTGCAGCGGCCCCGGCAGTGGCGCACGTGATATC
GCCGGCGAACTCTGGAGAAGAGCAGGTGACGTCCTCGAACTCATCGCCCATGGCATTTCAGCGAGGCACGAGCTGCACCACCACACCGGAACTACTAAGAGAGAACGAGC
GGCTGAGGAAGGAGAACATGCAACTGAGTCACGAGTTGACTCAGTTGAAAGGTCTCTGTAACAACATACTGTCGTTGATGACGAATTACGCTTCAGGTCACCAGTCGGAG
TCGGTGAGCGTCCGGGATGGGAAGGCGCTGGAGCTGATGCCGGCGACGCAGGTGATGATGGAAGACGAAGGCGCCGTGAGCGACGGGATACAGGAGTTGAGGCTGAAGGT
GGAGGAGGCAACAACGGCGGCGGCGGCGGCGGCGGAGGGAGTGACGCCGAAGCTGTTCGGGGTTTCGATCGGAGTGAAGCGCGTGAGGAGAGAGGAGGAAGAGGAAGAGG
AAGAAATGGTGGGGCAGAATCATGTACAGTCCGAGGAAGGTGAGAACGGGTCAGAGATTAAAGCAGAGCCGTTGGATGAGAACTCCGATAATCCAGAGGGATCCGCGTCA
CAGTGGCTCGAACTCGGGAATCAAGGCTCCTGA
mRNA sequenceShow/hide mRNA sequence
GGGGACTCAAAATAAATTTCATTAAAAATCTATATGTTAGTTAATACAAATACTAATTTGTTCATAAAATATATAATTTAAACATGTTTATCGACATTTAATTTTGGATT
GACTTGTTATATAATAAAATAAAGAAATACAGGAGTATAAAATAAACAGTGTTAAAAGTCCGTAAATTTGTTGATAGCAGGGATGGGAATCGCTCAGTTCTGATTGGGCC
GATAATGTAAAGGAGAATCCATTGGAACGGCCCAAACGAAAAAGGGTAAAATGGGAAAGAAGGAGGGGGAAGCTTCTGGAGAAGATTTTAAGAACCTGAAAAAAATAATA
ATAAATAAAATAAAATCCACAAACGCACACGCTTCCAAACCTCAATCCTTCTGGTGCCTTCCTATCCTTTCCCCCTTCTCTCTCTCTCTCTCTCTCTCTCTATAATGTCT
CTTCTGTTGTGGACCCCACTCCCTTTTTTAGTTTTTTTAATTTTATTTCTTTTCCTTTTCCTTCTCGAACCCACGTTTTCCTTTTTTCTTCCTTATCATTTACTCTCCTT
ACCCAGAAACTTCTAGAAACCTCTCTCTCTCTCTCTCTCTCTCTCTCTACGGGCGGCTTAGTTGAGCTTCAGACGTTGCAGTGCAGTGGCGGAGGACATTCTCAGGCGCG
GCGGTGTTCTTAACTCCTTCGATTCCGCCCCAGATTGAAGCGGAACTTACTGAAAATTGGAGATCCTTACAGTTCAGATCTGGGAGAACAGGCGATGTCTCCGTCGCCGG
CGGAACCGATCGGCGAATCCGGAACCGGAGATTCTCAGAGATCTATTCCGACGCCTTTTCTAACAAAAACTTTTCAGCTCGTGGATGATCCTGCTGTTGACGACCTCATC
TCCTGGAACGAAGATGGATCTACCTTCATAGTTTGGCGACCTGCTGAATTCGCCCGAGATTTACTTCCTAAATACTTTAAACACAATAACTTTTCTAGTTTCGTCCGCCA
ACTTAACACTTACGGATTTCGAAAGGTTGTGCCGGACCGATGGGAATTTGCGAACGATTGTTTCCGGAGAGGCGAGAAAGGACTTCTCCGAGACATCCAGCGGCGGAAAG
TAGCGCTGTCGGTTGCGACGACTCCGCCAACGCCGGCCGCCATGGTGTCGCCGGTGACGGTTGCAGCGGCCCCGGCAGTGGCGCACGTGATATCGCCGGCGAACTCTGGA
GAAGAGCAGGTGACGTCCTCGAACTCATCGCCCATGGCATTTCAGCGAGGCACGAGCTGCACCACCACACCGGAACTACTAAGAGAGAACGAGCGGCTGAGGAAGGAGAA
CATGCAACTGAGTCACGAGTTGACTCAGTTGAAAGGTCTCTGTAACAACATACTGTCGTTGATGACGAATTACGCTTCAGGTCACCAGTCGGAGTCGGTGAGCGTCCGGG
ATGGGAAGGCGCTGGAGCTGATGCCGGCGACGCAGGTGATGATGGAAGACGAAGGCGCCGTGAGCGACGGGATACAGGAGTTGAGGCTGAAGGTGGAGGAGGCAACAACG
GCGGCGGCGGCGGCGGCGGAGGGAGTGACGCCGAAGCTGTTCGGGGTTTCGATCGGAGTGAAGCGCGTGAGGAGAGAGGAGGAAGAGGAAGAGGAAGAAATGGTGGGGCA
GAATCATGTACAGTCCGAGGAAGGTGAGAACGGGTCAGAGATTAAAGCAGAGCCGTTGGATGAGAACTCCGATAATCCAGAGGGATCCGCGTCACAGTGGCTCGAACTCG
GGAATCAAGGCTCCTGATGGTGTATAAAAACGACGTCGTAGCAGTAGTCGATCATAATTCATAAATGACTTCGAGAAGAAGATAACGTTTTTGAGATCAGAGAGTTCGCA
GATGATTGGCCCTGCCCTGCCCTGCCCTGGACAGCTCCAAGAATCTCACGTGCCAAACCCGTGAGCTGGAAAATATACAAATTTAAAATATCTCTTTTTTCTTTTTTCTT
TTTGGGAAAGAGCTTGAGCTTGGTTGGTCCTTGTAATTGTAAAAGACGAAACCGTGGAAGATCAACCGGCTGGGTCGGTTGGTGAAGGTCTAGAATGACGGAAATACCCA
GGAAGATTGGCAAATGGCAACTGGGAACTTGTAATTTTAACCTCTCCTATTTTTATCTTCCAATTCTAATTTAAGGGGGCGAAGAAAAAAAAAAAAAGAACTGCCAGTTC
TGTTTCCGGTGTCCGGTGGCGGTTGTCCAATGTCACGTGTAATTGTATGGGCTTATTATGAGGCCCAATTAACGTTCTTCTCTTTTTCTGGCAATGGTAACGTTCTTCCT
TTGTCCTTCCCATTAAACAAAGAAAAATGTCACAATTTGTAATCTTGATTGATAACTATTTTTTTCTTCTAATTAAGCTTATAAATACTGCTTCTCATTTTATTCTAAGT
TAAATTTTGAAAAGTAAAAGAAAGGTCATATTTAAGGGCAAATATTTTTTAAGATTAAGTTTATAAATAGTACTTTATGTTAACTTCTTTTAAACATATATTCTCGACCA
AGAGATTTATTCTAAC
Protein sequenceShow/hide protein sequence
MSPSPAEPIGESGTGDSQRSIPTPFLTKTFQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFRRGEKGLLRD
IQRRKVALSVATTPPTPAAMVSPVTVAAAPAVAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELLRENERLRKENMQLSHELTQLKGLCNNILSLMTNYASGHQSE
SVSVRDGKALELMPATQVMMEDEGAVSDGIQELRLKVEEATTAAAAAAEGVTPKLFGVSIGVKRVRREEEEEEEEMVGQNHVQSEEGENGSEIKAEPLDENSDNPEGSAS
QWLELGNQGS