; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Tan0022188 (gene) of Snake gourd v1 genome

Gene IDTan0022188
OrganismTrichosanthes anguina (Snake gourd v1)
Descriptionheat stress transcription factor B-2b
Genome locationLG07:68285934..68288401
RNA-Seq ExpressionTan0022188
SyntenyTan0022188
Gene Ontology termsGO:0006012 - galactose metabolic process (biological process)
GO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0003978 - UDP-glucose 4-epimerase activity (molecular function)
GO:0008168 - methyltransferase activity (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR027725 - Heat shock transcription factor family
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647868.1 hypothetical protein Csa_000645 [Cucumis sativus]4.3e-16290.35Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ
        ++GEKGLLRDIQRRKV +SV TT TT AAVA  P TVA SPAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPELVRENERLRKENMQLSHELTQ
Subjt:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ

Query:  LKGLCNNILSLMTNYASGHHQS-ESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVE-GMTPKLFGVSIGVKRVRRE-EEEEEEMVG
        LKGLCNNILSLMTNYASG HQ  ES SVRDGKALEL+PARQVMEDEGAVSDG  E+RLK+EE M AAA   GMTPKLFGVSIG+KR+RRE EEEEEEMVG
Subjt:  LKGLCNNILSLMTNYASGHHQS-ESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVE-GMTPKLFGVSIGVKRVRRE-EEEEEEMVG

Query:  QNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS
        QNHVQSEEGETGSEIKAEPLDENSE+P+GSASPWLELGNQGS
Subjt:  QNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS

XP_004143930.1 heat stress transcription factor B-2b [Cucumis sativus]4.3e-16290.35Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ
        ++GEKGLLRDIQRRKV +SV TT TT AAVA  P TVA SPAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPELVRENERLRKENMQLSHELTQ
Subjt:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ

Query:  LKGLCNNILSLMTNYASGHHQS-ESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVE-GMTPKLFGVSIGVKRVRRE-EEEEEEMVG
        LKGLCNNILSLMTNYASG HQ  ES SVRDGKALEL+PARQVMEDEGAVSDG  E+RLK+EE M AAA   GMTPKLFGVSIG+KR+RRE EEEEEEMVG
Subjt:  LKGLCNNILSLMTNYASGHHQS-ESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVE-GMTPKLFGVSIGVKRVRRE-EEEEEEMVG

Query:  QNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS
        QNHVQSEEGETGSEIKAEPLDENSE+P+GSASPWLELGNQGS
Subjt:  QNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS

XP_008437221.1 PREDICTED: heat stress transcription factor B-2b [Cucumis melo]4.7e-16190.32Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ
        ++GEKGLLRDIQRRKV +SV TT TT AAVA  P  VAASPAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPELVRENERLRKENMQLSHELTQ
Subjt:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ

Query:  LKGLCNNILSLMTNYASG-HHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRRE-EEEEEEMVGQ
        LKGLCNNILSLMTNYASG HH  ES SVRDGKALEL+PARQVMEDEGAVSDG  E+RLK+ E M AAA  G+TPKLFGVSIGVKR+RRE EEEEEEMVGQ
Subjt:  LKGLCNNILSLMTNYASG-HHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRRE-EEEEEEMVGQ

Query:  NHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS
        NHVQSEEGETGSEIKAEPLDENSE+P+GSASPWLELGNQGS
Subjt:  NHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS

XP_022157722.1 heat stress transcription factor B-2b [Momordica charantia]1.3e-16391.84Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        M+PSPAEPIG+SGTGDSQRSIPTPFLTKT+QLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ
        +RGEKGLLRDIQRRKV +SVATTP TPAA+ S P TVAA+PAV AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPEL+RENERLRKENMQLSHELTQ
Subjt:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ

Query:  LKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQV-MEDEGAVSDGIQEIRLKVEE--TMRAAAVEGMTPKLFGVSIGVKRVRR-EEEEEEEMV
        LKGLCNNILSLMTNYASG HQSESVSVRDGKALELMPA QV MEDEGAVSDGIQE+RLKVEE  T  AAA EG+TPKLFGVSIGVKRVRR EEEEEEEMV
Subjt:  LKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQV-MEDEGAVSDGIQEIRLKVEE--TMRAAAVEGMTPKLFGVSIGVKRVRR-EEEEEEEMV

Query:  GQNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS
        GQNHVQSEEGE GSEIKAEPLDENS+NPEGSAS WLELGNQGS
Subjt:  GQNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS

XP_038875590.1 heat stress transcription factor B-2b [Benincasa hispida]1.6e-16491.84Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  KRGEKGLLRDIQRRKVTVSVATT--PTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHEL
        +RGEKGLLRDIQRRKV +S+ TT    TPAAVA P A VAASPAV AHVISPANSGEEQVTSSNSSPM FQRGTSCTTTPELVRENERLRKENMQLSHEL
Subjt:  KRGEKGLLRDIQRRKVTVSVATT--PTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHEL

Query:  TQLKGLCNNILSLMTNYASGH-HQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREE-EEEEEMV
        TQLKGLCNNILSLMTNYAS H HQ ESVSVRDGKALEL+PARQVMEDEGAVSDG QE+RLK+EETM AA   GMTPKLFGVSIGVKR+RREE +EEEEMV
Subjt:  TQLKGLCNNILSLMTNYASGH-HQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREE-EEEEEMV

Query:  GQNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS
        GQNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS
Subjt:  GQNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS

TrEMBL top hitse value%identityAlignment
A0A0A0KNZ8 HSF_DOMAIN domain-containing protein2.1e-16290.35Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ
        ++GEKGLLRDIQRRKV +SV TT TT AAVA  P TVA SPAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPELVRENERLRKENMQLSHELTQ
Subjt:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ

Query:  LKGLCNNILSLMTNYASGHHQS-ESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVE-GMTPKLFGVSIGVKRVRRE-EEEEEEMVG
        LKGLCNNILSLMTNYASG HQ  ES SVRDGKALEL+PARQVMEDEGAVSDG  E+RLK+EE M AAA   GMTPKLFGVSIG+KR+RRE EEEEEEMVG
Subjt:  LKGLCNNILSLMTNYASGHHQS-ESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVE-GMTPKLFGVSIGVKRVRRE-EEEEEEMVG

Query:  QNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS
        QNHVQSEEGETGSEIKAEPLDENSE+P+GSASPWLELGNQGS
Subjt:  QNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS

A0A1S3AU16 heat stress transcription factor B-2b2.3e-16190.32Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ
        ++GEKGLLRDIQRRKV +SV TT TT AAVA  P  VAASPAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPELVRENERLRKENMQLSHELTQ
Subjt:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ

Query:  LKGLCNNILSLMTNYASG-HHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRRE-EEEEEEMVGQ
        LKGLCNNILSLMTNYASG HH  ES SVRDGKALEL+PARQVMEDEGAVSDG  E+RLK+ E M AAA  G+TPKLFGVSIGVKR+RRE EEEEEEMVGQ
Subjt:  LKGLCNNILSLMTNYASG-HHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRRE-EEEEEEMVGQ

Query:  NHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS
        NHVQSEEGETGSEIKAEPLDENSE+P+GSASPWLELGNQGS
Subjt:  NHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS

A0A5A7TN46 Bifunctional UDP-glucose 4-epimerase and UDP-xylose 4-epimerase 18.7e-16190.35Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ
        ++GEKGLLRDIQRRKV +SV TT TT AAVA  P  VAASPAV AHVISPANS EEQVTSSNSSPMAFQR TSCTTTPELVRENERLRKENMQLSHELTQ
Subjt:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ

Query:  LKGLCNNILSLMTNYASG-HHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETM-RAAAVEGMTPKLFGVSIGVKRVRRE-EEEEEEMVG
        LKGLCNNILSLMTNYASG HH  ES SVRDGKALEL+PARQVMEDEGAVSDG  E+RLK+EE M  AAA  G+TPKLFGVSIGVKR+RRE EEEEEEMVG
Subjt:  LKGLCNNILSLMTNYASG-HHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETM-RAAAVEGMTPKLFGVSIGVKRVRRE-EEEEEEMVG

Query:  QNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS
        QNHVQSEEGETGSEIKAEPLDENSE+P+GSASPWLELGNQGS
Subjt:  QNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS

A0A6J1DU11 heat stress transcription factor B-2b6.4e-16491.84Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        M+PSPAEPIG+SGTGDSQRSIPTPFLTKT+QLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ
        +RGEKGLLRDIQRRKV +SVATTP TPAA+ S P TVAA+PAV AHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPEL+RENERLRKENMQLSHELTQ
Subjt:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ

Query:  LKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQV-MEDEGAVSDGIQEIRLKVEE--TMRAAAVEGMTPKLFGVSIGVKRVRR-EEEEEEEMV
        LKGLCNNILSLMTNYASG HQSESVSVRDGKALELMPA QV MEDEGAVSDGIQE+RLKVEE  T  AAA EG+TPKLFGVSIGVKRVRR EEEEEEEMV
Subjt:  LKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQV-MEDEGAVSDGIQEIRLKVEE--TMRAAAVEGMTPKLFGVSIGVKRVRR-EEEEEEEMV

Query:  GQNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS
        GQNHVQSEEGE GSEIKAEPLDENS+NPEGSAS WLELGNQGS
Subjt:  GQNHVQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS

A0A6J1H317 heat stress transcription factor B-2b-like2.5e-16089.09Show/hide
Query:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
        MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDP VDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF
Subjt:  MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCF

Query:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ
        KRGEK LLRDIQRRKV +SVA  P TPA+V+ P  TVAASPAVAA VISP NS EEQVTSSNSSPM FQRGTSC TTPELVRENERLRKENMQLSHELTQ
Subjt:  KRGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQ

Query:  LKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREEEEEEEMVGQNH
        LKGLCNNILSLMTNYASGH Q ESVSVRDGKAL+L+PARQ+M+DEGAVSDGIQE+RLKVEE +  A  EG TPKLFGVSIGVKRVRR EE++EEMVG NH
Subjt:  LKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREEEEEEEMVGQNH

Query:  VQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS
        VQSEE ETGSEIKAEPLDENSENPEGSAS WLELGNQGS
Subjt:  VQSEEGETGSEIKAEPLDENSENPEGSASPWLELGNQGS

SwissProt top hitse value%identityAlignment
P22335 Heat shock factor protein HSF241.3e-5244.83Show/hide
Query:  SQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDIQRRKV
        SQR+ P PFL KTYQLVDD A DD+ISWNE G+TF+VW+ AEFA+DLLPKYFKHNNFSSFVRQLNTYGFRK+VPD+WEFAN+ FKRG+K LL  I+RRK 
Subjt:  SQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDIQRRKV

Query:  TVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGT-----SCTTTPELVRENERLRKENMQLSHELTQLKGLCNNILSL
             T  +TPA   S  A  +A         SP NSG++  +SS SSP +   G+       +   +L  ENE+L+K+N  LS EL Q K  CN +++ 
Subjt:  TVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGT-----SCTTTPELVRENERLRKENMQLSHELTQLKGLCNNILSL

Query:  MTNYASGH----HQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREEEEEEEMVG
        ++ Y        ++  S     G +LE     +++++ G V D  ++      +       +G T KLFGV +  K+ +R  +E  E  G
Subjt:  MTNYASGH----HQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREEEEEEEMVG

Q652B0 Heat stress transcription factor B-2c3.1e-7048.71Show/hide
Query:  QRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDIQRRKVT
        QRS+PTPFLTKTYQLV+DPAVDD+ISWNEDGSTF+VWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRK+VPDRWEFANDCF+RGEK LL DI RRKV 
Subjt:  QRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDIQRRKVT

Query:  VSVATTPTTP--------AAVASPPATVAASPAVAAHVI----SPANSGEEQVTSSNSSPMAFQR------------GTSCTTTPELVRENERLRKENMQ
         + A  P  P        AAVAS   TVAA+P   A  +    SPA+S EEQV SSNS      R            G    +  ++  ENERLR+EN +
Subjt:  VSVATTPTTP--------AAVASPPATVAASPAVAAHVI----SPANSGEEQVTSSNSSPMAFQR------------GTSCTTTPELVRENERLRKENMQ

Query:  LSHELTQLKGLCNNILSLMTNYASGHHQSESVSVR-----DGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGM--------TPKLFGVSI
        L+ EL  +K LCNNIL LM+ YA+  H   S  +       G++ E +P          +   I ++         AAA  G+        + +LFGVSI
Subjt:  LSHELTQLKGLCNNILSLMTNYASGHHQSESVSVR-----DGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGM--------TPKLFGVSI

Query:  GVKRVRREEEEEEEMVGQNHVQSEEGETGSEIKAEPLDENSENPEGSAS
        G+KR R +     +  G    Q+E G  G+++K E  D +     G +S
Subjt:  GVKRVRREEEEEEEMVGQNHVQSEEGETGSEIKAEPLDENSENPEGSAS

Q6Z9C8 Heat stress transcription factor B-2b3.1e-7048.37Show/hide
Query:  PSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKR
        P+PA      G G  QR++PTPFLTKTYQLVDDPAVDD+ISWN+DGSTF+VWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRK+VPDRWEFANDCF+R
Subjt:  PSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKR

Query:  GEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSP--------MAFQRGTSCTTTPELVRENERLRKENMQL
        GE+ LL +I RRKVT       T   A A P     A P       SP  SGEEQV SS+SSP             G+    + ++  ENERLR+EN QL
Subjt:  GEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSP--------MAFQRGTSCTTTPELVRENERLRKENMQL

Query:  SHELTQLKGLCNNILSLMTNYAS-------------GHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIG
        + EL+Q++ LCNNIL LM+ YAS             G++ + + S    +A   +P   V+ D      G       V +         M+ KLFGVSIG
Subjt:  SHELTQLKGLCNNILSLMTNYAS-------------GHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIG

Query:  VKRVRREEEEEEEMVGQNHVQSEEGETGSEIKAEPLD
         KR+R             H    + +  + +KAEP+D
Subjt:  VKRVRREEEEEEEMVGQNHVQSEEGETGSEIKAEPLD

Q9SCW4 Heat stress transcription factor B-2a1.1e-6448.49Show/hide
Query:  SGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDI
        +G   SQRSIPTPFLTKT+ LV+D ++DD+ISWNEDGS+FIVW P +FA+DLLPK+FKHNNFSSFVRQLNTYGF+KVVPDRWEF+ND FKRGEK LLR+I
Subjt:  SGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDI

Query:  QRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSS--NSSPMAFQRGTSCTT-----TPELVRENERLRKENMQLSHELTQLKGL
        QRRK+T       TT   V +P    ++       V+SP+NSGE+   +   +SSP ++    + TT     + EL+ ENE+LR +N+QL+ ELTQ+K +
Subjt:  QRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSS--NSSPMAFQRGTSCTT-----TPELVRENERLRKENMQLSHELTQLKGL

Query:  CNNILSLMTNYASGHHQSESVSV--RDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRRE--EEEEEEMVGQN
        C+NI SLM+NY        S S      + +E +PA++  E             +++EE       E  +P+LFGV IG+KR R E  + +   +VG+N
Subjt:  CNNILSLMTNYASGHHQSESVSV--RDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRRE--EEEEEEMVGQN

Q9T0D3 Heat stress transcription factor B-2b1.1e-9660.58Show/hide
Query:  GDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLR
        G  G GDSQRSIPTPFLTKTYQLV+DP  D+LISWNEDG+TFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEF+NDCFKRGEK LLR
Subjt:  GDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLR

Query:  DIQRRKVT--VSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMA-------------FQRGTSCTTTPELVRENERLRKENMQL
        DIQRRK++     A      AAVA+   TVAA P V AH++SP+NSGEEQV SSNSSP A              QR TSCTT PELV ENERLRK+N +L
Subjt:  DIQRRKVT--VSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMA-------------FQRGTSCTTTPELVRENERLRKENMQL

Query:  SHELTQLKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREEE---E
          E+T+LKGL  NI +LM N+  G      + + +GK L+L+P RQ M +    S+    I LK+         E +TP+LFGVSIGVKR RREEE    
Subjt:  SHELTQLKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREEE---E

Query:  EEEMVGQNHVQSEEGETGSEIKAEPLDE-NSENPEGSASPWLELG
        EEE   +    ++EGE  S++KAEP++E NS N  GS   WLELG
Subjt:  EEEMVGQNHVQSEEGETGSEIKAEPLDE-NSENPEGSASPWLELG

Arabidopsis top hitse value%identityAlignment
AT1G46264.1 heat shock transcription factor B41.7e-4448.77Show/hide
Query:  RSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDIQRRKVTV
        +++P PFLTKTYQLVDDPA D ++SW +D +TF+VWRP EFARDLLP YFKHNNFSSFVRQLNTYGFRK+VPDRWEFAN+ FKRGEK LL +I RRK + 
Subjt:  RSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDIQRRKVTV

Query:  SVATTPTTPAAVASPPATVAAS-----PAVAAHVISPANSGEEQVTSSNSSPMAF-QRGTSCTTTPELVRENERLRKENMQLSHELTQLKGLCNNILSLM
         +    +   +    P  +  S     P     V +P         S  S P    Q+  +      L  +NERLR+ N  L  EL  +K L N+I+  +
Subjt:  SVATTPTTPAAVASPPATVAAS-----PAVAAHVISPANSGEEQVTSSNSSPMAF-QRGTSCTTTPELVRENERLRKENMQLSHELTQLKGLCNNILSLM

Query:  TNY
         N+
Subjt:  TNY

AT4G11660.1 winged-helix DNA-binding transcription factor family protein7.9e-9860.58Show/hide
Query:  GDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLR
        G  G GDSQRSIPTPFLTKTYQLV+DP  D+LISWNEDG+TFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEF+NDCFKRGEK LLR
Subjt:  GDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLR

Query:  DIQRRKVT--VSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMA-------------FQRGTSCTTTPELVRENERLRKENMQL
        DIQRRK++     A      AAVA+   TVAA P V AH++SP+NSGEEQV SSNSSP A              QR TSCTT PELV ENERLRK+N +L
Subjt:  DIQRRKVT--VSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMA-------------FQRGTSCTTTPELVRENERLRKENMQL

Query:  SHELTQLKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREEE---E
          E+T+LKGL  NI +LM N+  G      + + +GK L+L+P RQ M +    S+    I LK+         E +TP+LFGVSIGVKR RREEE    
Subjt:  SHELTQLKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREEE---E

Query:  EEEMVGQNHVQSEEGETGSEIKAEPLDE-NSENPEGSASPWLELG
        EEE   +    ++EGE  S++KAEP++E NS N  GS   WLELG
Subjt:  EEEMVGQNHVQSEEGETGSEIKAEPLDE-NSENPEGSASPWLELG

AT4G17750.1 heat shock factor 14.6e-3734.88Show/hide
Query:  APSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFK
        AP P  P     T  +  S+P PFL+KTY +V+DPA D ++SW+   ++FIVW P EF+RDLLPKYFKHNNFSSFVRQLNTYGFRKV PDRWEFAN+ F 
Subjt:  APSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFK

Query:  RGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPE--LVRENERLRKENMQLSHELT
        RG+K LL+ I RRK                          +V  H  S +N   +Q++    S  A    +SC    +  L  E E+L+++   L  EL 
Subjt:  RGEKGLLRDIQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPE--LVRENERLRKENMQLSHELT

Query:  QLKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEE--TMRAAAVEGMTPKLFGVSIGVKRVRREEEEEEEMVG
        +L+                    +     D K   L+   QVME        I     K  +  T  +  ++  T     V+   K+ R  E+       
Subjt:  QLKGLCNNILSLMTNYASGHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEE--TMRAAAVEGMTPKLFGVSIGVKRVRREEEEEEEMVG

Query:  QNHVQSEEGETGSEIKAEPLDENS
         +H  S E   G  +K +PL  +S
Subjt:  QNHVQSEEGETGSEIKAEPLDENS

AT4G36990.1 heat shock factor 44.4e-4841.28Show/hide
Query:  SQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDIQRRKV
        +QRS+P PFL+KTYQLVDD + DD++SWNE+G+ F+VW+ AEFA+DLLP+YFKHNNFSSF+RQLNTYGFRK VPD+WEFAND F+RG + LL DI+RRK 
Subjt:  SQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDIQRRKV

Query:  TVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQR-GTSCTTTPELVRENERLRKENMQLSHELTQLKGLCNNILSLMTNY
           +A+T      V SP               S +  G++  +SS SSP + +  G+      +L  ENE+L++EN  LS EL   K   + +++ +T +
Subjt:  TVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQR-GTSCTTTPELVRENERLRKENMQLSHELTQLKGLCNNILSLMTNY

Query:  ASGHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREEEEEEEMV
             +     ++ GK  + + + +  E EG    G  E              EG+  KLFGV +  +R +R+ +E+  +V
Subjt:  ASGHHQSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREEEEEEEMV

AT5G62020.1 heat shock transcription factor B2A8.0e-6648.49Show/hide
Query:  SGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDI
        +G   SQRSIPTPFLTKT+ LV+D ++DD+ISWNEDGS+FIVW P +FA+DLLPK+FKHNNFSSFVRQLNTYGF+KVVPDRWEF+ND FKRGEK LLR+I
Subjt:  SGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRDI

Query:  QRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSS--NSSPMAFQRGTSCTT-----TPELVRENERLRKENMQLSHELTQLKGL
        QRRK+T       TT   V +P    ++       V+SP+NSGE+   +   +SSP ++    + TT     + EL+ ENE+LR +N+QL+ ELTQ+K +
Subjt:  QRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSS--NSSPMAFQRGTSCTT-----TPELVRENERLRKENMQLSHELTQLKGL

Query:  CNNILSLMTNYASGHHQSESVSV--RDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRRE--EEEEEEMVGQN
        C+NI SLM+NY        S S      + +E +PA++  E             +++EE       E  +P+LFGV IG+KR R E  + +   +VG+N
Subjt:  CNNILSLMTNYASGHHQSESVSV--RDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRRE--EEEEEEMVGQN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTCCGTCGCCGGCGGAACCGATCGGCGATTCCGGAACCGGAGATTCTCAGAGATCTATTCCGACGCCGTTTCTAACGAAAACGTATCAACTCGTTGATGATCCGGC
CGTCGACGACCTCATCTCGTGGAACGAAGATGGATCTACCTTCATAGTTTGGCGACCTGCCGAATTTGCTCGAGATTTACTTCCTAAATACTTTAAACACAATAACTTTT
CTAGTTTCGTCCGTCAACTTAACACTTACGGATTCCGAAAGGTCGTGCCGGACCGATGGGAATTTGCGAACGATTGTTTTAAGAGAGGTGAGAAAGGACTTCTCCGAGAC
ATCCAGCGGCGGAAGGTAACAGTGTCGGTAGCGACCACGCCGACAACGCCGGCCGCCGTGGCTTCACCACCGGCGACAGTTGCAGCGTCTCCGGCAGTGGCGGCGCACGT
GATATCGCCGGCGAACTCTGGGGAAGAGCAGGTGACATCCTCGAACTCGTCGCCGATGGCATTTCAACGAGGTACAAGCTGCACCACCACGCCGGAACTGGTGAGAGAGA
ATGAGCGGCTGAGGAAGGAGAACATGCAACTGAGTCACGAGTTGACTCAGTTAAAAGGTCTCTGTAACAACATACTATCGTTAATGACGAATTACGCCTCCGGTCACCAC
CAGTCAGAGTCGGTGAGCGTCCGGGATGGGAAGGCGCTGGAGCTCATGCCGGCGAGGCAGGTGATGGAAGACGAAGGCGCGGTCAGCGACGGGATTCAGGAGATAAGGCT
GAAGGTGGAGGAGACGATGAGGGCGGCGGCGGTAGAAGGAATGACGCCGAAGCTGTTCGGAGTTTCGATCGGAGTGAAGCGCGTGAGGAGAGAGGAGGAAGAGGAAGAAG
AAATGGTGGGGCAGAATCACGTACAGTCGGAAGAAGGTGAGACCGGGTCGGAGATTAAAGCCGAGCCGTTGGATGAGAACTCTGAAAATCCAGAGGGATCCGCATCGCCA
TGGCTCGAACTCGGGAATCAAGGCTCCTGA
mRNA sequenceShow/hide mRNA sequence
CCTCAATCCTTCTGGTGCCTTCCTATCCTCTCTCTCTCTCTCTATAATGTCTCCTGTTGTGGTCCCCACCCCCCTTTTTCTCCTTTTTTTTAATTTTTTTTTTCCAAAAA
CATTTTCCTCTTTCCTTCTCGAACCCACGTTTTCATTTTTTTCTTCCCTCTTCTCTTCCTTATCATTTACTCTCCTTACCCAGAAACTTCTAGAAACAACAGAACCTCTC
TCTTCTCTCTCTCTCTCTCTCTACGCGGCGGCTTTGCTGAGCTTCTGACGTTGCAGTGGCGGAGGTTATTCTCAGGCGCGGTTGTCTTCTTCACTCATCCGATTCCGATT
CCGATTCCAGTTCCTTCTTTTTCTCTCTGGTTCCAGGGGAAAATCTAGTCCGATTAGACGAAACTCCTCGAAAATCGGAGATCCTTACGGTTCAGATCTGGGAGACGAGG
CGATGGCTCCGTCGCCGGCGGAACCGATCGGCGATTCCGGAACCGGAGATTCTCAGAGATCTATTCCGACGCCGTTTCTAACGAAAACGTATCAACTCGTTGATGATCCG
GCCGTCGACGACCTCATCTCGTGGAACGAAGATGGATCTACCTTCATAGTTTGGCGACCTGCCGAATTTGCTCGAGATTTACTTCCTAAATACTTTAAACACAATAACTT
TTCTAGTTTCGTCCGTCAACTTAACACTTACGGATTCCGAAAGGTCGTGCCGGACCGATGGGAATTTGCGAACGATTGTTTTAAGAGAGGTGAGAAAGGACTTCTCCGAG
ACATCCAGCGGCGGAAGGTAACAGTGTCGGTAGCGACCACGCCGACAACGCCGGCCGCCGTGGCTTCACCACCGGCGACAGTTGCAGCGTCTCCGGCAGTGGCGGCGCAC
GTGATATCGCCGGCGAACTCTGGGGAAGAGCAGGTGACATCCTCGAACTCGTCGCCGATGGCATTTCAACGAGGTACAAGCTGCACCACCACGCCGGAACTGGTGAGAGA
GAATGAGCGGCTGAGGAAGGAGAACATGCAACTGAGTCACGAGTTGACTCAGTTAAAAGGTCTCTGTAACAACATACTATCGTTAATGACGAATTACGCCTCCGGTCACC
ACCAGTCAGAGTCGGTGAGCGTCCGGGATGGGAAGGCGCTGGAGCTCATGCCGGCGAGGCAGGTGATGGAAGACGAAGGCGCGGTCAGCGACGGGATTCAGGAGATAAGG
CTGAAGGTGGAGGAGACGATGAGGGCGGCGGCGGTAGAAGGAATGACGCCGAAGCTGTTCGGAGTTTCGATCGGAGTGAAGCGCGTGAGGAGAGAGGAGGAAGAGGAAGA
AGAAATGGTGGGGCAGAATCACGTACAGTCGGAAGAAGGTGAGACCGGGTCGGAGATTAAAGCCGAGCCGTTGGATGAGAACTCTGAAAATCCAGAGGGATCCGCATCGC
CATGGCTCGAACTCGGGAATCAAGGCTCCTGATGTAAAACGGCGTCGTCGCGATGACGGTGATCGTCGTCATAAATGAGCGAGAAAGAAGAAAATAACGGGATAACGTTA
TTTGAAAATAGAGAGAATGCAGAGACAAGAGGAGACCAAACCGGCCCAAGACAGCAAGAAGAATCTCACGTGCGCAACACGTGAGCTTTAAAATATACGAATTTAAAAAT
GTTTGTTTTTTCCTTTTTTTTTTAATCTTTTTGGGAAAGAGCTTAGGTTGATTGGTTGGTCCTTGTGAATGACGAAACCGTGGAAAATCAACTGGCTGGGTCGGTTGGTG
AAGGGTGATAAAATGACCAAAAAACCCATGAAGAATGGCAATTGGGAATTTGTAATTTGTAAGTTTGTAACCTCTCCAATTTCTATCTTCAAAATTTTAATTTAAGAAAG
GAAAAAAAAAAAACCAAAAACTGTCGGTTCTGGGGTTCTCTTGTTTCCCGGTGTCCGATGGCCCGATTCGAATGTCACCTGTAATTATATGGGCTATTATTAGCCCGGTA
ATCTAATATCGTTCTTTTTTTC
Protein sequenceShow/hide protein sequence
MAPSPAEPIGDSGTGDSQRSIPTPFLTKTYQLVDDPAVDDLISWNEDGSTFIVWRPAEFARDLLPKYFKHNNFSSFVRQLNTYGFRKVVPDRWEFANDCFKRGEKGLLRD
IQRRKVTVSVATTPTTPAAVASPPATVAASPAVAAHVISPANSGEEQVTSSNSSPMAFQRGTSCTTTPELVRENERLRKENMQLSHELTQLKGLCNNILSLMTNYASGHH
QSESVSVRDGKALELMPARQVMEDEGAVSDGIQEIRLKVEETMRAAAVEGMTPKLFGVSIGVKRVRREEEEEEEMVGQNHVQSEEGETGSEIKAEPLDENSENPEGSASP
WLELGNQGS