; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr030095 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr030095
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionTransmembrane protein 161B
Genome locationtig00153554:2878303..2888340
RNA-Seq ExpressionSgr030095
SyntenySgr030095
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000232 - Heat shock factor (HSF)-type, DNA-binding
IPR019395 - Transmembrane protein 161A/B
IPR036388 - Winged helix-like DNA-binding domain superfamily
IPR036390 - Winged helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0033628.1 transmembrane protein 161B [Cucumis melo var. makuwa]3.9e-19780.58Show/hide
Query:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ
        M+LQ+LSIY+N LLHV LSLSLSVF+IFFK+P+LFLHGIFTYIHPDNTNSGVRAAIRRP+ S SGSGL+GYRNLSST  +EI+KRTKSKDKVEFDESKAQ
Subjt:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG
        IFRLKLDENHLQTRIYFKEYRD FTFSFVGISCLLLQ F+G SK SGVW +GIFVPLLF IFAGCKLF SL KVA EKSASRTLDRQLSLLFGV GFLFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG

Query:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI
        LLTCSA SPLILDFDLGEIGGLGACF+AILMG  AGFLFIPATKI+RSFWLGTDQIRCNL+MVYCGWFSRM+LY+SQ AMAFTTLLWVNPLAEIFI KNI
Subjt:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI

Query:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP
        GE      SS+ R ADRLVG++GFSK DFAKLRLWCL+LS  LQI+A+R NLQM+LNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCV+ALQFF+PP
Subjt:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP

Query:  ALVLLFVGLSQIGVNSLENTREGGGFKERRLRPGDLLSHFLFVLAATS
        ALVLLFVGLSQI + S +NT    G     +R   +L +F+ ++  T+
Subjt:  ALVLLFVGLSQIGVNSLENTREGGGFKERRLRPGDLLSHFLFVLAATS

KAG6574167.1 Heat stress transcription factor B-2b, partial [Cucurbita argyrosperma subsp. sororia]2.1e-30769.59Show/hide
Query:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKD--KVEFDESK
        M+LQ  S+ RN LLHV LSLSLS F+IFFK+PTLFLHGIFTYIHPDN +SGVRAAIRRPD S SGSGLEGYRNLSST A+EIRKRTKSKD  KVEFDESK
Subjt:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKD--KVEFDESK

Query:  AQIFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFL
        AQIFRLKLDENHLQTRIYFKEYRD FTFSFVGISCLLLQ FLG S++SGVW +G+FVPLLFAIFAGCKLF SL KVALEKSASR+LDRQLSLLFGV GFL
Subjt:  AQIFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFL

Query:  FGLLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINK
        FGLLTCS+ SP ILDFDL +I GLG  FVA+LMGCF+GFLFIPATKI+RSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAM FTTLLWV PLAEIFINK
Subjt:  FGLLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINK

Query:  NIGESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFS
        NIG S+ E   +EI  ADRLVGN+GFSK DFAKLRLWCL+LSG LQIIA+R NLQMYLNEALLSWYQRLHAGKVP+LDFSRAKVFLHNHYLCVA+LQFF+
Subjt:  NIGESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFS

Query:  PPALVLLFVGLSQIGVNSLENTREGGGFKERRLRPGDLLSHFLFVLAATSEIVCRIKMASSLKVNLEKAVARGIDINMERVVAMAAELGCSSEVKRKMLV
        PPALVLLF GLSQI +NSLE T                L+ F+                                            L CS  +K+  L 
Subjt:  PPALVLLFVGLSQIGVNSLENTREGGGFKERRLRPGDLLSHFLFVLAATSEIVCRIKMASSLKVNLEKAVARGIDINMERVVAMAAELGCSSEVKRKMLV

Query:  LGNCDFTFSKRWRWCWEFEEKGCGFAYEMALENCKSTQQFPGYFSLLFPVKFASPSLLLRAGSD--FLILVAPLMGEPSAK--AAGGDSQRSAPTPFLTK
        +    F F   W    +F  K   ++    + +  ST    G                LR+  D  FLILV PLM EP+AK  +AG DS RS PTPFL+K
Subjt:  LGNCDFTFSKRWRWCWEFEEKGCGFAYEMALENCKSTQQFPGYFSLLFPVKFASPSLLLRAGSD--FLILVAPLMGEPSAK--AAGGDSQRSAPTPFLTK

Query:  TYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKLLTPLPSTATNVV
        TYQLVDD+T+DHVISWNDDGSTFIVWNTL FARDLLPK+FKHNNFTSFLRQLNTYGFRKVV+DRWEFANECF KGKKQLLCEIQRRKL TP+ +TA+N V
Subjt:  TYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKLLTPLPSTATNVV

Query:  VSTTAMTTVGAAASPTAQPLTLPGNSGEDQVISSSASPTRALAELIDENDRLRKEKVRLTEQLVEVKSVCNNIFSLMSSFVENQFKSSFKVRESVSTSTK
        V    +T VGA+A P+A  LT   +SGE+QVISS  +PT+ LAELIDEND+L+KEKVRLTEQL EVKS+CNNIFSLMSSFVENQ KSS KVRESV  S K
Subjt:  VSTTAMTTVGAAASPTAQPLTLPGNSGEDQVISSSASPTRALAELIDENDRLRKEKVRLTEQLVEVKSVCNNIFSLMSSFVENQFKSSFKVRESVSTSTK

Query:  SLDLFPVKRSSGEDE-AEATEAERNQIGAPIGAKRAREHREGYSTPAEDDTALRLQPPERSEVKSEQL
        SLDLFPVKR S +DE AEATEAE  Q    IGAKR RE+REG    AEDDT LRLQPPERS VKS+++
Subjt:  SLDLFPVKRSSGEDE-AEATEAERNQIGAPIGAKRAREHREGYSTPAEDDTALRLQPPERSEVKSEQL

XP_004140782.1 uncharacterized protein LOC101217739 [Cucumis sativus]1.0e-20086.19Show/hide
Query:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ
        M+LQILSIYRN LLHV LSLSLSVF+IFF +P++FLHGIFTYIHPDNTNSGVRAAIRRPD+S SG+GL+GYRNLSST+A+EI+KRTKSKDK EFDESKAQ
Subjt:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG
        IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQ F+G SKNSGVW +GIFVPLLF IFAGCKLF SL KVA EKSASRTLDRQLSLLFGV GFLFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG

Query:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI
        LLTCSA SPLILDFDL EIGG+GACFVAILMG  AGFLFIPATKI+RSFWLGTDQIRCNL+MVYCGWFSR++LY+SQ AMAFTTLLWVNPLAEIFI KNI
Subjt:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI

Query:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP
        GE      SSEIR ADRLVG++GFSK DFAKLRLWCL+LSG LQIIA+R NLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCV+ALQFF+PP
Subjt:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP

Query:  ALVLLFVGLSQIGVNSLENT
        ALVLLFVGLSQI +NS +NT
Subjt:  ALVLLFVGLSQIGVNSLENT

XP_022141264.1 uncharacterized protein LOC111011705 [Momordica charantia]2.1e-20689.05Show/hide
Query:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ
        MVLQ  S Y N LL V+LSLSLSVFLIFF++PTLFLHGIFTYIHPDN +SGVRAAIRRP+NSGSGSGL+GYRNLSS +ASEIRKRTKSKDKVEFDESKAQ
Subjt:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG
        IFRLKLDENHLQTRIYFKEYRD FTF+FVGISCLLLQ+FLGVSK+SG+W +GI VPLLF+IFAGCKLF +LAKVA+EKSASRTLDRQLSLLFGVSG LFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG

Query:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI
        LLTCSAL+PLILDF+LGEIGG GACFVAILMGCFAGFLFIPATKI+RSFWLGTDQIRCNLEMVYCGWFSRMILYASQ+AMAFTTLLWVNPL EIFINKNI
Subjt:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI

Query:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP
        GES SE T SEIR ADRLVGNMGFSKPDF KLRLWCLSLSGLLQIIA+R NLQM+LNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCV+ALQFF+PP
Subjt:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP

Query:  ALVLLFVGLSQIGVNSLENT
        ALV LFVGLSQI VNSLENT
Subjt:  ALVLLFVGLSQIGVNSLENT

XP_038882659.1 transmembrane protein 161B [Benincasa hispida]3.0e-20588.33Show/hide
Query:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ
        M+LQILSIYRN +LHV LSLSLS F+IFFK+P+LFLHGIFTYIHPDNTNSGVRAAI RPD S S SGLEGYRNLSST A+EIRKRTKSKDKVEFDESKAQ
Subjt:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG
        IFRLKLDENHLQTRIY+KEYRD FTF+FVGISCLLLQ FLG SKNSGVW +GIFVPLLFAIFAGCKLF SLAKVALEKSASRTLDRQLSLLFGV GFLFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG

Query:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI
        LLTCSA SPLILDFDLGEIGGLGAC VAILMGCF GFLFIPATKI+RSFWLGTDQIRCNL+MVYCGWFSRM+LY+SQVAMA TTLLWVNPLAEIFINKNI
Subjt:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI

Query:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP
        GE   E  SSEIR ADRLVG++GFS+ DFAKL+LWCLSLSG LQIIA+R NLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCV+ALQFF+PP
Subjt:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP

Query:  ALVLLFVGLSQIGVNSLENT
        ALVLLFVGLSQIG+NSL+NT
Subjt:  ALVLLFVGLSQIGVNSLENT

TrEMBL top hitse value%identityAlignment
A0A0A0L8M8 Uncharacterized protein4.9e-20186.19Show/hide
Query:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ
        M+LQILSIYRN LLHV LSLSLSVF+IFF +P++FLHGIFTYIHPDNTNSGVRAAIRRPD+S SG+GL+GYRNLSST+A+EI+KRTKSKDK EFDESKAQ
Subjt:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG
        IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQ F+G SKNSGVW +GIFVPLLF IFAGCKLF SL KVA EKSASRTLDRQLSLLFGV GFLFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG

Query:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI
        LLTCSA SPLILDFDL EIGG+GACFVAILMG  AGFLFIPATKI+RSFWLGTDQIRCNL+MVYCGWFSR++LY+SQ AMAFTTLLWVNPLAEIFI KNI
Subjt:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI

Query:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP
        GE      SSEIR ADRLVG++GFSK DFAKLRLWCL+LSG LQIIA+R NLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCV+ALQFF+PP
Subjt:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP

Query:  ALVLLFVGLSQIGVNSLENT
        ALVLLFVGLSQI +NS +NT
Subjt:  ALVLLFVGLSQIGVNSLENT

A0A1S3AYU7 transmembrane protein 161B7.3e-19784.76Show/hide
Query:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ
        M+LQ+LSIY+N LLHV LSLSLSVF+IFFK+P+LFLHGIFTYIHPDNTNSGVRAAIRRP+ S SGSGL+GYRNLSST  +EI+KRTKSKDKVEFDESKAQ
Subjt:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG
        IFRLKLDENHLQTRIYFKEYRD FTFSFVGISCLLLQ F+G SK SGVW +GIFVPLLF IFAGCKLF SL KVA EKSASRTLDRQLSLLFGV GFLFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG

Query:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI
        LLTCSA SPLILDFDLGEIGGLGACF+AILMG  AGFLFIPATKI+RSFWLGTDQIRCNL+MVYCGWFSRM+LY+SQ AMAFTTLLWVNPLAEIFI KNI
Subjt:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI

Query:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP
        GE      SS+ R ADRLVG++GFSK DFAKLRLWCL+LS  LQI+A+R NLQM+LNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCV+ALQFF+PP
Subjt:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP

Query:  ALVLLFVGLSQIGVNSLENT
        ALVLLFVGLSQI + S +NT
Subjt:  ALVLLFVGLSQIGVNSLENT

A0A5A7SSR6 Transmembrane protein 161B1.9e-19780.58Show/hide
Query:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ
        M+LQ+LSIY+N LLHV LSLSLSVF+IFFK+P+LFLHGIFTYIHPDNTNSGVRAAIRRP+ S SGSGL+GYRNLSST  +EI+KRTKSKDKVEFDESKAQ
Subjt:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG
        IFRLKLDENHLQTRIYFKEYRD FTFSFVGISCLLLQ F+G SK SGVW +GIFVPLLF IFAGCKLF SL KVA EKSASRTLDRQLSLLFGV GFLFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG

Query:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI
        LLTCSA SPLILDFDLGEIGGLGACF+AILMG  AGFLFIPATKI+RSFWLGTDQIRCNL+MVYCGWFSRM+LY+SQ AMAFTTLLWVNPLAEIFI KNI
Subjt:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI

Query:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP
        GE      SS+ R ADRLVG++GFSK DFAKLRLWCL+LS  LQI+A+R NLQM+LNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCV+ALQFF+PP
Subjt:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP

Query:  ALVLLFVGLSQIGVNSLENTREGGGFKERRLRPGDLLSHFLFVLAATS
        ALVLLFVGLSQI + S +NT    G     +R   +L +F+ ++  T+
Subjt:  ALVLLFVGLSQIGVNSLENTREGGGFKERRLRPGDLLSHFLFVLAATS

A0A6J1CI38 uncharacterized protein LOC1110117051.0e-20689.05Show/hide
Query:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ
        MVLQ  S Y N LL V+LSLSLSVFLIFF++PTLFLHGIFTYIHPDN +SGVRAAIRRP+NSGSGSGL+GYRNLSS +ASEIRKRTKSKDKVEFDESKAQ
Subjt:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG
        IFRLKLDENHLQTRIYFKEYRD FTF+FVGISCLLLQ+FLGVSK+SG+W +GI VPLLF+IFAGCKLF +LAKVA+EKSASRTLDRQLSLLFGVSG LFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFG

Query:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI
        LLTCSAL+PLILDF+LGEIGG GACFVAILMGCFAGFLFIPATKI+RSFWLGTDQIRCNLEMVYCGWFSRMILYASQ+AMAFTTLLWVNPL EIFINKNI
Subjt:  LLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNI

Query:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP
        GES SE T SEIR ADRLVGNMGFSKPDF KLRLWCLSLSGLLQIIA+R NLQM+LNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCV+ALQFF+PP
Subjt:  GESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPP

Query:  ALVLLFVGLSQIGVNSLENT
        ALV LFVGLSQI VNSLENT
Subjt:  ALVLLFVGLSQIGVNSLENT

A0A6J1G034 uncharacterized protein LOC1114494561.2e-19484.36Show/hide
Query:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKD--KVEFDESK
        M+LQ +S+ RN LLHV LSLSLS F+IFFK+PTLFLHGIFTYIHPDN +SGVRAAIRRPD S SGSGLEGYRNLSSTSA+EIRKRTKSKD  KVEFDESK
Subjt:  MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKD--KVEFDESK

Query:  AQIFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFL
        AQIFRLKLDENHLQTRIYFKEYRD FTFSFVGISCLLLQ FLG S++SGVW +G+FVPLLFAIFAGCKLF SL KVALEKSASR+LDRQLSLLFGV GFL
Subjt:  AQIFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFL

Query:  FGLLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINK
        FGLLTCS+ SP ILDFDL +I GLG  FVA+LMGCF+GFLFIPATKI+RSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAM FTTLLWV PLAEIFINK
Subjt:  FGLLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINK

Query:  NIGESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFS
        NIG S+ E   +EI  ADRLVGN+GFSK DF KLRLWCL+LSG LQIIA+R NLQMYLNEALLSWYQRLHAGKVP+LDFSRAKVFLHNHYLCV +LQFF+
Subjt:  NIGESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFS

Query:  PPALVLLFVGLSQIGVNSLENT
        PPALVLLF GLSQI +NSLE T
Subjt:  PPALVLLFVGLSQIGVNSLENT

SwissProt top hitse value%identityAlignment
P22335 Heat shock factor protein HSF246.6e-4648.79Show/hide
Query:  SQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKL
        SQR+AP PFL KTYQLVDD   D VISWN+ G+TF+VW T  FA+DLLPKYFKHNNF+SF+RQLNTYGFRK+V D+WEFANE F +G+K+LL  I+RRK 
Subjt:  SQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKL

Query:  LTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTLPGNSGEDQVISSSASP-------------TRALAELIDENDRLRKEKVRLTEQLVEVKSVCNNIFS
        +T  P+   +V          GA+AS        P NSG+D   SS++SP                  +L DEN++L+K+   L+ +LV+ K  CN + +
Subjt:  LTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTLPGNSGEDQVISSSASP-------------TRALAELIDENDRLRKEKVRLTEQLVEVKSVCNNIFS

Query:  LMSSFVE
         +S +V+
Subjt:  LMSSFVE

Q652B0 Heat stress transcription factor B-2c7.3e-5348.36Show/hide
Query:  GEP------SAKAAGGDSQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANEC
        GEP      +A A     QRS PTPFLTKTYQLV+D  +D VISWN+DGSTF+VW    FARDLLPKYFKHNNF+SF+RQLNTYGFRK+V DRWEFAN+C
Subjt:  GEP------SAKAAGGDSQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANEC

Query:  FLKGKKQLLCEIQRRKLLT--------PLPSTATNVVVSTTAMTTVGAAASPTAQPLT---LPGNSGEDQVISSSASP--------------------TR
        F +G+K+LLC+I RRK++         P P  AT      +   TV AA  P A P+T    P +S E+QV+SS++                      + 
Subjt:  FLKGKKQLLCEIQRRKLLT--------PLPSTATNVVVSTTAMTTVGAAASPTAQPLT---LPGNSGEDQVISSSASP--------------------TR

Query:  ALAELIDENDRLRKEKVRLTEQLVEVKSVCNNIFSLMSSFVENQ
        +  ++ +EN+RLR+E  RLT +L  +K +CNNI  LMS +   Q
Subjt:  ALAELIDENDRLRKEKVRLTEQLVEVKSVCNNIFSLMSSFVENQ

Q6Z9C8 Heat stress transcription factor B-2b4.1e-5642.57Show/hide
Query:  APLMGEPS-------AKAAG---GDSQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTD
        A  MGEPS       A+AAG   G  QR+ PTPFLTKTYQLVDD  +D VISWNDDGSTF+VW    FARDLLPKYFKHNNF+SF+RQLNTYGFRK+V D
Subjt:  APLMGEPS-------AKAAG---GDSQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTD

Query:  RWEFANECFLKGKKQLLCEIQRRKLLTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTL-----PGNSGEDQVISSSASPTRAL----------------
        RWEFAN+CF +G+++LLCEI RRK+  P P+  T  V          AAA P A P+T      P  SGE+QVISSS+SP   L                
Subjt:  RWEFANECFLKGKKQLLCEIQRRKLLTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTL-----PGNSGEDQVISSSASPTRAL----------------

Query:  AELIDENDRLRKEKVRLTEQLVEVKSVCNNIFSLMSSFVENQFKSSFKVRESVSTSTKS------------------LDLFPVKRSSGEDEAEATEAERN
         ++ DEN+RLR+E  +L  +L +++ +CNNI  LMS +   Q   +     +   +  +                  LDL P    +    A  ++ E  
Subjt:  AELIDENDRLRKEKVRLTEQLVEVKSVCNNIFSLMSSFVENQFKSSFKVRESVSTSTKS------------------LDLFPVKRSSGEDEAEATEAERN

Query:  QI-----GAPIGAKRAREHREGYSTPAEDDTA--LRLQPPE-RSEVKSEQ
         +     G  IG KR R    G     +DD A  ++ +P + R   K EQ
Subjt:  QI-----GAPIGAKRAREHREGYSTPAEDDTA--LRLQPPE-RSEVKSEQ

Q9SCW4 Heat stress transcription factor B-2a6.6e-5446.35Show/hide
Query:  SQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKL
        SQRS PTPFLTKT+ LV+D +ID VISWN+DGS+FIVWN   FA+DLLPK+FKHNNF+SF+RQLNTYGF+KVV DRWEF+N+ F +G+K+LL EIQRRK 
Subjt:  SQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKL

Query:  LTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTL-PGNSGED----QVISSSAS-----PTRAL------AELIDENDRLRKEKVRLTEQLVEVKSVCNN
                    ++TT  T V  ++    Q + + P NSGED    QV+SSS S      T+         EL++EN++LR + ++L  +L ++KS+C+N
Subjt:  LTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTL-PGNSGED----QVISSSAS-----PTRAL------AELIDENDRLRKEKVRLTEQLVEVKSVCNN

Query:  IFSLMSSFVENQFKSSFKVRESVSTSTKSLDLFPVKRSSGEDEAEATEAERNQIGAPIGAKRAREHREGYSTPA
        I+SLMS++V +Q   + +      +S++ ++  P KR S  +  E  EA     G PIG KR R       T A
Subjt:  IFSLMSSFVENQFKSSFKVRESVSTSTKSLDLFPVKRSSGEDEAEATEAERNQIGAPIGAKRAREHREGYSTPA

Q9T0D3 Heat stress transcription factor B-2b5.6e-6143.84Show/hide
Query:  GEPSAKAAGGDSQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKK
        G       GGDSQRS PTPFLTKTYQLV+D   D +ISWN+DG+TFIVW    FARDLLPKYFKHNNF+SF+RQLNTYGFRKVV DRWEF+N+CF +G+K
Subjt:  GEPSAKAAGGDSQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKK

Query:  QLLCEIQRRKLLTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTLPGNSGEDQVISSSASPTRALA---------------------ELIDENDRLRKEK
         LL +IQRRK+  P  + A     +  A + V  AA P    +  P NSGE+QVISS++SP  A A                     EL++EN+RLRK+ 
Subjt:  QLLCEIQRRKLLTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTLPGNSGEDQVISSSASPTRALA---------------------ELIDENDRLRKEK

Query:  VRLTEQLVEVKSVCNNIFSLMSSFVENQFKSSFKVRESVSTSTKSLDLFPVKRSSGEDEAEATEAER------------NQIGAPIGAKRAREHREGYST
         RL +++ ++K +  NI++LM++F   Q   +  + E      K LDL P +R    +   A+E E                G  IG KRAR   E  + 
Subjt:  VRLTEQLVEVKSVCNNIFSLMSSFVENQFKSSFKVRESVSTSTKSLDLFPVKRSSGEDEAEATEAER------------NQIGAPIGAKRAREHREGYST

Query:  PAEDD---TALRLQPPERSEVKSEQLNSSSSGN
          EDD    A   +  + S+VK+E +  ++SGN
Subjt:  PAEDD---TALRLQPPERSEVKSEQLNSSSSGN

Arabidopsis top hitse value%identityAlignment
AT1G46264.1 heat shock transcription factor B41.6e-3944.5Show/hide
Query:  RSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKLLT
        ++ P PFLTKTYQLVDD   DHV+SW DD +TF+VW    FARDLLP YFKHNNF+SF+RQLNTYGFRK+V DRWEFANE F +G+K LLCEI RRK   
Subjt:  RSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKLLT

Query:  PLPSTATNVVVSTTAMTTV---GAAASPTAQP-LTLPGNS---------GEDQVISSSASPTRALAELIDENDRLRKEKVRLTEQLVEVKSVCNNIFSLM
         +P   +  +    A   +   G +  P   P +T P               +VI         +  L ++N+RLR+    L  +L  +K + N+I    
Subjt:  PLPSTATNVVVSTTAMTTV---GAAASPTAQP-LTLPGNS---------GEDQVISSSASPTRALAELIDENDRLRKEKVRLTEQLVEVKSVCNNIFSLM

Query:  SSFVENQFK
          FV+N  K
Subjt:  SSFVENQFK

AT4G11660.1 winged-helix DNA-binding transcription factor family protein4.0e-6243.84Show/hide
Query:  GEPSAKAAGGDSQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKK
        G       GGDSQRS PTPFLTKTYQLV+D   D +ISWN+DG+TFIVW    FARDLLPKYFKHNNF+SF+RQLNTYGFRKVV DRWEF+N+CF +G+K
Subjt:  GEPSAKAAGGDSQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKK

Query:  QLLCEIQRRKLLTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTLPGNSGEDQVISSSASPTRALA---------------------ELIDENDRLRKEK
         LL +IQRRK+  P  + A     +  A + V  AA P    +  P NSGE+QVISS++SP  A A                     EL++EN+RLRK+ 
Subjt:  QLLCEIQRRKLLTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTLPGNSGEDQVISSSASPTRALA---------------------ELIDENDRLRKEK

Query:  VRLTEQLVEVKSVCNNIFSLMSSFVENQFKSSFKVRESVSTSTKSLDLFPVKRSSGEDEAEATEAER------------NQIGAPIGAKRAREHREGYST
         RL +++ ++K +  NI++LM++F   Q   +  + E      K LDL P +R    +   A+E E                G  IG KRAR   E  + 
Subjt:  VRLTEQLVEVKSVCNNIFSLMSSFVENQFKSSFKVRESVSTSTKSLDLFPVKRSSGEDEAEATEAER------------NQIGAPIGAKRAREHREGYST

Query:  PAEDD---TALRLQPPERSEVKSEQLNSSSSGN
          EDD    A   +  + S+VK+E +  ++SGN
Subjt:  PAEDD---TALRLQPPERSEVKSEQLNSSSSGN

AT4G36990.1 heat shock factor 45.2e-3841.03Show/hide
Query:  SQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKL
        +QRS P PFL+KTYQLVDD + D V+SWN++G+ F+VW T  FA+DLLP+YFKHNNF+SF+RQLNTYGFRK V D+WEFAN+ F +G + LL +I+RRK 
Subjt:  SQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKL

Query:  LTPLPSTATN-VVVSTTAMTTVGAAASPTAQPLTLPGNSGEDQVISSSASPTRALAELIDENDRLRKEKVRLTEQLVEVKSVCNNIFSLMSSFVENQFKS
         + + STA   VVV + + +  G      +   + PG+S       +  S    +A+L  EN++L++E   L+ +L   K   + + + ++  +      
Subjt:  LTPLPSTATN-VVVSTTAMTTVGAAASPTAQPLTLPGNSGEDQVISSSASPTRALAELIDENDRLRKEKVRLTEQLVEVKSVCNNIFSLMSSFVENQFKS

Query:  SFKVR-ESVSTSTKSLDLFPVKRSSGEDEAEATE
          KVR E +    K     PV+ S  E E E  +
Subjt:  SFKVR-ESVSTSTKSLDLFPVKRSSGEDEAEATE

AT5G52180.1 LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Transmembrane protein 161AB, predicted (InterPro:IPR019395); Has 82 Blast hits to 82 proteins in 35 species: Archae - 0; Bacteria - 0; Metazoa - 47; Fungi - 0; Plants - 33; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink).6.4e-11349.29Show/hide
Query:  VLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPD-----NTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDE
        +L++L  YRN  L +LLSL L++ L F K+  +FLHG+ TYI P+     NT +G+R AIRRP ++               S  E+R+R +SKDK EFDE
Subjt:  VLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPD-----NTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDE

Query:  SKAQIFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNS--GVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGV
        S AQIFR+KLDE+HL++R+YF EY   F  SF+ +SC LL  + G+ +++  GV  +G+  P++    A CK+F +L K+++E+SAS+  +++LSL+FGV
Subjt:  SKAQIFRLKLDENHLQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNS--GVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGV

Query:  SGFLFGLLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEI
         GF+FG++  + + P   DF LG +       ++  M C  GFL++PA + +RSFW+GTDQIR NL ++ CGWF RMILYA+ +   FT+LLW++PLAE+
Subjt:  SGFLFGLLTCSALSPLILDFDLGEIGGLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEI

Query:  FINKNIGESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAAL
         + +    S    TS   +    LVGN+G    DFAK R+ CL LSGLLQ +A+RPNLQM+LNEA+LSWYQRLH  K PDLDFSRAK+FLHNHYLC+ AL
Subjt:  FINKNIGESDSELTSSEIRYADRLVGNMGFSKPDFAKLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAAL

Query:  QFFSPPALVLLFVGLSQIGVNS
        QF +P  LV+LF+GLSQI ++S
Subjt:  QFFSPPALVLLFVGLSQIGVNS

AT5G62020.1 heat shock transcription factor B2A4.7e-5546.35Show/hide
Query:  SQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKL
        SQRS PTPFLTKT+ LV+D +ID VISWN+DGS+FIVWN   FA+DLLPK+FKHNNF+SF+RQLNTYGF+KVV DRWEF+N+ F +G+K+LL EIQRRK 
Subjt:  SQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWEFANECFLKGKKQLLCEIQRRKL

Query:  LTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTL-PGNSGED----QVISSSAS-----PTRAL------AELIDENDRLRKEKVRLTEQLVEVKSVCNN
                    ++TT  T V  ++    Q + + P NSGED    QV+SSS S      T+         EL++EN++LR + ++L  +L ++KS+C+N
Subjt:  LTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTL-PGNSGED----QVISSSAS-----PTRAL------AELIDENDRLRKEKVRLTEQLVEVKSVCNN

Query:  IFSLMSSFVENQFKSSFKVRESVSTSTKSLDLFPVKRSSGEDEAEATEAERNQIGAPIGAKRAREHREGYSTPA
        I+SLMS++V +Q   + +      +S++ ++  P KR S  +  E  EA     G PIG KR R       T A
Subjt:  IFSLMSSFVENQFKSSFKVRESVSTSTKSLDLFPVKRSSGEDEAEATEAERNQIGAPIGAKRAREHREGYSTPA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCTGCAAATTCTTTCGATATACAGGAATTTTCTTCTTCATGTCCTACTGTCTCTCTCGCTTTCTGTTTTTCTCATCTTCTTCAAAGTCCCAACCCTTTTCCTCCA
TGGCATATTCACCTATATTCACCCAGATAACACCAACAGCGGCGTCCGAGCCGCTATTAGAAGACCTGACAACTCTGGCTCCGGTTCTGGACTGGAGGGGTACCGAAACT
TGTCCTCAACTAGTGCCTCTGAGATCAGGAAAAGAACGAAGTCGAAGGACAAGGTTGAGTTCGACGAAAGCAAAGCTCAGATCTTCAGGTTAAAGCTGGATGAGAATCAT
CTGCAAACGCGGATCTATTTCAAAGAATACAGAGATGCTTTCACTTTTTCATTCGTGGGTATTTCTTGTTTACTGCTGCAAAATTTCTTGGGTGTATCTAAAAATTCTGG
GGTTTGGGAAAGTGGGATTTTCGTTCCTCTTCTGTTCGCGATCTTCGCTGGATGTAAGCTGTTTGAATCGCTCGCAAAGGTTGCTTTGGAGAAATCTGCGTCAAGGACGT
TGGATAGGCAATTGAGCTTACTGTTTGGAGTTTCTGGGTTTCTTTTTGGACTTCTAACCTGTTCTGCTCTTAGCCCTCTAATATTGGATTTTGATCTTGGTGAAATTGGC
GGTCTAGGGGCATGCTTCGTAGCTATCTTAATGGGCTGTTTTGCGGGGTTTTTGTTTATACCTGCGACAAAAATCTCTCGATCATTTTGGCTTGGAACCGATCAAATTAG
ATGCAACTTGGAAATGGTTTATTGTGGATGGTTTTCTCGGATGATTTTGTATGCAAGTCAAGTAGCCATGGCTTTCACCACTCTGCTTTGGGTTAACCCATTAGCTGAAA
TTTTCATTAACAAGAATATTGGCGAAAGTGATAGTGAACTTACGTCCAGTGAAATCCGATATGCTGACAGATTGGTTGGAAATATGGGATTCTCAAAGCCAGATTTTGCT
AAGCTTAGGCTCTGGTGTTTGTCACTGTCTGGTCTCTTGCAGATCATCGCCCTAAGGCCAAACTTGCAAATGTATCTGAACGAAGCTCTGTTATCTTGGTACCAAAGACT
GCATGCTGGGAAGGTTCCAGACTTGGATTTCAGTAGAGCAAAGGTTTTTCTGCACAATCACTACTTATGTGTGGCTGCTTTGCAGTTTTTTTCTCCACCAGCCTTAGTTT
TACTTTTTGTTGGGTTATCTCAAATTGGTGTCAACTCTTTGGAAAATACAAGAGAGGGTGGAGGCTTCAAGGAGAGGAGGCTTAGGCCAGGGGATCTACTTTCTCATTTT
CTTTTTGTGTTGGCGGCGACGTCTGAAATCGTTTGTCGAATAAAGATGGCATCAAGTTTGAAGGTGAATTTGGAGAAAGCTGTTGCTCGTGGAATTGATATCAATATGGA
AAGAGTGGTTGCTATGGCTGCAGAATTGGGCTGTTCCTCAGAGGTAAAGAGAAAAATGTTAGTATTGGGAAATTGTGACTTTACCTTTAGCAAAAGGTGGCGTTGGTGTT
GGGAATTTGAAGAAAAAGGATGTGGCTTTGCTTATGAAATGGCTTTAGAGAATTGCAAATCCACCCAACAATTCCCGGGATATTTCTCTCTACTTTTTCCGGTGAAATTC
GCATCACCAAGCTTACTTTTGCGTGCGGGATCTGATTTCCTGATTCTTGTGGCTCCTCTGATGGGGGAGCCGAGCGCCAAAGCGGCGGGTGGTGATTCGCAGAGATCTGC
GCCGACTCCCTTTTTGACGAAAACGTATCAGCTCGTTGACGACGAGACAATCGACCATGTAATTTCGTGGAACGACGATGGATCTACTTTTATCGTCTGGAACACCCTAA
CTTTTGCCAGAGATTTGCTGCCCAAGTATTTCAAGCACAATAACTTCACCAGCTTTCTCCGCCAGCTCAACACTTATGGATTCAGAAAGGTTGTAACGGACCGATGGGAA
TTCGCGAACGAGTGCTTTCTCAAAGGCAAGAAACAACTTCTATGTGAGATTCAACGCCGTAAATTGTTGACTCCGTTGCCGTCGACTGCTACTAACGTGGTTGTCTCCAC
AACGGCGATGACAACCGTCGGTGCTGCTGCAAGTCCGACTGCGCAACCACTAACATTGCCGGGAAATTCCGGTGAAGACCAAGTGATCTCGTCGAGTGCATCTCCGACGA
GAGCTCTTGCGGAGCTGATCGACGAGAATGATAGGCTGAGAAAAGAGAAAGTCCGACTTACAGAACAATTGGTCGAGGTGAAATCTGTGTGCAACAACATCTTCTCGTTG
ATGTCGAGCTTCGTTGAGAATCAATTCAAGAGCAGTTTCAAAGTGAGAGAGAGCGTTTCGACATCTACGAAATCGCTCGATCTTTTCCCGGTGAAGCGCTCTTCCGGCGA
GGACGAAGCGGAGGCCACGGAGGCGGAGAGGAATCAGATCGGCGCGCCCATCGGAGCGAAGCGAGCGAGGGAGCATAGGGAGGGCTATAGCACTCCGGCGGAGGACGATA
CTGCTTTGCGGCTTCAACCGCCGGAAAGATCGGAAGTCAAATCAGAGCAGTTAAATAGTTCTAGTTCAGGCAATCCTGCGATGCAGAGAATCGTTCCCCAAATTGGGGGA
GGGGAGATAGCACGTGCTGTAAATCTTTGGCACGCAAACACAAGCAATCTGAAGAGTTTGAAGCATCATATGCTTCGAATCAGCATTCAGAACTGTCGTGTCCAGAACGT
CCGCAGCCTGACCCTTCTTAATCTTCTGAAACACCCAACCAACCAGATTTCCACCCTCAATCTCTTTGAAGTCAGGTCCAGTTGGTTCCTTCCCAGTTACCAACTCCAGC
AGAATCACACCGAAGCTATAAACGTCTCCTTTCGTAGTAGACCTCCCGCTCTGCCCGTACTCCGGTGGGATGTAACCAAAGGTTCCAGCAATCTCAGTTGTGACATGAGT
CTCACAAGCACTGATCAGTCTTGCCAACCCGAAGTCTGCAACTTTCGGCTCGAAATCTTCATTGAGGAGTATATTGCTTGCTTTAATGTCCCTATGAATGATGTGGGGGA
TGAATCCATGGTGAAGAAATGCCAACCCACGGGCTGCGCCAGAAGCGACTTTGAAGCGTGTCTCCCAGTTAAGGACTTCAAGAGTTCCGGTTCGGTTTCTCAGCCAAAGA
TCCAAGCTACCATTCACCATATACTCATAGACAAAGTTTCCATTTCAGCAGTAAATTCTCTGTGCCCCTGTGTCTTTGCTTCGCTTAGCTTCTTCACAGCAACAGTTTTA
CCACCAGGCAAAGTGGCCTTGTATACAGTCCCGAAACCTCCATCTCCAATTATGTTTGTTTTGCAGAAGTTATTGGTTGCTTCAAGGATATCAACTAATGTTAATTTCAA
AAGGGGCTGCTCGAACATGGCAACATTGATGCTCAAAGCAAGGCTCCAAGCATTCAAGACTGCAGATCTCTCTAAGCTTTTGATCTGGCAATTGAAACCCATGATTCTCC
CACAAAGATCCTTGTTACCGGCGAGTGAACTTTTGGATAGATTCTGGCAGATGCCACTTCTCGGAATTGGCCCTTCTAGACTGTTTTCTGCCAAATTCAGGCCTTGGAGC
TTGAGAGCATCGCCGATCTCTGCGGGAATAGGGCCAGTAAGCATATTGCCAGACAAATCCAAGGTTGTCAGGCACTGTAACTCAGTAAGGTCGGCAAGTCTTTCCGGAAT
CGACCCGTTTAGACTGTTGTTTCCAAGATCCAACGTGGTAAGTGCACTGCAATCTCCAAGCATGGATGGAATAGTTCCTTCAAGCAGATTTGAATTCAAATTTAGAACAG
AAAGGGCTGGAGATGACCCTCCAACCGGTTATTTGCAGCAGAGAATTCCATCAAATCTACTGAGTTCCATATACTTCCCGGTTCCTACACTGCACAAATGTGTCGTCAAT
TGTACCCGAAAGGAAATTACTGTCAAGATCAATCTCCGTCAAGGATGCTGCATTACAGATTTCTTTAGGTATTGGACCTGTCAACAAGTTATTGCTCAAACTAAGGTGCT
TAAGCATCGAGCAATTTCCAATCTCAGGTGGGATTTTCCCGGTAAACCGATTGCTCGAGAGCAAAATAGAATCGACATGATTCCATTTGCCGAGCCAGGAAGTATAGTCA
AGTTCTGCAACTCGCCAATTGATTTCGGGATAGAACAGCCGAGTGGGTTGTATGAAAGGTCCAGTTTGCTCAATGATTTCAACTTGGATAGTTCTTCAGGCAATGGACCC
GTTAGAGAACAAGAAGGCGAGAAAAAGTTCTCGAGCAATGCAAGGTTACCAATTTCAGGTGGGATGGAACCAGAAAACGAGTTGTTTGAAATGTCCAAAGAAGTTAGAGA
TTTAAGCTCAGTAAAGATGGTCAATGGGAGTGAACCTGATAAAAGATTATTGCCGAGGTCTAAGGATACAATCATCGTCAAGTTTCCGATGTGGGCCGGAACATTCCCGA
CGAAGGCATTGCCGGAGAGGTCGAGGGTCCGTAGCAGTTTCAAATTACCAAGCTCCAGCGGGATTCTACCTGCCGAGAGCGAGCACCTTCAAGCTTTGAAGATTGGATAT
TTGAGGTGGGATTGGGCCATAGAGGAAGTTGGAGGAAAGGTCAAGAACCCTCTCAAGCGCAGCTTTGAAAGAAAGCAAGCTTTCTTTCTCCACACTAACCTCATTCTGGT
CTGTAACGCCATCGGAGCTCAAAATGCAGAGCTCGAAGCAGACAATGAAAATGAGAAAGAAACGCTTCAACTCCATCCCCATCTGAAAGAACATGATATCCCACTATTCT
ACATCCGAGCTGCTGAAAAATGA
mRNA sequenceShow/hide mRNA sequence
ATGGTGCTGCAAATTCTTTCGATATACAGGAATTTTCTTCTTCATGTCCTACTGTCTCTCTCGCTTTCTGTTTTTCTCATCTTCTTCAAAGTCCCAACCCTTTTCCTCCA
TGGCATATTCACCTATATTCACCCAGATAACACCAACAGCGGCGTCCGAGCCGCTATTAGAAGACCTGACAACTCTGGCTCCGGTTCTGGACTGGAGGGGTACCGAAACT
TGTCCTCAACTAGTGCCTCTGAGATCAGGAAAAGAACGAAGTCGAAGGACAAGGTTGAGTTCGACGAAAGCAAAGCTCAGATCTTCAGGTTAAAGCTGGATGAGAATCAT
CTGCAAACGCGGATCTATTTCAAAGAATACAGAGATGCTTTCACTTTTTCATTCGTGGGTATTTCTTGTTTACTGCTGCAAAATTTCTTGGGTGTATCTAAAAATTCTGG
GGTTTGGGAAAGTGGGATTTTCGTTCCTCTTCTGTTCGCGATCTTCGCTGGATGTAAGCTGTTTGAATCGCTCGCAAAGGTTGCTTTGGAGAAATCTGCGTCAAGGACGT
TGGATAGGCAATTGAGCTTACTGTTTGGAGTTTCTGGGTTTCTTTTTGGACTTCTAACCTGTTCTGCTCTTAGCCCTCTAATATTGGATTTTGATCTTGGTGAAATTGGC
GGTCTAGGGGCATGCTTCGTAGCTATCTTAATGGGCTGTTTTGCGGGGTTTTTGTTTATACCTGCGACAAAAATCTCTCGATCATTTTGGCTTGGAACCGATCAAATTAG
ATGCAACTTGGAAATGGTTTATTGTGGATGGTTTTCTCGGATGATTTTGTATGCAAGTCAAGTAGCCATGGCTTTCACCACTCTGCTTTGGGTTAACCCATTAGCTGAAA
TTTTCATTAACAAGAATATTGGCGAAAGTGATAGTGAACTTACGTCCAGTGAAATCCGATATGCTGACAGATTGGTTGGAAATATGGGATTCTCAAAGCCAGATTTTGCT
AAGCTTAGGCTCTGGTGTTTGTCACTGTCTGGTCTCTTGCAGATCATCGCCCTAAGGCCAAACTTGCAAATGTATCTGAACGAAGCTCTGTTATCTTGGTACCAAAGACT
GCATGCTGGGAAGGTTCCAGACTTGGATTTCAGTAGAGCAAAGGTTTTTCTGCACAATCACTACTTATGTGTGGCTGCTTTGCAGTTTTTTTCTCCACCAGCCTTAGTTT
TACTTTTTGTTGGGTTATCTCAAATTGGTGTCAACTCTTTGGAAAATACAAGAGAGGGTGGAGGCTTCAAGGAGAGGAGGCTTAGGCCAGGGGATCTACTTTCTCATTTT
CTTTTTGTGTTGGCGGCGACGTCTGAAATCGTTTGTCGAATAAAGATGGCATCAAGTTTGAAGGTGAATTTGGAGAAAGCTGTTGCTCGTGGAATTGATATCAATATGGA
AAGAGTGGTTGCTATGGCTGCAGAATTGGGCTGTTCCTCAGAGGTAAAGAGAAAAATGTTAGTATTGGGAAATTGTGACTTTACCTTTAGCAAAAGGTGGCGTTGGTGTT
GGGAATTTGAAGAAAAAGGATGTGGCTTTGCTTATGAAATGGCTTTAGAGAATTGCAAATCCACCCAACAATTCCCGGGATATTTCTCTCTACTTTTTCCGGTGAAATTC
GCATCACCAAGCTTACTTTTGCGTGCGGGATCTGATTTCCTGATTCTTGTGGCTCCTCTGATGGGGGAGCCGAGCGCCAAAGCGGCGGGTGGTGATTCGCAGAGATCTGC
GCCGACTCCCTTTTTGACGAAAACGTATCAGCTCGTTGACGACGAGACAATCGACCATGTAATTTCGTGGAACGACGATGGATCTACTTTTATCGTCTGGAACACCCTAA
CTTTTGCCAGAGATTTGCTGCCCAAGTATTTCAAGCACAATAACTTCACCAGCTTTCTCCGCCAGCTCAACACTTATGGATTCAGAAAGGTTGTAACGGACCGATGGGAA
TTCGCGAACGAGTGCTTTCTCAAAGGCAAGAAACAACTTCTATGTGAGATTCAACGCCGTAAATTGTTGACTCCGTTGCCGTCGACTGCTACTAACGTGGTTGTCTCCAC
AACGGCGATGACAACCGTCGGTGCTGCTGCAAGTCCGACTGCGCAACCACTAACATTGCCGGGAAATTCCGGTGAAGACCAAGTGATCTCGTCGAGTGCATCTCCGACGA
GAGCTCTTGCGGAGCTGATCGACGAGAATGATAGGCTGAGAAAAGAGAAAGTCCGACTTACAGAACAATTGGTCGAGGTGAAATCTGTGTGCAACAACATCTTCTCGTTG
ATGTCGAGCTTCGTTGAGAATCAATTCAAGAGCAGTTTCAAAGTGAGAGAGAGCGTTTCGACATCTACGAAATCGCTCGATCTTTTCCCGGTGAAGCGCTCTTCCGGCGA
GGACGAAGCGGAGGCCACGGAGGCGGAGAGGAATCAGATCGGCGCGCCCATCGGAGCGAAGCGAGCGAGGGAGCATAGGGAGGGCTATAGCACTCCGGCGGAGGACGATA
CTGCTTTGCGGCTTCAACCGCCGGAAAGATCGGAAGTCAAATCAGAGCAGTTAAATAGTTCTAGTTCAGGCAATCCTGCGATGCAGAGAATCGTTCCCCAAATTGGGGGA
GGGGAGATAGCACGTGCTGTAAATCTTTGGCACGCAAACACAAGCAATCTGAAGAGTTTGAAGCATCATATGCTTCGAATCAGCATTCAGAACTGTCGTGTCCAGAACGT
CCGCAGCCTGACCCTTCTTAATCTTCTGAAACACCCAACCAACCAGATTTCCACCCTCAATCTCTTTGAAGTCAGGTCCAGTTGGTTCCTTCCCAGTTACCAACTCCAGC
AGAATCACACCGAAGCTATAAACGTCTCCTTTCGTAGTAGACCTCCCGCTCTGCCCGTACTCCGGTGGGATGTAACCAAAGGTTCCAGCAATCTCAGTTGTGACATGAGT
CTCACAAGCACTGATCAGTCTTGCCAACCCGAAGTCTGCAACTTTCGGCTCGAAATCTTCATTGAGGAGTATATTGCTTGCTTTAATGTCCCTATGAATGATGTGGGGGA
TGAATCCATGGTGAAGAAATGCCAACCCACGGGCTGCGCCAGAAGCGACTTTGAAGCGTGTCTCCCAGTTAAGGACTTCAAGAGTTCCGGTTCGGTTTCTCAGCCAAAGA
TCCAAGCTACCATTCACCATATACTCATAGACAAAGTTTCCATTTCAGCAGTAAATTCTCTGTGCCCCTGTGTCTTTGCTTCGCTTAGCTTCTTCACAGCAACAGTTTTA
CCACCAGGCAAAGTGGCCTTGTATACAGTCCCGAAACCTCCATCTCCAATTATGTTTGTTTTGCAGAAGTTATTGGTTGCTTCAAGGATATCAACTAATGTTAATTTCAA
AAGGGGCTGCTCGAACATGGCAACATTGATGCTCAAAGCAAGGCTCCAAGCATTCAAGACTGCAGATCTCTCTAAGCTTTTGATCTGGCAATTGAAACCCATGATTCTCC
CACAAAGATCCTTGTTACCGGCGAGTGAACTTTTGGATAGATTCTGGCAGATGCCACTTCTCGGAATTGGCCCTTCTAGACTGTTTTCTGCCAAATTCAGGCCTTGGAGC
TTGAGAGCATCGCCGATCTCTGCGGGAATAGGGCCAGTAAGCATATTGCCAGACAAATCCAAGGTTGTCAGGCACTGTAACTCAGTAAGGTCGGCAAGTCTTTCCGGAAT
CGACCCGTTTAGACTGTTGTTTCCAAGATCCAACGTGGTAAGTGCACTGCAATCTCCAAGCATGGATGGAATAGTTCCTTCAAGCAGATTTGAATTCAAATTTAGAACAG
AAAGGGCTGGAGATGACCCTCCAACCGGTTATTTGCAGCAGAGAATTCCATCAAATCTACTGAGTTCCATATACTTCCCGGTTCCTACACTGCACAAATGTGTCGTCAAT
TGTACCCGAAAGGAAATTACTGTCAAGATCAATCTCCGTCAAGGATGCTGCATTACAGATTTCTTTAGGTATTGGACCTGTCAACAAGTTATTGCTCAAACTAAGGTGCT
TAAGCATCGAGCAATTTCCAATCTCAGGTGGGATTTTCCCGGTAAACCGATTGCTCGAGAGCAAAATAGAATCGACATGATTCCATTTGCCGAGCCAGGAAGTATAGTCA
AGTTCTGCAACTCGCCAATTGATTTCGGGATAGAACAGCCGAGTGGGTTGTATGAAAGGTCCAGTTTGCTCAATGATTTCAACTTGGATAGTTCTTCAGGCAATGGACCC
GTTAGAGAACAAGAAGGCGAGAAAAAGTTCTCGAGCAATGCAAGGTTACCAATTTCAGGTGGGATGGAACCAGAAAACGAGTTGTTTGAAATGTCCAAAGAAGTTAGAGA
TTTAAGCTCAGTAAAGATGGTCAATGGGAGTGAACCTGATAAAAGATTATTGCCGAGGTCTAAGGATACAATCATCGTCAAGTTTCCGATGTGGGCCGGAACATTCCCGA
CGAAGGCATTGCCGGAGAGGTCGAGGGTCCGTAGCAGTTTCAAATTACCAAGCTCCAGCGGGATTCTACCTGCCGAGAGCGAGCACCTTCAAGCTTTGAAGATTGGATAT
TTGAGGTGGGATTGGGCCATAGAGGAAGTTGGAGGAAAGGTCAAGAACCCTCTCAAGCGCAGCTTTGAAAGAAAGCAAGCTTTCTTTCTCCACACTAACCTCATTCTGGT
CTGTAACGCCATCGGAGCTCAAAATGCAGAGCTCGAAGCAGACAATGAAAATGAGAAAGAAACGCTTCAACTCCATCCCCATCTGAAAGAACATGATATCCCACTATTCT
ACATCCGAGCTGCTGAAAAATGA
Protein sequenceShow/hide protein sequence
MVLQILSIYRNFLLHVLLSLSLSVFLIFFKVPTLFLHGIFTYIHPDNTNSGVRAAIRRPDNSGSGSGLEGYRNLSSTSASEIRKRTKSKDKVEFDESKAQIFRLKLDENH
LQTRIYFKEYRDAFTFSFVGISCLLLQNFLGVSKNSGVWESGIFVPLLFAIFAGCKLFESLAKVALEKSASRTLDRQLSLLFGVSGFLFGLLTCSALSPLILDFDLGEIG
GLGACFVAILMGCFAGFLFIPATKISRSFWLGTDQIRCNLEMVYCGWFSRMILYASQVAMAFTTLLWVNPLAEIFINKNIGESDSELTSSEIRYADRLVGNMGFSKPDFA
KLRLWCLSLSGLLQIIALRPNLQMYLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVAALQFFSPPALVLLFVGLSQIGVNSLENTREGGGFKERRLRPGDLLSHF
LFVLAATSEIVCRIKMASSLKVNLEKAVARGIDINMERVVAMAAELGCSSEVKRKMLVLGNCDFTFSKRWRWCWEFEEKGCGFAYEMALENCKSTQQFPGYFSLLFPVKF
ASPSLLLRAGSDFLILVAPLMGEPSAKAAGGDSQRSAPTPFLTKTYQLVDDETIDHVISWNDDGSTFIVWNTLTFARDLLPKYFKHNNFTSFLRQLNTYGFRKVVTDRWE
FANECFLKGKKQLLCEIQRRKLLTPLPSTATNVVVSTTAMTTVGAAASPTAQPLTLPGNSGEDQVISSSASPTRALAELIDENDRLRKEKVRLTEQLVEVKSVCNNIFSL
MSSFVENQFKSSFKVRESVSTSTKSLDLFPVKRSSGEDEAEATEAERNQIGAPIGAKRAREHREGYSTPAEDDTALRLQPPERSEVKSEQLNSSSSGNPAMQRIVPQIGG
GEIARAVNLWHANTSNLKSLKHHMLRISIQNCRVQNVRSLTLLNLLKHPTNQISTLNLFEVRSSWFLPSYQLQQNHTEAINVSFRSRPPALPVLRWDVTKGSSNLSCDMS
LTSTDQSCQPEVCNFRLEIFIEEYIACFNVPMNDVGDESMVKKCQPTGCARSDFEACLPVKDFKSSGSVSQPKIQATIHHILIDKVSISAVNSLCPCVFASLSFFTATVL
PPGKVALYTVPKPPSPIMFVLQKLLVASRISTNVNFKRGCSNMATLMLKARLQAFKTADLSKLLIWQLKPMILPQRSLLPASELLDRFWQMPLLGIGPSRLFSAKFRPWS
LRASPISAGIGPVSILPDKSKVVRHCNSVRSASLSGIDPFRLLFPRSNVVSALQSPSMDGIVPSSRFEFKFRTERAGDDPPTGYLQQRIPSNLLSSIYFPVPTLHKCVVN
CTRKEITVKINLRQGCCITDFFRYWTCQQVIAQTKVLKHRAISNLRWDFPGKPIAREQNRIDMIPFAEPGSIVKFCNSPIDFGIEQPSGLYERSSLLNDFNLDSSSGNGP
VREQEGEKKFSSNARLPISGGMEPENELFEMSKEVRDLSSVKMVNGSEPDKRLLPRSKDTIIVKFPMWAGTFPTKALPERSRVRSSFKLPSSSGILPAESEHLQALKIGY
LRWDWAIEEVGGKVKNPLKRSFERKQAFFLHTNLILVCNAIGAQNAELEADNENEKETLQLHPHLKEHDIPLFYIRAAEK