; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC10g0140 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC10g0140
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionTransmembrane protein 161B
Genome locationMC10:1008818..1018360
RNA-Seq ExpressionMC10g0140
SyntenyMC10g0140
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR019395 - Transmembrane protein 161A/B


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6574167.1 Heat stress transcription factor B-2b, partial [Cucurbita argyrosperma subsp. sororia]1.58e-30861.45Show/hide
Query:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDK--VEFDESK
        M+LQ FS   NLLL V LSLSLS F+IFF+IPTLFLHGIFTYIHPDNASSGVRAAIRRP+ S SGSGL+GYRNLSS  A+EIRKRTKSKDK  VEFDESK
Subjt:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDK--VEFDESK

Query:  AQIFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLL
        AQIFRLKLDENHLQTRIYFKEYRDGFTF+FVGISCLLLQ FLG S+ SG+WGNG+ VPLLF+IFAGCKLF++L KVA+EKSASR+LDRQLSLLFGV G L
Subjt:  AQIFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLL

Query:  FGLLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINK
        FGLLTCS+ +P ILDF+L +I G G  FVA+LMGCF+GFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQ+AM FTTLLWV PL EIFINK
Subjt:  FGLLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINK

Query:  NIGESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFA
        NIG S  EH  +EI NADRLVGN+GFSK DF KLRLWCL+LSG LQIIAVR NLQM+LNEALLSWYQRLHAGKVP+LDFSRAKVFLHNHYLCV++LQFFA
Subjt:  NIGESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFA

Query:  PPALVSLFVGLSQIDVNSLENTPL----------------------------------------------------------------------------
        PPALV LF GLSQI +NSLE T L                                                                            
Subjt:  PPALVSLFVGLSQIDVNSLENTPL----------------------------------------------------------------------------

Query:  ------------------------------------------------------------------------------GFRKVVSDRCEFANECFRRGEK
                                                                                      GFRKVVSDR EFANECFR+G+K
Subjt:  ------------------------------------------------------------------------------GFRKVVSDRCEFANECFRRGEK

Query:  QLLCEIQRRKLATP----------------TAIPTTQVLTLTGNYGEDQVISSNATPARAIAELIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFV
        QLLCEIQRRKL TP                +AIP+  +LT + + GE+QVISS+ TP + +AELIDEND+L+KEKVRLTEQL EVKSLCNNIFSLMSSFV
Subjt:  QLLCEIQRRKLATP----------------TAIPTTQVLTLTGNYGEDQVISSNATPARAIAELIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFV

Query:  ENQFESSFKVRESVLTSRTSLNLFPMKQSSGEDEKAERNPIGA-PVGAKRPREHRERAAAAEGDTTSRLQSPDRSEVKSERSHCQNNVDNQNTWLNQVH
        ENQ +SS KVRESVL S  SL+LFP+K+ S +DE AE        +GAKRPRE+RE   AAE DTT RLQ P+RS VKS+R  C+ NVDN+ TW NQVH
Subjt:  ENQFESSFKVRESVLTSRTSLNLFPMKQSSGEDEKAERNPIGA-PVGAKRPREHRERAAAAEGDTTSRLQSPDRSEVKSERSHCQNNVDNQNTWLNQVH

XP_004140782.1 uncharacterized protein LOC101217739 [Cucumis sativus]7.49e-24482.94Show/hide
Query:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ
        M+LQ  S Y NLLL V LSLSLSVF+IFF IP++FLHGIFTYIHPDN +SGVRAAIRRP++S SG+GL GYRNLSS  A+EI+KRTKSKDK EFDESKAQ
Subjt:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG
        IFRLKLDENHLQTRIYFKEYRD FTF+FVGISCLLLQ F+G SK+SG+WGNGI VPLLF IFAGCKLF++L KVA EKSASRTLDRQLSLLFGV G LFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG

Query:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI
        LLTCSA +PLILDF+L EIGG GACFVAILMG  AGFLFIPATKI RSFWLGTDQIRCNL+MVYCGWFSR++LY+SQ AMAFTTLLWVNPL EIFI KNI
Subjt:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI

Query:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
        GE    H  SEIRNADRLVG++GFSK DF KLRLWCL+LSG LQIIAVR NLQM+LNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
Subjt:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP

Query:  ALVSLFVGLSQIDVNSLENTPL
        ALV LFVGLSQID+NS +NT L
Subjt:  ALVSLFVGLSQIDVNSLENTPL

XP_008439191.1 PREDICTED: transmembrane protein 161B [Cucumis melo]5.88e-24383.41Show/hide
Query:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ
        M+LQ  S Y NLLL V LSLSLSVF+IFF+IP+LFLHGIFTYIHPDN +SGVRAAIRRPE S SGSGL GYRNLSS   +EI+KRTKSKDKVEFDESKAQ
Subjt:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG
        IFRLKLDENHLQTRIYFKEYRDGFTF+FVGISCLLLQ F+G SK SG+WGNGI VPLLF IFAGCKLF++L KVA EKSASRTLDRQLSLLFGV G LFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG

Query:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI
        LLTCSA +PLILDF+LGEIGG GACF+AILMG  AGFLFIPATKIARSFWLGTDQIRCNL+MVYCGWFSRM+LY+SQ AMAFTTLLWVNPL EIFI KNI
Subjt:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI

Query:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
        GE    H  S+ RNADRLVG++GFSK DF KLRLWCL+LS  LQI+AVR NLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
Subjt:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP

Query:  ALVSLFVGLSQIDVNSLENTPL
        ALV LFVGLSQID+ S +NT L
Subjt:  ALVSLFVGLSQIDVNSLENTPL

XP_022141264.1 uncharacterized protein LOC111011705 [Momordica charantia]5.33e-296100Show/hide
Query:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ
        MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ
Subjt:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG
        IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG

Query:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI
        LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI
Subjt:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI

Query:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
        GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
Subjt:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP

Query:  ALVSLFVGLSQIDVNSLENTPL
        ALVSLFVGLSQIDVNSLENTPL
Subjt:  ALVSLFVGLSQIDVNSLENTPL

XP_038882659.1 transmembrane protein 161B [Benincasa hispida]7.27e-24984.36Show/hide
Query:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ
        M+LQ  S Y NL+L V LSLSLS F+IFF+IP+LFLHGIFTYIHPDN +SGVRAAI RP+ S S SGL+GYRNLSS  A+EIRKRTKSKDKVEFDESKAQ
Subjt:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG
        IFRLKLDENHLQTRIY+KEYRDGFTFTFVGISCLLLQ FLG SK+SG+WGNGI VPLLF+IFAGCKLF++LAKVA+EKSASRTLDRQLSLLFGV G LFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG

Query:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI
        LLTCSA +PLILDF+LGEIGG GAC VAILMGCF GFLFIPATKIARSFWLGTDQIRCNL+MVYCGWFSRM+LY+SQ+AMA TTLLWVNPL EIFINKNI
Subjt:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI

Query:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
        GE   EH  SEIRNADRLVG++GFS+ DF KL+LWCLSLSG LQIIAVR NLQM+LNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
Subjt:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP

Query:  ALVSLFVGLSQIDVNSLENTPL
        ALV LFVGLSQI +NSL+NT L
Subjt:  ALVSLFVGLSQIDVNSLENTPL

TrEMBL top hitse value%identityAlignment
A0A0A0L8M8 Uncharacterized protein3.62e-24482.94Show/hide
Query:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ
        M+LQ  S Y NLLL V LSLSLSVF+IFF IP++FLHGIFTYIHPDN +SGVRAAIRRP++S SG+GL GYRNLSS  A+EI+KRTKSKDK EFDESKAQ
Subjt:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG
        IFRLKLDENHLQTRIYFKEYRD FTF+FVGISCLLLQ F+G SK+SG+WGNGI VPLLF IFAGCKLF++L KVA EKSASRTLDRQLSLLFGV G LFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG

Query:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI
        LLTCSA +PLILDF+L EIGG GACFVAILMG  AGFLFIPATKI RSFWLGTDQIRCNL+MVYCGWFSR++LY+SQ AMAFTTLLWVNPL EIFI KNI
Subjt:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI

Query:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
        GE    H  SEIRNADRLVG++GFSK DF KLRLWCL+LSG LQIIAVR NLQM+LNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
Subjt:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP

Query:  ALVSLFVGLSQIDVNSLENTPL
        ALV LFVGLSQID+NS +NT L
Subjt:  ALVSLFVGLSQIDVNSLENTPL

A0A1S3AYU7 transmembrane protein 161B2.85e-24383.41Show/hide
Query:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ
        M+LQ  S Y NLLL V LSLSLSVF+IFF+IP+LFLHGIFTYIHPDN +SGVRAAIRRPE S SGSGL GYRNLSS   +EI+KRTKSKDKVEFDESKAQ
Subjt:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG
        IFRLKLDENHLQTRIYFKEYRDGFTF+FVGISCLLLQ F+G SK SG+WGNGI VPLLF IFAGCKLF++L KVA EKSASRTLDRQLSLLFGV G LFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG

Query:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI
        LLTCSA +PLILDF+LGEIGG GACF+AILMG  AGFLFIPATKIARSFWLGTDQIRCNL+MVYCGWFSRM+LY+SQ AMAFTTLLWVNPL EIFI KNI
Subjt:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI

Query:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
        GE    H  S+ RNADRLVG++GFSK DF KLRLWCL+LS  LQI+AVR NLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
Subjt:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP

Query:  ALVSLFVGLSQIDVNSLENTPL
        ALV LFVGLSQID+ S +NT L
Subjt:  ALVSLFVGLSQIDVNSLENTPL

A0A5A7SSR6 Transmembrane protein 161B1.20e-24283.25Show/hide
Query:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ
        M+LQ  S Y NLLL V LSLSLSVF+IFF+IP+LFLHGIFTYIHPDN +SGVRAAIRRPE S SGSGL GYRNLSS   +EI+KRTKSKDKVEFDESKAQ
Subjt:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG
        IFRLKLDENHLQTRIYFKEYRDGFTF+FVGISCLLLQ F+G SK SG+WGNGI VPLLF IFAGCKLF++L KVA EKSASRTLDRQLSLLFGV G LFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG

Query:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI
        LLTCSA +PLILDF+LGEIGG GACF+AILMG  AGFLFIPATKIARSFWLGTDQIRCNL+MVYCGWFSRM+LY+SQ AMAFTTLLWVNPL EIFI KNI
Subjt:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI

Query:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
        GE    H  S+ RNADRLVG++GFSK DF KLRLWCL+LS  LQI+AVR NLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
Subjt:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP

Query:  ALVSLFVGLSQIDVNSLENTPLGF
        ALV LFVGLSQID+ S +NT L F
Subjt:  ALVSLFVGLSQIDVNSLENTPLGF

A0A6J1CI38 uncharacterized protein LOC1110117052.58e-296100Show/hide
Query:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ
        MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ
Subjt:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQ

Query:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG
        IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG
Subjt:  IFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFG

Query:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI
        LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI
Subjt:  LLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNI

Query:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
        GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP
Subjt:  GESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPP

Query:  ALVSLFVGLSQIDVNSLENTPL
        ALVSLFVGLSQIDVNSLENTPL
Subjt:  ALVSLFVGLSQIDVNSLENTPL

A0A6J1G034 uncharacterized protein LOC1114494566.03e-24182.08Show/hide
Query:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDK--VEFDESK
        M+LQ  S   NLLL V LSLSLS F+IFF+IPTLFLHGIFTYIHPDNASSGVRAAIRRP+ S SGSGL+GYRNLSS +A+EIRKRTKSKDK  VEFDESK
Subjt:  MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDK--VEFDESK

Query:  AQIFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLL
        AQIFRLKLDENHLQTRIYFKEYRDGFTF+FVGISCLLLQ FLG S+ SG+WGNG+ VPLLF+IFAGCKLF++L KVA+EKSASR+LDRQLSLLFGV G L
Subjt:  AQIFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLL

Query:  FGLLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINK
        FGLLTCS+ +P ILDF+L +I G G  FVA+LMGCF+GFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQ+AM FTTLLWV PL EIFINK
Subjt:  FGLLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINK

Query:  NIGESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFA
        NIG S  EH  +EI NADRLVGN+GFSK DF KLRLWCL+LSG LQIIAVR NLQM+LNEALLSWYQRLHAGKVP+LDFSRAKVFLHNHYLCV++LQFFA
Subjt:  NIGESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFA

Query:  PPALVSLFVGLSQIDVNSLENTPL
        PPALV LF GLSQI +NSLE T L
Subjt:  PPALVSLFVGLSQIDVNSLENTPL

SwissProt top hitse value%identityAlignment
Q652B0 Heat stress transcription factor B-2c8.9e-1535.62Show/hide
Query:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLAT--------------------------------PTAIPTT-----------QVLTLTGNYGED--Q
        GFRK+V DR EFAN+CFRRGEK+LLC+I RRK+                                  P A+P T           QVL+     GE+  Q
Subjt:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLAT--------------------------------PTAIPTT-----------QVLTLTGNYGED--Q

Query:  VISSNATP------ARAIAELIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQ
           S + P      + +  ++ +EN+RLR+E  RLT +L  +K LCNNI  LMS +   Q
Subjt:  VISSNATP------ARAIAELIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQ

Q6Z9C8 Heat stress transcription factor B-2b1.3e-1841.43Show/hide
Query:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLATP----------TAIPTTQVLTLTGN-----YGEDQVISSNATP----------------ARAIAE
        GFRK+V DR EFAN+CFRRGE++LLCEI RRK+  P           AIP    +T T +      GE+QVISS+++P                  A  +
Subjt:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLATP----------TAIPTTQVLTLTGN-----YGEDQVISSNATP----------------ARAIAE

Query:  LIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQ
        + DEN+RLR+E  +L  +L +++ LCNNI  LMS +   Q
Subjt:  LIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQ

Q7XRX3 Heat stress transcription factor B-2a8.3e-1334.9Show/hide
Query:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLA------------TPTAIPTTQVLTLTGNYGEDQVISS--------NATPARAIAELIDENDRLRKE
        GF+KVV+DR EFAN+CFRRGEK LL  IQRRK +             PTAIP +   T +G  GE  V SS         A  + A+AEL +EN RLR+E
Subjt:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLA------------TPTAIPTTQVLTLTGNYGEDQVISS--------NATPARAIAELIDENDRLRKE

Query:  KVRLTEQLVEVKSLCNNIFSLMSSFVENQFESSFKVRE---SVLTSRTSLNLFPMKQSSGEDEKAER----NPIGAPVGAKRPREHRERAAA
          RL  +L   + +C+ +  L+S +  +      +  E     +    ++     ++ +GEDE+ E     +  G     +   E RER AA
Subjt:  KVRLTEQLVEVKSLCNNIFSLMSSFVENQFESSFKVRE---SVLTSRTSLNLFPMKQSSGEDEKAER----NPIGAPVGAKRPREHRERAAA

Q9SCW4 Heat stress transcription factor B-2a6.6e-1836.76Show/hide
Query:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLAT-------PTAIPTTQVLTLT-GNYGED----QVISSNATP-----------ARAIAELIDENDRL
        GF+KVV DR EF+N+ F+RGEK+LL EIQRRK+ T       P++    Q + ++  N GED    QV+SS+ +                 EL++EN++L
Subjt:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLAT-------PTAIPTTQVLTLT-GNYGED----QVISSNATP-----------ARAIAELIDENDRL

Query:  RKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQ-FESSFKVRESVLTSRTSLNLFPMKQSS----GEDEKAERNPIGAPVGAKRPR
        R + ++L  +L ++KS+C+NI+SLMS++V +Q  + S+    S   S   +   P K+ S     E+E+A     G P+G KR R
Subjt:  RKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQ-FESSFKVRESVLTSRTSLNLFPMKQSS----GEDEKAERNPIGAPVGAKRPR

Q9T0D3 Heat stress transcription factor B-2b8.3e-2131.92Show/hide
Query:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLATP--------------------TAIPTTQVLTLTGNYGEDQVISSNATPARAIA------------
        GFRKVV DR EF+N+CF+RGEK LL +IQRRK++ P                     A+P    +    N GE+QVISSN++PA A A            
Subjt:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLATP--------------------TAIPTTQVLTLTGNYGEDQVISSNATPARAIA------------

Query:  ---------ELIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQFESSFKVRESVLTSRTSLNLFPMKQSSGE---------------DEKAER
                 EL++EN+RLRK+  RL +++ ++K L  NI++LM++F   Q + +      +L     L+L P +Q   E                E    
Subjt:  ---------ELIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQFESSFKVRESVLTSRTSLNLFPMKQSSGE---------------DEKAER

Query:  NPIGAPVGAKRPREHRERAAAAEGDTTSR----LQSPDRSEVKSERSHCQNNVDNQNTWL
           G  +G KR R   E  AA E D   R     +    S+VK+E     N+ ++  +WL
Subjt:  NPIGAPVGAKRPREHRERAAAAEGDTTSR----LQSPDRSEVKSERSHCQNNVDNQNTWL

Arabidopsis top hitse value%identityAlignment
AT1G46264.1 heat shock transcription factor B43.3e-0932.14Show/hide
Query:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLA---------------TPTAIPTT----------QVLTLTGNY--------GEDQVISSNATPARAI
        GFRK+V DR EFANE F+RGEK LLCEI RRK +                P  IP +          +V T   ++           +VI      A  +
Subjt:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLA---------------TPTAIPTT----------QVLTLTGNY--------GEDQVISSNATPARAI

Query:  AELIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFVE
          L ++N+RLR+    L  +L  +K L N+I   + + V+
Subjt:  AELIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFVE

AT2G41690.1 heat shock transcription factor B31.1e-0731.93Show/hide
Query:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRK-------LATPTAIPTTQVLTLTG-------NYGEDQVISSNATPARAIAELIDENDRLRKEKVRLTE
        GFRKV + R EF+NE FR+G+++L+  I+RRK        +    +PTT ++   G       ++ EDQ  SS  + +     L+DEN  L+ E   L+ 
Subjt:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRK-------LATPTAIPTTQVLTLTG-------NYGEDQVISSNATPARAIAELIDENDRLRKEKVRLTE

Query:  QLVEVKSLCNNIFSLMSSF
        +L + K  C  +  L+  +
Subjt:  QLVEVKSLCNNIFSLMSSF

AT4G11660.1 winged-helix DNA-binding transcription factor family protein5.9e-2231.92Show/hide
Query:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLATP--------------------TAIPTTQVLTLTGNYGEDQVISSNATPARAIA------------
        GFRKVV DR EF+N+CF+RGEK LL +IQRRK++ P                     A+P    +    N GE+QVISSN++PA A A            
Subjt:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLATP--------------------TAIPTTQVLTLTGNYGEDQVISSNATPARAIA------------

Query:  ---------ELIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQFESSFKVRESVLTSRTSLNLFPMKQSSGE---------------DEKAER
                 EL++EN+RLRK+  RL +++ ++K L  NI++LM++F   Q + +      +L     L+L P +Q   E                E    
Subjt:  ---------ELIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQFESSFKVRESVLTSRTSLNLFPMKQSSGE---------------DEKAER

Query:  NPIGAPVGAKRPREHRERAAAAEGDTTSR----LQSPDRSEVKSERSHCQNNVDNQNTWL
           G  +G KR R   E  AA E D   R     +    S+VK+E     N+ ++  +WL
Subjt:  NPIGAPVGAKRPREHRERAAAAEGDTTSR----LQSPDRSEVKSERSHCQNNVDNQNTWL

AT5G52180.1 LOCATED IN: endomembrane system; EXPRESSED IN: 22 plant structures; EXPRESSED DURING: 13 growth stages; CONTAINS InterPro DOMAIN/s: Transmembrane protein 161AB, predicted (InterPro:IPR019395); Has 82 Blast hits to 82 proteins in 35 species: Archae - 0; Bacteria - 0; Metazoa - 47; Fungi - 0; Plants - 33; Viruses - 0; Other Eukaryotes - 2 (source: NCBI BLink).4.0e-11149.05Show/hide
Query:  VLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPD-----NASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDE
        +L+   +Y NL LQ++LSL L++ L F +I  +FLHG+ TYI P+     N  +G+R AIRRP ++          +  SN   E+R+R +SKDK EFDE
Subjt:  VLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPD-----NASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDE

Query:  SKAQIFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGV--SKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGV
        S AQIFR+KLDE+HL++R+YF EY   F  +F+ +SC LL  + G+    S G+  NG+  P++    A CK+F+ L K+++E+SAS+  +++LSL+FGV
Subjt:  SKAQIFRLKLDENHLQTRIYFKEYRDGFTFTFVGISCLLLQSFLGV--SKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGV

Query:  SGLLFGLLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEI
         G +FG++  + + P   DF LG +       ++  M C  GFL++PA + ARSFW+GTDQIR NL ++ CGWF RMILYA+ +   FT+LLW++PL E+
Subjt:  SGLLFGLLTCSALTPLILDFNLGEIGGPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEI

Query:  FINKNIGESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSAL
         + +    S    T    ++   LVGN+G    DF K R+ CL LSGLLQ +AVR NLQMFLNEA+LSWYQRLH  K PDLDFSRAK+FLHNHYLC+ AL
Subjt:  FINKNIGESASEHTFSEIRNADRLVGNMGFSKPDFVKLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSAL

Query:  QFFAPPALVSLFVGLSQIDVNS
        QF AP  LV LF+GLSQID++S
Subjt:  QFFAPPALVSLFVGLSQIDVNS

AT5G62020.1 heat shock transcription factor B2A4.7e-1936.76Show/hide
Query:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLAT-------PTAIPTTQVLTLT-GNYGED----QVISSNATP-----------ARAIAELIDENDRL
        GF+KVV DR EF+N+ F+RGEK+LL EIQRRK+ T       P++    Q + ++  N GED    QV+SS+ +                 EL++EN++L
Subjt:  GFRKVVSDRCEFANECFRRGEKQLLCEIQRRKLAT-------PTAIPTTQVLTLT-GNYGED----QVISSNATP-----------ARAIAELIDENDRL

Query:  RKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQ-FESSFKVRESVLTSRTSLNLFPMKQSS----GEDEKAERNPIGAPVGAKRPR
        R + ++L  +L ++KS+C+NI+SLMS++V +Q  + S+    S   S   +   P K+ S     E+E+A     G P+G KR R
Subjt:  RKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQ-FESSFKVRESVLTSRTSLNLFPMKQSS----GEDEKAERNPIGAPVGAKRPR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTGCTGCAAAATTTTTCAACATACGGGAATTTGCTTCTTCAGGTCGTACTGTCTCTCTCGCTTTCTGTTTTTCTCATCTTCTTCAGAATCCCAACCCTTTTCCTCCA
TGGCATATTTACTTATATTCACCCAGATAACGCCAGCAGTGGCGTCCGCGCCGCAATTAGAAGACCTGAAAATTCTGGCTCCGGTTCTGGACTAGATGGGTACCGAAACT
TGTCCTCAAATACTGCTTCTGAGATCAGGAAAAGAACAAAGTCGAAGGACAAGGTTGAGTTTGACGAAAGCAAAGCGCAGATCTTCAGGTTAAAGCTGGATGAGAATCAT
CTGCAAACGCGGATCTATTTCAAAGAATACAGAGATGGTTTCACTTTTACGTTCGTGGGTATTTCTTGTTTACTACTGCAAAGTTTCTTGGGTGTATCTAAAAGTTCTGG
GATTTGGGGAAATGGGATTTCCGTCCCTCTACTGTTTTCGATCTTCGCTGGATGTAAGCTGTTTATAACGCTCGCAAAGGTTGCTATGGAGAAATCTGCGTCAAGGACGT
TGGATAGGCAATTGAGCTTACTGTTTGGAGTCTCTGGGCTTCTTTTTGGACTTCTAACTTGTTCTGCTCTTACCCCTCTAATATTGGATTTTAATCTTGGTGAGATTGGT
GGTCCGGGGGCATGTTTCGTTGCTATCTTAATGGGCTGTTTTGCGGGGTTTTTGTTTATACCTGCAACAAAAATCGCTCGATCATTTTGGCTTGGAACCGATCAAATTAG
ATGCAATTTGGAAATGGTTTATTGTGGATGGTTCTCTCGGATGATTTTGTACGCAAGCCAAATGGCCATGGCTTTCACCACTTTGCTTTGGGTTAACCCATTAACTGAAA
TTTTCATTAACAAGAATATTGGCGAAAGTGCAAGTGAACATACGTTCAGTGAAATCCGAAATGCTGACAGATTGGTAGGAAATATGGGATTTTCAAAGCCGGATTTTGTT
AAGCTCAGGCTTTGGTGTTTGTCACTGTCTGGTCTCTTGCAGATCATCGCTGTAAGGTCAAACTTACAAATGTTTCTAAACGAAGCTCTGTTATCGTGGTACCAAAGACT
ACATGCTGGGAAGGTTCCAGACTTGGATTTCAGTAGAGCAAAAGTTTTTCTGCACAATCACTACTTATGTGTGTCTGCCTTGCAGTTTTTTGCTCCACCAGCCTTAGTTT
CACTTTTTGTTGGGTTATCTCAAATTGATGTCAACTCTTTGGAAAATACCCCTTTGGGATTCAGAAAGGTTGTATCGGACCGCTGCGAATTCGCGAACGAGTGCTTCCGC
AGAGGCGAGAAACAACTTCTATGTGAGATTCAACGTCGTAAATTGGCGACTCCGACTGCAATTCCGACGACGCAAGTACTAACATTGACGGGAAATTACGGTGAAGACCA
AGTTATTTCGTCGAATGCAACTCCTGCGAGAGCTATTGCAGAGCTGATCGACGAGAATGATCGGCTGAGAAAAGAGAAAGTCCGGCTTACAGAACAATTGGTCGAGGTGA
AATCTCTGTGCAACAATATCTTCTCTCTGATGTCGAGCTTCGTTGAAAACCAATTCGAGAGCAGTTTCAAAGTGAGAGAGAGCGTTTTGACATCAAGGACATCGCTCAAT
CTTTTCCCGATGAAGCAGTCTTCCGGCGAAGACGAAAAGGCGGAGAGAAATCCGATCGGCGCGCCCGTCGGAGCCAAGCGACCGAGGGAACACCGGGAGAGGGCGGCGGC
GGCGGAGGGTGATACTACTTCGCGGCTTCAATCGCCGGATAGATCGGAGGTGAAATCAGAGCGGTCACATTGTCAAAATAACGTCGATAATCAGAATACGTGGCTTAATC
AGGTCCACTAA
mRNA sequenceShow/hide mRNA sequence
TAGCATTTAAGATGTAAAATAATGATTAAAAGTATAAAGGGTCTTTTCATGGGGACATTTGGACAATTACAATATTGGTTTGTGGGGCATAATTGCTAATTATTTTTTTT
CTATGGGCATTATTGATAACCCCTATATTTATAGGGGCACTTGCGAAAAAAATCCATAAAAATACTAGCACCCAATTAGGTGGACCTTGGGCCGACCCGATTCGAGTCCG
TCGCCCAGCACCGCCGCAAGCAGCGAATGAAACCCAGCTCAGTACAACGAGCCCAATGCTCCACAAACGATGGAATGAAACCCAAATCCAATATTCGTTCGACGATCGAC
GACGAGGGAGTGACGGAATCTGCGAGATCTCAAGTTCGAAGGACCGCAGAAATCGCCTGGAACAAACAAGCTTGCGAACTTGATTCGAACGAAACCAGTTCCTTCACGCT
ATGGTGCTGCAAAATTTTTCAACATACGGGAATTTGCTTCTTCAGGTCGTACTGTCTCTCTCGCTTTCTGTTTTTCTCATCTTCTTCAGAATCCCAACCCTTTTCCTCCA
TGGCATATTTACTTATATTCACCCAGATAACGCCAGCAGTGGCGTCCGCGCCGCAATTAGAAGACCTGAAAATTCTGGCTCCGGTTCTGGACTAGATGGGTACCGAAACT
TGTCCTCAAATACTGCTTCTGAGATCAGGAAAAGAACAAAGTCGAAGGACAAGGTTGAGTTTGACGAAAGCAAAGCGCAGATCTTCAGGTTAAAGCTGGATGAGAATCAT
CTGCAAACGCGGATCTATTTCAAAGAATACAGAGATGGTTTCACTTTTACGTTCGTGGGTATTTCTTGTTTACTACTGCAAAGTTTCTTGGGTGTATCTAAAAGTTCTGG
GATTTGGGGAAATGGGATTTCCGTCCCTCTACTGTTTTCGATCTTCGCTGGATGTAAGCTGTTTATAACGCTCGCAAAGGTTGCTATGGAGAAATCTGCGTCAAGGACGT
TGGATAGGCAATTGAGCTTACTGTTTGGAGTCTCTGGGCTTCTTTTTGGACTTCTAACTTGTTCTGCTCTTACCCCTCTAATATTGGATTTTAATCTTGGTGAGATTGGT
GGTCCGGGGGCATGTTTCGTTGCTATCTTAATGGGCTGTTTTGCGGGGTTTTTGTTTATACCTGCAACAAAAATCGCTCGATCATTTTGGCTTGGAACCGATCAAATTAG
ATGCAATTTGGAAATGGTTTATTGTGGATGGTTCTCTCGGATGATTTTGTACGCAAGCCAAATGGCCATGGCTTTCACCACTTTGCTTTGGGTTAACCCATTAACTGAAA
TTTTCATTAACAAGAATATTGGCGAAAGTGCAAGTGAACATACGTTCAGTGAAATCCGAAATGCTGACAGATTGGTAGGAAATATGGGATTTTCAAAGCCGGATTTTGTT
AAGCTCAGGCTTTGGTGTTTGTCACTGTCTGGTCTCTTGCAGATCATCGCTGTAAGGTCAAACTTACAAATGTTTCTAAACGAAGCTCTGTTATCGTGGTACCAAAGACT
ACATGCTGGGAAGGTTCCAGACTTGGATTTCAGTAGAGCAAAAGTTTTTCTGCACAATCACTACTTATGTGTGTCTGCCTTGCAGTTTTTTGCTCCACCAGCCTTAGTTT
CACTTTTTGTTGGGTTATCTCAAATTGATGTCAACTCTTTGGAAAATACCCCTTTGGGATTCAGAAAGGTTGTATCGGACCGCTGCGAATTCGCGAACGAGTGCTTCCGC
AGAGGCGAGAAACAACTTCTATGTGAGATTCAACGTCGTAAATTGGCGACTCCGACTGCAATTCCGACGACGCAAGTACTAACATTGACGGGAAATTACGGTGAAGACCA
AGTTATTTCGTCGAATGCAACTCCTGCGAGAGCTATTGCAGAGCTGATCGACGAGAATGATCGGCTGAGAAAAGAGAAAGTCCGGCTTACAGAACAATTGGTCGAGGTGA
AATCTCTGTGCAACAATATCTTCTCTCTGATGTCGAGCTTCGTTGAAAACCAATTCGAGAGCAGTTTCAAAGTGAGAGAGAGCGTTTTGACATCAAGGACATCGCTCAAT
CTTTTCCCGATGAAGCAGTCTTCCGGCGAAGACGAAAAGGCGGAGAGAAATCCGATCGGCGCGCCCGTCGGAGCCAAGCGACCGAGGGAACACCGGGAGAGGGCGGCGGC
GGCGGAGGGTGATACTACTTCGCGGCTTCAATCGCCGGATAGATCGGAGGTGAAATCAGAGCGGTCACATTGTCAAAATAACGTCGATAATCAGAATACGTGGCTTAATC
AGGTCCACTAAGGGAATCAAATGGATCTGTAATTAACGGCTAGGATAACGTATCAACTATTAAGGCTCTAGTTCAGGATGCAGGGAATCATGTCTCCAGATGGGAGGGGG
AGATAGCACGTGCCGTAAATCTTTGGCACGTGAGCCACCAATTAAGGAAGTTTCAATACATCTTCCCTCTTCTGCATTCTTTCTTCAGTTTTTAGTCTTTCCATTTTTCC
ATCTTCAAAAGCACTAATATTTTACTTTACATCCGATCATATAAATACTGAGAAAACTTATCATTCTTAAATAAATACAATAATAATAAGAATCACTAAACTGAAACTTA
TTTCTGAAAACAATGAATCATACAACTCGAACCAATCGATTAAGTTTTGAGTCGACAATCACGTTGTTCAAACACAATGAACTCGGTTCGATCTATTTTGTTGGTTTGAA
ATTTGACCCATATATCAAATGGACAGATTAGTTCCACACATTCAGCAGAGTGATTACTGTGATGAAGATGATGAGAAACACAAACATGTGATTGATTAGATTAGATTAGC
AGATAAAAACTATTACAACTCTGAACCTTTTCACCATATCCTCTTCCTCACTAAATACTTCTTTTTACAGTGTTATTCAATCTTTTGTTTCCCCCCTACTAAGTCCCAAC
ACTAACAAACTCTAAATTCTACTCATCCTTTATCCCTTTGAGGAACTTCAGTACCTGAAGCATGGAAGGTCTGTTAGCAGGATTTTCTGACAGGCAAATACAAGCAATCT
GAAGAGTTTGAAGCATCATATGCTTCGAATCAGCATTCAGAACTGTCGTGTCGAGGACGTCCGCAGCCTGACCCTTCTTGATCTTCTGAAACACCCAACCAACCAGATTT
CCACCCTCAATCTCTTTGAAGTCAGGTCCTGTTGGTTCCTTCCCAGTTACCAGTTCCAGTAGAATCACACCAAAGCTATAAACGTCTCCTTTCGCGGTAGACCTCCCGCT
CTGCCCGTACTCCGGTGGGATGTAGCCAAAGGTTCCAGCAATCTCAGTTGTGACATGAGTCTCACAAGCACTGATCAGTCTCGCCAACCCGAAGTCAGCAACTTTTGGCT
CGAAGTCTTCGTTGAGGAGTATATTGCTTGCTTTGATATCTCTGTGAATGATGTGGGGGATGAATCCATGGTGAAGAAATGCCAATCCACGGGCTGCACCGGAAGCGACT
TTGAAGCGAGTCTCCCAGTTAAGGACTTCAAGAGTTCCGATTCGGTTTCTCAGCCAAAGATCCAAGCTGCCATTCACCATATACTCGTAGACAAGGAGCTTCTCCTCCCC
AAGAGAGCAGTAGCCAAGCAGTGAAACAAGATTATGATGTTTTACTTTACCTAGAGTTTCCATTTCAGCAATAAATTCTCTGTGCCCCTGCGTTTTTGCTTCGCTAAGCT
TCTTCACAGCAACAATTTTTCCATCAGGCAAAGTGGCCTTGTACACGGTCCCAAATCCTCCATCTCCAATAATGTTCGTTTTACAGAAGTTATTGGTTGCTACGAGGATA
TCAACCAAGGTTAATTTCAAAAGGGGCTGCTCGAACATGGCCACGTTGATGCTTAAAGGCTCTTTCGATCTGCTGCTGCTTAAGAAATAGAGATTTGGATCTATAAAACT
GTTTAGTTTGCTTTCCTCCATTTCCTCTGGATCATTATCTCTGTGGTTTCTAATAATCTGTCTCCGCATAGCGAATGCCACGGTAAGAACAATAAGAACACTTACGACGA
TAATTCCAGCAAGGCTCCAAGCATTCAAGACTGCAGATCTCTCCAAGCTTTTGATCTGGCAATTGAAACCCATGATTCTCCCACAAAGGTCCTTGTTACCTGCAAGTGAA
GTCTTGGACAGATTCTGGCAAATGCCACTTCTTGGAATCGGGCCTTCTAGACTGTTTTCAGCCAAATTCAGGTAAAACATATTGGCCACGCTGCATATCTTCTCTGGAAT
CTCTCCTGAGAGCCTGTTTTTCGAAACATCAAAGTACTCGAGTTGCATAAGATCCCCAAGTTCAGAAGGGATCGGCCCTGTAAACTTATTCCCATGAAGATCCAAAGTTG
TCAAGTATGAAAGATTGCCCAATGTTCGTGGAAGTACACCCTCGAAATAGTTACTACTCAAATTCAAAGTTTCAATCTTCCATGTCATGGAACTTGGGAAAAGTTCAACA
ACCAGACCAGAAAGCCTGTTCTCTTGTACATAAAGCCCCACAAGATTCAACATGCTGGACAGAGAAGAAGGAAGATCACCATCCAGCTCATTGGAACTTAAATCCAAATG
AGTTAATGCTTTCAGATCACCAAGACTTTTTGGAACCGAACCAGATAACTTGTTACCAGTTAGGTTCAACTTTACCAAGCTATTCAAATGACTCAGGCTTTCGGGGATCG
TTCTAGTGAGCTGATTATTCCCAAGATATAGGCCTTGGAGCTTGAGAGCATCCCCGATCTCTGTGGGAATAGGACCAGTAAGCATATTACCAGACAGATCCAAGGTTGTC
AGGTTTGTTAAGTGAGAGAGAGATCTGGGAATTTCTCCAGAAAGCAGATTATTGCTTAGTAAAAGATCAACCACTACCACACATTTCCCCAGTTCATCTGGTATGGTACC
AGACAATCTATTGTGAGACAGATCAAAAACACCATGATGCTGGACAAAGCTCAAATCAGGAATAGTCATCTGTCGAAAATACGCAGACGGCTTGAAAGGTATTGCTCCAG
ATAACTTGTTGTGTGAAAGAACTAGGCACTGTAATTCAGTAAGGTCTGCAAGTCTTTCCGGGATGGACCCGTCCAAACTATTGTTCCCAAGGTCCAATGTGGTAAGTTTA
CTGCAATCTCCAAGCAAGGATGGAATAGTACCTTCAAGCAGATTTGAATTCAAATTTAGAACAGAAAGGTCTGTGAGATTTCCAATCTCATCTGGTATTGTACCTGTCAA
CCTATTGTTGCTGAGAACAAGCCTCTCAAGTGAAGCTGCATAACCGATTTCTGAAGGGAGATGACCCTCCAACCGGTTATTTGCAGCAGAGAATTCCATCAAATCTACTG
AGTTCCATATACTTCTCGGTAAAGAACCAGTAAAATTATTAGAGTCGAGGTCGATTACCAGTAGGGGAAGGTCTGAGAAGTACTCTGGTATTGTACCAACAATCTGATTG
TCTACCAAAACCAACTCTGTAAGGTTTCTACACTGCACAAATGTGTCGTCAATCGTACCTGAAAGGAAGTTACTGTCAAGATCAATCTCCATCAAGGATGCAGCATTACA
GATTTCTTTAGGTATCGGACCTGCCAACAAGTTATTACTCAAACTAAGGTGCTTAAGCATTGAACAGTTTCCAATCTCAGGTGGGATTTTCCCAGTGAACCGATTGCTTG
AGAGCAAAATAGAATCGACATGATTCCATTTGCCGAGCCAGGAGGGTAGTGGCCCAGAAAGCTGATTCTTCTCGGCAGAAAATGTCAACATGGGAAGTTCTGAAAGCTCT
TGTGGCAACACCCCAGATAGAAAGTTGAACGAAACCATCAATGTTTTCAAATTTCTGCACCTTCCAAGTTGCGCAGGAATGGAACCATTAAGTTCTGTGTAAACCAGATT
CAGTATAGTCAGGTTCTGCAACTCGCCGATTGATTTTGGGATAGAGCAGCCAAGTGGGTTGTATGAAAGGTCAAGTTTGCTCAATGATTTCAACTTGGATAGTTCTTCAG
GCAATGGACCCGTGAGAGAACAAGAAGGCGAGAAAAAGTTCTCGAGCAATGCAAGTTTCCCAACTTCAGGAGGCAACTCACCAGAAAAGTGGTTAATGCCGATATAAAGG
TCGGTTAAATGCTGTAGGTTACCAATTTCAGGTGGGATTGAACCAGAAAACGAGTTGTTCGAAATGTCCAAAGAAGTTAGAGATTTAAGCTCAGTAAAGATGGTCAATGG
GAGTGAACCTGATAAGAGATTGTTGCCGAGGTCCAAGGATAAAATCCTCGTCAAGTTTCCGATGTGGGCCGGAACATTCCCGACGAAGGCATTGCCGGAGAGGTCGAGCG
TCTGTAGCAGCTTCAAATTACCAAGCTCCGGCGGGATTTCACCTGTGAATAAATTAGTCCCCAGCTTGAGATTCTCCAACTGGGTCAACTCAGTGAGTTCGACAGGGAAG
TCGCCAGAAAACTGGTTTCCGCCGAGAGCGAGCACCTTCAAGCTCCGGAGATTGGATATCTGAGGTGGGATTGAGCCGTGGAGGGAGTTGGAGGAAAGGTCAAGAACCAT
GAGGCTCAAAATGTTGAAGAGAGAAGGAGAGAGTTGGCCTTTGAGCGAACGAGACGAAAGAGAGAGCTCTGTAACTCGACCGAGTCGGCAAGAAACGCCAGCCCAAGAGC
AATGAGGAACCAATGAGTTCCATGGGAGAATTTCGGAGTTCTCAAGCGCAGCTTTGAAAGCAAGCAAGCTTTCTCTCTCTATACTAACATCATTTGGGTCTGCAACGCCA
TTGGAGCTCGAAATGCAGAGCTGGAAGGAGACAATGAAAATGAGAACGAAGCGTTTCATCTCCACCCCCATCTGAAAAAACACGATGTTCTCAGTTCTCAGTCAAAAAGA
CGGTGATGGGTAATCTCGAATTCCATGGCCGCTGGAGAGAAATCAAACGAAGGGACGAAGAAGAGGAAGAAGAAGAAGAAGGTTGAGCGCCATAGGTATAATGCAAGGGG
AGGACTACGGCAGAAGAAGCAGTGTCAGAAAGAGGAGGCGCAGCTCACGTGACTTGAAGGATGGTAACGGCAATTCTGTTTTTCTTTAATTTAGACCCCTCAAATCCCCT
GTTATTACTTTCGGGTATTTATTTTCAAGTCCAAAAAGTTTCATGCTTTTTGGCAAACAAGGCTCTTTTTTTGGAGGGGGTTTAAAAAAAAAAAAAAAAACTAGTCTTTT
TGTCACTTCCATCCCTTTTTCTTTATTCTTTATTCATTTTTTGATCAAGTGTTTGTGTTTGTGGAATAGGGAATAAAAATGGAAGCTTTTTGATTGTGGGTTTTGCAAAT
ACTCCCACCATTGTTGGCTGTAATTTCAAGCCCTAATAGCCTTATTCCTATATGGGTCTATACTTTTCCCTAAGTTGCATTTATTCTAATCTTTTTTTGAAAAATAATTG
TGGAAAAGAATCCAAATTGGTCAAAAGGGGG
Protein sequenceShow/hide protein sequence
MVLQNFSTYGNLLLQVVLSLSLSVFLIFFRIPTLFLHGIFTYIHPDNASSGVRAAIRRPENSGSGSGLDGYRNLSSNTASEIRKRTKSKDKVEFDESKAQIFRLKLDENH
LQTRIYFKEYRDGFTFTFVGISCLLLQSFLGVSKSSGIWGNGISVPLLFSIFAGCKLFITLAKVAMEKSASRTLDRQLSLLFGVSGLLFGLLTCSALTPLILDFNLGEIG
GPGACFVAILMGCFAGFLFIPATKIARSFWLGTDQIRCNLEMVYCGWFSRMILYASQMAMAFTTLLWVNPLTEIFINKNIGESASEHTFSEIRNADRLVGNMGFSKPDFV
KLRLWCLSLSGLLQIIAVRSNLQMFLNEALLSWYQRLHAGKVPDLDFSRAKVFLHNHYLCVSALQFFAPPALVSLFVGLSQIDVNSLENTPLGFRKVVSDRCEFANECFR
RGEKQLLCEIQRRKLATPTAIPTTQVLTLTGNYGEDQVISSNATPARAIAELIDENDRLRKEKVRLTEQLVEVKSLCNNIFSLMSSFVENQFESSFKVRESVLTSRTSLN
LFPMKQSSGEDEKAERNPIGAPVGAKRPREHRERAAAAEGDTTSRLQSPDRSEVKSERSHCQNNVDNQNTWLNQVH