; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr014687 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr014687
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionbasic helix-loop-helix (bHLH) DNA-binding superfamily protein
Genome locationtig00000892:871082..876996
RNA-Seq ExpressionSgr014687
SyntenySgr014687
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035841.1 hypothetical protein SDJN02_02640, partial [Cucurbita argyrosperma subsp. argyrosperma]8.4e-19880.76Show/hide
Query:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV
        M+D+ A  ENVF+ FPT+VSF+SPVHTPS RRLSS+FT PR PVPAAR+LAWVSLQGRLVNAE+ASSVRSIRG  G DEAIAWQLFSPIERFLIVAVIGV
Subjt:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV

Query:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP
        AVSES KNHQI QL+RAV+LRDQVLLSMQQKLDDLCDQV P+KDQ +TENDM  RKNADLADSGAFG DKIKFVDCGCW+CDQH  +FSGLEQ NTATKP
Subjt:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP

Query:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS
        SCGAEMLQYKMPL NEAEQEERRMSDLSDWASSVTSAA  ++QMNTLS+EQDMLFLKKDC+EKD TIKELT+LLHS+EVSGSQR+SELEDIIRRKNMIIS
Subjt:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS

Query:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA
        KL+KDMVVLEQKV+QLTRLRRP  SSCASN ++QPIP+MTDNL+YDME STSPSSSDSDC PSES Q PPPTRK EIH IQ SEPCL RTQ+SATKK+P+
Subjt:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA

Query:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI
        TSDSRS+PQ+ATPLKE+ M NRK E  S S        R+R   VRASGDSTN RRR QT  AKDTPQRKRN+
Subjt:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI

XP_022148706.1 uncharacterized protein LOC111017303 isoform X1 [Momordica charantia]5.1e-20382.8Show/hide
Query:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV
        MDD    QENVF+ FPT+VSFASPV TPS RRLSSNFT PR PVPAAR+L+WVSLQGRL+NA++ASSVRSI GGLG DEAIAWQLFSPIERFL+VAVIGV
Subjt:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV

Query:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP
        AVSES KNHQI QLK+AV+LRDQVLLSMQQKLDDLCDQVNPVKD+S TE DMAL+KNADLADSGAFGH+KIKFVDCGCWLCD+HL++F+GLEQGNTATKP
Subjt:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP

Query:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAAEIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIISKL
         CGAEMLQYK+PL NEAEQEERRMSDLSDWASSVTSAA+IQMNTLS+EQDMLFLKKDCDEKD TIKELT+LLHSSEVSGSQRISELEDIIRRKNMIISKL
Subjt:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAAEIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIISKL

Query:  KKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQPPPTRKQEIHHIQNSEPCLIRTQQSATKKKPATSD
        KKDMVVLEQKVVQLTRLRRP  SSCASN+D+QPIPHMTDNL+YDME S+SPSSSDSDC        PPTRKQ IHH+QNSEPCLIRTQ+SATKKKP T D
Subjt:  KKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQPPPTRKQEIHHIQNSEPCLIRTQQSATKKKPATSD

Query:  SRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKD-TPQRKRNI
        SRS+ QMA PLKEI M NRK  VTS S      R+R     VRASGDSTNIRRRFQTA AKD TPQRKRNI
Subjt:  SRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKD-TPQRKRNI

XP_022148707.1 uncharacterized protein LOC111017303 isoform X2 [Momordica charantia]6.2e-20182.59Show/hide
Query:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV
        MDD    QENVF+ FPT+VSFASPV TPS RRLSSNFT PR PVPAAR+L+WVSLQGRL+NA++ASSVRSI GGLG DEAIAWQLFSPIERFL+VAVIGV
Subjt:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV

Query:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP
        AVSES KNHQI QLK+AV+LRDQVLLSMQQKLDDLCDQVNPVKD+S TE DMAL+KNADLADSGAFGH+KIKFVDCGCWLCD+HL++F+GLE GNTATKP
Subjt:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP

Query:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAAEIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIISKL
         CGAEMLQYK+PL NEAEQEERRMSDLSDWASSVTSAA+IQMNTLS+EQDMLFLKKDCDEKD TIKELT+LLHSSEVSGSQRISELEDIIRRKNMIISKL
Subjt:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAAEIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIISKL

Query:  KKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQPPPTRKQEIHHIQNSEPCLIRTQQSATKKKPATSD
        KKDMVVLEQKVVQLTRLRRP  SSCASN+D+QPIPHMTDNL+YDME S+SPSSSDSDC        PPTRKQ IHH+QNSEPCLIRTQ+SATKKKP T D
Subjt:  KKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQPPPTRKQEIHHIQNSEPCLIRTQQSATKKKPATSD

Query:  SRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKD-TPQRKRNI
        SRS+ QMA PLKEI M NRK  VTS S      R+R     VRASGDSTNIRRRFQTA AKD TPQRKRNI
Subjt:  SRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKD-TPQRKRNI

XP_022958664.1 uncharacterized protein LOC111459819 isoform X1 [Cucurbita moschata]2.9e-19880.97Show/hide
Query:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV
        M+D+ A  ENVF+ FPT+VSF+SPVHTPS RRLSS+FT PR PVPAAR+LAWVSLQGRLVNAE+ASSVRSIRG  G DEAIAWQLFSPIERFLIVAVIGV
Subjt:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV

Query:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP
        AVSES KNHQI QL+RAV+LRDQVLLSMQQKLDDLCDQV P+KDQ +TENDM  RKNADLADSGAFG DKIKFVDCGCW+CDQH  +FSGLEQ NTATKP
Subjt:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP

Query:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS
        SCGAEMLQYKMPL NEAEQEERRMSDLSDWASSVTSAA  ++QMNTLS+EQDMLFLKKDC+EKD TIKELT+LLHS+EVSGSQR+SELEDIIRRKNMIIS
Subjt:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS

Query:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA
        KLKKDMVVLEQKV+QLTRLRRP  SSCASN ++QPIP+MTDNL+YDME STSPSSSDSDC PSESSQ PPPTRK EIHHIQ SEPCL RTQ+SATKK+P+
Subjt:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA

Query:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI
        TSDSRS+PQ+ATPLKE+ M NRK E    S        R+R    RASGDSTN RRR QT  AKDTPQRKRN+
Subjt:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI

XP_023534038.1 uncharacterized protein LOC111795710 isoform X1 [Cucurbita pepo subsp. pepo]1.4e-19780.76Show/hide
Query:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV
        M+D+ A  ENVF  FPT+VSF+SPVHTPS RRLSS+FT PR PVPAAR+LAWVSLQGRLVNAE+ASSVRSIRG  G DEAIAWQLFSPIERFLIVAVIGV
Subjt:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV

Query:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP
        AVSES KNHQI QL+RAV+LRDQVLLSMQQKLDDLCDQV P+KD  +TENDM  RKN DLADSGAFG DKIKFVDCGCW+CDQH  +FSGLEQ NTATKP
Subjt:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP

Query:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS
        SCGAEMLQYKMPL NEAEQEERRMSDLSDWASSVTSAA  ++QMNTLS+EQDMLFLKKDC+EKD TIKELT+LLHS+EVSGSQR+SELEDIIRRKNMIIS
Subjt:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS

Query:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA
        KLKKDMVVLEQKV+QLTRLRRP  SSCASN ++QPIP+MTDNL+YDME STSPSSSDSDC PSESSQ PPPTRK EIH IQ SEPCL RTQ+SATKK+P+
Subjt:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA

Query:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI
        TSDSRS+PQ+ATPLKE+ M NRK E  S S        R+R   VRASGDSTN RRR QT  AKDTPQRKRN+
Subjt:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI

TrEMBL top hitse value%identityAlignment
A0A6J1D5T2 uncharacterized protein LOC111017303 isoform X23.0e-20182.59Show/hide
Query:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV
        MDD    QENVF+ FPT+VSFASPV TPS RRLSSNFT PR PVPAAR+L+WVSLQGRL+NA++ASSVRSI GGLG DEAIAWQLFSPIERFL+VAVIGV
Subjt:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV

Query:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP
        AVSES KNHQI QLK+AV+LRDQVLLSMQQKLDDLCDQVNPVKD+S TE DMAL+KNADLADSGAFGH+KIKFVDCGCWLCD+HL++F+GLE GNTATKP
Subjt:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP

Query:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAAEIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIISKL
         CGAEMLQYK+PL NEAEQEERRMSDLSDWASSVTSAA+IQMNTLS+EQDMLFLKKDCDEKD TIKELT+LLHSSEVSGSQRISELEDIIRRKNMIISKL
Subjt:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAAEIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIISKL

Query:  KKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQPPPTRKQEIHHIQNSEPCLIRTQQSATKKKPATSD
        KKDMVVLEQKVVQLTRLRRP  SSCASN+D+QPIPHMTDNL+YDME S+SPSSSDSDC        PPTRKQ IHH+QNSEPCLIRTQ+SATKKKP T D
Subjt:  KKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQPPPTRKQEIHHIQNSEPCLIRTQQSATKKKPATSD

Query:  SRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKD-TPQRKRNI
        SRS+ QMA PLKEI M NRK  VTS S      R+R     VRASGDSTNIRRRFQTA AKD TPQRKRNI
Subjt:  SRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKD-TPQRKRNI

A0A6J1D678 uncharacterized protein LOC111017303 isoform X12.5e-20382.8Show/hide
Query:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV
        MDD    QENVF+ FPT+VSFASPV TPS RRLSSNFT PR PVPAAR+L+WVSLQGRL+NA++ASSVRSI GGLG DEAIAWQLFSPIERFL+VAVIGV
Subjt:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV

Query:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP
        AVSES KNHQI QLK+AV+LRDQVLLSMQQKLDDLCDQVNPVKD+S TE DMAL+KNADLADSGAFGH+KIKFVDCGCWLCD+HL++F+GLEQGNTATKP
Subjt:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP

Query:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAAEIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIISKL
         CGAEMLQYK+PL NEAEQEERRMSDLSDWASSVTSAA+IQMNTLS+EQDMLFLKKDCDEKD TIKELT+LLHSSEVSGSQRISELEDIIRRKNMIISKL
Subjt:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAAEIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIISKL

Query:  KKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQPPPTRKQEIHHIQNSEPCLIRTQQSATKKKPATSD
        KKDMVVLEQKVVQLTRLRRP  SSCASN+D+QPIPHMTDNL+YDME S+SPSSSDSDC        PPTRKQ IHH+QNSEPCLIRTQ+SATKKKP T D
Subjt:  KKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQPPPTRKQEIHHIQNSEPCLIRTQQSATKKKPATSD

Query:  SRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKD-TPQRKRNI
        SRS+ QMA PLKEI M NRK  VTS S      R+R     VRASGDSTNIRRRFQTA AKD TPQRKRNI
Subjt:  SRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKD-TPQRKRNI

A0A6J1H449 uncharacterized protein LOC111459819 isoform X21.3e-19680.76Show/hide
Query:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV
        M+D+ A  ENVF+ FPT+VSF+SPVHTPS RRLSS+FT PR PVPAAR+LAWVSLQGRLVNAE+ASSVRSIRG  G DEAIAWQLFSPIERFLIVAVIGV
Subjt:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV

Query:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP
        AVSES KNHQI QL+RAV+LRDQVLLSMQQKLDDLCDQV P+KDQ +TENDM  RKNADLADSGAFG DKIKFVDCGCW+CDQH  +FSGLE  NTATKP
Subjt:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP

Query:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS
        SCGAEMLQYKMPL NEAEQEERRMSDLSDWASSVTSAA  ++QMNTLS+EQDMLFLKKDC+EKD TIKELT+LLHS+EVSGSQR+SELEDIIRRKNMIIS
Subjt:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS

Query:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA
        KLKKDMVVLEQKV+QLTRLRRP  SSCASN ++QPIP+MTDNL+YDME STSPSSSDSDC PSESSQ PPPTRK EIHHIQ SEPCL RTQ+SATKK+P+
Subjt:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA

Query:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI
        TSDSRS+PQ+ATPLKE+ M NRK E    S        R+R    RASGDSTN RRR QT  AKDTPQRKRN+
Subjt:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI

A0A6J1H5R7 uncharacterized protein LOC111459819 isoform X11.4e-19880.97Show/hide
Query:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV
        M+D+ A  ENVF+ FPT+VSF+SPVHTPS RRLSS+FT PR PVPAAR+LAWVSLQGRLVNAE+ASSVRSIRG  G DEAIAWQLFSPIERFLIVAVIGV
Subjt:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV

Query:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP
        AVSES KNHQI QL+RAV+LRDQVLLSMQQKLDDLCDQV P+KDQ +TENDM  RKNADLADSGAFG DKIKFVDCGCW+CDQH  +FSGLEQ NTATKP
Subjt:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP

Query:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS
        SCGAEMLQYKMPL NEAEQEERRMSDLSDWASSVTSAA  ++QMNTLS+EQDMLFLKKDC+EKD TIKELT+LLHS+EVSGSQR+SELEDIIRRKNMIIS
Subjt:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS

Query:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA
        KLKKDMVVLEQKV+QLTRLRRP  SSCASN ++QPIP+MTDNL+YDME STSPSSSDSDC PSESSQ PPPTRK EIHHIQ SEPCL RTQ+SATKK+P+
Subjt:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA

Query:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI
        TSDSRS+PQ+ATPLKE+ M NRK E    S        R+R    RASGDSTN RRR QT  AKDTPQRKRN+
Subjt:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI

A0A6J1JXT1 uncharacterized protein LOC111490736 isoform X11.7e-19680.34Show/hide
Query:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV
        M+D+ A  ENVF  FPT+VSF+SPVHTPS RRLSS+FT PR PVPAAR+LAWVSLQGRLVNAE+ASSVRSIRG  G DEAIAWQLFSPIERFLIVAVIGV
Subjt:  MDDEAANQENVFEWFPTVVSFASPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGV

Query:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP
        AVSES KNHQI QL+RAV+LRDQVLLSMQQKLDDLCDQV  +KD  +TENDM  RKN DLADSGAFG DKIKFVDCGCW+CDQH  +FSGLEQ NTATKP
Subjt:  AVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKP

Query:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS
        SCGAEMLQYKMPL NEAEQEERRMSDLSDWASSVTSAA  ++QMNTLS EQDMLFLKKDC+EKD TIKELT LLHS+EVSGSQR+SELEDIIRRKNMIIS
Subjt:  SCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAA--EIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIIS

Query:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA
        KLKKDMVVLEQKV+QLTRLRRP  SSCASN ++QPIP+MTDNL+YDME STSPSSSDSDC PSESSQ PPPTRK EIHHI+ SEPCL RTQ+SATKK+P+
Subjt:  KLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQ-PPPTRKQEIHHIQNSEPCLIRTQQSATKKKPA

Query:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI
        TSDSRS+PQ+ATPLKE+ M NRK E  S S        ++R   VRASGDSTN RRR QT  AKDTPQRKRN+
Subjt:  TSDSRSRPQMATPLKEIWM-NRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNI

SwissProt top hitse value%identityAlignment
Q66GR3 Transcription factor bHLH1301.0e-2569.66Show/hide
Query:  EDSAMLSDDVKMSEN-QFSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL
        + S+  SD V + +  Q  DSVPCKIRAKRGCATHPRSIAERVRRTRI++R+RKLQ+LVPNMDKQ NTS+MLDLA++YIK LQ+Q + L
Subjt:  EDSAMLSDDVKMSEN-QFSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL

Q8H102 Transcription factor bHLH1289.2e-2275.71Show/hide
Query:  DSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL
        DSVPCKIRAKRGCATHPRSIAER RRTRI+ +++KLQ LVPNMDKQ + S+MLDLA+++IKGLQ Q+Q L
Subjt:  DSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL

Q9C690 Transcription factor bHLH1224.7e-3434.41Show/hide
Query:  MESDIQEHHRLLHDHQYELHHRQTNSAFTRHQSAPSSYFTSLVDRKLCEQFVSRPSSPETERIFARFMTSGSGTQHTSS--HRRRESPVGEVGMVTAEAE
        MES+ Q+HH LLHDHQ   H R  NS   R+QSAPSSYF+S  +    E+F+ RP+SPETERI + F+ +   + +  S  H    S   E      + E
Subjt:  MESDIQEHHRLLHDHQYELHHRQTNSAFTRHQSAPSSYFTSLVDRKLCEQFVSRPSSPETERIFARFMTSGSGTQHTSS--HRRRESPVGEVGMVTAEAE

Query:  RKTQYLASTTVKSEAADVIQKQSDID---QYRSGLHAFHQNQWRP-------PLPNKSINSGREANY--SHSMEMGWLAPMRT---GGNCSLLRQSSSPA
         +   +  T   +    V+    +I    +   G  A      RP       P+ N + ++   A    S  +E  + A M++    G  +++  S++ A
Subjt:  RKTQYLASTTVKSEAADVIQKQSDID---QYRSGLHAFHQNQWRP-------PLPNKSINSGREANY--SHSMEMGWLAPMRT---GGNCSLLRQSSSPA

Query:  -------ELLPP----------INVESGFPFEC----------------------------------------WEDSA----------MLSDDVKMSENQ
               +LLPP          ++V+ GF                                             EDSA           L   +   E  
Subjt:  -------ELLPP----------INVESGFPFEC----------------------------------------WEDSA----------MLSDDVKMSENQ

Query:  FSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL
         SDS+PCKIRAKRGCATHPRSIAERVRRT+I++R+RKLQ LVPNMD Q NT++MLDLA++YIK LQ+QV+ L
Subjt:  FSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL

Q9C8P8 Transcription factor bHLH805.8e-2475Show/hide
Query:  FSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL
        F DSVPC++RAKRGCATHPRSIAERVRRTRI+ RIR+LQ+LVPNMDKQ NT++ML+ A+EY+K LQ Q+Q L
Subjt:  FSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL

Q9M0R0 Transcription factor bHLH812.4e-2272Show/hide
Query:  ENQFSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL
        EN   DSV  ++RAKRGCATHPRSIAERVRRTRI+ RIRKLQ+LVPNMDKQ NT++ML+ A+EY+K LQ+Q+Q L
Subjt:  ENQFSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL

Arabidopsis top hitse value%identityAlignment
AT1G35460.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein4.1e-2575Show/hide
Query:  FSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL
        F DSVPC++RAKRGCATHPRSIAERVRRTRI+ RIR+LQ+LVPNMDKQ NT++ML+ A+EY+K LQ Q+Q L
Subjt:  FSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL

AT1G51140.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.3e-3534.41Show/hide
Query:  MESDIQEHHRLLHDHQYELHHRQTNSAFTRHQSAPSSYFTSLVDRKLCEQFVSRPSSPETERIFARFMTSGSGTQHTSS--HRRRESPVGEVGMVTAEAE
        MES+ Q+HH LLHDHQ   H R  NS   R+QSAPSSYF+S  +    E+F+ RP+SPETERI + F+ +   + +  S  H    S   E      + E
Subjt:  MESDIQEHHRLLHDHQYELHHRQTNSAFTRHQSAPSSYFTSLVDRKLCEQFVSRPSSPETERIFARFMTSGSGTQHTSS--HRRRESPVGEVGMVTAEAE

Query:  RKTQYLASTTVKSEAADVIQKQSDID---QYRSGLHAFHQNQWRP-------PLPNKSINSGREANY--SHSMEMGWLAPMRT---GGNCSLLRQSSSPA
         +   +  T   +    V+    +I    +   G  A      RP       P+ N + ++   A    S  +E  + A M++    G  +++  S++ A
Subjt:  RKTQYLASTTVKSEAADVIQKQSDID---QYRSGLHAFHQNQWRP-------PLPNKSINSGREANY--SHSMEMGWLAPMRT---GGNCSLLRQSSSPA

Query:  -------ELLPP----------INVESGFPFEC----------------------------------------WEDSA----------MLSDDVKMSENQ
               +LLPP          ++V+ GF                                             EDSA           L   +   E  
Subjt:  -------ELLPP----------INVESGFPFEC----------------------------------------WEDSA----------MLSDDVKMSENQ

Query:  FSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL
         SDS+PCKIRAKRGCATHPRSIAERVRRT+I++R+RKLQ LVPNMD Q NT++MLDLA++YIK LQ+QV+ L
Subjt:  FSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL

AT2G42280.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein7.5e-2769.66Show/hide
Query:  EDSAMLSDDVKMSEN-QFSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL
        + S+  SD V + +  Q  DSVPCKIRAKRGCATHPRSIAERVRRTRI++R+RKLQ+LVPNMDKQ NTS+MLDLA++YIK LQ+Q + L
Subjt:  EDSAMLSDDVKMSEN-QFSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL

AT4G09180.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.7e-2372Show/hide
Query:  ENQFSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL
        EN   DSV  ++RAKRGCATHPRSIAERVRRTRI+ RIRKLQ+LVPNMDKQ NT++ML+ A+EY+K LQ+Q+Q L
Subjt:  ENQFSDSVPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFL

AT5G12930.1 unknown protein2.4e-8143.78Show/hide
Query:  SPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGVAVSESNKNHQIAQLKRAVQLRD
        +P  +P  RRLS++FT+  +P+ ++  LA++SLQG LVN++ ASS RSI GGL ++E++AW+LF+P +RFL+VAVIGVA ++S KN  I QL+++V LRD
Subjt:  SPVHTPSPRRLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGVAVSESNKNHQIAQLKRAVQLRD

Query:  QVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKPSCGAEMLQYKMP--LTNEAEQE
        Q+L SMQQKLDDLC ++N  KDQS   + ++       A    FG ++I FVDCGCWLCDQH                   +  +Q K P  L  +AE E
Subjt:  QVLLSMQQKLDDLCDQVNPVKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKPSCGAEMLQYKMP--LTNEAEQE

Query:  ERRMSDLSDWASSVTSAAEIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIISKLKKDMVVLEQKVVQLTRLRRP
        ERRMS +SDW SSVTSAAE   ++LS++QDML L+K+C EKD TIK+LTS L  +  +GS+R +ELE+II RK  II KLK+D++VLE KV QLTRLRR 
Subjt:  ERRMSDLSDWASSVTSAAEIQMNTLSMEQDMLFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIISKLKKDMVVLEQKVVQLTRLRRP

Query:  SSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQPPPTRKQEIHHIQNSEPCLIRTQQSATKKKPATSDSRSRPQMATPLKEIWMNRK-
        S S   SNT     P   DNL+YDM+  T+ SSSDS+ +                 +   +  ++     + K++PAT    ++   A     +  + K 
Subjt:  SSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPSESSQPPPTRKQEIHHIQNSEPCLIRTQQSATKKKPATSDSRSRPQMATPLKEIWMNRK-

Query:  PEVTSASRERERERERERGKVVRAS--GDSTNIRRRFQTAGAKDTPQRKR
        P V S S  R+        +V R S  GDS   RR  QT     +   KR
Subjt:  PEVTSASRERERERERERGKVVRAS--GDSTNIRRRFQTAGAKDTPQRKR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTCGGATATTCAGGAACATCACCGTCTTCTTCACGATCACCAATACGAACTTCACCACCGACAGACGAACTCTGCCTTCACTCGGCATCAATCGGCGCCGAGCTC
ATATTTCACCAGTCTCGTGGACCGGAAACTCTGCGAACAATTCGTGAGCAGGCCATCGAGTCCCGAGACGGAGCGAATCTTCGCTCGATTTATGACCAGTGGCAGCGGCA
CACAACACACGTCATCACACCGCCGGAGGGAATCTCCGGTGGGCGAAGTGGGGATGGTAACCGCAGAAGCAGAGCGGAAAACTCAGTATTTAGCGTCAACGACGGTGAAA
AGCGAAGCAGCAGACGTAATTCAAAAACAGAGCGACATCGACCAATACAGATCTGGTTTACATGCTTTCCATCAAAATCAATGGAGACCCCCTTTGCCAAATAAGAGCAT
TAACTCGGGGAGGGAAGCTAATTATAGTCATTCAATGGAAATGGGTTGGTTGGCGCCTATGAGAACGGGCGGCAATTGCAGTCTTCTTCGACAGAGCAGCTCACCTGCTG
AGTTGCTGCCCCCCATCAACGTTGAAAGTGGATTCCCATTTGAGTGTTGGGAGGACAGCGCAATGTTATCAGATGATGTGAAGATGTCCGAAAATCAGTTTTCAGATTCT
GTGCCCTGCAAAATTCGTGCCAAGCGGGGCTGTGCAACTCACCCCAGAAGCATTGCCGAGAGAGTTAGACGAACAAGAATTACCCAAAGGATAAGGAAATTACAACAACT
TGTACCCAATATGGATAAGCAAGCAAACACATCAGAAATGTTGGATTTGGCTATTGAGTACATAAAAGGCCTTCAGAAACAAGTGCAGTTCCTTCCTGCAATTGCATTTT
TTTTATTTCTCAACGCCATGGATGATGAAGCTGCCAACCAGGAAAACGTTTTTGAATGGTTCCCCACTGTGGTCTCTTTTGCATCTCCTGTTCATACACCTTCACCGCGC
CGGCTATCCAGCAACTTCACTCAACCTCGGCGACCGGTCCCTGCCGCTCGACAACTAGCTTGGGTTTCTCTCCAGGGACGCCTTGTTAACGCCGAACGCGCCAGCTCGGT
TCGATCTATTAGAGGTGGATTGGGCCAGGACGAAGCTATCGCTTGGCAGTTGTTCAGCCCGATTGAGAGGTTTCTTATAGTTGCCGTCATCGGCGTCGCGGTTTCCGAGT
CCAACAAGAACCATCAGATCGCCCAGCTTAAAAGAGCTGTCCAACTTAGGGATCAAGTGCTTCTAAGCATGCAGCAGAAGCTGGATGATCTATGTGATCAAGTCAATCCC
GTTAAGGATCAATCTAAAACTGAGAACGACATGGCTTTGAGAAAGAATGCTGATTTGGCAGACTCAGGAGCTTTTGGCCACGACAAAATTAAGTTTGTTGATTGTGGTTG
TTGGCTTTGTGATCAGCATCTTGATATGTTCAGTGGTTTGGAGCAGGGTAACACCGCCACAAAACCCTCTTGCGGGGCAGAGATGTTACAATACAAAATGCCACTCACGA
ACGAAGCAGAACAAGAGGAGCGTCGAATGTCTGATTTGTCAGATTGGGCTTCCAGTGTCACGTCTGCTGCTGAAATACAGATGAACACCTTATCAATGGAACAAGATATG
TTATTTCTGAAGAAAGATTGTGACGAGAAAGACGGAACCATCAAGGAATTAACTTCTTTACTCCACTCGTCTGAGGTTTCTGGTTCACAGAGGATTTCAGAGTTGGAAGA
CATCATTCGTCGGAAGAACATGATAATTTCAAAACTAAAGAAGGACATGGTGGTTCTTGAACAGAAGGTTGTTCAACTGACGAGGCTTCGGAGACCCTCTTCGTCTTCGT
GTGCATCAAACACGGACGTCCAGCCAATCCCCCACATGACTGATAACCTAATTTATGACATGGAATGTAGCACCAGTCCTTCATCTTCCGACTCAGATTGCTCCCCATCG
GAAAGTTCACAACCTCCCCCAACAAGAAAGCAGGAGATTCATCATATCCAGAATAGCGAGCCTTGTTTGATAAGAACCCAGCAATCAGCGACTAAGAAGAAACCTGCAAC
TTCAGATTCTCGCTCAAGACCTCAGATGGCAACCCCACTTAAAGAAATATGGATGAATCGAAAACCAGAAGTTACGTCGGCATCCAGGGAAAGGGAAAGGGAAAGGGAAA
GGGAAAGGGGGAAGGTGGTACGTGCGAGTGGCGATTCCACAAATATCAGAAGACGGTTTCAAACTGCTGGGGCCAAGGATACACCTCAACGAAAGAGAAATATCAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAGTCGGATATTCAGGAACATCACCGTCTTCTTCACGATCACCAATACGAACTTCACCACCGACAGACGAACTCTGCCTTCACTCGGCATCAATCGGCGCCGAGCTC
ATATTTCACCAGTCTCGTGGACCGGAAACTCTGCGAACAATTCGTGAGCAGGCCATCGAGTCCCGAGACGGAGCGAATCTTCGCTCGATTTATGACCAGTGGCAGCGGCA
CACAACACACGTCATCACACCGCCGGAGGGAATCTCCGGTGGGCGAAGTGGGGATGGTAACCGCAGAAGCAGAGCGGAAAACTCAGTATTTAGCGTCAACGACGGTGAAA
AGCGAAGCAGCAGACGTAATTCAAAAACAGAGCGACATCGACCAATACAGATCTGGTTTACATGCTTTCCATCAAAATCAATGGAGACCCCCTTTGCCAAATAAGAGCAT
TAACTCGGGGAGGGAAGCTAATTATAGTCATTCAATGGAAATGGGTTGGTTGGCGCCTATGAGAACGGGCGGCAATTGCAGTCTTCTTCGACAGAGCAGCTCACCTGCTG
AGTTGCTGCCCCCCATCAACGTTGAAAGTGGATTCCCATTTGAGTGTTGGGAGGACAGCGCAATGTTATCAGATGATGTGAAGATGTCCGAAAATCAGTTTTCAGATTCT
GTGCCCTGCAAAATTCGTGCCAAGCGGGGCTGTGCAACTCACCCCAGAAGCATTGCCGAGAGAGTTAGACGAACAAGAATTACCCAAAGGATAAGGAAATTACAACAACT
TGTACCCAATATGGATAAGCAAGCAAACACATCAGAAATGTTGGATTTGGCTATTGAGTACATAAAAGGCCTTCAGAAACAAGTGCAGTTCCTTCCTGCAATTGCATTTT
TTTTATTTCTCAACGCCATGGATGATGAAGCTGCCAACCAGGAAAACGTTTTTGAATGGTTCCCCACTGTGGTCTCTTTTGCATCTCCTGTTCATACACCTTCACCGCGC
CGGCTATCCAGCAACTTCACTCAACCTCGGCGACCGGTCCCTGCCGCTCGACAACTAGCTTGGGTTTCTCTCCAGGGACGCCTTGTTAACGCCGAACGCGCCAGCTCGGT
TCGATCTATTAGAGGTGGATTGGGCCAGGACGAAGCTATCGCTTGGCAGTTGTTCAGCCCGATTGAGAGGTTTCTTATAGTTGCCGTCATCGGCGTCGCGGTTTCCGAGT
CCAACAAGAACCATCAGATCGCCCAGCTTAAAAGAGCTGTCCAACTTAGGGATCAAGTGCTTCTAAGCATGCAGCAGAAGCTGGATGATCTATGTGATCAAGTCAATCCC
GTTAAGGATCAATCTAAAACTGAGAACGACATGGCTTTGAGAAAGAATGCTGATTTGGCAGACTCAGGAGCTTTTGGCCACGACAAAATTAAGTTTGTTGATTGTGGTTG
TTGGCTTTGTGATCAGCATCTTGATATGTTCAGTGGTTTGGAGCAGGGTAACACCGCCACAAAACCCTCTTGCGGGGCAGAGATGTTACAATACAAAATGCCACTCACGA
ACGAAGCAGAACAAGAGGAGCGTCGAATGTCTGATTTGTCAGATTGGGCTTCCAGTGTCACGTCTGCTGCTGAAATACAGATGAACACCTTATCAATGGAACAAGATATG
TTATTTCTGAAGAAAGATTGTGACGAGAAAGACGGAACCATCAAGGAATTAACTTCTTTACTCCACTCGTCTGAGGTTTCTGGTTCACAGAGGATTTCAGAGTTGGAAGA
CATCATTCGTCGGAAGAACATGATAATTTCAAAACTAAAGAAGGACATGGTGGTTCTTGAACAGAAGGTTGTTCAACTGACGAGGCTTCGGAGACCCTCTTCGTCTTCGT
GTGCATCAAACACGGACGTCCAGCCAATCCCCCACATGACTGATAACCTAATTTATGACATGGAATGTAGCACCAGTCCTTCATCTTCCGACTCAGATTGCTCCCCATCG
GAAAGTTCACAACCTCCCCCAACAAGAAAGCAGGAGATTCATCATATCCAGAATAGCGAGCCTTGTTTGATAAGAACCCAGCAATCAGCGACTAAGAAGAAACCTGCAAC
TTCAGATTCTCGCTCAAGACCTCAGATGGCAACCCCACTTAAAGAAATATGGATGAATCGAAAACCAGAAGTTACGTCGGCATCCAGGGAAAGGGAAAGGGAAAGGGAAA
GGGAAAGGGGGAAGGTGGTACGTGCGAGTGGCGATTCCACAAATATCAGAAGACGGTTTCAAACTGCTGGGGCCAAGGATACACCTCAACGAAAGAGAAATATCAAGTAA
Protein sequenceShow/hide protein sequence
MESDIQEHHRLLHDHQYELHHRQTNSAFTRHQSAPSSYFTSLVDRKLCEQFVSRPSSPETERIFARFMTSGSGTQHTSSHRRRESPVGEVGMVTAEAERKTQYLASTTVK
SEAADVIQKQSDIDQYRSGLHAFHQNQWRPPLPNKSINSGREANYSHSMEMGWLAPMRTGGNCSLLRQSSSPAELLPPINVESGFPFECWEDSAMLSDDVKMSENQFSDS
VPCKIRAKRGCATHPRSIAERVRRTRITQRIRKLQQLVPNMDKQANTSEMLDLAIEYIKGLQKQVQFLPAIAFFLFLNAMDDEAANQENVFEWFPTVVSFASPVHTPSPR
RLSSNFTQPRRPVPAARQLAWVSLQGRLVNAERASSVRSIRGGLGQDEAIAWQLFSPIERFLIVAVIGVAVSESNKNHQIAQLKRAVQLRDQVLLSMQQKLDDLCDQVNP
VKDQSKTENDMALRKNADLADSGAFGHDKIKFVDCGCWLCDQHLDMFSGLEQGNTATKPSCGAEMLQYKMPLTNEAEQEERRMSDLSDWASSVTSAAEIQMNTLSMEQDM
LFLKKDCDEKDGTIKELTSLLHSSEVSGSQRISELEDIIRRKNMIISKLKKDMVVLEQKVVQLTRLRRPSSSSCASNTDVQPIPHMTDNLIYDMECSTSPSSSDSDCSPS
ESSQPPPTRKQEIHHIQNSEPCLIRTQQSATKKKPATSDSRSRPQMATPLKEIWMNRKPEVTSASRERERERERERGKVVRASGDSTNIRRRFQTAGAKDTPQRKRNIK