; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg002096 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg002096
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
Descriptiontranscription factor UNE12-like
Genome locationscaffold2:26722667..26730337
RNA-Seq ExpressionSpg002096
SyntenySpg002096
Gene Ontology termsGO:0006357 - regulation of transcription by RNA polymerase II (biological process)
GO:0005634 - nucleus (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004135655.1 transcription factor UNE12 [Cucumis sativus]3.7e-14881.39Show/hide
Query:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST
        MAGIPSEGLGDDFFEQ+L+VPP+YGG GGGD+VS+PMGLQLGSGGGGG  G       GGM MPLGLNLEQGFLRQERFREE+D  HN+N+ NNASSSST
Subjt:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST

Query:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN
        ASSGITERDSS+QHMTSLFPTFGHLQTQ LRPPPPLHLHQ FHNQT  GTV A+PQPPQVRPRVRARRGQATDPHSIAER                    
Subjt:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN

Query:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK
                       LRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESG+NQQAWEK
Subjt:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK

Query:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
        WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADP I+VKPEMNTP
Subjt:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

XP_008450747.1 PREDICTED: transcription factor UNE12-like isoform X1 [Cucumis melo]6.4e-14881.39Show/hide
Query:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST
        MAGIPSEGLGDDFFEQ+L+VPP+YGG GGGD+VS+PMGLQLGSGGGGG  G       GGM MPLGLNLEQGFLRQERFREE+D  HN+N+ NNASSSST
Subjt:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST

Query:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN
        ASSGITERDSS+QHMT+LFPTFGHLQTQ LRPPPPLHLHQ FHNQT  GTV AVPQPPQVRPRVRARRGQATDPHSIAER                    
Subjt:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN

Query:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK
                       LRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESG+NQQAWEK
Subjt:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK

Query:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
        WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADP I+VKPEMNTP
Subjt:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

XP_008450748.1 PREDICTED: transcription factor UNE12-like isoform X2 [Cucumis melo]3.6e-14380.28Show/hide
Query:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST
        MAGIPSEGLGDDFFEQ+L+VPP+YGG GGGD+VS+PMGLQLGSGGGGG  G       GGM MPLGLNLEQGFLRQERFREE+D  HN+N+ NNASSSST
Subjt:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST

Query:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN
        ASSGITERDSS+QHMT+LFPTFGHLQTQ LRPPPPLHLH     QT  GTV AVPQPPQVRPRVRARRGQATDPHSIAER                    
Subjt:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN

Query:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK
                       LRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESG+NQQAWEK
Subjt:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK

Query:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
        WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADP I+VKPEMNTP
Subjt:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

XP_022965926.1 transcription factor UNE12-like isoform X1 [Cucurbita maxima]2.1e-14380Show/hide
Query:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGM-GLQ-----GGMAMPLGLNLEQGFLRQERFREELDAHN---NNTT
        MAGIPSEGLGDDFFEQIL+V P Y  G  GDVVS+PMGLQLGSGGGGG  GLRG+ GLQ     GGM MPLGLNLEQGFLRQERFR+EL+ +N   NNTT
Subjt:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGM-GLQ-----GGMAMPLGLNLEQGFLRQERFREELDAHN---NNTT

Query:  NNASSSSTASSGITERDSSVQHMTSLFPTFGHL-QTQPLR-PPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNI
        NNASSSSTASSGITERDSSVQHM +LFP+FGHL QTQ LR PPPPLHLHQ FHNQT PG V AVPQP Q+RPRVRARRGQATDPHSIAER          
Subjt:  NNASSSSTASSGITERDSSVQHMTSLFPTFGHL-QTQPLR-PPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNI

Query:  SGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIE
                                 LRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIE
Subjt:  SGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIE

Query:  SGSNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
        SG+NQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPI+VKPEMNTP
Subjt:  SGSNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

XP_038880010.1 transcription factor UNE12-like [Benincasa hispida]3.4e-14982.73Show/hide
Query:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELDAHNNNTTNNASSSSTA
        MAGIPSEGLGDDFFEQIL+VP AYGGGGGGD+VSMPMGLQLGSGGGGG          GGM MPLGLNLEQGFLRQERFREE+D HN    NNASSSSTA
Subjt:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELDAHNNNTTNNASSSSTA

Query:  SSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKING
        SSGITERDSS+QHMT+LFPTFGHLQTQ LRPPPPLHLHQ FHNQTTPGTV AVPQPPQVRPRVRARRGQATDPHSIAER                     
Subjt:  SSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKING

Query:  LLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEKW
                      LRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEKW
Subjt:  LLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEKW

Query:  SSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
        SSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADP I+VKPEMNTP
Subjt:  SSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

TrEMBL top hitse value%identityAlignment
A0A0A0LYN0 BHLH domain-containing protein1.8e-14881.39Show/hide
Query:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST
        MAGIPSEGLGDDFFEQ+L+VPP+YGG GGGD+VS+PMGLQLGSGGGGG  G       GGM MPLGLNLEQGFLRQERFREE+D  HN+N+ NNASSSST
Subjt:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST

Query:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN
        ASSGITERDSS+QHMTSLFPTFGHLQTQ LRPPPPLHLHQ FHNQT  GTV A+PQPPQVRPRVRARRGQATDPHSIAER                    
Subjt:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN

Query:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK
                       LRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESG+NQQAWEK
Subjt:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK

Query:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
        WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADP I+VKPEMNTP
Subjt:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

A0A1S3BPW1 transcription factor UNE12-like isoform X21.7e-14380.28Show/hide
Query:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST
        MAGIPSEGLGDDFFEQ+L+VPP+YGG GGGD+VS+PMGLQLGSGGGGG  G       GGM MPLGLNLEQGFLRQERFREE+D  HN+N+ NNASSSST
Subjt:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST

Query:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN
        ASSGITERDSS+QHMT+LFPTFGHLQTQ LRPPPPLHLH     QT  GTV AVPQPPQVRPRVRARRGQATDPHSIAER                    
Subjt:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN

Query:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK
                       LRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESG+NQQAWEK
Subjt:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK

Query:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
        WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADP I+VKPEMNTP
Subjt:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

A0A1S3BPY4 transcription factor UNE12-like isoform X13.1e-14881.39Show/hide
Query:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST
        MAGIPSEGLGDDFFEQ+L+VPP+YGG GGGD+VS+PMGLQLGSGGGGG  G       GGM MPLGLNLEQGFLRQERFREE+D  HN+N+ NNASSSST
Subjt:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST

Query:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN
        ASSGITERDSS+QHMT+LFPTFGHLQTQ LRPPPPLHLHQ FHNQT  GTV AVPQPPQVRPRVRARRGQATDPHSIAER                    
Subjt:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN

Query:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK
                       LRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESG+NQQAWEK
Subjt:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK

Query:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
        WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADP I+VKPEMNTP
Subjt:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

A0A5D3CG28 Transcription factor UNE12-like isoform X13.1e-14881.39Show/hide
Query:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST
        MAGIPSEGLGDDFFEQ+L+VPP+YGG GGGD+VS+PMGLQLGSGGGGG  G       GGM MPLGLNLEQGFLRQERFREE+D  HN+N+ NNASSSST
Subjt:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELD-AHNNNTTNNASSSST

Query:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN
        ASSGITERDSS+QHMT+LFPTFGHLQTQ LRPPPPLHLHQ FHNQT  GTV AVPQPPQVRPRVRARRGQATDPHSIAER                    
Subjt:  ASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKIN

Query:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK
                       LRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESG+NQQAWEK
Subjt:  GLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEK

Query:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
        WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADP I+VKPEMNTP
Subjt:  WSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

A0A6J1HMZ7 transcription factor UNE12-like isoform X11.0e-14380Show/hide
Query:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGM-GLQ-----GGMAMPLGLNLEQGFLRQERFREELDAHN---NNTT
        MAGIPSEGLGDDFFEQIL+V P Y  G  GDVVS+PMGLQLGSGGGGG  GLRG+ GLQ     GGM MPLGLNLEQGFLRQERFR+EL+ +N   NNTT
Subjt:  MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGM-GLQ-----GGMAMPLGLNLEQGFLRQERFREELDAHN---NNTT

Query:  NNASSSSTASSGITERDSSVQHMTSLFPTFGHL-QTQPLR-PPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNI
        NNASSSSTASSGITERDSSVQHM +LFP+FGHL QTQ LR PPPPLHLHQ FHNQT PG V AVPQP Q+RPRVRARRGQATDPHSIAER          
Subjt:  NNASSSSTASSGITERDSSVQHMTSLFPTFGHL-QTQPLR-PPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNI

Query:  SGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIE
                                 LRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIE
Subjt:  SGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIE

Query:  SGSNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
        SG+NQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPI+VKPEMNTP
Subjt:  SGSNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

SwissProt top hitse value%identityAlignment
D0PX88 bHLH transcription factor RHL14.6e-4054.59Show/hide
Query:  QVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQV
        Q +PRVRARRGQATDPHSIAER                                   LRRERIAERMKALQELVP+ NKTD+A+MLDEI+DYVKFL+LQV
Subjt:  QVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQV

Query:  KVLSMSRLGGAGAVAQLVADVPLSSVEGEG-------------IESGSNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAI
        KVLSMSRLGGA AVA LVAD+      G G               S +   +    S   TE QVAKLMEED+G+AMQ+LQ K LC+MPISLA+AI
Subjt:  KVLSMSRLGGAGAVAQLVADVPLSSVEGEG-------------IESGSNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAI

O22768 Transcription factor UNE122.9e-6350.14Show/hide
Query:  DDFFEQILSVP-----------PAYGGGGGGDVVSMPMGLQLGSGGGGGD-GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQERFREELDAHNNNTTN
        DDFFEQIL +P              GG GGG   + PM LQLGSG  G   GGL G G  G      PLGL+L+Q    GFLR E         +++  +
Subjt:  DDFFEQILSVP-----------PAYGGGGGGDVVSMPMGLQLGSGGGGGD-GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQERFREELDAHNNNTTN

Query:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD
        N  S                   S+ P F     QP++ PPP   H                QP  +RPRVRARRGQATDPHSIAER             
Subjt:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD

Query:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG
                              LRRERIAER++ALQELVP+ NKTDRAAM+DEIVDYVKFLRLQVKVLSMSRLGGAGAVA LV D+PL SSVE E  E G
Subjt:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG

Query:  -SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
         + Q AWEKWS+DGTE+QVAKLMEE+VGAAMQ LQSKALC+MPISLA AI+ +   D   +VKPE N P
Subjt:  -SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

Q8S3D5 Transcription factor LRL21.9e-3848.13Show/hide
Query:  PLHLHQSFHNQ-------TTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERMKA
        P HL Q    Q       T   T       PQ +P+VRARRGQATDPHSIAER                                   LRRERIAERMK+
Subjt:  PLHLHQSFHNQ-------TTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERMKA

Query:  LQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQS
        LQELVP+ NKTD+A+MLDEI+DYVKFL+LQVKVLSMSRLGGA + +  +++             GS++       +  TE QVAKLMEED+G+AMQ+LQ 
Subjt:  LQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQS

Query:  KALCIMPISLASAI
        K LC+MPISLA+ I
Subjt:  KALCIMPISLASAI

Q93Y00 Transcription factor bHLH72.2e-5847.97Show/hide
Query:  DDFFEQILSVPPAYGGGGG-----GDVVSMPMGLQLGSGGGGGD---GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQE----RFREELDAHNNNTTN
        DDFFEQIL +    G  G      G V   PM LQLGSG  G     G + G G  G      PLGL+L+Q    GFL+ +    RF++++         
Subjt:  DDFFEQILSVPPAYGGGGG-----GDVVSMPMGLQLGSGGGGGD---GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQE----RFREELDAHNNNTTN

Query:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD
                       D+    M  +F   G   +QP  P P        H Q+T            +RPRVRARRGQATDPHSIAER             
Subjt:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD

Query:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG
                              LRRERIAER+++LQELVP+ NKTDRAAM+DEIVDYVKFLRLQVKVLSMSRLGGAGAVA LV ++PL SSVE E     
Subjt:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG

Query:  SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQAD-PPIIVKPEMNTP
          Q  WEKWS+DGTE+QVAKLMEE+VGAAMQ LQSKALCIMPISLA AI+ +   D    IVKPEMN P
Subjt:  SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQAD-PPIIVKPEMNTP

Q9ZUG9 Transcription factor LRL11.6e-4051.61Show/hide
Query:  QTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERM
        QTQP          Q+  +  T GTV A   PPQ R ++RARRGQATDPHSIAER                                   LRRERIAERM
Subjt:  QTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERM

Query:  KALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEKWSS-DGTEQQVAKLMEEDVGAAMQF
        KALQELVP+ NKTD+A+MLDEI+DYVKFL+LQVKVLSMSRLGGA +V+  +++   S         G +Q A     S   TE QVAKLMEED+G+AMQ+
Subjt:  KALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEKWSS-DGTEQQVAKLMEEDVGAAMQF

Query:  LQSKALCIMPISLASAI
        LQ K LC+MPISLA+AI
Subjt:  LQSKALCIMPISLASAI

Arabidopsis top hitse value%identityAlignment
AT1G03040.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.5e-5947.97Show/hide
Query:  DDFFEQILSVPPAYGGGGG-----GDVVSMPMGLQLGSGGGGGD---GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQE----RFREELDAHNNNTTN
        DDFFEQIL +    G  G      G V   PM LQLGSG  G     G + G G  G      PLGL+L+Q    GFL+ +    RF++++         
Subjt:  DDFFEQILSVPPAYGGGGG-----GDVVSMPMGLQLGSGGGGGD---GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQE----RFREELDAHNNNTTN

Query:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD
                       D+    M  +F   G   +QP  P P        H Q+T            +RPRVRARRGQATDPHSIAER             
Subjt:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD

Query:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG
                              LRRERIAER+++LQELVP+ NKTDRAAM+DEIVDYVKFLRLQVKVLSMSRLGGAGAVA LV ++PL SSVE E     
Subjt:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG

Query:  SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQAD-PPIIVKPEMNTP
          Q  WEKWS+DGTE+QVAKLMEE+VGAAMQ LQSKALCIMPISLA AI+ +   D    IVKPEMN P
Subjt:  SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQAD-PPIIVKPEMNTP

AT1G03040.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.7e-5847.7Show/hide
Query:  DDFFEQILSVPPAYGGGGG-----GDVVSMPMGLQLGSGGGGGD---GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQE----RFREELDAHNNNTTN
        DDFFEQIL +    G  G      G V   PM LQLGSG  G     G + G G  G      PLGL+L+Q    GFL+ +    RF++++         
Subjt:  DDFFEQILSVPPAYGGGGG-----GDVVSMPMGLQLGSGGGGGD---GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQE----RFREELDAHNNNTTN

Query:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD
                   +  R SS++   S             +P PP+      H Q+T            +RPRVRARRGQATDPHSIAER             
Subjt:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD

Query:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG
                              LRRERIAER+++LQELVP+ NKTDRAAM+DEIVDYVKFLRLQVKVLSMSRLGGAGAVA LV ++PL SSVE E     
Subjt:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG

Query:  SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQAD-PPIIVKPEMNTP
          Q  WEKWS+DGTE+QVAKLMEE+VGAAMQ LQSKALCIMPISLA AI+ +   D    IVKPEMN P
Subjt:  SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQAD-PPIIVKPEMNTP

AT4G02590.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.1e-6450.14Show/hide
Query:  DDFFEQILSVP-----------PAYGGGGGGDVVSMPMGLQLGSGGGGGD-GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQERFREELDAHNNNTTN
        DDFFEQIL +P              GG GGG   + PM LQLGSG  G   GGL G G  G      PLGL+L+Q    GFLR E         +++  +
Subjt:  DDFFEQILSVP-----------PAYGGGGGGDVVSMPMGLQLGSGGGGGD-GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQERFREELDAHNNNTTN

Query:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD
        N  S                   S+ P F     QP++ PPP   H                QP  +RPRVRARRGQATDPHSIAER             
Subjt:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD

Query:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG
                              LRRERIAER++ALQELVP+ NKTDRAAM+DEIVDYVKFLRLQVKVLSMSRLGGAGAVA LV D+PL SSVE E  E G
Subjt:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG

Query:  -SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
         + Q AWEKWS+DGTE+QVAKLMEE+VGAAMQ LQSKALC+MPISLA AI+ +   D   +VKPE N P
Subjt:  -SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

AT4G02590.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein2.1e-6450.14Show/hide
Query:  DDFFEQILSVP-----------PAYGGGGGGDVVSMPMGLQLGSGGGGGD-GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQERFREELDAHNNNTTN
        DDFFEQIL +P              GG GGG   + PM LQLGSG  G   GGL G G  G      PLGL+L+Q    GFLR E         +++  +
Subjt:  DDFFEQILSVP-----------PAYGGGGGGDVVSMPMGLQLGSGGGGGD-GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQERFREELDAHNNNTTN

Query:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD
        N  S                   S+ P F     QP++ PPP   H                QP  +RPRVRARRGQATDPHSIAER             
Subjt:  NASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGD

Query:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG
                              LRRERIAER++ALQELVP+ NKTDRAAM+DEIVDYVKFLRLQVKVLSMSRLGGAGAVA LV D+PL SSVE E  E G
Subjt:  LTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG

Query:  -SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP
         + Q AWEKWS+DGTE+QVAKLMEE+VGAAMQ LQSKALC+MPISLA AI+ +   D   +VKPE N P
Subjt:  -SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIFRTHQADPPIIVKPEMNTP

AT4G02590.3 basic helix-loop-helix (bHLH) DNA-binding superfamily protein5.3e-6050.94Show/hide
Query:  GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQERFREELDAHNNNTTNNASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHN
        GGL G G  G      PLGL+L+Q    GFLR E         +++  +N  S                   S+ P F     QP++ PPP   H     
Subjt:  GGLRGMGLQG--GMAMPLGLNLEQ----GFLRQERFREELDAHNNNTTNNASSSSTASSGITERDSSVQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHN

Query:  QTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAML
                   QP  +RPRVRARRGQATDPHSIAER                                   LRRERIAER++ALQELVP+ NKTDRAAM+
Subjt:  QTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKINGLLLSLPIGFEMEPYLRRERIAERMKALQELVPSCNKTDRAAML

Query:  DEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG-SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIF
        DEIVDYVKFLRLQVKVLSMSRLGGAGAVA LV D+PL SSVE E  E G + Q AWEKWS+DGTE+QVAKLMEE+VGAAMQ LQSKALC+MPISLA AI+
Subjt:  DEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPL-SSVEGEGIESG-SNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALCIMPISLASAIF

Query:  RTHQADPPIIVKPEMNTP
         +   D   +VKPE N P
Subjt:  RTHQADPPIIVKPEMNTP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCGGAATCCCCTCTGAGGGGTTGGGGGATGATTTCTTCGAGCAGATTCTGTCGGTGCCGCCGGCTTACGGCGGCGGAGGTGGCGGCGATGTGGTGTCTATGCCGAT
GGGGTTGCAACTTGGGTCCGGCGGCGGCGGCGGCGACGGTGGGTTGAGAGGGATGGGGTTGCAAGGAGGGATGGCTATGCCTTTGGGGCTGAATTTAGAACAAGGGTTTT
TGAGACAAGAGAGGTTTAGAGAAGAGCTTGATGCGCATAATAATAATACCACTAATAATGCTTCCTCTTCTTCAACTGCTTCTTCAGGAATCACTGAGAGAGATTCTTCG
GTGCAGCACATGACCAGCTTGTTTCCGACCTTTGGACATTTGCAGACTCAACCACTCCGGCCACCGCCGCCCTTACATCTCCACCAGTCCTTTCATAATCAGACAACTCC
GGGGACGGTTGTTGCAGTACCACAACCACCGCAAGTTCGTCCAAGAGTTCGAGCAAGACGAGGGCAAGCGACTGATCCTCACAGTATTGCAGAAAGGGTAATCTCATCTT
TGAACTCTAAGAATATAAGTGGAGACCTCACTACTTATAAGATAAATGGGTTACTCCTCTCATTACCAATTGGTTTTGAGATGGAACCCTATTTGCGTCGAGAAAGAATT
GCAGAAAGAATGAAGGCCTTGCAAGAGTTGGTTCCTAGTTGCAATAAGACTGATAGGGCGGCAATGCTCGACGAAATTGTGGATTATGTAAAGTTTCTTAGGCTTCAAGT
TAAGGTTTTAAGCATGAGCAGACTAGGCGGGGCGGGTGCGGTGGCTCAACTTGTGGCCGATGTGCCCCTATCATCAGTCGAGGGAGAAGGCATTGAGAGTGGCAGCAATC
AGCAAGCATGGGAAAAGTGGTCAAGTGATGGCACAGAGCAACAAGTAGCCAAATTGATGGAAGAAGATGTGGGAGCTGCCATGCAATTTCTACAATCTAAAGCTCTATGC
ATTATGCCCATCTCATTAGCCTCAGCAATTTTCAGGACACACCAAGCAGATCCACCCATCATAGTCAAGCCAGAAATGAACACTCCCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCGGAATCCCCTCTGAGGGGTTGGGGGATGATTTCTTCGAGCAGATTCTGTCGGTGCCGCCGGCTTACGGCGGCGGAGGTGGCGGCGATGTGGTGTCTATGCCGAT
GGGGTTGCAACTTGGGTCCGGCGGCGGCGGCGGCGACGGTGGGTTGAGAGGGATGGGGTTGCAAGGAGGGATGGCTATGCCTTTGGGGCTGAATTTAGAACAAGGGTTTT
TGAGACAAGAGAGGTTTAGAGAAGAGCTTGATGCGCATAATAATAATACCACTAATAATGCTTCCTCTTCTTCAACTGCTTCTTCAGGAATCACTGAGAGAGATTCTTCG
GTGCAGCACATGACCAGCTTGTTTCCGACCTTTGGACATTTGCAGACTCAACCACTCCGGCCACCGCCGCCCTTACATCTCCACCAGTCCTTTCATAATCAGACAACTCC
GGGGACGGTTGTTGCAGTACCACAACCACCGCAAGTTCGTCCAAGAGTTCGAGCAAGACGAGGGCAAGCGACTGATCCTCACAGTATTGCAGAAAGGGTAATCTCATCTT
TGAACTCTAAGAATATAAGTGGAGACCTCACTACTTATAAGATAAATGGGTTACTCCTCTCATTACCAATTGGTTTTGAGATGGAACCCTATTTGCGTCGAGAAAGAATT
GCAGAAAGAATGAAGGCCTTGCAAGAGTTGGTTCCTAGTTGCAATAAGACTGATAGGGCGGCAATGCTCGACGAAATTGTGGATTATGTAAAGTTTCTTAGGCTTCAAGT
TAAGGTTTTAAGCATGAGCAGACTAGGCGGGGCGGGTGCGGTGGCTCAACTTGTGGCCGATGTGCCCCTATCATCAGTCGAGGGAGAAGGCATTGAGAGTGGCAGCAATC
AGCAAGCATGGGAAAAGTGGTCAAGTGATGGCACAGAGCAACAAGTAGCCAAATTGATGGAAGAAGATGTGGGAGCTGCCATGCAATTTCTACAATCTAAAGCTCTATGC
ATTATGCCCATCTCATTAGCCTCAGCAATTTTCAGGACACACCAAGCAGATCCACCCATCATAGTCAAGCCAGAAATGAACACTCCCTAG
Protein sequenceShow/hide protein sequence
MAGIPSEGLGDDFFEQILSVPPAYGGGGGGDVVSMPMGLQLGSGGGGGDGGLRGMGLQGGMAMPLGLNLEQGFLRQERFREELDAHNNNTTNNASSSSTASSGITERDSS
VQHMTSLFPTFGHLQTQPLRPPPPLHLHQSFHNQTTPGTVVAVPQPPQVRPRVRARRGQATDPHSIAERVISSLNSKNISGDLTTYKINGLLLSLPIGFEMEPYLRRERI
AERMKALQELVPSCNKTDRAAMLDEIVDYVKFLRLQVKVLSMSRLGGAGAVAQLVADVPLSSVEGEGIESGSNQQAWEKWSSDGTEQQVAKLMEEDVGAAMQFLQSKALC
IMPISLASAIFRTHQADPPIIVKPEMNTP