; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0031388 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0031388
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
Descriptiontranscription factor bHLH68-like isoform X2
Genome locationchr11:7824915..7828596
RNA-Seq ExpressionLag0031388
SyntenyLag0031388
Gene Ontology termsGO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR011598 - Myc-type, basic helix-loop-helix (bHLH) domain
IPR036638 - Helix-loop-helix DNA-binding domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022153548.1 transcription factor bHLH68-like isoform X1 [Momordica charantia]2.9e-12671.54Show/hide
Query:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF
        MM GNPSNWW+M PP+ LS PQFVLGSS LPL SSMADHHN     +PNSQSWSQLL+GGLQE GD ++RL LNS   NNF+ KK E LEGRILIPFPRF
Subjt:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF

Query:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT
        GVG   D         VLKQES     SK+LSFLWN  E SSSSS A A S    ++ S+S  S + SN ILDFSF+KVDSKNQI    YSSECASTAAT
Subjt:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT

Query:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG
         GGVCKK RVQP SGQPP+KVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLG   NS+KPTKKEQ RTLNDGRNLR+VEG
Subjt:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG

Query:  ENGRNCVFPEDPGQ-------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
        E GR+CVFPEDPGQ                   +  E  KDLRSRGLCLVPVSCTQ VQSDINGADYWAQAYNGSF
Subjt:  ENGRNCVFPEDPGQ-------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF

XP_022153549.1 transcription factor bHLH68-like isoform X2 [Momordica charantia]7.7e-12772.51Show/hide
Query:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF
        MM GNPSNWW+M PP+ LS PQFVLGSS LPL SSMADHHN     +PNSQSWSQLL+GGLQE GD ++RL LNS   NNF+ KK E LEGRILIPFPRF
Subjt:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF

Query:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT
        GVG   D         VLKQES     SK+LSFLWN  E SSSSS A A S    ++ S+S  S + SN ILDFSF+KVDSKNQI    YSSECASTAAT
Subjt:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT

Query:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG
         GGVCKK RVQP SGQPP+KVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLG   NS+KPTKKEQ RTLNDGRNLR+VEG
Subjt:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG

Query:  ENGRNCVFPEDPGQ--------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
        E GR+CVFPEDPGQ              +  E  KDLRSRGLCLVPVSCTQ VQSDINGADYWAQAYNGSF
Subjt:  ENGRNCVFPEDPGQ--------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF

XP_022153550.1 transcription factor bHLH68-like isoform X3 [Momordica charantia]2.9e-12671.54Show/hide
Query:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF
        MM GNPSNWW+M PP+ LS PQFVLGSS LPL SSMADHHN     +PNSQSWSQLL+GGLQE GD ++RL LNS   NNF+ KK E LEGRILIPFPRF
Subjt:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF

Query:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT
        GVG   D         VLKQES     SK+LSFLWN  E SSSSS A A S    ++ S+S  S + SN ILDFSF+KVDSKNQI    YSSECASTAAT
Subjt:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT

Query:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG
         GGVCKK RVQP SGQPP+KVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLG   NS+KPTKKEQ RTLNDGRNLR+VEG
Subjt:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG

Query:  ENGRNCVFPEDPGQ-------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
        E GR+CVFPEDPGQ                   +  E  KDLRSRGLCLVPVSCTQ VQSDINGADYWAQAYNGSF
Subjt:  ENGRNCVFPEDPGQ-------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF

XP_023524875.1 transcription factor bHLH68-like isoform X2 [Cucurbita pepo subsp. pepo]1.5e-12572.75Show/hide
Query:  MMAGNPSNWWNMFPPSQL----------SPPQFVLGSSSLPLSSSMADHHNQEQPNSQSWSQLLLGGLQEGDDQDRLGLNSINNNNFQPKKMENLEGRIL
        MM GNPSNWWNMFPP+ L           PPQFV+GSSSLP  +SMADH NQE PNSQSWSQLLLGGLQEGD  D L LNS N N+FQPKK+ENLEGRIL
Subjt:  MMAGNPSNWWNMFPPSQL----------SPPQFVLGSSSLPLSSSMADHHNQEQPNSQSWSQLLLGGLQEGDDQDRLGLNSINNNNFQPKKMENLEGRIL

Query:  IPFPRFGVGDDDDDHNDDDHHQVLKQ----ESSKSLSFLWNEKESSSSSSAAKACSQTR------PATCSSSPKSSVNSNGILDFSFNKVDSKNQI----
        IPFPRFGVGDDD+D  DDDH  VLKQ    +S KSLSFLWNEKES SSSS +   SQTR      P T SSSPKSSVNSN IL+FSFN++DS NQ     
Subjt:  IPFPRFGVGDDDDDHNDDDHHQVLKQ----ESSKSLSFLWNEKESSSSSSAAKACSQTR------PATCSSSPKSSVNSNGILDFSFNKVDSKNQI----

Query:  YSSECASTAATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLN
        YSSEC STAATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFL +QIEALISPYLG  NNS++PT+ ++L   N
Subjt:  YSSECASTAATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLN

Query:  DGRNLRQVEGENGRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
        D    R++             P QE EEEA +LRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
Subjt:  DGRNLRQVEGENGRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF

XP_038895281.1 transcription factor bHLH68-like isoform X1 [Benincasa hispida]1.7e-12673.14Show/hide
Query:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHNQEQPNSQSWSQLLLGGLQEGD-DQDRLGLNSINNNNFQPKKMENLEGRILIPFPRFGVG
        MM+GNPSNWWNMFPP+    PQFV+GSSSLPL SSMA HH+Q+  NSQSWSQL+LGGLQ+GD DQDRLGL    NN FQPKK+ENLEG+ILIPF RFGV 
Subjt:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHNQEQPNSQSWSQLLLGGLQEGD-DQDRLGLNSINNNNFQPKKMENLEGRILIPFPRFGVG

Query:  DDDDDHNDDDHHQVLKQE-----SSKSLSFLW---NEKESSSSSSAAKACSQTR-PATCSSSPKSSVNS-NGILDFSFNKVDSKNQI----YSSECASTA
        D DD   +D+H   LKQE     +SKSL FLW   NEKE SSSSS++ A  +TR P T +SS KSSVNS N ILDFSFNK+ SKNQI    YSSECASTA
Subjt:  DDDDDHNDDDHHQVLKQE-----SSKSLSFLW---NEKESSSSSSAAKACSQTR-PATCSSSPKSSVNS-NGILDFSFNKVDSKNQI----YSSECASTA

Query:  ATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRN-LRQV
         TGGV KK RVQP SGQPPIKVRKEKVGDRIT LHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGN++ S K  KK+QLRTLNDGRN LR+V
Subjt:  ATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRN-LRQV

Query:  EGE-NGRNCVFPEDPGQ----------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
        EGE NG +CVF EDPGQ                E EEEAKDLR RGLCLVPVSCTQHVQSDINGADYWAQAYNG+F
Subjt:  EGE-NGRNCVFPEDPGQ----------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF

TrEMBL top hitse value%identityAlignment
A0A6J1DH49 transcription factor bHLH68-like isoform X31.4e-12671.54Show/hide
Query:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF
        MM GNPSNWW+M PP+ LS PQFVLGSS LPL SSMADHHN     +PNSQSWSQLL+GGLQE GD ++RL LNS   NNF+ KK E LEGRILIPFPRF
Subjt:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF

Query:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT
        GVG   D         VLKQES     SK+LSFLWN  E SSSSS A A S    ++ S+S  S + SN ILDFSF+KVDSKNQI    YSSECASTAAT
Subjt:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT

Query:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG
         GGVCKK RVQP SGQPP+KVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLG   NS+KPTKKEQ RTLNDGRNLR+VEG
Subjt:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG

Query:  ENGRNCVFPEDPGQ-------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
        E GR+CVFPEDPGQ                   +  E  KDLRSRGLCLVPVSCTQ VQSDINGADYWAQAYNGSF
Subjt:  ENGRNCVFPEDPGQ-------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF

A0A6J1DHR8 transcription factor bHLH68-like isoform X11.4e-12671.54Show/hide
Query:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF
        MM GNPSNWW+M PP+ LS PQFVLGSS LPL SSMADHHN     +PNSQSWSQLL+GGLQE GD ++RL LNS   NNF+ KK E LEGRILIPFPRF
Subjt:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF

Query:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT
        GVG   D         VLKQES     SK+LSFLWN  E SSSSS A A S    ++ S+S  S + SN ILDFSF+KVDSKNQI    YSSECASTAAT
Subjt:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT

Query:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG
         GGVCKK RVQP SGQPP+KVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLG   NS+KPTKKEQ RTLNDGRNLR+VEG
Subjt:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG

Query:  ENGRNCVFPEDPGQ-------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
        E GR+CVFPEDPGQ                   +  E  KDLRSRGLCLVPVSCTQ VQSDINGADYWAQAYNGSF
Subjt:  ENGRNCVFPEDPGQ-------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF

A0A6J1DJE0 transcription factor bHLH68-like isoform X23.7e-12772.51Show/hide
Query:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF
        MM GNPSNWW+M PP+ LS PQFVLGSS LPL SSMADHHN     +PNSQSWSQLL+GGLQE GD ++RL LNS   NNF+ KK E LEGRILIPFPRF
Subjt:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN---QEQPNSQSWSQLLLGGLQE-GDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF

Query:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT
        GVG   D         VLKQES     SK+LSFLWN  E SSSSS A A S    ++ S+S  S + SN ILDFSF+KVDSKNQI    YSSECASTAAT
Subjt:  GVGDDDDDHNDDDHHQVLKQES-----SKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAAT

Query:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG
         GGVCKK RVQP SGQPP+KVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLG   NS+KPTKKEQ RTLNDGRNLR+VEG
Subjt:  -GGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEG

Query:  ENGRNCVFPEDPGQ--------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
        E GR+CVFPEDPGQ              +  E  KDLRSRGLCLVPVSCTQ VQSDINGADYWAQAYNGSF
Subjt:  ENGRNCVFPEDPGQ--------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF

A0A6J1GA87 transcription factor bHLH68-like isoform X21.2e-12574.02Show/hide
Query:  MMAGNPSNWWNMFPPSQL------SPPQFVLGSSSLPLSSSMADHHNQEQPNSQSWSQLLLGGLQEGDDQDRLGLNSINNNNFQPKKMENLEGRILIPFP
        MM GNPSNWWNMFPP+ L       PPQFV+GSSSLP  +SMADH NQE PNSQSWSQLLLGGLQEGD  D L LNS N N+FQPKK++NLEGRILIPFP
Subjt:  MMAGNPSNWWNMFPPSQL------SPPQFVLGSSSLPLSSSMADHHNQEQPNSQSWSQLLLGGLQEGDDQDRLGLNSINNNNFQPKKMENLEGRILIPFP

Query:  RFGVGDDDDDHNDDDHHQVLKQ----ESSKSLSFLWNEKESSSSSSAAKACSQTR-PATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTA
        RFGVGDDD    DDDH  VLKQ    +S KSLSFLWNEKES SSSS +   SQTR P T SSSPKSSVNSN IL+FSFNK+DS NQ     YSSEC STA
Subjt:  RFGVGDDDDDHNDDDHHQVLKQ----ESSKSLSFLWNEKESSSSSSAAKACSQTR-PATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTA

Query:  ATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVE
        ATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFL +QIEALISPYLG  NNS++PT+ +QL   ND    R++ 
Subjt:  ATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVE

Query:  GENGRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
                    P QE +EEA  LRSRGLCLVPVSCTQHVQSD+NGADYWAQAYNGSF
Subjt:  GENGRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF

A0A6J1K6J4 transcription factor bHLH68-like isoform X24.6e-12574.65Show/hide
Query:  MMAGNPSNWWNMFPPSQL----SPPQFVLGSSSLPLSSSMADHHNQEQPNSQSWSQLLLGGLQEGDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF
        MM GNPSNWWNMFPP+ L     PPQFV+GSSS P  +SMADH NQE PNSQSWSQLLLGGLQEG   D L LNS N N+FQPKK++NLEGRILIPFPRF
Subjt:  MMAGNPSNWWNMFPPSQL----SPPQFVLGSSSLPLSSSMADHHNQEQPNSQSWSQLLLGGLQEGDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRF

Query:  GVGDDDDDHNDDDHHQVLKQ----ESSKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAATG
        GVGDDD    DDDH  VLKQ    +S KSLSFLW+EKES SSSS +   SQTRP T SSSPKSSVNSN IL+FSFNK+DS NQI    YSSEC STAATG
Subjt:  GVGDDDDDHNDDDHHQVLKQ----ESSKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQI----YSSECASTAATG

Query:  GVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGEN
        GVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFL +QIEALISPYLG  NNS++PT+ +QL   ND    R++    
Subjt:  GVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGEN

Query:  GRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
                 P QE EEEA +LRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF
Subjt:  GRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNGSF

SwissProt top hitse value%identityAlignment
Q7XHI5 Transcription factor bHLH1334.7e-4237.41Show/hide
Query:  AGNPSNWWN-----MFPPSQL---SPPQFVLGSSSL-------PLSSSMADHH---NQEQPNSQSW--------------SQLLLGGLQEGDDQDRLGLN
        AGNP NWWN     + PP+ L    PP       SL       P SSS +          PN  SW              SQLLLGGL  G+++    +N
Subjt:  AGNPSNWWN-----MFPPSQL---SPPQFVLGSSSL-------PLSSSMADHH---NQEQPNSQSW--------------SQLLLGGLQEGDDQDRLGLN

Query:  SINNNN----FQPKKMENLEGRILIPFPRFGVGDDDDDHNDDDHHQVLKQESSKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFS
          ++ N    +Q K+++N E ++L                   H   +KQESS + S+      SS +S   K+C+            + +N+N      
Subjt:  SINNNN----FQPKKMENLEGRILIPFPRFGVGDDDDDHNDDDHHQVLKQESSKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFS

Query:  FNKVDSKNQIYS----SECASTAATGG--VCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNN
            D+ N I+S    SEC S+   G     KKP++Q  S Q  +KVRKEK+G RI +LHQLVSPFGKTDTASVLSEAIGY+RFL SQIEAL  PY G  
Subjt:  FNKVDSKNQIYS----SECASTAATGG--VCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNN

Query:  NNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQ----------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQA
        + +                N+     +   N +FPEDPGQ                       NEE  KDLRSRGLCLVP+SCT  V SD NGADYWA A
Subjt:  NNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQ----------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQA

Query:  Y
        +
Subjt:  Y

Q8GXT3 Transcription factor bHLH1231.5e-1935.98Show/hide
Query:  ESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQIYSSECASTAATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKT
        ++SS+   A    Q  P +    PK   N + I D S N+V      +              K+ + +  S  P  K RKEK+GDRI AL QLVSPFGKT
Subjt:  ESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQIYSSECASTAATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKT

Query:  DTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDI
        D ASVLSEAI Y++FL  Q+ AL +PY                  +  G +L+  + ++       E+P         DLRSRGLCLVPVS T  V  D 
Subjt:  DTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDI

Query:  NGADYWAQAYNGSF
           D+W   + G+F
Subjt:  NGADYWAQAYNGSF

Q8S3D1 Transcription factor bHLH687.7e-4537.59Show/hide
Query:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN-------------------------------QEQPNSQSW-------------SQLLLG
        M AGNP NWWN+     + PP  ++G    PL   M  ++N                                  PN  SW             SQLLLG
Subjt:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN-------------------------------QEQPNSQSW-------------SQLLLG

Query:  GLQEGDDQDRLGLNSINNNN------FQPK-KMENLEGRILIPFPRFGVGDDDDDHNDDDHHQVLKQESSKSLSFLWNEKESSSSSSAAKACSQTRPATC
        GL  G+++    +N  N+++      FQ K ++EN E ++L       V  D            +KQE   +++       SS +S   K+C  T   T 
Subjt:  GLQEGDDQDRLGLNSINNNN------FQPK-KMENLEGRILIPFPRFGVGDDDDDHNDDDHHQVLKQESSKSLSFLWNEKESSSSSSAAKACSQTRPATC

Query:  SSSPKSSV-NSNGILDFSFNKVDSKNQIY-----------SSECASTAATGGVCKKPRVQP-VSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLS
         +S   ++ N+N +LDFS N     N ++           SSEC S    G   KKPR+QP  S Q  +KVRKEK+G RI ALHQLVSPFGKTDTASVLS
Subjt:  SSSPKSSV-NSNGILDFSFNKVDSKNQIY-----------SSECASTAATGGVCKKPRVQP-VSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLS

Query:  EAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQ---------------------ENEEEAKDLRSRGLC
        EAIGY+RFLQSQIEAL  PY G               T   G    Q   +  R+C+FPEDPGQ                      +EE  KDLRSRGLC
Subjt:  EAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQ---------------------ENEEEAKDLRSRGLC

Query:  LVPVSCTQHVQSDINGADYWAQA
        LVP+SCT  V SD NGADYWA A
Subjt:  LVPVSCTQHVQSDINGADYWAQA

Q9LT67 Transcription factor bHLH1132.2e-2344.67Show/hide
Query:  KKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGENGRN
        K  R Q  S     KVRKE++G+RI AL QLVSP+GKTD ASVL EA+GY++FLQ QI+ L SPYL N++        + +  +                
Subjt:  KKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGENGRN

Query:  CVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNG
                     +AKDLRSRGLCLVPVS T HV++  NGAD+W+ A  G
Subjt:  CVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNG

Q9SFZ3 Transcription factor bHLH1101.0e-2852.74Show/hide
Query:  STAATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLR
        +T A     KKPRV+  S  PP KVRKEK+GDRI AL QLVSPFGKTDTASVL EAIGY++FLQSQIE L  PY+  + N  +P K  QL +       +
Subjt:  STAATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLR

Query:  QVEGENGRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSD
          EG+               EEE +DLRSRGLCLVP+SC  +V  D
Subjt:  QVEGENGRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSD

Arabidopsis top hitse value%identityAlignment
AT1G27660.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein7.2e-3052.74Show/hide
Query:  STAATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLR
        +T A     KKPRV+  S  PP KVRKEK+GDRI AL QLVSPFGKTDTASVL EAIGY++FLQSQIE L  PY+  + N  +P K  QL +       +
Subjt:  STAATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLR

Query:  QVEGENGRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSD
          EG+               EEE +DLRSRGLCLVP+SC  +V  D
Subjt:  QVEGENGRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSD

AT2G20100.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein3.3e-4337.41Show/hide
Query:  AGNPSNWWN-----MFPPSQL---SPPQFVLGSSSL-------PLSSSMADHH---NQEQPNSQSW--------------SQLLLGGLQEGDDQDRLGLN
        AGNP NWWN     + PP+ L    PP       SL       P SSS +          PN  SW              SQLLLGGL  G+++    +N
Subjt:  AGNPSNWWN-----MFPPSQL---SPPQFVLGSSSL-------PLSSSMADHH---NQEQPNSQSW--------------SQLLLGGLQEGDDQDRLGLN

Query:  SINNNN----FQPKKMENLEGRILIPFPRFGVGDDDDDHNDDDHHQVLKQESSKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFS
          ++ N    +Q K+++N E ++L                   H   +KQESS + S+      SS +S   K+C+            + +N+N      
Subjt:  SINNNN----FQPKKMENLEGRILIPFPRFGVGDDDDDHNDDDHHQVLKQESSKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFS

Query:  FNKVDSKNQIYS----SECASTAATGG--VCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNN
            D+ N I+S    SEC S+   G     KKP++Q  S Q  +KVRKEK+G RI +LHQLVSPFGKTDTASVLSEAIGY+RFL SQIEAL  PY G  
Subjt:  FNKVDSKNQIYS----SECASTAATGG--VCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNN

Query:  NNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQ----------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQA
        + +                N+     +   N +FPEDPGQ                       NEE  KDLRSRGLCLVP+SCT  V SD NGADYWA A
Subjt:  NNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQ----------------------ENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQA

Query:  Y
        +
Subjt:  Y

AT2G20100.2 basic helix-loop-helix (bHLH) DNA-binding superfamily protein5.5e-3035.59Show/hide
Query:  AGNPSNWWN-----MFPPSQL---SPPQFVLGSSSL-------PLSSSMADHH---NQEQPNSQSW--------------SQLLLGGLQEGDDQDRLGLN
        AGNP NWWN     + PP+ L    PP       SL       P SSS +          PN  SW              SQLLLGGL  G+++    +N
Subjt:  AGNPSNWWN-----MFPPSQL---SPPQFVLGSSSL-------PLSSSMADHH---NQEQPNSQSW--------------SQLLLGGLQEGDDQDRLGLN

Query:  SINNNN----FQPKKMENLEGRILIPFPRFGVGDDDDDHNDDDHHQVLKQESSKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFS
          ++ N    +Q K+++N E ++L                   H   +KQESS + S+      SS +S   K+C+            + +N+N      
Subjt:  SINNNN----FQPKKMENLEGRILIPFPRFGVGDDDDDHNDDDHHQVLKQESSKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFS

Query:  FNKVDSKNQIYS----SECASTAATGG--VCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNN
            D+ N I+S    SEC S+   G     KKP++Q  S Q  +KVRKEK+G RI +LHQLVSPFGKTDTASVLSEAIGY+RFL SQIEAL  PY G  
Subjt:  FNKVDSKNQIYS----SECASTAATGG--VCKKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNN

Query:  NNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQ
        + +                N+     +   N +FPEDPGQ
Subjt:  NNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQ

AT3G19500.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein1.5e-2444.67Show/hide
Query:  KKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGENGRN
        K  R Q  S     KVRKE++G+RI AL QLVSP+GKTD ASVL EA+GY++FLQ QI+ L SPYL N++        + +  +                
Subjt:  KKPRVQPVSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGENGRN

Query:  CVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNG
                     +AKDLRSRGLCLVPVS T HV++  NGAD+W+ A  G
Subjt:  CVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDINGADYWAQAYNG

AT4G29100.1 basic helix-loop-helix (bHLH) DNA-binding superfamily protein5.5e-4637.59Show/hide
Query:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN-------------------------------QEQPNSQSW-------------SQLLLG
        M AGNP NWWN+     + PP  ++G    PL   M  ++N                                  PN  SW             SQLLLG
Subjt:  MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHN-------------------------------QEQPNSQSW-------------SQLLLG

Query:  GLQEGDDQDRLGLNSINNNN------FQPK-KMENLEGRILIPFPRFGVGDDDDDHNDDDHHQVLKQESSKSLSFLWNEKESSSSSSAAKACSQTRPATC
        GL  G+++    +N  N+++      FQ K ++EN E ++L       V  D            +KQE   +++       SS +S   K+C  T   T 
Subjt:  GLQEGDDQDRLGLNSINNNN------FQPK-KMENLEGRILIPFPRFGVGDDDDDHNDDDHHQVLKQESSKSLSFLWNEKESSSSSSAAKACSQTRPATC

Query:  SSSPKSSV-NSNGILDFSFNKVDSKNQIY-----------SSECASTAATGGVCKKPRVQP-VSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLS
         +S   ++ N+N +LDFS N     N ++           SSEC S    G   KKPR+QP  S Q  +KVRKEK+G RI ALHQLVSPFGKTDTASVLS
Subjt:  SSSPKSSV-NSNGILDFSFNKVDSKNQIY-----------SSECASTAATGGVCKKPRVQP-VSGQPPIKVRKEKVGDRITALHQLVSPFGKTDTASVLS

Query:  EAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQ---------------------ENEEEAKDLRSRGLC
        EAIGY+RFLQSQIEAL  PY G               T   G    Q   +  R+C+FPEDPGQ                      +EE  KDLRSRGLC
Subjt:  EAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQ---------------------ENEEEAKDLRSRGLC

Query:  LVPVSCTQHVQSDINGADYWAQA
        LVP+SCT  V SD NGADYWA A
Subjt:  LVPVSCTQHVQSDINGADYWAQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGGCTGGAAACCCTAGTAATTGGTGGAACATGTTTCCACCTTCTCAGCTTTCTCCTCCTCAGTTTGTTCTTGGATCTTCTTCACTTCCTTTGAGTTCTTCCATGGC
TGATCATCACAATCAAGAACAACCCAATTCACAATCATGGAGCCAATTACTTTTAGGTGGATTGCAAGAAGGAGATGATCAAGATAGGTTGGGTTTGAATAGTATTAATA
ATAATAATTTTCAACCAAAAAAGATGGAAAATTTGGAGGGGAGAATATTGATTCCATTTCCAAGATTTGGAGTTGGTGATGATGATGATGATCATAATGATGATGATCAT
CATCAAGTGTTGAAGCAAGAAAGTAGTAAGAGTTTGTCATTTTTATGGAATGAAAAGGAATCTTCATCATCTTCTTCAGCAGCTAAAGCTTGCTCTCAAACAAGGCCAGC
TACTTGTTCTTCTTCTCCCAAGTCTTCTGTCAATAGCAATGGCATATTGGATTTCTCTTTCAACAAAGTTGATTCCAAAAATCAAATTTATTCATCTGAGTGTGCTAGCA
CAGCCGCCACTGGTGGAGTGTGCAAGAAGCCTAGGGTTCAGCCCGTCTCCGGCCAGCCTCCGATAAAGGTGAGAAAGGAGAAGGTAGGGGACAGAATCACAGCTCTCCAC
CAGCTGGTTTCTCCATTTGGAAAGACTGACACTGCTTCTGTCTTGTCAGAGGCTATTGGGTATGTCAGATTCCTTCAGAGTCAAATTGAGGCTCTCATCTCTCCATATTT
GGGCAATAATAATAATTCAGCAAAACCCACAAAGAAGGAGCAACTTAGAACATTAAATGATGGGAGAAATTTGAGACAAGTTGAAGGTGAAAATGGAAGAAATTGTGTAT
TTCCTGAAGACCCTGGTCAGGAGAATGAAGAAGAAGCAAAGGACCTAAGGAGTAGAGGGCTTTGTTTGGTACCAGTATCTTGTACACAACATGTTCAAAGTGACATTAAT
GGAGCTGATTATTGGGCTCAAGCTTATAATGGCAGCTTCTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGGCTGGAAACCCTAGTAATTGGTGGAACATGTTTCCACCTTCTCAGCTTTCTCCTCCTCAGTTTGTTCTTGGATCTTCTTCACTTCCTTTGAGTTCTTCCATGGC
TGATCATCACAATCAAGAACAACCCAATTCACAATCATGGAGCCAATTACTTTTAGGTGGATTGCAAGAAGGAGATGATCAAGATAGGTTGGGTTTGAATAGTATTAATA
ATAATAATTTTCAACCAAAAAAGATGGAAAATTTGGAGGGGAGAATATTGATTCCATTTCCAAGATTTGGAGTTGGTGATGATGATGATGATCATAATGATGATGATCAT
CATCAAGTGTTGAAGCAAGAAAGTAGTAAGAGTTTGTCATTTTTATGGAATGAAAAGGAATCTTCATCATCTTCTTCAGCAGCTAAAGCTTGCTCTCAAACAAGGCCAGC
TACTTGTTCTTCTTCTCCCAAGTCTTCTGTCAATAGCAATGGCATATTGGATTTCTCTTTCAACAAAGTTGATTCCAAAAATCAAATTTATTCATCTGAGTGTGCTAGCA
CAGCCGCCACTGGTGGAGTGTGCAAGAAGCCTAGGGTTCAGCCCGTCTCCGGCCAGCCTCCGATAAAGGTGAGAAAGGAGAAGGTAGGGGACAGAATCACAGCTCTCCAC
CAGCTGGTTTCTCCATTTGGAAAGACTGACACTGCTTCTGTCTTGTCAGAGGCTATTGGGTATGTCAGATTCCTTCAGAGTCAAATTGAGGCTCTCATCTCTCCATATTT
GGGCAATAATAATAATTCAGCAAAACCCACAAAGAAGGAGCAACTTAGAACATTAAATGATGGGAGAAATTTGAGACAAGTTGAAGGTGAAAATGGAAGAAATTGTGTAT
TTCCTGAAGACCCTGGTCAGGAGAATGAAGAAGAAGCAAAGGACCTAAGGAGTAGAGGGCTTTGTTTGGTACCAGTATCTTGTACACAACATGTTCAAAGTGACATTAAT
GGAGCTGATTATTGGGCTCAAGCTTATAATGGCAGCTTCTAA
Protein sequenceShow/hide protein sequence
MMAGNPSNWWNMFPPSQLSPPQFVLGSSSLPLSSSMADHHNQEQPNSQSWSQLLLGGLQEGDDQDRLGLNSINNNNFQPKKMENLEGRILIPFPRFGVGDDDDDHNDDDH
HQVLKQESSKSLSFLWNEKESSSSSSAAKACSQTRPATCSSSPKSSVNSNGILDFSFNKVDSKNQIYSSECASTAATGGVCKKPRVQPVSGQPPIKVRKEKVGDRITALH
QLVSPFGKTDTASVLSEAIGYVRFLQSQIEALISPYLGNNNNSAKPTKKEQLRTLNDGRNLRQVEGENGRNCVFPEDPGQENEEEAKDLRSRGLCLVPVSCTQHVQSDIN
GADYWAQAYNGSF