; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g40060 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g40060
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionagamous-like MADS-box protein AGL80
Genome locationchr4:29757920..29759945
RNA-Seq ExpressionMoc04g40060
SyntenyMoc04g40060
Gene Ontology termsGO:0045944 - positive regulation of transcription by RNA polymerase II (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0000978 - RNA polymerase II proximal promoter sequence-specific DNA binding (molecular function)
GO:0000981 - DNA-binding transcription factor activity, RNA polymerase II-specific (molecular function)
GO:0046983 - protein dimerization activity (molecular function)
InterPro domainsIPR002100 - Transcription factor, MADS-box
IPR033897 - MADS SRF-like
IPR036879 - Transcription factor, MADS-box superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6602743.1 hypothetical protein SDJN03_07976, partial [Cucurbita argyrosperma subsp. sororia]3.6e-3181.44Show/hide
Query:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQR
        MLATAAAAAS WL SSR+RF  +LLCSPLLVPIFCATFP ICAIELCIRLARHR  I LRDSPE ERLRRCEEGGC +ALPE    DGEE+IGLLQR
Subjt:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQR

XP_022152501.1 uncharacterized protein LOC111020213 [Momordica charantia]7.8e-42100Show/hide
Query:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPEDGEEEIGLLQR
        MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPEDGEEEIGLLQR
Subjt:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPEDGEEEIGLLQR

XP_022718279.1 agamous-like MADS-box protein AGL80 [Durio zibethinus]5.1e-3352.32Show/hide
Query:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE
        R K+KH+LISN ++R+ TLKKRKAGLLK L++LTTLCGV ACA+I +  D+Q ++WPS  +AF V+E+F N P KK+ K MMD   FL R +  L E+LE
Subjt:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE

Query:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERI
        K++ K QE E EL LA H+ G   C+LN L  L EL  ++   I+FV+ +I
Subjt:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERI

XP_022962190.1 uncharacterized protein LOC111462719 [Cucurbita moschata]3.6e-3181.44Show/hide
Query:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQR
        MLATAAAAAS WL SSR+RF  +LLCSPLLVPIFCATFP ICAIELCIRLARHR  I LRDSPE ERLRRCEEGGC +ALPE    DGEE+IGLLQR
Subjt:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQR

XP_023550754.1 uncharacterized protein LOC111808798 [Cucurbita pepo subsp. pepo]3.6e-3181.44Show/hide
Query:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQR
        MLATAAAAAS WL SSR+RF  +LLCSPLLVPIFCATFP ICAIELCIRLARHR  I LRDSPE ERLRRCEEGGC +ALPE    DGEE+IGLLQR
Subjt:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQR

TrEMBL top hitse value%identityAlignment
A0A2N9F373 MADS-box domain-containing protein1.3e-3146.59Show/hide
Query:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE
        R KVKH+LISN  +RR T +KRKAGLLK LS+LTTLCG++AC +I++ +D Q ++WP PQ+A  +LERF+N P KKQ KYMMD K FL + ++ L  +LE
Subjt:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE

Query:  KE-KAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIKNLEPTRV-----RKIRPAHG
         E K   +  + EL L   L     C+      L E+  I+D K+ F+++RI FIK   PT++      K R  HG
Subjt:  KE-KAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIKNLEPTRV-----RKIRPAHG

A0A6A6L8T0 MADS-box domain-containing protein5.1e-3146.79Show/hide
Query:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE
        R KVKH+LISN + R+ T KKR+AGLLK L +LTTLCGV+ACA+I + ++S  +IWPS  +A  VLE+F+  P KKQ KYMMD ++FL R ++ L E+LE
Subjt:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE

Query:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIKN
        K++ K +  E EL  A  + G  +  LN L+N+ ++S ++   +E++++RI   K+
Subjt:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIKN

A0A6J1DHX6 uncharacterized protein LOC1110202133.8e-42100Show/hide
Query:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPEDGEEEIGLLQR
        MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPEDGEEEIGLLQR
Subjt:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPEDGEEEIGLLQR

A0A6J1HEC7 uncharacterized protein LOC1114627191.8e-3181.44Show/hide
Query:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQR
        MLATAAAAAS WL SSR+RF  +LLCSPLLVPIFCATFP ICAIELCIRLARHR  I LRDSPE ERLRRCEEGGC +ALPE    DGEE+IGLLQR
Subjt:  MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPE----DGEEEIGLLQR

A0A6P5WQH6 agamous-like MADS-box protein AGL802.5e-3352.32Show/hide
Query:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE
        R K+KH+LISN ++R+ TLKKRKAGLLK L++LTTLCGV ACA+I +  D+Q ++WPS  +AF V+E+F N P KK+ K MMD   FL R +  L E+LE
Subjt:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE

Query:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERI
        K++ K QE E EL LA H+ G   C+LN L  L EL  ++   I+FV+ +I
Subjt:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERI

SwissProt top hitse value%identityAlignment
O80805 MADS-box transcription factor PHERES 12.2e-1535.46Show/hide
Query:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE
        R K+K   I N ++R+ T  KRK G+LK  ++L TLCGV ACAVI + ++S  E WPS +   +V+ +F  F +  + K M+D +TFLR+++    E+L+
Subjt:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE

Query:  K--EKAKIQEFEQELF-------LAHHLEGNGICNLNCLLN
        K  ++ +  +    +F          HL G  + +LN  LN
Subjt:  K--EKAKIQEFEQELF-------LAHHLEGNGICNLNCLLN

Q7XJK6 Agamous-like MADS-box protein AGL361.4e-1433.94Show/hide
Query:  KVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLEKE
        KVK  LI+N   R+ +  KRK G+ K L +L+TLCGV ACA+I++      E WPS + A  V  RF   P   + K MMD +T+L  ++T  KEQL+  
Subjt:  KVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLEKE

Query:  KAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIKNLEPTRVRKIRP
         A+ +E +   F+   +EG          +L +L   I+  ++ ++ RI  IK    + +  + P
Subjt:  KAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIKNLEPTRVRKIRP

Q7XJK8 MADS-box transcription factor PHERES 24.2e-1433.55Show/hide
Query:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE
        + K+K  LI N   R+ T  KRK G+ K L++L TLCGV ACAV+++  +S  E WPS +   DV+ +F    +  + K M+D +TF+ +++   KEQL+
Subjt:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE

Query:  KEKAKIQEFEQELFLAHHLEG-NGICNLNCLLNLIELSCIIDFKIEFVSERIAFI
        K + +    +    +   L+G   + NL+   +L +LS  ID  +  ++ RI  +
Subjt:  KEKAKIQEFEQELFLAHHLEG-NGICNLNCLLNLIELSCIIDFKIEFVSERIAFI

Q9C6V3 Agamous-like MADS-box protein AGL862.4e-1732.95Show/hide
Query:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE
        RSK+K  LI+N T RR T +KRK G+   L +LTTLCGV ACAVI + +++ + +WPS +   + +  F   P  +Q K MM H+T+L+ ++T   ++LE
Subjt:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE

Query:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIKNLEPTRVRKIRPAHGQIA
          + + +E +   F+   +EG    +     +L +LS  ID  I  ++  +  + N   +      P H  +A
Subjt:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIKNLEPTRVRKIRPAHGQIA

Q9FJK3 Agamous-like MADS-box protein AGL802.5e-1935.85Show/hide
Query:  LQRSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQ
        + R KVK   ISN + R+AT KKRK GL+K + +L+TLCG+ ACA+I++ +D+  E+WPS      V+  FR  P   Q K M+D + FL++++    E 
Subjt:  LQRSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQ

Query:  LEKEKAKIQEFEQELFLAHHLEGN-GICNLNCLLNLIELSCIIDFKIEFVSERIAFIKN
        L +++   +E E    +   L GN  + +LN +++L +L  +I+  ++ V+ RI  ++N
Subjt:  LEKEKAKIQEFEQELFLAHHLEGN-GICNLNCLLNLIELSCIIDFKIEFVSERIAFIKN

Arabidopsis top hitse value%identityAlignment
AT1G22590.2 AGAMOUS-like 872.8e-2134.84Show/hide
Query:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE
        R KV H LIS+   RR T +KRK GLLK + +LT LCG+ ACA+I++ +    E+WP+  +   +L R    P++KQ KYMMD K  + + +   +++LE
Subjt:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE

Query:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIK
        KEK   +  +  L        + I + +C   L   + ++D K++ + ERI  ++
Subjt:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIK

AT1G31630.1 AGAMOUS-like 861.7e-1832.95Show/hide
Query:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE
        RSK+K  LI+N T RR T +KRK G+   L +LTTLCGV ACAVI + +++ + +WPS +   + +  F   P  +Q K MM H+T+L+ ++T   ++LE
Subjt:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE

Query:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIKNLEPTRVRKIRPAHGQIA
          + + +E +   F+   +EG    +     +L +LS  ID  I  ++  +  + N   +      P H  +A
Subjt:  KEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIKNLEPTRVRKIRPAHGQIA

AT1G65330.1 MADS-box transcription factor family protein1.6e-1635.46Show/hide
Query:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE
        R K+K   I N ++R+ T  KRK G+LK  ++L TLCGV ACAVI + ++S  E WPS +   +V+ +F  F +  + K M+D +TFLR+++    E+L+
Subjt:  RSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLE

Query:  K--EKAKIQEFEQELF-------LAHHLEGNGICNLNCLLN
        K  ++ +  +    +F          HL G  + +LN  LN
Subjt:  K--EKAKIQEFEQELF-------LAHHLEGNGICNLNCLLN

AT5G26630.1 MADS-box transcription factor family protein2.1e-1631.29Show/hide
Query:  LQRSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQ
        + R KVK   I N T R++T KKRK GLLK   +L  LCGV   AV+++ ++   E+WPS + A  V+ +++   +  + K M++ +TFL++++T   E 
Subjt:  LQRSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQ

Query:  LEKEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIK-NLEPT
         +K + + +E E +  +   L G  + +      L +   +I+ +++ V+ RI  +K N EP+
Subjt:  LEKEKAKIQEFEQELFLAHHLEGNGICNLNCLLNLIELSCIIDFKIEFVSERIAFIK-NLEPT

AT5G48670.1 AGAMOUS-like 801.8e-2035.85Show/hide
Query:  LQRSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQ
        + R KVK   ISN + R+AT KKRK GL+K + +L+TLCG+ ACA+I++ +D+  E+WPS      V+  FR  P   Q K M+D + FL++++    E 
Subjt:  LQRSKVKHDLISNGTLRRATLKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQ

Query:  LEKEKAKIQEFEQELFLAHHLEGN-GICNLNCLLNLIELSCIIDFKIEFVSERIAFIKN
        L +++   +E E    +   L GN  + +LN +++L +L  +I+  ++ V+ RI  ++N
Subjt:  LEKEKAKIQEFEQELFLAHHLEGN-GICNLNCLLNLIELSCIIDFKIEFVSERIAFIKN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGCCACCGCCGCTGCTGCTGCCTCTTCTTGGCTTTGTTCCAGCCGAACCCGCTTTGTCTTTGTCTTGCTTTGTTCCCCCCTCCTAGTTCCCATTTTCTGCGCTAC
TTTCCCCTTCATCTGTGCCATAGAGCTCTGCATCCGCCTAGCTCGTCACAGGCGAAGCATATCTCTTCGTGATTCCCCAGAAATCGAACGGTTGCGGCGATGCGAGGAAG
GCGGCTGCGCAGCAGCGCTCCCGGAGGACGGGGAGGAAGAGATCGGTCTATTACAAAGGAGCAAGGTCAAGCACGATTTGATTAGTAACGGGACCTTGAGAAGAGCAACA
CTGAAGAAAAGGAAGGCGGGATTACTGAAAAATCTGAGCCAACTTACGACACTATGCGGAGTTGTTGCCTGTGCGGTTATTCATAATGTCCACGACTCACAAATCGAGAT
TTGGCCTTCTCCACAACAAGCATTCGATGTGCTGGAAAGGTTTAGGAATTTTCCAATCAAGAAACAACAAAAATACATGATGGACCACAAAACGTTTCTCAGAAGACAAG
TCACCGTACTCAAAGAGCAACTGGAGAAAGAAAAGGCTAAAATCCAAGAATTCGAGCAGGAGCTATTTTTAGCACATCACCTGGAAGGTAACGGTATATGTAATTTGAAT
TGTTTACTAAACTTGATAGAATTAAGTTGCATAATAGACTTTAAGATTGAGTTTGTTAGTGAACGAATTGCGTTCATCAAGAACCTCGAACCAACTCGAGTAAGGAAAAT
TAGGCCGGCACATGGCCAGATTGCTTATTCCCTATGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGCCACCGCCGCTGCTGCTGCCTCTTCTTGGCTTTGTTCCAGCCGAACCCGCTTTGTCTTTGTCTTGCTTTGTTCCCCCCTCCTAGTTCCCATTTTCTGCGCTAC
TTTCCCCTTCATCTGTGCCATAGAGCTCTGCATCCGCCTAGCTCGTCACAGGCGAAGCATATCTCTTCGTGATTCCCCAGAAATCGAACGGTTGCGGCGATGCGAGGAAG
GCGGCTGCGCAGCAGCGCTCCCGGAGGACGGGGAGGAAGAGATCGGTCTATTACAAAGGAGCAAGGTCAAGCACGATTTGATTAGTAACGGGACCTTGAGAAGAGCAACA
CTGAAGAAAAGGAAGGCGGGATTACTGAAAAATCTGAGCCAACTTACGACACTATGCGGAGTTGTTGCCTGTGCGGTTATTCATAATGTCCACGACTCACAAATCGAGAT
TTGGCCTTCTCCACAACAAGCATTCGATGTGCTGGAAAGGTTTAGGAATTTTCCAATCAAGAAACAACAAAAATACATGATGGACCACAAAACGTTTCTCAGAAGACAAG
TCACCGTACTCAAAGAGCAACTGGAGAAAGAAAAGGCTAAAATCCAAGAATTCGAGCAGGAGCTATTTTTAGCACATCACCTGGAAGGTAACGGTATATGTAATTTGAAT
TGTTTACTAAACTTGATAGAATTAAGTTGCATAATAGACTTTAAGATTGAGTTTGTTAGTGAACGAATTGCGTTCATCAAGAACCTCGAACCAACTCGAGTAAGGAAAAT
TAGGCCGGCACATGGCCAGATTGCTTATTCCCTATGA
Protein sequenceShow/hide protein sequence
MLATAAAAASSWLCSSRTRFVFVLLCSPLLVPIFCATFPFICAIELCIRLARHRRSISLRDSPEIERLRRCEEGGCAAALPEDGEEEIGLLQRSKVKHDLISNGTLRRAT
LKKRKAGLLKNLSQLTTLCGVVACAVIHNVHDSQIEIWPSPQQAFDVLERFRNFPIKKQQKYMMDHKTFLRRQVTVLKEQLEKEKAKIQEFEQELFLAHHLEGNGICNLN
CLLNLIELSCIIDFKIEFVSERIAFIKNLEPTRVRKIRPAHGQIAYSL