; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG09G010160 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG09G010160
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionG-box-binding factor 4-like isoform X2
Genome locationCG_Chr09:9928231..9932015
RNA-Seq ExpressionClCG09G010160
SyntenyClCG09G010160
Gene Ontology termsGO:0045893 - positive regulation of transcription, DNA-templated (biological process)
GO:0016020 - membrane (cellular component)
GO:0003700 - DNA-binding transcription factor activity (molecular function)
InterPro domainsIPR004827 - Basic-leucine zipper domain
IPR043452 - Plant bZIP transcription factors


Homology Show/hide homology
GenBank top hitse value%identityAlignment
TYK31528.1 G-box-binding factor 4-like isoform X2 [Cucumis melo var. makuwa]1.2e-4168.1Show/hide
Query:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK
        VALI NRNP SH     +D +F TS  SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E VGFGN V+I  RGK
Subjt:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK

Query:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS
        RRR AMEPMD+AALQRQRRMIKNRESAARSRERKQAHQ+ELE IASRLEEENE LLKEK + S
Subjt:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS

XP_008461925.1 PREDICTED: G-box-binding factor 4-like isoform X1 [Cucumis melo]4.3e-4268.71Show/hide
Query:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK
        VALI NRNP SH     +D +F TS  SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E VGFGN VDI  RGK
Subjt:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK

Query:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS
        RRR AMEPMD+AALQRQRRMIKNRESAARSRERKQAHQ+ELE IASRLEEENE LLKEK + S
Subjt:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS

XP_008461926.1 PREDICTED: G-box-binding factor 4-like isoform X2 [Cucumis melo]4.3e-4268.71Show/hide
Query:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK
        VALI NRNP SH     +D +F TS  SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E VGFGN VDI  RGK
Subjt:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK

Query:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS
        RRR AMEPMD+AALQRQRRMIKNRESAARSRERKQAHQ+ELE IASRLEEENE LLKEK + S
Subjt:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS

XP_008461927.1 PREDICTED: G-box-binding factor 4-like isoform X3 [Cucumis melo]4.3e-4268.71Show/hide
Query:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK
        VALI NRNP SH     +D +F TS  SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E VGFGN VDI  RGK
Subjt:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK

Query:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS
        RRR AMEPMD+AALQRQRRMIKNRESAARSRERKQAHQ+ELE IASRLEEENE LLKEK + S
Subjt:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS

XP_022981612.1 G-box-binding factor 4-like isoform X1 [Cucurbita maxima]1.0e-3565.13Show/hide
Query:  NPSHFRTVDMDAKFDTSSSASKTVDDLWKELKEEAVGEMI-LEGFLQAKPQDQDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGR-GKRRRAAMEPM
        NP H   +DMD+K    S+AS+ VDD+W   +E+AV EM+  E F+  K Q +DVRILNP +C   F+    EE  VGFGNG +ISGR GKRRRA MEPM
Subjt:  NPSHFRTVDMDAKFDTSSSASKTVDDLWKELKEEAVGEMI-LEGFLQAKPQDQDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGR-GKRRRAAMEPM

Query:  DEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQ
        DEAALQRQRRMIKNRESAARSRERK AHQVELELIA+RLEEEN  LLK+K +
Subjt:  DEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQ

TrEMBL top hitse value%identityAlignment
A0A1S3CFQ6 G-box-binding factor 4-like isoform X12.1e-4268.71Show/hide
Query:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK
        VALI NRNP SH     +D +F TS  SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E VGFGN VDI  RGK
Subjt:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK

Query:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS
        RRR AMEPMD+AALQRQRRMIKNRESAARSRERKQAHQ+ELE IASRLEEENE LLKEK + S
Subjt:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS

A0A1S3CGA7 G-box-binding factor 4-like isoform X22.1e-4268.71Show/hide
Query:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK
        VALI NRNP SH     +D +F TS  SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E VGFGN VDI  RGK
Subjt:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK

Query:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS
        RRR AMEPMD+AALQRQRRMIKNRESAARSRERKQAHQ+ELE IASRLEEENE LLKEK + S
Subjt:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS

A0A1S3CH58 G-box-binding factor 4-like isoform X32.1e-4268.71Show/hide
Query:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK
        VALI NRNP SH     +D +F TS  SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E VGFGN VDI  RGK
Subjt:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK

Query:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS
        RRR AMEPMD+AALQRQRRMIKNRESAARSRERKQAHQ+ELE IASRLEEENE LLKEK + S
Subjt:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS

A0A5D3E6S6 G-box-binding factor 4-like isoform X26.0e-4268.1Show/hide
Query:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK
        VALI NRNP SH     +D +F TS  SS SKTVDDLW++LKEE+V ++                ILNP SCLKDFDRVYVE +E VGFGN V+I  RGK
Subjt:  VALIGNRNP-SHFRTVDMDAKFDTS--SSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVE-EETVGFGNGVDISGRGK

Query:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS
        RRR AMEPMD+AALQRQRRMIKNRESAARSRERKQAHQ+ELE IASRLEEENE LLKEK + S
Subjt:  RRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQFS

A0A6J1IUG5 G-box-binding factor 4-like isoform X14.9e-3665.13Show/hide
Query:  NPSHFRTVDMDAKFDTSSSASKTVDDLWKELKEEAVGEMI-LEGFLQAKPQDQDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGR-GKRRRAAMEPM
        NP H   +DMD+K    S+AS+ VDD+W   +E+AV EM+  E F+  K Q +DVRILNP +C   F+    EE  VGFGNG +ISGR GKRRRA MEPM
Subjt:  NPSHFRTVDMDAKFDTSSSASKTVDDLWKELKEEAVGEMI-LEGFLQAKPQDQDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGR-GKRRRAAMEPM

Query:  DEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQ
        DEAALQRQRRMIKNRESAARSRERK AHQVELELIA+RLEEEN  LLK+K +
Subjt:  DEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEKVQ

SwissProt top hitse value%identityAlignment
P42777 G-box-binding factor 42.9e-1740.17Show/hide
Query:  LNSKNSQLSPPPPSSSSLS---SFSSHTKSGSISNEETKSVALIGNRNPSHFRT--------VDMDAKFDTSSSAS--KTVDDLWKEL-----------K
        ++S NS LS    SS+S S     S H +    ++    S A  G  N   F +        +D+D      +S +  K+VDD+WKE+           +
Subjt:  LNSKNSQLSPPPPSSSSLS---SFSSHTKSGSISNEETKSVALIGNRNPSHFRT--------VDMDAKFDTSSSAS--KTVDDLWKEL-----------K

Query:  EEAVGEMILEGFLQAKPQDQ------DV-----RILNPFSCLKDFD---RVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAAR
        EE    M LE FL     D+      DV     R+ N  S   DF        +      G GV    RGKR R  ME MD+AA QRQ+RMIKNRESAAR
Subjt:  EEAVGEMILEGFLQAKPQDQ------DV-----RILNPFSCLKDFD---RVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAAR

Query:  SRERKQAHQVELELIASRLEEENELLLKE
        SRERKQA+QVELE +A++LEEENE LLKE
Subjt:  SRERKQAHQVELELIASRLEEENELLLKE

Q0JHF1 bZIP transcription factor 122.5e-1345.69Show/hide
Query:  EMILEGFLQAK-PQDQDVRILNPFSCLKDFDRVYVEEETVGFGNGVD----ISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELEL
        EM LE FL  +    +D  ++   S  K        +  +GF NG +    ++G   R+R  M+PMD AA+QRQ+RMIKNRESAARSRERKQA+  ELE 
Subjt:  EMILEGFLQAK-PQDQDVRILNPFSCLKDFDRVYVEEETVGFGNGVD----ISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELEL

Query:  IASRLEEENELLLKEK
        + ++LEEEN  + KE+
Subjt:  IASRLEEENELLLKEK

Q9C5Q2 ABSCISIC ACID-INSENSITIVE 5-like protein 31.9e-0833.74Show/hide
Query:  ASKTVDDLWKEL---------------KEEAVGEMILEGFL--------QAKPQDQDVRILNPFSCLKDFDR-----------VYVEEETVGFGNGVDIS
        + KTVD++W+++               K+  +GE+ LE  L           PQ+  V I +    ++   +           V   ++ V  G   D  
Subjt:  ASKTVDDLWKEL---------------KEEAVGEMILEGFL--------QAKPQDQDVRILNPFSCLKDFDR-----------VYVEEETVGFGNGVDIS

Query:  GRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEK
            R+R A E +++   +RQ+RMIKNRESAARSR RKQA+  ELE+  SRLEEENE L + K
Subjt:  GRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEK

Q9LES3 ABSCISIC ACID-INSENSITIVE 5-like protein 23.2e-0857.58Show/hide
Query:  DISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEK
        D    G++R A+ E +++   +RQ+RMIKNRESAARSR RKQA+  ELE+  SRLEEENE L K+K
Subjt:  DISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEK

Q9SJN0 Protein ABSCISIC ACID-INSENSITIVE 54.9e-0943.01Show/hide
Query:  QDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELL
        Q + ++ P S +      + + + +G   GVD+ G   R+R    P+++   +RQRRMIKNRESAARSR RKQA+ VELE   ++L+EEN  L
Subjt:  QDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELL

Arabidopsis top hitse value%identityAlignment
AT1G03970.1 G-box binding factor 42.0e-1840.17Show/hide
Query:  LNSKNSQLSPPPPSSSSLS---SFSSHTKSGSISNEETKSVALIGNRNPSHFRT--------VDMDAKFDTSSSAS--KTVDDLWKEL-----------K
        ++S NS LS    SS+S S     S H +    ++    S A  G  N   F +        +D+D      +S +  K+VDD+WKE+           +
Subjt:  LNSKNSQLSPPPPSSSSLS---SFSSHTKSGSISNEETKSVALIGNRNPSHFRT--------VDMDAKFDTSSSAS--KTVDDLWKEL-----------K

Query:  EEAVGEMILEGFLQAKPQDQ------DV-----RILNPFSCLKDFD---RVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAAR
        EE    M LE FL     D+      DV     R+ N  S   DF        +      G GV    RGKR R  ME MD+AA QRQ+RMIKNRESAAR
Subjt:  EEAVGEMILEGFLQAKPQDQ------DV-----RILNPFSCLKDFD---RVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAAR

Query:  SRERKQAHQVELELIASRLEEENELLLKE
        SRERKQA+QVELE +A++LEEENE LLKE
Subjt:  SRERKQAHQVELELIASRLEEENELLLKE

AT2G36270.1 Basic-leucine zipper (bZIP) transcription factor family protein3.5e-1043.01Show/hide
Query:  QDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELL
        Q + ++ P S +      + + + +G   GVD+ G   R+R    P+++   +RQRRMIKNRESAARSR RKQA+ VELE   ++L+EEN  L
Subjt:  QDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELL

AT2G41070.1 Basic-leucine zipper (bZIP) transcription factor family protein1.3e-0933.74Show/hide
Query:  ASKTVDDLWKEL---------------KEEAVGEMILEGFL--------QAKPQDQDVRILNPFSCLKDFDR-----------VYVEEETVGFGNGVDIS
        + KTVD++W+++               K+  +GE+ LE  L           PQ+  V I +    ++   +           V   ++ V  G   D  
Subjt:  ASKTVDDLWKEL---------------KEEAVGEMILEGFL--------QAKPQDQDVRILNPFSCLKDFDR-----------VYVEEETVGFGNGVDIS

Query:  GRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEK
            R+R A E +++   +RQ+RMIKNRESAARSR RKQA+  ELE+  SRLEEENE L + K
Subjt:  GRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEK

AT2G41070.3 Basic-leucine zipper (bZIP) transcription factor family protein1.3e-0933.74Show/hide
Query:  ASKTVDDLWKEL---------------KEEAVGEMILEGFL--------QAKPQDQDVRILNPFSCLKDFDR-----------VYVEEETVGFGNGVDIS
        + KTVD++W+++               K+  +GE+ LE  L           PQ+  V I +    ++   +           V   ++ V  G   D  
Subjt:  ASKTVDDLWKEL---------------KEEAVGEMILEGFL--------QAKPQDQDVRILNPFSCLKDFDR-----------VYVEEETVGFGNGVDIS

Query:  GRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEK
            R+R A E +++   +RQ+RMIKNRESAARSR RKQA+  ELE+  SRLEEENE L + K
Subjt:  GRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVELELIASRLEEENELLLKEK

AT5G44080.1 Basic-leucine zipper (bZIP) transcription factor family protein2.3e-2241.74Show/hide
Query:  PPPPSSSSLSSFSSHTKSGSISNEETKSVALIGNRNPSHFRTVDMDAKFDTSSSASKTVDDLWKE--------LKEEAVGE-MILEGFL-----------
        PP P+ SSL   S +    S +  E  +              VD     +T +   K+VD++W+E        +KEE   E M LE FL           
Subjt:  PPPPSSSSLSSFSSHTKSGSISNEETKSVALIGNRNPSHFRTVDMDAKFDTSSSASKTVDDLWKE--------LKEEAVGE-MILEGFL-----------

Query:  QAKPQDQDVRI-------------LNPFSCLKDFDRVYVEEETVGFGNGVDISG---RGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVE
         A  +D DV+I              NPF  +       VE   V FGNG+D+ G   RGKR R  +EP+D+AA QRQRRMIKNRESAARSRERKQA+QVE
Subjt:  QAKPQDQDVRI-------------LNPFSCLKDFDRVYVEEETVGFGNGVDISG---RGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQAHQVE

Query:  LELIASRLEEENELLLKE
        LE +A++LEEENELL KE
Subjt:  LELIASRLEEENELLLKE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTGGTCTCGCTCTTCGGCACACGATCTTGCTCTTTGGCACTCAGTTTGGCTTTGCAACACAAATGGAAAAGAGAGAAGAGGAAGTCGGTGGGAGAAGAAACATGTG
GACTCGCCGGAGTTGGTCGCCTCGCCAAAGCTGCGAAATGGCGCCACTGAATTCGAAAAATTCGCAACTGTCGCCTCCTCCTCCTTCTTCTTCTTCACTCTCCTCCTTTT
CTTCACACACGAAATCCGGATCCATCTCCAATGAAGAAACCAAATCAGTAGCTTTAATCGGCAATCGCAATCCCTCTCATTTCCGCACTGTGGACATGGATGCTAAGTTC
GATACTTCATCTTCTGCTTCTAAAACTGTGGACGATCTTTGGAAGGAGTTGAAGGAGGAGGCTGTTGGAGAGATGATCTTGGAGGGTTTTCTTCAAGCCAAACCACAGGA
TCAGGATGTGAGGATTTTGAATCCGTTTAGTTGTTTAAAGGATTTCGATAGGGTTTATGTTGAAGAAGAGACTGTTGGGTTTGGGAATGGAGTTGACATTAGTGGGAGAG
GGAAGAGAAGGCGCGCAGCTATGGAACCAATGGATGAAGCTGCACTGCAAAGACAACGGAGGATGATTAAGAACAGGGAGTCTGCTGCTAGGTCCAGAGAAAGGAAACAA
GCACATCAAGTTGAGTTAGAGTTAATAGCTTCGAGACTTGAGGAAGAGAACGAGCTATTATTGAAAGAGAAGGTTCAGTTTTCCTGCTGA
mRNA sequenceShow/hide mRNA sequence
ATGCTTGGTCTCGCTCTTCGGCACACGATCTTGCTCTTTGGCACTCAGTTTGGCTTTGCAACACAAATGGAAAAGAGAGAAGAGGAAGTCGGTGGGAGAAGAAACATGTG
GACTCGCCGGAGTTGGTCGCCTCGCCAAAGCTGCGAAATGGCGCCACTGAATTCGAAAAATTCGCAACTGTCGCCTCCTCCTCCTTCTTCTTCTTCACTCTCCTCCTTTT
CTTCACACACGAAATCCGGATCCATCTCCAATGAAGAAACCAAATCAGTAGCTTTAATCGGCAATCGCAATCCCTCTCATTTCCGCACTGTGGACATGGATGCTAAGTTC
GATACTTCATCTTCTGCTTCTAAAACTGTGGACGATCTTTGGAAGGAGTTGAAGGAGGAGGCTGTTGGAGAGATGATCTTGGAGGGTTTTCTTCAAGCCAAACCACAGGA
TCAGGATGTGAGGATTTTGAATCCGTTTAGTTGTTTAAAGGATTTCGATAGGGTTTATGTTGAAGAAGAGACTGTTGGGTTTGGGAATGGAGTTGACATTAGTGGGAGAG
GGAAGAGAAGGCGCGCAGCTATGGAACCAATGGATGAAGCTGCACTGCAAAGACAACGGAGGATGATTAAGAACAGGGAGTCTGCTGCTAGGTCCAGAGAAAGGAAACAA
GCACATCAAGTTGAGTTAGAGTTAATAGCTTCGAGACTTGAGGAAGAGAACGAGCTATTATTGAAAGAGAAGGTTCAGTTTTCCTGCTGA
Protein sequenceShow/hide protein sequence
MLGLALRHTILLFGTQFGFATQMEKREEEVGGRRNMWTRRSWSPRQSCEMAPLNSKNSQLSPPPPSSSSLSSFSSHTKSGSISNEETKSVALIGNRNPSHFRTVDMDAKF
DTSSSASKTVDDLWKELKEEAVGEMILEGFLQAKPQDQDVRILNPFSCLKDFDRVYVEEETVGFGNGVDISGRGKRRRAAMEPMDEAALQRQRRMIKNRESAARSRERKQ
AHQVELELIASRLEEENELLLKEKVQFSC