; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

IVF0011019 (gene) of Melon (IVF77) v1 genome

Gene IDIVF0011019
OrganismCucumis melo ssp. agrestis cv. IVF77 (Melon (IVF77) v1)
DescriptionGATA transcription factor 21
Genome locationchr02:1871861..1874008
RNA-Seq ExpressionIVF0011019
SyntenyIVF0011019
Gene Ontology termsGO:0006355 - regulation of transcription, DNA-templated (biological process)
GO:0008270 - zinc ion binding (molecular function)
GO:0043565 - sequence-specific DNA binding (molecular function)
InterPro domainsIPR000679 - Zinc finger, GATA-type
IPR013088 - Zinc finger, NHR/GATA-type


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6588037.1 GATA transcription factor 21, partial [Cucurbita argyrosperma subsp. sororia]1.71e-6367.17Show/hide
Query:  MINSNLQTETTPTRTIDSGRNVQDLNP--PSPSPSSIEQTNKRTSATTLHEGG-AIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA
        +IN N QTET PT+TID+ RN QDLNP  PSPSPS  +QTNKR    TL++GG AIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA
Subjt:  MINSNLQTETTPTRTIDSGRNVQDLNP--PSPSPSSIEQTNKRTSATTLHEGG-AIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA

Query:  AAATNGG---AVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVS--GHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
            NGG   AVVLKTNKA+      KPA TM      KRK+K+ V   +         GGG + KLC E++KMG RLSEISS+YQRVFPQDEREAAI
Subjt:  AAATNGG---AVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVS--GHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

XP_004135818.1 putative GATA transcription factor 22 [Cucumis sativus]3.59e-9786.84Show/hide
Query:  MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA
        MINSN QTETT TRTI+SGRNVQDLN  SPSPSS EQTNKRTS TTLH+GGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA
Subjt:  MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA

Query:  TNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
         NGGAVV+KTNK VQHKITTKPATT      LKRKYKDEVVVV    GGDK GG + KLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
Subjt:  TNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

XP_008450852.1 PREDICTED: GATA transcription factor 21 [Cucumis melo]1.27e-124100Show/hide
Query:  MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA
        MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA
Subjt:  MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA

Query:  TNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
        TNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
Subjt:  TNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

XP_023530322.1 GATA transcription factor 21-like [Cucurbita pepo subsp. pepo]4.69e-6567.53Show/hide
Query:  MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGG-AIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAA
        +IN N QTET PT+TID+ RN QDLNP  PSPS  +QTNKR    TL++GG AIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA  
Subjt:  MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGG-AIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAA

Query:  ATNGG---AVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
          NGG   AVVLKTNKA+      KPA TM      KRK+K+ V   +       GGG + KLC E++KMG RLSEISS+YQRVFPQDEREAAI
Subjt:  ATNGG---AVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

XP_038878562.1 GATA transcription factor 21 [Benincasa hispida]3.74e-8380.51Show/hide
Query:  MINSNLQTETTPTRTIDSGRNVQDLNP--PSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAA
        +INSN QTET  TRTIDSGRN QDLN   P+PSPSS +QTNKRTS T L +GGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAA
Subjt:  MINSNLQTETTPTRTIDSGRNVQDLNP--PSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAA

Query:  AATNG---GAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
        AA NG    AVVLK+NKAVQHKI TK A   TTTT LKRK KD VV   G GGGD GGGRK  LCFEEIK+G RLSEISSSYQRVFPQDEREAAI
Subjt:  AATNG---GAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

TrEMBL top hitse value%identityAlignment
A0A0A0LZE4 GATA-type domain-containing protein4.4e-7786.84Show/hide
Query:  MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA
        MINSN QTETT TRTI+SGRNVQDLN  SPSPSS EQTNKRTS TTLH+GGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA
Subjt:  MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA

Query:  TNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
         NGGAVV+KTNK VQHKITTKPATT      LKRKYKDEVVVV    GGDK GG + KLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
Subjt:  TNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

A0A1S3BPL1 GATA transcription factor 212.0e-98100Show/hide
Query:  MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA
        MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA
Subjt:  MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAA

Query:  TNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
        TNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
Subjt:  TNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

A0A6J1ELP1 GATA transcription factor 21-like2.1e-5066.5Show/hide
Query:  MINSNLQTETTPTRTIDSGRNVQDLNP--PSPSPSSIEQTNKRTSATTLHE-GGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA
        +IN N QTE TPT+TID+ RN QDLNP  PSPSPS  +QTNKR    TL++ GGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA
Subjt:  MINSNLQTETTPTRTIDSGRNVQDLNP--PSPSPSSIEQTNKRTSATTLHE-GGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA

Query:  AAATNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEV----VVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
            N  AVVLKTNKA+      KPA TM      KRK+K+ V       +       GGG + KLC E++KMG RLSEISS+YQRVFPQDEREAAI
Subjt:  AAATNGGAVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEV----VVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

A0A6J1HT96 GATA transcription factor 21-like isoform X11.5e-4865.13Show/hide
Query:  MINSNLQTETTPTRTIDSGRNVQDLNP--PSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAA
        +IN N QTET   +TID+ RN QDLNP  PSPSPS  +QTNKR +      GGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA 
Subjt:  MINSNLQTETTPTRTIDSGRNVQDLNP--PSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAA

Query:  AATNGG---AVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
           NGG   AVVLKTNKA+      KPA TM      KRK+K+ V   +       GGG + KLC E++KMG RL+EI+S+YQRVFPQDEREAAI
Subjt:  AATNGG---AVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

A0A6J1HXZ7 GATA transcription factor 21-like isoform X21.5e-4865.13Show/hide
Query:  MINSNLQTETTPTRTIDSGRNVQDLNP--PSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAA
        +IN N QTET   +TID+ RN QDLNP  PSPSPS  +QTNKR +      GGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAA 
Subjt:  MINSNLQTETTPTRTIDSGRNVQDLNP--PSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAA

Query:  AATNGG---AVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
           NGG   AVVLKTNKA+      KPA TM      KRK+K+ V   +       GGG + KLC E++KMG RL+EI+S+YQRVFPQDEREAAI
Subjt:  AATNGG---AVVLKTNKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

SwissProt top hitse value%identityAlignment
Q5HZ36 GATA transcription factor 213.9e-2240.43Show/hide
Query:  TNKRTSAT------TLHEGG-----AIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAATNGGAVV------LKTNKAVQHKIT-
        T K T+AT      T++E G      +IR CSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA   AAAAA +    V      L   K +Q+K   
Subjt:  TNKRTSAT------TLHEGG-----AIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAATNGGAVV------LKTNKAVQHKIT-

Query:  ---------TKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKA--------KLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
                 + P         +K + + E+   +  G  +      +        K CF+++ +   +   SS+YQ+VFPQDE+EAA+
Subjt:  ---------TKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKA--------KLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

Q6YW48 Protein CYTOKININ-RESPONSIVE GATA TRANSCRIPTION FACTOR 13.1e-1956.52Show/hide
Query:  PSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAATNGGAVVLKTNKAVQHKITTKPA
        P    Q ++  S   L +   ++R CSDCNTTKTPLWRSGP GPKSLCNACGIRQRKARRAM    AAA NGGA V          +  KPA
Subjt:  PSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAATNGGAVVLKTNKAVQHKITTKPA

Q8LC59 GATA transcription factor 238.2e-1267.39Show/hide
Query:  LHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA
        + E    IR CS+C TTKTP+WR GP GPKSLCNACGIR RK RR+
Subjt:  LHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA

Q9FJ10 GATA transcription factor 164.1e-1173.17Show/hide
Query:  RTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAE
        +TC+DC T+KTPLWR GP GPKSLCNACGIR RK RR   E
Subjt:  RTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAE

Q9SZI6 Putative GATA transcription factor 222.4e-1941.48Show/hide
Query:  SPSSIEQTNKRTSATTL-------HEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARR-AMAEAAAAATNG-GAVVLKTNKAVQHKITTK
        S SS + TN   S+          +    +IR CSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARR AMA A A A +G    V+K     ++KI+  
Subjt:  SPSSIEQTNKRTSATTL-------HEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARR-AMAEAAAAATNG-GAVVLKTNKAVQHKITTK

Query:  PATTMT----TTTALKRKYKDEVVVVSGHGGGDKGG---GRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
            ++         KR    E   ++                 + F+++ +   L   SS+YQ+VFPQDE+EAAI
Subjt:  PATTMT----TTTALKRKYKDEVVVVSGHGGGDKGG---GRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

Arabidopsis top hitse value%identityAlignment
AT4G26150.1 cytokinin-responsive gata factor 11.7e-2041.48Show/hide
Query:  SPSSIEQTNKRTSATTL-------HEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARR-AMAEAAAAATNG-GAVVLKTNKAVQHKITTK
        S SS + TN   S+          +    +IR CSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARR AMA A A A +G    V+K     ++KI+  
Subjt:  SPSSIEQTNKRTSATTL-------HEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARR-AMAEAAAAATNG-GAVVLKTNKAVQHKITTK

Query:  PATTMT----TTTALKRKYKDEVVVVSGHGGGDKGG---GRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
            ++         KR    E   ++                 + F+++ +   L   SS+YQ+VFPQDE+EAAI
Subjt:  PATTMT----TTTALKRKYKDEVVVVSGHGGGDKGG---GRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI

AT4G36620.1 GATA transcription factor 194.9e-1256.6Show/hide
Query:  IIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAATNGGA
        + R C++C+TT TPLWR+GPRGPKSLCNACGIR +K  R  + A  + + GG+
Subjt:  IIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAATNGGA

AT5G26930.1 GATA transcription factor 235.8e-1367.39Show/hide
Query:  LHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA
        + E    IR CS+C TTKTP+WR GP GPKSLCNACGIR RK RR+
Subjt:  LHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA

AT5G49300.1 GATA transcription factor 162.9e-1273.17Show/hide
Query:  RTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAE
        +TC+DC T+KTPLWR GP GPKSLCNACGIR RK RR   E
Subjt:  RTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAE

AT5G56860.1 GATA type zinc finger transcription factor family protein2.8e-2340.43Show/hide
Query:  TNKRTSAT------TLHEGG-----AIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAATNGGAVV------LKTNKAVQHKIT-
        T K T+AT      T++E G      +IR CSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRA   AAAAA +    V      L   K +Q+K   
Subjt:  TNKRTSAT------TLHEGG-----AIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAATNGGAVV------LKTNKAVQHKIT-

Query:  ---------TKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKA--------KLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI
                 + P         +K + + E+   +  G  +      +        K CF+++ +   +   SS+YQ+VFPQDE+EAA+
Subjt:  ---------TKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKA--------KLCFEEIKMGGRLSEISSSYQRVFPQDEREAAI


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATAAATTCTAACCTGCAAACTGAGACGACTCCCACCAGAACAATTGACAGCGGTCGAAATGTCCAAGATCTCAACCCACCGTCACCGTCACCATCTTCGATTGAACA
AACGAACAAACGAACAAGCGCAACGACACTACACGAGGGCGGTGCCATAATCAGAACCTGTTCCGATTGTAACACCACAAAAACTCCCCTTTGGAGAAGCGGCCCTAGAG
GTCCTAAGTCACTTTGCAACGCATGTGGAATCCGACAGAGAAAAGCAAGACGAGCAATGGCAGAAGCAGCAGCGGCCGCCACGAACGGTGGGGCCGTAGTTTTGAAGACC
AACAAGGCGGTACAACACAAGATAACGACGAAGCCAGCGACGACGATGACGACAACAACAGCATTGAAGAGAAAATACAAAGACGAGGTCGTGGTAGTCAGTGGCCACGG
TGGCGGAGACAAGGGCGGAGGAAGAAAGGCGAAACTTTGTTTTGAAGAGATAAAAATGGGGGGAAGATTGAGTGAGATTTCTTCATCCTATCAACGAGTTTTCCCACAAG
ATGAAAGAGAAGCTGCCATTTGCTCATGA
mRNA sequenceShow/hide mRNA sequence
TCTTTCTTTCTTTTAACTCATTCTTCTTTGATCTTCTCTTCTTCATTATCATGGCTCCTCCTTATCGGGACTCGTTTCCCTCCGATCACGACGATCTTGATCATCTTCAC
TACTCGTCTTCTCATCATCACCTCTTCTTCCTATCGTCACCCCAGCTCAGGCTTCTTCCTCCTCCTCCTCCTCTCTCTCTTTCACTGCCCTCGATCATTCCATGATCTCC
GACGATCCTCGCTCGGTAGAGCTCAAACACGAGGGTGGTGGGATTATGGGTTGTAACAATGATCAAAGTATTGGAAATCATGAAGATCATATAGAAGAAACTGGACTAAG
GTTTACAATTTGGAAGCAGATTGATAAGAGAGAAACTTCAAGCTGTTGTGAGAATAATAATAACGATAATACTCACAATGATTCGGTGAAGTGGTCTTCTTCTTCTCCTC
CTCCTCCAAGATCAAATTCATGATAAATTCTAACCTGCAAACTGAGACGACTCCCACCAGAACAATTGACAGCGGTCGAAATGTCCAAGATCTCAACCCACCGTCACCGT
CACCATCTTCGATTGAACAAACGAACAAACGAACAAGCGCAACGACACTACACGAGGGCGGTGCCATAATCAGAACCTGTTCCGATTGTAACACCACAAAAACTCCCCTT
TGGAGAAGCGGCCCTAGAGGTCCTAAGTCACTTTGCAACGCATGTGGAATCCGACAGAGAAAAGCAAGACGAGCAATGGCAGAAGCAGCAGCGGCCGCCACGAACGGTGG
GGCCGTAGTTTTGAAGACCAACAAGGCGGTACAACACAAGATAACGACGAAGCCAGCGACGACGATGACGACAACAACAGCATTGAAGAGAAAATACAAAGACGAGGTCG
TGGTAGTCAGTGGCCACGGTGGCGGAGACAAGGGCGGAGGAAGAAAGGCGAAACTTTGTTTTGAAGAGATAAAAATGGGGGGAAGATTGAGTGAGATTTCTTCATCCTAT
CAACGAGTTTTCCCACAAGATGAAAGAGAAGCTGCCATTTGCTCATGACTCTATCTTACGGCC
Protein sequenceShow/hide protein sequence
MINSNLQTETTPTRTIDSGRNVQDLNPPSPSPSSIEQTNKRTSATTLHEGGAIIRTCSDCNTTKTPLWRSGPRGPKSLCNACGIRQRKARRAMAEAAAAATNGGAVVLKT
NKAVQHKITTKPATTMTTTTALKRKYKDEVVVVSGHGGGDKGGGRKAKLCFEEIKMGGRLSEISSSYQRVFPQDEREAAICS