; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC02g0022 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC02g0022
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
DescriptionFHA domain-containing protein
Genome locationMC02:265043..268659
RNA-Seq ExpressionMC02g0022
SyntenyMC02g0022
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR000253 - Forkhead-associated (FHA) domain
IPR008984 - SMAD/FHA domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6605848.1 hypothetical protein SDJN03_03165, partial [Cucurbita argyrosperma subsp. sororia]5.23e-7364.19Show/hide
Query:  MECALHSVS-----VSVSSFKNSSSPASNSFALPSSRSFRP----PPLRLLR---HLSFQPR-TPLHASASASQSQI-PQSSA-RWILLPVGDGDWKHIG
        ME  +HS+S     + +S   +SSSP+++    P +R+       PPL LLR   +L+F+ R   L     ASQS+  PQSS+ RW+L+PVGDGDWKHIG
Subjt:  MECALHSVS-----VSVSSFKNSSSPASNSFALPSSRSFRP----PPLRLLR---HLSFQPR-TPLHASASASQSQI-PQSSA-RWILLPVGDGDWKHIG

Query:  SKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEA
        SKVEMPDAFEIVS+EVTVGRLP+KADI IPVATVS +HARIK Q+DRLL+ DLDSTNGTFI+DKRL PGVVAAVSSGNCI+FGDIHLAMFQVSKLKT +A
Subjt:  SKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEA

Query:  ATKLQESAEPTNNSE
        A+K+QES +   NSE
Subjt:  ATKLQESAEPTNNSE

XP_016900711.1 PREDICTED: uncharacterized protein LOC103491680 isoform X1 [Cucumis melo]1.36e-7767.82Show/hide
Query:  MECALHSVSVSVSSFKNSSSPASNSFALPS-SRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSS
        ME A+HS S+S  ++ NSS+P+ +   L   S S  P   RL   + F+PR         S  +  Q+S+RW+L+PVGDG+WKHIGSKVEMPDAFEIVS+
Subjt:  MECALHSVSVSVSSFKNSSSPASNSFALPS-SRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSS

Query:  EVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAE-PTNN
        EVTVGRLPDKADIVIPVATVSG HARIKNQ+D+LL+ DLDSTNGTFI+DKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAA+K+QE  E P ++
Subjt:  EVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAE-PTNN

Query:  SE
        SE
Subjt:  SE

XP_022153741.1 zeaxanthin epoxidase, chloroplastic [Momordica charantia]8.02e-137100Show/hide
Query:  MECALHSVSVSVSSFKNSSSPASNSFALPSSRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSE
        MECALHSVSVSVSSFKNSSSPASNSFALPSSRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSE
Subjt:  MECALHSVSVSVSSFKNSSSPASNSFALPSSRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSE

Query:  VTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAEPTNNSE
        VTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAEPTNNSE
Subjt:  VTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAEPTNNSE

Query:  LETS
        LETS
Subjt:  LETS

XP_022957686.1 uncharacterized protein LOC111459151 [Cucurbita moschata]7.42e-7362.67Show/hide
Query:  MECALHSVS-----VSVSSFKNSSSPASNSFALPSSRSFRP----PPLRLLR---HLSFQPRT-----PLHASASASQSQIPQSSARWILLPVGDGDWKH
        ME  +HS+S     + +S    SSSP+++    P +R+       PPL LLR   +L+F+ R      PL AS S  + +   SS RW+L+PVGDGDWKH
Subjt:  MECALHSVS-----VSVSSFKNSSSPASNSFALPSSRSFRP----PPLRLLR---HLSFQPRT-----PLHASASASQSQIPQSSARWILLPVGDGDWKH

Query:  IGSKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTV
        IGSKVEMPDAFEIVS+EVTVGRLP+KADI IPVATVS +HARIK Q+DRLL+ DLDSTNGTFI+DKRL PGVVAAVSSGNCI+FGDIHLAMFQVSKLKT 
Subjt:  IGSKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTV

Query:  EAATKLQESAEPTNNSE
        +AA+K+QES +   NSE
Subjt:  EAATKLQESAEPTNNSE

XP_038902005.1 zeaxanthin epoxidase, chloroplastic [Benincasa hispida]2.66e-7868.66Show/hide
Query:  MECALHSVSVSVSSFKNSSSPASNSFALPSSRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSE
        ME ALHS+     SF N SSP   S           P L + R  SF      H          P  ++RW+LLPVGDG+WKHIGSKVEMPDAFEIVS+E
Subjt:  MECALHSVSVSVSSFKNSSSPASNSFALPSSRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSE

Query:  VTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQE-SAEPTNNS
        VTVGRLPDKADIVIPVATVS LHARIKNQ+DRLL+ DLDSTNGTFI+DKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATK+QE S EP ++S
Subjt:  VTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQE-SAEPTNNS

Query:  E
        E
Subjt:  E

TrEMBL top hitse value%identityAlignment
A0A0A0KX03 FHA domain-containing protein1.05e-7467.18Show/hide
Query:  MECALHSVSVSVSSFKNSSSPASNSFALPS-SRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSS
        ME A+HS S+S  ++ NS +P+ +   L S S  FRP    LL+ L                    Q+S+RW+L+PVGDG+WKHIGSKVEMPDAFEIVS+
Subjt:  MECALHSVSVSVSSFKNSSSPASNSFALPS-SRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSS

Query:  EVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAE
        EVTVGRLPDKADIVIPVATVS  HARIKNQ+DRLL+ DLDSTNGTFINDKRLNPGVVAAVSSGN ITFGDIHLAMFQV+KLKTVEAA+K+QE  E
Subjt:  EVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAE

A0A1S4DXK3 uncharacterized protein LOC103491680 isoform X16.60e-7867.82Show/hide
Query:  MECALHSVSVSVSSFKNSSSPASNSFALPS-SRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSS
        ME A+HS S+S  ++ NSS+P+ +   L   S S  P   RL   + F+PR         S  +  Q+S+RW+L+PVGDG+WKHIGSKVEMPDAFEIVS+
Subjt:  MECALHSVSVSVSSFKNSSSPASNSFALPS-SRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSS

Query:  EVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAE-PTNN
        EVTVGRLPDKADIVIPVATVSG HARIKNQ+D+LL+ DLDSTNGTFI+DKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAA+K+QE  E P ++
Subjt:  EVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAE-PTNN

Query:  SE
        SE
Subjt:  SE

A0A6J1DLL4 zeaxanthin epoxidase, chloroplastic3.88e-137100Show/hide
Query:  MECALHSVSVSVSSFKNSSSPASNSFALPSSRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSE
        MECALHSVSVSVSSFKNSSSPASNSFALPSSRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSE
Subjt:  MECALHSVSVSVSSFKNSSSPASNSFALPSSRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSE

Query:  VTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAEPTNNSE
        VTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAEPTNNSE
Subjt:  VTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAEPTNNSE

Query:  LETS
        LETS
Subjt:  LETS

A0A6J1GZX7 uncharacterized protein LOC1114591513.59e-7362.67Show/hide
Query:  MECALHSVS-----VSVSSFKNSSSPASNSFALPSSRSFRP----PPLRLLR---HLSFQPRT-----PLHASASASQSQIPQSSARWILLPVGDGDWKH
        ME  +HS+S     + +S    SSSP+++    P +R+       PPL LLR   +L+F+ R      PL AS S  + +   SS RW+L+PVGDGDWKH
Subjt:  MECALHSVS-----VSVSSFKNSSSPASNSFALPSSRSFRP----PPLRLLR---HLSFQPRT-----PLHASASASQSQIPQSSARWILLPVGDGDWKH

Query:  IGSKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTV
        IGSKVEMPDAFEIVS+EVTVGRLP+KADI IPVATVS +HARIK Q+DRLL+ DLDSTNGTFI+DKRL PGVVAAVSSGNCI+FGDIHLAMFQVSKLKT 
Subjt:  IGSKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTV

Query:  EAATKLQESAEPTNNSE
        +AA+K+QES +   NSE
Subjt:  EAATKLQESAEPTNNSE

A0A6J1K1I9 uncharacterized protein LOC1114915705.10e-7362.21Show/hide
Query:  MECALHSVS-----VSVSSFKNSSSPASNSFALPSSRSFRP----PPLRLLR---HLSFQPR-----TPLHASASASQSQIPQSSARWILLPVGDGDWKH
        ME  +HS+S     + +S   +SSSP+++    P +R+       PPL LLR   +L+F+ R      PL A  S  + Q   SS RW+L+PVGDGDWKH
Subjt:  MECALHSVS-----VSVSSFKNSSSPASNSFALPSSRSFRP----PPLRLLR---HLSFQPR-----TPLHASASASQSQIPQSSARWILLPVGDGDWKH

Query:  IGSKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTV
        IGSKVEMPDAFEIVS+E+TVGRLP+KADI IPVATVS +HARIK Q+DRLL+ DLDSTNGTFI+DKRL PGVVAAVSSGNCI+FGDIHLAMFQVSKLKT 
Subjt:  IGSKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTV

Query:  EAATKLQESAEPTNNSE
        +AA+K+QES +   NSE
Subjt:  EAATKLQESAEPTNNSE

SwissProt top hitse value%identityAlignment
P46017 Protein FraH1.3e-0429.01Show/hide
Query:  PRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGS--KVEMPDAFEIVSSEVTVGRLPDKADI--VIPVATVSGLHARIKNQQDRLLITDLDSTNGT
        P T + A  S +++Q+ Q +AR +          H+ S  ++E+P +  +V       R+P   D+        VS +HA I+ + D   I D+ S+NGT
Subjt:  PRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGS--KVEMPDAFEIVSSEVTVGRLPDKADI--VIPVATVSGLHARIKNQQDRLLITDLDSTNGT

Query:  FINDKRLNPGVVAAVSSGNCITFGDIHLAMF
        +IN+  L PG    +  G+ I+ G   L  F
Subjt:  FINDKRLNPGVVAAVSSGNCITFGDIHLAMF

P93236 Zeaxanthin epoxidase, chloroplastic2.7e-0527.07Show/hide
Query:  SSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDK-----RLNPGVVAAVSS
        + A W+LLP G+G        +   +        ++   +P K+ IV+P+  VS +HARI  +     +TDL S +GT++ D      R +P        
Subjt:  SSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDK-----RLNPGVVAAVSS

Query:  GNCITFGDIHLAMFQVSKLKTVEAATKLQESAE
         + I FG    A F+V  +K     ++ +E  E
Subjt:  GNCITFGDIHLAMFQVSKLKTVEAATKLQESAE

Q40412 Zeaxanthin epoxidase, chloroplastic7.1e-0629.17Show/hide
Query:  SSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDK-----RLNPGVVAAVSS
        + A W+LLP G+ +       +   +        V+   +P K+ +VIP+  VS +HARI  +     +TDL S +GT+I D      R +P        
Subjt:  SSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDK-----RLNPGVVAAVSS

Query:  GNCITFGDIHLAMFQVSKLK
         + I FG    A F+V  +K
Subjt:  GNCITFGDIHLAMFQVSKLK

Q9FGC7 Zeaxanthin epoxidase, chloroplastic3.0e-0432.41Show/hide
Query:  VGRLPDK----ADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDK-----RLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESA
        VG  PD+      IVIP + VS +HAR+  +     + DL S +GT++ D      R  P   A   S + I FG    A F+V  ++    +T+  ES 
Subjt:  VGRLPDK----ADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDK-----RLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESA

Query:  EPTNNSEL
           NN +L
Subjt:  EPTNNSEL

Arabidopsis top hitse value%identityAlignment
AT1G34355.1 forkhead-associated (FHA) domain-containing protein8.1e-0533.78Show/hide
Query:  VSSEVTVGRLPDKADIVIPVATVSGLHARIK--NQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFG
        V   + VGR PD  DI++   ++S  H  I+  + + +L +TDL S +GT++ D R+ P     V  G+ I  G
Subjt:  VSSEVTVGRLPDKADIVIPVATVSGLHARIK--NQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFG

AT2G21530.1 SMAD/FHA domain-containing protein2.3e-3649.72Show/hide
Query:  SVSVSSFKNSSSPASNSFALPSSRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSA-RWILLPVGDGDWKHIGSKVEMPDAFEIVSSEVTVGRLP
        +VSV+  + +   ++ SF     RS   P +R  R             AS  ++Q P S   RW+L PVGDGD +HIG KV MP  FEI S +VT+GRLP
Subjt:  SVSVSSFKNSSSPASNSFALPSSRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSA-RWILLPVGDGDWKHIGSKVEMPDAFEIVSSEVTVGRLP

Query:  DKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVE
        +KAD+VIPVATVSG+HA I   +  LL+TD++STNGTFI DKRL PGV A    G  ITFGD +LA+F+V KL+  E
Subjt:  DKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVE

AT4G14490.1 SMAD/FHA domain-containing protein6.2e-0533.8Show/hide
Query:  SEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGD
        S + VGR+    +I I  A +S  H RI++     +I DL S+NGT +N   L+P     +  G+ I  G+
Subjt:  SEVTVGRLPDKADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGD

AT5G67030.1 zeaxanthin epoxidase (ZEP) (ABA1)2.1e-0532.41Show/hide
Query:  VGRLPDK----ADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDK-----RLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESA
        VG  PD+      IVIP + VS +HAR+  +     + DL S +GT++ D      R  P   A   S + I FG    A F+V  ++    +T+  ES 
Subjt:  VGRLPDK----ADIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDK-----RLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESA

Query:  EPTNNSEL
           NN +L
Subjt:  EPTNNSEL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAGTGCGCACTGCATTCTGTTTCTGTTTCTGTTTCTTCCTTCAAAAATTCATCCTCTCCCGCTTCCAATTCCTTTGCTCTTCCCTCCTCCCGCAGCTTCCGGCCGCC
GCCGTTAAGGTTATTGCGGCATCTCAGCTTCCAGCCAAGAACACCTCTACACGCTTCCGCTTCCGCTTCCCAGTCTCAAATTCCTCAATCTTCAGCCCGTTGGATCCTCC
TACCCGTTGGCGATGGAGATTGGAAGCATATAGGTTCCAAGGTTGAAATGCCAGATGCTTTCGAAATTGTGTCGAGTGAGGTCACTGTTGGGCGCCTTCCCGATAAAGCT
GACATTGTAATCCCAGTTGCAACAGTTTCTGGTCTTCATGCTCGCATCAAAAACCAACAAGACAGGCTGTTGATCACAGATCTAGACAGCACCAATGGGACTTTTATCAA
TGACAAGAGGCTCAACCCTGGAGTTGTTGCTGCTGTATCATCTGGGAATTGCATTACTTTTGGGGACATCCATCTGGCCATGTTTCAGGTCTCAAAGCTCAAGACTGTAG
AAGCTGCAACCAAACTCCAAGAATCGGCGGAGCCAACCAACAACTCTGAATTGGAAACAAGTTGA
mRNA sequenceShow/hide mRNA sequence
GAAGAATTTGTACAAAGTTGAGAAGAAGCAGCTAGGGGAGACCATAAGTACCAAATCGGCCTCACGAAGCTCCATAAAGGAGACGGAAAACAAGGGGCACTTTTGATATT
TAAATTGAATAGGGAATAGGAATACAAGGTGGATCGAATCTATAAATACTCGAAAGGGGTTAGGGTTGCGCATTATACACTATTTTTCCGCTCCGCCCCCAATACCAATA
CATCACTAATCACTCCTCTCCACATTTCTGCGCAAGCTTTGGTTTTTCGATGCTCCACCCAATGCTGCATTGTTCTCATTGCCTCCTCCACTGCGAAGCCGATCGCCTCG
ATGACTTCGTATGCTGTTCGGGCTGCGGCAAGGTTCTTGGCCAGTTCGTTGGAGGAAAATACATCCATCACAGCCAACTACCTATACTCGATAGGGCCACCAAATTGAAG
AGAAGCAAGAATGAGAATAAGAAGAAGCGGCATGAGAAAATTGATCAGGATGACGATCCCGATGATGGAGGGGGAATCCGAAAAGCTAAGATCCTGAAGGCTGAAGGCTG
AAGGCTGAAGGCTGAAGGCGGAGAGGAGTCTTCCAAAATTTGCCAAGCCATGGAGTGCGCACTGCATTCTGTTTCTGTTTCTGTTTCTTCCTTCAAAAATTCATCCTCTC
CCGCTTCCAATTCCTTTGCTCTTCCCTCCTCCCGCAGCTTCCGGCCGCCGCCGTTAAGGTTATTGCGGCATCTCAGCTTCCAGCCAAGAACACCTCTACACGCTTCCGCT
TCCGCTTCCCAGTCTCAAATTCCTCAATCTTCAGCCCGTTGGATCCTCCTACCCGTTGGCGATGGAGATTGGAAGCATATAGGTTCCAAGGTTGAAATGCCAGATGCTTT
CGAAATTGTGTCGAGTGAGGTCACTGTTGGGCGCCTTCCCGATAAAGCTGACATTGTAATCCCAGTTGCAACAGTTTCTGGTCTTCATGCTCGCATCAAAAACCAACAAG
ACAGGCTGTTGATCACAGATCTAGACAGCACCAATGGGACTTTTATCAATGACAAGAGGCTCAACCCTGGAGTTGTTGCTGCTGTATCATCTGGGAATTGCATTACTTTT
GGGGACATCCATCTGGCCATGTTTCAGGTCTCAAAGCTCAAGACTGTAGAAGCTGCAACCAAACTCCAAGAATCGGCGGAGCCAACCAACAACTCTGAATTGGAAACAAG
TTGATAAAAATGGCACATTTTATATGCCATCTAGAGAGAGAGAGAGACAGCAAATATGTGCAAAAATCTGCACCTTCTGTATTGTTTGTAACAGTAATCTTTGCATGAAT
TTTTCCATATGAGTTGAGGAAAATATTGAAGTGCCAATGAGTCATCTGATAGACCATTTAGTTTAAATTGACAGTAAAGGGTCAAATTGTTAAATTTTTTTTTTTATGGA
CAACGGTGACTGGAGCAGAGGGATTCAAATTTAGGCTGCCTATTTCTAAACTCACTTGTCTTTGCTAATAGAGCTATTTTTATTACACAGATTATCACTAA
Protein sequenceShow/hide protein sequence
MECALHSVSVSVSSFKNSSSPASNSFALPSSRSFRPPPLRLLRHLSFQPRTPLHASASASQSQIPQSSARWILLPVGDGDWKHIGSKVEMPDAFEIVSSEVTVGRLPDKA
DIVIPVATVSGLHARIKNQQDRLLITDLDSTNGTFINDKRLNPGVVAAVSSGNCITFGDIHLAMFQVSKLKTVEAATKLQESAEPTNNSELETS