; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS026036 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS026036
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold325:346470..347708
RNA-Seq ExpressionMS026036
SyntenyMS026036
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG5219090.1 receptor protein [Salix suchowensis]5.5e-4341.09Show/hide
Query:  MEETSLSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDFSDEKEGQYNSDDVQFFK---
        +E+T  + EE D L RS   ++KVKI   +S +            G +GG    +Y E ++G+S  E E D +    +  SD+ E   + DD    K   
Subjt:  MEETSLSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDFSDEKEGQYNSDDVQFFK---

Query:  EQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALW
        E+ +  R PW+  LIIK++G++ GY FL++RL  MW  +G+F L  L N+F++ +    EDR  +L +G W + D YL VR W P F P  A ID+VA+W
Subjt:  EQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALW

Query:  VQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF
        V++PD  ME Y+   + +IGD IGKTLKID  T  G  G FAR+CVEVDLTK L + F
Subjt:  VQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF

KAG6752401.1 hypothetical protein POTOM_044628 [Populus tomentosa]6.4e-4442.52Show/hide
Query:  LSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDF-SDEKEGQYNSDD---VQFFKEQWR
        +S EE+D L RS KK +   I   S  +  DS+  +N   G       FSY+E ++G S    + D   P +EDF SD+ E     +D   ++   E+ +
Subjt:  LSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDF-SDEKEGQYNSDD---VQFFKEQWR

Query:  SFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIP
          R+PWR  LIIKVLG++  Y FL++RL  MW  +G+  L+ L NDFF+ R    EDRE  +  G W + D YL +R W P F P+ A I++VA+W+++P
Subjt:  SFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIP

Query:  DCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF
        D  ME Y+   + +IG+ IGKTLK+D  T  G  GN+ARICVEVDLTK L S F
Subjt:  DCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF

KAG6772677.1 hypothetical protein POTOM_024095 [Populus tomentosa]6.4e-4442.52Show/hide
Query:  LSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDF-SDEKEGQYNSDD---VQFFKEQWR
        +S EE+D L RS KK +   I   S  +  DS+  +N   G       FSY+E ++G S    + D   P +EDF SD+ E     +D   ++   E+ +
Subjt:  LSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDF-SDEKEGQYNSDD---VQFFKEQWR

Query:  SFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIP
          R+PWR  LIIKVLG++  Y FL++RL  MW  +G+  L+ L NDFF+ R    EDRE  +  G W + D YL +R W P F P+ A I++VA+W+++P
Subjt:  SFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIP

Query:  DCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF
        D  ME Y+   + +IG+ IGKTLK+D  T  G  GN+ARICVEVDLTK L S F
Subjt:  DCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF

XP_034894449.1 uncharacterized protein LOC118033543 isoform X1 [Populus alba]6.4e-4442.52Show/hide
Query:  LSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDF-SDEKEGQYNSDD---VQFFKEQWR
        +S EE+D L RS KK +   I   S  +  DS   +N   G       FSY+E ++G S  + + D   P +EDF SD+ E     +D   ++   E+ +
Subjt:  LSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDF-SDEKEGQYNSDD---VQFFKEQWR

Query:  SFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIP
          R+PWR  LIIKVLG++  Y FL++RL  MW  +G+  L+ L NDFF+ R    EDRE  +  G W + D YL +R W P F P+ A I++VA+W+++P
Subjt:  SFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIP

Query:  DCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF
        D  ME Y+   + +IG+ IGKTLK+D  T  G  GN+ARICVEVDLTK L S F
Subjt:  DCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF

XP_034894450.1 uncharacterized protein LOC118033543 isoform X2 [Populus alba]6.4e-4442.52Show/hide
Query:  LSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDF-SDEKEGQYNSDD---VQFFKEQWR
        +S EE+D L RS KK +   I   S  +  DS   +N   G       FSY+E ++G S  + + D   P +EDF SD+ E     +D   ++   E+ +
Subjt:  LSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDF-SDEKEGQYNSDD---VQFFKEQWR

Query:  SFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIP
          R+PWR  LIIKVLG++  Y FL++RL  MW  +G+  L+ L NDFF+ R    EDRE  +  G W + D YL +R W P F P+ A I++VA+W+++P
Subjt:  SFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIP

Query:  DCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF
        D  ME Y+   + +IG+ IGKTLK+D  T  G  GN+ARICVEVDLTK L S F
Subjt:  DCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF

TrEMBL top hitse value%identityAlignment
A0A2Z6MNN8 CCHC-type domain-containing protein4.4e-3833.55Show/hide
Query:  LEGGKRKFSYREMVVG-NSEAEMEMDTESP-LEEDFSDEKE--------GQYNSDDVQFFKEQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHK
        +EGGK   SY+ MVVG + E E+  D +   ++E+  DE E        G+Y   +  F K + +    PWR  +I+K+LG++ GY+ L  RL  MW  K
Subjt:  LEGGKRKFSYREMVVG-NSEAEMEMDTESP-LEEDFSDEKE--------GQYNSDDVQFFKEQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHK

Query:  GEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMG
        G   +I L N +++V     ED+   + DG W I D YL VR W+P F P+   I  VA+WV+I   P+E Y+   +  IG+ +GKT+K+D  T   E G
Subjt:  GEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMG

Query:  NFARICVEVDLTKKLRSDFTVMGE----------------------RLGIGDLIQ----QKTEEQRNS------KISEPSQGHGPWILVDNSRRNRGSQS
         +AR+CVEV+LTK L + F++ G                       R G  + +Q       E+Q  +      K   P +  GPW++V   +RNR  + 
Subjt:  NFARICVEVDLTKKLRSDFTVMGE----------------------RLGIGDLIQ----QKTEEQRNS------KISEPSQGHGPWILVDNSRRNRGSQS

Query:  RPKP
          +P
Subjt:  RPKP

A0A392LX56 CCHC-type domain-containing protein6.7e-3931.34Show/hide
Query:  EETSLSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKD-----NIIGGLEGGKRKFSYREMVVG-NSEAEMEMDTESPLEED----------FSDEKE
        +E S + +  D+ +   +  +K+K       N   S PKD     N+     GG R  SY+ MVVG   E EM  DTE     D            ++K 
Subjt:  EETSLSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKD-----NIIGGLEGGKRKFSYREMVVG-NSEAEMEMDTESPLEED----------FSDEKE

Query:  GQYNSDDVQFFKEQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFR
        G+Y   +  F K + +    PWR  +I+K+LG++ GY+ L  RL  MW  KG   +I L ND+F+V     ED+   L++G W I D YL V+ W+P F 
Subjt:  GQYNSDDVQFFKEQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFR

Query:  PSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDFTVMGERLGI------------------GDL
        P+   I +VA+WV+I   P+E Y+   +  IGD IG+T+K+D  T + E G +AR+CVEV+LTK+L + F++   +  +                   + 
Subjt:  PSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDFTVMGERLGI------------------GDL

Query:  IQQKTEEQRNSKISEPS----------QGHGPWILVDNSRRNRGSQSR---PKPDLGVSHFPWVNGD
           K ++Q  +   +P+          +G GPW++V   RR R  + +    + D G +    +NGD
Subjt:  IQQKTEEQRNSKISEPS----------QGHGPWILVDNSRRNRGSQSR---PKPDLGVSHFPWVNGD

A0A392NIW1 DUF4283 domain-containing protein (Fragment)1.7e-3736.92Show/hide
Query:  SYREMVVGN---------SEAEMEMDTESPLEED---FSDEKEGQYNSDDVQFFKEQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLI
        SYR+MV G          S+ E E   E   EE+     ++  G Y   +  F K + +    PWR  +I+K+LG++ GY+ L  RL  MW  KG   +I
Subjt:  SYREMVVGN---------SEAEMEMDTESPLEED---FSDEKEGQYNSDDVQFFKEQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLI

Query:  SLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARIC
         LSND+++V     +D+   L+DG W I D YL V+ W+P F P+   I +VA+WV+I   P++ Y+   +  IG+ +GKT+K+D  T + E G +AR+C
Subjt:  SLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARIC

Query:  VEVDLTKKLRSDFTVMGERLGIGDLIQQKTEEQRNSKISEPSQGHGPWILVDNSRRNRGS
        V+VDLTK L + F + G +  I       T    N    + S G GPW +V   RRNR S
Subjt:  VEVDLTKKLRSDFTVMGERLGIGDLIQQKTEEQRNSKISEPSQGHGPWILVDNSRRNRGS

A0A6N2LZK8 CCHC-type domain-containing protein5.0e-4238.91Show/hide
Query:  EETSLSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDFSDEKEGQYNSDD---VQFFKE
        E   +S EE+D L RS   ++KVKI                      G     SY E ++G S    EMD ++   +  S++++   + DD   ++   E
Subjt:  EETSLSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDFSDEKEGQYNSDD---VQFFKE

Query:  QWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWV
        + +  R PW+  LIIK++G++ GY F M+RL  MW  +G+F L  L N+F++ +    EDRE +L  G W + D YL +R W P F P  A ID+VA+WV
Subjt:  QWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWV

Query:  QIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF
        ++P+  +E Y+   + +IGD IGKTLKID  T  G  GNFAR+CVEVDLTK L + F
Subjt:  QIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDF

A0A7J7GEL2 DUF4283 domain-containing protein2.0e-3842.62Show/hide
Query:  DTESPLEEDFSDEKEGQYNSDDVQFFKEQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKI
        D++S  + D  DE  G +    V   KE+ R  R PWR+ALI+K+LGK   + F+  RL  MW   G+ K++ L +D F+VR+   +D + +L DG W I
Subjt:  DTESPLEEDFSDEKEGQYNSDDVQFFKEQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKI

Query:  MDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRS
           YL++R+W P FRPS A I  +A WV++P+ P+E ++E  +K++G+ IG+T+K+D+ T   + G FAR+CVE+DL K LRS
Subjt:  MDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding4.9e-2630.26Show/hide
Query:  ESPLEEDFSDEK------EGQYNSDDVQFFKEQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDG
        E  ++++F  E+      +G+     +   +E   +    W+  +I+KVLG Q     L R+L  +W   G   ++ L   FF++R EL E+    L  G
Subjt:  ESPLEEDFSDEK------EGQYNSDDVQFFKEQWRSFRDPWRSALIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDG

Query:  HWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDFTVMGER
         W+++  YL V+ W+  F P R  I    +WV++ + P   Y+   + EI   +G+ LK+DM T + + G FAR+C+EV+L K L+    + G+R
Subjt:  HWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDFTVMGER

AT2G17920.1 nucleic acid binding;zinc ion binding8.2e-0528.24Show/hide
Query:  WKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLR
        W   + ++A +RW P   P+  F+  + LWVQ+   P    +E    EI   IG  + +D    +     + R+ V V +T  LR
Subjt:  WKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLR

AT2G41590.1 unknown protein1.1e-0427.06Show/hide
Query:  WKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLR
        W   + ++A  RW     P+  F+  + LWVQI   P+   +E  + EI   +G+ L +D    +     + R+ V   +T +LR
Subjt:  WKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLR

AT3G47920.1 unknown protein6.9e-0427Show/hide
Query:  VELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLR
        V+L+  + R L    W   + ++A  RW P   P   F+  + LWVQ+   P+    E    EI   IG+ + +D    +     + R+ V + +T +LR
Subjt:  VELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFIGKTLKIDMKTQSGEMGNFARICVEVDLTKKLR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAGAAACGAGTCTTTCGGTGGAAGAACAAGATCAACTAAATCGAAGTGTGAAGAAAGCCAGGAAAGTAAAAATCAGACCACGTTCTTCTGAAAACTATCCAGATTC
TTCACCAAAGGACAACATTATTGGTGGGCTAGAGGGAGGGAAAAGGAAATTTTCATACAGAGAAATGGTTGTTGGGAATTCAGAAGCGGAGATGGAGATGGACACTGAAT
CACCTCTGGAGGAGGATTTCAGTGATGAAAAGGAGGGCCAGTACAATTCGGATGATGTCCAATTTTTTAAGGAACAATGGAGATCATTTAGGGATCCTTGGAGGTCAGCC
CTAATTATTAAAGTTCTTGGGAAGCAATTTGGATACCAATTCCTTATGCGCCGTCTCACAGCTATGTGGTGCCACAAAGGTGAGTTCAAGTTAATTAGTCTTAGTAATGA
TTTTTTCATAGTAAGGGTGGAATTGATGGAAGATAGAGAAAGAATCCTTCTTGATGGCCATTGGAAAATCATGGATCGTTACTTAGCGGTGCGACGGTGGACTCCTGGGT
TCAGACCTTCAAGAGCATTCATAGATAGGGTAGCATTATGGGTCCAGATTCCAGATTGTCCGATGGAACTCTATAATGAGATCGGGATGAAAGAAATTGGTGACTTTATT
GGCAAAACACTGAAGATAGATATGAAAACTCAATCAGGAGAGATGGGGAATTTTGCTAGAATATGTGTGGAGGTAGACCTAACCAAGAAGTTAAGATCGGATTTCACCGT
AATGGGGGAAAGGCTGGGCATTGGTGACCTCATTCAACAGAAGACTGAGGAACAGAGAAATTCGAAAATTTCAGAGCCTTCTCAAGGCCATGGACCATGGATACTAGTTG
ACAATTCAAGAAGGAATAGGGGTAGCCAATCACGTCCCAAACCGGATCTGGGGGTGAGTCATTTCCCATGGGTGAATGGAGATCTTGATCAGGAAGAAGATGAGTCGAAC
CATGAAGACTCCATTGACCCAAATTCGACTTTCACATTTCTAGCAGAAGGCAGTAAGCAAGGATGGACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAGAAACGAGTCTTTCGGTGGAAGAACAAGATCAACTAAATCGAAGTGTGAAGAAAGCCAGGAAAGTAAAAATCAGACCACGTTCTTCTGAAAACTATCCAGATTC
TTCACCAAAGGACAACATTATTGGTGGGCTAGAGGGAGGGAAAAGGAAATTTTCATACAGAGAAATGGTTGTTGGGAATTCAGAAGCGGAGATGGAGATGGACACTGAAT
CACCTCTGGAGGAGGATTTCAGTGATGAAAAGGAGGGCCAGTACAATTCGGATGATGTCCAATTTTTTAAGGAACAATGGAGATCATTTAGGGATCCTTGGAGGTCAGCC
CTAATTATTAAAGTTCTTGGGAAGCAATTTGGATACCAATTCCTTATGCGCCGTCTCACAGCTATGTGGTGCCACAAAGGTGAGTTCAAGTTAATTAGTCTTAGTAATGA
TTTTTTCATAGTAAGGGTGGAATTGATGGAAGATAGAGAAAGAATCCTTCTTGATGGCCATTGGAAAATCATGGATCGTTACTTAGCGGTGCGACGGTGGACTCCTGGGT
TCAGACCTTCAAGAGCATTCATAGATAGGGTAGCATTATGGGTCCAGATTCCAGATTGTCCGATGGAACTCTATAATGAGATCGGGATGAAAGAAATTGGTGACTTTATT
GGCAAAACACTGAAGATAGATATGAAAACTCAATCAGGAGAGATGGGGAATTTTGCTAGAATATGTGTGGAGGTAGACCTAACCAAGAAGTTAAGATCGGATTTCACCGT
AATGGGGGAAAGGCTGGGCATTGGTGACCTCATTCAACAGAAGACTGAGGAACAGAGAAATTCGAAAATTTCAGAGCCTTCTCAAGGCCATGGACCATGGATACTAGTTG
ACAATTCAAGAAGGAATAGGGGTAGCCAATCACGTCCCAAACCGGATCTGGGGGTGAGTCATTTCCCATGGGTGAATGGAGATCTTGATCAGGAAGAAGATGAGTCGAAC
CATGAAGACTCCATTGACCCAAATTCGACTTTCACATTTCTAGCAGAAGGCAGTAAGCAAGGATGGACATAA
Protein sequenceShow/hide protein sequence
MEETSLSVEEQDQLNRSVKKARKVKIRPRSSENYPDSSPKDNIIGGLEGGKRKFSYREMVVGNSEAEMEMDTESPLEEDFSDEKEGQYNSDDVQFFKEQWRSFRDPWRSA
LIIKVLGKQFGYQFLMRRLTAMWCHKGEFKLISLSNDFFIVRVELMEDRERILLDGHWKIMDRYLAVRRWTPGFRPSRAFIDRVALWVQIPDCPMELYNEIGMKEIGDFI
GKTLKIDMKTQSGEMGNFARICVEVDLTKKLRSDFTVMGERLGIGDLIQQKTEEQRNSKISEPSQGHGPWILVDNSRRNRGSQSRPKPDLGVSHFPWVNGDLDQEEDESN
HEDSIDPNSTFTFLAEGSKQGWT