; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MS024016 (gene) of Bitter gourd (TR) v1 genome

Gene IDMS024016
OrganismMomordica charantia cv. TR (Bitter gourd (TR) v1)
DescriptionCCHC-type domain-containing protein
Genome locationscaffold64:526816..527982
RNA-Seq ExpressionMS024016
SyntenyMS024016
Gene Ontology termsGO:0003676 - nucleic acid binding (molecular function)
GO:0008270 - zinc ion binding (molecular function)
InterPro domainsIPR001878 - Zinc finger, CCHC-type
IPR025558 - Domain of unknown function DUF4283
IPR040256 - Uncharacterized protein At4g02000-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6752401.1 hypothetical protein POTOM_044628 [Populus tomentosa]1.3e-5340.29Show/hide
Query:  EDASSDEEGDSYPSND---VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDH
        ED +SD++ +     D   ++ S E+ K  R   R  LIIKVLG+   Y FLL+RL  +WK++G   L++L N+FF+AR   KEDR+  +  GPW + DH
Subjt:  EDASSDEEGDSYPSND---VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDH

Query:  YLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVC
        YL +RSW P F P+  TI++ A+W+R+PD  ME Y+   +  IG+ IGKTLK+D  T  G  G++ARI VEVDLTK     F +      I YEG+H++C
Subjt:  YLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVC

Query:  FNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWML
        F CG YGHK + CP   N E + ++      +  G D  VP E               V +P  +  +G WM+
Subjt:  FNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWML

KAG6772677.1 hypothetical protein POTOM_024095 [Populus tomentosa]1.3e-5340.29Show/hide
Query:  EDASSDEEGDSYPSND---VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDH
        ED +SD++ +     D   ++ S E+ K  R   R  LIIKVLG+   Y FLL+RL  +WK++G   L++L N+FF+AR   KEDR+  +  GPW + DH
Subjt:  EDASSDEEGDSYPSND---VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDH

Query:  YLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVC
        YL +RSW P F P+  TI++ A+W+R+PD  ME Y+   +  IG+ IGKTLK+D  T  G  G++ARI VEVDLTK     F +      I YEG+H++C
Subjt:  YLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVC

Query:  FNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWML
        F CG YGHK + CP   N E + ++      +  G D  VP E               V +P  +  +G WM+
Subjt:  FNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWML

XP_022153253.1 uncharacterized protein LOC111020790 [Momordica charantia]6.1e-8888.33Show/hide
Query:  EGIHLVCFNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWMLVDHSKRNGGGKPPRTRSERG
        +GIHLVCFNCGVYGHK EECPLRCNTETK NDNSSGILDSVG DSRVPKESNI EF QKPFIPV VSQ N SPGHGPWMLVDHSKRNGGGKPPRTRS RG
Subjt:  EGIHLVCFNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWMLVDHSKRNGGGKPPRTRSERG

Query:  FVSSKNLNTKWNANLDPNEESTNVLDPIVDPTVERDPIKSFQVKSVERVRLARDQDSWRFKNKIPFAGPGANFSGLPFNY
        F+SSKNLNTKWN N DPNEE  NVLD IVDPT++RDPIKSFQ+KSVERVRLARDQDSWRFKNKIPFAGPGANFS LPF+Y
Subjt:  FVSSKNLNTKWNANLDPNEESTNVLDPIVDPTVERDPIKSFQVKSVERVRLARDQDSWRFKNKIPFAGPGANFSGLPFNY

XP_034894449.1 uncharacterized protein LOC118033543 isoform X1 [Populus alba]1.3e-5340.29Show/hide
Query:  EDASSDEEGDSYPSND---VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDH
        ED +SD++ +     D   ++ S E+ K  R   R  LIIKVLG+   Y FLL+RL  +WK++G   L++L N+FF+AR   KEDR+  +  GPW + DH
Subjt:  EDASSDEEGDSYPSND---VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDH

Query:  YLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVC
        YL +RSW P F P+  TI++ A+W+R+PD  ME Y+   +  IG+ IGKTLK+D  T  G  G++ARI VEVDLTK     F +      I YEG+H++C
Subjt:  YLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVC

Query:  FNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWML
        F CG YGHK + CP   N E + ++      +  G D  VP E               V +P  +  +G WM+
Subjt:  FNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWML

XP_034894450.1 uncharacterized protein LOC118033543 isoform X2 [Populus alba]1.3e-5340.29Show/hide
Query:  EDASSDEEGDSYPSND---VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDH
        ED +SD++ +     D   ++ S E+ K  R   R  LIIKVLG+   Y FLL+RL  +WK++G   L++L N+FF+AR   KEDR+  +  GPW + DH
Subjt:  EDASSDEEGDSYPSND---VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDH

Query:  YLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVC
        YL +RSW P F P+  TI++ A+W+R+PD  ME Y+   +  IG+ IGKTLK+D  T  G  G++ARI VEVDLTK     F +      I YEG+H++C
Subjt:  YLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVC

Query:  FNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWML
        F CG YGHK + CP   N E + ++      +  G D  VP E               V +P  +  +G WM+
Subjt:  FNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWML

TrEMBL top hitse value%identityAlignment
A0A1R3IXK3 Reverse transcriptase5.7e-4734.03Show/hide
Query:  DASSDEEGDSYPSND-VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDHYLA
        DA SD E      +D + FSKE+ +  R   R+ALI+K+LGK  G+  L  R+  +WKL+G +K+ +L +++FI R   K D + VL  GPW I  HYL 
Subjt:  DASSDEEGDSYPSND-VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDHYLA

Query:  VRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVCFNC
        VR W P F PS+  I++  +W+R P+ P+E +N   +K +G  +G+ +KID  T  G  G FAR+ VE+DL+K  +   T+      I+YEG+ L+CF+C
Subjt:  VRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVCFNC

Query:  GVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWMLVDHSKRN-----------GGGKPPRTRSER
        G++GH+  +CPL+  + ++                  P+ S + E  ++P          +   +GPWM+V   KRN            GGK        
Subjt:  GVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWMLVDHSKRN-----------GGGKPPRTRSER

Query:  GF-----VSSKNLNTKWNANLDPNEESTNVLDPIV
        GF     V++ NLN K +AN     E      P++
Subjt:  GF-----VSSKNLNTKWNANLDPNEESTNVLDPIV

A0A2N9GMU3 Uncharacterized protein3.3e-4733.71Show/hide
Query:  SSDEEGDSYPSNDVKFS--KEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDHYLAV
        SSDEE D     +V  S  +E     RA   ++ I+KV G+S GY FL+ RL ++WK  G F  ++L   FF+ +LDL +D DR+L  GPW I +H+L++
Subjt:  SSDEEGDSYPSNDVKFS--KEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDHYLAV

Query:  RSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVCFNCG
        R W P FRPS  +++   +WVR+P+ P+E Y++  +  IGH +G  L++DF T SG  G FAR+ +++DL K   +   +   R  + YEGI L+CF+CG
Subjt:  RSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVCFNCG

Query:  VYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPG--HGPWMLVDHSKRNGGGKP---PRTRSERGFVS---
          GHK ++CP R  T T     +    D+                      P P + P+R      GPWMLV   KR    KP    R+R E   V+   
Subjt:  VYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPG--HGPWMLVDHSKRNGGGKP---PRTRSERGFVS---

Query:  --------SKNLNT------------KWNANLDPNEESTNVLDPIVDPTVERD
                  N +T            +W ++ +PN +   V++P+  P    D
Subjt:  --------SKNLNT------------KWNANLDPNEESTNVLDPIVDPTVERD

A0A2N9HXU9 Uncharacterized protein1.4e-4537.14Show/hide
Query:  EDASSDEEGDSYPSND----VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILD
        E  SSDEE D  P  +    +KF++E     RA   ++LI+KV G+S GY FL+ +L  +W   G+F  ++L   FF+ R D +   + VL  GPW I +
Subjt:  EDASSDEEGDSYPSND----VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILD

Query:  HYLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLV
        H+L++R W P FR S  ++   A+WVR+P+ P+E Y++  +  IG  +G  L++DF T +G  G FARI V++DL K   R   +   R  + YEGI L+
Subjt:  HYLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLV

Query:  CFNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWMLVDHSKR
        CF+CG  GH+ E CP R     +  DN + I      DSR  +E  +  F                   GPWMLV   KR
Subjt:  CFNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWMLVDHSKR

A0A6J1DGZ9 uncharacterized protein LOC1110207903.0e-8888.33Show/hide
Query:  EGIHLVCFNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWMLVDHSKRNGGGKPPRTRSERG
        +GIHLVCFNCGVYGHK EECPLRCNTETK NDNSSGILDSVG DSRVPKESNI EF QKPFIPV VSQ N SPGHGPWMLVDHSKRNGGGKPPRTRS RG
Subjt:  EGIHLVCFNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWMLVDHSKRNGGGKPPRTRSERG

Query:  FVSSKNLNTKWNANLDPNEESTNVLDPIVDPTVERDPIKSFQVKSVERVRLARDQDSWRFKNKIPFAGPGANFSGLPFNY
        F+SSKNLNTKWN N DPNEE  NVLD IVDPT++RDPIKSFQ+KSVERVRLARDQDSWRFKNKIPFAGPGANFS LPF+Y
Subjt:  FVSSKNLNTKWNANLDPNEESTNVLDPIVDPTVERDPIKSFQVKSVERVRLARDQDSWRFKNKIPFAGPGANFSGLPFNY

A0A6N2LZK8 CCHC-type domain-containing protein7.6e-5236.76Show/hide
Query:  VLGDTNEDFEMGVETLLEDASSDEEGDSYPSND---VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLK
        ++G+++   EM  + L  D  S++E D    +D   +K S E+ K  RA  +  LIIK++G+  GY F ++RL  +WKL+G F L +L NEF++A+    
Subjt:  VLGDTNEDFEMGVETLLEDASSDEEGDSYPSND---VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLK

Query:  EDRDRVLMEGPWKILDHYLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFT
        EDR+ VL  GPW + DHYL +R+W P F P   TID+ A+WVR+P+  +E Y+   +  IG  IGKTLKID  T  G  G+FAR+ VEVDLTK     F 
Subjt:  EDRDRVLMEGPWKILDHYLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFT

Query:  ILXERYGIQYEGIHLVCFNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKES--NISEFSQKPFIPVPVSQPNRSPGHGPWMLVDHSKRNG
        +      I YEG+H +CF+CG YGHK E CP                        ++P E+  + SE  +     VP+ +P      G WM+ +  +R  
Subjt:  ILXERYGIQYEGIHLVCFNCGVYGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKES--NISEFSQKPFIPVPVSQPNRSPGHGPWMLVDHSKRNG

Query:  GGK---PPRTRSERGFVSSKN
          +       R+ +G V++ N
Subjt:  GGK---PPRTRSERGFVSSKN

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G01050.1 zinc ion binding;nucleic acid binding7.1e-3433.33Show/hide
Query:  VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDHYLAVRSWSPKFRPSTVTID
        +   +E  +    L +  +I+KVLG       L R+L  +WK  G   +++L  +FF+ R +L+E+    L  GPW++L +YL V+ WS +F P    I 
Subjt:  VKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPWKILDHYLAVRSWSPKFRPSTVTID

Query:  RAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVCFNCGVYGHKFEECP
           +WVR+ + P   Y+   +  I   +G+ LK+D  T +   G FAR+ +EV+L K  +    I  +RY + YEG+  +C +CG+YGH    CP
Subjt:  RAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILXERYGIQYEGIHLVCFNCGVYGHKFEECP


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTTTTGGGAGATACAAATGAAGACTTTGAGATGGGGGTTGAAACTCTCCTGGAAGATGCTTCCAGTGATGAAGAGGGAGATTCATATCCCTCGAATGATGTGAAATT
TTCAAAAGAACAATGGAAGAATTTTAGGGCACTATCGAGGTCTGCCCTAATTATAAAAGTTCTAGGCAAGTCTTTTGGGTACCCATTTCTTCTCCGAAGGTTAACGGCAA
TCTGGAAACTGAAAGGGCATTTCAAGTTGATCAACCTTAGTAATGAGTTCTTTATTGCGCGTCTGGACTTGAAGGAGGATCGAGACAGGGTTCTTATGGAGGGCCCCTGG
AAGATTTTAGACCACTACCTTGCGGTTCGGAGTTGGTCTCCCAAATTTAGGCCTTCAACAGTGACTATTGACAGGGCTGCTTTGTGGGTTCGTATCCCTGATTGCCCGAT
GGAACTTTATAATGAGACAGGAATGAAGGGAATCGGGCATTTCATAGGCAAGACTCTAAAAATTGATTTCAAGACTCAATCCGGGAAGATGGGCCACTTTGCGAGAATCT
ACGTTGAGGTGGACCTTACGAAGAAGCCTCGACGTGACTTTACAATTTTATGAGAAAGGTATGGAATCCAATATGAAGGGATACATTTAGTCTGTTTCAACTGTGGGGTG
TATGGGCACAAATTTGAGGAGTGTCCCTTGAGATGCAACACTGAAACGAAGGCTAACGATAACTCATCTGGAATTCTTGATTCCGTTGGTGGAGATTCGAGGGTTCCAAA
GGAAAGCAATATCTCTGAGTTTTCTCAAAAGCCTTTCATACCAGTGCCAGTTTCTCAGCCAAATCGTTCACCTGGTCATGGGCCTTGGATGTTGGTAGATCATTCCAAGA
GAAATGGGGGAGGTAAACCACCTCGTACAAGGTCAGAGAGAGGTTTCGTATCATCTAAAAACTTGAACACTAAGTGGAATGCAAATCTGGACCCAAATGAAGAGTCGACG
AATGTTCTTGACCCAATTGTTGATCCAACTGTGGAGCGTGATCCTATAAAGTCTTTCCAAGTTAAGTCAGTTGAGAGGGTGAGGTTAGCAAGAGATCAAGACTCTTGGAG
GTTTAAGAACAAGATTCCTTTTGCTGGACCTGGGGCAAATTTTTCAGGTCTTCCTTTCAATTATTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTTTTGGGAGATACAAATGAAGACTTTGAGATGGGGGTTGAAACTCTCCTGGAAGATGCTTCCAGTGATGAAGAGGGAGATTCATATCCCTCGAATGATGTGAAATT
TTCAAAAGAACAATGGAAGAATTTTAGGGCACTATCGAGGTCTGCCCTAATTATAAAAGTTCTAGGCAAGTCTTTTGGGTACCCATTTCTTCTCCGAAGGTTAACGGCAA
TCTGGAAACTGAAAGGGCATTTCAAGTTGATCAACCTTAGTAATGAGTTCTTTATTGCGCGTCTGGACTTGAAGGAGGATCGAGACAGGGTTCTTATGGAGGGCCCCTGG
AAGATTTTAGACCACTACCTTGCGGTTCGGAGTTGGTCTCCCAAATTTAGGCCTTCAACAGTGACTATTGACAGGGCTGCTTTGTGGGTTCGTATCCCTGATTGCCCGAT
GGAACTTTATAATGAGACAGGAATGAAGGGAATCGGGCATTTCATAGGCAAGACTCTAAAAATTGATTTCAAGACTCAATCCGGGAAGATGGGCCACTTTGCGAGAATCT
ACGTTGAGGTGGACCTTACGAAGAAGCCTCGACGTGACTTTACAATTTTATGAGAAAGGTATGGAATCCAATATGAAGGGATACATTTAGTCTGTTTCAACTGTGGGGTG
TATGGGCACAAATTTGAGGAGTGTCCCTTGAGATGCAACACTGAAACGAAGGCTAACGATAACTCATCTGGAATTCTTGATTCCGTTGGTGGAGATTCGAGGGTTCCAAA
GGAAAGCAATATCTCTGAGTTTTCTCAAAAGCCTTTCATACCAGTGCCAGTTTCTCAGCCAAATCGTTCACCTGGTCATGGGCCTTGGATGTTGGTAGATCATTCCAAGA
GAAATGGGGGAGGTAAACCACCTCGTACAAGGTCAGAGAGAGGTTTCGTATCATCTAAAAACTTGAACACTAAGTGGAATGCAAATCTGGACCCAAATGAAGAGTCGACG
AATGTTCTTGACCCAATTGTTGATCCAACTGTGGAGCGTGATCCTATAAAGTCTTTCCAAGTTAAGTCAGTTGAGAGGGTGAGGTTAGCAAGAGATCAAGACTCTTGGAG
GTTTAAGAACAAGATTCCTTTTGCTGGACCTGGGGCAAATTTTTCAGGTCTTCCTTTCAATTATTAG
Protein sequenceShow/hide protein sequence
MVLGDTNEDFEMGVETLLEDASSDEEGDSYPSNDVKFSKEQWKNFRALSRSALIIKVLGKSFGYPFLLRRLTAIWKLKGHFKLINLSNEFFIARLDLKEDRDRVLMEGPW
KILDHYLAVRSWSPKFRPSTVTIDRAALWVRIPDCPMELYNETGMKGIGHFIGKTLKIDFKTQSGKMGHFARIYVEVDLTKKPRRDFTILUERYGIQYEGIHLVCFNCGV
YGHKFEECPLRCNTETKANDNSSGILDSVGGDSRVPKESNISEFSQKPFIPVPVSQPNRSPGHGPWMLVDHSKRNGGGKPPRTRSERGFVSSKNLNTKWNANLDPNEEST
NVLDPIVDPTVERDPIKSFQVKSVERVRLARDQDSWRFKNKIPFAGPGANFSGLPFNY