; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG10G013080 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG10G013080
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
DescriptionDUF4228 domain-containing protein
Genome locationCG_Chr10:27230922..27231683
RNA-Seq ExpressionClCG10G013080
SyntenyClCG10G013080
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037693.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa]2.5e-10687.45Show/hide
Query:  MKNAI-PDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNN
        MKNA+ PDPA+   NIKLI SDGVVRIY RPIYVSEVLLEFPKH VCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP PFFHSVLSFVTIASFFSSSN+N
Subjt:  MKNAI-PDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNN

Query:  NKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETITETKKKR-STF
        NK   RFINNAAACH PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLPPQQ+ SLPLGKICTTPQLAKDYTQLVRTR+WKPKLETI ET+KKR S+F
Subjt:  NKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETITETKKKR-STF

Query:  GLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        GLKKA PF S P+RSAYPL HLRSIHFAYK KIRIKSR+
Subjt:  GLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

KAG6581636.1 hypothetical protein SDJN03_21638, partial [Cucurbita argyrosperma subsp. sororia]2.7e-10078.6Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNA---IPDPATANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF
        MGMK LSVQNMVSWSCFHL K     IPDP +ANIKLI S GVVR+Y RPI VS++LLEFPKHLVCRSD+FYIGQKIP LS+ D LQLGHKYFLLPKPFF
Subjt:  MGMKVLSVQNMVSWSCFHLMKNA---IPDPATANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF

Query:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQW
        HSVLSFV+IASFFS  N  N    RF  NAAAC  PFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLP   S  LP GKICTTPQLAK+YTQLVRTR+W
Subjt:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQW

Query:  KPKLETITETKKKR--STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSR
        KPKLETI ETKKK   S FGLKK NPF SPP RS+Y LHHLR+IH  YKPKIRIKSR
Subjt:  KPKLETITETKKKR--STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSR

XP_008465162.1 PREDICTED: uncharacterized protein LOC103502830 [Cucumis melo]1.1e-11787.98Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF
        MG+KVLSVQNMVSWSCFHLMKNA+ PDPA+   NIKLI SDGVVRIY RPIYVSEVLLEFPKH VCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP PFF
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF

Query:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ
        HSVLSFVTIASFFSSSN+NNK   RFINNAAACH PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLPPQQ+ SLPLGKICTTPQLAKDYTQLVRTR+
Subjt:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ

Query:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        WKPKLETI ET+KKR S+FGLKKA PF S P+RSAYPL HLRSIHFAYK KIRIKSR+
Subjt:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

XP_011652332.1 uncharacterized protein LOC105435015 [Cucumis sativus]4.8e-11886.87Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAIPDPA----TANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF
        MG+KVLSVQNMVSWSCFHLMKNAIP P     + NIKLI+SDGVVRIY RPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP  F
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAIPDPA----TANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF

Query:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTR
        FHSVLSFVTIASFFSSSNNNNK   RFINNAAA H PFDLQRTPSGCL+IRVSD+FISQLLEQG NPKPLPPQQS SLPLGKICTTPQLAKDYTQLVRTR
Subjt:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTR

Query:  QWKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        +WKPKLETI ET+KKR S+FGLKKANPF S P+RS YPLHHLRSIHFAYK KIRIK+R+
Subjt:  QWKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

XP_038905326.1 uncharacterized protein LOC120091392 [Benincasa hispida]1.2e-12490.62Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAIPDPATANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSV
        MG+KVLSVQNMVSWSCFHLMKNAIP+PA ANIKLINSDGVVRIY +PIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSV
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAIPDPATANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSV

Query:  LSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQS-TSLPLGKICTTPQLAKDYTQLVRTRQWK
        LSFVTIASFFS+S +N+    RF+NNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPL  QQS +SLPLGKICTTPQLAKDYTQLVRTRQWK
Subjt:  LSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQS-TSLPLGKICTTPQLAKDYTQLVRTRQWK

Query:  PKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        PKLETI ETKKKR STFGLKKANPFHSPPLRS+YPLHHLRSI  AYKPKIRIKSRT
Subjt:  PKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

TrEMBL top hitse value%identityAlignment
A0A0A0LCN9 Uncharacterized protein2.3e-11886.87Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAIPDPA----TANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF
        MG+KVLSVQNMVSWSCFHLMKNAIP P     + NIKLI+SDGVVRIY RPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP  F
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAIPDPA----TANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF

Query:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTR
        FHSVLSFVTIASFFSSSNNNNK   RFINNAAA H PFDLQRTPSGCL+IRVSD+FISQLLEQG NPKPLPPQQS SLPLGKICTTPQLAKDYTQLVRTR
Subjt:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTR

Query:  QWKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        +WKPKLETI ET+KKR S+FGLKKANPF S P+RS YPLHHLRSIHFAYK KIRIK+R+
Subjt:  QWKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

A0A1S3CNA0 uncharacterized protein LOC1035028305.2e-11887.98Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF
        MG+KVLSVQNMVSWSCFHLMKNA+ PDPA+   NIKLI SDGVVRIY RPIYVSEVLLEFPKH VCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP PFF
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF

Query:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ
        HSVLSFVTIASFFSSSN+NNK   RFINNAAACH PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLPPQQ+ SLPLGKICTTPQLAKDYTQLVRTR+
Subjt:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ

Query:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        WKPKLETI ET+KKR S+FGLKKA PF S P+RSAYPL HLRSIHFAYK KIRIKSR+
Subjt:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

A0A5A7T3Y8 DUF4228 domain-containing protein1.2e-10687.45Show/hide
Query:  MKNAI-PDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNN
        MKNA+ PDPA+   NIKLI SDGVVRIY RPIYVSEVLLEFPKH VCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP PFFHSVLSFVTIASFFSSSN+N
Subjt:  MKNAI-PDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNN

Query:  NKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETITETKKKR-STF
        NK   RFINNAAACH PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLPPQQ+ SLPLGKICTTPQLAKDYTQLVRTR+WKPKLETI ET+KKR S+F
Subjt:  NKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETITETKKKR-STF

Query:  GLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        GLKKA PF S P+RSAYPL HLRSIHFAYK KIRIKSR+
Subjt:  GLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

A0A5D3CH36 DUF4228 domain-containing protein5.2e-11887.98Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF
        MG+KVLSVQNMVSWSCFHLMKNA+ PDPA+   NIKLI SDGVVRIY RPIYVSEVLLEFPKH VCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP PFF
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF

Query:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ
        HSVLSFVTIASFFSSSN+NNK   RFINNAAACH PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLPPQQ+ SLPLGKICTTPQLAKDYTQLVRTR+
Subjt:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ

Query:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        WKPKLETI ET+KKR S+FGLKKA PF S P+RSAYPL HLRSIHFAYK KIRIKSR+
Subjt:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

A0A6J1F4J4 uncharacterized protein LOC1114420994.9e-10078.29Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMK----NAIPDPATANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF
        MGMK LSVQNMVSWSCFHL K      IPDP +ANIKLI S GVVR+Y RPI VSE+LLEFPKHLVCRSD+FYIGQKIP LS+ D LQLGHKYFLLPKPF
Subjt:  MGMKVLSVQNMVSWSCFHLMK----NAIPDPATANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF

Query:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ
        FHSVLSFV+IASFFS  N  N    RF  NAAAC  PFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLP   S  LP GKICTTPQLAK+YTQLVR R+
Subjt:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ

Query:  WKPKLETITETKKKR--STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSR
        WKPKLETI ETKKK   S FGLKK NPF SPP RS+Y LHHLR+IH  YKPKIRIKSR
Subjt:  WKPKLETITETKKKR--STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G28190.1 unknown protein4.8e-3943.96Show/hide
Query:  SWSCFHLMKNAIPDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFF
        SW C H      PD  +    IKL+ SDG +++YDRP+ VSE+  +FPKH +CRSD  YIGQK P LS+ + L+LG  YFLLP  FF + LSF+TIA+  
Subjt:  SWSCFHLMKNAIPDPAT--ANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFF

Query:  SSSNNNNKTNTRFINNAAACHRPFDLQRTPSG-CLRIRVSDEFISQLLEQGNPKPL----PPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETIT
        +  N         +       +PF +Q+   G  LRIRVS+EF+S+L+ +G    +      ++      G++CTT +L KDY QLV  R+WKPKLETIT
Subjt:  SSSNNNNKTNTRFINNAAACHRPFDLQRTPSG-CLRIRVSDEFISQLLEQGNPKPL----PPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETIT

Query:  ETKKKRS
        ETK  ++
Subjt:  ETKKKRS

AT5G12340.1 unknown protein1.5e-2137.93Show/hide
Query:  EVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNNNKTNTRFIN-NAAACHRPFDLQRTPSGCLRIRVSD
        E++ EFP  +VC +DSFYIG++IP L+  D L  G  YF+LP   F   +   +  S F+S+  N ++NT  +N  A +   PF+  + P+G + I+VS 
Subjt:  EVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNNNKTNTRFIN-NAAACHRPFDLQRTPSGCLRIRVSD

Query:  EFISQLLEQGNPKPLPPQQSTSL-PLGKICTTPQLAKDYTQLVRTRQ--WKPKLETITETKKKRST---FGLKK
        EFI  L+ +   + +  +++TS      IC +P+L K Y QLV TR+  W P L+TI+E K + S    FGL +
Subjt:  EFISQLLEQGNPKPLPPQQSTSL-PLGKICTTPQLAKDYTQLVRTRQ--WKPKLETITETKKKRST---FGLKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGATGAAGGTGTTGAGTGTTCAGAATATGGTTTCATGGTCTTGTTTTCATTTGATGAAGAACGCAATTCCTGATCCTGCTACCGCTAATATCAAGCTTATTAATTC
CGATGGTGTTGTTAGAATCTACGACCGACCCATTTATGTTTCTGAGGTTTTGCTTGAATTTCCTAAGCATTTAGTTTGCCGATCTGATTCTTTTTACATCGGTCAGAAAA
TCCCTCCTCTCTCCGATCAAGATCTCCTCCAACTTGGCCATAAGTACTTCCTTCTTCCTAAGCCTTTTTTCCACTCTGTTCTCTCTTTTGTCACAATCGCTTCTTTCTTT
TCCTCTTCCAACAACAACAATAAAACCAACACTAGATTCATCAATAACGCCGCCGCCTGCCACCGCCCTTTTGACCTCCAACGAACCCCTTCCGGCTGCCTCAGAATCCG
AGTCTCCGACGAGTTCATTTCTCAATTACTGGAACAGGGTAATCCAAAACCCCTCCCCCCACAACAATCCACCTCTTTACCGCTCGGAAAAATCTGCACCACGCCTCAGT
TGGCTAAAGATTACACCCAGCTCGTCAGAACCCGGCAATGGAAGCCCAAGCTTGAGACCATTACAGAGACGAAAAAGAAACGTTCAACCTTTGGGTTGAAGAAGGCTAAC
CCATTTCACTCTCCGCCGCTCCGATCCGCCTACCCACTTCACCATCTTCGTTCAATTCACTTTGCTTATAAGCCTAAGATTCGAATCAAATCCAGAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGATGAAGGTGTTGAGTGTTCAGAATATGGTTTCATGGTCTTGTTTTCATTTGATGAAGAACGCAATTCCTGATCCTGCTACCGCTAATATCAAGCTTATTAATTC
CGATGGTGTTGTTAGAATCTACGACCGACCCATTTATGTTTCTGAGGTTTTGCTTGAATTTCCTAAGCATTTAGTTTGCCGATCTGATTCTTTTTACATCGGTCAGAAAA
TCCCTCCTCTCTCCGATCAAGATCTCCTCCAACTTGGCCATAAGTACTTCCTTCTTCCTAAGCCTTTTTTCCACTCTGTTCTCTCTTTTGTCACAATCGCTTCTTTCTTT
TCCTCTTCCAACAACAACAATAAAACCAACACTAGATTCATCAATAACGCCGCCGCCTGCCACCGCCCTTTTGACCTCCAACGAACCCCTTCCGGCTGCCTCAGAATCCG
AGTCTCCGACGAGTTCATTTCTCAATTACTGGAACAGGGTAATCCAAAACCCCTCCCCCCACAACAATCCACCTCTTTACCGCTCGGAAAAATCTGCACCACGCCTCAGT
TGGCTAAAGATTACACCCAGCTCGTCAGAACCCGGCAATGGAAGCCCAAGCTTGAGACCATTACAGAGACGAAAAAGAAACGTTCAACCTTTGGGTTGAAGAAGGCTAAC
CCATTTCACTCTCCGCCGCTCCGATCCGCCTACCCACTTCACCATCTTCGTTCAATTCACTTTGCTTATAAGCCTAAGATTCGAATCAAATCCAGAACTTAA
Protein sequenceShow/hide protein sequence
MGMKVLSVQNMVSWSCFHLMKNAIPDPATANIKLINSDGVVRIYDRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFF
SSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETITETKKKRSTFGLKKAN
PFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT