; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC10G191640 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC10G191640
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
DescriptionDUF4228 domain-containing protein
Genome locationCiama_Chr10:26006102..26006863
RNA-Seq ExpressionCaUC10G191640
SyntenyCaUC10G191640
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0037693.1 DUF4228 domain-containing protein [Cucumis melo var. makuwa]2.9e-10787.87Show/hide
Query:  MKNAI-PDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNN
        MKNA+ PDPA+   NIKLI SDGVVRIYHRPIYVSEVLLEFPKH VCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP PFFHSVLSFVTIASFFSSSN+N
Subjt:  MKNAI-PDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNN

Query:  NKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETITETKKKR-STF
        NK   RFINNAAACH PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLPPQQ+ SLPLGKICTTPQLAKDYTQLVRTR+WKPKLETI ET+KKR S+F
Subjt:  NKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETITETKKKR-STF

Query:  GLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        GLKKA PF S P+RSAYPL HLRSIHFAYK KIRIKSR+
Subjt:  GLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

KAG6581636.1 hypothetical protein SDJN03_21638, partial [Cucurbita argyrosperma subsp. sororia]2.4e-10178.99Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNA---IPDPATANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF
        MGMK LSVQNMVSWSCFHL K     IPDP +ANIKLI S GVVR+YHRPI VS++LLEFPKHLVCRSD+FYIGQKIP LS+ D LQLGHKYFLLPKPFF
Subjt:  MGMKVLSVQNMVSWSCFHLMKNA---IPDPATANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF

Query:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQW
        HSVLSFV+IASFFS  N  N    RF  NAAAC  PFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLP   S  LP GKICTTPQLAK+YTQLVRTR+W
Subjt:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQW

Query:  KPKLETITETKKKR--STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSR
        KPKLETI ETKKK   S FGLKK NPF SPP RS+Y LHHLR+IH  YKPKIRIKSR
Subjt:  KPKLETITETKKKR--STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSR

XP_008465162.1 PREDICTED: uncharacterized protein LOC103502830 [Cucumis melo]9.7e-11988.37Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF
        MG+KVLSVQNMVSWSCFHLMKNA+ PDPA+   NIKLI SDGVVRIYHRPIYVSEVLLEFPKH VCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP PFF
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF

Query:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ
        HSVLSFVTIASFFSSSN+NNK   RFINNAAACH PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLPPQQ+ SLPLGKICTTPQLAKDYTQLVRTR+
Subjt:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ

Query:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        WKPKLETI ET+KKR S+FGLKKA PF S P+RSAYPL HLRSIHFAYK KIRIKSR+
Subjt:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

XP_011652332.1 uncharacterized protein LOC105435015 [Cucumis sativus]4.4e-11987.26Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAIPDPA----TANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF
        MG+KVLSVQNMVSWSCFHLMKNAIP P     + NIKLI+SDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP  F
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAIPDPA----TANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF

Query:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTR
        FHSVLSFVTIASFFSSSNNNNK   RFINNAAA H PFDLQRTPSGCL+IRVSD+FISQLLEQG NPKPLPPQQS SLPLGKICTTPQLAKDYTQLVRTR
Subjt:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTR

Query:  QWKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        +WKPKLETI ET+KKR S+FGLKKANPF S P+RS YPLHHLRSIHFAYK KIRIK+R+
Subjt:  QWKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

XP_038905326.1 uncharacterized protein LOC120091392 [Benincasa hispida]1.4e-12591.02Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAIPDPATANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSV
        MG+KVLSVQNMVSWSCFHLMKNAIP+PA ANIKLINSDGVVRIYH+PIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSV
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAIPDPATANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSV

Query:  LSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQS-TSLPLGKICTTPQLAKDYTQLVRTRQWK
        LSFVTIASFFS+S +N+    RF+NNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPL  QQS +SLPLGKICTTPQLAKDYTQLVRTRQWK
Subjt:  LSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQS-TSLPLGKICTTPQLAKDYTQLVRTRQWK

Query:  PKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        PKLETI ETKKKR STFGLKKANPFHSPPLRS+YPLHHLRSI  AYKPKIRIKSRT
Subjt:  PKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

TrEMBL top hitse value%identityAlignment
A0A0A0LCN9 Uncharacterized protein2.1e-11987.26Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAIPDPA----TANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF
        MG+KVLSVQNMVSWSCFHLMKNAIP P     + NIKLI+SDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP  F
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAIPDPA----TANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF

Query:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTR
        FHSVLSFVTIASFFSSSNNNNK   RFINNAAA H PFDLQRTPSGCL+IRVSD+FISQLLEQG NPKPLPPQQS SLPLGKICTTPQLAKDYTQLVRTR
Subjt:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTR

Query:  QWKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        +WKPKLETI ET+KKR S+FGLKKANPF S P+RS YPLHHLRSIHFAYK KIRIK+R+
Subjt:  QWKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

A0A1S3CNA0 uncharacterized protein LOC1035028304.7e-11988.37Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF
        MG+KVLSVQNMVSWSCFHLMKNA+ PDPA+   NIKLI SDGVVRIYHRPIYVSEVLLEFPKH VCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP PFF
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF

Query:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ
        HSVLSFVTIASFFSSSN+NNK   RFINNAAACH PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLPPQQ+ SLPLGKICTTPQLAKDYTQLVRTR+
Subjt:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ

Query:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        WKPKLETI ET+KKR S+FGLKKA PF S P+RSAYPL HLRSIHFAYK KIRIKSR+
Subjt:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

A0A5A7T3Y8 DUF4228 domain-containing protein1.4e-10787.87Show/hide
Query:  MKNAI-PDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNN
        MKNA+ PDPA+   NIKLI SDGVVRIYHRPIYVSEVLLEFPKH VCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP PFFHSVLSFVTIASFFSSSN+N
Subjt:  MKNAI-PDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNN

Query:  NKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETITETKKKR-STF
        NK   RFINNAAACH PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLPPQQ+ SLPLGKICTTPQLAKDYTQLVRTR+WKPKLETI ET+KKR S+F
Subjt:  NKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETITETKKKR-STF

Query:  GLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        GLKKA PF S P+RSAYPL HLRSIHFAYK KIRIKSR+
Subjt:  GLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

A0A5D3CH36 DUF4228 domain-containing protein4.7e-11988.37Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF
        MG+KVLSVQNMVSWSCFHLMKNA+ PDPA+   NIKLI SDGVVRIYHRPIYVSEVLLEFPKH VCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLP PFF
Subjt:  MGMKVLSVQNMVSWSCFHLMKNAI-PDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFF

Query:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ
        HSVLSFVTIASFFSSSN+NNK   RFINNAAACH PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLPPQQ+ SLPLGKICTTPQLAKDYTQLVRTR+
Subjt:  HSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ

Query:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT
        WKPKLETI ET+KKR S+FGLKKA PF S P+RSAYPL HLRSIHFAYK KIRIKSR+
Subjt:  WKPKLETITETKKKR-STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT

A0A6J1F4J4 uncharacterized protein LOC1114420994.4e-10178.68Show/hide
Query:  MGMKVLSVQNMVSWSCFHLMK----NAIPDPATANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF
        MGMK LSVQNMVSWSCFHL K      IPDP +ANIKLI S GVVR+YHRPI VSE+LLEFPKHLVCRSD+FYIGQKIP LS+ D LQLGHKYFLLPKPF
Subjt:  MGMKVLSVQNMVSWSCFHLMK----NAIPDPATANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPF

Query:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ
        FHSVLSFV+IASFFS  N  N    RF  NAAAC  PFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLP   S  LP GKICTTPQLAK+YTQLVR R+
Subjt:  FHSVLSFVTIASFFSSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQ

Query:  WKPKLETITETKKKR--STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSR
        WKPKLETI ETKKK   S FGLKK NPF SPP RS+Y LHHLR+IH  YKPKIRIKSR
Subjt:  WKPKLETITETKKKR--STFGLKKANPFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G28190.1 unknown protein4.0e-3843.48Show/hide
Query:  SWSCFHLMKNAIPDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFF
        SW C H      PD  +    IKL+ SDG +++Y RP+ VSE+  +FPKH +CRSD  YIGQK P LS+ + L+LG  YFLLP  FF + LSF+TIA+  
Subjt:  SWSCFHLMKNAIPDPAT--ANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFF

Query:  SSSNNNNKTNTRFINNAAACHRPFDLQRTPSG-CLRIRVSDEFISQLLEQGNPKPL----PPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETIT
        +  N         +       +PF +Q+   G  LRIRVS+EF+S+L+ +G    +      ++      G++CTT +L KDY QLV  R+WKPKLETIT
Subjt:  SSSNNNNKTNTRFINNAAACHRPFDLQRTPSG-CLRIRVSDEFISQLLEQGNPKPL----PPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETIT

Query:  ETKKKRS
        ETK  ++
Subjt:  ETKKKRS

AT5G12340.1 unknown protein2.0e-2137.93Show/hide
Query:  EVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNNNKTNTRFIN-NAAACHRPFDLQRTPSGCLRIRVSD
        E++ EFP  +VC +DSFYIG++IP L+  D L  G  YF+LP   F   +   +  S F+S+  N ++NT  +N  A +   PF+  + P+G + I+VS 
Subjt:  EVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFFSSSNNNNKTNTRFIN-NAAACHRPFDLQRTPSGCLRIRVSD

Query:  EFISQLLEQGNPKPLPPQQSTSL-PLGKICTTPQLAKDYTQLVRTRQ--WKPKLETITETKKKRST---FGLKK
        EFI  L+ +   + +  +++TS      IC +P+L K Y QLV TR+  W P L+TI+E K + S    FGL +
Subjt:  EFISQLLEQGNPKPLPPQQSTSL-PLGKICTTPQLAKDYTQLVRTRQ--WKPKLETITETKKKRST---FGLKK

AT5G17350.1 unknown protein8.5e-0428.85Show/hide
Query:  KLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLG--HKYFLLPKPFFHSVLSFVTIASFFSSSNNNNKTNTRFINNAAA-
        K+I  DG VR  H P+  +E+++E P   +  + S  IG+K  PL+  D LQ+   H Y   P     S  +   +A  F ++    +      +++AA 
Subjt:  KLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLG--HKYFLLPKPFFHSVLSFVTIASFFSSSNNNNKTNTRFINNAAA-

Query:  --CH
          CH
Subjt:  --CH


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGATGAAGGTGTTGAGTGTTCAGAATATGGTTTCATGGTCTTGTTTTCATTTGATGAAGAACGCTATTCCTGATCCTGCTACCGCTAATATCAAGCTTATTAATTC
CGATGGTGTTGTTAGAATCTACCACCGACCCATTTATGTTTCTGAGGTTTTGCTTGAATTTCCTAAGCATTTAGTTTGCCGATCTGATTCTTTTTACATCGGTCAGAAAA
TCCCTCCTCTCTCCGATCAAGATCTCCTCCAACTTGGCCATAAGTACTTCCTTCTTCCTAAGCCTTTTTTCCACTCTGTTCTCTCTTTTGTCACAATCGCTTCTTTCTTT
TCCTCTTCCAACAACAACAATAAAACCAACACTAGATTCATCAATAACGCCGCCGCCTGCCACCGCCCTTTTGACCTCCAACGAACCCCTTCCGGCTGCCTCAGAATCCG
AGTCTCCGACGAGTTCATTTCTCAATTACTGGAACAGGGTAATCCAAAACCCCTCCCCCCACAACAATCCACCTCTTTACCGCTCGGAAAAATCTGCACCACGCCTCAGT
TGGCTAAAGATTACACCCAACTCGTCAGAACCCGGCAATGGAAGCCCAAGCTTGAGACCATTACAGAGACGAAAAAGAAACGTTCAACCTTTGGGTTGAAGAAGGCTAAC
CCATTTCACTCTCCGCCGCTCCGATCCGCCTACCCACTTCACCATCTTCGTTCAATTCACTTTGCTTATAAGCCTAAGATTCGAATCAAATCCAGAACTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGATGAAGGTGTTGAGTGTTCAGAATATGGTTTCATGGTCTTGTTTTCATTTGATGAAGAACGCTATTCCTGATCCTGCTACCGCTAATATCAAGCTTATTAATTC
CGATGGTGTTGTTAGAATCTACCACCGACCCATTTATGTTTCTGAGGTTTTGCTTGAATTTCCTAAGCATTTAGTTTGCCGATCTGATTCTTTTTACATCGGTCAGAAAA
TCCCTCCTCTCTCCGATCAAGATCTCCTCCAACTTGGCCATAAGTACTTCCTTCTTCCTAAGCCTTTTTTCCACTCTGTTCTCTCTTTTGTCACAATCGCTTCTTTCTTT
TCCTCTTCCAACAACAACAATAAAACCAACACTAGATTCATCAATAACGCCGCCGCCTGCCACCGCCCTTTTGACCTCCAACGAACCCCTTCCGGCTGCCTCAGAATCCG
AGTCTCCGACGAGTTCATTTCTCAATTACTGGAACAGGGTAATCCAAAACCCCTCCCCCCACAACAATCCACCTCTTTACCGCTCGGAAAAATCTGCACCACGCCTCAGT
TGGCTAAAGATTACACCCAACTCGTCAGAACCCGGCAATGGAAGCCCAAGCTTGAGACCATTACAGAGACGAAAAAGAAACGTTCAACCTTTGGGTTGAAGAAGGCTAAC
CCATTTCACTCTCCGCCGCTCCGATCCGCCTACCCACTTCACCATCTTCGTTCAATTCACTTTGCTTATAAGCCTAAGATTCGAATCAAATCCAGAACTTAA
Protein sequenceShow/hide protein sequence
MGMKVLSVQNMVSWSCFHLMKNAIPDPATANIKLINSDGVVRIYHRPIYVSEVLLEFPKHLVCRSDSFYIGQKIPPLSDQDLLQLGHKYFLLPKPFFHSVLSFVTIASFF
SSSNNNNKTNTRFINNAAACHRPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPPQQSTSLPLGKICTTPQLAKDYTQLVRTRQWKPKLETITETKKKRSTFGLKKAN
PFHSPPLRSAYPLHHLRSIHFAYKPKIRIKSRT