; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh14G010830 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh14G010830
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionDUF4228 domain-containing protein
Genome locationCmo_Chr14:6110506..6111255
RNA-Seq ExpressionCmoCh14G010830
SyntenyCmoCh14G010830
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR025322 - Protein of unknown function DUF4228, plant


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6581636.1 hypothetical protein SDJN03_21638, partial [Cucurbita argyrosperma subsp. sororia]1.7e-13998.8Show/hide
Query:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF
        MGMKALSVQNMVSWSCFHLAK TTTTDIPDPSANIKLILSHGVVRVYHRPISVS+ILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF
Subjt:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF

Query:  HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETI
        HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVR RRWKPKLETI
Subjt:  HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETI

Query:  EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD
        EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD
Subjt:  EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD

KAG6581637.1 hypothetical protein SDJN03_21639, partial [Cucurbita argyrosperma subsp. sororia]3.2e-13898.79Show/hide
Query:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF
        MGMKALSVQNMVSWSCFHLAK TTTTDIPDPSANIKLILSHGVVRVYHRPISVS+ILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF
Subjt:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF

Query:  HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETI
        HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVR RRWKPKLETI
Subjt:  HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETI

Query:  EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKS
        EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKS
Subjt:  EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKS

XP_022935139.1 uncharacterized protein LOC111442099 [Cucurbita moschata]3.6e-142100Show/hide
Query:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF
        MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF
Subjt:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF

Query:  HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETI
        HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETI
Subjt:  HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETI

Query:  EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD
        EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD
Subjt:  EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD

XP_023520736.1 uncharacterized protein LOC111784188 [Cucurbita pepo subsp. pepo]1.4e-13896.46Show/hide
Query:  MGMKALSVQNMVSWSCFHLAKATT-----TTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLL
        MGMKALSVQNMVSWSCFHLAK TT     TTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLL
Subjt:  MGMKALSVQNMVSWSCFHLAKATT-----TTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLL

Query:  PKPFFHSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKP
        PKPFFHSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVR RRWKP
Subjt:  PKPFFHSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKP

Query:  KLETIEETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD
        KLETIEE++KKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD
Subjt:  KLETIEETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD

XP_023522939.1 uncharacterized protein LOC111786990, partial [Cucurbita pepo subsp. pepo]2.1e-12995.44Show/hide
Query:  MGMKALSVQNMVSWSCFHLAK-------ATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYF
        MGMKALSVQNMVSWSCFHLAK        TTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYF
Subjt:  MGMKALSVQNMVSWSCFHLAK-------ATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYF

Query:  LLPKPFFHSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRW
        LLPKPFFHSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVR RRW
Subjt:  LLPKPFFHSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRW

Query:  KPKLETIEETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHL
        KPKLETIEE++KKGGISPFGLKKGNPFPSPPPRSSYSLHHL
Subjt:  KPKLETIEETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHL

TrEMBL top hitse value%identityAlignment
A0A0A0LCN9 Uncharacterized protein8.0e-9575.1Show/hide
Query:  MGMKALSVQNMVSWSCFHLAKATTTTDIP-DPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPF
        MG+K LSVQNMVSWSCFHL K       P   S NIKLI S GVVR+YHRPI VSE+LLEFPKHLVCRSD+FYIGQKIP LS+ D LQLGHKYFLLP  F
Subjt:  MGMKALSVQNMVSWSCFHLAKATTTTDIP-DPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPF

Query:  FHSVLSFVSIASFFSPP--NKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLP---SPPLPQGKICTTPQLAKEYTQLVRARRW
        FHSVLSFV+IASFFS    N  NRF  NAAA   PFDLQRTPSGCL+IRVSD+FISQLLEQG NPKPLP   SP LP GKICTTPQLAK+YTQLVR R+W
Subjt:  FHSVLSFVSIASFFSPP--NKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLP---SPPLPQGKICTTPQLAKEYTQLVRARRW

Query:  KPKLETIEETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIH--YKPKIRIKSR
        KPKLETI+ET+KK  IS FGLKK NPFPS P RS Y LHHLR+IH  YK KIRIK+R
Subjt:  KPKLETIEETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIH--YKPKIRIKSR

A0A1S3CNA0 uncharacterized protein LOC1035028305.5e-9675.39Show/hide
Query:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF
        MG+K LSVQNMVSWSCFHL K     D    + NIKLILS GVVR+YHRPI VSE+LLEFPKH VCRSD+FYIGQKIP LS+ D LQLGHKYFLLP PFF
Subjt:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF

Query:  HSVLSFVSIASFFSP--PNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLP---SPPLPQGKICTTPQLAKEYTQLVRARRWK
        HSVLSFV+IASFFS    N  NRF  NAAAC  PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLP   +P LP GKICTTPQLAK+YTQLVR RRWK
Subjt:  HSVLSFVSIASFFSP--PNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLP---SPPLPQGKICTTPQLAKEYTQLVRARRWK

Query:  PKLETIEETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIH--YKPKIRIKSR
        PKLETI+ET+KK  +S FGLKK  PF S P RS+Y L HLR+IH  YK KIRIKSR
Subjt:  PKLETIEETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIH--YKPKIRIKSR

A0A5A7T3Y8 DUF4228 domain-containing protein5.2e-8676.62Show/hide
Query:  PDPSA---NIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFFHSVLSFVSIASFFSP--PNKPNRFH
        PDP++   NIKLILS GVVR+YHRPI VSE+LLEFPKH VCRSD+FYIGQKIP LS+ D LQLGHKYFLLP PFFHSVLSFV+IASFFS    N  NRF 
Subjt:  PDPSA---NIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFFHSVLSFVSIASFFSP--PNKPNRFH

Query:  TNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLP---SPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETIEETKKKGGISPFGLKKGNP
         NAAAC  PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLP   +P LP GKICTTPQLAK+YTQLVR RRWKPKLETI+ET+KK  +S FGLKK  P
Subjt:  TNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLP---SPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETIEETKKKGGISPFGLKKGNP

Query:  FPSPPPRSSYSLHHLRAIH--YKPKIRIKSR
        F S P RS+Y L HLR+IH  YK KIRIKSR
Subjt:  FPSPPPRSSYSLHHLRAIH--YKPKIRIKSR

A0A5D3CH36 DUF4228 domain-containing protein5.5e-9675.39Show/hide
Query:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF
        MG+K LSVQNMVSWSCFHL K     D    + NIKLILS GVVR+YHRPI VSE+LLEFPKH VCRSD+FYIGQKIP LS+ D LQLGHKYFLLP PFF
Subjt:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF

Query:  HSVLSFVSIASFFSP--PNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLP---SPPLPQGKICTTPQLAKEYTQLVRARRWK
        HSVLSFV+IASFFS    N  NRF  NAAAC  PFDLQRTPSGCLRIRVSDEFISQLLEQG NPKPLP   +P LP GKICTTPQLAK+YTQLVR RRWK
Subjt:  HSVLSFVSIASFFSP--PNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQG-NPKPLP---SPPLPQGKICTTPQLAKEYTQLVRARRWK

Query:  PKLETIEETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIH--YKPKIRIKSR
        PKLETI+ET+KK  +S FGLKK  PF S P RS+Y L HLR+IH  YK KIRIKSR
Subjt:  PKLETIEETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIH--YKPKIRIKSR

A0A6J1F4J4 uncharacterized protein LOC1114420991.8e-142100Show/hide
Query:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF
        MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF
Subjt:  MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFF

Query:  HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETI
        HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETI
Subjt:  HSVLSFVSIASFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETI

Query:  EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD
        EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD
Subjt:  EETKKKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIKSRD

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G28190.1 unknown protein3.7e-3640.33Show/hide
Query:  SWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFFHSVLSFVSIASF
        SW C H     T+         IKL+ S G ++VY RP+ VSE+  +FPKH +CRSD  YIGQK P LSE + L+LG  YFLLP  FF + LSF++IA+ 
Subjt:  SWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFFHSVLSFVSIASF

Query:  FSPPNKPNRFHTNAAACLHPFDLQRTPSG-CLRIRVSDEFISQLLEQGNPKPLPSPPLP-------QGKICTTPQLAKEYTQLVRARRWKPKLETIEETK
         +P N              PF +Q+   G  LRIRVS+EF+S+L+ +G    +             +G++CTT +L K+Y QLV  RRWKPKLETI ETK
Subjt:  FSPPNKPNRFHTNAAACLHPFDLQRTPSG-CLRIRVSDEFISQLLEQGNPKPLPSPPLP-------QGKICTTPQLAKEYTQLVRARRWKPKLETIEETK

Query:  -KKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIK
          K        KK   F     +S       R +  K K + K
Subjt:  -KKGGISPFGLKKGNPFPSPPPRSSYSLHHLRAIHYKPKIRIK

AT5G12340.1 unknown protein1.2e-1836.21Show/hide
Query:  EILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFFHSVLSFVSIASFFSPPNKPNRFHT-----NAAACLHPFDLQRTPSGCLRIRVSD
        EI+ EFP  +VC +D+FYIG++IP L+  D+L  G  YF+LP   F   +   S  S F+   +  R +T      A +   PF+  + P+G + I+VS 
Subjt:  EILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFFHSVLSFVSIASFFSPPNKPNRFHT-----NAAACLHPFDLQRTPSGCLRIRVSD

Query:  EFISQLL-EQGNPKPLPSPPLPQGK-ICTTPQLAKEYTQLV--RARRWKPKLETIEETKKKGGISP---FGLKK
        EFI  L+ +  N +   +    +   IC +P+L K+Y QLV  R   W P L+TI E K +  +SP   FGL +
Subjt:  EFISQLL-EQGNPKPLPSPPLPQGK-ICTTPQLAKEYTQLV--RARRWKPKLETIEETKKKGGISP---FGLKK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGGATGAAGGCGCTGAGTGTTCAGAATATGGTTTCATGGTCTTGTTTCCATTTGGCCAAGGCTACTACTACTACTGACATCCCCGATCCCTCTGCTAACATCAAGCT
CATTCTTTCCCACGGCGTTGTGAGAGTCTACCACCGACCCATCTCTGTCTCTGAGATTCTGCTTGAGTTCCCTAAGCACTTAGTTTGCCGATCCGACGCCTTTTACATCG
GCCAGAAAATCCCTCATCTCTCCGAACACGACTTTCTCCAACTCGGCCATAAGTACTTCCTCCTCCCCAAGCCCTTCTTCCACTCTGTTCTCTCTTTCGTCTCAATCGCC
TCTTTCTTCTCTCCTCCCAACAAACCAAACCGTTTCCACACCAATGCCGCCGCCTGCCTCCACCCTTTCGACCTCCAACGAACCCCCTCCGGCTGCCTCCGAATCCGAGT
CTCCGACGAGTTCATCTCTCAGTTACTCGAACAGGGGAATCCGAAACCCCTGCCTTCACCGCCGTTGCCGCAAGGAAAAATCTGCACCACCCCCCAGTTGGCCAAGGAAT
ACACCCAGCTGGTCCGAGCCCGGCGATGGAAGCCGAAGCTGGAGACGATTGAAGAGACCAAAAAGAAAGGCGGGATTTCTCCATTTGGGTTGAAGAAGGGAAACCCATTT
CCTTCTCCACCGCCACGATCGTCCTACTCACTTCACCATCTCCGTGCCATTCACTACAAGCCGAAGATCCGCATAAAATCCCGGGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGGATGAAGGCGCTGAGTGTTCAGAATATGGTTTCATGGTCTTGTTTCCATTTGGCCAAGGCTACTACTACTACTGACATCCCCGATCCCTCTGCTAACATCAAGCT
CATTCTTTCCCACGGCGTTGTGAGAGTCTACCACCGACCCATCTCTGTCTCTGAGATTCTGCTTGAGTTCCCTAAGCACTTAGTTTGCCGATCCGACGCCTTTTACATCG
GCCAGAAAATCCCTCATCTCTCCGAACACGACTTTCTCCAACTCGGCCATAAGTACTTCCTCCTCCCCAAGCCCTTCTTCCACTCTGTTCTCTCTTTCGTCTCAATCGCC
TCTTTCTTCTCTCCTCCCAACAAACCAAACCGTTTCCACACCAATGCCGCCGCCTGCCTCCACCCTTTCGACCTCCAACGAACCCCCTCCGGCTGCCTCCGAATCCGAGT
CTCCGACGAGTTCATCTCTCAGTTACTCGAACAGGGGAATCCGAAACCCCTGCCTTCACCGCCGTTGCCGCAAGGAAAAATCTGCACCACCCCCCAGTTGGCCAAGGAAT
ACACCCAGCTGGTCCGAGCCCGGCGATGGAAGCCGAAGCTGGAGACGATTGAAGAGACCAAAAAGAAAGGCGGGATTTCTCCATTTGGGTTGAAGAAGGGAAACCCATTT
CCTTCTCCACCGCCACGATCGTCCTACTCACTTCACCATCTCCGTGCCATTCACTACAAGCCGAAGATCCGCATAAAATCCCGGGATTAA
Protein sequenceShow/hide protein sequence
MGMKALSVQNMVSWSCFHLAKATTTTDIPDPSANIKLILSHGVVRVYHRPISVSEILLEFPKHLVCRSDAFYIGQKIPHLSEHDFLQLGHKYFLLPKPFFHSVLSFVSIA
SFFSPPNKPNRFHTNAAACLHPFDLQRTPSGCLRIRVSDEFISQLLEQGNPKPLPSPPLPQGKICTTPQLAKEYTQLVRARRWKPKLETIEETKKKGGISPFGLKKGNPF
PSPPPRSSYSLHHLRAIHYKPKIRIKSRD