; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh08G000780 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh08G000780
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionProtein of unknown function (DUF1685)
Genome locationCmo_Chr08:407967..410804
RNA-Seq ExpressionCmoCh08G000780
SyntenyCmoCh08G000780
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6592872.1 hypothetical protein SDJN03_12348, partial [Cucurbita argyrosperma subsp. sororia]3.7e-9296.59Show/hide
Query:  MSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFAFC
        MSPLS RTESQISPESAIGG+PEPQIF YEPEFSSEDDHDE VVLTDWNNTKKKNKKKNQILLEGFVE SDEENLTRTKSLTDDDLEELKGCVDLGFAFC
Subjt:  MSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFAFC

Query:  YDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
        YDEIPELCNTLPALELCYSMSQKFMDDHQ VPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACT
Subjt:  YDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACT

KAG7025278.1 hypothetical protein SDJN02_11773 [Cucurbita argyrosperma subsp. argyrosperma]3.1e-9996.32Show/hide
Query:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE
        MKEENRRIPKLEKMSPLS RTESQISPESAIGG+PEPQIF YEPEFSSEDDHDE VVLTDWNNTKKKNKKKNQILLEGFVE SDEENLTRTKSLTDDDLE
Subjt:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE

Query:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL
        ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQ VPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACT+
Subjt:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL

XP_022959537.1 uncharacterized protein LOC111460587 [Cucurbita moschata]2.7e-10399.47Show/hide
Query:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE
        MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE
Subjt:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE

Query:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL
        ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACT+
Subjt:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL

XP_023005082.1 uncharacterized protein LOC111498175 [Cucurbita maxima]1.2e-9895.26Show/hide
Query:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE
        M+EENRRIPKLEKMSPLSS+TESQISPESAIGGDPEPQIF YEP+FSSEDDHD SVVLTDWNNTKKKNKKKNQILLEGFVE SD+ENLTRTKSLTDDDLE
Subjt:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE

Query:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL
        ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQ VPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACT+
Subjt:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL

XP_023514829.1 uncharacterized protein LOC111779030 [Cucurbita pepo subsp. pepo]9.0e-9995.79Show/hide
Query:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE
        MKEENRRIP+LEKMSPLS +TESQISPESAIGGDPEPQIF YEPEFSSEDDHDE VVLTDWNNTKKKNKKKNQILLEGFVE SDEENLTRTKSLTDDDLE
Subjt:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE

Query:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL
        ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQ VPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACT+
Subjt:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL

TrEMBL top hitse value%identityAlignment
A0A6J1DBH7 uncharacterized protein LOC1110185932.6e-7581.36Show/hide
Query:  MSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFAFC
        M+P S  ++SQISP + IGGDPE QIF ++ EFSSEDD DES   T+W N     KKKNQILLEGFVE +DEENL RTKSLTDDDLEELKGCVDLGFAFC
Subjt:  MSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFAFC

Query:  YDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL
        YDEIPELCNTLPALELCYSMSQKFMDDHQ VPE+SPPES DS SSPIPNWKISSPGDHPEDVKARLKYWAQAVACT+
Subjt:  YDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL

A0A6J1EVL6 uncharacterized protein LOC1114363731.2e-7582.12Show/hide
Query:  MSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNT--KKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFA
        MSP SS+ ESQISPE AIGGD E QI  ++ EF+S DD +ESV  TDWNNT  KKK KKKNQILLEGFVE SDEENLTRTKSLTD+DLEELKGCVDLGFA
Subjt:  MSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNT--KKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFA

Query:  FCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL
        FCYDEIPELCNTLPALELCYSMSQKF+D+HQ VPEHS P+  DS SSPIPNWKISSPGDHPEDVKARLK+WAQAVACT+
Subjt:  FCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL

A0A6J1H4T7 uncharacterized protein LOC1114605871.3e-10399.47Show/hide
Query:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE
        MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE
Subjt:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE

Query:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL
        ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACT+
Subjt:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL

A0A6J1JTH9 uncharacterized protein LOC1114887296.2e-7783.24Show/hide
Query:  MSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNT--KKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFA
        MSP SS+ ESQISPESAIGGD E QI  ++ EF+S D  DESV  TDWNNT  KKK KKKNQILLEGFVE SDEENLTRTKSLTDDDLEELKGCVDLGFA
Subjt:  MSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNT--KKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFA

Query:  FCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL
        FCYDEIPELCNTLPALELCYSMSQKF+D+HQ VPEHSPP+  DS SSPIPNWKISSPGDHPE+VKARLK+WAQAVACT+
Subjt:  FCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL

A0A6J1L190 uncharacterized protein LOC1114981755.7e-9995.26Show/hide
Query:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE
        M+EENRRIPKLEKMSPLSS+TESQISPESAIGGDPEPQIF YEP+FSSEDDHD SVVLTDWNNTKKKNKKKNQILLEGFVE SD+ENLTRTKSLTDDDLE
Subjt:  MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLE

Query:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL
        ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQ VPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACT+
Subjt:  ELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)1.0e-4454.95Show/hide
Query:  PYEPEFSSEDDHDESVVLTDW-------------NNTKKKNKKKNQILLEGFVEVS-------DEENLTRTKSLTDDDLEELKGCVDLGFAFCYDEIPEL
        P++   ++ +  DES +   W              + KK  +KK+Q+LLEG+VE +        +++LTR+KSLTDDDLE+L+GC+DLGF F YDEIPEL
Subjt:  PYEPEFSSEDDHDESVVLTDW-------------NNTKKKNKKKNQILLEGFVEVS-------DEENLTRTKSLTDDDLEELKGCVDLGFAFCYDEIPEL

Query:  CNTLPALELCYSMSQKFMDDHQN-VPEHSPPESVDS----GSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTLVEWLLCS
        CNTLPALELCYSMSQKF+DD QN  PE S  E   S     ++PI NWKISSPGD+P+DVKARLKYWAQAVACT+    LCS
Subjt:  CNTLPALELCYSMSQKFMDDHQN-VPEHSPPESVDS----GSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTLVEWLLCS

AT1G05870.2 Protein of unknown function (DUF1685)1.0e-4454.95Show/hide
Query:  PYEPEFSSEDDHDESVVLTDW-------------NNTKKKNKKKNQILLEGFVEVS-------DEENLTRTKSLTDDDLEELKGCVDLGFAFCYDEIPEL
        P++   ++ +  DES +   W              + KK  +KK+Q+LLEG+VE +        +++LTR+KSLTDDDLE+L+GC+DLGF F YDEIPEL
Subjt:  PYEPEFSSEDDHDESVVLTDW-------------NNTKKKNKKKNQILLEGFVEVS-------DEENLTRTKSLTDDDLEELKGCVDLGFAFCYDEIPEL

Query:  CNTLPALELCYSMSQKFMDDHQN-VPEHSPPESVDS----GSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTLVEWLLCS
        CNTLPALELCYSMSQKF+DD QN  PE S  E   S     ++PI NWKISSPGD+P+DVKARLKYWAQAVACT+    LCS
Subjt:  CNTLPALELCYSMSQKFMDDHQN-VPEHSPPESVDS----GSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTLVEWLLCS

AT1G05870.3 Protein of unknown function (DUF1685)1.0e-4454.95Show/hide
Query:  PYEPEFSSEDDHDESVVLTDW-------------NNTKKKNKKKNQILLEGFVEVS-------DEENLTRTKSLTDDDLEELKGCVDLGFAFCYDEIPEL
        P++   ++ +  DES +   W              + KK  +KK+Q+LLEG+VE +        +++LTR+KSLTDDDLE+L+GC+DLGF F YDEIPEL
Subjt:  PYEPEFSSEDDHDESVVLTDW-------------NNTKKKNKKKNQILLEGFVEVS-------DEENLTRTKSLTDDDLEELKGCVDLGFAFCYDEIPEL

Query:  CNTLPALELCYSMSQKFMDDHQN-VPEHSPPESVDS----GSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTLVEWLLCS
        CNTLPALELCYSMSQKF+DD QN  PE S  E   S     ++PI NWKISSPGD+P+DVKARLKYWAQAVACT+    LCS
Subjt:  CNTLPALELCYSMSQKFMDDHQN-VPEHSPPESVDS----GSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTLVEWLLCS

AT2G31560.1 Protein of unknown function (DUF1685)2.8e-4568.12Show/hide
Query:  KKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSG-----SSP
        KK  KKK+Q+LLEG+  + D+++LTR KSLTDDDLEELKGC+DLGF F YDEIPELCNTLPALELCYSMSQKF+DD Q    H   E  DS      ++P
Subjt:  KKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSG-----SSP

Query:  IPNWKISSPGDHPEDVKARLKYWAQAVACTLVEWLLCS
        I NWKISSPGD P+DVKARLKYWAQ VACT+    LCS
Subjt:  IPNWKISSPGDHPEDVKARLKYWAQAVACTLVEWLLCS

AT2G31560.2 Protein of unknown function (DUF1685)2.8e-4568.12Show/hide
Query:  KKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSG-----SSP
        KK  KKK+Q+LLEG+  + D+++LTR KSLTDDDLEELKGC+DLGF F YDEIPELCNTLPALELCYSMSQKF+DD Q    H   E  DS      ++P
Subjt:  KKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGFAFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSG-----SSP

Query:  IPNWKISSPGDHPEDVKARLKYWAQAVACTLVEWLLCS
        I NWKISSPGD P+DVKARLKYWAQ VACT+    LCS
Subjt:  IPNWKISSPGDHPEDVKARLKYWAQAVACTLVEWLLCS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAAGGAGGAAAACCGACGTATCCCAAAATTGGAAAAAATGAGTCCCCTTTCGTCTCGAACTGAATCTCAGATTTCTCCTGAATCCGCCATTGGAGGGGACCCAGAACC
TCAAATCTTCCCTTACGAGCCTGAATTCAGTTCCGAAGACGATCATGACGAGTCCGTTGTTCTGACGGACTGGAACAACACGAAGAAAAAGAACAAGAAGAAGAACCAGA
TCTTGCTTGAGGGCTTTGTGGAGGTTTCAGATGAGGAAAATTTGACGAGGACGAAGAGTTTGACGGATGACGATCTCGAGGAGCTCAAGGGGTGTGTGGATCTAGGGTTT
GCGTTTTGCTATGACGAGATTCCTGAGCTCTGTAACACATTGCCGGCGCTCGAGCTCTGTTATTCGATGAGCCAGAAGTTTATGGACGACCACCAGAATGTTCCGGAACA
TTCTCCGCCCGAGTCGGTGGATTCGGGGTCCAGTCCGATTCCGAATTGGAAGATCTCTAGTCCTGGTGATCATCCAGAAGATGTTAAAGCGAGGCTCAAATATTGGGCGC
AAGCGGTGGCTTGTACTTTAGTTGAGTGGCTTTTATGCAGTGGTGGGAGCCCCATCAAAGCTTCACCAACAGCCTTTTGCTTCACACCTCTGCACATGCTGCTGAGATAT
TGTGCTAACTGTTTCTGTTCAGCCCACAAATATGGCTGGATGAATACGGGTTTGCTCGATGTCGCTGGGAGACGTGTGGGAGAACTGCCTACTTCATCTGGGTTTATGAT
TGTTTTGCATCTTGAACTTGGTGGCATTTTCTAG
mRNA sequenceShow/hide mRNA sequence
AGCCAAAAACATATGAAGGAGGAAAACCGACGTATCCCAAAATTGGAAAAAATGAGTCCCCTTTCGTCTCGAACTGAATCTCAGATTTCTCCTGAATCCGCCATTGGAGG
GGACCCAGAACCTCAAATCTTCCCTTACGAGCCTGAATTCAGTTCCGAAGACGATCATGACGAGTCCGTTGTTCTGACGGACTGGAACAACACGAAGAAAAAGAACAAGA
AGAAGAACCAGATCTTGCTTGAGGGCTTTGTGGAGGTTTCAGATGAGGAAAATTTGACGAGGACGAAGAGTTTGACGGATGACGATCTCGAGGAGCTCAAGGGGTGTGTG
GATCTAGGGTTTGCGTTTTGCTATGACGAGATTCCTGAGCTCTGTAACACATTGCCGGCGCTCGAGCTCTGTTATTCGATGAGCCAGAAGTTTATGGACGACCACCAGAA
TGTTCCGGAACATTCTCCGCCCGAGTCGGTGGATTCGGGGTCCAGTCCGATTCCGAATTGGAAGATCTCTAGTCCTGGTGATCATCCAGAAGATGTTAAAGCGAGGCTCA
AATATTGGGCGCAAGCGGTGGCTTGTACTTTAGTTGAGTGGCTTTTATGCAGTGGTGGGAGCCCCATCAAAGCTTCACCAACAGCCTTTTGCTTCACACCTCTGCACATG
CTGCTGAGATATTGTGCTAACTGTTTCTGTTCAGCCCACAAATATGGCTGGATGAATACGGGTTTGCTCGATGTCGCTGGGAGACGTGTGGGAGAACTGCCTACTTCATC
TGGGTTTATGATTGTTTTGCATCTTGAACTTGGTGGCATTTTCTAGCTTTGTATCATGAATGATAGTTTCTTGTTCATTTAGTTGGTTCTCCCTGAGTGCTAACTATTTC
TTTCGATCCACACGCATGTCTGGATGAGTACCCGTTTGCCTCGCCAGATGTAGTTGGATGGGAGGGGAGTTCTTACTTTCGTGGCTTACCGAGTGCTAACTATCTCTCGT
TGGTCCACACATATGGCTGGATGAATGAATACCCATTTGTCCCATGATGTTGGGATTGATGGGGGAAAACTGCCTGTTTCATCTCTACATCAAATGAATTCCATCTTGAA
CTTGAAGATGCTGTCTGGTTTTGTATGCTGACTGATGGTTTCTCGTAATGACATGACTGGAAAACGATCAACCGATCGGTTATTTTTTACTTCTTCCCTCGGTTTGTTCT
TATCTCTGGTTTATGGAAAGAGCAACAGATTCTTCCAAACATGTCCAATGTCTTTTAGGTTCTATCATGTGATACAGGGAAATAGAGAACTGCACAGCGGACAATACCAC
ACATCCCAAGTGAGGCGAGGCTTTTTGTAGGCCATGTAAGAGATTGAGGGGAACAAAGCTGAAAGAAACGGTTCATTTATGCAATCCTTGCTGTGTGATAGTGTACTAAA
AGCTTATTCTTGTCGGGTGTTCTACGTTCTATTTTTAAGACGATTATCTTTCAGTTTCGGGAGCTAGCTTGAATGTCCACCTTTGAACGATGTTCGTTTAATCATTGATT
TATTAGGATATTTTGTAATTTTATTGTCGGGTTATGTTATACGTTTCTGCTCGTTCTATTAGACGGCTTTCAAAAGTTGAAGGTGTTTTGATTGAATGTTGTGAGAGGGA
ATAAGGGAATAATTCTTTGGATAATTCCTTTTTGAGAATTATAATTCATTTCTATTAAGAGTTGGCCAATCTTATTTTGAATGGTAAAGGACAAAAAAGGG
Protein sequenceShow/hide protein sequence
MKEENRRIPKLEKMSPLSSRTESQISPESAIGGDPEPQIFPYEPEFSSEDDHDESVVLTDWNNTKKKNKKKNQILLEGFVEVSDEENLTRTKSLTDDDLEELKGCVDLGF
AFCYDEIPELCNTLPALELCYSMSQKFMDDHQNVPEHSPPESVDSGSSPIPNWKISSPGDHPEDVKARLKYWAQAVACTLVEWLLCSGGSPIKASPTAFCFTPLHMLLRY
CANCFCSAHKYGWMNTGLLDVAGRRVGELPTSSGFMIVLHLELGGIF