; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0004937 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0004937
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionPhytocyanin domain-containing protein
Genome locationchr11:5368604..5369674
RNA-Seq ExpressionPay0004937
SyntenyPay0004937
Gene Ontology termsGO:0022900 - electron transport chain (biological process)
GO:0009055 - electron transfer activity (molecular function)
InterPro domainsIPR003245 - Phytocyanin domain
IPR008972 - Cupredoxin


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7010963.1 hypothetical protein SDJN02_27761, partial [Cucurbita argyrosperma subsp. argyrosperma]4.7e-7258.31Show/hide
Query:  MAFICSKLIILFIWACISTISSANGGWFWGPNCTVPSKPRHSRSPPPPSRRHPRTPPSQRFPPPPPRSRRPRTPPPSPPMLSSPPPPSRRPRTPPPSRRP
        M FI SKL+ILFIWAC+ST+SSAN  WFW  NC+ P  P H+ SPPPP+    RTPPSQ  PP PPRSRRPRTPP     + SPPPPS           P
Subjt:  MAFICSKLIILFIWACISTISSANGGWFWGPNCTVPSKPRHSRSPPPPSRRHPRTPPSQRFPPPPPRSRRPRTPPPSPPMLSSPPPPSRRPRTPPPSRRP

Query:  QTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSRRPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALK
        Q PP                          PPPP+SRR RTPP ++  PPP               PPS        RKIIVGGS  W LGFDY DWALK
Subjt:  QTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSRRPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALK

Query:  NGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGD-GFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG
        NGPF++NDILVFKYDPPN +TP HSVY L NMRSF NCDLGKAKMLAN  QGS + GFEF LK QNPYYFACGE NGFHC+ GSMKFT+TPIL+G
Subjt:  NGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGD-GFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG

XP_008459215.1 PREDICTED: extensin, partial [Cucumis melo]1.1e-8198.71Show/hide
Query:  RTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCD
        RTPPPQRIPPPP SLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLL NMRSFANCD
Subjt:  RTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCD

Query:  LGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG
        LGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG
Subjt:  LGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG

XP_011649199.1 leucine-rich repeat extensin-like protein 3 [Cucumis sativus]1.4e-10878.95Show/hide
Query:  MAFICSKLIILFIWACISTISSA-NGGWFWG-PNCTVPSKPRHSRSPPPPSR-----RHPRTPPSQRFPPPPPRSRRPRTPPPSP-PMLSSPPPPSRRPR
        MAFI SKLIILFIWACIST+ SA NGGWFWG PNCTV S P H  S PPP R     R+PRTPPS+RFPPPP  SRR RTPP    P+L  PPPP R PR
Subjt:  MAFICSKLIILFIWACISTISSA-NGGWFWG-PNCTVPSKPRHSRSPPPPSR-----RHPRTPPSQRFPPPPPRSRRPRTPPPSP-PMLSSPPPPSRRPR

Query:  TPPPSRRPQTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSR---RPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQ
        TPPPSRRP+TPPP PPMLSSPPPPSP         + SPPPP+ R   RPR PP Q IPP P SLPSP PPSPSPSLPPSP PTPQSPRKIIVGGSNQWQ
Subjt:  TPPPSRRPQTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSR---RPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQ

Query:  LGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTIT
        LGFDYTDWALKNGPFYVNDILVFKYDPPN STPPH+VYLL NMRS ANCD GKAK+LANI QGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFT+T
Subjt:  LGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTIT

Query:  PILK
        PILK
Subjt:  PILK

XP_022944052.1 alpha carbonic anhydrase 8-like [Cucurbita moschata]2.2e-7461.69Show/hide
Query:  MAFICSKLIILFIWACISTISSANGGWFWGPNCTVPSKPRHSRSPPPPSRRHPRTPPSQRFPPPPPRSRRPRTPPPSPPMLSSPPPPSRRPRTPPPSRRP
        M  I SKL+ILFIWAC+ST+SSAN  WFW  NC+ P  P H  SPPPP+    RTPPSQ  PP PPRSRRPRTPP     + SPPPPS           P
Subjt:  MAFICSKLIILFIWACISTISSANGGWFWGPNCTVPSKPRHSRSPPPPSRRHPRTPPSQRFPPPPPRSRRPRTPPPSPPMLSSPPPPSRRPRTPPPSRRP

Query:  QTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSRRPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALK
        Q PPP       PPP S RPRT          PPRSR PRTPP Q+  PPP   P P PPS                RKIIVGGS  W LGFDY DWALK
Subjt:  QTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSRRPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALK

Query:  NGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGD-GFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG
        NGPF++NDILVFKYDPPN +TP HSVY L NMRSF NCDLGKAKMLAN  QGS + GFEF LK QNPYYFACGE NGFHC+ GSMKFT+TPIL+G
Subjt:  NGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGD-GFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG

XP_038902031.1 extensin-3-like [Benincasa hispida]9.6e-9469.72Show/hide
Query:  MAFICSKLIILFIWACISTISSANGGWFWGPNCTV------------PSKPRHSRSPPPPSRR-------HPRTPPSQRFPPPPPRSRRPRTPP----PS
        MAFI SK+IILFIWAC ST+SSAN G F G N T+            PS P   R P PPSRR       HPRTPPS++FPPPPP SRRPRTPP    P 
Subjt:  MAFICSKLIILFIWACISTISSANGGWFWGPNCTV------------PSKPRHSRSPPPPSRR-------HPRTPPSQRFPPPPPRSRRPRTPP----PS

Query:  PPMLSSPPPPSRRPRT--------PPPSRRPQTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSRRPRTPPPQRIPPPPWSLPSPLPPSPSPSLP--
        PP     PP  R PRT        PPPSRR +TPP   P    P PPS +P+TPPSP++S  PPP+SRRPRTPP Q+IPPPP S+PS   PSP PSLP  
Subjt:  PPMLSSPPPPSRRPRT--------PPPSRRPQTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSRRPRTPPPQRIPPPPWSLPSPLPPSPSPSLP--

Query:  PSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPY
        PSPLP PQSPRKIIVGGS QWQLGFDY DWALKNGPFYVNDILVFKYDPPN STPPHSVY L NMRSFANCDLGK KM+ANI QGS DGFEFVLKDQNPY
Subjt:  PSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPY

Query:  YFACGEGNGFHCKLGSMKFTITPILKG
        YFACGEGNGFHCKLGSMKFT+TPI+KG
Subjt:  YFACGEGNGFHCKLGSMKFTITPILKG

TrEMBL top hitse value%identityAlignment
A0A0A0LIT9 Phytocyanin domain-containing protein6.7e-10978.95Show/hide
Query:  MAFICSKLIILFIWACISTISSA-NGGWFWG-PNCTVPSKPRHSRSPPPPSR-----RHPRTPPSQRFPPPPPRSRRPRTPPPSP-PMLSSPPPPSRRPR
        MAFI SKLIILFIWACIST+ SA NGGWFWG PNCTV S P H  S PPP R     R+PRTPPS+RFPPPP  SRR RTPP    P+L  PPPP R PR
Subjt:  MAFICSKLIILFIWACISTISSA-NGGWFWG-PNCTVPSKPRHSRSPPPPSR-----RHPRTPPSQRFPPPPPRSRRPRTPPPSP-PMLSSPPPPSRRPR

Query:  TPPPSRRPQTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSR---RPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQ
        TPPPSRRP+TPPP PPMLSSPPPPSP         + SPPPP+ R   RPR PP Q IPP P SLPSP PPSPSPSLPPSP PTPQSPRKIIVGGSNQWQ
Subjt:  TPPPSRRPQTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSR---RPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQ

Query:  LGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTIT
        LGFDYTDWALKNGPFYVNDILVFKYDPPN STPPH+VYLL NMRS ANCD GKAK+LANI QGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFT+T
Subjt:  LGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTIT

Query:  PILK
        PILK
Subjt:  PILK

A0A1S3CAV5 extensin5.4e-8298.71Show/hide
Query:  RTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCD
        RTPPPQRIPPPP SLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLL NMRSFANCD
Subjt:  RTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCD

Query:  LGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG
        LGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG
Subjt:  LGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG

A0A6J1D4K0 leucine-rich repeat extensin-like protein 33.6e-6255.72Show/hide
Query:  IILFIWACISTISSANGGWFWGPNCTVPSKPRHSRSPPPPSRRHPRTPP-----SQRFPPPP------PRSRRPRT----PPPSPPMLSSPPPPSRRPRT
        I+LF  A +S ++SA G WF   NC   SK RH RSPPPP  R  R PP     ++RFPPPP      P   +P+T    PPP P     PPPP  + R 
Subjt:  IILFIWACISTISSANGGWFWGPNCTVPSKPRHSRSPPPPSRRHPRTPP-----SQRFPPPP------PRSRRPRT----PPPSPPMLSSPPPPSRRPRT

Query:  -PPPSRRPQT-----PPPS-------PPMLSS-----------PPPPSPRPRTPPSPIVSSPPP-PRSRRPRTPPPQR----IPPPPWSLPSPLPPSPS-
         PPP  +P+T     PPP        PP  SS           PPPP P P  PPSP+ S PPP P++RR   PPPQ      PPPP   P P PP PS 
Subjt:  -PPPSRRPQT-----PPPS-------PPMLSS-----------PPPPSPRPRTPPSPIVSSPPP-PRSRRPRTPPPQR----IPPPPWSLPSPLPPSPS-

Query:  PSLPPSP--LPTPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVL
        PSLPP P   P PQSPRKIIVGGS  W LGFDY++WALKNGPF++NDILVFKYDPP  +T PHSVYLL+NM+SF+NCDL +A  LAN  QG+GDGF+FVL
Subjt:  PSLPPSP--LPTPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVL

Query:  KDQNPYYFACGEGNGFHCKLGSMKFTITPILK
        K Q  YYFACGEGNGFHCK GSMKF++TPIL+
Subjt:  KDQNPYYFACGEGNGFHCKLGSMKFTITPILK

A0A6J1FUQ3 alpha carbonic anhydrase 8-like1.1e-7461.69Show/hide
Query:  MAFICSKLIILFIWACISTISSANGGWFWGPNCTVPSKPRHSRSPPPPSRRHPRTPPSQRFPPPPPRSRRPRTPPPSPPMLSSPPPPSRRPRTPPPSRRP
        M  I SKL+ILFIWAC+ST+SSAN  WFW  NC+ P  P H  SPPPP+    RTPPSQ  PP PPRSRRPRTPP     + SPPPPS           P
Subjt:  MAFICSKLIILFIWACISTISSANGGWFWGPNCTVPSKPRHSRSPPPPSRRHPRTPPSQRFPPPPPRSRRPRTPPPSPPMLSSPPPPSRRPRTPPPSRRP

Query:  QTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSRRPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALK
        Q PPP       PPP S RPRT          PPRSR PRTPP Q+  PPP   P P PPS                RKIIVGGS  W LGFDY DWALK
Subjt:  QTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSRRPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALK

Query:  NGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGD-GFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG
        NGPF++NDILVFKYDPPN +TP HSVY L NMRSF NCDLGKAKMLAN  QGS + GFEF LK QNPYYFACGE NGFHC+ GSMKFT+TPIL+G
Subjt:  NGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGD-GFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG

A0A6J1G5T9 extensin-like isoform X41.2e-6255.93Show/hide
Query:  MAFICSKLIILFIWACISTISSANGGWFWGPNCTVPSKPRHSRS--PPPPSRRHPRTPPSQRFP--PPPPRSRRPRTPPPSPPMLSSPPPPSRRPRTPPP
        MA I S +I+LF+ AC+ST+SSA+  WF   N T     RH R   PPPPSR  P+ PP +  P  PPP R R+PRTPPPS P         ++PRTPP 
Subjt:  MAFICSKLIILFIWACISTISSANGGWFWGPNCTVPSKPRHSRS--PPPPSRRHPRTPPSQRFP--PPPPRSRRPRTPPPSPPMLSSPPPPSRRPRTPPP

Query:  -SRRPQTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSRRPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYT
          R+P+TPPP                               RRPRTPPP+  PP P     P P +P    P SPLPTPQ+PRKIIVGGS  W+LGFDY 
Subjt:  -SRRPQTPPPSPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSRRPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYT

Query:  DWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITP
        DW LKNGPFYVNDILVFKYDPPN STPPH+VYLL NM+S   CD  +AK++AN+ QGSG+GF FVLK Q  YYFACGEGNGFHC LGSMKF+ITP
Subjt:  DWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITP

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G15770.1 Cupredoxin superfamily protein2.1e-3049.15Show/hide
Query:  QSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEG
        ++P+KIIVGGS+ W+ G DY DWA KN PFYVND+LVFKYD    +   ++VYL  +  S+ NCD+  A+ + + ++GS + F F LK   PY+FA GE 
Subjt:  QSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEG

Query:  NGFHCKLGSMKFTITPIL
        +G +C+  +MKFTI P+L
Subjt:  NGFHCKLGSMKFTITPIL

AT2G15780.1 Cupredoxin superfamily protein1.2e-3859.17Show/hide
Query:  TPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPYYFACG
        T   PRKIIVGG  +W  GF+Y DWA K  PF++NDILVFKY+PP   T  HSVYLL N  S+  CD+ K KM+A+ KQG+G GFEFVLK   PYY +CG
Subjt:  TPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDILVFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPYYFACG

Query:  EGNGFHCKLGSMKFTITPIL
        E +G HC  G+MKFT+ P+L
Subjt:  EGNGFHCKLGSMKFTITPIL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTCATATGTTCAAAACTTATCATTCTGTTCATTTGGGCATGCATCTCAACAATAAGCTCTGCCAATGGAGGTTGGTTTTGGGGACCAAATTGCACTGTG
CCTTCCAAACCTAGGCATAGTCGATCTCCGCCACCACCATCACGGCGACATCCTCGAACACCACCATCACAACGGTTTCCACCACCACCGCCACGATCACGACGT
CCTCGAACACCTCCACCATCGCCACCAATGCTATCATCGCCTCCACCACCATCACGAAGGCCTCGAACACCACCACCATCAAGACGTCCTCAAACACCTCCACCA
TCACCACCAATGTTATCATCGCCTCCACCACCATCACCAAGGCCTCGAACACCACCCTCGCCAATAGTGTCGTCACCACCACCACCACGATCACGACGACCACGT
ACACCACCACCCCAAAGAATACCACCACCACCGTGGTCATTGCCATCACCGTTGCCACCATCTCCATCACCATCATTACCACCCTCACCACTACCAACCCCACAA
AGTCCAAGAAAGATCATAGTGGGTGGTTCCAATCAATGGCAGCTTGGCTTCGACTACACCGATTGGGCGCTCAAAAACGGTCCCTTTTATGTCAATGATATTCTT
GTGTTCAAATATGATCCTCCAAACATGTCAACTCCACCTCATAGTGTTTATCTGCTAACAAACATGAGAAGCTTTGCCAATTGTGATTTAGGAAAGGCCAAAATG
TTAGCAAATATAAAACAGGGAAGTGGAGATGGGTTTGAGTTTGTGCTTAAAGACCAAAATCCTTACTATTTTGCTTGTGGAGAAGGCAATGGCTTCCATTGCAAA
CTTGGATCCATGAAGTTCACCATCACACCAATACTTAAGGGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCTTTCATATGTTCAAAACTTATCATTCTGTTCATTTGGGCATGCATCTCAACAATAAGCTCTGCCAATGGAGGTTGGTTTTGGGGACCAAATTGCACTGTG
CCTTCCAAACCTAGGCATAGTCGATCTCCGCCACCACCATCACGGCGACATCCTCGAACACCACCATCACAACGGTTTCCACCACCACCGCCACGATCACGACGT
CCTCGAACACCTCCACCATCGCCACCAATGCTATCATCGCCTCCACCACCATCACGAAGGCCTCGAACACCACCACCATCAAGACGTCCTCAAACACCTCCACCA
TCACCACCAATGTTATCATCGCCTCCACCACCATCACCAAGGCCTCGAACACCACCCTCGCCAATAGTGTCGTCACCACCACCACCACGATCACGACGACCACGT
ACACCACCACCCCAAAGAATACCACCACCACCGTGGTCATTGCCATCACCGTTGCCACCATCTCCATCACCATCATTACCACCCTCACCACTACCAACCCCACAA
AGTCCAAGAAAGATCATAGTGGGTGGTTCCAATCAATGGCAGCTTGGCTTCGACTACACCGATTGGGCGCTCAAAAACGGTCCCTTTTATGTCAATGATATTCTT
GTGTTCAAATATGATCCTCCAAACATGTCAACTCCACCTCATAGTGTTTATCTGCTAACAAACATGAGAAGCTTTGCCAATTGTGATTTAGGAAAGGCCAAAATG
TTAGCAAATATAAAACAGGGAAGTGGAGATGGGTTTGAGTTTGTGCTTAAAGACCAAAATCCTTACTATTTTGCTTGTGGAGAAGGCAATGGCTTCCATTGCAAA
CTTGGATCCATGAAGTTCACCATCACACCAATACTTAAGGGTTGA
Protein sequenceShow/hide protein sequence
MAFICSKLIILFIWACISTISSANGGWFWGPNCTVPSKPRHSRSPPPPSRRHPRTPPSQRFPPPPPRSRRPRTPPPSPPMLSSPPPPSRRPRTPPPSRRPQTPPP
SPPMLSSPPPPSPRPRTPPSPIVSSPPPPRSRRPRTPPPQRIPPPPWSLPSPLPPSPSPSLPPSPLPTPQSPRKIIVGGSNQWQLGFDYTDWALKNGPFYVNDIL
VFKYDPPNMSTPPHSVYLLTNMRSFANCDLGKAKMLANIKQGSGDGFEFVLKDQNPYYFACGEGNGFHCKLGSMKFTITPILKG