; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmUC01G007790 (gene) of Watermelon (USVL531) v1 genome

Gene IDCmUC01G007790
OrganismCitrullus mucosospermus (Watermelon (USVL531) v1)
Descriptionprotein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic
Genome locationCmU531Chr01:8445234..8452169
RNA-Seq ExpressionCmUC01G007790
SyntenyCmUC01G007790
Gene Ontology termsGO:0010190 - cytochrome b6f complex assembly (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021919 - Cofactor assembly of complex C subunit B, CCB1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0066087.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1 [Cucumis melo var. makuwa]3.4e-12791.37Show/hide
Query:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
        MAAKLLPSS+ YP PSNSFSAVDL+PRPCF RPRTHF  HRSVTV+VNAEPL+ LQ+H+NSAFLLAE+VGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
Subjt:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS

Query:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV
        KVVKKTFVS+ ESKKEPNQIAGEILSFFTRNNFQVT RGETITFEG MVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFW+SSLSPLAGAYYWV
Subjt:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV

Query:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQ++QMRKEL+L+EKGMVYVKGIFEQ
Subjt:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ

XP_004143764.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic [Cucumis sativus]1.8e-12891.76Show/hide
Query:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
        MAAKLLPSS+LYP PSNSFSA+DL+PRPCF RPRTHF  HRSVTVRV+AEPLV LQDH+NSAFLLAE+VGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
Subjt:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS

Query:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV
        KVVKKTFVS+  SKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTIT+PDFGNNWFW+SSLSPLAGAYYWV
Subjt:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV

Query:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        KASRKEEIKVKMIVGEDGRLGEI+VQGDDQQ++QMRKEL+L+EKGMVYVKGIFEQ
Subjt:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ

XP_008465491.1 PREDICTED: protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic [Cucumis melo]4.4e-12790.98Show/hide
Query:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
        MAAKLLPSS+ YP PSNSFSA+DL+PRPCF RPRTHF  HRSVTV+VNAEPL+ LQ+H+NSAFLLAE+VGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
Subjt:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS

Query:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV
        KVVKKTFVS+ ESKKEPNQIAGEILSFFTRNNFQVT RGETITFEG MVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFW+SSLSPLAGAYYWV
Subjt:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV

Query:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQ++QMRKEL+L+EKGMVYVKGIFEQ
Subjt:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ

XP_022938901.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic [Cucurbita moschata]1.9e-12289.23Show/hide
Query:  MAAKLLPSSNLYPPPS-----NSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIK
        MA K LP S+LYP PS     +SFSA DLTPRPCFGRPRTHF   RSV VRVN EPLV  QDHHNSAFLLAE+VGYS ASYYTSLGLFVISVPGLWSLIK
Subjt:  MAAKLLPSSNLYPPPS-----NSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIK

Query:  RSVKSKVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAG
        RSVKSKVVKKTFVS+GESKK PNQ AGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPD GNNWFW+SSLSPLAG
Subjt:  RSVKSKVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAG

Query:  AYYWVKASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        AYYWVKASRKEEIKVKMIVGEDG LGEIIVQGDDQQVEQMRKELQL+EKGMVYVKGIFEQ
Subjt:  AYYWVKASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ

XP_038874658.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic [Benincasa hispida]3.5e-13295.69Show/hide
Query:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
        MAAKLLPSS+L P PSNSFSAVDLTPRPCFGRPRTHF  HRSVTVRVNAEPLVVLQDHHNSAFLLAE+VGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
Subjt:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS

Query:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV
        KVVKKT VS+GESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFW+SSLSPLAGAYYWV
Subjt:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV

Query:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        KASRKEEIKVKMIVGEDG LGEIIVQGDDQQVEQMRKELQL+EKGMVYVKGIFEQ
Subjt:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ

TrEMBL top hitse value%identityAlignment
A0A0A0KRU8 Uncharacterized protein8.6e-12991.76Show/hide
Query:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
        MAAKLLPSS+LYP PSNSFSA+DL+PRPCF RPRTHF  HRSVTVRV+AEPLV LQDH+NSAFLLAE+VGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
Subjt:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS

Query:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV
        KVVKKTFVS+  SKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTIT+PDFGNNWFW+SSLSPLAGAYYWV
Subjt:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV

Query:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        KASRKEEIKVKMIVGEDGRLGEI+VQGDDQQ++QMRKEL+L+EKGMVYVKGIFEQ
Subjt:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ

A0A1S3CPE0 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic2.1e-12790.98Show/hide
Query:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
        MAAKLLPSS+ YP PSNSFSA+DL+PRPCF RPRTHF  HRSVTV+VNAEPL+ LQ+H+NSAFLLAE+VGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
Subjt:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS

Query:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV
        KVVKKTFVS+ ESKKEPNQIAGEILSFFTRNNFQVT RGETITFEG MVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFW+SSLSPLAGAYYWV
Subjt:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV

Query:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQ++QMRKEL+L+EKGMVYVKGIFEQ
Subjt:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ

A0A5A7VG78 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB11.6e-12791.37Show/hide
Query:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
        MAAKLLPSS+ YP PSNSFSAVDL+PRPCF RPRTHF  HRSVTV+VNAEPL+ LQ+H+NSAFLLAE+VGYSMASYYTSLGLFVISVPGLWSLIKRSVKS
Subjt:  MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKS

Query:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV
        KVVKKTFVS+ ESKKEPNQIAGEILSFFTRNNFQVT RGETITFEG MVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFW+SSLSPLAGAYYWV
Subjt:  KVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWV

Query:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQ++QMRKEL+L+EKGMVYVKGIFEQ
Subjt:  KASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ

A0A6J1FEF5 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic9.2e-12389.23Show/hide
Query:  MAAKLLPSSNLYPPPS-----NSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIK
        MA K LP S+LYP PS     +SFSA DLTPRPCFGRPRTHF   RSV VRVN EPLV  QDHHNSAFLLAE+VGYS ASYYTSLGLFVISVPGLWSLIK
Subjt:  MAAKLLPSSNLYPPPS-----NSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIK

Query:  RSVKSKVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAG
        RSVKSKVVKKTFVS+GESKK PNQ AGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPD GNNWFW+SSLSPLAG
Subjt:  RSVKSKVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAG

Query:  AYYWVKASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        AYYWVKASRKEEIKVKMIVGEDG LGEIIVQGDDQQVEQMRKELQL+EKGMVYVKGIFEQ
Subjt:  AYYWVKASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ

A0A6J1JUD8 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic6.2e-11986.97Show/hide
Query:  MAAKLLPSSNLYP------PPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLI
        MA K LP S+LYP      PPS SFSA DLTPRPCFG PRTHF   RS+ VRVN EPLV  QD HNS FLLAE+VGYS ASYYTSLGLFVISVPGLWSLI
Subjt:  MAAKLLPSSNLYP------PPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLI

Query:  KRSVKSKVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLA
        KRSVKSKVVKKTFVS+GESKK PNQ AGEILSFFTRNNF+VTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPD GNNWFW+SSLSPLA
Subjt:  KRSVKSKVVKKTFVSDGESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLA

Query:  GAYYWVKASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        GAYYWVKASRKEEIKVKMIV EDG LGEIIVQGDDQQVEQMRKELQL+EKGMVYVKGIFEQ
Subjt:  GAYYWVKASRKEEIKVKMIVGEDGRLGEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ

SwissProt top hitse value%identityAlignment
Q9LSE4 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic7.4e-8572.96Show/hide
Query:  THFNQHRSVTVRVNAEPLVVL-----------QDHHNSAFLLAE-TVGYSMASYYTSLGLFVISVPGLWSLIKRSVKSKVVKKTFVSDGESKKEPNQIAG
        T  N+  +VT  +  EPL V+           + + NS  L+ E T GYS+ASYYTSLGLFVISVPGLWSLIKRSVKSK+V+KTFV + + KKEP Q+AG
Subjt:  THFNQHRSVTVRVNAEPLVVL-----------QDHHNSAFLLAE-TVGYSMASYYTSLGLFVISVPGLWSLIKRSVKSKVVKKTFVSDGESKKEPNQIAG

Query:  EILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWVKASRKEEIKVKMIVGEDGRLGE
        EILSFFTR NF +TDRGETITFEG MVPSRGQAALLTFCTCISLASVGLVLTIT PDFGNNWF+I  LSPLAG YYW KASRKEEIKVKM+VG  GRL E
Subjt:  EILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWVKASRKEEIKVKMIVGEDGRLGE

Query:  IIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        I+VQGDD QVE+MRKELQLNEKGMVYVKG+FE+
Subjt:  IIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ

Arabidopsis top hitse value%identityAlignment
AT3G26710.1 cofactor assembly of complex C5.2e-8672.96Show/hide
Query:  THFNQHRSVTVRVNAEPLVVL-----------QDHHNSAFLLAE-TVGYSMASYYTSLGLFVISVPGLWSLIKRSVKSKVVKKTFVSDGESKKEPNQIAG
        T  N+  +VT  +  EPL V+           + + NS  L+ E T GYS+ASYYTSLGLFVISVPGLWSLIKRSVKSK+V+KTFV + + KKEP Q+AG
Subjt:  THFNQHRSVTVRVNAEPLVVL-----------QDHHNSAFLLAE-TVGYSMASYYTSLGLFVISVPGLWSLIKRSVKSKVVKKTFVSDGESKKEPNQIAG

Query:  EILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWVKASRKEEIKVKMIVGEDGRLGE
        EILSFFTR NF +TDRGETITFEG MVPSRGQAALLTFCTCISLASVGLVLTIT PDFGNNWF+I  LSPLAG YYW KASRKEEIKVKM+VG  GRL E
Subjt:  EILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWVKASRKEEIKVKMIVGEDGRLGE

Query:  IIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ
        I+VQGDD QVE+MRKELQLNEKGMVYVKG+FE+
Subjt:  IIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGCCAAGCTTCTACCATCATCCAATCTCTATCCTCCCCCTTCCAACTCCTTCTCCGCCGTAGACCTAACTCCGCGCCCATGCTTCGGCCGACCTCGCACTCATTT
CAACCAGCACAGATCGGTAACCGTCAGAGTCAATGCGGAACCGCTCGTCGTTCTCCAAGACCACCACAACTCCGCCTTCCTCCTGGCTGAGACAGTCGGCTATTCCATGG
CTAGTTACTACACTTCTCTCGGCCTCTTCGTCATCTCCGTCCCTGGCCTATGGTCTCTCATCAAGCGATCCGTCAAATCCAAGGTTGTGAAGAAGACGTTCGTTAGCGAT
GGAGAATCGAAGAAGGAGCCGAATCAGATTGCTGGAGAGATCTTGTCGTTCTTCACTCGTAATAATTTCCAAGTCACAGACAGAGGTGAAACCATAACGTTTGAAGGAAC
AATGGTGCCGAGTCGAGGGCAAGCGGCATTGTTGACATTCTGTACGTGCATTAGCTTGGCTAGCGTGGGTCTTGTTCTGACCATAACGTTTCCAGATTTTGGCAACAATT
GGTTTTGGATTAGCAGCTTGAGTCCCTTAGCTGGAGCATACTACTGGGTGAAAGCATCAAGAAAGGAAGAGATCAAAGTTAAAATGATAGTTGGAGAAGATGGAAGGCTT
GGAGAGATTATTGTTCAAGGAGATGACCAGCAGGTTGAGCAAATGAGAAAGGAGCTTCAGTTGAATGAAAAAGGCATGGTTTATGTCAAAGGTATTTTTGAGCAATGA
mRNA sequenceShow/hide mRNA sequence
ACGGCAACCACAAGATGCCGCAGTGATTTTCACTTCACAAGCAAAGCAATGCTTTTATCCTTTTCCTCTTTCTTGTGGCTGTGGCTCTCTTCTTCTTCTTCTTCCTCAAA
ATCTCTCATTCCTTTCTCTGGTCGACATTAATGGCGGCCAAGCTTCTACCATCATCCAATCTCTATCCTCCCCCTTCCAACTCCTTCTCCGCCGTAGACCTAACTCCGCG
CCCATGCTTCGGCCGACCTCGCACTCATTTCAACCAGCACAGATCGGTAACCGTCAGAGTCAATGCGGAACCGCTCGTCGTTCTCCAAGACCACCACAACTCCGCCTTCC
TCCTGGCTGAGACAGTCGGCTATTCCATGGCTAGTTACTACACTTCTCTCGGCCTCTTCGTCATCTCCGTCCCTGGCCTATGGTCTCTCATCAAGCGATCCGTCAAATCC
AAGGTTGTGAAGAAGACGTTCGTTAGCGATGGAGAATCGAAGAAGGAGCCGAATCAGATTGCTGGAGAGATCTTGTCGTTCTTCACTCGTAATAATTTCCAAGTCACAGA
CAGAGGTGAAACCATAACGTTTGAAGGAACAATGGTGCCGAGTCGAGGGCAAGCGGCATTGTTGACATTCTGTACGTGCATTAGCTTGGCTAGCGTGGGTCTTGTTCTGA
CCATAACGTTTCCAGATTTTGGCAACAATTGGTTTTGGATTAGCAGCTTGAGTCCCTTAGCTGGAGCATACTACTGGGTGAAAGCATCAAGAAAGGAAGAGATCAAAGTT
AAAATGATAGTTGGAGAAGATGGAAGGCTTGGAGAGATTATTGTTCAAGGAGATGACCAGCAGGTTGAGCAAATGAGAAAGGAGCTTCAGTTGAATGAAAAAGGCATGGT
TTATGTCAAAGGTATTTTTGAGCAATGATTATTACTTTCTCTTTTCTTTTTCCCTTTTTTTCCTCTTTAGCATAGTTGAAGGAATCTTCATGAGTTACAATTCTAGTTCC
CAAATGCAAAATGTTCAATAACTTAATCAATCAGTCCAGATTCTGGAAAAGATTCAAACTTGTACAACAATTTTCAACACATGAATCATACAAATACATCAAAATTTGAA
TATGCCCTCACTTTTTTTGTCTACTACAAAGTTTTCATTATCCTCAATAGGATGGTGGTATCTTCTAATCTATTATCTTTCACTCATTCAGAAAAAATTATTTCAGATTC
ATTCATGCATCAAGCAATGTTACAGTTACAGTGTTCTCACTGCATAAAGATTTCTAAAATCTCAAAGGAGTGAAGAAAGAAACAAAACATGTCTGCAACTTTATCTGCTT
TAAATAAAAATTGAGGGTCTCGAACCTCGGTATTACCTCGAAAAGTAGAGATTGGTTTAACCAATCAAGAGCTTGTTCTCATCCAAAAAGGTGTAGGTGGGTACCTCCAA
ATTGTAGGGTGCTGGCCCTTTACGGATTCTGCCAGAGACGTCGTAATGAGAACCATGGCATGGACAAAACCACCCACCATAATCACCAGCATTTGGCAAGGGAATGCAAC
CAAGATGCGTGCAGACACCGATCACAATAAGCCATTCTGGATCCTTAACTCTCTCTTCATCCTGCTGAGGGTCACGAAGAGATCCAACGTCCACACTATTTGCTAACTTA
ATATCGTCCTCAGTTCGTCGCCTGATGAATACTGGTTTTCCACGCCACTTAACTGTCACCGTAGAACCAGGTTCGATGCTTGAAAGGTCAACCTCAAGTGAGGCCAGGGC
AAGAACATCCTTACTGGCTGACATGCTGAGGACAAACTTGAGAACAAGGAGACGAATCAAAGAAGCATAGACAAAACGACCACCTGATAATACAAAATAAGCAAATGCGC
GCTTGCTGGGGTCACCAGGAGGAAATCGCTCATGATTGTAGTCGTCATAGACTATCTTTGAAGAAGGGTTCTTAATGGCAGCTACAGTTGCAGGAATATCCAGAATTAAG
CCATCATCCTTGGTTGAGGTCAATGCATCAGAAGCAAAACCTCTAGAATTGCGATCAAAGTGAGGTCCGATGGGGAAAGTGTTGAAGAATCCGCCGGATCTGAAATCATT
GGAAGGGGCGGAATCAGTGGAATCAATGATGTTGTGAGAACCGGGCAAAGAAGAGGCAATAGTGGGCCTGAAATGCGAAGAAGAGAGTGACGAGAGCCTTCGAGCTGCAA
TCCTCAACATCTTCGTCAACCTTCAGGTTTTCGATTTGGAGGGATATTGAGAAACTCAGAAGAAGGCGATAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAAGAA
CCGGCCTTTTCCCCGATTCGTAGAGGGTTCATGACCGACCCCTATATTTGACCTCAAACTTGTTTTGCTTCCTTCAGCTATCGTTATGGTTTTTTAATTTTC
Protein sequenceShow/hide protein sequence
MAAKLLPSSNLYPPPSNSFSAVDLTPRPCFGRPRTHFNQHRSVTVRVNAEPLVVLQDHHNSAFLLAETVGYSMASYYTSLGLFVISVPGLWSLIKRSVKSKVVKKTFVSD
GESKKEPNQIAGEILSFFTRNNFQVTDRGETITFEGTMVPSRGQAALLTFCTCISLASVGLVLTITFPDFGNNWFWISSLSPLAGAYYWVKASRKEEIKVKMIVGEDGRL
GEIIVQGDDQQVEQMRKELQLNEKGMVYVKGIFEQ