; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC04g1121 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC04g1121
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionprotein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic
Genome locationMC04:19266679..19273292
RNA-Seq ExpressionMC04g1121
SyntenyMC04g1121
Gene Ontology termsGO:0010190 - cytochrome b6f complex assembly (biological process)
GO:0009507 - chloroplast (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021919 - Cofactor assembly of complex C subunit B, CCB1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7016624.1 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic [Cucurbita argyrosperma subsp. argyrosperma]1.86e-14484.56Show/hide
Query:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR
        A K LPSHL+PLPSP     SFS GDL P  C   PRT  RPQRSV VRVN EPL+A Q DH NSAF LA++VGYS ASYYTSLGLFVISVPGLWSLIKR
Subjt:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR

Query:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA
        SVKSKVVKKTFV+EGESKKAP+Q AGEILSFFTRNNFQVTDRGETITFEG MVPSRGQAALLTFCTCISLASV LVLTITFPD GNNWFW+SSLSPLAGA
Subjt:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA

Query:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        YYWVKASR+EEIKVKMIV E G L EIIVQGDDQQVE MRKELQLSEKGMVYVKGIFEQ
Subjt:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ

XP_022141475.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic [Momordica charantia]4.77e-179100Show/hide
Query:  MALAATKLLPSHLHPLPSPKSFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR
        MALAATKLLPSHLHPLPSPKSFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR
Subjt:  MALAATKLLPSHLHPLPSPKSFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR

Query:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA
        SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA
Subjt:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA

Query:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
Subjt:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ

XP_022938901.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic [Cucurbita moschata]1.13e-14584.94Show/hide
Query:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR
        A K LPSHL+PLPSP     SFS GDL P  C   PRT  RPQRSV VRVN EPL+A Q DHHNSAF LA++VGYS ASYYTSLGLFVISVPGLWSLIKR
Subjt:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR

Query:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA
        SVKSKVVKKTFV+EGESKKAP+Q AGEILSFFTRNNFQVTDRGETITFEG MVPSRGQAALLTFCTCISLASV LVLTITFPD GNNWFW+SSLSPLAGA
Subjt:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA

Query:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        YYWVKASR+EEIKVKMIV E G L EIIVQGDDQQVE MRKELQLSEKGMVYVKGIFEQ
Subjt:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ

XP_022992736.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic [Cucurbita maxima]7.57e-14483.78Show/hide
Query:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR
        A K LPSHL+PL SP     SFS GDL P  C  GPRT  RPQRS+ VRVN EPL+A Q D HNS F LA++VGYS ASYYTSLGLFVISVPGLWSLIKR
Subjt:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR

Query:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA
        SVKSKVVKKTFV+EGESKKAP+Q AGEILSFFTRNNF+VTDRGETITFEG MVPSRGQAALLTFCTCISLASV LVLTITFPD GNNWFW+SSLSPLAGA
Subjt:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA

Query:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        YYWVKASR+EEIKVKMIVAE G L EIIVQGDDQQVE MRKELQLSEKGMVYVKGIFEQ
Subjt:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ

XP_023550037.1 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic [Cucurbita pepo subsp. pepo]3.37e-14785.33Show/hide
Query:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR
        A K+LPSHL+PLPSP     SFS GDL P  C  GPRT  RPQRSV VRVN EPL+A Q DHHNSAF LA++VGYS ASYYTSLGLFVISVPGLWSLIKR
Subjt:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR

Query:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA
        SVKSKVVKKTFV+EGESKKAP+Q AGEILSFFTRNNFQVTDRGETITFEG MVPSRGQAALLTFCTCISLASV LVLTITFPD GNNWFW+SSLSPLAGA
Subjt:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA

Query:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        YYWVKASR+EEIKVKMIV E G L EIIVQGDDQQVE MRKELQLSEKGMVYVKGIFEQ
Subjt:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ

TrEMBL top hitse value%identityAlignment
A0A1S3CPE0 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic4.11e-14083.59Show/hide
Query:  ATKLLPS-HLHPLPSPKSFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKRSVK
        A KLLPS H +PLPS  SFS  DL P  C   PRT  +P RSVTV+VNAEPL+ALQ +H+NSAF LA++VGYSMASYYTSLGLFVISVPGLWSLIKRSVK
Subjt:  ATKLLPS-HLHPLPSPKSFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKRSVK

Query:  SKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGAYYW
        SKVVKKTFV+E ESKK P+QIAGEILSFFTRNNFQVT RGETITFEGAMVPSRGQAALLTFCTCISLASV LVLTITFPDFGNNWFW+SSLSPLAGAYYW
Subjt:  SKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGAYYW

Query:  VKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        VKASR+EEIKVKMIV E G+L EIIVQGDDQQ++ MRKEL+LSEKGMVYVKGIFEQ
Subjt:  VKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ

A0A5A7VG78 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB12.89e-14083.59Show/hide
Query:  ATKLLPS-HLHPLPSPKSFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKRSVK
        A KLLPS H +PLPS  SFS  DL P  C   PRT  +P RSVTV+VNAEPL+ALQ +H+NSAF LA++VGYSMASYYTSLGLFVISVPGLWSLIKRSVK
Subjt:  ATKLLPS-HLHPLPSPKSFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKRSVK

Query:  SKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGAYYW
        SKVVKKTFV+E ESKK P+QIAGEILSFFTRNNFQVT RGETITFEGAMVPSRGQAALLTFCTCISLASV LVLTITFPDFGNNWFW+SSLSPLAGAYYW
Subjt:  SKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGAYYW

Query:  VKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        VKASR+EEIKVKMIV E G+L EIIVQGDDQQ++ MRKEL+LSEKGMVYVKGIFEQ
Subjt:  VKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ

A0A6J1CI70 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic2.31e-179100Show/hide
Query:  MALAATKLLPSHLHPLPSPKSFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR
        MALAATKLLPSHLHPLPSPKSFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR
Subjt:  MALAATKLLPSHLHPLPSPKSFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR

Query:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA
        SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA
Subjt:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA

Query:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
Subjt:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ

A0A6J1FEF5 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic5.45e-14684.94Show/hide
Query:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR
        A K LPSHL+PLPSP     SFS GDL P  C   PRT  RPQRSV VRVN EPL+A Q DHHNSAF LA++VGYS ASYYTSLGLFVISVPGLWSLIKR
Subjt:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR

Query:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA
        SVKSKVVKKTFV+EGESKKAP+Q AGEILSFFTRNNFQVTDRGETITFEG MVPSRGQAALLTFCTCISLASV LVLTITFPD GNNWFW+SSLSPLAGA
Subjt:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA

Query:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        YYWVKASR+EEIKVKMIV E G L EIIVQGDDQQVE MRKELQLSEKGMVYVKGIFEQ
Subjt:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ

A0A6J1JUD8 protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic3.66e-14483.78Show/hide
Query:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR
        A K LPSHL+PL SP     SFS GDL P  C  GPRT  RPQRS+ VRVN EPL+A Q D HNS F LA++VGYS ASYYTSLGLFVISVPGLWSLIKR
Subjt:  ATKLLPSHLHPLPSPK----SFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKR

Query:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA
        SVKSKVVKKTFV+EGESKKAP+Q AGEILSFFTRNNF+VTDRGETITFEG MVPSRGQAALLTFCTCISLASV LVLTITFPD GNNWFW+SSLSPLAGA
Subjt:  SVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGA

Query:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        YYWVKASR+EEIKVKMIVAE G L EIIVQGDDQQVE MRKELQLSEKGMVYVKGIFEQ
Subjt:  YYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ

SwissProt top hitse value%identityAlignment
Q9LSE4 Protein COFACTOR ASSEMBLY OF COMPLEX C SUBUNIT B CCB1, chloroplastic1.6e-7978.17Show/hide
Query:  NSAFFLAD-TVGYSMASYYTSLGLFVISVPGLWSLIKRSVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALL
        NS   + + T GYS+ASYYTSLGLFVISVPGLWSLIKRSVKSK+V+KTFV   + KK P Q+AGEILSFFTR NF +TDRGETITFEG MVPSRGQAALL
Subjt:  NSAFFLAD-TVGYSMASYYTSLGLFVISVPGLWSLIKRSVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALL

Query:  TFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGAYYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        TFCTCISLASV LVLTIT PDFGNNWF+I  LSPLAG YYW KASR+EEIKVKM+V   G+L EI+VQGDD QVE MRKELQL+EKGMVYVKG+FE+
Subjt:  TFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGAYYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ

Arabidopsis top hitse value%identityAlignment
AT3G26710.1 cofactor assembly of complex C1.1e-8078.17Show/hide
Query:  NSAFFLAD-TVGYSMASYYTSLGLFVISVPGLWSLIKRSVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALL
        NS   + + T GYS+ASYYTSLGLFVISVPGLWSLIKRSVKSK+V+KTFV   + KK P Q+AGEILSFFTR NF +TDRGETITFEG MVPSRGQAALL
Subjt:  NSAFFLAD-TVGYSMASYYTSLGLFVISVPGLWSLIKRSVKSKVVKKTFVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALL

Query:  TFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGAYYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ
        TFCTCISLASV LVLTIT PDFGNNWF+I  LSPLAG YYW KASR+EEIKVKM+V   G+L EI+VQGDD QVE MRKELQL+EKGMVYVKG+FE+
Subjt:  TFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGAYYWVKASRREEIKVKMIVAEGGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTGGCAGCAACCAAGCTACTACCATCTCACCTCCATCCTCTTCCTTCCCCTAAATCCTTCTCCACCGGAGACCTACCTCCGCCGTCATGCCTCCCCGGACCTCG
CACTCTCGTCAGGCCGCAGAGATCGGTAACGGTGAGAGTCAATGCGGAGCCGCTCCTCGCTCTCCAATTGGACCACCACAACTCCGCCTTCTTCCTCGCTGACACCGTCG
GCTACTCCATGGCTAGCTACTACACTTCTCTCGGCCTCTTCGTCATCTCCGTCCCTGGCTTGTGGTCACTCATCAAGCGATCCGTCAAATCCAAGGTTGTGAAGAAGACA
TTCGTCGCCGAAGGAGAATCGAAGAAGGCGCCGAGCCAGATCGCCGGAGAGATCTTGTCATTCTTCACTCGCAACAACTTCCAAGTCACGGACAGAGGCGAAACCATAAC
ATTTGAAGGAGCGATGGTGCCGAGTCGAGGCCAAGCGGCATTGCTGACATTCTGTACTTGCATTAGCCTGGCCAGCGTCGCCCTCGTCCTCACCATAACTTTTCCAGATT
TCGGCAACAACTGGTTCTGGATTAGCAGCCTCAGTCCCCTAGCAGGAGCATATTACTGGGTGAAAGCATCAAGAAGGGAGGAGATAAAGGTCAAAATGATAGTTGCAGAA
GGTGGAAAGCTTGCAGAGATTATTGTTCAAGGAGATGACCAACAAGTAGAGCTTATGAGAAAGGAGCTTCAGTTGAGTGAAAAAGGCATGGTCTATGTCAAAGGCATTTT
TGAGCAATAA
mRNA sequenceShow/hide mRNA sequence
CTGACTTTCAACCATCATAATATGATTTAATGATCAATAAGGGTTCATGTAAATAGCAAATGACTTAGAGGATCAAACCATGATGGCCGCGCCTAACTATTTTTTACTCG
ATACTAATTTTGACCCTAATACCAACCCTTTACCCAACCCTAAATTTTTTTTCCAAAACAAAAAAAAAAAAGAGAGAGAGAGAGAGAAAGGGAGGGAGGGAGAGAGAGAG
ACCGTGTGAGTGAGTTTGGAGTTGGCTCGAGTGAAGCGAGCCTACCTTTGTGTGGGAAGGTAACGGCAACCACAAGATGAGGTAGCCGCGTCAGCCACCTCTCACTTCAC
AAACCAAACCTTTTTATCCTTTTCTTTTCTTCTGTCTTCTGCCTGTGGCTGTGGATGTGGCTGTCCCTCTTCTTCAGAAAAACTTACTCTCCATTTCTCTGCCGGACGGA
CACCTATGGCTTTGGCAGCAACCAAGCTACTACCATCTCACCTCCATCCTCTTCCTTCCCCTAAATCCTTCTCCACCGGAGACCTACCTCCGCCGTCATGCCTCCCCGGA
CCTCGCACTCTCGTCAGGCCGCAGAGATCGGTAACGGTGAGAGTCAATGCGGAGCCGCTCCTCGCTCTCCAATTGGACCACCACAACTCCGCCTTCTTCCTCGCTGACAC
CGTCGGCTACTCCATGGCTAGCTACTACACTTCTCTCGGCCTCTTCGTCATCTCCGTCCCTGGCTTGTGGTCACTCATCAAGCGATCCGTCAAATCCAAGGTTGTGAAGA
AGACATTCGTCGCCGAAGGAGAATCGAAGAAGGCGCCGAGCCAGATCGCCGGAGAGATCTTGTCATTCTTCACTCGCAACAACTTCCAAGTCACGGACAGAGGCGAAACC
ATAACATTTGAAGGAGCGATGGTGCCGAGTCGAGGCCAAGCGGCATTGCTGACATTCTGTACTTGCATTAGCCTGGCCAGCGTCGCCCTCGTCCTCACCATAACTTTTCC
AGATTTCGGCAACAACTGGTTCTGGATTAGCAGCCTCAGTCCCCTAGCAGGAGCATATTACTGGGTGAAAGCATCAAGAAGGGAGGAGATAAAGGTCAAAATGATAGTTG
CAGAAGGTGGAAAGCTTGCAGAGATTATTGTTCAAGGAGATGACCAACAAGTAGAGCTTATGAGAAAGGAGCTTCAGTTGAGTGAAAAAGGCATGGTCTATGTCAAAGGC
ATTTTTGAGCAATAATCATTCCTTGTTACCACTTCTCTACCTTTTTCTAGCAGTTCAAACTCAATTAGTAATCACTCCAACATTCCTCTGGAAAGAAAGCAAACAATTTT
GGAACTTGTACAACAATTTCCAAACACATGAATCATACAGATACGACATCAAAATTGAATGTTAATGTTCCCAGTTTCTCAACAGTATGGGTATCTCATCATCTAATCTA
TACCTTTCACTCATTCAGTCAAATTATTCAGATTCATTCATGCACCAAGCAATGACAGTTACAGTGTTCTCATTGCATAAAGATTTCAAAGGAATTTGAGGAGAAAAGAA
ACAAAACAGGTCTGCAAACAGATTCTTTGCCATTTATCTGCTTTTAAATAAAAATTGGGGGTCGCTCGGACCTCTGTATTACCTCGAAAAGAAGAGCTTGGTTTCTTAAC
CGATCAAAAGCTTGTTGTCCTCCAAAAAGGTGTAGGTGGGTACCTCCAAATTGTAGGGTGCTGGCCCTTTGCGGATCCTACCAGAAACATCGTAATGAGAACCATGGCAG
GGGCAAAACCACCCACCATAGTCTCCAGCATTTGGCAAGGGGATGCAACCGAGATGTGTGCAGACGCCAATCACGATAAGCCATTCTGGATTCTTAACTCTTTCTTCATC
CTGCTGTGGGTCACGAAGAGATCCAATGTCCACGCTATTTGCTAAGTTAATATCGTCCTCAGTTCGTCGCCTGATGAAAACTGGCTTTCCACGCCACTTGACAGTCACGG
TGGAACCAGGCTCGATGCTTGAGAGGTCAACCTCGAGCGAGGCCAGGGCAAGAACATCCTTACTGGCTGACATGCTAAGAACAAACTTGAGAACAAGGAGACGAATCAAG
GAAGCATAGACAAACCGACCACCTGACAAGACAAAATAAGCAAATGCACGCTTGCTAGGATCACCCGGAGGAAATCGCTCGTGATTGTAGTCGTCATAGACTATCTTTGA
AGAAGGGTTCTTAACAGCTGCTACAGTTGCAGGAATATCCATAATTAAGCTGTCCTCCTTCGTTGAGGTGAATGCATCAGAAGCAAAACCTAACAGAGAGTCATTTCCAT
AAAATCATGAAACCACGCCATGGTGGAAAGAAATAATCATGCATAAGGTGTTTAATCACAAATCAAAATTGAAAAGCAATCACCTAATCAATCATCTGGACAGAATGCAC
AAGATTTTGTAGTTTTTTAATTTAAAAAATGGTGCTTTTTTCTCACAATTTCTGTACTATATTTTTTACCTTCCTTAAAGAAACCTTTGAAATCTTCAGCCAAATTACTT
TTTTTTTTATATAAGAAACAATAATATTTCATTCAAAAAGAAGAGGAGCCCAAAAAAGGCAGAAAAACAGCCTAGAGGCAGAGGGTCGAGGTGGAAGTACAAAAACAAAA
TTAAAGCTTTTTAAAACTACTCTTTTTAGTTTTCACTTAAATTTTGAAAACATTTTTGAAAAGTATAAAACAAAACAAAGAAAGTGCTAGGTGGAATTAATATTTATAAG
ATTAATTTTCAAAAGCTAAAAGCTAAAACCAAGTGGTTATCAATCGGTAATTGATAAAGAACGATATAATTGCAATTTTTTGTTTAATAAGCATCAGATACAAAACCATT
TCCAGTGTGTTAATAGATTTCTTGCCACTATTTCTAACATTGTCAACTTTCATACAGAGATAGGAAGTAAAACTCCATAATTAGTAATGTACAAGTACATAGAGGAGAAT
TTACTATAACATTTGATTGTAAAACTAGTTGTGAATAAAGGAGGATCTGCAAAACCATCCATAACCAAAAAGGTCAGCCATAATGGCCAGTCAATTTTGCAAGATTGTAT
ACAAAACAACTGATAAGCAGTTCCACATTGGCATAAGCATCTTAAAAAATTTTGATCTGAATTAGTTGTTGTATTACTTAAACAAGTGGTTTCAATGTATATTACAGGGG
CTTAAAAGGTCAGCTCACAAGATAATAAAAGCATCATGAAATTGATCTTTCTCAATTTATGTTGCCTGATTTTTGGAGCAAATTTAAGGATTGTCTACGTTAGTTTAAGT
AAATTCATCTTCAGGGCCCAAGGAATAGGATTAAAGTTTTCCAATCTTTTAAAGTAGAAATGTCTTTACTTGGATTCTCCTCTTTGTCAAAGTTAATACAGTGGATATTC
TTCAAAAGAAAAACCCTTCTATGGTGTTTTTTCCTTCCATTTGCTCTCTGTCTGGAAAGTCTTGTGAGTTCCAGCAACATATGTTCTTTCGGTGCTCTTTTTCGACTACT
TGCTGGAGGCTTTTCTTCGAGATCTTTGAGGTTTCTTGAGTTTTTGATTTCGAAGTCGCAAAGAACGTTCATTCTCTGTTGATTGGCCTGTGTCTCGCGTCAAAGGCGGG
TCTTCTTTGGTGTAATGGAGTCAAAGCTGTGTTGTCTGATATTTGGTTCGAGAGAAATCAGAGGCTTTTCAAGAGGAAACGTCGAGATCCTCCTAGCAAAGTTTAAGGCT
TCCCAATGGTGTGCTCTTTCCAATGTTTTTGTTAATTACTCTCTGAGTATGATTTGTTCTAATTGGGAGGTTTTTATAACTCCCCCTTAGCTTTTGTTCTTTTTGCTTAG
TTTGTTGTTCTGTTTTCCGTTTTGTTTATTTTCTCACTACTTCGGAGGTTTGTATCATTGAACAATTTTCTGTTCCTTTTCATTAAATCAATGAAAAGTTTGTTTCTTGT
TCAAAAAAAAAGTAGAAACTTTGAAGTTATTTCATAAAGGATTATTTTTTCCTTTTGCATAAGAAATGGCATGATGCAGTTTTTCTAGGGGCACGGTTTGATTTTTTCTT
ATTCTAAATCTAATACTTCATTGTTTCATTCAAGTAAGAGGATCAATTATCCTCATACCTTCTTATACTTAATCTCAAACTAATTCATTAAAAATATTCTGATATAAAAA
TATAAAATACTGCAAGCTAAATCTTGTTTGCTCCCTTGGAAGGGATTCCTCCCACCTCCTGCCCCCTAGGCTGTTCTTTTTTGTGAATACAAAATCTCTTATTTCTTATA
ATAATAATAAAAAGAAAGAAAAATACAAGGGCTTAAACATAGTAGATAACAATTATACTCGCACTTTTATTAAGATGAAATGATGAAATGAAAGAAAAAACAAATGCCAT
ACAAAAAAACAAGCTTGACAAAAGAAGACCTGCCTATAGAAAGGATCTTCAATCCAAAAGAACAACTCTTAACTGATAATTACAAAAAATCTTTGATGCCGACACCACTT
GAAGGAATTAAATGTAATAATATCCCAAACTTTCTCCCAAGGCCTCTCAACCCCCTCTAGAATCTGCTATTCCTCTCAAGCAAAACATCCCACAACAGACATTCTCTATC
CCGAAAAGGAGGAAAAAGCTATAATCTGCCAGAACTATACTCTTTCAACAACAATAAACTATACCTCAACGCCCAACAAATAATCTGTTGATGGGGATTCCATATTGATA
TTGGTGAGTTTTTAAAAGAAACAATACATAACTTTCTAAGCACAGGAACTATGGGAAACGCTAAACTGAATAATTTTAATCTCCTTTAGAAAGGATGAGATATAACCTCA
CTCACGCCAAAACTTCTGTACTAGAAGAATCAGCAACAAAGTAGATGAATAATATCGTATAGTTTTCAGTGATTATAACATAAACGACAAAAAATAAAGCTAAAGTCTGA
AAATAAAACAGTGGAAGATTAAGAAATGAACGAAGAAGAAACAATTGCTTCGGTAAAGTGACATCGTTAAAACGCCCCGACGTAACAGTAATAACTATCACATTTTGAAA
TTATTTTCAAAACGAGAAACTAATATGCAAAAGCTGATCCAAAATAGACGATTCAATGAAGTTATACATCAATCACCATAAACAAGCCAGTCTTCGAATTTGAATCTGGT
AAGATTAGATTCAACATTTTGTTTAATAAAAATCCATATTTATTTATTTACTTCAACAATAGATTTCACGAAAACGCGTCACATATCGACAAAAACAAACCCTAAAAGAC
ACCATAAATCAACTGGGGGCAAGAACGACAAAAGCTCAGGACAAAAAAAAAAGTGACTCTAACCTCTGCAATTACGATCAAAGTGAAGACCGAAAGGGAAGGTGTTGAAG
GAGGCGACGGATCTGAAATCATTGGAAGG
Protein sequenceShow/hide protein sequence
MALAATKLLPSHLHPLPSPKSFSTGDLPPPSCLPGPRTLVRPQRSVTVRVNAEPLLALQLDHHNSAFFLADTVGYSMASYYTSLGLFVISVPGLWSLIKRSVKSKVVKKT
FVAEGESKKAPSQIAGEILSFFTRNNFQVTDRGETITFEGAMVPSRGQAALLTFCTCISLASVALVLTITFPDFGNNWFWISSLSPLAGAYYWVKASRREEIKVKMIVAE
GGKLAEIIVQGDDQQVELMRKELQLSEKGMVYVKGIFEQ