; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmaCh00G000340 (gene) of Cucurbita maxima (Rimu) v1.1 genome

Gene IDCmaCh00G000340
OrganismCucurbita maxima Rimu (Cucurbita maxima (Rimu) v1.1)
DescriptionPhotosystem I assembly protein Ycf4
Genome locationCma_Chr00:2610699..2615303
RNA-Seq ExpressionCmaCh00G000340
SyntenyCmaCh00G000340
Gene Ontology termsGO:0015979 - photosynthesis (biological process)
GO:0009522 - photosystem I (cellular component)
GO:0009535 - chloroplast thylakoid membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
GO:0000287 - magnesium ion binding (molecular function)
GO:0003824 - catalytic activity (molecular function)
InterPro domainsIPR000685 - Ribulose bisphosphate carboxylase, large subunit, C-terminal
IPR003359 - Photosystem I Ycf4, assembly
IPR036376 - Ribulose bisphosphate carboxylase, large subunit, C-terminal domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
ASY96369.1 photosystem I assembly protein Ycf4 [Cucumis melo subsp. agrestis]3.4e-3879.46Show/hide
Query:  GIVMSFYGIAGLFISSYLWCTILWN-----------KRRNSVYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT
        GIVMSFYGIAGLFISSYLWCTILWN           +   S++  GF G+N RIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLT TDQNLT
Subjt:  GIVMSFYGIAGLFISSYLWCTILWN-----------KRRNSVYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT

Query:  PREIEQKAAELA
        PREIEQKAAELA
Subjt:  PREIEQKAAELA

KAB2074847.1 hypothetical protein ES319_A07G178900v1, partial [Gossypium barbadense]1.5e-3880.77Show/hide
Query:  LNHGIVMSFYGIAGLFISSYLWCTILWNKRRNSVYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLTPREIEQKA
        L  GIVMSFYGI GLFISSYLWCTI WN+    ++  GF G+N RIFLRFLMKDIQSIRIEVKEGIYARRVLYMEI+GQGA+PLT TD+NLTPREIEQKA
Subjt:  LNHGIVMSFYGIAGLFISSYLWCTILWNKRRNSVYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLTPREIEQKA

Query:  AELA
        AELA
Subjt:  AELA

PWZ03905.1 Ribulose bisphosphate carboxylase large chain [Zea mays]7.0e-4449.02Show/hide
Query:  LRVNSGTFLNSVSLFYGLFSFLSFDSAFKTLQIQDSFI----FTSIPFCYDWVSLPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------
        LR++ G  ++S ++   L         F  L ++D FI       I F  DWVS+PGV+PVAS GIHVWHMPALTE+     +       FG G      
Subjt:  LRVNSGTFLNSVSLFYGLFSFLSFDSAFKTLQIQDSFI----FTSIPFCYDWVSLPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------

Query:  GNLPGAVANRVALEACVQARNEGRDLAREGANCSESESSRWLTFDKGLFDLNHGIVMSFYGIAGLFISSYLWCTILWNKRRNSVYFGGFRGQNPRIFLRF
        GN PGA ANRVALEACVQARNEGRDLAREG+            +D+  FD   GIV                C   W          GF G   RIFL+F
Subjt:  GNLPGAVANRVALEACVQARNEGRDLAREGANCSESESSRWLTFDKGLFDLNHGIVMSFYGIAGLFISSYLWCTILWNKRRNSVYFGGFRGQNPRIFLRF

Query:  LMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQN-LTPREIEQKAAELA
        L++DIQSIRI+VKEG+Y RR+LYMEIRGQG IPLT TD+   TPREIEQKAAELA
Subjt:  LMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQN-LTPREIEQKAAELA

THU42562.1 hypothetical protein C4D60_Mb00t15120 [Musa balbisiana]2.5e-4948.2Show/hide
Query:  LPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------GNLPGAVANRVALEACVQARNEGRDLAREG------------------------
        +PGVLPVAS GIHVWHMPALTE+     +       FG G      GN PGAVANRVALEA VQARNEGRDLAR                          
Subjt:  LPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------GNLPGAVANRVALEACVQARNEGRDLAREG------------------------

Query:  --ANCSESESSRWLTFDKG-----------------------------------------LFDLNHGIVMSFYGIAGLFISSYLWCTILWN--------K
           N +      W+    G                                         +     GIVMSFYGIAGLFISSYLWCTI WN         
Subjt:  --ANCSESESSRWLTFDKG-----------------------------------------LFDLNHGIVMSFYGIAGLFISSYLWCTILWN--------K

Query:  RRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLTPREIEQKAAELA
        R+     ++  GF G N RIFLRF M+DIQSIRIEVKEG+Y RRVLYMEIRGQGAIPLT TD+N TPREIEQKAAELA
Subjt:  RRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLTPREIEQKAAELA

TVU46446.1 hypothetical protein EJB05_05986, partial [Eragrostis curvula]3.3e-4639.47Show/hide
Query:  LRVNSGTFLNSVSLFYGLFSFLSFDSAFKTLQIQDSFI----FTSIPFCYDWVSLPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------
        LR++ G  ++S ++   L         F  L ++D +I       I F  DWVS+PGV+PVAS GIHVWHMPALTE+     +       FG G      
Subjt:  LRVNSGTFLNSVSLFYGLFSFLSFDSAFKTLQIQDSFI----FTSIPFCYDWVSLPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------

Query:  GNLPGAVANRVALEACVQARNEGRDLAREG-----ANCSES---------------------------------------------------------ES
        GN PGA ANRVALEACVQARNEGRDLAREG     A C  S                                                          S
Subjt:  GNLPGAVANRVALEACVQARNEGRDLAREG-----ANCSES---------------------------------------------------------ES

Query:  SR----------------------WLTFDKG------------------------------LFDLNHGIVMSFYGIAGLFISSYLWCTILWN--------
         R                      W+   KG                              +     G+VMSFYGIAGLFISSYLWCTILWN        
Subjt:  SR----------------------WLTFDKG------------------------------LFDLNHGIVMSFYGIAGLFISSYLWCTILWN--------

Query:  KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQN-LTPREIEQKAAELA
         R+     ++  GF G   RIFLRFL++DIQSIRI+VKEG+Y RR+LYMEIRGQG IPLT TD+   TPREIEQKAAELA
Subjt:  KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQN-LTPREIEQKAAELA

TrEMBL top hitse value%identityAlignment
A0A218KG38 Photosystem I assembly protein Ycf41.6e-3879.46Show/hide
Query:  GIVMSFYGIAGLFISSYLWCTILWN-----------KRRNSVYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT
        GIVMSFYGIAGLFISSYLWCTILWN           +   S++  GF G+N RIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLT TDQNLT
Subjt:  GIVMSFYGIAGLFISSYLWCTILWN-----------KRRNSVYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT

Query:  PREIEQKAAELA
        PREIEQKAAELA
Subjt:  PREIEQKAAELA

A0A3L6D5S1 Photosystem I assembly protein Ycf43.4e-4449.02Show/hide
Query:  LRVNSGTFLNSVSLFYGLFSFLSFDSAFKTLQIQDSFI----FTSIPFCYDWVSLPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------
        LR++ G  ++S ++   L         F  L ++D FI       I F  DWVS+PGV+PVAS GIHVWHMPALTE+     +       FG G      
Subjt:  LRVNSGTFLNSVSLFYGLFSFLSFDSAFKTLQIQDSFI----FTSIPFCYDWVSLPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------

Query:  GNLPGAVANRVALEACVQARNEGRDLAREGANCSESESSRWLTFDKGLFDLNHGIVMSFYGIAGLFISSYLWCTILWNKRRNSVYFGGFRGQNPRIFLRF
        GN PGA ANRVALEACVQARNEGRDLAREG+            +D+  FD   GIV                C   W          GF G   RIFL+F
Subjt:  GNLPGAVANRVALEACVQARNEGRDLAREGANCSESESSRWLTFDKGLFDLNHGIVMSFYGIAGLFISSYLWCTILWNKRRNSVYFGGFRGQNPRIFLRF

Query:  LMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQN-LTPREIEQKAAELA
        L++DIQSIRI+VKEG+Y RR+LYMEIRGQG IPLT TD+   TPREIEQKAAELA
Subjt:  LMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQN-LTPREIEQKAAELA

A0A4S8I6E4 Photosystem I assembly protein Ycf41.2e-4948.2Show/hide
Query:  LPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------GNLPGAVANRVALEACVQARNEGRDLAREG------------------------
        +PGVLPVAS GIHVWHMPALTE+     +       FG G      GN PGAVANRVALEA VQARNEGRDLAR                          
Subjt:  LPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------GNLPGAVANRVALEACVQARNEGRDLAREG------------------------

Query:  --ANCSESESSRWLTFDKG-----------------------------------------LFDLNHGIVMSFYGIAGLFISSYLWCTILWN--------K
           N +      W+    G                                         +     GIVMSFYGIAGLFISSYLWCTI WN         
Subjt:  --ANCSESESSRWLTFDKG-----------------------------------------LFDLNHGIVMSFYGIAGLFISSYLWCTILWN--------K

Query:  RRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLTPREIEQKAAELA
        R+     ++  GF G N RIFLRF M+DIQSIRIEVKEG+Y RRVLYMEIRGQGAIPLT TD+N TPREIEQKAAELA
Subjt:  RRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLTPREIEQKAAELA

A0A5J5V5Z9 Photosystem I assembly protein Ycf4 (Fragment)7.3e-3980.77Show/hide
Query:  LNHGIVMSFYGIAGLFISSYLWCTILWNKRRNSVYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLTPREIEQKA
        L  GIVMSFYGI GLFISSYLWCTI WN+    ++  GF G+N RIFLRFLMKDIQSIRIEVKEGIYARRVLYMEI+GQGA+PLT TD+NLTPREIEQKA
Subjt:  LNHGIVMSFYGIAGLFISSYLWCTILWNKRRNSVYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLTPREIEQKA

Query:  AELA
        AELA
Subjt:  AELA

A0A5J9WER5 Photosystem I assembly protein Ycf4 (Fragment)1.6e-4639.47Show/hide
Query:  LRVNSGTFLNSVSLFYGLFSFLSFDSAFKTLQIQDSFI----FTSIPFCYDWVSLPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------
        LR++ G  ++S ++   L         F  L ++D +I       I F  DWVS+PGV+PVAS GIHVWHMPALTE+     +       FG G      
Subjt:  LRVNSGTFLNSVSLFYGLFSFLSFDSAFKTLQIQDSFI----FTSIPFCYDWVSLPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------

Query:  GNLPGAVANRVALEACVQARNEGRDLAREG-----ANCSES---------------------------------------------------------ES
        GN PGA ANRVALEACVQARNEGRDLAREG     A C  S                                                          S
Subjt:  GNLPGAVANRVALEACVQARNEGRDLAREG-----ANCSES---------------------------------------------------------ES

Query:  SR----------------------WLTFDKG------------------------------LFDLNHGIVMSFYGIAGLFISSYLWCTILWN--------
         R                      W+   KG                              +     G+VMSFYGIAGLFISSYLWCTILWN        
Subjt:  SR----------------------WLTFDKG------------------------------LFDLNHGIVMSFYGIAGLFISSYLWCTILWN--------

Query:  KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQN-LTPREIEQKAAELA
         R+     ++  GF G   RIFLRFL++DIQSIRI+VKEG+Y RR+LYMEIRGQG IPLT TD+   TPREIEQKAAELA
Subjt:  KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQN-LTPREIEQKAAELA

SwissProt top hitse value%identityAlignment
A0ZZ46 Photosystem I assembly protein Ycf41.1e-3976.79Show/hide
Query:  GIVMSFYGIAGLFISSYLWCTILWN--------KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT
        GIVMSFYGIAGLFISSYLWCTI WN         R+     ++  GF G+N RIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGA+PLT TD+NLT
Subjt:  GIVMSFYGIAGLFISSYLWCTILWN--------KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT

Query:  PREIEQKAAELA
        PREIEQKAAELA
Subjt:  PREIEQKAAELA

Q09G35 Photosystem I assembly protein Ycf42.7e-3875Show/hide
Query:  GIVMSFYGIAGLFISSYLWCTILWN--------KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT
        GIVMSFYGIAGLFISSYLWCTI WN         R+     ++  GF G+N RIFLRF MKDIQSIRIEVKEG+Y RRVLYMEIRGQGAIPLT TD+NLT
Subjt:  GIVMSFYGIAGLFISSYLWCTILWN--------KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT

Query:  PREIEQKAAELA
        PREIEQKAAELA
Subjt:  PREIEQKAAELA

Q2L913 Photosystem I assembly protein Ycf41.1e-3976.79Show/hide
Query:  GIVMSFYGIAGLFISSYLWCTILWN--------KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT
        GIVMSFYGIAGLFISSYLWCTI WN         R+     ++  GF G+N RIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGA+PLT TD+NLT
Subjt:  GIVMSFYGIAGLFISSYLWCTILWN--------KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT

Query:  PREIEQKAAELA
        PREIEQKAAELA
Subjt:  PREIEQKAAELA

Q2MIH6 Photosystem I assembly protein Ycf42.7e-3874.11Show/hide
Query:  GIVMSFYGIAGLFISSYLWCTILWN--------KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT
        GIVMSFYGIAGLFISSYLWCTI WN         R+     ++  GF G+N RIFLRFL+KDIQS+RIEVKEGIYARRVLYM+IRGQG+IPLT TD+NLT
Subjt:  GIVMSFYGIAGLFISSYLWCTILWN--------KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT

Query:  PREIEQKAAELA
        PREIEQKAAELA
Subjt:  PREIEQKAAELA

Q2QD78 Photosystem I assembly protein Ycf44.4e-4179.46Show/hide
Query:  GIVMSFYGIAGLFISSYLWCTILWN-----------KRRNSVYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT
        GIVMSFYGIAGLFISSYLWCTILWN           +   S++  GF G+N RIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLT TDQNLT
Subjt:  GIVMSFYGIAGLFISSYLWCTILWN-----------KRRNSVYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT

Query:  PREIEQKAAELA
        PREIEQKAAELA
Subjt:  PREIEQKAAELA

Arabidopsis top hitse value%identityAlignment
ATCG00490.1 ribulose-bisphosphate carboxylases9.5e-2363.83Show/hide
Query:  FCYDWVSLPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------GNLPGAVANRVALEACVQARNEGRDLAREGANCSESESSRW
        F  DWVSLPGVLPVAS GIHVWHMPALTE+     +       FG G      GN PGAVANRVALEACVQARNEGRDLA EG N    E+ +W
Subjt:  FCYDWVSLPGVLPVASSGIHVWHMPALTEMEMIPYYTPIRRRNFGAG------GNLPGAVANRVALEACVQARNEGRDLAREGANCSESESSRW

ATCG00520.1 unfolded protein binding1.5e-3671.43Show/hide
Query:  GIVMSFYGIAGLFISSYLWCTILWN--------KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT
        GIVMSFYGIAGLFIS YLWCTILWN         R+     ++  GF G++ RIFLRF MKDIQSIRIEVKEG+ ARRVLYMEIRGQGAIPL  TD+N T
Subjt:  GIVMSFYGIAGLFISSYLWCTILWN--------KRRNS---VYFGGFRGQNPRIFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLT

Query:  PREIEQKAAELA
         REIEQKAAELA
Subjt:  PREIEQKAAELA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTACTGTGCGGGTACATGCAAAGAAATGATGAAAAGGGCTTTGCTCGAGAATTTGGAGTTATTCTTTCACGAACTGTAAACAAAGAGAAATACTGCTTATTCCTAAG
AGTGAACTCCGGAACCTTTTTGAACTCGGTTTCACTATTTTATGGTCTCTTCTCTTTTCTTTCCTTTGATTCTGCTTTCAAGACTCTTCAAATTCAAGATTCATTCATCT
TCACTTCTATTCCCTTCTGTTATGATTGGGTCTCTTTACCAGGTGTTCTGCCAGTGGCTTCCAGTGGTATTCATGTTTGGCATATGCCTGCTCTAACCGAGATGGAGATG
ATTCCGTACTACACTCCAATTCGGCGGAGGAACTTTGGGGCTGGGGGGAATTTACCAGGTGCCGTAGCTAACCGAGTAGCTCTAGAAGCATGTGTACAAGCTCGTAATGA
GGGCCGTGATCTTGCTCGTGAGGGTGCAAATTGTAGTGAAAGTGAGAGTTCCAGGTGGCTGACGTTTGATAAGGGCTTATTCGATCTAAACCACGGGATCGTAATGTCTT
TCTATGGGATTGCAGGTCTCTTTATTAGCTCCTATTTGTGGTGCACAATTTTGTGGAATAAAAGAAGGAATAGTGTCTATTTTGGGGGGTTTCGTGGACAAAATCCTAGA
ATCTTCCTCCGATTCCTTATGAAAGACATTCAGTCCATCAGAATAGAAGTGAAAGAGGGTATTTATGCACGTCGTGTCCTTTATATGGAAATCAGAGGCCAAGGGGCCAT
TCCCTTGACTTGTACCGATCAGAATCTAACCCCACGAGAAATTGAGCAAAAGGCTGCCGAATTGGCTGGCCTATTTCTTGCGTGTATAGGATTCCATTCGCCCCATGGTT
GA
mRNA sequenceShow/hide mRNA sequence
ATGCTACTGTGCGGGTACATGCAAAGAAATGATGAAAAGGGCTTTGCTCGAGAATTTGGAGTTATTCTTTCACGAACTGTAAACAAAGAGAAATACTGCTTATTCCTAAG
AGTGAACTCCGGAACCTTTTTGAACTCGGTTTCACTATTTTATGGTCTCTTCTCTTTTCTTTCCTTTGATTCTGCTTTCAAGACTCTTCAAATTCAAGATTCATTCATCT
TCACTTCTATTCCCTTCTGTTATGATTGGGTCTCTTTACCAGGTGTTCTGCCAGTGGCTTCCAGTGGTATTCATGTTTGGCATATGCCTGCTCTAACCGAGATGGAGATG
ATTCCGTACTACACTCCAATTCGGCGGAGGAACTTTGGGGCTGGGGGGAATTTACCAGGTGCCGTAGCTAACCGAGTAGCTCTAGAAGCATGTGTACAAGCTCGTAATGA
GGGCCGTGATCTTGCTCGTGAGGGTGCAAATTGTAGTGAAAGTGAGAGTTCCAGGTGGCTGACGTTTGATAAGGGCTTATTCGATCTAAACCACGGGATCGTAATGTCTT
TCTATGGGATTGCAGGTCTCTTTATTAGCTCCTATTTGTGGTGCACAATTTTGTGGAATAAAAGAAGGAATAGTGTCTATTTTGGGGGGTTTCGTGGACAAAATCCTAGA
ATCTTCCTCCGATTCCTTATGAAAGACATTCAGTCCATCAGAATAGAAGTGAAAGAGGGTATTTATGCACGTCGTGTCCTTTATATGGAAATCAGAGGCCAAGGGGCCAT
TCCCTTGACTTGTACCGATCAGAATCTAACCCCACGAGAAATTGAGCAAAAGGCTGCCGAATTGGCTGGCCTATTTCTTGCGTGTATAGGATTCCATTCGCCCCATGGTT
GA
Protein sequenceShow/hide protein sequence
MLLCGYMQRNDEKGFAREFGVILSRTVNKEKYCLFLRVNSGTFLNSVSLFYGLFSFLSFDSAFKTLQIQDSFIFTSIPFCYDWVSLPGVLPVASSGIHVWHMPALTEMEM
IPYYTPIRRRNFGAGGNLPGAVANRVALEACVQARNEGRDLAREGANCSESESSRWLTFDKGLFDLNHGIVMSFYGIAGLFISSYLWCTILWNKRRNSVYFGGFRGQNPR
IFLRFLMKDIQSIRIEVKEGIYARRVLYMEIRGQGAIPLTCTDQNLTPREIEQKAAELAGLFLACIGFHSPHG