; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Clc04G01450 (gene) of Watermelon (cordophanus) v2 genome

Gene IDClc04G01450
OrganismCitrullus lanatus subsp. cordophanus (Watermelon (cordophanus) v2)
DescriptionPeptidyl-prolyl cis-trans isomerase CYP23 isoform X1
Genome locationClcChr04:3945593..3948579
RNA-Seq ExpressionClc04G01450
SyntenyClc04G01450
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0032235.1 peptidyl-prolyl cis-trans isomerase CYP23 isoform X1 [Cucumis melo var. makuwa]7.0e-1927.29Show/hide
Query:  PKIRGRKEFWEELGDLYGICDPRRCVAGDFNV----GGLSFVGKLR-------------------ALEGTL-----------------KTWNREVFGDV-
        P+I GR  FWEELGDLY  C  R CV GDFNV       +  GK+                     +EG                   K W  E FG + 
Subjt:  PKIRGRKEFWEELGDLYGICDPRRCVAGDFNV----GGLSFVGKLR-------------------ALEGTL-----------------KTWNREVFGDV-

Query:  -----------KIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVI--RKENMNWSQNMKEQISR----------LDGGWFLYAFFQDNWNRL
                   K KKKE   RI +ID  E  G ++  L EER +++    E I  R++  +      E I +          L    F  AFFQD+W+ +
Subjt:  -----------KIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVI--RKENMNWSQNMKEQISR----------LDGGWFLYAFFQDNWNRL

Query:  KGELENVFKEFFERGLYKGFQRFKTAGPFISFLIPFDRGRVKSNHSWGWKVALFSPLRWGLRRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWG
        KG+LE VFKEFFERG+       K        LIP      K N +   KV  F P+                     S   + + IL+ +L    R   
Subjt:  KGELENVFKEFFERGLYKGFQRFKTAGPFISFLIPFDRGRVKSNHSWGWKVALFSPLRWGLRRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWG

Query:  LKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLPLGGNLRAISFWDAPLKKIRK--------------SSEKLMRDLKEGKGPHLVSWEVAG
                      V+ +N +D R     +      + +        RA+SFW+   +KI+K                  +   + EG   +L+S E  G
Subjt:  LKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLPLGGNLRAISFWDAPLKKIRK--------------SSEKLMRDLKEGKGPHLVSWEVAG

Query:  KLVN-WGVGARKFKVTGQSPMAKWLWRFALEPEALW
          V+  G+G    ++  ++ +AKWLW F LEP++LW
Subjt:  KLVN-WGVGARKFKVTGQSPMAKWLWRFALEPEALW

RVW63564.1 Transposon TX1 uncharacterized 149 kDa protein [Vitis vinifera]7.0e-1923.71Show/hide
Query:  GLSFVGKLRALEGTLKTWNREVFGDVKIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKENMNWSQNMK--------------------
        G  F+ KL+ ++  LK WN+  FG +K +KK I   I  ID  E EG +   L  +R   +GE  E+I +E ++W Q  K                    
Subjt:  GLSFVGKLRALEGTLKTWNREVFGDVKIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKENMNWSQNMK--------------------

Query:  --------------------------------EQISRLDGGW----------------------FLYAFFQDNWNRLKGELENVFKEFFERG--------
                                        E  SRLD  +                      F  A FQD WN +K +L  VF EF   G        
Subjt:  --------------------------------EQISRLDGGW----------------------FLYAFFQDNWNRLKGELENVFKEFFERG--------

Query:  --------------------------LYKGF-------------------QRFKTAG-PFISFLIPFDRG--RVK-------------SNHSWGWKVALF
                                  LYK                      R K +G   + F I F++    VK             S+    W     
Subjt:  --------------------------LYKGF-------------------QRFKTAG-PFISFLIPFDRG--RVK-------------SNHSWGWKVALF

Query:  SPLRWGL-----------RRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWGLKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLPL
        S + + +           R  ++HLQFA DT+ F +  E     L  +L    +  GLK+       + + + + + +     +D K   +P  YLGLPL
Subjt:  SPLRWGL-----------RRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWGLKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLPL

Query:  GGNLRAISFWDAPLKKIRKSSEKLMR------DLKEGKGPHLVSWEVAGKLVNW-GVGARKFKVTGQSPMAKWLWRFALEPEALW
        GGN  A  FWD  +++I +  +   +       + EGK  HLV WE + +   + G+G  K  +  ++ + KWLWRF  E  +LW
Subjt:  GGNLRAISFWDAPLKKIRKSSEKLMR------DLKEGKGPHLVSWEVAGKLVNW-GVGARKFKVTGQSPMAKWLWRFALEPEALW

RVX09326.1 LINE-1 retrotransposable element ORF2 protein [Vitis vinifera]6.5e-2525.62Show/hide
Query:  WSLWSP-KIRGRKEFWEELGDLYGICDPRRCVAGDFNV---------------------------------------GGLSFVGKLRALEGTLKTWNREV
        W ++ P K   RK+FW EL DL+G+  PR CV GDFNV                                        G  F+ KL+ ++  LK WN  V
Subjt:  WSLWSP-KIRGRKEFWEELGDLYGICDPRRCVAGDFNV---------------------------------------GGLSFVGKLRALEGTLKTWNREV

Query:  FGDVKIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKE---NMNWSQNMKEQISRLDGGWFLYAFFQDNWNRLKGELENVFKEFFERGL
        FGD++ +KK I   +  ID  E EG ++  L  ER   R E  + + KE    M   Q  KE+    D   F  A +Q+ W+ +K +L  VF EF  +G+
Subjt:  FGDVKIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKE---NMNWSQNMKEQISRLDGGWFLYAFFQDNWNRLKGELENVFKEFFERGL

Query:  YKGFQR-------FKTAGPFI-------SFLIPFD-------RGRVKSNHSW-------------------GWKVAL--------FSPLRWGL-------
          G  R       F + G F+       + LI  +       +G  +   SW                   GW  A          SP  + L       
Subjt:  YKGFQR-------FKTAGPFI-------SFLIPFD-------RGRVKSNHSW-------------------GWKVAL--------FSPLRWGL-------

Query:  -------------------RRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWGLKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLP
                           R  +S LQFA DT+FF   + +    L  IL    +  GLK+         +   +   +     ++ +V  +P SYLGL 
Subjt:  -------------------RRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWGLKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLP

Query:  LGGNLRAISFWDAPLKKIRK------------------------------------------SSEKLMRDL-----KEGKGPHLVSWEVAGKLVNW-GVG
        LGGN + I FWD  +++I +                                            EK+ RD      ++GK  HL+ WEV  +     G+G
Subjt:  LGGNLRAISFWDAPLKKIRK------------------------------------------SSEKLMRDL-----KEGKGPHLVSWEVAGKLVNW-GVG

Query:  ARKFKVTGQSPMAKWLWRFALEPEALW
          K  +   + + KWLWRF  E   LW
Subjt:  ARKFKVTGQSPMAKWLWRFALEPEALW

TYK03178.1 hypothetical protein E5676_scaffold11G00830 [Cucumis melo var. makuwa]5.7e-2143.71Show/hide
Query:  KIRGRKEFWEELGDLYGICDPRRCVAGDFNV---------------------------------------GGLSFVGKLRALEGTLKTWNREVFGDVKIK
        +IRGR+  W+ELGDLYG C  R  V GDFNV                                        G  F+ KLRAL   L+ WNR+VFGD++IK
Subjt:  KIRGRKEFWEELGDLYGICDPRRCVAGDFNV---------------------------------------GGLSFVGKLRALEGTLKTWNREVFGDVKIK

Query:  KKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKENMNWSQNMK
        KKE+  RI EID  E EG +D AL+EER + +G+ AE+IRKEN++WSQ  K
Subjt:  KKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKENMNWSQNMK

XP_028056784.1 uncharacterized protein LOC114260796 [Camellia sinensis]1.4e-2229.57Show/hide
Query:  GLSFVGKLRALEGTLKTWNREVFGDVKIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGE------------------YAEVIRKENMNWSQNMKEQ
        G  F+ +LR ++  LK W REVFGD    K E+ + I E+D+KE    +   LR  R    G+                    E+I KE  N   N+   
Subjt:  GLSFVGKLRALEGTLKTWNREVFGDVKIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGE------------------YAEVIRKENMNWSQNMKEQ

Query:  ISRLDGGWFLYAFFQDNWNRL----KGELENVFKEFFERGLYKGFQRFKTAGPFISFLIPFDRGRVKSNHSWGWKVALFSPLRWGLRRWLSHLQFAADTM
            DGG         NW  +       LE +F E   R     ++R K  GP ++ ++    GR+         +  F   R G+    +HLQFA DT+
Subjt:  ISRLDGGWFLYAFFQDNWNRL----KGELENVFKEFFERGLYKGFQRFKTAGPFISFLIPFDRGRVKSNHSWGWKVALFSPLRWGLRRWLSHLQFAADTM

Query:  FFYSGNENSFIILSHILGFSRRCWGLKLTRANVKFWELTVIKINFADGR-----TWVDYKVGSFPSSYLGLPLGGNLRAISFWDAPLKKIRK--------
        F     E+    L  IL       GLK     V   + +V+ IN AD         +   + SFP  YLGLPLGG+ R +SFW+  L KIRK        
Subjt:  FFYSGNENSFIILSHILGFSRRCWGLKLTRANVKFWELTVIKINFADGR-----TWVDYKVGSFPSSYLGLPLGGNLRAISFWDAPLKKIRK--------

Query:  ----------------------------------SSEKLMRDL-----KEGKGPHLVSWE-VAGKLVNWGVGARKFKVTGQSPMAKWLWRFALEPEALW
                                            EKLMRD      +EGKG HLV WE V+      G+          + + KWLWRF LE E+LW
Subjt:  ----------------------------------SSEKLMRDL-----KEGKGPHLVSWE-VAGKLVNWGVGARKFKVTGQSPMAKWLWRFALEPEALW

TrEMBL top hitse value%identityAlignment
A0A438FUE8 Transposon TX1 uncharacterized 149 kDa protein3.4e-1923.71Show/hide
Query:  GLSFVGKLRALEGTLKTWNREVFGDVKIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKENMNWSQNMK--------------------
        G  F+ KL+ ++  LK WN+  FG +K +KK I   I  ID  E EG +   L  +R   +GE  E+I +E ++W Q  K                    
Subjt:  GLSFVGKLRALEGTLKTWNREVFGDVKIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKENMNWSQNMK--------------------

Query:  --------------------------------EQISRLDGGW----------------------FLYAFFQDNWNRLKGELENVFKEFFERG--------
                                        E  SRLD  +                      F  A FQD WN +K +L  VF EF   G        
Subjt:  --------------------------------EQISRLDGGW----------------------FLYAFFQDNWNRLKGELENVFKEFFERG--------

Query:  --------------------------LYKGF-------------------QRFKTAG-PFISFLIPFDRG--RVK-------------SNHSWGWKVALF
                                  LYK                      R K +G   + F I F++    VK             S+    W     
Subjt:  --------------------------LYKGF-------------------QRFKTAG-PFISFLIPFDRG--RVK-------------SNHSWGWKVALF

Query:  SPLRWGL-----------RRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWGLKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLPL
        S + + +           R  ++HLQFA DT+ F +  E     L  +L    +  GLK+       + + + + + +     +D K   +P  YLGLPL
Subjt:  SPLRWGL-----------RRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWGLKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLPL

Query:  GGNLRAISFWDAPLKKIRKSSEKLMR------DLKEGKGPHLVSWEVAGKLVNW-GVGARKFKVTGQSPMAKWLWRFALEPEALW
        GGN  A  FWD  +++I +  +   +       + EGK  HLV WE + +   + G+G  K  +  ++ + KWLWRF  E  +LW
Subjt:  GGNLRAISFWDAPLKKIRKSSEKLMR------DLKEGKGPHLVSWEVAGKLVNW-GVGARKFKVTGQSPMAKWLWRFALEPEALW

A0A438G4S3 Uncharacterized protein3.4e-1925.59Show/hide
Query:  WEGVFYHWSLWSP-KIRGRKEFWEELGDLYGICDPRRCVAGDFNV--------GGLSFVGKLRALEGTLK-------TWNREVFGDVKIKKKEIPKRIEE
        W+ V   ++L+ P K   RK+FW EL DL+G+  PR CV GDFNV        G       +R  +  ++             F    ++   I KR E 
Subjt:  WEGVFYHWSLWSP-KIRGRKEFWEELGDLYGICDPRRCVAGDFNV--------GGLSFVGKLRALEGTLK-------TWNREVFGDVKIKKKEIPKRIEE

Query:  IDLKEGEGLMDPALREERDTLRGEYAEVIRKENMNWSQNMKEQISRLDGGW----------------------FLYAFFQDNWNRLKGELENVFKEFFER
        + L   E  +      +  T+ G   +  + E ++W+   +E    LD  +                      F  A +Q+ W+ +K +L  VF EF  +
Subjt:  IDLKEGEGLMDPALREERDTLRGEYAEVIRKENMNWSQNMKEQISRLDGGW----------------------FLYAFFQDNWNRLKGELENVFKEFFER

Query:  GLYKGFQR-------FKTAGPFI-------SFLIPFDRGRVKSNHSWGWKVALFSPLRWGLRRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWG
        G+  G  R       F + G F+       + LI       K +H    K   FS  +W  R W+     +++     +GN   ++  S          G
Subjt:  GLYKGFQR-------FKTAGPFI-------SFLIPFDRGRVKSNHSWGWKVALFSPLRWGLRRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWG

Query:  LKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLPLGGNLRAISFWDAPLKKIRKSSEKLMRDLKEGKGPHLVSWEVAGKLVNW-GVGARKFK
        L+  R++  F   +  + +          KV  +P SYLGLPLGGN + I FWD  ++K+++  + L    +  +  HL+ WEV  +     G+G  K  
Subjt:  LKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLPLGGNLRAISFWDAPLKKIRKSSEKLMRDLKEGKGPHLVSWEVAGKLVNW-GVGARKFK

Query:  VTGQSPMAKWLWRFALEPEALW
        +   + + KWLWRF  E   LW
Subjt:  VTGQSPMAKWLWRFALEPEALW

A0A438JK40 LINE-1 retrotransposable element ORF2 protein3.2e-2525.62Show/hide
Query:  WSLWSP-KIRGRKEFWEELGDLYGICDPRRCVAGDFNV---------------------------------------GGLSFVGKLRALEGTLKTWNREV
        W ++ P K   RK+FW EL DL+G+  PR CV GDFNV                                        G  F+ KL+ ++  LK WN  V
Subjt:  WSLWSP-KIRGRKEFWEELGDLYGICDPRRCVAGDFNV---------------------------------------GGLSFVGKLRALEGTLKTWNREV

Query:  FGDVKIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKE---NMNWSQNMKEQISRLDGGWFLYAFFQDNWNRLKGELENVFKEFFERGL
        FGD++ +KK I   +  ID  E EG ++  L  ER   R E  + + KE    M   Q  KE+    D   F  A +Q+ W+ +K +L  VF EF  +G+
Subjt:  FGDVKIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKE---NMNWSQNMKEQISRLDGGWFLYAFFQDNWNRLKGELENVFKEFFERGL

Query:  YKGFQR-------FKTAGPFI-------SFLIPFD-------RGRVKSNHSW-------------------GWKVAL--------FSPLRWGL-------
          G  R       F + G F+       + LI  +       +G  +   SW                   GW  A          SP  + L       
Subjt:  YKGFQR-------FKTAGPFI-------SFLIPFD-------RGRVKSNHSW-------------------GWKVAL--------FSPLRWGL-------

Query:  -------------------RRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWGLKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLP
                           R  +S LQFA DT+FF   + +    L  IL    +  GLK+         +   +   +     ++ +V  +P SYLGL 
Subjt:  -------------------RRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWGLKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLP

Query:  LGGNLRAISFWDAPLKKIRK------------------------------------------SSEKLMRDL-----KEGKGPHLVSWEVAGKLVNW-GVG
        LGGN + I FWD  +++I +                                            EK+ RD      ++GK  HL+ WEV  +     G+G
Subjt:  LGGNLRAISFWDAPLKKIRK------------------------------------------SSEKLMRDL-----KEGKGPHLVSWEVAGKLVNW-GVG

Query:  ARKFKVTGQSPMAKWLWRFALEPEALW
          K  +   + + KWLWRF  E   LW
Subjt:  ARKFKVTGQSPMAKWLWRFALEPEALW

A0A5A7SSQ8 Peptidyl-prolyl cis-trans isomerase CYP23 isoform X13.4e-1927.29Show/hide
Query:  PKIRGRKEFWEELGDLYGICDPRRCVAGDFNV----GGLSFVGKLR-------------------ALEGTL-----------------KTWNREVFGDV-
        P+I GR  FWEELGDLY  C  R CV GDFNV       +  GK+                     +EG                   K W  E FG + 
Subjt:  PKIRGRKEFWEELGDLYGICDPRRCVAGDFNV----GGLSFVGKLR-------------------ALEGTL-----------------KTWNREVFGDV-

Query:  -----------KIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVI--RKENMNWSQNMKEQISR----------LDGGWFLYAFFQDNWNRL
                   K KKKE   RI +ID  E  G ++  L EER +++    E I  R++  +      E I +          L    F  AFFQD+W+ +
Subjt:  -----------KIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVI--RKENMNWSQNMKEQISR----------LDGGWFLYAFFQDNWNRL

Query:  KGELENVFKEFFERGLYKGFQRFKTAGPFISFLIPFDRGRVKSNHSWGWKVALFSPLRWGLRRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWG
        KG+LE VFKEFFERG+       K        LIP      K N +   KV  F P+                     S   + + IL+ +L    R   
Subjt:  KGELENVFKEFFERGLYKGFQRFKTAGPFISFLIPFDRGRVKSNHSWGWKVALFSPLRWGLRRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWG

Query:  LKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLPLGGNLRAISFWDAPLKKIRK--------------SSEKLMRDLKEGKGPHLVSWEVAG
                      V+ +N +D R     +      + +        RA+SFW+   +KI+K                  +   + EG   +L+S E  G
Subjt:  LKLTRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLPLGGNLRAISFWDAPLKKIRK--------------SSEKLMRDLKEGKGPHLVSWEVAG

Query:  KLVN-WGVGARKFKVTGQSPMAKWLWRFALEPEALW
          V+  G+G    ++  ++ +AKWLW F LEP++LW
Subjt:  KLVN-WGVGARKFKVTGQSPMAKWLWRFALEPEALW

A0A5D3BW32 Uncharacterized protein2.8e-2143.71Show/hide
Query:  KIRGRKEFWEELGDLYGICDPRRCVAGDFNV---------------------------------------GGLSFVGKLRALEGTLKTWNREVFGDVKIK
        +IRGR+  W+ELGDLYG C  R  V GDFNV                                        G  F+ KLRAL   L+ WNR+VFGD++IK
Subjt:  KIRGRKEFWEELGDLYGICDPRRCVAGDFNV---------------------------------------GGLSFVGKLRALEGTLKTWNREVFGDVKIK

Query:  KKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKENMNWSQNMK
        KKE+  RI EID  E EG +D AL+EER + +G+ AE+IRKEN++WSQ  K
Subjt:  KKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKENMNWSQNMK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTTGCTCCGTTGGTTAATGCCATGGGTGACGTGATAAGGGGAGGAGGCTGCTTTTTATCCAACGGGGGAGTTGTTGGTGATCTTTCTTTCTTGAGCTCGAACCTACT
TGGCATCCCAACTCATCATGCTCATGGGGATGTGGGTACTAAGGGGTGGGCAGCTCTAGCCTCTATAGGTGTTCTTTCCGTGGAGGCTAAGGTCCCTCGTTGGCCCTTGT
TGATCAGGATATATGGATGGGAAGGAGTTTTCTATCACTGGAGTCTATGGTCCCCCAAGATCCGAGGGAGGAAGGAGTTTTGGGAGGAGTTGGGGGACCTCTATGGCATT
TGTGACCCTAGACGGTGTGTGGCTGGAGATTTTAATGTGGGAGGGTTATCATTTGTGGGGAAACTAAGAGCTCTAGAAGGGACCCTTAAAACGTGGAATAGAGAGGTTTT
TGGAGACGTCAAAATCAAAAAGAAAGAAATCCCTAAGAGAATTGAGGAGATTGATTTGAAGGAGGGGGAGGGTCTCATGGACCCCGCTTTGAGAGAGGAGAGAGATACTC
TAAGAGGGGAGTATGCTGAAGTGATTAGGAAGGAGAATATGAATTGGAGTCAAAATATGAAAGAGCAAATCTCCCGGTTGGATGGTGGATGGTTTCTCTATGCCTTTTTC
CAGGATAATTGGAATCGGCTAAAGGGGGAGTTAGAAAATGTCTTCAAAGAGTTCTTTGAAAGAGGGCTCTATAAAGGCTTCCAGAGGTTTAAAACAGCAGGACCCTTTAT
CTCCTTTCTTATTCCTTTTGACAGAGGTCGTGTTAAGTCGAATCATTCATGGGGGTGGAAGGTGGCATTATTTAGTCCTTTAAGGTGGGGGCTAAGGAGGTGGCTCTCGC
ACTTACAATTTGCTGCTGACACGATGTTCTTCTATTCCGGCAATGAGAATTCCTTCATTATTCTTAGCCATATCTTGGGTTTTTCGAGGCGATGTTGGGGCTTAAAATTA
ACAAGAGCAAATGTCAAATTTTGGGAATTAACTGTGATCAAGATAAACTTTGCAGATGGGCGAACATGGGTTGACTATAAGGTTGGCTCCTTTCCTTCTTCTTACCTAGG
CCTCCCCCTTGGGGGCAATTTGAGAGCCATCTCGTTTTGGGATGCTCCTCTCAAGAAGATTAGGAAAAGTAGTGAGAAGCTCATGAGAGACTTGAAGGAAGGGAAGGGGC
CCCATCTAGTTAGTTGGGAGGTGGCAGGGAAGCTTGTGAATTGGGGGGTTGGAGCTAGGAAATTTAAGGTTACGGGACAAAGCCCTATGGCTAAATGGCTTTGGCGTTTT
GCTCTAGAGCCCGAAGCTTTGTGGTAG
mRNA sequenceShow/hide mRNA sequence
ATGTTTGCTCCGTTGGTTAATGCCATGGGTGACGTGATAAGGGGAGGAGGCTGCTTTTTATCCAACGGGGGAGTTGTTGGTGATCTTTCTTTCTTGAGCTCGAACCTACT
TGGCATCCCAACTCATCATGCTCATGGGGATGTGGGTACTAAGGGGTGGGCAGCTCTAGCCTCTATAGGTGTTCTTTCCGTGGAGGCTAAGGTCCCTCGTTGGCCCTTGT
TGATCAGGATATATGGATGGGAAGGAGTTTTCTATCACTGGAGTCTATGGTCCCCCAAGATCCGAGGGAGGAAGGAGTTTTGGGAGGAGTTGGGGGACCTCTATGGCATT
TGTGACCCTAGACGGTGTGTGGCTGGAGATTTTAATGTGGGAGGGTTATCATTTGTGGGGAAACTAAGAGCTCTAGAAGGGACCCTTAAAACGTGGAATAGAGAGGTTTT
TGGAGACGTCAAAATCAAAAAGAAAGAAATCCCTAAGAGAATTGAGGAGATTGATTTGAAGGAGGGGGAGGGTCTCATGGACCCCGCTTTGAGAGAGGAGAGAGATACTC
TAAGAGGGGAGTATGCTGAAGTGATTAGGAAGGAGAATATGAATTGGAGTCAAAATATGAAAGAGCAAATCTCCCGGTTGGATGGTGGATGGTTTCTCTATGCCTTTTTC
CAGGATAATTGGAATCGGCTAAAGGGGGAGTTAGAAAATGTCTTCAAAGAGTTCTTTGAAAGAGGGCTCTATAAAGGCTTCCAGAGGTTTAAAACAGCAGGACCCTTTAT
CTCCTTTCTTATTCCTTTTGACAGAGGTCGTGTTAAGTCGAATCATTCATGGGGGTGGAAGGTGGCATTATTTAGTCCTTTAAGGTGGGGGCTAAGGAGGTGGCTCTCGC
ACTTACAATTTGCTGCTGACACGATGTTCTTCTATTCCGGCAATGAGAATTCCTTCATTATTCTTAGCCATATCTTGGGTTTTTCGAGGCGATGTTGGGGCTTAAAATTA
ACAAGAGCAAATGTCAAATTTTGGGAATTAACTGTGATCAAGATAAACTTTGCAGATGGGCGAACATGGGTTGACTATAAGGTTGGCTCCTTTCCTTCTTCTTACCTAGG
CCTCCCCCTTGGGGGCAATTTGAGAGCCATCTCGTTTTGGGATGCTCCTCTCAAGAAGATTAGGAAAAGTAGTGAGAAGCTCATGAGAGACTTGAAGGAAGGGAAGGGGC
CCCATCTAGTTAGTTGGGAGGTGGCAGGGAAGCTTGTGAATTGGGGGGTTGGAGCTAGGAAATTTAAGGTTACGGGACAAAGCCCTATGGCTAAATGGCTTTGGCGTTTT
GCTCTAGAGCCCGAAGCTTTGTGGTAG
Protein sequenceShow/hide protein sequence
MFAPLVNAMGDVIRGGGCFLSNGGVVGDLSFLSSNLLGIPTHHAHGDVGTKGWAALASIGVLSVEAKVPRWPLLIRIYGWEGVFYHWSLWSPKIRGRKEFWEELGDLYGI
CDPRRCVAGDFNVGGLSFVGKLRALEGTLKTWNREVFGDVKIKKKEIPKRIEEIDLKEGEGLMDPALREERDTLRGEYAEVIRKENMNWSQNMKEQISRLDGGWFLYAFF
QDNWNRLKGELENVFKEFFERGLYKGFQRFKTAGPFISFLIPFDRGRVKSNHSWGWKVALFSPLRWGLRRWLSHLQFAADTMFFYSGNENSFIILSHILGFSRRCWGLKL
TRANVKFWELTVIKINFADGRTWVDYKVGSFPSSYLGLPLGGNLRAISFWDAPLKKIRKSSEKLMRDLKEGKGPHLVSWEVAGKLVNWGVGARKFKVTGQSPMAKWLWRF
ALEPEALW