; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI05G09370 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI05G09370
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionProtein of unknown function (DUF674)
Genome locationChr5:7910830..7912662
RNA-Seq ExpressionCSPI05G09370
SyntenyCSPI05G09370
Gene Ontology termsNA
InterPro domainsIPR007750 - Protein of unknown function DUF674


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8647974.1 hypothetical protein Csa_000694 [Cucumis sativus]1.2e-9690.2Show/hide
Query:  MAEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKTYYLCNNNFRKQS
        MAEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLPGPLSNLYCSMDSLNTINYFEPKR LQNLLNPKTYYLCNNN+RKQS
Subjt:  MAEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKTYYLCNNNFRKQS

Query:  PCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIEVKLIRLDIKE-----GSKL
         CHSVSSTHGT+CP CGGYMTINLAYVYVD+E K IEGGYVTGMGKYMVMDDLTVKPMAYSS STISVLNELNVDD+SQIE KLIRLDIKE     GSK 
Subjt:  PCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIEVKLIRLDIKE-----GSKL

Query:  LKAS
        +  S
Subjt:  LKAS

XP_008461633.1 PREDICTED: uncharacterized protein LOC103500190 isoform X1 [Cucumis melo]5.6e-4150.22Show/hide
Query:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPM---LPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT--------------
        KVSLKLVIDKK KR+LY EADK FIDFLFT+L+LP  T++KL S  P+   +   L NLY S+ +L+ I YF+P    + LLNPK+              
Subjt:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPM---LPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT--------------

Query:  -----YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEG--GYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIE
             YY C  ++         +ST  T+CPKC   M     YVY   E KP EG  GYV G  +YMVMDDLTVKP+ +SS STI VL EL VDD+  I+
Subjt:  -----YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEG--GYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIE

Query:  VKLIRLDIKEGSKLLKASFHTSTVLTEVF
         KL+ LDI EG KLLKAS  + TVLT+VF
Subjt:  VKLIRLDIKEGSKLLKASFHTSTVLTEVF

XP_008461634.1 PREDICTED: uncharacterized protein LOC103500190 isoform X2 [Cucumis melo]5.6e-4150.22Show/hide
Query:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPM---LPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT--------------
        KVSLKLVIDKK KR+LY EADK FIDFLFT+L+LP  T++KL S  P+   +   L NLY S+ +L+ I YF+P    + LLNPK+              
Subjt:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPM---LPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT--------------

Query:  -----YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEG--GYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIE
             YY C  ++         +ST  T+CPKC   M     YVY   E KP EG  GYV G  +YMVMDDLTVKP+ +SS STI VL EL VDD+  I+
Subjt:  -----YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEG--GYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIE

Query:  VKLIRLDIKEGSKLLKASFHTSTVLTEVF
         KL+ LDI EG KLLKAS  + TVLT+VF
Subjt:  VKLIRLDIKEGSKLLKASFHTSTVLTEVF

XP_022139200.1 uncharacterized protein LOC111010169 [Momordica charantia]5.8e-4653.22Show/hide
Query:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQST-EPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPK-----------------
        KVSLKLVID+K KR+LY EADK+FIDFLFTIL+LP+G V+KL ST  P+    + N+Y +  +LN +NYF   R+   LLNP                  
Subjt:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQST-EPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPK-----------------

Query:  ------TYYLCNNNFRKQSPC-HSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIE-GGYVT-GMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLS
              TYY C  +    + C +S S T+G  C +CG  MT N  YVY  KEAKP+E GGYV  GM  +MVMDDLTVKP++ SS STISVL++L+V+D+ 
Subjt:  ------TYYLCNNNFRKQSPC-HSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIE-GGYVT-GMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLS

Query:  QIEVKLIRLDIKEGSKLLKASFHTSTVLTEVFL
        QIE KLI LDI EG KLL+AS  TSTVLT+VFL
Subjt:  QIEVKLIRLDIKEGSKLLKASFHTSTVLTEVFL

XP_031741243.1 uncharacterized protein LOC101207769 [Cucumis sativus]2.8e-9694.24Show/hide
Query:  MAEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKTYYLCNNNFRKQS
        MAEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLPGPLSNLYCSMDSLNTINYFEPKR LQNLLNPKTYYLCNNN+RKQS
Subjt:  MAEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKTYYLCNNNFRKQS

Query:  PCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIEVKLIRLDIKE
         CHSVSSTHGT+CP CGGYMTINLAYVYVD+E K IEGGYVTGMGKYMVMDDLTVKPMAYSS STISVLNELNVDD+SQIE KLIRLDIKE
Subjt:  PCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIEVKLIRLDIKE

TrEMBL top hitse value%identityAlignment
A0A0A0KNU0 Uncharacterized protein7.3e-8792.05Show/hide
Query:  MLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKTYYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTGMGKYMVMDD
        MLPGPLSNLYCSMDSLNTINYFEPKR LQNLLNPKTYYLCNNN+RKQS CHSVSSTHGT+CP CGGYMTINLAYVYVD+E K IEGGYVTGMGKYMVMDD
Subjt:  MLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKTYYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTGMGKYMVMDD

Query:  LTVKPMAYSSTSTISVLNELNVDDLSQIEVKLIRLDIKEGSKLLKASFHTSTVLTEVFLLQTRDGIPTNKQLKEEE
        LTVKPMAYSS STISVLNELNVDD+SQIE KLIRLDIKEGS LLKASFHTSTVLTEVFLLQTRD IPTNKQL+EEE
Subjt:  LTVKPMAYSSTSTISVLNELNVDDLSQIEVKLIRLDIKEGSKLLKASFHTSTVLTEVFLLQTRDGIPTNKQLKEEE

A0A1S3CF25 uncharacterized protein LOC103500190 isoform X22.7e-4150.22Show/hide
Query:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPM---LPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT--------------
        KVSLKLVIDKK KR+LY EADK FIDFLFT+L+LP  T++KL S  P+   +   L NLY S+ +L+ I YF+P    + LLNPK+              
Subjt:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPM---LPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT--------------

Query:  -----YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEG--GYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIE
             YY C  ++         +ST  T+CPKC   M     YVY   E KP EG  GYV G  +YMVMDDLTVKP+ +SS STI VL EL VDD+  I+
Subjt:  -----YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEG--GYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIE

Query:  VKLIRLDIKEGSKLLKASFHTSTVLTEVF
         KL+ LDI EG KLLKAS  + TVLT+VF
Subjt:  VKLIRLDIKEGSKLLKASFHTSTVLTEVF

A0A1S3CF63 uncharacterized protein LOC103500190 isoform X12.7e-4150.22Show/hide
Query:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPM---LPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT--------------
        KVSLKLVIDKK KR+LY EADK FIDFLFT+L+LP  T++KL S  P+   +   L NLY S+ +L+ I YF+P    + LLNPK+              
Subjt:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPM---LPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT--------------

Query:  -----YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEG--GYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIE
             YY C  ++         +ST  T+CPKC   M     YVY   E KP EG  GYV G  +YMVMDDLTVKP+ +SS STI VL EL VDD+  I+
Subjt:  -----YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEG--GYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIE

Query:  VKLIRLDIKEGSKLLKASFHTSTVLTEVF
         KL+ LDI EG KLLKAS  + TVLT+VF
Subjt:  VKLIRLDIKEGSKLLKASFHTSTVLTEVF

A0A6J1CBN0 uncharacterized protein LOC1110101692.8e-4653.22Show/hide
Query:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQST-EPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPK-----------------
        KVSLKLVID+K KR+LY EADK+FIDFLFTIL+LP+G V+KL ST  P+    + N+Y +  +LN +NYF   R+   LLNP                  
Subjt:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQST-EPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPK-----------------

Query:  ------TYYLCNNNFRKQSPC-HSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIE-GGYVT-GMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLS
              TYY C  +    + C +S S T+G  C +CG  MT N  YVY  KEAKP+E GGYV  GM  +MVMDDLTVKP++ SS STISVL++L+V+D+ 
Subjt:  ------TYYLCNNNFRKQSPC-HSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIE-GGYVT-GMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLS

Query:  QIEVKLIRLDIKEGSKLLKASFHTSTVLTEVFL
        QIE KLI LDI EG KLL+AS  TSTVLT+VFL
Subjt:  QIEVKLIRLDIKEGSKLLKASFHTSTVLTEVFL

M5X812 Uncharacterized protein4.1e-3745.29Show/hide
Query:  VSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKTYYL-----------CNNN
        VS+KL++D  R +VL+ EA K+ +DFLFT+L+LPVGTVI+L S + M+ G L  LY S+++L+   Y +P  +   LL PK                ++N
Subjt:  VSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKTYYL-----------CNNN

Query:  FRKQSPC-----HSVSSTHGTRCPKCG-GYMTINLAYVY-VDKEAKPIEGGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIEVKLIRLD
         +K   C      S+S+ HGTRCP C  G+M+  + YV      A+P EGGYV G+  YMVMDDL VKPM  S+ S+I++LN+ NV ++  +E K++ L 
Subjt:  FRKQSPC-----HSVSSTHGTRCPKCG-GYMTINLAYVY-VDKEAKPIEGGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIEVKLIRLD

Query:  IKEGSKLLKASFHTSTVLTEVFL
        ++EG KLLKAS  TSTVLT+VFL
Subjt:  IKEGSKLLKASFHTSTVLTEVFL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G09110.1 Protein of unknown function (DUF674)2.8e-1426.96Show/hide
Query:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKL----QSTEPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT-------------
        K SL+L+ID+++ RV+  EA K+F+D L ++L LP+GT+++L    Q+ +  + G L NLY S+  ++  N FE +     LL+P++             
Subjt:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKL----QSTEPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT-------------

Query:  -------YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIE
               +++C  NF     C  + S   T   +CG  M     +  +  E +  +G + +    +++ DDL  K    S    ++VLN+       +++
Subjt:  -------YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIE

Query:  VKLIRLDIKEGSKLLKASFHTSTVLTEVFL
          LI +  +E   LL   F +   LT+ FL
Subjt:  VKLIRLDIKEGSKLLKASFHTSTVLTEVFL

AT3G09120.1 Protein of unknown function (DUF674)1.7e-1426.97Show/hide
Query:  MAEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLP---GPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT---------
        MA+  K+SLKL++D+K+ +V+  EA ++F+D LF +L  P+GT+ +L      LP   G   NL  S+  +  ++ F+ +     LL+PK+         
Subjt:  MAEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLP---GPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT---------

Query:  -----------YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGG-----YVTGMGKYMVMDDLTVKPMAYSSTSTISVLNEL
                   +Y+C+   + +S     S+ + +RC  CG  M   + +V  D++     G      +V+    +++ DDL  K M  S    + VLN L
Subjt:  -----------YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGG-----YVTGMGKYMVMDDLTVKPMAYSSTSTISVLNEL

Query:  NVDDLSQIEVKLIRLDIKEGSKLLKASFHTSTVLTEVFLLQ
           +++ ++  LI +  +E   LL   F + + LT  FL++
Subjt:  NVDDLSQIEVKLIRLDIKEGSKLLKASFHTSTVLTEVFLLQ

AT3G09140.2 Protein of unknown function (DUF674)1.4e-1325.71Show/hide
Query:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLP----GPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT-------------
        K+S++L+ID+ + +V+  E+ K+F+D LF+ LALP+GT+++L       P    G  +NLY S+  ++ +  F+ +   Q LL P++             
Subjt:  KVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLP----GPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKT-------------

Query:  -------YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTG---------------MGKYMVMDDLTVKPMAYSSTSTI
               Y++C+    K+S  H  S ++   C +CG  +   +    +++E K +E   V G                  +++ DDL V+  A S  + +
Subjt:  -------YYLCNNNFRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTG---------------MGKYMVMDDLTVKPMAYSSTSTI

Query:  SVLNELNVDDLSQIEVKLIRLDIKEGSKLLKASFHTSTVLTEVFL
        + L  L   D S++   L+ + + E   LL+  F +   LT+ FL
Subjt:  SVLNELNVDDLSQIEVKLIRLDIKEGSKLLKASFHTSTVLTEVFL

AT5G01130.1 Protein of unknown function (DUF674)3.3e-1529.39Show/hide
Query:  AEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKL----QSTEPMLPGPLSNLYCSMDSLNTINY-FEPKRHLQNLLNPKTY-------
        +E  KVSL+L ID+++ +V+  EA K F+D LF++L LP+GT+I+L    + ++P+  G  SNLY S+  +   N+  +  +H+  LL+P++        
Subjt:  AEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKL----QSTEPMLPGPLSNLYCSMDSLNTINY-FEPKRHLQNLLNPKTY-------

Query:  YLCNNN------FRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIEVK
         + N N      F+    C+  S    +RC +CG  M        V   A  I+       G +++ DDL  K    S+   ++ L  L   D+S++   
Subjt:  YLCNNN------FRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIEVK

Query:  LIRLDIKEGSKLLKASFHTSTVLTEVFL
        L+ +  +E   LL+  F +   LT  FL
Subjt:  LIRLDIKEGSKLLKASFHTSTVLTEVFL

AT5G01150.1 Protein of unknown function (DUF674)1.9e-1526.72Show/hide
Query:  AEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKL----QSTEPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKTY-------Y
        +E  K SL+L++D+++ +V+  EA ++F+D LF++L LP+GT+++L    + +EP+  G  +NLY S+  + + + FE +   Q L+ PK+         
Subjt:  AEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKL----QSTEPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKTY-------Y

Query:  LCNNN-------FRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIE----GGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQ
          N N       F+  S C   S+   ++C +CG +M   +     +++    +    G +V+G   +++ DDL V     S+   ++ L  L   D+ +
Subjt:  LCNNN-------FRKQSPCHSVSSTHGTRCPKCGGYMTINLAYVYVDKEAKPIE----GGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQ

Query:  IEVKLIRLDIKEGSKLLKASFHTSTVLTEVFL
        +  +L+ + +KE   LL   F ++  L ++FL
Subjt:  IEVKLIRLDIKEGSKLLKASFHTSTVLTEVFL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGGAAGTATACAAGGTGAGTTTGAAACTTGTTATAGACAAGAAAAGGAAGAGGGTTCTATATGGTGAAGCAGACAAGGAATTCATCGACTTTCTTTTTACTATACT
TGCTCTTCCTGTTGGAACTGTGATTAAGCTTCAATCCACAGAACCCATGCTGCCTGGCCCCCTCTCAAATCTCTACTGTAGTATGGACTCCTTGAACACAATCAACTATT
TTGAACCAAAGCGGCATCTACAAAATCTACTAAACCCAAAGACCTATTATTTATGTAATAACAATTTTAGAAAGCAGTCCCCTTGTCACAGTGTTAGTAGTACACATGGC
ACAAGATGTCCCAAATGCGGAGGATACATGACTATCAATTTAGCTTATGTTTATGTGGATAAGGAAGCAAAGCCTATTGAAGGAGGTTATGTGACAGGGATGGGTAAGTA
TATGGTGATGGATGATCTTACTGTGAAACCCATGGCCTACTCTTCCACGTCCACCATTTCTGTTTTGAATGAGTTGAATGTAGATGATCTTTCCCAGATTGAGGTCAAGC
TTATTCGTTTGGACATCAAAGAGGGTTCGAAGTTGCTGAAAGCTTCCTTCCACACGTCCACTGTGCTCACTGAAGTGTTTCTTCTCCAAACTCGTGATGGTATTCCAACA
AACAAACAATTAAAGGAGGAGGAGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCGGAAGTATACAAGGTGAGTTTGAAACTTGTTATAGACAAGAAAAGGAAGAGGGTTCTATATGGTGAAGCAGACAAGGAATTCATCGACTTTCTTTTTACTATACT
TGCTCTTCCTGTTGGAACTGTGATTAAGCTTCAATCCACAGAACCCATGCTGCCTGGCCCCCTCTCAAATCTCTACTGTAGTATGGACTCCTTGAACACAATCAACTATT
TTGAACCAAAGCGGCATCTACAAAATCTACTAAACCCAAAGACCTATTATTTATGTAATAACAATTTTAGAAAGCAGTCCCCTTGTCACAGTGTTAGTAGTACACATGGC
ACAAGATGTCCCAAATGCGGAGGATACATGACTATCAATTTAGCTTATGTTTATGTGGATAAGGAAGCAAAGCCTATTGAAGGAGGTTATGTGACAGGGATGGGTAAGTA
TATGGTGATGGATGATCTTACTGTGAAACCCATGGCCTACTCTTCCACGTCCACCATTTCTGTTTTGAATGAGTTGAATGTAGATGATCTTTCCCAGATTGAGGTCAAGC
TTATTCGTTTGGACATCAAAGAGGGTTCGAAGTTGCTGAAAGCTTCCTTCCACACGTCCACTGTGCTCACTGAAGTGTTTCTTCTCCAAACTCGTGATGGTATTCCAACA
AACAAACAATTAAAGGAGGAGGAGTAG
Protein sequenceShow/hide protein sequence
MAEVYKVSLKLVIDKKRKRVLYGEADKEFIDFLFTILALPVGTVIKLQSTEPMLPGPLSNLYCSMDSLNTINYFEPKRHLQNLLNPKTYYLCNNNFRKQSPCHSVSSTHG
TRCPKCGGYMTINLAYVYVDKEAKPIEGGYVTGMGKYMVMDDLTVKPMAYSSTSTISVLNELNVDDLSQIEVKLIRLDIKEGSKLLKASFHTSTVLTEVFLLQTRDGIPT
NKQLKEEE