; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Cucsat.G8407 (gene) of Cucumber (B10) v3 genome

Gene IDCucsat.G8407
OrganismCucumis sativus L. var. sativus cv. B10 (Cucumber (B10) v3)
DescriptionChloroplast, nucleus, chloroplast envelope, putative
Genome locationctg1557:1183714..1186816
RNA-Seq ExpressionCucsat.G8407
SyntenyCucsat.G8407
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KGN65868.1 hypothetical protein Csa_023343 [Cucumis sativus]7.94e-158100Show/hide
Query:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSSG
        QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSSG
Subjt:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSSG

Query:  SEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKYMP
        SEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKYMP
Subjt:  SEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKYMP

Query:  KKAQNQKQNVASDDYDETDYLDSDYGEIDF
        KKAQNQKQNVASDDYDETDYLDSDYGEIDF
Subjt:  KKAQNQKQNVASDDYDETDYLDSDYGEIDF

XP_008444591.1 PREDICTED: uncharacterized protein LOC103487859 [Cucumis melo]3.02e-10672.69Show/hide
Query:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPP--GGGGGSGGGDSGDGGEDS
        QGLP+LKY+HRRVGLK+QHTPIVSL+GSKGKGSDDGGSPWK  DKVVESFK G SVEDVLR+QIEKKEFYDGGDGG+RPP  GGGGG GGG  G G EDS
Subjt:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPP--GGGGGSGGGDSGDGGEDS

Query:  SSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDK
        SSG++D+SL   +DE LQV+LATLG +F+Y Y+L+GEE++RL KDYIKY FGGSKSVRL+RAMY WG+FYQ L  KKKYD++WLEKAI++TPTWWD+PD 
Subjt:  SSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDK

Query:  YMPK-----KAQNQKQNVASDDYDETD
        Y        KA+NQ++N ASDD  ETD
Subjt:  YMPK-----KAQNQKQNVASDDYDETD

XP_022140099.1 uncharacterized protein LOC111010834 [Momordica charantia]3.56e-10170.69Show/hide
Query:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSS
        QGLPVLKY+HRR GL +QHTPIVSL+GSKGK S DGGSPWK  DKVVE+FK GRSVEDVLRQQIEKKEFYDGGDGGKRPP GGGGSG         DSSS
Subjt:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSS

Query:  GSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKY-
        GSED SL GI+DE LQVILAT+G +F+YIYI+SGEEL+RLAKDYIK++FGGSKSVRLKRAMY WG+FYQ L +KK+YD+YWLEKAI++TPTWWD+PDKY 
Subjt:  GSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKY-

Query:  ------MPKKAQNQKQNVASDDYDETDYLDSD
              M  + +NQ      +D  E D  +SD
Subjt:  ------MPKKAQNQKQNVASDDYDETDYLDSD

XP_031742537.1 uncharacterized protein LOC101222813, partial [Cucumis sativus]7.56e-209100Show/hide
Query:  HSCELAQLLLSHNLKSPFLYQSIATYRCSLSLLPSPQQPPPPHPHSTVIHSPTRTTTHHIPSSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDD
        HSCELAQLLLSHNLKSPFLYQSIATYRCSLSLLPSPQQPPPPHPHSTVIHSPTRTTTHHIPSSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDD
Subjt:  HSCELAQLLLSHNLKSPFLYQSIATYRCSLSLLPSPQQPPPPHPHSTVIHSPTRTTTHHIPSSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDD

Query:  GGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEE
        GGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEE
Subjt:  GGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEE

Query:  LSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF
        LSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF
Subjt:  LSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF

XP_038895689.1 uncharacterized protein LOC120083861 [Benincasa hispida]1.35e-10872.27Show/hide
Query:  SSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGE
        SS  QGLPVLKY+HRRVGLK+QHTPIVSL+GSKGK + DGGSPWK  D+VVE+FK GRSVEDVLRQQIEKKEFYDGG+GGKRPP GGGGSG GD      
Subjt:  SSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGE

Query:  DSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNP
         SSSGSED SL GI+DE LQV+LATLG +F+YIYI++GEEL+RLAKDYIKYLFGGSKSVRL+R+MY WG+FYQ L +KK+YD+YWLEKAIL+TPTWWD+P
Subjt:  DSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNP

Query:  DKYMPK-----KAQNQKQNVASDDYDETDYLDSDYGEI
        D Y        ++Q+QK+N ASDDY E D  +SD  EI
Subjt:  DKYMPK-----KAQNQKQNVASDDYDETDYLDSDYGEI

TrEMBL top hitse value%identityAlignment
A0A0A0LVP5 Uncharacterized protein3.84e-158100Show/hide
Query:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSSG
        QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSSG
Subjt:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSSG

Query:  SEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKYMP
        SEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKYMP
Subjt:  SEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKYMP

Query:  KKAQNQKQNVASDDYDETDYLDSDYGEIDF
        KKAQNQKQNVASDDYDETDYLDSDYGEIDF
Subjt:  KKAQNQKQNVASDDYDETDYLDSDYGEIDF

A0A1S3BA69 uncharacterized protein LOC1034878591.46e-10672.69Show/hide
Query:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPP--GGGGGSGGGDSGDGGEDS
        QGLP+LKY+HRRVGLK+QHTPIVSL+GSKGKGSDDGGSPWK  DKVVESFK G SVEDVLR+QIEKKEFYDGGDGG+RPP  GGGGG GGG  G G EDS
Subjt:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPP--GGGGGSGGGDSGDGGEDS

Query:  SSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDK
        SSG++D+SL   +DE LQV+LATLG +F+Y Y+L+GEE++RL KDYIKY FGGSKSVRL+RAMY WG+FYQ L  KKKYD++WLEKAI++TPTWWD+PD 
Subjt:  SSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDK

Query:  YMPK-----KAQNQKQNVASDDYDETD
        Y        KA+NQ++N ASDD  ETD
Subjt:  YMPK-----KAQNQKQNVASDDYDETD

A0A6J1CES8 uncharacterized protein LOC1110108341.72e-10170.69Show/hide
Query:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSS
        QGLPVLKY+HRR GL +QHTPIVSL+GSKGK S DGGSPWK  DKVVE+FK GRSVEDVLRQQIEKKEFYDGGDGGKRPP GGGGSG         DSSS
Subjt:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSS

Query:  GSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKY-
        GSED SL GI+DE LQVILAT+G +F+YIYI+SGEEL+RLAKDYIK++FGGSKSVRLKRAMY WG+FYQ L +KK+YD+YWLEKAI++TPTWWD+PDKY 
Subjt:  GSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKY-

Query:  ------MPKKAQNQKQNVASDDYDETDYLDSD
              M  + +NQ      +D  E D  +SD
Subjt:  ------MPKKAQNQKQNVASDDYDETDYLDSD

A0A6J1GSZ4 uncharacterized protein LOC1114572072.68e-8667.94Show/hide
Query:  PSSSA----QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDS
        PS++A    +GLPVLKY+HRRVGLK+++TPI SL+GSKGK + DGGSPWK  DKVVE+FK GRSVED+LRQQIE K+FYDGGDGG+ PPGGGGG G    
Subjt:  PSSSA----QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDS

Query:  GDGGEDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKK-KYDQYWLEKAILSTP
           G DSSS SED S+ GI++E + V+LAT+GLV VYIYI+ G+EL  LAKDYIKYLFG  +S RLK AMY+WGKFY+   +KK K D+YWLEKAIL+TP
Subjt:  GDGGEDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKK-KYDQYWLEKAILSTP

Query:  TWWDNPDKY
        TWWD+PDKY
Subjt:  TWWDNPDKY

A0A6J1K2H4 uncharacterized protein LOC1114904568.43e-8861.51Show/hide
Query:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSS
        +GLPVLKY+HRRVGLK+++TPI SL+GSKGK + DGGSPWK  DKVVE+FK GRSVED+LRQQIE K+FYDGGDGG+ PPGGGGGS GGD       SSS
Subjt:  QGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDKVVESFK-GRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSS

Query:  GSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKK-KYDQYWLEKAILSTPTWWDNPDKY
         SED+++ GI++E + V+LAT+GLV VYIYI+ G+EL  LAKDYIKYLFG  +S RLK AMY+WGKFY+   +KK K D+YWLEKAIL+TPTWWD+PDKY
Subjt:  GSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKK-KYDQYWLEKAILSTPTWWDNPDKY

Query:  MPK-----KAQNQKQNVA---------SDDYDETDYLDS
                ++Q Q+++ A         S  YD+ +Y +S
Subjt:  MPK-----KAQNQKQNVA---------SDDYDETDYLDS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G43630.1 FUNCTIONS IN: molecular_function unknown2.8e-4343.78Show/hide
Query:  HSP-TRTTTHHIPSSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGK--GSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGG
        H P T  T     + S Q  P+L +  R    K + +  V L+G K K  GSD+  SPWK ++K +     +SVED+LR+QI+KK+FYD   GG  PP G
Subjt:  HSP-TRTTTHHIPSSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGK--GSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGG

Query:  GGGSGGGDSGDGGEDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWL
        GG  GGG +G+      SG ED  L GI DE LQV+LATLG +F+Y YI++GEEL +LA+DYI++L G  K+VRL RAM +W  F + + +++ YD+YWL
Subjt:  GGGSGGGDSGDGGEDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWL

Query:  EKAILSTPTWWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF
        EKAI++TPTW+D+P+KY       +      D   +  Y++S+  E+ +
Subjt:  EKAILSTPTWWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF

AT3G59640.1 glycine-rich protein1.2e-3045.21Show/hide
Query:  SSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGK--GSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGG
        S S+Q  P+  +  R      +  P+V L G K K  GS++  S W+ ++K +     +SVED+LR+QI+KK+      GG  P G GGG GG + G+ G
Subjt:  SSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGK--GSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGG

Query:  EDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLE
           SSG ED  L    DE LQV+LATLG +F+Y YI++GEEL RLA+DYI+YL G  KSVRL R M  W +F++ + +KK Y++YWL+
Subjt:  EDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLE

AT3G59640.2 glycine-rich protein1.2e-3045.21Show/hide
Query:  SSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGK--GSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGG
        S S+Q  P+  +  R      +  P+V L G K K  GS++  S W+ ++K +     +SVED+LR+QI+KK+      GG  P G GGG GG + G+ G
Subjt:  SSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGK--GSDDGGSPWKGLDKVVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGG

Query:  EDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLE
           SSG ED  L    DE LQV+LATLG +F+Y YI++GEEL RLA+DYI+YL G  KSVRL R M  W +F++ + +KK Y++YWL+
Subjt:  EDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVRLKRAMYNWGKFYQSLMKKKKYDQYWLE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
CATAGTTGTGAGTTAGCCCAATTATTACTCTCTCATAACCTAAAAAGCCCATTTCTCTATCAGTCCATTGCCACCTACCGATGCTCATTATCCCTTCTTCCTTCTCCGCA
ACAACCTCCACCGCCACACCCCCATTCCACCGTCATTCACTCGCCGACACGCACCACTACCCACCACATTCCCTCATCTTCCGCCCAGGGTCTACCAGTTTTGAAGTATG
AACATCGGAGGGTTGGATTAAAATATCAGCATACACCAATTGTTTCCTTATACGGTAGCAAGGGAAAGGGCAGTGATGATGGGGGTTCTCCATGGAAAGGTTTGGACAAA
GTTGTTGAAAGTTTTAAGGGACGATCAGTAGAAGATGTCTTGCGACAACAAATTGAAAAGAAAGAGTTCTATGATGGTGGAGATGGTGGCAAAAGACCTCCAGGTGGCGG
CGGCGGCAGTGGTGGTGGCGACAGCGGTGATGGTGGCGAGGATAGCTCTAGTGGATCTGAGGATTATAGTCTCACAGGAATTATGGATGAAATACTGCAAGTCATTCTGG
CGACCCTCGGCTTAGTTTTCGTATACATTTATATACTCAGTGGGGAAGAGCTATCGCGATTAGCGAAGGATTACATAAAGTATCTATTTGGAGGAAGCAAGAGTGTGCGT
TTGAAGCGAGCGATGTACAACTGGGGAAAGTTTTACCAAAGCCTCATGAAAAAGAAGAAATATGATCAATATTGGCTGGAGAAAGCTATTCTTAGCACTCCAACATGGTG
GGATAATCCTGATAAGTATATGCCTAAGAAGGCACAGAATCAGAAACAGAATGTTGCATCAGATGATTATGATGAAACCGATTACCTAGACTCTGATTATGGTGAAATTG
ATTTCTAA
mRNA sequenceShow/hide mRNA sequence
CATAGTTGTGAGTTAGCCCAATTATTACTCTCTCATAACCTAAAAAGCCCATTTCTCTATCAGTCCATTGCCACCTACCGATGCTCATTATCCCTTCTTCCTTCTCCGCA
ACAACCTCCACCGCCACACCCCCATTCCACCGTCATTCACTCGCCGACACGCACCACTACCCACCACATTCCCTCATCTTCCGCCCAGGGTCTACCAGTTTTGAAGTATG
AACATCGGAGGGTTGGATTAAAATATCAGCATACACCAATTGTTTCCTTATACGGTAGCAAGGGAAAGGGCAGTGATGATGGGGGTTCTCCATGGAAAGGTTTGGACAAA
GTTGTTGAAAGTTTTAAGGGACGATCAGTAGAAGATGTCTTGCGACAACAAATTGAAAAGAAAGAGTTCTATGATGGTGGAGATGGTGGCAAAAGACCTCCAGGTGGCGG
CGGCGGCAGTGGTGGTGGCGACAGCGGTGATGGTGGCGAGGATAGCTCTAGTGGATCTGAGGATTATAGTCTCACAGGAATTATGGATGAAATACTGCAAGTCATTCTGG
CGACCCTCGGCTTAGTTTTCGTATACATTTATATACTCAGTGGGGAAGAGCTATCGCGATTAGCGAAGGATTACATAAAGTATCTATTTGGAGGAAGCAAGAGTGTGCGT
TTGAAGCGAGCGATGTACAACTGGGGAAAGTTTTACCAAAGCCTCATGAAAAAGAAGAAATATGATCAATATTGGCTGGAGAAAGCTATTCTTAGCACTCCAACATGGTG
GGATAATCCTGATAAGTATATGCCTAAGAAGGCACAGAATCAGAAACAGAATGTTGCATCAGATGATTATGATGAAACCGATTACCTAGACTCTGATTATGGTGAAATTG
ATTTCTAA
Protein sequenceShow/hide protein sequence
HSCELAQLLLSHNLKSPFLYQSIATYRCSLSLLPSPQQPPPPHPHSTVIHSPTRTTTHHIPSSSAQGLPVLKYEHRRVGLKYQHTPIVSLYGSKGKGSDDGGSPWKGLDK
VVESFKGRSVEDVLRQQIEKKEFYDGGDGGKRPPGGGGGSGGGDSGDGGEDSSSGSEDYSLTGIMDEILQVILATLGLVFVYIYILSGEELSRLAKDYIKYLFGGSKSVR
LKRAMYNWGKFYQSLMKKKKYDQYWLEKAILSTPTWWDNPDKYMPKKAQNQKQNVASDDYDETDYLDSDYGEIDF