; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Pay0020433 (gene) of Melon (Payzawat) v1 genome

Gene IDPay0020433
OrganismCucumis melo var. inodorus cv. Payzawat (Melon (Payzawat) v1)
DescriptionProtein of unknown function (DUF1685)
Genome locationchr01:2403079..2404743
RNA-Seq ExpressionPay0020433
SyntenyPay0020433
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6572385.1 hypothetical protein SDJN03_29113, partial [Cucurbita argyrosperma subsp. sororia]4.6e-6968Show/hide
Query:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG------SPVM---KMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKL
        MAAEEILPLFDLFWFQ A+F  KPLL T F+       SPVM   K+RSQSEY L+S   PP     ++NQKL+ +LSG+VTEF G GEGK  K   KK 
Subjt:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG------SPVM---KMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKL

Query:  EGNENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVP
        EG+E K RR+K+ +GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+L SIIPGL RLG +  EE+  E+GV  RPYLSEAW+A+EEE EK  LMKWRVP
Subjt:  EGNENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVP

Query:  SLGATEMDIKHHLKFWAHTVASTVR
         LGATEMD+K HLKFWAHTVASTVR
Subjt:  SLGATEMDIKHHLKFWAHTVASTVR

KGN43634.1 hypothetical protein Csa_017160 [Cucumis sativus]1.7e-10088.07Show/hide
Query:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG--SPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKI
        MAAEEILPLFDLFWFQ+AIF RK  LKTCFQ     V+KMRSQSEYLLNSKDFPPP T LNSNQKLET+LSG+VTEFGG+ EG+ATKK+ KKLEGNE+KI
Subjt:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG--SPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKI

Query:  RRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEM
        RRKKK KGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGP+ TEEKR+ENGVLRRPYLSEAW+AIEEENEKM+LMKWRVPSLGATEM
Subjt:  RRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEM

Query:  DIKHHLKFWAHTVASTVR
        DIKHHLKFWAHTVASTVR
Subjt:  DIKHHLKFWAHTVASTVR

XP_008455295.1 PREDICTED: uncharacterized protein LOC103495495 [Cucumis melo]3.9e-116100Show/hide
Query:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQGSPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKIRR
        MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQGSPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKIRR
Subjt:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQGSPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKIRR

Query:  KKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDI
        KKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDI
Subjt:  KKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDI

Query:  KHHLKFWAHTVASTVR
        KHHLKFWAHTVASTVR
Subjt:  KHHLKFWAHTVASTVR

XP_022952129.1 uncharacterized protein LOC111454895 [Cucurbita moschata]6.0e-6968Show/hide
Query:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG------SPVM---KMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKL
        MAAEEILPLFDLFWFQ A+F  KPLL T F+       SPVM   K+RSQSEY L+S   PP     ++NQKL+ +LSG+VTEF G GEGK  K   KK 
Subjt:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG------SPVM---KMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKL

Query:  EGNENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVP
        EG+E K RR+K+ +GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+L SIIPGL RLG +  EE+  E+GV  RPYLSEAW+A+EEE EK  LMKWRVP
Subjt:  EGNENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVP

Query:  SLGATEMDIKHHLKFWAHTVASTVR
         LGATEMD+K HLKFWAHTVASTVR
Subjt:  SLGATEMDIKHHLKFWAHTVASTVR

XP_038887878.1 uncharacterized protein LOC120077867 [Benincasa hispida]1.9e-7571.61Show/hide
Query:  MAAEEILPLFDLFWFQQAIFPRKPLLKT-------CFQG--SPVMKMRSQSEYLLNSKDFPPPVTT-------LNSNQKLETVLSGQVTEFGGSGEGK-A
        MAAEEILPLFDLFWFQ+AIF  KPLL+T        FQ   + VMK RSQSEYLL+SK FPPP T        ++++QKL+T+LSG+V EF G+GEGK A
Subjt:  MAAEEILPLFDLFWFQQAIFPRKPLLKT-------CFQG--SPVMKMRSQSEYLLNSKDFPPPVTT-------LNSNQKLETVLSGQVTEFGGSGEGK-A

Query:  TKKEMKKLEGNENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQI---TEEKRNENGVLRRPYLSEAWEAIEEEN
             KKLEGNENK RRKK+ KGLSKSLSDLEFEELKGFMDLGFVF EEDKNDSNL SIIPGL RLG +     EEKR ENGV  RPYLSEAWEA+EEEN
Subjt:  TKKEMKKLEGNENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQI---TEEKRNENGVLRRPYLSEAWEAIEEEN

Query:  EKMVLMKWRVPSLGATEMDIKHHLKFWAHTVASTVR
        EK +LMKWRVP LGATEMD+K HLKFWAHTVASTVR
Subjt:  EKMVLMKWRVPSLGATEMDIKHHLKFWAHTVASTVR

TrEMBL top hitse value%identityAlignment
A0A0A0K3T7 Uncharacterized protein8.4e-10188.07Show/hide
Query:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG--SPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKI
        MAAEEILPLFDLFWFQ+AIF RK  LKTCFQ     V+KMRSQSEYLLNSKDFPPP T LNSNQKLET+LSG+VTEFGG+ EG+ATKK+ KKLEGNE+KI
Subjt:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG--SPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKI

Query:  RRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEM
        RRKKK KGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGP+ TEEKR+ENGVLRRPYLSEAW+AIEEENEKM+LMKWRVPSLGATEM
Subjt:  RRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEM

Query:  DIKHHLKFWAHTVASTVR
        DIKHHLKFWAHTVASTVR
Subjt:  DIKHHLKFWAHTVASTVR

A0A1S3C1U3 uncharacterized protein LOC1034954951.9e-116100Show/hide
Query:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQGSPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKIRR
        MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQGSPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKIRR
Subjt:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQGSPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKIRR

Query:  KKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDI
        KKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDI
Subjt:  KKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDI

Query:  KHHLKFWAHTVASTVR
        KHHLKFWAHTVASTVR
Subjt:  KHHLKFWAHTVASTVR

A0A5D3C775 Uncharacterized protein1.9e-116100Show/hide
Query:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQGSPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKIRR
        MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQGSPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKIRR
Subjt:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQGSPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKIRR

Query:  KKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDI
        KKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDI
Subjt:  KKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDI

Query:  KHHLKFWAHTVASTVR
        KHHLKFWAHTVASTVR
Subjt:  KHHLKFWAHTVASTVR

A0A6J1GKQ8 uncharacterized protein LOC1114548952.9e-6968Show/hide
Query:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG------SPVM---KMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKL
        MAAEEILPLFDLFWFQ A+F  KPLL T F+       SPVM   K+RSQSEY L+S   PP     ++NQKL+ +LSG+VTEF G GEGK  K   KK 
Subjt:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG------SPVM---KMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKL

Query:  EGNENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVP
        EG+E K RR+K+ +GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+L SIIPGL RLG +  EE+  E+GV  RPYLSEAW+A+EEE EK  LMKWRVP
Subjt:  EGNENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVP

Query:  SLGATEMDIKHHLKFWAHTVASTVR
         LGATEMD+K HLKFWAHTVASTVR
Subjt:  SLGATEMDIKHHLKFWAHTVASTVR

A0A6J1I0F9 uncharacterized protein LOC1114683114.0e-6665.79Show/hide
Query:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG------SPVM---KMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKL
        MAAEEIL LFDLFWFQ A+F   PLL T F+       SPVM   K+RSQSEY L+S   PP     ++NQKL+ +LSG+VTEF G G GK  K   KK 
Subjt:  MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQG------SPVM---KMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKL

Query:  EGNENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGV---LRRPYLSEAWEAIEEENEKMVLMKW
        EG+E K RR+K+ +GLSKSLSDLEFEELKGFMDLGFVFSEEDK +S+L SIIPGL RLG + TEE   E G+   + RPYLSEAW+A+EEE EK  LMKW
Subjt:  EGNENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGV---LRRPYLSEAWEAIEEENEKMVLMKW

Query:  RVPSLGATEMDIKHHLKFWAHTVASTVR
        RVP LGATEMD+K HLKFWAHTVASTVR
Subjt:  RVPSLGATEMDIKHHLKFWAHTVASTVR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G05870.1 Protein of unknown function (DUF1685)1.7e-0529.61Show/hide
Query:  SGEGKATKKEMKK----LEG-----NENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGL---HRLGPQITEEKRNENGVLR
        +GEG   K E KK    LEG     + + +  +K     SKSL+D + E+L+G +DLGF FS ++  +  L + +P L   + +  +  ++K+N++    
Subjt:  SGEGKATKKEMKK----LEG-----NENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGL---HRLGPQITEEKRNENGVLR

Query:  RPYLSEAWEAIEEENEKMV-LMKWRVPSLGATEMDIKHHLKFWAHTVASTVR
         P  S   +           +  W++ S G    D+K  LK+WA  VA TV+
Subjt:  RPYLSEAWEAIEEENEKMV-LMKWRVPSLGATEMDIKHHLKFWAHTVASTVR

AT1G05870.2 Protein of unknown function (DUF1685)1.7e-0529.61Show/hide
Query:  SGEGKATKKEMKK----LEG-----NENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGL---HRLGPQITEEKRNENGVLR
        +GEG   K E KK    LEG     + + +  +K     SKSL+D + E+L+G +DLGF FS ++  +  L + +P L   + +  +  ++K+N++    
Subjt:  SGEGKATKKEMKK----LEG-----NENKIRRKKKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGL---HRLGPQITEEKRNENGVLR

Query:  RPYLSEAWEAIEEENEKMV-LMKWRVPSLGATEMDIKHHLKFWAHTVASTVR
         P  S   +           +  W++ S G    D+K  LK+WA  VA TV+
Subjt:  RPYLSEAWEAIEEENEKMV-LMKWRVPSLGATEMDIKHHLKFWAHTVASTVR

AT2G31560.1 Protein of unknown function (DUF1685)5.9e-0627.92Show/hide
Query:  VTEFGGSGEGKATKKEMKKLEGNENKIRRK------KKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGL---HRLGPQITEEKRNENG
        + + GG G G    +  KKLE  ++++  +      +     +KSL+D + EELKG +DLGF FS ++  +  L + +P L   + +  +  ++K+  + 
Subjt:  VTEFGGSGEGKATKKEMKKLEGNENKIRRK------KKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGL---HRLGPQITEEKRNENG

Query:  VLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDIKHHLKFWAHTVASTVR
          +     E  ++         +  W++ S G    D+K  LK+WA TVA TVR
Subjt:  VLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDIKHHLKFWAHTVASTVR

AT2G31560.2 Protein of unknown function (DUF1685)5.9e-0627.92Show/hide
Query:  VTEFGGSGEGKATKKEMKKLEGNENKIRRK------KKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGL---HRLGPQITEEKRNENG
        + + GG G G    +  KKLE  ++++  +      +     +KSL+D + EELKG +DLGF FS ++  +  L + +P L   + +  +  ++K+  + 
Subjt:  VTEFGGSGEGKATKKEMKKLEGNENKIRRK------KKRKGLSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGL---HRLGPQITEEKRNENG

Query:  VLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDIKHHLKFWAHTVASTVR
          +     E  ++         +  W++ S G    D+K  LK+WA TVA TVR
Subjt:  VLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDIKHHLKFWAHTVASTVR

AT2G42760.1 unknown protein4.7e-2735.77Show/hide
Query:  MAAEEILPLFDLFWFQQAI---------------------------------FPRKPLLKTCFQGSPVMKMRSQSEYLLNSKD--FPPPVTTL---NSNQ
        MA EE+L LF+  W ++ I                                 FP   L++       +M   S++    +S D  F  P + L    +  
Subjt:  MAAEEILPLFDLFWFQQAI---------------------------------FPRKPLLKTCFQGSPVMKMRSQSEYLLNSKD--FPPPVTTL---NSNQ

Query:  KLETVLSG-QVTEFGGSGEGKATKKEMKKLEGNENKIRRKKKRKG-----LSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRL-----GP
        KL+T+LSG +V  F        T  E ++L   + + R+KKK+K        KS+SDLE+EELKGFMDLGFVFSE+D  DS+L SI+PGL RL     G 
Subjt:  KLETVLSG-QVTEFGGSGEGKATKKEMKKLEGNENKIRRKKKRKG-----LSKSLSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRL-----GP

Query:  QITEEKRNENGVL-----RRPYLSEAWEAIEEENEKMVL---MKWRVPS-LGATEMDIKHHLKFWAHTVASTVR
           EE+  E   +      RPYLSEAW+       K  +   +KWRVP+   A+E+D+K +L+ WAH VAST+R
Subjt:  QITEEKRNENGVL-----RRPYLSEAWEAIEEENEKMVL---MKWRVPS-LGATEMDIKHHLKFWAHTVASTVR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGCCGAAGAAATCCTCCCACTTTTTGATCTCTTCTGGTTTCAACAAGCAATATTCCCCAGAAAACCTCTTTTAAAAACCTGCTTTCAAGGTAGTCCTGTGATGAA
GATGAGATCTCAAAGCGAGTATCTTCTAAACTCCAAGGATTTCCCACCACCCGTAACCACCCTCAACTCCAACCAAAAGCTGGAAACCGTTCTTTCGGGTCAGGTAACGG
AATTTGGGGGAAGCGGAGAAGGAAAAGCGACGAAGAAGGAGATGAAGAAGTTGGAAGGGAATGAAAACAAAATCAGAAGAAAGAAAAAGCGGAAAGGGCTGAGTAAGAGT
TTATCAGACTTGGAATTTGAAGAATTGAAAGGATTTATGGATTTGGGATTTGTGTTTAGTGAAGAAGATAAAAATGATTCAAATTTGGGTTCAATAATTCCAGGGTTACA
CAGATTAGGGCCCCAAATAACAGAAGAAAAAAGGAACGAAAATGGGGTTTTAAGAAGGCCATATTTATCTGAAGCTTGGGAAGCCATTGAAGAAGAAAATGAAAAAATGG
TTTTGATGAAATGGAGAGTTCCAAGTTTAGGAGCAACTGAAATGGATATTAAACATCATCTCAAATTCTGGGCTCATACCGTGGCTTCAACTGTCAGATAA
mRNA sequenceShow/hide mRNA sequence
AAATCATCTTTTTCTAGTTTCTTCTTCCTTATCTATATAACTGTTTACAAACTCGAATTTTCTCTCACCAACACAACAATGGCAGCCGAAGAAATCCTCCCACTTTTTGA
TCTCTTCTGGTTTCAACAAGCAATATTCCCCAGAAAACCTCTTTTAAAAACCTGCTTTCAAGGTAGTCCTGTGATGAAGATGAGATCTCAAAGCGAGTATCTTCTAAACT
CCAAGGATTTCCCACCACCCGTAACCACCCTCAACTCCAACCAAAAGCTGGAAACCGTTCTTTCGGGTCAGGTAACGGAATTTGGGGGAAGCGGAGAAGGAAAAGCGACG
AAGAAGGAGATGAAGAAGTTGGAAGGGAATGAAAACAAAATCAGAAGAAAGAAAAAGCGGAAAGGGCTGAGTAAGAGTTTATCAGACTTGGAATTTGAAGAATTGAAAGG
ATTTATGGATTTGGGATTTGTGTTTAGTGAAGAAGATAAAAATGATTCAAATTTGGGTTCAATAATTCCAGGGTTACACAGATTAGGGCCCCAAATAACAGAAGAAAAAA
GGAACGAAAATGGGGTTTTAAGAAGGCCATATTTATCTGAAGCTTGGGAAGCCATTGAAGAAGAAAATGAAAAAATGGTTTTGATGAAATGGAGAGTTCCAAGTTTAGGA
GCAACTGAAATGGATATTAAACATCATCTCAAATTCTGGGCTCATACCGTGGCTTCAACTGTCAGATAACACCACTTTTCTTACATCTTATGGTAACTGTACTTTTTCTT
TTTTTTCCCTACTTCTGTGAATGATATTATAAATAAATGACAAAATTTGTGTTCTACCAATTTTATAATCATACTAATAAGAGTACTATACTTCAATGTTTAAAGATATT
TTGTATACTCTTAGCTAAAAATATATTTTTCTGTTCTTGACCTTAAATATCATATGGCTAAATAAAAAATTCAAATCTTCAAATTCCTACTTATCATATTACAATAAACA
ATAAATAAAACTTGCGCAGTTAAAATCATCACCCAGTTATTAAGCGACTTATGGTGTGAATCTCTCACTCCAATTGTATTAAAAAATACATGAAAAAGTTTTTAATGTTT
TACAAATATCTATGGAAGAATTCTCTCTCTTAATCATGTTCTTACACACAATAACGAGGTTTGAATTTACACCTATAGTAAGACTTTCTTTACATCAATAAAACTCATAT
TAGGTTAAACATAAAGAATCTAATAATATGGTTAGTAATCCGTATAAGATGCGGTGAATATAATTTGATATGAATTAAAGGCTGTACAGGGTGTGTATTCGTGGAGAATT
TGAAGAATTTCTCTCGAGAAATTGAACAACTTTCTTCACAAAAGTTGAAGAGAACCGAGTAGGAAAGCTCCTCGAGGGCTACCGAAGAAGGAAAAAAGTTCAGGAAGGCC
ATCAAATTTGAAGAAAACATAAGCTTAGGCAGGTTCTTTTTTTTCTTTTGGTTTAACAGAGCAATTGGATTTTCTGAAATTTGGGATTGTAGGACGTAATTTGTGAACTC
TAACCTATTGGGTAATTTAATTTAAAGGTAGTTAATGATTTTTGTGGAAGAAACTTTTTGAGATTGGATTGGATGAAATCAAAGCTTAACATCTTGAAAGAAGCTGCTGT
AGAATAGAGACGCGG
Protein sequenceShow/hide protein sequence
MAAEEILPLFDLFWFQQAIFPRKPLLKTCFQGSPVMKMRSQSEYLLNSKDFPPPVTTLNSNQKLETVLSGQVTEFGGSGEGKATKKEMKKLEGNENKIRRKKKRKGLSKS
LSDLEFEELKGFMDLGFVFSEEDKNDSNLGSIIPGLHRLGPQITEEKRNENGVLRRPYLSEAWEAIEEENEKMVLMKWRVPSLGATEMDIKHHLKFWAHTVASTVR