; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg027120 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg027120
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF506)
Genome locationscaffold8:4415995..4420455
RNA-Seq ExpressionSpg027120
SyntenySpg027120
Gene Ontology termsNA
InterPro domainsIPR006502 - Protein of unknown function PDDEXK-like


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7035272.1 hypothetical protein SDJN02_02067, partial [Cucurbita argyrosperma subsp. argyrosperma]3.3e-7468.3Show/hide
Query:  KEILG-SGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR
        KEILG +GR AE EV E+VMKH+RRK DAPKTT +KKWLVMKLKMDGY S +LCHTSWVTS+GCP GDYEYI+MK +      KR+IIDIDFKAQFEVAR
Subjt:  KEILG-SGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR

Query:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKWKPPMVKQ
         TE YKQLT+ALPSVFVG+EEK+V+IISILCSAAKQSLKESGLHIPPWRTS+YMQ KW++A QQR +                            P+VK 
Subjt:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKWKPPMVKQ

Query:  I-RRVWAGGSALSTQFSNMSINCC
        I +RVW GGSALSTQFSNMSINCC
Subjt:  I-RRVWAGGSALSTQFSNMSINCC

XP_022143594.1 uncharacterized protein LOC111013453 [Momordica charantia]1.7e-8372.2Show/hide
Query:  KEILGSGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVARP
        KEILGSGR+AE EV E+V KHLR+K ++PKTTSLKKWLVMKL+MDGYDS+DLCHTSWVTS+GCPAG+YEYI+ KV+DEFG  KR+IIDI+FKAQFEVARP
Subjt:  KEILGSGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVARP

Query:  TEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLIN-KWKPPMVKQ
        T  YKQLTEALP+VFVGTEE + RII+ILCSAAKQSL+ESGLHIPPWRTS+YMQ K+      + EET          EE+++E G+  N +WKPPMVKQ
Subjt:  TEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLIN-KWKPPMVKQ

Query:  IRRVWAGGSALSTQFSNMSINCC
         RR+W+G SALSTQFSNMSINCC
Subjt:  IRRVWAGGSALSTQFSNMSINCC

XP_022947663.1 uncharacterized protein LOC111451460 isoform X1 [Cucurbita moschata]8.7e-7569.2Show/hide
Query:  KEILG-SGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR
        KEILG +GR AE EV E+VMKH+RRK DAPKTT LKKWLVMKLKMDGY S DLCH+SWVTSMGCP GDYEYI+MK +      KR+IIDIDFKAQFEVAR
Subjt:  KEILG-SGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR

Query:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKWKPPMVKQ
         TE YKQLT+ALPSVFVG+EEK+V+IISILCSAAKQSLKESGLHIPPWRTS+YMQ KW++A QQR +                            P+VK 
Subjt:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKWKPPMVKQ

Query:  I-RRVWAGGSALSTQFSNMSINCC
        I +RVW GGSALSTQFSNMSINCC
Subjt:  I-RRVWAGGSALSTQFSNMSINCC

XP_023007236.1 uncharacterized protein LOC111499781 [Cucurbita maxima]2.7e-7670.54Show/hide
Query:  KEILG-SGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR
        KEILG SGR AE EV E+VMKH+RRK DAPKTT LKKWLVMKLKMDGY S DLCHTSWVTSMGCP GDYEYI+MKV+      KR+IIDIDFKAQFEVAR
Subjt:  KEILG-SGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR

Query:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKWKPPMVKQ
         TE YKQLT+ALPSVFVG+EEK+V+IISILCSAAKQSLKESGLHIPPWRTS+YMQ KW++A QQR +                            P+VK 
Subjt:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKWKPPMVKQ

Query:  I-RRVWAGGSALSTQFSNMSINCC
        I +RVW GGSALSTQFSNMSINCC
Subjt:  I-RRVWAGGSALSTQFSNMSINCC

XP_038900827.1 uncharacterized protein LOC120087891 [Benincasa hispida]6.4e-8676.99Show/hide
Query:  KEILGSGRKAEAEVGESVMKHLRR-KRDAPKTT-SLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVA
        KEILGSG KAE EVGE+VMKHLR  K D+PKTT SLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYI+MKVKDE+GS KR+IIDI+FKAQFEVA
Subjt:  KEILGSGRKAEAEVGESVMKHLRR-KRDAPKTT-SLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVA

Query:  RPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINK-WKPPMV
        R TE YKQLTEALP+VFVG+EE++ RIIS+LCSAAKQSLKESGLHIPPWRTS+YM CKWL  H+  S   NNN         KE  +  + NK WKPPMV
Subjt:  RPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINK-WKPPMV

Query:  KQI-RRVWAGGSALSTQFSNMSINCC
        K I RRVW G SALSTQFSNMSINCC
Subjt:  KQI-RRVWAGGSALSTQFSNMSINCC

TrEMBL top hitse value%identityAlignment
A0A0A0LJS3 Uncharacterized protein1.5e-6969.86Show/hide
Query:  KEILGSGRKAEAEVGESVMKHLRRKR---DAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVK-DEFGSVKRLIIDIDFKAQFE
        KEILG+G K E EVGESVMKHLRR +    + KT SL+KWLVMKLKMDGYDSS LCHTSWVTSMGCPAGDYEYI+M+ K DE GS KRLIIDI+FKAQFE
Subjt:  KEILGSGRKAEAEVGESVMKHLRRKR---DAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVK-DEFGSVKRLIIDIDFKAQFE

Query:  VARPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNN--------NININFEREEKEKESGNL
        VAR TE YKQLT+ALP+VFVG+EEK+ RIIS+LCSAAKQSL++SGLHIPPWRTS+YM  KWL  H   S  TNN        NININ  R      S N 
Subjt:  VARPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNN--------NININFEREEKEKESGNL

Query:  INKWKPPMV
           WKPPMV
Subjt:  INKWKPPMV

A0A2N9GFS6 Uncharacterized protein2.6e-6963.11Show/hide
Query:  KEILGSGRKAEAEVGESVMKHLR-RKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR
        +EI+GSG + EAE+ ESV++H+R  KR+A KT+SLKKWLVMK KMDGY++S LCHTSW+TS+GCPAGDYEYID+ +++E G   RLI+DIDFK+QFE+AR
Subjt:  KEILGSGRKAEAEVGESVMKHLR-RKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR

Query:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKW--KPPMV
        PT  YK+LT+ LP +FVGTE+KL +IISILCSAAKQSL+E GLHIPPWRT +YMQ KWLS   + S    N       +E K    G++ +KW   PPMV
Subjt:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKW--KPPMV

Query:  KQIRRVWAGGSALSTQFSNMSINCC
        K  R   AGGSALS+QFSNMSINCC
Subjt:  KQIRRVWAGGSALSTQFSNMSINCC

A0A6J1CPS4 uncharacterized protein LOC1110134538.4e-8472.2Show/hide
Query:  KEILGSGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVARP
        KEILGSGR+AE EV E+V KHLR+K ++PKTTSLKKWLVMKL+MDGYDS+DLCHTSWVTS+GCPAG+YEYI+ KV+DEFG  KR+IIDI+FKAQFEVARP
Subjt:  KEILGSGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVARP

Query:  TEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLIN-KWKPPMVKQ
        T  YKQLTEALP+VFVGTEE + RII+ILCSAAKQSL+ESGLHIPPWRTS+YMQ K+      + EET          EE+++E G+  N +WKPPMVKQ
Subjt:  TEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLIN-KWKPPMVKQ

Query:  IRRVWAGGSALSTQFSNMSINCC
         RR+W+G SALSTQFSNMSINCC
Subjt:  IRRVWAGGSALSTQFSNMSINCC

A0A6J1G7H8 uncharacterized protein LOC111451460 isoform X14.2e-7569.2Show/hide
Query:  KEILG-SGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR
        KEILG +GR AE EV E+VMKH+RRK DAPKTT LKKWLVMKLKMDGY S DLCH+SWVTSMGCP GDYEYI+MK +      KR+IIDIDFKAQFEVAR
Subjt:  KEILG-SGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR

Query:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKWKPPMVKQ
         TE YKQLT+ALPSVFVG+EEK+V+IISILCSAAKQSLKESGLHIPPWRTS+YMQ KW++A QQR +                            P+VK 
Subjt:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKWKPPMVKQ

Query:  I-RRVWAGGSALSTQFSNMSINCC
        I +RVW GGSALSTQFSNMSINCC
Subjt:  I-RRVWAGGSALSTQFSNMSINCC

A0A6J1KZZ7 uncharacterized protein LOC1114997811.3e-7670.54Show/hide
Query:  KEILG-SGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR
        KEILG SGR AE EV E+VMKH+RRK DAPKTT LKKWLVMKLKMDGY S DLCHTSWVTSMGCP GDYEYI+MKV+      KR+IIDIDFKAQFEVAR
Subjt:  KEILG-SGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVAR

Query:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKWKPPMVKQ
         TE YKQLT+ALPSVFVG+EEK+V+IISILCSAAKQSLKESGLHIPPWRTS+YMQ KW++A QQR +                            P+VK 
Subjt:  PTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKWKPPMVKQ

Query:  I-RRVWAGGSALSTQFSNMSINCC
        I +RVW GGSALSTQFSNMSINCC
Subjt:  I-RRVWAGGSALSTQFSNMSINCC

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G77145.1 Protein of unknown function (DUF506)1.4e-3043.41Show/hide
Query:  KEILGSGRKAEAEVGESVMKHLRRKR-----DAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSM----GCP----AGDYEYIDMKVK-----DEFGSV
        KEIL +  + E E+ E +   +   R     D  K   + K +V KL+ +GYD+S L  TSW +S     GC     +  YEYID+ VK     D    +
Subjt:  KEILGSGRKAEAEVGESVMKHLRRKR-----DAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSM----GCP----AGDYEYIDMKVK-----DEFGSV

Query:  KRLIIDIDFKAQFEVARPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQR
        KR+IID+DFK QFE+AR TE YK +TE LP VFV TE +L R++S++C   K+S+K+ G+  PPWRT+ YMQ KWL  +++R
Subjt:  KRLIIDIDFKAQFEVARPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQR

AT1G77160.1 Protein of unknown function (DUF506)5.3e-3042.86Show/hide
Query:  KEILGSGRKAEAEVGESVMKHLRRKR-----DAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSM----GCP----AGDYEYIDMKV-----KDEFGSV
        +EIL +    E E+ E +  ++ R R     D  K   + K +V KL+ +GY++S L  TSW +S     GC     +  YEYID  V     +D    +
Subjt:  KEILGSGRKAEAEVGESVMKHLRRKR-----DAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSM----GCP----AGDYEYIDMKV-----KDEFGSV

Query:  KRLIIDIDFKAQFEVARPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQR
        KR+IID+DFK QFE+AR TE YK +TE LP+VFV TE +L R++S++C   K+S+K+ G+  PPWRTS YMQ KWL  + +R
Subjt:  KRLIIDIDFKAQFEVARPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQR

AT2G38820.1 Protein of unknown function (DUF506)1.1e-3247.01Show/hide
Query:  VMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVARPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLK
        V K+    YD++ LC + W  S  CPAG+YEY+D+ +K E     RL+IDIDFK++FE+AR T+ YK + + LP +FVG  ++L +II ++C AAKQSLK
Subjt:  VMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVARPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLK

Query:  ESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNI
        + GLH+PPWR + Y++ KWLS+H +  + +N  +
Subjt:  ESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNI

AT2G38820.2 Protein of unknown function (DUF506)1.1e-3248.82Show/hide
Query:  GYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVARPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIP
        GYD++ LC + W  S  CPAG+YEY+D+ +K E     RL+IDIDFK++FE+AR T+ YK + + LP +FVG  ++L +II ++C AAKQSLK+ GLH+P
Subjt:  GYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVARPTEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIP

Query:  PWRTSSYMQCKWLSAHQQRSEETNNNI
        PWR + Y++ KWLS+H +  + +N  +
Subjt:  PWRTSSYMQCKWLSAHQQRSEETNNNI

AT4G14620.1 Protein of unknown function (DUF506)3.9e-3341.92Show/hide
Query:  KEILGSGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVARP
        K ++  G   E  +     K + + +   +   L+K +V +L   GYDSS +C + W  +   PAG+YEYID+ V  E     RLIIDIDF+++FE+AR 
Subjt:  KEILGSGRKAEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVARP

Query:  TEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEE
        T GYK+L ++LP +FVG  +++ +I+SI+  A+KQSLK+ G+H PPWR + YM+ KWLS++ + S E
Subjt:  TEGYKQLTEALPSVFVGTEEKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTAGCTATTTCACTATCCTACTTCCATTCTACAAATAGAATACTTTATAAGGAAGGAAGGCACACAATTCTTCCCCTCCATTACAAATTTACGGTTGATCCAACGAT
ACGAATTCTGCATGCAGCCCTAGGAGTAGTATTAGCCAAGCCACTCAAACCCCCATCTTTCTTTCTTATTCTTCATCCACGAGCAGCCCCTCCCCCTTCTCGATTTCTTC
TCCTTCTTGCTCCGGCGACATTCTCTCCGACTAGTCCGGCGGCGTCTTCTACTCCGGCAAGTGGCGGCTGGTCTCAATTCGGCGGCGTCGTTGCATGGGGAGGCGCGACA
GCAGCTCTTTTTCTTCTTCCTTTTCCTCTGGCGACAAAATCTCCTCCACGACGGCGGCAGCAGCGGTACGTCGACGGAGCAGGAGGGCAGCGGCGGTGGGCGACGCCGTT
TCTTCCGGCAAGAGACAGCGGTGAGTGGTCTCGGTTCAGCAGCGTTTCCGCGCACGTTCAGCAGGCGGTACAGCAGGTGTTCGCGGATCGTTCCGAGCAGCAGTCCGCGA
CGTCTCCAATAGTGGGTACCCATAGCGATAGGAGGTTGGATTTTGGCATTAGAGTGAAATTGGAACTTTCGACTCGCAAGCGAAAGGAGATTCTTGGGAGTGGAAGAAAA
GCAGAAGCAGAGGTGGGTGAGAGTGTGATGAAGCACTTGAGAAGGAAAAGGGATGCTCCCAAAACCACCAGCTTGAAGAAATGGCTTGTGATGAAACTCAAAATGGACGG
CTATGATTCTTCTGATCTCTGTCACACCTCTTGGGTCACTTCCATGGGATGTCCAGCAGGGGATTATGAGTACATAGACATGAAAGTGAAGGATGAGTTTGGGAGTGTAA
AGAGGCTGATAATAGACATAGACTTCAAGGCTCAATTTGAAGTAGCAAGGCCAACAGAAGGGTACAAGCAGCTCACAGAAGCACTTCCATCAGTGTTTGTAGGGACTGAA
GAGAAGCTTGTGAGAATAATCTCAATTCTATGCTCAGCAGCCAAACAGTCCCTTAAGGAGAGTGGGCTCCACATTCCTCCTTGGAGAACTTCCAGTTACATGCAGTGCAA
ATGGCTGTCTGCCCACCAGCAAAGATCAGAAGAAACTAACAATAATATTAATATTAATTTTGAGAGAGAAGAAAAAGAAAAAGAAAGTGGTAATTTAATTAATAAGTGGA
AGCCTCCCATGGTGAAGCAAATTAGGAGGGTTTGGGCTGGTGGCTCTGCCTTGTCCACTCAATTTTCTAACATGAGTATTAATTGTTGTTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTAGCTATTTCACTATCCTACTTCCATTCTACAAATAGAATACTTTATAAGGAAGGAAGGCACACAATTCTTCCCCTCCATTACAAATTTACGGTTGATCCAACGAT
ACGAATTCTGCATGCAGCCCTAGGAGTAGTATTAGCCAAGCCACTCAAACCCCCATCTTTCTTTCTTATTCTTCATCCACGAGCAGCCCCTCCCCCTTCTCGATTTCTTC
TCCTTCTTGCTCCGGCGACATTCTCTCCGACTAGTCCGGCGGCGTCTTCTACTCCGGCAAGTGGCGGCTGGTCTCAATTCGGCGGCGTCGTTGCATGGGGAGGCGCGACA
GCAGCTCTTTTTCTTCTTCCTTTTCCTCTGGCGACAAAATCTCCTCCACGACGGCGGCAGCAGCGGTACGTCGACGGAGCAGGAGGGCAGCGGCGGTGGGCGACGCCGTT
TCTTCCGGCAAGAGACAGCGGTGAGTGGTCTCGGTTCAGCAGCGTTTCCGCGCACGTTCAGCAGGCGGTACAGCAGGTGTTCGCGGATCGTTCCGAGCAGCAGTCCGCGA
CGTCTCCAATAGTGGGTACCCATAGCGATAGGAGGTTGGATTTTGGCATTAGAGTGAAATTGGAACTTTCGACTCGCAAGCGAAAGGAGATTCTTGGGAGTGGAAGAAAA
GCAGAAGCAGAGGTGGGTGAGAGTGTGATGAAGCACTTGAGAAGGAAAAGGGATGCTCCCAAAACCACCAGCTTGAAGAAATGGCTTGTGATGAAACTCAAAATGGACGG
CTATGATTCTTCTGATCTCTGTCACACCTCTTGGGTCACTTCCATGGGATGTCCAGCAGGGGATTATGAGTACATAGACATGAAAGTGAAGGATGAGTTTGGGAGTGTAA
AGAGGCTGATAATAGACATAGACTTCAAGGCTCAATTTGAAGTAGCAAGGCCAACAGAAGGGTACAAGCAGCTCACAGAAGCACTTCCATCAGTGTTTGTAGGGACTGAA
GAGAAGCTTGTGAGAATAATCTCAATTCTATGCTCAGCAGCCAAACAGTCCCTTAAGGAGAGTGGGCTCCACATTCCTCCTTGGAGAACTTCCAGTTACATGCAGTGCAA
ATGGCTGTCTGCCCACCAGCAAAGATCAGAAGAAACTAACAATAATATTAATATTAATTTTGAGAGAGAAGAAAAAGAAAAAGAAAGTGGTAATTTAATTAATAAGTGGA
AGCCTCCCATGGTGAAGCAAATTAGGAGGGTTTGGGCTGGTGGCTCTGCCTTGTCCACTCAATTTTCTAACATGAGTATTAATTGTTGTTGA
Protein sequenceShow/hide protein sequence
MLAISLSYFHSTNRILYKEGRHTILPLHYKFTVDPTIRILHAALGVVLAKPLKPPSFFLILHPRAAPPPSRFLLLLAPATFSPTSPAASSTPASGGWSQFGGVVAWGGAT
AALFLLPFPLATKSPPRRRQQRYVDGAGGQRRWATPFLPARDSGEWSRFSSVSAHVQQAVQQVFADRSEQQSATSPIVGTHSDRRLDFGIRVKLELSTRKRKEILGSGRK
AEAEVGESVMKHLRRKRDAPKTTSLKKWLVMKLKMDGYDSSDLCHTSWVTSMGCPAGDYEYIDMKVKDEFGSVKRLIIDIDFKAQFEVARPTEGYKQLTEALPSVFVGTE
EKLVRIISILCSAAKQSLKESGLHIPPWRTSSYMQCKWLSAHQQRSEETNNNININFEREEKEKESGNLINKWKPPMVKQIRRVWAGGSALSTQFSNMSINCC