; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

PI0016743 (gene) of Melon (PI 482460) v1 genome

Gene IDPI0016743
OrganismCucumis metuliferus PI 482460 (Melon (PI 482460) v1)
DescriptionProtein of unknown function, DUF599
Genome locationchr03:3678157..3680888
RNA-Seq ExpressionPI0016743
SyntenyPI0016743
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR006747 - Protein of unknown function DUF599


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004138780.1 uncharacterized protein LOC101209677 [Cucumis sativus]2.9e-9580.17Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        MEWKES+LDLILVPTGFIL+MCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSS++KDI+KKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRE--SPLLL------GC--------------SSFAWIFGPVLVFL
        KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQ+      PQ      +P  L      GC                  WIFGPVLVFL
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRE--SPLLL------GC--------------SSFAWIFGPVLVFL

Query:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV
        CYLSLLPLLYNLDFVSCNAHNKNNTTKVEAN GI+VG+ENFV
Subjt:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV

XP_008445058.1 PREDICTED: uncharacterized protein LOC103488211 [Cucumis melo]1.9e-9479.84Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        M WKESNLDLILVPTGFILIMCYHLGLWYKVRTQPF T IGINTSGRRLWVSS+MKDIEKKNILAVQTLRNAIMGSTLMATT+ILISCGLAAILSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRESPLL---------LGC--------------SSFAWIFGPVLVF
        KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQ+      PQ    SP+           GC                  WIFGPVLVF
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRESPLL---------LGC--------------SSFAWIFGPVLVF

Query:  LCYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV
        LCYLSLLPLLYNLDFVSC+AH KNNTTKVEANNGI+VGNENFV
Subjt:  LCYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV

XP_022131495.1 uncharacterized protein LOC111004681 [Momordica charantia]3.4e-7266.53Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        MEWKES LD+ILVP GF+L +CYH  LW+KVRTQPFTTIIGIN+SGRR WVS+MMKD EKKNILAVQTLRNAIMGSTLMATTSIL+S GLAAILSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRE-------SPLL--------LGCSSF-------AWIFGPVLVFL
        KKP+NDSV+GAHGEF +SLKYVS+LTIFLFSF CHSLSIRF+NQ+      PQ           S LL        +G   F        WIFGPVLVFL
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRE-------SPLL--------LGCSSF-------AWIFGPVLVFL

Query:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV
        C L+++P+LYNLDFV  NAH   N TKV+AN      N +FV
Subjt:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV

XP_038885866.1 uncharacterized protein LOC120076170 isoform X1 [Benincasa hispida]8.7e-7673.54Show/hide
Query:  GLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSIKKPLNDSVFGAHGEFMLSLKYVSIL
        GLWYKVRTQPFTTIIGIN+SGR  WVS+MMKDIEKKNILAVQTLRNAIMGSTLMATTSIL+SCGLAAILSSTYSIKKP+N+SVFGAHGEFMLSLKYVSIL
Subjt:  GLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSIKKPLNDSVFGAHGEFMLSLKYVSIL

Query:  TIFLFSFLCHSLSIRFINQLR--------------PQHGRE-------------------SPLLLGCSSFAWIFGPVLVFLCYLSLLPLLYNLDFVSCNA
        TIFLFSFLCHSLSIRFINQ+               P +  E                    PLLL      WIFGPVLVFLCYL+LLPLLY+LDFVS +A
Subjt:  TIFLFSFLCHSLSIRFINQLR--------------PQHGRE-------------------SPLLLGCSSFAWIFGPVLVFLCYLSLLPLLYNLDFVSCNA

Query:  HNKNNTTKVEANNGIIVGNENFV
        HNKNNTTKVEANN II+GNENFV
Subjt:  HNKNNTTKVEANNGIIVGNENFV

XP_038885867.1 uncharacterized protein LOC120076170 isoform X2 [Benincasa hispida]1.4e-8975Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        MEWKES LDLILVP GF LIMCYHLGLWYKVRTQPFTTIIGIN+SGR  WVS+MMKDIEKKNILAVQTLRNAIMGSTLMATTSIL+SCGLAAILSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR--------------PQHGRE-------------------SPLLLGCSSFAWIFG
        KKP+N+SVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQ+               P +  E                    PLLL      WIFG
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR--------------PQHGRE-------------------SPLLLGCSSFAWIFG

Query:  PVLVFLCYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV
        PVLVFLCYL+LLPLLY+LDFVS +AHNKNNTTKVEANN II+GNENFV
Subjt:  PVLVFLCYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV

TrEMBL top hitse value%identityAlignment
A0A0A0LS59 Uncharacterized protein1.4e-9580.17Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        MEWKES+LDLILVPTGFIL+MCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSS++KDI+KKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRE--SPLLL------GC--------------SSFAWIFGPVLVFL
        KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQ+      PQ      +P  L      GC                  WIFGPVLVFL
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRE--SPLLL------GC--------------SSFAWIFGPVLVFL

Query:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV
        CYLSLLPLLYNLDFVSCNAHNKNNTTKVEAN GI+VG+ENFV
Subjt:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV

A0A1S3BCI9 uncharacterized protein LOC1034882119.0e-9579.84Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        M WKESNLDLILVPTGFILIMCYHLGLWYKVRTQPF T IGINTSGRRLWVSS+MKDIEKKNILAVQTLRNAIMGSTLMATT+ILISCGLAAILSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRESPLL---------LGC--------------SSFAWIFGPVLVF
        KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQ+      PQ    SP+           GC                  WIFGPVLVF
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRESPLL---------LGC--------------SSFAWIFGPVLVF

Query:  LCYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV
        LCYLSLLPLLYNLDFVSC+AH KNNTTKVEANNGI+VGNENFV
Subjt:  LCYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV

A0A6J1BPV5 uncharacterized protein LOC1110046811.6e-7266.53Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        MEWKES LD+ILVP GF+L +CYH  LW+KVRTQPFTTIIGIN+SGRR WVS+MMKD EKKNILAVQTLRNAIMGSTLMATTSIL+S GLAAILSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRE-------SPLL--------LGCSSF-------AWIFGPVLVFL
        KKP+NDSV+GAHGEF +SLKYVS+LTIFLFSF CHSLSIRF+NQ+      PQ           S LL        +G   F        WIFGPVLVFL
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-----PQHGRE-------SPLL--------LGCSSF-------AWIFGPVLVFL

Query:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV
        C L+++P+LYNLDFV  NAH   N TKV+AN      N +FV
Subjt:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV

A0A6J1GHK8 uncharacterized protein LOC1114542523.1e-7164.96Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        M WK+S LD+ILVP+ F+L + YH  LWYKVRTQPFTTI+GIN+SGRR WVS+MMKD EKKNILAVQTLRNAIMGSTLMATTSIL+S GLAA+LSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQ--------------LRPQHGRE-------------SPLLLGCSSFAWIFGPVLVFL
        KKP+NDSVFGAHGE M+SLKYVS+L+IFLFSFLCHSLSIRFINQ              ++P++  E                 L      WIFGPVLVF+
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQ--------------LRPQHGRE-------------SPLLLGCSSFAWIFGPVLVFL

Query:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGI
        C L+++ LLYNLDFV  +AHNK   TKVEAN  I
Subjt:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGI

A0A6J1KLF7 uncharacterized protein LOC1114963101.0e-6964.53Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        M WK+S LD+ILVP+ F+L + YH  L YKVRTQPFTTI+GIN+SGRR WVS+MMKD EKKNILAVQTLRNAIMGSTLMATTSIL+S GLAA+LSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQ--------------LRPQHGRE-------------SPLLLGCSSFAWIFGPVLVFL
        KKP+NDSVFGAHGEFM+SLKYVS+L+IFLFSFLCHSLSIRFINQ              ++P++  E                 L      WIFGPVLVF+
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQ--------------LRPQHGRE-------------SPLLLGCSSFAWIFGPVLVFL

Query:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGI
        C L+++ LLYNLD V  +AHNK   TKVEAN  I
Subjt:  CYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G18215.1 Protein of unknown function, DUF5997.9e-2734.12Show/hide
Query:  WKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSIKK
        W E +LDL+LVPTG ++++ YH+ L Y +  +P  T+I +N   RR WV SMM +  K   LAVQT+RN IM STL+ATT+I +   +   +S++ S K 
Subjt:  WKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSIKK

Query:  PLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRF-----------INQLRPQHGRESPLLLGCSS----------------FAWIFGPVLVFLCY
           + ++G+    + S K  +IL  FL +FLC+  SIR+           +++ + +H       L  +S                F W FGP+ +F+C 
Subjt:  PLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRF-----------INQLRPQHGRESPLLLGCSS----------------FAWIFGPVLVFLCY

Query:  LSLLPLLYNLD
          +  +LY LD
Subjt:  LSLLPLLYNLD

AT4G31330.1 Protein of unknown function, DUF5999.9e-6256.82Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        MEW+E  LD+ILVP G ++   YH+ LW+K+RTQP TTIIG N   RR WV+S++KD +KKNILAVQTLRN IMGSTLMATTSIL+  GLAA+LSSTY++
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-------PQHGRESPLLLGCSSFA--------------------------WIFG
        KKPLND+VFGA GEFM++LKYV+ILTIFLFSF  HSLSIRFINQ+        P    E  +++    +                           WIFG
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR-------PQHGRESPLLLGCSSFA--------------------------WIFG

Query:  PVLVFLCYLSLLPLLYNLDF
        PVLVFLC + ++PLLYNLDF
Subjt:  PVLVFLCYLSLLPLLYNLDF

AT5G10580.1 Protein of unknown function, DUF5992.3e-5851.45Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        MEW++  LD +LVP+  +++  YH+ LWYKVRT PF TI+G N+  RR WV+++MKD EKKNILAVQTLRN IMG TLMATT IL+  GLAA+LSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR----------------------PQHGRE-------------SPLLLGCSSFAWI
        KKPLND+V+GAHG+F ++LKYV+ILTIFLF+F  HSLSIRFINQ+                       P++  E                 +G     WI
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLR----------------------PQHGRE-------------SPLLLGCSSFAWI

Query:  FGPVLVFLCYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNG
        FGPVLVFL    ++P+LYNLDFV   ++ +    KV+ N G
Subjt:  FGPVLVFLCYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNG

AT5G10580.2 Protein of unknown function, DUF5993.2e-5268.28Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        MEW++  LD +LVP+  +++  YH+ LWYKVRT PF TI+G N+  RR WV+++MKD EKKNILAVQTLRN IMG TLMATT IL+  GLAA+LSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQL
        KKPLND+V+GAHG+F ++LKYV+ILTIFLF+F  HSLSIRFINQ+
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQL

AT5G24790.1 Protein of unknown function, DUF5996.2e-5651.07Show/hide
Query:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI
        MEWK+  LD ILVP   ++++CYH+ L + VRT PF+T++GIN+ GRR+W+S+M+KD +K NILAVQTLRN +MG+TLMATT +L+  GLAA+LSSTYSI
Subjt:  MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSI

Query:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQ-------------------LRPQHGRE-------------SPLLLGCSSFAWIFGP
        KKPLND+VFGAHG+F +S+KY++ILTIF+FSF  HSLSIRF+NQ                   L  +H  E                  G S   WIFGP
Subjt:  KKPLNDSVFGAHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQ-------------------LRPQHGRE-------------SPLLLGCSSFAWIFGP

Query:  VLVFLCYLSLLPLLYNLDFVSCNAHNKNNTTKV
        +LVF   L ++ +L +LDFVS N    NN  K+
Subjt:  VLVFLCYLSLLPLLYNLDFVSCNAHNKNNTTKV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAATGGAAAGAGAGCAATTTGGATTTGATATTAGTTCCTACGGGATTTATATTGATTATGTGTTATCATTTAGGATTATGGTATAAAGTTCGAACTCAACCTTTCAC
CACTATCATTGGAATTAATACCTCTGGTCGTCGTCTCTGGGTTTCTTCCATGATGAAGGATATTGAGAAGAAGAACATTCTTGCTGTTCAAACTCTACGAAATGCCATAA
TGGGATCAACTCTTATGGCTACAACCTCGATCCTCATCTCATGCGGCCTCGCCGCGATTTTGAGCAGCACCTACAGCATCAAAAAGCCACTGAACGACTCCGTGTTCGGG
GCGCACGGCGAGTTCATGTTGTCTCTCAAATATGTCTCCATTCTCACCATTTTCCTCTTCTCCTTCTTATGTCATTCCCTTTCCATCAGATTTATCAACCAGTTGCGTCC
TCAACACGGTCGGGAATCGCCTCTTCTACTCGGCTGTTCCTCTTTTGCTTGGATATTTGGACCTGTGCTTGTGTTTTTATGCTATCTCTCTTTGCTGCCTTTGCTTTATA
ATTTGGATTTTGTTTCTTGTAATGCTCATAACAAAAACAATACCACCAAAGTTGAAGCCAATAATGGTATTATTGTTGGTAATGAAAACTTTGTGTAG
mRNA sequenceShow/hide mRNA sequence
ATGGAATGGAAAGAGAGCAATTTGGATTTGATATTAGTTCCTACGGGATTTATATTGATTATGTGTTATCATTTAGGATTATGGTATAAAGTTCGAACTCAACCTTTCAC
CACTATCATTGGAATTAATACCTCTGGTCGTCGTCTCTGGGTTTCTTCCATGATGAAGGATATTGAGAAGAAGAACATTCTTGCTGTTCAAACTCTACGAAATGCCATAA
TGGGATCAACTCTTATGGCTACAACCTCGATCCTCATCTCATGCGGCCTCGCCGCGATTTTGAGCAGCACCTACAGCATCAAAAAGCCACTGAACGACTCCGTGTTCGGG
GCGCACGGCGAGTTCATGTTGTCTCTCAAATATGTCTCCATTCTCACCATTTTCCTCTTCTCCTTCTTATGTCATTCCCTTTCCATCAGATTTATCAACCAGTTGCGTCC
TCAACACGGTCGGGAATCGCCTCTTCTACTCGGCTGTTCCTCTTTTGCTTGGATATTTGGACCTGTGCTTGTGTTTTTATGCTATCTCTCTTTGCTGCCTTTGCTTTATA
ATTTGGATTTTGTTTCTTGTAATGCTCATAACAAAAACAATACCACCAAAGTTGAAGCCAATAATGGTATTATTGTTGGTAATGAAAACTTTGTGTAG
Protein sequenceShow/hide protein sequence
MEWKESNLDLILVPTGFILIMCYHLGLWYKVRTQPFTTIIGINTSGRRLWVSSMMKDIEKKNILAVQTLRNAIMGSTLMATTSILISCGLAAILSSTYSIKKPLNDSVFG
AHGEFMLSLKYVSILTIFLFSFLCHSLSIRFINQLRPQHGRESPLLLGCSSFAWIFGPVLVFLCYLSLLPLLYNLDFVSCNAHNKNNTTKVEANNGIIVGNENFV