; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg010592 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg010592
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold5:12216703..12230641
RNA-Seq ExpressionSpg010592
SyntenySpg010592
Gene Ontology termsNA
InterPro domainsIPR025558 - Domain of unknown function DUF4283


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.1e-2735.48Show/hide
Query:  PRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQVIVRGVPVDWSPRAINSLFNLQDF--PHAGFNETVVAPSNDQLNAAVREAGIEGAQWR
        P F+   I  HGW Q C  P      +VREFYAN+ D     V V+ V V ++ RAINS+F L++    +  F   V   +++QL   + E  IEGA W+
Subjt:  PRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQVIVRGVPVDWSPRAINSLFNLQDF--PHAGFNETVVAPSNDQLNAAVREAGIEGAQWR

Query:  LSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVLLVFAILRSMSIDVGKIISNEIYDC-WRKKVGKLFFPNIITMLCQRAGVPMNTDDVT
        +S     T     LK  A  W  F+  R + +TH   V++DRVLL+++IL  +S+++ +I   EI  C   +K G L+FP++IT L  +A VP + D+  
Subjt:  LSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVLLVFAILRSMSIDVGKIISNEIYDC-WRKKVGKLFFPNIITMLCQRAGVPMNTDDVT

Query:  LMDKGIIDTPNLARLQR
        + + G I T +++R+ +
Subjt:  LMDKGIIDTPNLARLQR

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.7e-3236.36Show/hide
Query:  GIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQ
        GI     +A      ET     R+ NN+          R    E+GF  D       LP F+   I  H W Q CA PE     +VREFYAN+ D     
Subjt:  GIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQ

Query:  VIVRGVPVDWSPRAINSLFNLQDFPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVL
        V VRGV V WS  AIN++F L D P    +E +   +   L   +    + GA+W +S     T     L   A  W  F+K  LL TTH   VS+DR+L
Subjt:  VIVRGVPVDWSPRAINSLFNLQDFPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVL

Query:  LVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARLQRTQE
        L+ ++L   SI+VG++I +EI  C  +K G LFFP++IT LC+ A  P   ++  L + G ID   +AR+  TQE
Subjt:  LVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.2e-4435.05Show/hide
Query:  GIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQ
        GI     +A      ET     R+ NN+          R    E+GF  D       LP F+   I  H W Q CA PE     +VREFYAN+ D E   
Subjt:  GIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQ

Query:  VIVRGVPVDWSPRAINSLFNLQDFPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVL
        V VRGV V WS  AIN++F L D P    +E +   +   L   +      GA+W +S     T     L   A  W  F+K RLL TTH   VS+DR+L
Subjt:  VIVRGVPVDWSPRAINSLFNLQDFPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVL

Query:  LVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARLQR---TQEAHQ---------------GGLV
        L+ ++L   SI+VG++I +EI  C  +K G LFFP++IT LC+ A  P   ++  L + G ID   +AR+ +   T+   Q               G ++
Subjt:  LVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARLQR---TQEAHQ---------------GGLV

Query:  CGIHQMQEQL------QMH-SSKMEFPERQFQTFWNYVKRRDAALREALQSNFSKPYPTFPIFPDDLL
          +  ++++L      Q H  S ++   +Q Q FW Y K RD AL++ALQ+NF++P PTFP FP ++L
Subjt:  CGIHQMQEQL------QMH-SSKMEFPERQFQTFWNYVKRRDAALREALQSNFSKPYPTFPIFPDDLL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.9e-2731.42Show/hide
Query:  RENKGKGIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDDLPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQV
        RE   + +A   ++A   +++   + Y   I N   +     ++++F+++     + P F+   I  H W   CA PE     +VREFY N+ + +   V
Subjt:  RENKGKGIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDDLPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQV

Query:  IVRGVPVDWSPRAINSLFNLQD--FPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRV
         +RGV V  S  AIN++F+L D    H+ F E +  P   +L   +    I GA+W +S     T     L   A  W  F+K RLL TTH   VS++ V
Subjt:  IVRGVPVDWSPRAINSLFNLQD--FPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRV

Query:  LLVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKG
         L++++L   SI+VG++I  EI  C  +K G LFFP++IT +C+    P   ++  L + G
Subjt:  LLVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKG

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]3.7e-3938.18Show/hide
Query:  IVREFYANIDDQEGFQVIVRGVPVDWSPRAINSLFNLQD--FPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIK
        +VREFYAN+ D E   + VRGV V WS  AIN++F L D    H+ F E +  P   +L   +      GA+W +S     T     L   A  W  F+K
Subjt:  IVREFYANIDDQEGFQVIVRGVPVDWSPRAINSLFNLQD--FPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIK

Query:  LRLLLTTHDSMVSRDRVLLVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARL------------
         RLL TTH  +VS+DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP++IT LC+ A  P   ++  L + G ID   +AR+            
Subjt:  LRLLLTTHDSMVSRDRVLLVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARL------------

Query:  --QRTQEAHQGGLVCGIHQMQEQLQMHSSKMEFPERQFQTFWNYVKRRDAALREALQSNFSKPYPTFPIFPDDLL
           R   A        + Q  + L+   S+ E   +Q Q FW Y K RD AL++ALQ+NF++P PTFP FP ++L
Subjt:  --QRTQEAHQGGLVCGIHQMQEQLQMHSSKMEFPERQFQTFWNYVKRRDAALREALQSNFSKPYPTFPIFPDDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.7e-3236.36Show/hide
Query:  GIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQ
        GI     +A      ET     R+ NN+          R    E+GF  D       LP F+   I  H W Q CA PE     +VREFYAN+ D     
Subjt:  GIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQ

Query:  VIVRGVPVDWSPRAINSLFNLQDFPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVL
        V VRGV V WS  AIN++F L D P    +E +   +   L   +    + GA+W +S     T     L   A  W  F+K  LL TTH   VS+DR+L
Subjt:  VIVRGVPVDWSPRAINSLFNLQDFPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVL

Query:  LVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARLQRTQE
        L+ ++L   SI+VG++I +EI  C  +K G LFFP++IT LC+ A  P   ++  L + G ID   +AR+  TQE
Subjt:  LVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.1e-4435.05Show/hide
Query:  GIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQ
        GI     +A      ET     R+ NN+          R    E+GF  D       LP F+   I  H W Q CA PE     +VREFYAN+ D E   
Subjt:  GIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDD-------LPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQ

Query:  VIVRGVPVDWSPRAINSLFNLQDFPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVL
        V VRGV V WS  AIN++F L D P    +E +   +   L   +      GA+W +S     T     L   A  W  F+K RLL TTH   VS+DR+L
Subjt:  VIVRGVPVDWSPRAINSLFNLQDFPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVL

Query:  LVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARLQR---TQEAHQ---------------GGLV
        L+ ++L   SI+VG++I +EI  C  +K G LFFP++IT LC+ A  P   ++  L + G ID   +AR+ +   T+   Q               G ++
Subjt:  LVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARLQR---TQEAHQ---------------GGLV

Query:  CGIHQMQEQL------QMH-SSKMEFPERQFQTFWNYVKRRDAALREALQSNFSKPYPTFPIFPDDLL
          +  ++++L      Q H  S ++   +Q Q FW Y K RD AL++ALQ+NF++P PTFP FP ++L
Subjt:  CGIHQMQEQL------QMH-SSKMEFPERQFQTFWNYVKRRDAALREALQSNFSKPYPTFPIFPDDLL

A0A2P5DAQ2 Uncharacterized protein9.1e-2831.42Show/hide
Query:  RENKGKGIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDDLPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQV
        RE   + +A   ++A   +++   + Y   I N   +     ++++F+++     + P F+   I  H W   CA PE     +VREFY N+ + +   V
Subjt:  RENKGKGIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDDLPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQV

Query:  IVRGVPVDWSPRAINSLFNLQD--FPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRV
         +RGV V  S  AIN++F+L D    H+ F E +  P   +L   +    I GA+W +S     T     L   A  W  F+K RLL TTH   VS++ V
Subjt:  IVRGVPVDWSPRAINSLFNLQD--FPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRV

Query:  LLVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKG
         L++++L   SI+VG++I  EI  C  +K G LFFP++IT +C+    P   ++  L + G
Subjt:  LLVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKG

A0A2P5DXM3 Uncharacterized protein1.8e-3938.18Show/hide
Query:  IVREFYANIDDQEGFQVIVRGVPVDWSPRAINSLFNLQD--FPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIK
        +VREFYAN+ D E   + VRGV V WS  AIN++F L D    H+ F E +  P   +L   +      GA+W +S     T     L   A  W  F+K
Subjt:  IVREFYANIDDQEGFQVIVRGVPVDWSPRAINSLFNLQD--FPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIK

Query:  LRLLLTTHDSMVSRDRVLLVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARL------------
         RLL TTH  +VS+DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP++IT LC+ A  P   ++  L + G ID   +AR+            
Subjt:  LRLLLTTHDSMVSRDRVLLVFAILRSMSIDVGKIISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARL------------

Query:  --QRTQEAHQGGLVCGIHQMQEQLQMHSSKMEFPERQFQTFWNYVKRRDAALREALQSNFSKPYPTFPIFPDDLL
           R   A        + Q  + L+   S+ E   +Q Q FW Y K RD AL++ALQ+NF++P PTFP FP ++L
Subjt:  --QRTQEAHQGGLVCGIHQMQEQLQMHSSKMEFPERQFQTFWNYVKRRDAALREALQSNFSKPYPTFPIFPDDLL

W9QTD9 Uncharacterized protein5.4e-2835.48Show/hide
Query:  PRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQVIVRGVPVDWSPRAINSLFNLQDF--PHAGFNETVVAPSNDQLNAAVREAGIEGAQWR
        P F+   I  HGW Q C  P      +VREFYAN+ D     V V+ V V ++ RAINS+F L++    +  F   V   +++QL   + E  IEGA W+
Subjt:  PRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQVIVRGVPVDWSPRAINSLFNLQDF--PHAGFNETVVAPSNDQLNAAVREAGIEGAQWR

Query:  LSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVLLVFAILRSMSIDVGKIISNEIYDC-WRKKVGKLFFPNIITMLCQRAGVPMNTDDVT
        +S     T     LK  A  W  F+  R + +TH   V++DRVLL+++IL  +S+++ +I   EI  C   +K G L+FP++IT L  +A VP + D+  
Subjt:  LSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVLLVFAILRSMSIDVGKIISNEIYDC-WRKKVGKLFFPNIITMLCQRAGVPMNTDDVT

Query:  LMDKGIIDTPNLARLQR
        + + G I T +++R+ +
Subjt:  LMDKGIIDTPNLARLQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGATCAAGAATGCCCTCATTGGAGCATGGAGAACAAGAAAGCAGTTCTGTACTGATACAGTGGGAAAGAATACATTCCTTTTCAAATTCGAAGACCTTGATGACAG
GGAATGGGTAATCAATAATGGGCTAGGGCTTTTTTATAAGAAGCTGATAGTCTTGGAAGCACCAGGGGCAAATCAGAGGACAGCAGACTTGGCTTTCAAGAAAGCAGATT
TCTGGGTGAGATTATTCAATCTTCCTTTGGGTTATAGAAACAAAGTCACAGCAGAGAAAATTGGTAATATCTTAGGAGATTTTGTAGCAGCCGACATGGATGAAAATTTC
TGGGGAAATAGCGTTAGACTCAAAAAACCAGAATCTCCGGAAAAAGCCAACGATGAGAATTATGTTGGTGGTAATGTGAATAATGTTGACCTGGAAAATGTAGTTGAGTT
GGGAGAAGACATAGGGAGGAATGATAATCTGACTGAGGCAAAGTCAATAGAAACTGGAAGTCCGGATAGTCAAAAGGAAGAAGGGGAAATGGGGTGCGATTCCATCTATG
TCGAAGACCAGCGCCTAAATATTGTATATACAGACATGGAAATTCACCTAGAAAATGGGGGAAATCAGCTTAATAATGCTAGTAGTCAGAATAACTCGGAGAACATAGCT
ATTCATGGTAAGACTGGGAAAAAGAAGGCCACCTGGAAAAGAAAGGCAAGAAGCGTCACGAATATTGATTTGAATAAAGAAGCTTCTACACCGAAAAAGCGGAAGGGCGA
TTTTGTGGAAGAACATGGAGGCAAGAACCACATGGAAGCTCGAAATTTAGATTGGATCTATTCGGATCATAGACCTATTGAATTGGAGTTAGCTAACCGAAGGATCAATC
ATTCAAAGAGATGGAAAAGATCGTTCAAATTCGAGGAATTGTGGACCATATATGAAAATTGTGCAGAGATTATCTCAAAGAATGGAGAATGGTTAGGTGATATCTCTTCT
TTTTATCCTCTAAATCATAATCTTTCTAAATGTTCGGAGTCTCTAACTAAATGGGGTAAAGGGTTAAACAACCAACGAAAAAACAGAATTCTTGATTGTAAAAGAGCTCT
TAAAGAAGCTTACAGTAGCATTCCTCATGTTGTTTTTGATAAGATCCATGCAATTGAATTCGAGTTAGAAAATCTTTTAGAGGAAGAGGAAATTTACTGGAAACAAAGGT
TTAGGGAGGAATGGTTAAAATGGGGAGATAGAAATTCCAGGGTATCATATTTTATTTTACCGTCGGGGGCTTGGGATTCCGAGAGGCTTAGGGAGGCAGTTATGGGGGAG
GATGTTGAGCTTATTAAGGGGATCCCAATCAATCAAAAATTAGAAGATAGGTTAGTTTGGCATTACGATAAATTGGGTAAATACTCTGTCAAAAGCGGTGTGAGTTTGGT
GCATGAGCGATCCGCCTGGGGTAATTTGCATGGGAGATCTGATGTTGCCACATGTCGCGCAGGGGAAATCCAATGGTTTAGCCGTTGGACAGTGCATCGGCATTTACCAT
GCGCCGCTAGAACTTCGATTTATTACCGTTTGATGGATTTAATTTTTCCGATTGCATGGGATTTAAACGTGGAAGACATTGCTGCCGAAGTGGTTGAGGAGGAAAATCCG
AAGGATCCAGAGGAACAGAATCCTGGACAGAATGATCCGATAGTCGAGAATCCGCAAGAAGTTCAAGAAAAACAAGCGGAGGATGTGCAAGAAATAGGAAACAAGCCGGA
GGTTCAAGAGCAAGAAGCTCAAGTCGAAGTTATTGTGCCAGAGGTTCCCCGTCGTCGCCATCGGAAGCAGAAAGCCGACCGCGTCAAGGTAATCCGAATAGATACTCCGT
CGCCGCCGACAACTGATTCCGAGAAAGAGAATGCGGAAAAGGAAGAACAGGAGAAAAAGGAGGCAAAAGACCAAGCAAGAGAAGAAACAGAAAAGAAAGCAGAGCAGGAA
ATTTTGCCCAAGCAGAGAGAAAACAAGGGCAAAGGTATTGCTGAGGCACCGGCTGAAGCTGAGGATATTGATACTGAGGAAACACGGTTGCCGTACAATCGCTTCATCAA
TAATCTTGCCCGAGCAAAGTATATAGAAATGTTGAAACGAGACTTTTTGTTTGAGCGGGGATTTGGTGATGATTTGCCACGTTTCTTGAGGACTGGAATAGCGAATCATG
GGTGGAGTCAGTTATGCGCAAAGCCGGAGCCGGTCAATTCCAATATTGTCCGTGAGTTTTATGCAAATATAGATGATCAAGAAGGATTTCAGGTTATCGTCCGAGGGGTG
CCCGTTGACTGGAGCCCAAGAGCAATCAATTCTCTCTTCAACCTTCAAGACTTTCCGCACGCAGGATTCAATGAGACGGTGGTAGCACCGTCTAACGATCAGTTAAATGC
GGCTGTCAGGGAGGCTGGCATCGAAGGGGCTCAATGGAGGCTGTCGAAGACGGAAAAGCGCACGTTCCAAACAGTTTATCTGAAAAGCGAAGCCAATACATGGATGGGCT
TCATTAAGTTGCGCTTGCTTCTGACAACTCACGATTCAATGGTGTCTCGAGATCGGGTGCTGCTGGTATTTGCTATTCTTCGTTCTATGAGTATTGATGTGGGCAAAATT
ATTTCCAATGAGATTTATGATTGCTGGCGGAAGAAGGTAGGGAAGCTGTTTTTCCCAAACATCATAACGATGCTATGTCAAAGGGCAGGGGTTCCTATGAATACAGACGA
TGTCACTTTAATGGACAAGGGAATAATCGACACACCAAACCTGGCTAGGCTTCAAAGGACACAAGAGGCACACCAAGGGGGTTTGGTGTGCGGCATCCATCAAATGCAGG
AGCAATTACAGATGCATTCCAGCAAGATGGAGTTTCCCGAAAGGCAATTCCAAACCTTCTGGAATTATGTGAAGAGAAGGGATGCCGCGTTGAGGGAGGCCTTGCAGTCT
AACTTTTCTAAACCATATCCGACCTTCCCAATATTCCCTGATGACCTACTGAACCCTTGGATTCCGCCACCACCAGTCGAGAGAGAGGGAGATGAAAAAGAAGATCCTGG
TCAGGAGGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCGATCAAGAATGCCCTCATTGGAGCATGGAGAACAAGAAAGCAGTTCTGTACTGATACAGTGGGAAAGAATACATTCCTTTTCAAATTCGAAGACCTTGATGACAG
GGAATGGGTAATCAATAATGGGCTAGGGCTTTTTTATAAGAAGCTGATAGTCTTGGAAGCACCAGGGGCAAATCAGAGGACAGCAGACTTGGCTTTCAAGAAAGCAGATT
TCTGGGTGAGATTATTCAATCTTCCTTTGGGTTATAGAAACAAAGTCACAGCAGAGAAAATTGGTAATATCTTAGGAGATTTTGTAGCAGCCGACATGGATGAAAATTTC
TGGGGAAATAGCGTTAGACTCAAAAAACCAGAATCTCCGGAAAAAGCCAACGATGAGAATTATGTTGGTGGTAATGTGAATAATGTTGACCTGGAAAATGTAGTTGAGTT
GGGAGAAGACATAGGGAGGAATGATAATCTGACTGAGGCAAAGTCAATAGAAACTGGAAGTCCGGATAGTCAAAAGGAAGAAGGGGAAATGGGGTGCGATTCCATCTATG
TCGAAGACCAGCGCCTAAATATTGTATATACAGACATGGAAATTCACCTAGAAAATGGGGGAAATCAGCTTAATAATGCTAGTAGTCAGAATAACTCGGAGAACATAGCT
ATTCATGGTAAGACTGGGAAAAAGAAGGCCACCTGGAAAAGAAAGGCAAGAAGCGTCACGAATATTGATTTGAATAAAGAAGCTTCTACACCGAAAAAGCGGAAGGGCGA
TTTTGTGGAAGAACATGGAGGCAAGAACCACATGGAAGCTCGAAATTTAGATTGGATCTATTCGGATCATAGACCTATTGAATTGGAGTTAGCTAACCGAAGGATCAATC
ATTCAAAGAGATGGAAAAGATCGTTCAAATTCGAGGAATTGTGGACCATATATGAAAATTGTGCAGAGATTATCTCAAAGAATGGAGAATGGTTAGGTGATATCTCTTCT
TTTTATCCTCTAAATCATAATCTTTCTAAATGTTCGGAGTCTCTAACTAAATGGGGTAAAGGGTTAAACAACCAACGAAAAAACAGAATTCTTGATTGTAAAAGAGCTCT
TAAAGAAGCTTACAGTAGCATTCCTCATGTTGTTTTTGATAAGATCCATGCAATTGAATTCGAGTTAGAAAATCTTTTAGAGGAAGAGGAAATTTACTGGAAACAAAGGT
TTAGGGAGGAATGGTTAAAATGGGGAGATAGAAATTCCAGGGTATCATATTTTATTTTACCGTCGGGGGCTTGGGATTCCGAGAGGCTTAGGGAGGCAGTTATGGGGGAG
GATGTTGAGCTTATTAAGGGGATCCCAATCAATCAAAAATTAGAAGATAGGTTAGTTTGGCATTACGATAAATTGGGTAAATACTCTGTCAAAAGCGGTGTGAGTTTGGT
GCATGAGCGATCCGCCTGGGGTAATTTGCATGGGAGATCTGATGTTGCCACATGTCGCGCAGGGGAAATCCAATGGTTTAGCCGTTGGACAGTGCATCGGCATTTACCAT
GCGCCGCTAGAACTTCGATTTATTACCGTTTGATGGATTTAATTTTTCCGATTGCATGGGATTTAAACGTGGAAGACATTGCTGCCGAAGTGGTTGAGGAGGAAAATCCG
AAGGATCCAGAGGAACAGAATCCTGGACAGAATGATCCGATAGTCGAGAATCCGCAAGAAGTTCAAGAAAAACAAGCGGAGGATGTGCAAGAAATAGGAAACAAGCCGGA
GGTTCAAGAGCAAGAAGCTCAAGTCGAAGTTATTGTGCCAGAGGTTCCCCGTCGTCGCCATCGGAAGCAGAAAGCCGACCGCGTCAAGGTAATCCGAATAGATACTCCGT
CGCCGCCGACAACTGATTCCGAGAAAGAGAATGCGGAAAAGGAAGAACAGGAGAAAAAGGAGGCAAAAGACCAAGCAAGAGAAGAAACAGAAAAGAAAGCAGAGCAGGAA
ATTTTGCCCAAGCAGAGAGAAAACAAGGGCAAAGGTATTGCTGAGGCACCGGCTGAAGCTGAGGATATTGATACTGAGGAAACACGGTTGCCGTACAATCGCTTCATCAA
TAATCTTGCCCGAGCAAAGTATATAGAAATGTTGAAACGAGACTTTTTGTTTGAGCGGGGATTTGGTGATGATTTGCCACGTTTCTTGAGGACTGGAATAGCGAATCATG
GGTGGAGTCAGTTATGCGCAAAGCCGGAGCCGGTCAATTCCAATATTGTCCGTGAGTTTTATGCAAATATAGATGATCAAGAAGGATTTCAGGTTATCGTCCGAGGGGTG
CCCGTTGACTGGAGCCCAAGAGCAATCAATTCTCTCTTCAACCTTCAAGACTTTCCGCACGCAGGATTCAATGAGACGGTGGTAGCACCGTCTAACGATCAGTTAAATGC
GGCTGTCAGGGAGGCTGGCATCGAAGGGGCTCAATGGAGGCTGTCGAAGACGGAAAAGCGCACGTTCCAAACAGTTTATCTGAAAAGCGAAGCCAATACATGGATGGGCT
TCATTAAGTTGCGCTTGCTTCTGACAACTCACGATTCAATGGTGTCTCGAGATCGGGTGCTGCTGGTATTTGCTATTCTTCGTTCTATGAGTATTGATGTGGGCAAAATT
ATTTCCAATGAGATTTATGATTGCTGGCGGAAGAAGGTAGGGAAGCTGTTTTTCCCAAACATCATAACGATGCTATGTCAAAGGGCAGGGGTTCCTATGAATACAGACGA
TGTCACTTTAATGGACAAGGGAATAATCGACACACCAAACCTGGCTAGGCTTCAAAGGACACAAGAGGCACACCAAGGGGGTTTGGTGTGCGGCATCCATCAAATGCAGG
AGCAATTACAGATGCATTCCAGCAAGATGGAGTTTCCCGAAAGGCAATTCCAAACCTTCTGGAATTATGTGAAGAGAAGGGATGCCGCGTTGAGGGAGGCCTTGCAGTCT
AACTTTTCTAAACCATATCCGACCTTCCCAATATTCCCTGATGACCTACTGAACCCTTGGATTCCGCCACCACCAGTCGAGAGAGAGGGAGATGAAAAAGAAGATCCTGG
TCAGGAGGATTAA
Protein sequenceShow/hide protein sequence
MAIKNALIGAWRTRKQFCTDTVGKNTFLFKFEDLDDREWVINNGLGLFYKKLIVLEAPGANQRTADLAFKKADFWVRLFNLPLGYRNKVTAEKIGNILGDFVAADMDENF
WGNSVRLKKPESPEKANDENYVGGNVNNVDLENVVELGEDIGRNDNLTEAKSIETGSPDSQKEEGEMGCDSIYVEDQRLNIVYTDMEIHLENGGNQLNNASSQNNSENIA
IHGKTGKKKATWKRKARSVTNIDLNKEASTPKKRKGDFVEEHGGKNHMEARNLDWIYSDHRPIELELANRRINHSKRWKRSFKFEELWTIYENCAEIISKNGEWLGDISS
FYPLNHNLSKCSESLTKWGKGLNNQRKNRILDCKRALKEAYSSIPHVVFDKIHAIEFELENLLEEEEIYWKQRFREEWLKWGDRNSRVSYFILPSGAWDSERLREAVMGE
DVELIKGIPINQKLEDRLVWHYDKLGKYSVKSGVSLVHERSAWGNLHGRSDVATCRAGEIQWFSRWTVHRHLPCAARTSIYYRLMDLIFPIAWDLNVEDIAAEVVEEENP
KDPEEQNPGQNDPIVENPQEVQEKQAEDVQEIGNKPEVQEQEAQVEVIVPEVPRRRHRKQKADRVKVIRIDTPSPPTTDSEKENAEKEEQEKKEAKDQAREETEKKAEQE
ILPKQRENKGKGIAEAPAEAEDIDTEETRLPYNRFINNLARAKYIEMLKRDFLFERGFGDDLPRFLRTGIANHGWSQLCAKPEPVNSNIVREFYANIDDQEGFQVIVRGV
PVDWSPRAINSLFNLQDFPHAGFNETVVAPSNDQLNAAVREAGIEGAQWRLSKTEKRTFQTVYLKSEANTWMGFIKLRLLLTTHDSMVSRDRVLLVFAILRSMSIDVGKI
ISNEIYDCWRKKVGKLFFPNIITMLCQRAGVPMNTDDVTLMDKGIIDTPNLARLQRTQEAHQGGLVCGIHQMQEQLQMHSSKMEFPERQFQTFWNYVKRRDAALREALQS
NFSKPYPTFPIFPDDLLNPWIPPPPVEREGDEKEDPGQED