; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg031070 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg031070
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionPolyglutamine tract-binding protein 1
Genome locationscaffold11:10028507..10044913
RNA-Seq ExpressionSpg031070
SyntenySpg031070
Gene Ontology termsGO:0005515 - protein binding (molecular function)
InterPro domainsIPR001202 - WW domain
IPR036020 - WW domain superfamily
IPR036691 - Endonuclease/exonuclease/phosphatase superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0053859.1 WW domain-containing protein [Cucumis melo var. makuwa]1.0e-6249.09Show/hide
Query:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF
        VEAKDP+SGVSYYYNE++GKSQWERPSE   ++QL SAVSLPEDWMEA+D+T                                                
Subjt:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF

Query:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS
                                                                       +GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG 
Subjt:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS

Query:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        WNDQTLEQ+KCITCG G+TLVQGSRYCN C+S VSTSST G WQDQSSE NKCMGCGGWGLGLVQAWGYCNHCTR
Subjt:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR

TYK25544.1 uncharacterized protein E5676_scaffold352G006960 [Cucumis melo var. makuwa]1.0e-6249.09Show/hide
Query:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF
        VEAKDP+SGVSYYYNE++GKSQWERPSE   ++QL SAVSLPEDWMEA+D+T                                                
Subjt:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF

Query:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS
                                                                       +GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG 
Subjt:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS

Query:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        WNDQTLEQ+KCITCG G+TLVQGSRYCN C+S VSTSST G WQDQSSE NKCMGCGGWGLGLVQAWGYCNHCTR
Subjt:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR

XP_016899708.1 PREDICTED: uncharacterized protein LOC103486911 isoform X1 [Cucumis melo]1.0e-6249.09Show/hide
Query:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF
        VEAKDP+SGVSYYYNE++GKSQWERPSE   ++QL SAVSLPEDWMEA+D+T                                                
Subjt:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF

Query:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS
                                                                       +GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG 
Subjt:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS

Query:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        WNDQTLEQ+KCITCG G+TLVQGSRYCN C+S VSTSST G WQDQSSE NKCMGCGGWGLGLVQAWGYCNHCTR
Subjt:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR

XP_016899711.1 PREDICTED: uncharacterized protein LOC103486911 isoform X2 [Cucumis melo]1.0e-6249.09Show/hide
Query:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF
        VEAKDP+SGVSYYYNE++GKSQWERPSE   ++QL SAVSLPEDWMEA+D+T                                                
Subjt:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF

Query:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS
                                                                       +GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG 
Subjt:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS

Query:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        WNDQTLEQ+KCITCG G+TLVQGSRYCN C+S VSTSST G WQDQSSE NKCMGCGGWGLGLVQAWGYCNHCTR
Subjt:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR

XP_016899712.1 PREDICTED: uncharacterized protein LOC103486911 isoform X3 [Cucumis melo]1.0e-6249.09Show/hide
Query:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF
        VEAKDP+SGVSYYYNE++GKSQWERPSE   ++QL SAVSLPEDWMEA+D+T                                                
Subjt:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF

Query:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS
                                                                       +GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG 
Subjt:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS

Query:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        WNDQTLEQ+KCITCG G+TLVQGSRYCN C+S VSTSST G WQDQSSE NKCMGCGGWGLGLVQAWGYCNHCTR
Subjt:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR

TrEMBL top hitse value%identityAlignment
A0A1S4DUP7 uncharacterized protein LOC103486911 isoform X34.9e-6349.09Show/hide
Query:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF
        VEAKDP+SGVSYYYNE++GKSQWERPSE   ++QL SAVSLPEDWMEA+D+T                                                
Subjt:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF

Query:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS
                                                                       +GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG 
Subjt:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS

Query:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        WNDQTLEQ+KCITCG G+TLVQGSRYCN C+S VSTSST G WQDQSSE NKCMGCGGWGLGLVQAWGYCNHCTR
Subjt:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR

A0A1S4DUQ3 uncharacterized protein LOC103486911 isoform X24.9e-6349.09Show/hide
Query:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF
        VEAKDP+SGVSYYYNE++GKSQWERPSE   ++QL SAVSLPEDWMEA+D+T                                                
Subjt:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF

Query:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS
                                                                       +GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG 
Subjt:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS

Query:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        WNDQTLEQ+KCITCG G+TLVQGSRYCN C+S VSTSST G WQDQSSE NKCMGCGGWGLGLVQAWGYCNHCTR
Subjt:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR

A0A1S4DVH1 uncharacterized protein LOC103486911 isoform X14.9e-6349.09Show/hide
Query:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF
        VEAKDP+SGVSYYYNE++GKSQWERPSE   ++QL SAVSLPEDWMEA+D+T                                                
Subjt:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF

Query:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS
                                                                       +GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG 
Subjt:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS

Query:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        WNDQTLEQ+KCITCG G+TLVQGSRYCN C+S VSTSST G WQDQSSE NKCMGCGGWGLGLVQAWGYCNHCTR
Subjt:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR

A0A5A7UK56 Polyglutamine tract-binding protein 14.9e-6349.09Show/hide
Query:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF
        VEAKDP+SGVSYYYNE++GKSQWERPSE   ++QL SAVSLPEDWMEA+D+T                                                
Subjt:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF

Query:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS
                                                                       +GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG 
Subjt:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS

Query:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        WNDQTLEQ+KCITCG G+TLVQGSRYCN C+S VSTSST G WQDQSSE NKCMGCGGWGLGLVQAWGYCNHCTR
Subjt:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR

A0A5D3DPP7 Polyglutamine tract-binding protein 14.9e-6349.09Show/hide
Query:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF
        VEAKDP+SGVSYYYNE++GKSQWERPSE   ++QL SAVSLPEDWMEA+D+T                                                
Subjt:  VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFF

Query:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS
                                                                       +GLKYYYN+R+ +TQWE PVASHQ TLTHSND VPG 
Subjt:  FCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGS

Query:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        WNDQTLEQ+KCITCG G+TLVQGSRYCN C+S VSTSST G WQDQSSE NKCMGCGGWGLGLVQAWGYCNHCTR
Subjt:  WNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G41020.1 WW domain-containing protein7.5e-1623.41Show/hide
Query:  SLKSWNRE----------------VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFI
        S  SWNR+                V+AKDP SG +YYYN+ TG  QWERP E  + +     V   E+W+E  DE                         
Subjt:  SLKSWNRE----------------VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFI

Query:  NWLRLRLPSSSSLCSYPLAPFFFFFCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQV
                                                                                               +G KY+YN R+ V
Subjt:  NWLRLRLPSSSSLCSYPLAPFFFFFCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQV

Query:  TQWEPPVASHQVTLTHSNDNVPGSWNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        +QWEPP +  +   T+SN                                     + V+ S+  GK +   S+L +C GCGGWG+GLVQ WGYC HCTR
Subjt:  TQWEPPVASHQVTLTHSNDNVPGSWNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR

AT2G41020.2 WW domain-containing protein7.5e-1623.41Show/hide
Query:  SLKSWNRE----------------VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFI
        S  SWNR+                V+AKDP SG +YYYN+ TG  QWERP E  + +     V   E+W+E  DE                         
Subjt:  SLKSWNRE----------------VEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATLRLLQLAPTSRASSSFVFVFI

Query:  NWLRLRLPSSSSLCSYPLAPFFFFFCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQV
                                                                                               +G KY+YN R+ V
Subjt:  NWLRLRLPSSSSLCSYPLAPFFFFFCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGLKYYYNVRSQV

Query:  TQWEPPVASHQVTLTHSNDNVPGSWNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR
        +QWEPP +  +   T+SN                                     + V+ S+  GK +   S+L +C GCGGWG+GLVQ WGYC HCTR
Subjt:  TQWEPPVASHQVTLTHSNDNVPGSWNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCATCTTCGAGAGCTAAGAATATCGAGCAAACTCGAGGGAAAGAAGATGACAAGCAGAAGCCGAAAGTGGGTGCTTTTCGATGTCCATATCCTTTTGGTCAAGAGAT
GATTGATTTGAGGTGGGTGTCAAGGGTGTATGGCCCTCCTAGGCCGAAAGGTAAAAGGCCTTTTTGGGAAGAGTTGGGGGATCTCTATGGCCTTTGTGGCTCGTTGTGGT
GTGTGGCTGGGGACTTCAATGTCATCTGTTCTCCTTTATCTTCGGGAGGGAGGGTGACCAAATCCATGAAAGTGTTTAATGACTTCATTGAAGGTAACATTAAATCAGGA
CCCTGCCCCTTTAGGTTTGAGAACATGTGGATGGACCATCCTTCATTCAAATCTTTCATTTCTGCATGGTGGAATATGGGAGTCCAGGGGAATTGGGTAGGGTTTCAATT
CATGGGGAAGCTTAGGGCGCTTAAGAATTCATTGAAATCTTGGAACCGTGAGGTGGAGGCTAAAGACCCTAATAGTGGTGTTTCATATTATTATAATGAAACTACTGGGA
AGAGTCAATGGGAAAGGCCCTCTGAATCCCCCTTCGAATCGCAACTTCCATCAGCTGTATCTCTTCCAGAAGATTGGATGGAGGCGCTTGATGAAACAACAGCCACGCTT
CGTCTTCTTCAACTTGCTCCAACCAGCCGCGCTTCGTCTAGCTTCGTCTTCGTCTTCATCAACTGGCTTCGTCTTCGTTTGCCATCTTCGTCTTCTCTTTGTAGTTATCC
GCTTGCTCCTTTCTTCTTCTTCTTCTGCTTCATCTTCGTTCGGAATGGCATGGATCTCCACGACCATTCCATTGCAACGAGGAAAGCATTCCTTCTGAAGCCTGGTTATG
ACCATCCACTTCCTGTCTGTCCTAAGTTCTTTGCTGAGATTAAGGCCAAGAAGGCGAGCTTTGTACTCAGTTTTGTTCCGTTAACAACTTTTCATTTGATTGCAGGCCTT
AAATACTACTACAATGTGAGAAGTCAGGTAACCCAGTGGGAGCCGCCTGTTGCATCTCATCAGGTAACTTTGACACACTCAAATGATAATGTTCCTGGGTCTTGGAACGA
TCAAACTTTGGAGCAAAATAAATGCATCACATGTGGAAGGGGAATCACCCTCGTGCAGGGTTCAAGATACTGCAATGGTTGTTCAAGTGAGGTTTCTACAAGTTCAACCA
TTGGGAAATGGCAGGATCAATCGTCTGAGCTAAATAAATGCATGGGATGTGGTGGTTGGGGACTTGGCCTTGTGCAAGCTTGGGGTTATTGCAATCATTGTACACGGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGCATCTTCGAGAGCTAAGAATATCGAGCAAACTCGAGGGAAAGAAGATGACAAGCAGAAGCCGAAAGTGGGTGCTTTTCGATGTCCATATCCTTTTGGTCAAGAGAT
GATTGATTTGAGGTGGGTGTCAAGGGTGTATGGCCCTCCTAGGCCGAAAGGTAAAAGGCCTTTTTGGGAAGAGTTGGGGGATCTCTATGGCCTTTGTGGCTCGTTGTGGT
GTGTGGCTGGGGACTTCAATGTCATCTGTTCTCCTTTATCTTCGGGAGGGAGGGTGACCAAATCCATGAAAGTGTTTAATGACTTCATTGAAGGTAACATTAAATCAGGA
CCCTGCCCCTTTAGGTTTGAGAACATGTGGATGGACCATCCTTCATTCAAATCTTTCATTTCTGCATGGTGGAATATGGGAGTCCAGGGGAATTGGGTAGGGTTTCAATT
CATGGGGAAGCTTAGGGCGCTTAAGAATTCATTGAAATCTTGGAACCGTGAGGTGGAGGCTAAAGACCCTAATAGTGGTGTTTCATATTATTATAATGAAACTACTGGGA
AGAGTCAATGGGAAAGGCCCTCTGAATCCCCCTTCGAATCGCAACTTCCATCAGCTGTATCTCTTCCAGAAGATTGGATGGAGGCGCTTGATGAAACAACAGCCACGCTT
CGTCTTCTTCAACTTGCTCCAACCAGCCGCGCTTCGTCTAGCTTCGTCTTCGTCTTCATCAACTGGCTTCGTCTTCGTTTGCCATCTTCGTCTTCTCTTTGTAGTTATCC
GCTTGCTCCTTTCTTCTTCTTCTTCTGCTTCATCTTCGTTCGGAATGGCATGGATCTCCACGACCATTCCATTGCAACGAGGAAAGCATTCCTTCTGAAGCCTGGTTATG
ACCATCCACTTCCTGTCTGTCCTAAGTTCTTTGCTGAGATTAAGGCCAAGAAGGCGAGCTTTGTACTCAGTTTTGTTCCGTTAACAACTTTTCATTTGATTGCAGGCCTT
AAATACTACTACAATGTGAGAAGTCAGGTAACCCAGTGGGAGCCGCCTGTTGCATCTCATCAGGTAACTTTGACACACTCAAATGATAATGTTCCTGGGTCTTGGAACGA
TCAAACTTTGGAGCAAAATAAATGCATCACATGTGGAAGGGGAATCACCCTCGTGCAGGGTTCAAGATACTGCAATGGTTGTTCAAGTGAGGTTTCTACAAGTTCAACCA
TTGGGAAATGGCAGGATCAATCGTCTGAGCTAAATAAATGCATGGGATGTGGTGGTTGGGGACTTGGCCTTGTGCAAGCTTGGGGTTATTGCAATCATTGTACACGGTAA
Protein sequenceShow/hide protein sequence
MASSRAKNIEQTRGKEDDKQKPKVGAFRCPYPFGQEMIDLRWVSRVYGPPRPKGKRPFWEELGDLYGLCGSLWCVAGDFNVICSPLSSGGRVTKSMKVFNDFIEGNIKSG
PCPFRFENMWMDHPSFKSFISAWWNMGVQGNWVGFQFMGKLRALKNSLKSWNREVEAKDPNSGVSYYYNETTGKSQWERPSESPFESQLPSAVSLPEDWMEALDETTATL
RLLQLAPTSRASSSFVFVFINWLRLRLPSSSSLCSYPLAPFFFFFCFIFVRNGMDLHDHSIATRKAFLLKPGYDHPLPVCPKFFAEIKAKKASFVLSFVPLTTFHLIAGL
KYYYNVRSQVTQWEPPVASHQVTLTHSNDNVPGSWNDQTLEQNKCITCGRGITLVQGSRYCNGCSSEVSTSSTIGKWQDQSSELNKCMGCGGWGLGLVQAWGYCNHCTR