; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020203 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020203
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionLOW QUALITY PROTEIN: uncharacterized protein LOC110414781
Genome locationscaffold1:22914481..22916302
RNA-Seq ExpressionSpg020203
SyntenySpg020203
Gene Ontology termsGO:0031326 - regulation of cellular biosynthetic process (biological process)
GO:0072593 - reactive oxygen species metabolic process (biological process)
GO:0005737 - cytoplasm (cellular component)
GO:0016667 - oxidoreductase activity, acting on a sulfur group of donors (molecular function)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7027627.1 hypothetical protein SDJN02_11642, partial [Cucurbita argyrosperma subsp. argyrosperma]2.2e-10275.43Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD
        MAEFPCSLERTVASALLLLSTSPPPPPSPP          SV QD+WLSEE  +G  F REI+   DYSKSCSS+LT+SDESSET A+EP LFST AYRD
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD

Query:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVTRAEKKLE
        +L LH                                 VVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSE+SCLSSSSSVVTSAPIHRLVTRAEKKLE
Subjt:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVTRAEKKLE

Query:  MIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTVQDTLNSQLPLYLM
        MIRH WRK+ VA+AHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYIYTV DTLNSQLPLYL+
Subjt:  MIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTVQDTLNSQLPLYLM

XP_022924988.1 uncharacterized protein LOC111432371 [Cucurbita moschata]3.9e-9674.64Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD
        MAEFPCSLERTVASALLLLSTSPPPPPSPP          SV QD+WLSEE  +G  F REI+   DYSKSCSS+LT+SDESSET A+EP LFST AYRD
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD

Query:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVTRAEKKLE
        +L LH                                 VVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSE+SCLSSSSSVVTSAPIHRLVTRAEKKLE
Subjt:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVTRAEKKLE

Query:  MIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV
        MIRH WRK+ VA+AHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYIYT+
Subjt:  MIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV

XP_022967938.1 uncharacterized protein LOC111467304 [Cucurbita maxima]2.6e-9274.01Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD
        MAEFPC+LERTVASALLLLSTSPPPP SPPPSP +      +SQD+WL EEKI+G K S E+S FCD SKSCSS+LT+SDESSET AQE  LFSTSAYRD
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD

Query:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETS-CLSSSSSVVTSAPIHRLVTRAEKKL
        EL L+                                 VVRKSRS+ +RIS NRNLT TDDVTLSSGS SSET+ CLSSSSSV TSAPI RLVTRAEKKL
Subjt:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETS-CLSSSSSVVTSAPIHRLVTRAEKKL

Query:  EMIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV
        EMIRHAWRKK VASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYT+
Subjt:  EMIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV

XP_023518044.1 uncharacterized protein LOC111781591 [Cucurbita pepo subsp. pepo]2.0e-9574.28Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD
        MAEFPCSLERTVASALLLLSTSPPPPPSPP          SV QD+WLSEE  +G  F REI+   DYSKSCSS+LT+SDESSET A+EP L ST AYRD
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD

Query:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVTRAEKKLE
        +L LH                                 VVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSE+SCLSSSSSVVTSAPIHRLVTRAEKKLE
Subjt:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVTRAEKKLE

Query:  MIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV
        MIRH WRK+ VA+AHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYIYT+
Subjt:  MIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV

XP_023544086.1 uncharacterized protein LOC111803781 [Cucurbita pepo subsp. pepo]2.4e-9374.37Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD
        MAEFPC+LERTVASALLLLSTSPPPPPSPPPSP +      +SQD+WL EEKI+G K S E+S FCD SKSCSS+LT+SDESSET AQE  LFSTSAYRD
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD

Query:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETS-CLSSSSSVVTSAPIHRLVTRAEKKL
        EL L+                                 VVRKSRS+ +RIS NRNLT TDDVTLSSGS SSET+ CLSSSSSV TSAPI RLVTRAEKKL
Subjt:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETS-CLSSSSSVVTSAPIHRLVTRAEKKL

Query:  EMIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV
        EMIRHAWRKK VASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYT+
Subjt:  EMIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV

TrEMBL top hitse value%identityAlignment
A0A1S4E302 uncharacterized protein LOC1034995331.9e-8067.38Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD
        MAEFP  LERTVASALLLLS SP PP +P            +S+D+WL E+ I G K SREISAFCDYS + SSILT SD SS T   E  LF T   R 
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD

Query:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSE-TSCLSSSSSVVTSAPIHRLVTRAEKKL
        +L L+                                 VVRKSRSKL+RISENRNL+STD+VTLSSGS SSE TSCLSSSSSVVTSAPIHRLVTRAEKKL
Subjt:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSE-TSCLSSSSSVVTSAPIHRLVTRAEKKL

Query:  EMIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTVQDTLN
        EMIRHAWRKKQ+ASAHMRRRAEAILSYLS GCSSEVKIRQV+GDSPDTSKALR+LLKLEEIKRSGTGGRQDPY+Y VQ  LN
Subjt:  EMIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTVQDTLN

A0A6J1DFG0 uncharacterized protein LOC1110200041.3e-8973.19Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD
        MAEFPCSLER+VASALLLLSTSPPPPP  PPS        SVS+D+WL E KI G K SRE+ AFCDYSKSCSSILT  DESS+T  QEP LFSTSAY D
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD

Query:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVTRAEKKLE
        EL L+                                 VVRKSRSKLIRISENRN +S DD TLSSGS SSETSCLSSSS+VVTSAP  RLVTRAEKKLE
Subjt:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVTRAEKKLE

Query:  MIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV
        MIRH WRKKQVASAHMRRRAEAIL YLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIY +
Subjt:  MIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV

A0A6J1EE02 uncharacterized protein LOC1114323711.9e-9674.64Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD
        MAEFPCSLERTVASALLLLSTSPPPPPSPP          SV QD+WLSEE  +G  F REI+   DYSKSCSS+LT+SDESSET A+EP LFST AYRD
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD

Query:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVTRAEKKLE
        +L LH                                 VVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSE+SCLSSSSSVVTSAPIHRLVTRAEKKLE
Subjt:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVTRAEKKLE

Query:  MIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV
        MIRH WRK+ VA+AHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALR+LLKLEEIKRSGTGGRQDPYIYT+
Subjt:  MIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV

A0A6J1EL58 uncharacterized protein LOC1114355591.3e-9274.01Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD
        MAEFPC+LERTVASALLLLSTSPPPPPSP PSP +      +SQD+WL EEKI+G K S E+S FCD SKSCSS+LT+SDESSET AQE  LFSTSAYRD
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD

Query:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETS-CLSSSSSVVTSAPIHRLVTRAEKKL
        EL L+                                 VVRKSRS+ +RIS NRNLT TDDVTLSSGS SSET+ CLSSSSSV TSAPI RLVTRAEKKL
Subjt:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETS-CLSSSSSVVTSAPIHRLVTRAEKKL

Query:  EMIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV
        EMIRHAWRKK VASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYT+
Subjt:  EMIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV

A0A6J1HWL4 uncharacterized protein LOC1114673041.3e-9274.01Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD
        MAEFPC+LERTVASALLLLSTSPPPP SPPPSP +      +SQD+WL EEKI+G K S E+S FCD SKSCSS+LT+SDESSET AQE  LFSTSAYRD
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRD

Query:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETS-CLSSSSSVVTSAPIHRLVTRAEKKL
        EL L+                                 VVRKSRS+ +RIS NRNLT TDDVTLSSGS SSET+ CLSSSSSV TSAPI RLVTRAEKKL
Subjt:  ELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETS-CLSSSSSVVTSAPIHRLVTRAEKKL

Query:  EMIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV
        EMIRHAWRKK VASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYT+
Subjt:  EMIRHAWRKKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT3G57440.1 unknown protein2.9e-2033.68Show/hide
Query:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSIL--TKSDESSETIAQEPFLFSTSAY
        MA +P  +ERTVAS+LLLLS  P                       + S ++   V+ S  +  +C    S  S++  +    S  +       F  S  
Subjt:  MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSIL--TKSDESSETIAQEPFLFSTSAY

Query:  RDELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISEN-----RNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVT
        R+           +     F    F              +  RK RS++I  S N       +    DV  +    S + SCLS+ SS V+S    R+  
Subjt:  RDELNLHVIVLSFSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISEN-----RNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVT

Query:  RAEKKLEMIRHAWR--KKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV
        R +K  E +R   +  K+   S+ +RRRA+ IL +LS   SSEV IRQ+LGDSPDTSKALRMLLK+EE+KR GTGGR DP+IY +
Subjt:  RAEKKLEMIRHAWR--KKQVASAHMRRRAEAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAGAGTTCCCTTGCTCTCTAGAACGCACCGTCGCTTCTGCTCTGCTCCTCCTCTCCACTTCGCCGCCTCCTCCACCTTCTCCTCCGCCATCTCCGCCTATGTATGA
TATTGGATCATCGGTTTCTCAAGACCAGTGGTTGTCTGAGGAGAAAATTGTCGGAGTAAAATTCTCCAGAGAGATATCGGCGTTTTGTGATTATTCGAAGTCTTGCTCTT
CGATACTCACTAAATCAGATGAATCGTCCGAGACTATAGCGCAGGAGCCGTTCTTGTTCTCTACTTCGGCTTATCGCGACGAGCTAAATCTTCATGTAATTGTTCTATCC
TTCTCGTCCGTTTTAGCTGATTTCTGCTTCTTTGAGTTTGTTCTTCTTGTATGTTGTTTGATGTGTCTAAGAAACTCCGCACAGGTCGTGAGAAAGAGTCGTTCGAAGTT
AATACGGATATCCGAGAACCGGAATCTCACTTCTACAGACGACGTTACCCTGTCTTCAGGCTCCGTATCCTCGGAGACTTCTTGTTTGTCAAGCAGCTCAAGCGTGGTCA
CAAGTGCACCAATCCATCGCCTGGTTACCAGAGCAGAGAAGAAGTTAGAAATGATTCGTCACGCGTGGAGGAAAAAGCAGGTCGCATCGGCTCATATGCGCCGGCGGGCC
GAAGCCATTCTTAGCTACCTCTCTGGTGGATGTTCCTCCGAAGTGAAGATACGCCAAGTGCTTGGCGACAGCCCTGACACAAGCAAGGCTCTCAGAATGCTGTTGAAACT
GGAAGAGATCAAAAGATCCGGAACAGGTGGGCGCCAAGATCCCTATATTTACACGGTACAAGACACTCTCAACTCCCAGCTTCCTCTCTACTTGATGGCTTCAAAGCAAG
CAAGTCCAACAAAACCCATTAATTTACACAGATTTACTTACCAAAATTTTCCGGGAAAATTGAACATGTTTCTTTTCTTTCCATTTCCCCTTTTGCAGATTGCTTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCAGAGTTCCCTTGCTCTCTAGAACGCACCGTCGCTTCTGCTCTGCTCCTCCTCTCCACTTCGCCGCCTCCTCCACCTTCTCCTCCGCCATCTCCGCCTATGTATGA
TATTGGATCATCGGTTTCTCAAGACCAGTGGTTGTCTGAGGAGAAAATTGTCGGAGTAAAATTCTCCAGAGAGATATCGGCGTTTTGTGATTATTCGAAGTCTTGCTCTT
CGATACTCACTAAATCAGATGAATCGTCCGAGACTATAGCGCAGGAGCCGTTCTTGTTCTCTACTTCGGCTTATCGCGACGAGCTAAATCTTCATGTAATTGTTCTATCC
TTCTCGTCCGTTTTAGCTGATTTCTGCTTCTTTGAGTTTGTTCTTCTTGTATGTTGTTTGATGTGTCTAAGAAACTCCGCACAGGTCGTGAGAAAGAGTCGTTCGAAGTT
AATACGGATATCCGAGAACCGGAATCTCACTTCTACAGACGACGTTACCCTGTCTTCAGGCTCCGTATCCTCGGAGACTTCTTGTTTGTCAAGCAGCTCAAGCGTGGTCA
CAAGTGCACCAATCCATCGCCTGGTTACCAGAGCAGAGAAGAAGTTAGAAATGATTCGTCACGCGTGGAGGAAAAAGCAGGTCGCATCGGCTCATATGCGCCGGCGGGCC
GAAGCCATTCTTAGCTACCTCTCTGGTGGATGTTCCTCCGAAGTGAAGATACGCCAAGTGCTTGGCGACAGCCCTGACACAAGCAAGGCTCTCAGAATGCTGTTGAAACT
GGAAGAGATCAAAAGATCCGGAACAGGTGGGCGCCAAGATCCCTATATTTACACGGTACAAGACACTCTCAACTCCCAGCTTCCTCTCTACTTGATGGCTTCAAAGCAAG
CAAGTCCAACAAAACCCATTAATTTACACAGATTTACTTACCAAAATTTTCCGGGAAAATTGAACATGTTTCTTTTCTTTCCATTTCCCCTTTTGCAGATTGCTTGA
Protein sequenceShow/hide protein sequence
MAEFPCSLERTVASALLLLSTSPPPPPSPPPSPPMYDIGSSVSQDQWLSEEKIVGVKFSREISAFCDYSKSCSSILTKSDESSETIAQEPFLFSTSAYRDELNLHVIVLS
FSSVLADFCFFEFVLLVCCLMCLRNSAQVVRKSRSKLIRISENRNLTSTDDVTLSSGSVSSETSCLSSSSSVVTSAPIHRLVTRAEKKLEMIRHAWRKKQVASAHMRRRA
EAILSYLSGGCSSEVKIRQVLGDSPDTSKALRMLLKLEEIKRSGTGGRQDPYIYTVQDTLNSQLPLYLMASKQASPTKPINLHRFTYQNFPGKLNMFLFFPFPLLQIA