; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg012405 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg012405
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionTransmembrane protein
Genome locationscaffold1:9656472..9658537
RNA-Seq ExpressionSpg012405
SyntenySpg012405
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7034392.1 hypothetical protein SDJN02_04119, partial [Cucurbita argyrosperma subsp. argyrosperma]3.4e-9984.21Show/hide
Query:  RTAEPMFRALELPPPCPAAKHNPIHALPSDVKLCR-LPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNV
        RT E MFRALEL PPCPAAK + +HA PSDVKL R  PYNL+LPNR+LSLLS+RAQS+S+ PSDPSTS RYTETI HSSPAF+QFSQ TLTQRH+LVLNV
Subjt:  RTAEPMFRALELPPPCPAAKHNPIHALPSDVKLCR-LPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNV

Query:  VACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPS--VQ
        VACATAI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTM AIRLSGMEISDLTMELSDLGQ+ITQGVRSSTRAVRVAEERLR LTNM P+  VQ
Subjt:  VACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPS--VQ

Query:  EMTVTNL-EVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL
        EMTV NL  V+AA PVLAK ARDIKEGIVKGRSIFQLFLSLTR  RL
Subjt:  EMTVTNL-EVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL

XP_008440148.1 PREDICTED: uncharacterized protein LOC103484701 isoform X1 [Cucumis melo]7.6e-9984.43Show/hide
Query:  TAEPMFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVA
        TAE MF ALE+ PPCPAAK N + ALPS+ K  RLPYNL LPNRRLSLLS+RAQSL    SDPSTSSRYT+TI +SSPAFLQF + TLTQRHILVLNVVA
Subjt:  TAEPMFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVA

Query:  CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNP--SVQEM
        CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTM AIRLSGMEISDLTMELSDLGQ+ITQGVRSSTRAVRVAEERLRRLTNM+P  SVQEM
Subjt:  CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNP--SVQEM

Query:  TVTNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL
        T+TNL VK A PVLAK ARDIKEGIVKGRSIFQLFLS+TR  RL
Subjt:  TVTNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL

XP_008440149.1 PREDICTED: uncharacterized protein LOC103484701 isoform X2 [Cucumis melo]5.3e-10084.71Show/hide
Query:  TAEPMFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVA
        TAE MF ALE+ PPCPAAK N + ALPS+ K  RLPYNL LPNRRLSLLS+RAQSL    SDPSTSSRYT+TI +SSPAFLQF + TLTQRHILVLNVVA
Subjt:  TAEPMFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVA

Query:  CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPSVQEMTV
        CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTM AIRLSGMEISDLTMELSDLGQ+ITQGVRSSTRAVRVAEERLRRLTNM+P+VQEMT+
Subjt:  CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPSVQEMTV

Query:  TNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL
        TNL VK A PVLAK ARDIKEGIVKGRSIFQLFLS+TR  RL
Subjt:  TNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL

XP_022977945.1 uncharacterized protein LOC111478086 isoform X1 [Cucurbita maxima]9.7e-10287.14Show/hide
Query:  MFRALELPPPCPAAKHNPIHALPSDVKLCR-LPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACAT
        MFRALEL PPCPAAKH  +HA PSDVKLCR  P+NLRLPNRRLSLLS+RAQSLS+ PSDPSTS RYTETI HSSPAF+QFSQ TLTQRHILVLNVVACAT
Subjt:  MFRALELPPPCPAAKHNPIHALPSDVKLCR-LPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACAT

Query:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPS--VQEMTVT
        AI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTM AIRLSGMEISDLTMELSDLGQ ITQGVRSSTRAVRVAEERLR LTNM P+  VQEMTV 
Subjt:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPS--VQEMTVT

Query:  NLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL
        NL V+AA PVLAK ARDIKEGIVKGRSIFQLFLSLTR  RL
Subjt:  NLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL

XP_038880851.1 uncharacterized protein LOC120072535 [Benincasa hispida]8.2e-10187.08Show/hide
Query:  MFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACATA
        MFRALELPPPCPAAK N +HALPSDVK CRLPY+L LPNRRLSLL +RAQSL    SDPSTSSRYTETI HSSPAFLQFSQ TLTQ HI VLNVVACATA
Subjt:  MFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACATA

Query:  ISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNP--SVQEMTVTN
        ISATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTM AIRLSGMEISDLTMEL+DLGQ+ITQGVRSSTRAVRVAE+RLRRLTNM P  SVQEMTVTN
Subjt:  ISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNP--SVQEMTVTN

Query:  LEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL
        L V+ A PVLAK ARDIKEGIVKGRSIFQLFLSLTR  RL
Subjt:  LEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL

TrEMBL top hitse value%identityAlignment
A0A1S3B011 uncharacterized protein LOC103484701 isoform X22.6e-10084.71Show/hide
Query:  TAEPMFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVA
        TAE MF ALE+ PPCPAAK N + ALPS+ K  RLPYNL LPNRRLSLLS+RAQSL    SDPSTSSRYT+TI +SSPAFLQF + TLTQRHILVLNVVA
Subjt:  TAEPMFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVA

Query:  CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPSVQEMTV
        CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTM AIRLSGMEISDLTMELSDLGQ+ITQGVRSSTRAVRVAEERLRRLTNM+P+VQEMT+
Subjt:  CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPSVQEMTV

Query:  TNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL
        TNL VK A PVLAK ARDIKEGIVKGRSIFQLFLS+TR  RL
Subjt:  TNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL

A0A1S3B164 uncharacterized protein LOC103484701 isoform X13.7e-9984.43Show/hide
Query:  TAEPMFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVA
        TAE MF ALE+ PPCPAAK N + ALPS+ K  RLPYNL LPNRRLSLLS+RAQSL    SDPSTSSRYT+TI +SSPAFLQF + TLTQRHILVLNVVA
Subjt:  TAEPMFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVA

Query:  CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNP--SVQEM
        CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTM AIRLSGMEISDLTMELSDLGQ+ITQGVRSSTRAVRVAEERLRRLTNM+P  SVQEM
Subjt:  CATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNP--SVQEM

Query:  TVTNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL
        T+TNL VK A PVLAK ARDIKEGIVKGRSIFQLFLS+TR  RL
Subjt:  TVTNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL

A0A6J1BT62 uncharacterized protein LOC111005326 isoform X23.1e-9881.78Show/hide
Query:  MFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACATA
        MF AL+LPPPC AAKHN   ALPS VKLCRL YNLR PNRRL+L S+R+QSLS+ PSDPSTSS YTETI HSSPA LQ SQW LTQRHILVLNVVACATA
Subjt:  MFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACATA

Query:  ISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPSVQEMTVTNLE
        ISATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTM AIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRV EERLR LTNM P+VQEMTVT+ +
Subjt:  ISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPSVQEMTVTNLE

Query:  VKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERLIASAGEGKG
        V AA PVLAK AR IKEGIVKGRSIF+LFL+LTR   L  +   G+G
Subjt:  VKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERLIASAGEGKG

A0A6J1GDG4 uncharacterized protein LOC111452981 isoform X18.2e-9985.54Show/hide
Query:  MFRALELPPPCPAAKHNPIHALPSDVKLCR-LPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACAT
        MFRALEL PPCPAAKH+ +HA PSDVKL R  PYNLRLPNRRLSLLS+RAQSLS+ PSDPSTS RYTETI HSSPA++QFSQ TLTQRH+LVLNVVACAT
Subjt:  MFRALELPPPCPAAKHNPIHALPSDVKLCR-LPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACAT

Query:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPS--VQEMTVT
        AI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTM AIRLSGMEISDLTMELSDLGQ+ITQGVRSSTRAVRVAEERLR LTNM P+  VQEMTV 
Subjt:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPS--VQEMTVT

Query:  NL-EVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL
        NL  V+AA PVLAK ARDIK GIVKGRSIFQLFLSLTR  RL
Subjt:  NL-EVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL

A0A6J1INQ2 uncharacterized protein LOC111478086 isoform X14.7e-10287.14Show/hide
Query:  MFRALELPPPCPAAKHNPIHALPSDVKLCR-LPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACAT
        MFRALEL PPCPAAKH  +HA PSDVKLCR  P+NLRLPNRRLSLLS+RAQSLS+ PSDPSTS RYTETI HSSPAF+QFSQ TLTQRHILVLNVVACAT
Subjt:  MFRALELPPPCPAAKHNPIHALPSDVKLCR-LPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACAT

Query:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPS--VQEMTVT
        AI+ATWLFCSAIPTLLAFKRAAESLEKLMDVTREE+PGTM AIRLSGMEISDLTMELSDLGQ ITQGVRSSTRAVRVAEERLR LTNM P+  VQEMTV 
Subjt:  AISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPS--VQEMTVT

Query:  NLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL
        NL V+AA PVLAK ARDIKEGIVKGRSIFQLFLSLTR  RL
Subjt:  NLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08530.1 unknown protein3.1e-2143.31Show/hide
Query:  TLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLR
        +L+ +  L+L  + C T+++ T L  +AIPTL+A  RAA S  KL D  R+E+P T+ A+RLSGMEISDLT+ELSDL Q+IT G+  S +AV+ AE  ++
Subjt:  TLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLR

Query:  RLTNMNPSVQEMTVTNLEVKAAGPVLA
        ++  +    Q+ T++ +E +A  P ++
Subjt:  RLTNMNPSVQEMTVTNLEVKAAGPVLA

AT5G09995.1 unknown protein5.0e-4867.53Show/hide
Query:  RLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEI
        RL     S+ S    +L + P +PS SS+ T ++       LQ SQWT TQ+H ++LNVVAC TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREE+
Subjt:  RLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEI

Query:  PGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNP
        P TM A+RLSGMEISDLTMELSDLGQ ITQGV+SSTRA+RVAE+RLRRLTNMNP
Subjt:  PGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNP

AT5G09995.2 unknown protein3.1e-5860.58Show/hide
Query:  RLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEI
        RL     S+ S    +L + P +PS SS+ T ++       LQ SQWT TQ+H ++LNVVAC TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREE+
Subjt:  RLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEI

Query:  PGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNP--SVQEMTVTNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLT
        P TM A+RLSGMEISDLTMELSDLGQ ITQGV+SSTRA+RVAE+RLRRLTNMNP  S+QE+ +   +     P+LAK AR  +EG+VKGRS++QLF ++T
Subjt:  PGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNP--SVQEMTVTNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLT

Query:  RQERLIAS
        R  ++  S
Subjt:  RQERLIAS

AT5G09995.3 unknown protein3.7e-5959.71Show/hide
Query:  RLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEI
        RL     S+ S    +L + P +PS SS+ T ++       LQ SQWT TQ+H ++LNVVAC TAISA+WLF +AIPTLLAFK+AAESLEKL+DVTREE+
Subjt:  RLPNRRLSLLSLRAQSLSTLPSDPSTSSRYTETIAHSSPAFLQFSQWTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEI

Query:  PGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPSVQEMTVTNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQ
        P TM A+RLSGMEISDLTMELSDLGQ ITQGV+SSTRA+RVAE+RLRRLTNMNP+  +  +   +     P+LAK AR  +EG+VKGRS++QLF ++TR 
Subjt:  PGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRVAEERLRRLTNMNPSVQEMTVTNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQ

Query:  ERLIAS
         ++  S
Subjt:  ERLIAS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
TATTTTGGTTCTGGGCCTCTTCACACACTGGATTACAAGCAGCAGGTCTCTTGGGAGGCCCACAATCTTCACATCCTCTCGGCTGAGTCCCAGCTTCCGAGTCGCTTTGC
TGTAGCAAGAACAGCAGAACCAATGTTCAGAGCTTTGGAACTACCGCCACCGTGCCCGGCGGCGAAGCATAATCCCATTCACGCACTGCCAAGTGACGTTAAACTCTGCC
GACTACCTTACAATCTCAGGCTGCCAAATCGACGACTTTCTTTGCTTTCGCTACGTGCACAATCGCTGTCGACATTGCCGTCTGATCCATCGACTTCATCGCGTTATACG
GAGACTATTGCGCATTCTTCTCCTGCATTTCTTCAATTCTCTCAGTGGACGCTAACTCAACGCCACATCCTTGTTCTTAATGTCGTTGCCTGCGCGACGGCTATTTCTGC
AACCTGGCTCTTTTGTTCTGCAATCCCCACTCTTCTGGCATTCAAGAGAGCAGCCGAATCATTAGAGAAGCTCATGGATGTAACAAGGGAGGAAATTCCAGGCACTATGG
TAGCCATTCGGTTATCTGGCATGGAAATCAGTGATCTGACCATGGAACTCAGTGATCTTGGCCAGGAAATCACCCAAGGTGTGAGAAGTTCAACCAGAGCTGTTCGAGTA
GCCGAAGAGAGATTGCGTCGCTTGACAAACATGAATCCATCAGTGCAGGAAATGACAGTAACCAATCTGGAAGTGAAGGCAGCAGGACCAGTTCTGGCTAAAACGGCAAG
GGACATTAAGGAGGGGATTGTGAAAGGCCGTTCCATCTTCCAATTATTTCTGTCTCTTACAAGGCAGGAGAGGCTAATAGCTTCTGCGGGCGAGGGCAAAGGAATTTTTA
AGTTGTGA
mRNA sequenceShow/hide mRNA sequence
TATTTTGGTTCTGGGCCTCTTCACACACTGGATTACAAGCAGCAGGTCTCTTGGGAGGCCCACAATCTTCACATCCTCTCGGCTGAGTCCCAGCTTCCGAGTCGCTTTGC
TGTAGCAAGAACAGCAGAACCAATGTTCAGAGCTTTGGAACTACCGCCACCGTGCCCGGCGGCGAAGCATAATCCCATTCACGCACTGCCAAGTGACGTTAAACTCTGCC
GACTACCTTACAATCTCAGGCTGCCAAATCGACGACTTTCTTTGCTTTCGCTACGTGCACAATCGCTGTCGACATTGCCGTCTGATCCATCGACTTCATCGCGTTATACG
GAGACTATTGCGCATTCTTCTCCTGCATTTCTTCAATTCTCTCAGTGGACGCTAACTCAACGCCACATCCTTGTTCTTAATGTCGTTGCCTGCGCGACGGCTATTTCTGC
AACCTGGCTCTTTTGTTCTGCAATCCCCACTCTTCTGGCATTCAAGAGAGCAGCCGAATCATTAGAGAAGCTCATGGATGTAACAAGGGAGGAAATTCCAGGCACTATGG
TAGCCATTCGGTTATCTGGCATGGAAATCAGTGATCTGACCATGGAACTCAGTGATCTTGGCCAGGAAATCACCCAAGGTGTGAGAAGTTCAACCAGAGCTGTTCGAGTA
GCCGAAGAGAGATTGCGTCGCTTGACAAACATGAATCCATCAGTGCAGGAAATGACAGTAACCAATCTGGAAGTGAAGGCAGCAGGACCAGTTCTGGCTAAAACGGCAAG
GGACATTAAGGAGGGGATTGTGAAAGGCCGTTCCATCTTCCAATTATTTCTGTCTCTTACAAGGCAGGAGAGGCTAATAGCTTCTGCGGGCGAGGGCAAAGGAATTTTTA
AGTTGTGA
Protein sequenceShow/hide protein sequence
YFGSGPLHTLDYKQQVSWEAHNLHILSAESQLPSRFAVARTAEPMFRALELPPPCPAAKHNPIHALPSDVKLCRLPYNLRLPNRRLSLLSLRAQSLSTLPSDPSTSSRYT
ETIAHSSPAFLQFSQWTLTQRHILVLNVVACATAISATWLFCSAIPTLLAFKRAAESLEKLMDVTREEIPGTMVAIRLSGMEISDLTMELSDLGQEITQGVRSSTRAVRV
AEERLRRLTNMNPSVQEMTVTNLEVKAAGPVLAKTARDIKEGIVKGRSIFQLFLSLTRQERLIASAGEGKGIFKL