; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg025559 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg025559
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold13:28566802..28575388
RNA-Seq ExpressionSpg025559
SyntenySpg025559
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]2.2e-2529.12Show/hide
Query:  RFINDRARAKY-LDMVKIDFLFERGF---SDDLPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFP
        RF++  A  +Y   +V    + ERGF    +     +   +   +W  F A PES    +V +FYAN  E +  + +VRG  V +    IN L+N+    
Subjt:  RFINDRARAKY-LDMVKIDFLFERGF---SDDLPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFP

Query:  LAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSW
        L  +       +        + +   GAQW+++K E  +F++  L   A  WL FI  R+LPT H   V+ DR LL++ I+     DVGK+IS+ I  S 
Subjt:  LAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSW

Query:  RKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQEFRQGGLVCGIHQILEQLQLSASRQEY---AERQAQTYWTY--AKWRD-AT
          +   L+FP+ IT LC RAGV     + ++  +  ID   ++R+        GG    + + +  L    S QE+    ER+      Y  A  R    
Subjt:  RKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQEFRQGGLVCGIHQILEQLQLSASRQEY---AERQAQTYWTY--AKWRD-AT

Query:  LRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEE
        L R + ++        P F D    P  PPP  +  E+EE
Subjt:  LRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEE

PIN07564.1 hypothetical protein CDL12_19862 [Handroanthus impetiginosus]2.6e-2629.46Show/hide
Query:  NDRARAKYLDMVKIDFLFERGF---SDDLPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFPLAGY
        N RAR    D +    + ERGF    +     +   +   +W  F A+PES    +  +FYAN  E + F+ +VRG  V +    IN L+N+    L  +
Subjt:  NDRARAKYLDMVKIDFLFERGF---SDDLPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFPLAGY

Query:  NEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSWRKTV
               +        + +   GAQW+++K E  +F++  L   A  WL FI  ++LPT+H   V+ D+ LL++ I+     DVGK+ISN I  S   + 
Subjt:  NEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSWRKTV

Query:  RKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQEFRQGGLVCGIHQILEQLQLSASRQEY---AERQAQTYWTY--AKWRD-ATLRRA
          L+FP+ IT LC RAGV     + ++  +  ID   ++R+        GG    + + +  L    S QE+    ER+      Y  A  R    L R 
Subjt:  RKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQEFRQGGLVCGIHQILEQLQLSASRQEY---AERQAQTYWTY--AKWRD-ATLRRA

Query:  LQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEE
        + ++        P F  D  +P  PPPP   E E+E
Subjt:  LQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEE

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]5.7e-2935.06Show/hide
Query:  INIKIPLRKGHKTQQYMRFINDRARAKYLDMVKIDFL-FERGFSDD-------LPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVR
        + IK   RK HK    ++F  + A  +Y + ++   L  E+GF  D       LP F+   IT H W QFCA PE     +V +FYAN+ +       VR
Subjt:  INIKIPLRKGHKTQQYMRFINDRARAKYLDMVKIDFL-FERGFSDD-------LPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVR

Query:  GVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFA
        GV V WS  AIN++F L D P+  ++E +   +   L   ++ V + GA+W +S     T   + L   A  W  F+K  LLPTTH   VS+DR+LL+ +
Subjt:  GVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFA

Query:  ILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQE
        +L    I+VG++I +EI     +    LFFP+ IT LC  A  P   ++  L +   ID   + R+  TQE
Subjt:  ILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.0e-3932.82Show/hide
Query:  INIKIPLRKGHKTQQYMRFINDRARAKYLDMVKIDFL-FERGFSDD-------LPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVR
        + IK   RK HK    ++F  + A  +Y + ++   L  E+GF  D       LP F+   IT H W QFCA PE     +V +FYAN+ + E     VR
Subjt:  INIKIPLRKGHKTQQYMRFINDRARAKYLDMVKIDFL-FERGFSDD-------LPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVR

Query:  GVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFA
        GV V WS  AIN++F L D P+  ++E +   +   L   ++ V   GA+W +S     T   + L   A  W  F+K RLLPTTH   VS+DR+LL+ +
Subjt:  GVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFA

Query:  ILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRL------QRTQE-----------FRQGGLVCGIHQ
        +L    I+VG++I +EI     +    LFFP+ IT LC  A  P   ++  L +   ID   + R+      + TQ+            R  G    I Q
Subjt:  ILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRL------QRTQE-----------FRQGGLVCGIHQ

Query:  ILEQLQLSASRQE-----------YAERQAQTYWTYAKWRDATLRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEENDQAWSS
         L+ L+   S+QE           +  +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++         ++ E E E+D+  S+
Subjt:  ILEQLQLSASRQE-----------YAERQAQTYWTYAKWRDATLRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEENDQAWSS

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.1e-3233Show/hide
Query:  IVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLR
        +V +FYAN+ + E     VRGV V WS  AIN++F L D P+  ++E +   +  +L   ++ V   GA+W +S     T   + L   A  W  F+K R
Subjt:  IVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLR

Query:  LLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPD--------DVILLDQ-------EIIDTPNLVR
        LLPTTH  +VS+DR+LL+ ++L    I+VG++I +EI     +    LFFP+ IT LC  A      +        D I + +       E    P+  R
Subjt:  LLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPD--------DVILLDQ-------EIIDTPNLVR

Query:  LQRTQEFRQGGLVCGIHQILEQLQLSASRQEYAERQAQTYWTYAKWRDATLRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEENDQAWSS
               R  G    + Q L+ L+   S+QE+  +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++         ++ E E E+D+  S+
Subjt:  LQRTQEFRQGGLVCGIHQILEQLQLSASRQEYAERQAQTYWTYAKWRDATLRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEENDQAWSS

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein1.1e-2529.12Show/hide
Query:  RFINDRARAKY-LDMVKIDFLFERGF---SDDLPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFP
        RF++  A  +Y   +V    + ERGF    +     +   +   +W  F A PES    +V +FYAN  E +  + +VRG  V +    IN L+N+    
Subjt:  RFINDRARAKY-LDMVKIDFLFERGF---SDDLPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFP

Query:  LAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSW
        L  +       +        + +   GAQW+++K E  +F++  L   A  WL FI  R+LPT H   V+ DR LL++ I+     DVGK+IS+ I  S 
Subjt:  LAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSW

Query:  RKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQEFRQGGLVCGIHQILEQLQLSASRQEY---AERQAQTYWTY--AKWRD-AT
          +   L+FP+ IT LC RAGV     + ++  +  ID   ++R+        GG    + + +  L    S QE+    ER+      Y  A  R    
Subjt:  RKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQEFRQGGLVCGIHQILEQLQLSASRQEY---AERQAQTYWTY--AKWRD-AT

Query:  LRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEE
        L R + ++        P F D    P  PPP  +  E+EE
Subjt:  LRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEE

A0A2G9GQI5 Uncharacterized protein1.3e-2629.46Show/hide
Query:  NDRARAKYLDMVKIDFLFERGF---SDDLPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFPLAGY
        N RAR    D +    + ERGF    +     +   +   +W  F A+PES    +  +FYAN  E + F+ +VRG  V +    IN L+N+    L  +
Subjt:  NDRARAKYLDMVKIDFLFERGF---SDDLPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFPLAGY

Query:  NEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSWRKTV
               +        + +   GAQW+++K E  +F++  L   A  WL FI  ++LPT+H   V+ D+ LL++ I+     DVGK+ISN I  S   + 
Subjt:  NEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSWRKTV

Query:  RKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQEFRQGGLVCGIHQILEQLQLSASRQEY---AERQAQTYWTY--AKWRD-ATLRRA
          L+FP+ IT LC RAGV     + ++  +  ID   ++R+        GG    + + +  L    S QE+    ER+      Y  A  R    L R 
Subjt:  RKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQEFRQGGLVCGIHQILEQLQLSASRQEY---AERQAQTYWTY--AKWRD-ATLRRA

Query:  LQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEE
        + ++        P F  D  +P  PPPP   E E+E
Subjt:  LQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEE

A0A2P5AGA5 Uncharacterized protein (Fragment)2.7e-2935.06Show/hide
Query:  INIKIPLRKGHKTQQYMRFINDRARAKYLDMVKIDFL-FERGFSDD-------LPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVR
        + IK   RK HK    ++F  + A  +Y + ++   L  E+GF  D       LP F+   IT H W QFCA PE     +V +FYAN+ +       VR
Subjt:  INIKIPLRKGHKTQQYMRFINDRARAKYLDMVKIDFL-FERGFSDD-------LPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVR

Query:  GVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFA
        GV V WS  AIN++F L D P+  ++E +   +   L   ++ V + GA+W +S     T   + L   A  W  F+K  LLPTTH   VS+DR+LL+ +
Subjt:  GVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFA

Query:  ILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQE
        +L    I+VG++I +EI     +    LFFP+ IT LC  A  P   ++  L +   ID   + R+  TQE
Subjt:  ILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)2.9e-3932.82Show/hide
Query:  INIKIPLRKGHKTQQYMRFINDRARAKYLDMVKIDFL-FERGFSDD-------LPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVR
        + IK   RK HK    ++F  + A  +Y + ++   L  E+GF  D       LP F+   IT H W QFCA PE     +V +FYAN+ + E     VR
Subjt:  INIKIPLRKGHKTQQYMRFINDRARAKYLDMVKIDFL-FERGFSDD-------LPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVR

Query:  GVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFA
        GV V WS  AIN++F L D P+  ++E +   +   L   ++ V   GA+W +S     T   + L   A  W  F+K RLLPTTH   VS+DR+LL+ +
Subjt:  GVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFA

Query:  ILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRL------QRTQE-----------FRQGGLVCGIHQ
        +L    I+VG++I +EI     +    LFFP+ IT LC  A  P   ++  L +   ID   + R+      + TQ+            R  G    I Q
Subjt:  ILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRL------QRTQE-----------FRQGGLVCGIHQ

Query:  ILEQLQLSASRQE-----------YAERQAQTYWTYAKWRDATLRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEENDQAWSS
         L+ L+   S+QE           +  +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++         ++ E E E+D+  S+
Subjt:  ILEQLQLSASRQE-----------YAERQAQTYWTYAKWRDATLRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEENDQAWSS

A0A2P5DXM3 Uncharacterized protein5.3e-3333Show/hide
Query:  IVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLR
        +V +FYAN+ + E     VRGV V WS  AIN++F L D P+  ++E +   +  +L   ++ V   GA+W +S     T   + L   A  W  F+K R
Subjt:  IVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLR

Query:  LLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPD--------DVILLDQ-------EIIDTPNLVR
        LLPTTH  +VS+DR+LL+ ++L    I+VG++I +EI     +    LFFP+ IT LC  A      +        D I + +       E    P+  R
Subjt:  LLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPD--------DVILLDQ-------EIIDTPNLVR

Query:  LQRTQEFRQGGLVCGIHQILEQLQLSASRQEYAERQAQTYWTYAKWRDATLRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEENDQAWSS
               R  G    + Q L+ L+   S+QE+  +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++         ++ E E E+D+  S+
Subjt:  LQRTQEFRQGGLVCGIHQILEQLQLSASRQEYAERQAQTYWTYAKWRDATLRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEENDQAWSS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGTCGGCCTCGGTCTTGGGCCGAGGCCGAGTATGATGGTCGGCCTGCTTGCACGGGCCGAGCTCGTTCGCCTCCATTCGGTCCCTGCTGCCTCTGGTCGCCTCGGTTC
CACCTGGTTCGTCCCGAAACGCCTCCGAATTCCTAAAAACCCTAGGAGCATGAGCAGCATCGGAGGCGGTGTGGCAAGCACCACACCGGTGTGCATGTTTACTATTTTGC
AGGCCACGTCTTCCCCCTCTCATACAAATTTATCGTTGGTGGCACATGAAGGTCAGGAAATATATTGTACGTCTGAAATGGCTAAAACAAGAGGAAGAAGGGAAAGAGAT
ACTGACGAAGGAGAGGTTCCAGTTACACCCGAGGCACCGAAGACAAGAACGAACAGAAGGAAAACACCAGAAGAAAGAGAAGCCAAGAGGCGGAGAAGGCAACAACGAGC
TGAAGCAACAGAGGTAGTTAGGAAGGTGATCGAGGACATTACTGAAGGAGTGGCTAGAGAAGAGCAGCCAAAACAACCTGAAGAAGCGGCTAGGGGAGAACAACTGACAG
TAACTGAGGAAGGGAAAGATTCAGAACAGGGTGATCAACCTGCCGAAACTCAGCAAGAAGTTCAGGAAAAGCACGCAGAAGATGTGCTGGAACAAGGGAATGATCAAGGA
GCCCAAGAGCAGGAGGATCAAGAGAACGAGGAAACTGAAAAGAAGGCTGAAGAAGAGACCCTGACGAAGAAACAAGAAGACCGGGGCAAAGGTGTTGCTGAAGCAGCAAT
AGAAGCAGAGGAGGCTGAGATTGAAGAGCAAAGAATGTCGTATGTGAAGGGGATAAAAGCCCCTACGCAGCGGAAGCGCATCGATTGGACCTTACGCCGTATATTAATTA
ACATTAAAATACCGTTACGAAAAGGGCATAAAACCCAACAGTATATGCGCTTCATCAACGACCGCGCCAGAGCAAAATATCTGGACATGGTGAAAATTGACTTCTTGTTT
GAAAGGGGATTCAGTGATGACCTACCACATTTCTTGCGTGCTGGGATTACAAACCACAGATGGGATCAGTTCTGCGCGAAACCAGAGTCGATAAACTCGAATATTGTCCA
CAAATTCTATGCGAATATAGATGAAGAGGAAGGTTTTCAAGCCATGGTCCGAGGGGTTGCTGTAGATTGGAGCCCAGGTGCGATAAACTCGTTATTCAATCTCCAGGATT
TTCCACTCGCCGGATACAATGAAATGGTGGTAGCACCATCTAATGATCAACTGAACGCGACTGTGAAAGAAGTTGGAATTGAAGGGGCCCAGTGGAGGTTGTCAAAGACT
GAGAAGCGCACATTTCAGGCAGCCTATTTGAAGAGTGAGGCCAACACCTGGTTGGGCTTCATCAAATTGCGTCTTCTACCGACAACCCACGATTCTATGGTTTCCCGAGA
TCGAGTGCTCCTGGTTTTTGCGATCCTGAGGTCCCTGGGTATTGATGTTGGTAAAGTTATTTCCAATGAAATCTTCAACTCCTGGCGCAAAACGGTGCGTAAATTATTCT
TCCCAAATACAATCACTATGCTGTGTAATAGGGCAGGGGTGCCTACAACTCCAGACGATGTCATTCTGCTTGACCAGGAGATTATCGACACGCCCAACCTAGTGAGACTT
CAACGGACTCAAGAATTCCGACAAGGAGGGTTGGTATGTGGCATCCATCAAATTCTAGAGCAACTTCAACTTTCGGCCAGTAGGCAAGAGTATGCTGAGAGACAAGCTCA
GACCTACTGGACCTATGCTAAGTGGAGGGATGCCACCCTAAGGAGGGCACTGCAATCAAATTTTTCAAAACCATATCAGGCCTTCCCCGTTTTCCCCGATGACTTGTTTA
ATCCCTGGATTCCACCCCCGCCAGTCGAAAGAGAAGAGGAGGAAGAAAATGATCAGGCCTGGTCATCGCTGCGGCAAGAAGATTCTGAGGTAGTGTTGACTTCTTTGATC
CACCTTAAGCTTAATCTTACAGTGCTTGGTTTTGCAGAATGCGCAGGCACTCTAGCGTGCGCTCTCTCAGTTCAGGACGGTGTTGATCTTATGATCAGGCACGTTGTTCG
TGCTCCAGTTGGCACCAGACATGTTTACATGCGTCAGTTGCTCCACCGTATTTTATGTTACATTCAGGGGTGTTGTGTTGAGATAGGTGTAGTTCTGCTAGAGGTGGTAC
TTGAGGACATCAGGCGAGTGGCGAGGGCAATGACTTGGCGAGACTATGAGATGCAGCTGCAAATCATGTTCCTAGAAGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGTCGGCCTCGGTCTTGGGCCGAGGCCGAGTATGATGGTCGGCCTGCTTGCACGGGCCGAGCTCGTTCGCCTCCATTCGGTCCCTGCTGCCTCTGGTCGCCTCGGTTC
CACCTGGTTCGTCCCGAAACGCCTCCGAATTCCTAAAAACCCTAGGAGCATGAGCAGCATCGGAGGCGGTGTGGCAAGCACCACACCGGTGTGCATGTTTACTATTTTGC
AGGCCACGTCTTCCCCCTCTCATACAAATTTATCGTTGGTGGCACATGAAGGTCAGGAAATATATTGTACGTCTGAAATGGCTAAAACAAGAGGAAGAAGGGAAAGAGAT
ACTGACGAAGGAGAGGTTCCAGTTACACCCGAGGCACCGAAGACAAGAACGAACAGAAGGAAAACACCAGAAGAAAGAGAAGCCAAGAGGCGGAGAAGGCAACAACGAGC
TGAAGCAACAGAGGTAGTTAGGAAGGTGATCGAGGACATTACTGAAGGAGTGGCTAGAGAAGAGCAGCCAAAACAACCTGAAGAAGCGGCTAGGGGAGAACAACTGACAG
TAACTGAGGAAGGGAAAGATTCAGAACAGGGTGATCAACCTGCCGAAACTCAGCAAGAAGTTCAGGAAAAGCACGCAGAAGATGTGCTGGAACAAGGGAATGATCAAGGA
GCCCAAGAGCAGGAGGATCAAGAGAACGAGGAAACTGAAAAGAAGGCTGAAGAAGAGACCCTGACGAAGAAACAAGAAGACCGGGGCAAAGGTGTTGCTGAAGCAGCAAT
AGAAGCAGAGGAGGCTGAGATTGAAGAGCAAAGAATGTCGTATGTGAAGGGGATAAAAGCCCCTACGCAGCGGAAGCGCATCGATTGGACCTTACGCCGTATATTAATTA
ACATTAAAATACCGTTACGAAAAGGGCATAAAACCCAACAGTATATGCGCTTCATCAACGACCGCGCCAGAGCAAAATATCTGGACATGGTGAAAATTGACTTCTTGTTT
GAAAGGGGATTCAGTGATGACCTACCACATTTCTTGCGTGCTGGGATTACAAACCACAGATGGGATCAGTTCTGCGCGAAACCAGAGTCGATAAACTCGAATATTGTCCA
CAAATTCTATGCGAATATAGATGAAGAGGAAGGTTTTCAAGCCATGGTCCGAGGGGTTGCTGTAGATTGGAGCCCAGGTGCGATAAACTCGTTATTCAATCTCCAGGATT
TTCCACTCGCCGGATACAATGAAATGGTGGTAGCACCATCTAATGATCAACTGAACGCGACTGTGAAAGAAGTTGGAATTGAAGGGGCCCAGTGGAGGTTGTCAAAGACT
GAGAAGCGCACATTTCAGGCAGCCTATTTGAAGAGTGAGGCCAACACCTGGTTGGGCTTCATCAAATTGCGTCTTCTACCGACAACCCACGATTCTATGGTTTCCCGAGA
TCGAGTGCTCCTGGTTTTTGCGATCCTGAGGTCCCTGGGTATTGATGTTGGTAAAGTTATTTCCAATGAAATCTTCAACTCCTGGCGCAAAACGGTGCGTAAATTATTCT
TCCCAAATACAATCACTATGCTGTGTAATAGGGCAGGGGTGCCTACAACTCCAGACGATGTCATTCTGCTTGACCAGGAGATTATCGACACGCCCAACCTAGTGAGACTT
CAACGGACTCAAGAATTCCGACAAGGAGGGTTGGTATGTGGCATCCATCAAATTCTAGAGCAACTTCAACTTTCGGCCAGTAGGCAAGAGTATGCTGAGAGACAAGCTCA
GACCTACTGGACCTATGCTAAGTGGAGGGATGCCACCCTAAGGAGGGCACTGCAATCAAATTTTTCAAAACCATATCAGGCCTTCCCCGTTTTCCCCGATGACTTGTTTA
ATCCCTGGATTCCACCCCCGCCAGTCGAAAGAGAAGAGGAGGAAGAAAATGATCAGGCCTGGTCATCGCTGCGGCAAGAAGATTCTGAGGTAGTGTTGACTTCTTTGATC
CACCTTAAGCTTAATCTTACAGTGCTTGGTTTTGCAGAATGCGCAGGCACTCTAGCGTGCGCTCTCTCAGTTCAGGACGGTGTTGATCTTATGATCAGGCACGTTGTTCG
TGCTCCAGTTGGCACCAGACATGTTTACATGCGTCAGTTGCTCCACCGTATTTTATGTTACATTCAGGGGTGTTGTGTTGAGATAGGTGTAGTTCTGCTAGAGGTGGTAC
TTGAGGACATCAGGCGAGTGGCGAGGGCAATGACTTGGCGAGACTATGAGATGCAGCTGCAAATCATGTTCCTAGAAGTCTAG
Protein sequenceShow/hide protein sequence
MVGLGLGPRPSMMVGLLARAELVRLHSVPAASGRLGSTWFVPKRLRIPKNPRSMSSIGGGVASTTPVCMFTILQATSSPSHTNLSLVAHEGQEIYCTSEMAKTRGRRERD
TDEGEVPVTPEAPKTRTNRRKTPEEREAKRRRRQQRAEATEVVRKVIEDITEGVAREEQPKQPEEAARGEQLTVTEEGKDSEQGDQPAETQQEVQEKHAEDVLEQGNDQG
AQEQEDQENEETEKKAEEETLTKKQEDRGKGVAEAAIEAEEAEIEEQRMSYVKGIKAPTQRKRIDWTLRRILINIKIPLRKGHKTQQYMRFINDRARAKYLDMVKIDFLF
ERGFSDDLPHFLRAGITNHRWDQFCAKPESINSNIVHKFYANIDEEEGFQAMVRGVAVDWSPGAINSLFNLQDFPLAGYNEMVVAPSNDQLNATVKEVGIEGAQWRLSKT
EKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSMVSRDRVLLVFAILRSLGIDVGKVISNEIFNSWRKTVRKLFFPNTITMLCNRAGVPTTPDDVILLDQEIIDTPNLVRL
QRTQEFRQGGLVCGIHQILEQLQLSASRQEYAERQAQTYWTYAKWRDATLRRALQSNFSKPYQAFPVFPDDLFNPWIPPPPVEREEEEENDQAWSSLRQEDSEVVLTSLI
HLKLNLTVLGFAECAGTLACALSVQDGVDLMIRHVVRAPVGTRHVYMRQLLHRILCYIQGCCVEIGVVLLEVVLEDIRRVARAMTWRDYEMQLQIMFLEV