; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0000806 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0000806
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionUnknown protein
Genome locationchr4:17051890..17058530
RNA-Seq ExpressionLag0000806
SyntenyLag0000806
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]1.5e-3029.78Show/hide
Query:  VPYDRFVNNFARAKYTEILKRDFLFERGF------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALY
        + + +F N+ A+A++     R+  FE GF       G   P +   ++   W  F   P +VNA +V EFYANI K     + VRG +I ++  AIN  +
Subjt:  VPYDRFVNNFARAKYTEILKRDFLFERGF------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALY

Query:  HLQNF--PHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKII
        HLQ     HA F E A      +    + ++  E  +W   +T + +     L   A  W  F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II
Subjt:  HLQNF--PHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKII

Query:  CNEISCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQRTQEIRQGGLIY-------GINTILEQLAL-SATRQEFAERQTL--
          ++  C  KK   L FPN IT LCR+  V     D IL     I    L  L   +  +    ++         N  +  LAL  A  Q  A+   L  
Subjt:  CNEISCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQRTQEIRQGGLIY-------GINTILEQLAL-SATRQEFAERQTL--

Query:  ---TFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLLNPWIPPPPAEKEDEEED
            F+ YVK+RD  +    QE         P FP+++L  +      E E +  D
Subjt:  ---TFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLLNPWIPPPPAEKEDEEED

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]8.2e-3237.01Show/hide
Query:  RFVNNFARAKY-TEILKRDFLFERGF-------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL
        +F    A  +Y   I  R    E+GF        G L PF+   I  H W+ FCA PE     +V EFYAN+       V VRGV++ WS  AINA++ L
Subjt:  RFVNNFARAKY-TEILKRDFLFERGF-------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL

Query:  QNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI
         + P    +E     +E  L   +  V +  A+W +S     T   + L   A  W  F++  LLPTTH  TVS++R+LL  ++L   SI+VG++I +EI
Subjt:  QNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI

Query:  SCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  SCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.3e-4233.43Show/hide
Query:  RFVNNFARAKY-TEILKRDFLFERGF-------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL
        +F    A  +Y   I  R    E+GF        G L PF+   I  H W+ FCA PE     +V EFYAN+   E   V VRGV++ WS  AINA++ L
Subjt:  RFVNNFARAKY-TEILKRDFLFERGF-------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL

Query:  QNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI
         + P    +E     +++ L   +  V    A+W +S     T   + L   A  W  F++ RLLPTTH  TVS++R+LL  ++L   SI+VG++I +EI
Subjt:  QNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI

Query:  SCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQR---TQEIRQ---------------GGLIYGINTILEQLALSATRQ----
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q    
Subjt:  SCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQR---TQEIRQ---------------GGLIYGINTILEQLALSATRQ----

Query:  ---EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLL
           +   +Q   FW Y K RD +L++ALQ NF++P P  P FP+++L
Subjt:  ---EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]4.5e-3033.19Show/hide
Query:  RFVNNFARAKYTE-------ILKRDFLFERGFSGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQ
        +F +  A  +Y E        ++++F+++     + PPF+   I+ H W+LFCA PE     +V EFY N+   +   V +RGV++  S  AIN ++ L 
Subjt:  RFVNNFARAKYTE-------ILKRDFLFERGFSGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQ

Query:  NFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIS
        + P    +E     ++ +L   +  V I  A+W +S     T   + LN  A  W  F++ RLLPTTH  TVS+E + L +++L   SI+VG++I  EI 
Subjt:  NFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIS

Query:  CCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKG
         C  +K G LFFP+ IT +CR    P   ++  L + G
Subjt:  CCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKG

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.8e-3435.02Show/hide
Query:  VVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQR
        +V EFYAN+   E   + VRGV++ WS  AINA++ L + P    +E     +E +L   +  V    A+W +S     T   + L   A  W  F++ R
Subjt:  VVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQR

Query:  LLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEISCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQR---TQEIRQ---
        LLPTTH   VS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LCR A   V E    L + G ID   +AR+ +   T+  +Q   
Subjt:  LLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEISCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQR---TQEIRQ---

Query:  ------------GGLIYGINTILEQLALSATRQEFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLL
                    G ++  +  + ++L    ++QE   +Q   FW Y K RD +L++ALQ NF++P P  P FP+++L
Subjt:  ------------GGLIYGINTILEQLALSATRQEFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLL

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)4.0e-3237.01Show/hide
Query:  RFVNNFARAKY-TEILKRDFLFERGF-------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL
        +F    A  +Y   I  R    E+GF        G L PF+   I  H W+ FCA PE     +V EFYAN+       V VRGV++ WS  AINA++ L
Subjt:  RFVNNFARAKY-TEILKRDFLFERGF-------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL

Query:  QNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI
         + P    +E     +E  L   +  V +  A+W +S     T   + L   A  W  F++  LLPTTH  TVS++R+LL  ++L   SI+VG++I +EI
Subjt:  QNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI

Query:  SCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+  TQE
Subjt:  SCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)6.5e-4333.43Show/hide
Query:  RFVNNFARAKY-TEILKRDFLFERGF-------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL
        +F    A  +Y   I  R    E+GF        G L PF+   I  H W+ FCA PE     +V EFYAN+   E   V VRGV++ WS  AINA++ L
Subjt:  RFVNNFARAKY-TEILKRDFLFERGF-------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHL

Query:  QNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI
         + P    +E     +++ L   +  V    A+W +S     T   + L   A  W  F++ RLLPTTH  TVS++R+LL  ++L   SI+VG++I +EI
Subjt:  QNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEI

Query:  SCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQR---TQEIRQ---------------GGLIYGINTILEQLALSATRQ----
          C  +K G LFFP+ IT LCR A  P   ++  L + G ID   +AR+ +   T+  +Q               G ++  +  + ++L+    +Q    
Subjt:  SCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQR---TQEIRQ---------------GGLIYGINTILEQLALSATRQ----

Query:  ---EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLL
           +   +Q   FW Y K RD +L++ALQ NF++P P  P FP+++L
Subjt:  ---EFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLL

A0A2P5DAQ2 Uncharacterized protein2.2e-3033.19Show/hide
Query:  RFVNNFARAKYTE-------ILKRDFLFERGFSGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQ
        +F +  A  +Y E        ++++F+++     + PPF+   I+ H W+LFCA PE     +V EFY N+   +   V +RGV++  S  AIN ++ L 
Subjt:  RFVNNFARAKYTE-------ILKRDFLFERGFSGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQ

Query:  NFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIS
        + P    +E     ++ +L   +  V I  A+W +S     T   + LN  A  W  F++ RLLPTTH  TVS+E + L +++L   SI+VG++I  EI 
Subjt:  NFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEIS

Query:  CCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKG
         C  +K G LFFP+ IT +CR    P   ++  L + G
Subjt:  CCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKG

A0A2P5DXM3 Uncharacterized protein8.5e-3535.02Show/hide
Query:  VVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQR
        +V EFYAN+   E   + VRGV++ WS  AINA++ L + P    +E     +E +L   +  V    A+W +S     T   + L   A  W  F++ R
Subjt:  VVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQR

Query:  LLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEISCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQR---TQEIRQ---
        LLPTTH   VS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LCR A   V E    L + G ID   +AR+ +   T+  +Q   
Subjt:  LLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEISCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQR---TQEIRQ---

Query:  ------------GGLIYGINTILEQLALSATRQEFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLL
                    G ++  +  + ++L    ++QE   +Q   FW Y K RD +L++ALQ NF++P P  P FP+++L
Subjt:  ------------GGLIYGINTILEQLALSATRQEFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLL

A0A6A3BU96 Uncharacterized protein7.5e-3129.78Show/hide
Query:  VPYDRFVNNFARAKYTEILKRDFLFERGF------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALY
        + + +F N+ A+A++     R+  FE GF       G   P +   ++   W  F   P +VNA +V EFYANI K     + VRG +I ++  AIN  +
Subjt:  VPYDRFVNNFARAKYTEILKRDFLFERGF------SGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALY

Query:  HLQNF--PHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKII
        HLQ     HA F E A      +    + ++  E  +W   +T + +     L   A  W  F++ +L+PT+H++TVS  R+LL  +++ S  IDVG+II
Subjt:  HLQNF--PHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQLSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKII

Query:  CNEISCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQRTQEIRQGGLIY-------GINTILEQLAL-SATRQEFAERQTL--
          ++  C  KK   L FPN IT LCR+  V     D IL     I    L  L   +  +    ++         N  +  LAL  A  Q  A+   L  
Subjt:  CNEISCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPNLARLQRTQEIRQGGLIY-------GINTILEQLAL-SATRQEFAERQTL--

Query:  ---TFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLLNPWIPPPPAEKEDEEED
            F+ YVK+RD  +    QE         P FP+++L  +      E E +  D
Subjt:  ---TFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLLNPWIPPPPAEKEDEEED

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCCTACCCTCTTTATGGCACGAGAGGGATTTCTGTTTGCTGGTTGGACCACAAACAAGTTGTTCATTAGAGGAGCACTGATACTTAAGGACCAAGAGGTAGCCCA
GGGAAATATATCTGCAGTGAGAAGAGTGCAGCTGTGGTTCTTTAGTGGAGTGAACCACAGTCCATCAGGTCCCACCGGTAACTCTATAAGGGCGTTGAGCGAAAACAAGA
TCTCTTGCTGCTGTATTTTTGGTTTACTAAAGGTTTGGCGAAGCGGTTCAAGAATTTCTTCAAGGGATGACCTAGGCAACTTGGCCTTCATCCTGAGTGGATTATGGACT
CTGTCCATGAGGGATTGTCCTTTGATTTGTACGGGTGAGAGTGGTCTGTTCGCCAACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATGGGGAGCTGGGAACGTAG
TCTTACAAGATGGAATTCACTTCTTCCCAGTATTAGGGTAAGTAGAGGTCCATCAGGTCCCACCGGTAGCTCTTTAAGGGCATTGAGGTTAAAGAAGGAGTCAGGAGGCA
GTTTTGCGAGTCTCCCTCATTCACGCCTCCCTCACTCTCGCCCTCCCGTTCTCTCTCGTTTTTCTCTCCTCCATCTGAGATACAGAGAAGAACTCCAAGAGAAAAAATCC
TTAGACTATTCAGCGAAAACAAAGATCTCTTGCTGCTGTATTTTTGGTTTACCAAAGGGTTTGGCGAAGCGGTTCAAGAATTTCTTCAAGGGTGAGAGTGGCCTGTTCGC
CGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATAGGGAGCTGGGAACGTAGTCTTACAAGATGGAATTCACTCCTTCCCAGTATTAGGGTAAGTAGAGGTTCAG
TAGATTGTTGCGGCAAAGATATTGCTGGAGCAAAATATTCCGAGTTAGAAGGGTTTGCATTTTTAATTGCAACGGTAATATTAGTTATGGCAGGGCAAATCTTTTCGGTT
AGGATTCCTTGTTTGCTGCAGCTCTATTTATTCGCAATCTCTGAGAGGAGAGTGAAGAGGAGGAGGTGCCGGTCACACCGGAAGTACAAAAAGGGAAAAACTAAGAAAAA
GAGAACGCCAGAAGAGAAGGAGCTAAGCGAAGAAGAAGGCAACAAAGAGTTGCAGAACAAGAAAGAGCGAGAGAGGAGGAGGTTGTTGCAGAGGAGACGAAGACCCACAA
GAATCTGCAAACCGAATCAAGAAGAGGATGGACGGGAAGTGGCGACTATAGAAAAGGCTCGAGAGGGAATTCAGACGAAACAGAGTGAGGTTATGCAGACGGAGGCTGTA
GAGGAAGAAGACAGAGAGTCAGTTCAGGAGGATCGTGTAGAGGTTATCATACCTGAACCGCCGAAGCGACGTCGCATTAAGCGGAAGGCTGGGCGCATTCCAAGAGAAGG
CCGAGCAAGAGGAACGAGAGAAAAAAGAGCTGAGGAAAGAAGGCGAGAAGAAGAGGAGAAAACGACAGAGGAGGAACAGTTGCTGAAGCGGAGAGAAGACAAGGGCAAAA
AGATTGCTGAAGCATCAGAGGAGCACGATGAAATAGCAGAGCGACAGGTGCCATACGATCGCTTCGTCAATAATTTCGCTAGAGCAAAATACACTGAGATCTTGAAAAGA
GATTTTCTGTTCGAAAGAGGTTTCAGCGGTGATCTTCCGCCATTTCTAAGGACTGGCATAGTTGACCACGGCTGGGAGTTGTTTTGTGCTAAGCCAGAGGCTGTGAACGC
ACAGGTGGTGTGTGAATTCTATGCCAACATTGATAAAGAAGAAGGTTTCCTGGTAATTGTTAGAGGAGTCGAGATAGATTGGAGTCCAAGTGCGATCAACGCACTGTACC
ATCTTCAAAACTTCCCCCACGCGGCATTCAATGAAATGGCAGTCGCGCCATCTGAGGAGCAGTTAAGTAATGCTGTCCGGGAGGTAGGAATTGAGGAGGCGCAGTGGCAA
CTATCAAAGACTCAGAAAAGGACATTCCAATCAGCTTACTTGAACAAGGAAGCAAACACGTGGATGGGTTTCATTAGGCAGAGGTTGCTTCCGACAACGCACGACTCGAC
AGTATCCAGGGAACGAATTCTTTTGGCTTTTGCTATCTTAAGGTCTCTCAGTATTGACGTAGGAAAAATTATTTGTAATGAAATTTCCTGCTGCTGGAAGAAAAAGGTGG
GGAAGTTGTTCTTCCCGAACACAATTACTATGCTATGCAGACAAGCAGGGGTTCCAGTGGAGGAGAGTGATGCAATTCTATTTGATAAAGGGATCATTGATACGCCCAAT
TTGGCGCGGCTCCAGCGCACGCAGGAGATACGTCAAGGTGGGCTTATCTACGGCATCAATACGATTCTGGAACAACTGGCACTTTCAGCCACTAGGCAGGAGTTTGCTGA
AAGGCAAACTTTAACCTTCTGGAACTATGTTAAGAATCGAGATGCCAGCTTAAGAAGGGCACTGCAAGAGAATTTTTCCAAGCCGTATCCAGCCCTTCCTACATTCCCTG
AAGATCTGTTGAACCCTTGGATTCCACCGCCGCCTGCTGAGAAAGAAGATGAAGAGGAAGATCTCGGTCAGGAAGATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCCTACCCTCTTTATGGCACGAGAGGGATTTCTGTTTGCTGGTTGGACCACAAACAAGTTGTTCATTAGAGGAGCACTGATACTTAAGGACCAAGAGGTAGCCCA
GGGAAATATATCTGCAGTGAGAAGAGTGCAGCTGTGGTTCTTTAGTGGAGTGAACCACAGTCCATCAGGTCCCACCGGTAACTCTATAAGGGCGTTGAGCGAAAACAAGA
TCTCTTGCTGCTGTATTTTTGGTTTACTAAAGGTTTGGCGAAGCGGTTCAAGAATTTCTTCAAGGGATGACCTAGGCAACTTGGCCTTCATCCTGAGTGGATTATGGACT
CTGTCCATGAGGGATTGTCCTTTGATTTGTACGGGTGAGAGTGGTCTGTTCGCCAACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATGGGGAGCTGGGAACGTAG
TCTTACAAGATGGAATTCACTTCTTCCCAGTATTAGGGTAAGTAGAGGTCCATCAGGTCCCACCGGTAGCTCTTTAAGGGCATTGAGGTTAAAGAAGGAGTCAGGAGGCA
GTTTTGCGAGTCTCCCTCATTCACGCCTCCCTCACTCTCGCCCTCCCGTTCTCTCTCGTTTTTCTCTCCTCCATCTGAGATACAGAGAAGAACTCCAAGAGAAAAAATCC
TTAGACTATTCAGCGAAAACAAAGATCTCTTGCTGCTGTATTTTTGGTTTACCAAAGGGTTTGGCGAAGCGGTTCAAGAATTTCTTCAAGGGTGAGAGTGGCCTGTTCGC
CGACTCAATAAGCCTACCATTTTGGGGACAAGACCGAATAGGGAGCTGGGAACGTAGTCTTACAAGATGGAATTCACTCCTTCCCAGTATTAGGGTAAGTAGAGGTTCAG
TAGATTGTTGCGGCAAAGATATTGCTGGAGCAAAATATTCCGAGTTAGAAGGGTTTGCATTTTTAATTGCAACGGTAATATTAGTTATGGCAGGGCAAATCTTTTCGGTT
AGGATTCCTTGTTTGCTGCAGCTCTATTTATTCGCAATCTCTGAGAGGAGAGTGAAGAGGAGGAGGTGCCGGTCACACCGGAAGTACAAAAAGGGAAAAACTAAGAAAAA
GAGAACGCCAGAAGAGAAGGAGCTAAGCGAAGAAGAAGGCAACAAAGAGTTGCAGAACAAGAAAGAGCGAGAGAGGAGGAGGTTGTTGCAGAGGAGACGAAGACCCACAA
GAATCTGCAAACCGAATCAAGAAGAGGATGGACGGGAAGTGGCGACTATAGAAAAGGCTCGAGAGGGAATTCAGACGAAACAGAGTGAGGTTATGCAGACGGAGGCTGTA
GAGGAAGAAGACAGAGAGTCAGTTCAGGAGGATCGTGTAGAGGTTATCATACCTGAACCGCCGAAGCGACGTCGCATTAAGCGGAAGGCTGGGCGCATTCCAAGAGAAGG
CCGAGCAAGAGGAACGAGAGAAAAAAGAGCTGAGGAAAGAAGGCGAGAAGAAGAGGAGAAAACGACAGAGGAGGAACAGTTGCTGAAGCGGAGAGAAGACAAGGGCAAAA
AGATTGCTGAAGCATCAGAGGAGCACGATGAAATAGCAGAGCGACAGGTGCCATACGATCGCTTCGTCAATAATTTCGCTAGAGCAAAATACACTGAGATCTTGAAAAGA
GATTTTCTGTTCGAAAGAGGTTTCAGCGGTGATCTTCCGCCATTTCTAAGGACTGGCATAGTTGACCACGGCTGGGAGTTGTTTTGTGCTAAGCCAGAGGCTGTGAACGC
ACAGGTGGTGTGTGAATTCTATGCCAACATTGATAAAGAAGAAGGTTTCCTGGTAATTGTTAGAGGAGTCGAGATAGATTGGAGTCCAAGTGCGATCAACGCACTGTACC
ATCTTCAAAACTTCCCCCACGCGGCATTCAATGAAATGGCAGTCGCGCCATCTGAGGAGCAGTTAAGTAATGCTGTCCGGGAGGTAGGAATTGAGGAGGCGCAGTGGCAA
CTATCAAAGACTCAGAAAAGGACATTCCAATCAGCTTACTTGAACAAGGAAGCAAACACGTGGATGGGTTTCATTAGGCAGAGGTTGCTTCCGACAACGCACGACTCGAC
AGTATCCAGGGAACGAATTCTTTTGGCTTTTGCTATCTTAAGGTCTCTCAGTATTGACGTAGGAAAAATTATTTGTAATGAAATTTCCTGCTGCTGGAAGAAAAAGGTGG
GGAAGTTGTTCTTCCCGAACACAATTACTATGCTATGCAGACAAGCAGGGGTTCCAGTGGAGGAGAGTGATGCAATTCTATTTGATAAAGGGATCATTGATACGCCCAAT
TTGGCGCGGCTCCAGCGCACGCAGGAGATACGTCAAGGTGGGCTTATCTACGGCATCAATACGATTCTGGAACAACTGGCACTTTCAGCCACTAGGCAGGAGTTTGCTGA
AAGGCAAACTTTAACCTTCTGGAACTATGTTAAGAATCGAGATGCCAGCTTAAGAAGGGCACTGCAAGAGAATTTTTCCAAGCCGTATCCAGCCCTTCCTACATTCCCTG
AAGATCTGTTGAACCCTTGGATTCCACCGCCGCCTGCTGAGAAAGAAGATGAAGAGGAAGATCTCGGTCAGGAAGATTAA
Protein sequenceShow/hide protein sequence
MGPTLFMAREGFLFAGWTTNKLFIRGALILKDQEVAQGNISAVRRVQLWFFSGVNHSPSGPTGNSIRALSENKISCCCIFGLLKVWRSGSRISSRDDLGNLAFILSGLWT
LSMRDCPLICTGESGLFANSISLPFWGQDRMGSWERSLTRWNSLLPSIRVSRGPSGPTGSSLRALRLKKESGGSFASLPHSRLPHSRPPVLSRFSLLHLRYREELQEKKS
LDYSAKTKISCCCIFGLPKGLAKRFKNFFKGESGLFADSISLPFWGQDRIGSWERSLTRWNSLLPSIRVSRGSVDCCGKDIAGAKYSELEGFAFLIATVILVMAGQIFSV
RIPCLLQLYLFAISERRVKRRRCRSHRKYKKGKTKKKRTPEEKELSEEEGNKELQNKKERERRRLLQRRRRPTRICKPNQEEDGREVATIEKAREGIQTKQSEVMQTEAV
EEEDRESVQEDRVEVIIPEPPKRRRIKRKAGRIPREGRARGTREKRAEERRREEEEKTTEEEQLLKRREDKGKKIAEASEEHDEIAERQVPYDRFVNNFARAKYTEILKR
DFLFERGFSGDLPPFLRTGIVDHGWELFCAKPEAVNAQVVCEFYANIDKEEGFLVIVRGVEIDWSPSAINALYHLQNFPHAAFNEMAVAPSEEQLSNAVREVGIEEAQWQ
LSKTQKRTFQSAYLNKEANTWMGFIRQRLLPTTHDSTVSRERILLAFAILRSLSIDVGKIICNEISCCWKKKVGKLFFPNTITMLCRQAGVPVEESDAILFDKGIIDTPN
LARLQRTQEIRQGGLIYGINTILEQLALSATRQEFAERQTLTFWNYVKNRDASLRRALQENFSKPYPALPTFPEDLLNPWIPPPPAEKEDEEEDLGQED