; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030881 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030881
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold11:30663537..30672166
RNA-Seq ExpressionSpg030881
SyntenySpg030881
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8695166.1 hypothetical protein F3Y22_tig00110733pilonHSYRG00282 [Hibiscus syriacus]2.9e-2730.14Show/hide
Query:  IANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQ--DFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKTEKR
        +  H W +F   P PVN+ IV+EFY+N+    +   +VRG+++ ++P AIN  F LQ  D  +  F + V     +     + ++ + G +W   + +++
Subjt:  IANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQ--DFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKTEKR

Query:  TFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVIL-----LD
        T     L      W  F+K +L+PT+H++TVSC R+LL+ +IL   +ID+GKII      C +++   L FPN IT LC +  V     D IL     L+
Subjt:  TFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVIL-----LD

Query:  KGIIDTPNLVWLQRTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQ----------TYWAYAKRRDDTLRRALQSNFSK
        K  I  P L+  +  +  +       +       A S   ++  +R  Q           Y+AYAKRRD  L  AL  +  +
Subjt:  KGIIDTPNLVWLQRTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQ----------TYWAYAKRRDDTLRRALQSNFSK

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.3e-3140.39Show/hide
Query:  FLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKT
        F+   I  H W QFCA P+     +VREFYAN+ +  E    VRGV V WS  AIN++F L D P    +E +   +   L   +  V + GA+W +S  
Subjt:  FLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKT

Query:  EKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKG
           T   + L   A  W  F+K  LLPTTH  TVS DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P +  +  L + G
Subjt:  EKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKG

Query:  IID
         ID
Subjt:  IID

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.8e-3836.15Show/hide
Query:  FLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKT
        F+   I  H W QFCA P+     +VREFYAN+ + EE    VRGV V WS  AIN++F L D P    +E +   +   L   +  V   GA+W +S  
Subjt:  FLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKT

Query:  EKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKG
           T   + L   A  W  F+K RLLPTTH  TVS DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P +  +  L + G
Subjt:  EKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKG

Query:  IIDTPNLVWL------QRTQEA-----------RQGGLVCGIHQILEQLALSASRQEF--------AERQAQTYWAYAKRRDDTLRRALQSNFSKP
         ID   +  +      + TQ+            R  G +    + LEQ       Q++          +Q Q +WAY+K RD  L++ALQ+NF++P
Subjt:  IIDTPNLVWL------QRTQEA-----------RQGGLVCGIHQILEQLALSASRQEF--------AERQAQTYWAYAKRRDDTLRRALQSNFSKP

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]7.6e-2837.62Show/hide
Query:  PHFLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLS
        P F+   I  H W  FCA P+     +VREFY N+ N ++    +RGV V  S  AIN++F+L D P    +E V   +  +L   +  V I GA+W +S
Subjt:  PHFLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLS

Query:  KTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLD
             T   + L   A  W  F+K RLLPTTH  TVS + V L++++L   SI+VG++I  EI  C  +K G LFFP+ IT +C     P +  +  L +
Subjt:  KTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLD

Query:  KG
         G
Subjt:  KG

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]4.6e-3337.64Show/hide
Query:  IVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQD--FPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIK
        +VREFYAN+ + EE    VRGV V WS  AIN++F L D    H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQD--FPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIK

Query:  LRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKGIIDTPNLVWL------------
         RLLPTTH   VS DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P +  +  L + G ID   +  +            
Subjt:  LRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKGIIDTPNLVWL------------

Query:  --QRTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQTYWAYAKRRDDTLRRALQSNFSKP
           R   A        + Q L+ L    S+QE   +Q Q +WAY+K RD  L++ALQ+NF++P
Subjt:  --QRTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQTYWAYAKRRDDTLRRALQSNFSKP

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.1e-3140.39Show/hide
Query:  FLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKT
        F+   I  H W QFCA P+     +VREFYAN+ +  E    VRGV V WS  AIN++F L D P    +E +   +   L   +  V + GA+W +S  
Subjt:  FLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKT

Query:  EKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKG
           T   + L   A  W  F+K  LLPTTH  TVS DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P +  +  L + G
Subjt:  EKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKG

Query:  IID
         ID
Subjt:  IID

A0A2P5BCG4 Uncharacterized protein (Fragment)1.3e-3836.15Show/hide
Query:  FLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKT
        F+   I  H W QFCA P+     +VREFYAN+ + EE    VRGV V WS  AIN++F L D P    +E +   +   L   +  V   GA+W +S  
Subjt:  FLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKT

Query:  EKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKG
           T   + L   A  W  F+K RLLPTTH  TVS DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P +  +  L + G
Subjt:  EKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKG

Query:  IIDTPNLVWL------QRTQEA-----------RQGGLVCGIHQILEQLALSASRQEF--------AERQAQTYWAYAKRRDDTLRRALQSNFSKP
         ID   +  +      + TQ+            R  G +    + LEQ       Q++          +Q Q +WAY+K RD  L++ALQ+NF++P
Subjt:  IIDTPNLVWL------QRTQEA-----------RQGGLVCGIHQILEQLALSASRQEF--------AERQAQTYWAYAKRRDDTLRRALQSNFSKP

A0A2P5DAQ2 Uncharacterized protein3.7e-2837.62Show/hide
Query:  PHFLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLS
        P F+   I  H W  FCA P+     +VREFY N+ N ++    +RGV V  S  AIN++F+L D P    +E V   +  +L   +  V I GA+W +S
Subjt:  PHFLRVGIANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLS

Query:  KTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLD
             T   + L   A  W  F+K RLLPTTH  TVS + V L++++L   SI+VG++I  EI  C  +K G LFFP+ IT +C     P +  +  L +
Subjt:  KTEKRTFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLD

Query:  KG
         G
Subjt:  KG

A0A2P5DXM3 Uncharacterized protein2.2e-3337.64Show/hide
Query:  IVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQD--FPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIK
        +VREFYAN+ + EE    VRGV V WS  AIN++F L D    H+ F E +  P   +L   +  V   GA+W +S     T   + L   A  W  F+K
Subjt:  IVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQD--FPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIK

Query:  LRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKGIIDTPNLVWL------------
         RLLPTTH   VS DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P +  +  L + G ID   +  +            
Subjt:  LRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKGIIDTPNLVWL------------

Query:  --QRTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQTYWAYAKRRDDTLRRALQSNFSKP
           R   A        + Q L+ L    S+QE   +Q Q +WAY+K RD  L++ALQ+NF++P
Subjt:  --QRTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQTYWAYAKRRDDTLRRALQSNFSKP

A0A6A2ZUE4 Uncharacterized protein1.4e-2730.14Show/hide
Query:  IANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQ--DFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKTEKR
        +  H W +F   P PVN+ IV+EFY+N+    +   +VRG+++ ++P AIN  F LQ  D  +  F + V     +     + ++ + G +W   + +++
Subjt:  IANHGWSQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQ--DFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKTEKR

Query:  TFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVIL-----LD
        T     L      W  F+K +L+PT+H++TVSC R+LL+ +IL   +ID+GKII      C +++   L FPN IT LC +  V     D IL     L+
Subjt:  TFQAAYLKSEANTWLGFIKLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVIL-----LD

Query:  KGIIDTPNLVWLQRTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQ----------TYWAYAKRRDDTLRRALQSNFSK
        K  I  P L+  +  +  +       +       A S   ++  +R  Q           Y+AYAKRRD  L  AL  +  +
Subjt:  KGIIDTPNLVWLQRTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQ----------TYWAYAKRRDDTLRRALQSNFSK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCCAGTGTCTCGACACCGACGGTGATTTTTCTGGTCCCTCAAATCCGAAGCGTTGAGACGTCATGGGGTGGCGTCGAGACGCCATCACTCTTGTCTTCCTTGTTTGG
CTGCTTCAAAACTTGGTCCTTTGACGTTGTTTTCGAGTCTCGAGTGCTCCACTTTGATGTTGCTGCTACGATAGTTGAAGAAGGAAATCCGAAGGAACCAGAGGGACAGA
ACACTAAGCTGAGTGACCCAGTAGTTGCAGATACGGAGGGAGTTCAAGAAGAACAAACAGAGGAAGTTCAAGAAAAACAGGCCAAAGATACGCAAGAAGGTAGGATAGAG
GATGTTCAAGAAACAGGTAATGAGCAGGTGGAGCAAGAGCAAGAGGCTCATGTTGAGGTTATCATGCCAGAAGTACCAAAACGTCGCCGTGTGAAGCGAAAAGCTAGACG
CGTCAAGAAAAAAGAGGTTGAGGACAGAGAGAGAGAAGAAGCAGGAAAGAAAGCAGCGGAAGAAACTTTGACAAAGCATCAAGAAGACAGAGGCAAAGGAATTGCTAAAG
CATCGGATGAACCTATAGAAGAAGCAGAAAAAGGACCATTCATCCGCTTCATCAATGAACTTGCTGAAGGATTGGAATTCCGTAATTCCGTAAAGCGGAAGCGTGATTGG
AACCGATTCTGCGAACACCACCACTATGGCTACACCGGTATGACTCTGAGACTTCTAGAGACAGGAGACTTGTGGGAGCCTTTGGGAGAATTCTCTGAGATGGGACCTAA
TGGACCTACAGATCAGAAGCTCCAACGATACGAGACTAATTGGCCAAACTCATTGACCAAGTTTAGTCAACATTCGTTACCTGTGGGTCACTCCACTAAAGACCCACAGC
TGCACTCTTCTCACTATAGAATATTTCTGTGTCCACGGATATCGACCAAGCAAGGGTTTGGTGACGATCTGCCACATTTCTTAAGGGTAGGGATCGCGAATCACGGCTGG
AGTCAGTTTTGTGCGAAACCAGACCCAGTGAATTCGAACATTGTTCGAGAATTTTATGCGAATGTTGATAATGTAGAGGAATTTCAGGCCATAGTCCGAGGAGTGACTGT
TGACTGGAGCCCAGGAGCTATTAATTCACTATTCAACCTTCAGGATTTCCCACACGCAGGCTTTAATGAGATGGTGGTGGCACCATCGAGTGACCAGTTAAATGCGGCGG
TCCGCGAGGTTGGCATTGAGGGGGCTCAATGGAGGCTATCAAAGACGGAGAAGCGAACTTTTCAAGCTGCCTATCTAAAGAGTGAAGCCAATACTTGGTTGGGCTTCATC
AAGCTGCGTTTGCTTCCAACTACGCATGATTCAACGGTGTCTTGCGACCGAGTGCTTCTGATATTCGCAATTCTTCGATCCTTAAGTATTGATGTTGGAAAAATCATTTC
GAATGAAATCTTTAATTGTTGGCGCAAGAAGGTGGGGAAGCTGTTTTTCCCAAATACGATCACTATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAGGATGTAA
TTTTGCTTGACAAGGGAATCATAGATACGCCTAATCTGGTGTGGCTTCAGCGAACGCAAGAGGCACGCCAGGGTGGGCTTGTGTGTGGAATTCATCAAATCCTAGAGCAA
CTGGCACTGTCGGCCAGTAGGCAAGAGTTTGCTGAAAGGCAAGCTCAAACCTATTGGGCCTATGCTAAAAGGAGAGATGACACACTTCGGAGGGCCTTGCAGTCCAATTT
CTCCAAACCATATCAGCCTGGTCACAGCTGTGGCAAAGAAGATTCTGAGAGGAATTATATTGCTGCAGCAGTCCTTGGTTTTGCAGAATGCTCAGGTCCATTAGGTTCCA
CTGCCGATTGGTGGTGTTCGCACAAGAGGGTTGCTGCGTTTTCAATCTTATTGGCAAGAAAAGGCAAATTGACCAAGGCTTTCTACAAAATAGTGCGGAGCAACTTGAGG
GAGCAAATCCAGTGCTGCAGCAAAGCTGGGAGCAAAACTGTCACGTCACAGCTCGTTAGCCAATTTGATGAACTGAATTCTGTTAAGTTATTCTCTGGTTTATGGAGCAA
GGAGAGCCGTCCACGTGTCTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCCAGTGTCTCGACACCGACGGTGATTTTTCTGGTCCCTCAAATCCGAAGCGTTGAGACGTCATGGGGTGGCGTCGAGACGCCATCACTCTTGTCTTCCTTGTTTGG
CTGCTTCAAAACTTGGTCCTTTGACGTTGTTTTCGAGTCTCGAGTGCTCCACTTTGATGTTGCTGCTACGATAGTTGAAGAAGGAAATCCGAAGGAACCAGAGGGACAGA
ACACTAAGCTGAGTGACCCAGTAGTTGCAGATACGGAGGGAGTTCAAGAAGAACAAACAGAGGAAGTTCAAGAAAAACAGGCCAAAGATACGCAAGAAGGTAGGATAGAG
GATGTTCAAGAAACAGGTAATGAGCAGGTGGAGCAAGAGCAAGAGGCTCATGTTGAGGTTATCATGCCAGAAGTACCAAAACGTCGCCGTGTGAAGCGAAAAGCTAGACG
CGTCAAGAAAAAAGAGGTTGAGGACAGAGAGAGAGAAGAAGCAGGAAAGAAAGCAGCGGAAGAAACTTTGACAAAGCATCAAGAAGACAGAGGCAAAGGAATTGCTAAAG
CATCGGATGAACCTATAGAAGAAGCAGAAAAAGGACCATTCATCCGCTTCATCAATGAACTTGCTGAAGGATTGGAATTCCGTAATTCCGTAAAGCGGAAGCGTGATTGG
AACCGATTCTGCGAACACCACCACTATGGCTACACCGGTATGACTCTGAGACTTCTAGAGACAGGAGACTTGTGGGAGCCTTTGGGAGAATTCTCTGAGATGGGACCTAA
TGGACCTACAGATCAGAAGCTCCAACGATACGAGACTAATTGGCCAAACTCATTGACCAAGTTTAGTCAACATTCGTTACCTGTGGGTCACTCCACTAAAGACCCACAGC
TGCACTCTTCTCACTATAGAATATTTCTGTGTCCACGGATATCGACCAAGCAAGGGTTTGGTGACGATCTGCCACATTTCTTAAGGGTAGGGATCGCGAATCACGGCTGG
AGTCAGTTTTGTGCGAAACCAGACCCAGTGAATTCGAACATTGTTCGAGAATTTTATGCGAATGTTGATAATGTAGAGGAATTTCAGGCCATAGTCCGAGGAGTGACTGT
TGACTGGAGCCCAGGAGCTATTAATTCACTATTCAACCTTCAGGATTTCCCACACGCAGGCTTTAATGAGATGGTGGTGGCACCATCGAGTGACCAGTTAAATGCGGCGG
TCCGCGAGGTTGGCATTGAGGGGGCTCAATGGAGGCTATCAAAGACGGAGAAGCGAACTTTTCAAGCTGCCTATCTAAAGAGTGAAGCCAATACTTGGTTGGGCTTCATC
AAGCTGCGTTTGCTTCCAACTACGCATGATTCAACGGTGTCTTGCGACCGAGTGCTTCTGATATTCGCAATTCTTCGATCCTTAAGTATTGATGTTGGAAAAATCATTTC
GAATGAAATCTTTAATTGTTGGCGCAAGAAGGTGGGGAAGCTGTTTTTCCCAAATACGATCACTATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAGGATGTAA
TTTTGCTTGACAAGGGAATCATAGATACGCCTAATCTGGTGTGGCTTCAGCGAACGCAAGAGGCACGCCAGGGTGGGCTTGTGTGTGGAATTCATCAAATCCTAGAGCAA
CTGGCACTGTCGGCCAGTAGGCAAGAGTTTGCTGAAAGGCAAGCTCAAACCTATTGGGCCTATGCTAAAAGGAGAGATGACACACTTCGGAGGGCCTTGCAGTCCAATTT
CTCCAAACCATATCAGCCTGGTCACAGCTGTGGCAAAGAAGATTCTGAGAGGAATTATATTGCTGCAGCAGTCCTTGGTTTTGCAGAATGCTCAGGTCCATTAGGTTCCA
CTGCCGATTGGTGGTGTTCGCACAAGAGGGTTGCTGCGTTTTCAATCTTATTGGCAAGAAAAGGCAAATTGACCAAGGCTTTCTACAAAATAGTGCGGAGCAACTTGAGG
GAGCAAATCCAGTGCTGCAGCAAAGCTGGGAGCAAAACTGTCACGTCACAGCTCGTTAGCCAATTTGATGAACTGAATTCTGTTAAGTTATTCTCTGGTTTATGGAGCAA
GGAGAGCCGTCCACGTGTCTAG
Protein sequenceShow/hide protein sequence
MASVSTPTVIFLVPQIRSVETSWGGVETPSLLSSLFGCFKTWSFDVVFESRVLHFDVAATIVEEGNPKEPEGQNTKLSDPVVADTEGVQEEQTEEVQEKQAKDTQEGRIE
DVQETGNEQVEQEQEAHVEVIMPEVPKRRRVKRKARRVKKKEVEDREREEAGKKAAEETLTKHQEDRGKGIAKASDEPIEEAEKGPFIRFINELAEGLEFRNSVKRKRDW
NRFCEHHHYGYTGMTLRLLETGDLWEPLGEFSEMGPNGPTDQKLQRYETNWPNSLTKFSQHSLPVGHSTKDPQLHSSHYRIFLCPRISTKQGFGDDLPHFLRVGIANHGW
SQFCAKPDPVNSNIVREFYANVDNVEEFQAIVRGVTVDWSPGAINSLFNLQDFPHAGFNEMVVAPSSDQLNAAVREVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFI
KLRLLPTTHDSTVSCDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDKGIIDTPNLVWLQRTQEARQGGLVCGIHQILEQ
LALSASRQEFAERQAQTYWAYAKRRDDTLRRALQSNFSKPYQPGHSCGKEDSERNYIAAAVLGFAECSGPLGSTADWWCSHKRVAAFSILLARKGKLTKAFYKIVRSNLR
EQIQCCSKAGSKTVTSQLVSQFDELNSVKLFSGLWSKESRPRV