; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg029613 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg029613
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold2:20930460..20933633
RNA-Seq ExpressionSpg029613
SyntenySpg029613
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAE8718449.1 hypothetical protein F3Y22_tig00110013pilonHSYRG00240 [Hibiscus syriacus]2.9e-2827.7Show/hide
Query:  LSYDRFVSNHARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALY
        +++ +F ++ A+A++     R+  FE GF       G     +   +    W +F   P SVNA +V+EFYANI K       VRG ++ ++  AIN  +
Subjt:  LSYDRFVSNHARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALY

Query:  NLRTL--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKII
        +L+ +   HA + E A    + +    + ++  E  +W   +T + +     L+  A  W  F+K +L+PT+H++ VS  R+LL  +++ S  IDVG+II
Subjt:  NLRTL--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKII

Query:  ANEISGCWKKKVGKLFFPNTITMLCKR---------------AGRTQEARQGGLVYGI-------------------NTILEQLAL-SASRQEFAERQAL
          ++  C  KK   L FPN IT LC++               +G T++     L+ G+                   N  +  LAL  A  Q  A+  AL
Subjt:  ANEISGCWKKKVGKLFFPNTITMLCKR---------------AGRTQEARQGGLVYGI-------------------NTILEQLAL-SASRQEFAERQAL

Query:  -----TFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA
              F+ YV++RDV ++   QE         P FP+++L  +      E E D  + PA
Subjt:  -----TFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]4.5e-2936.16Show/hide
Query:  RFVSNHARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNL
        +F +  A  +Y   +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+         VRG++V WS  AINA++ L
Subjt:  RFVSNHARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNL

Query:  RTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEI
           P   ++E     +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH   VS++R+LL  ++L   SI+VG++I +EI
Subjt:  RTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEI

Query:  SGCWKKKVGKLFFPNTITMLCKRA
          C  +K G LFFP+ IT LC+ A
Subjt:  SGCWKKKVGKLFFPNTITMLCKRA

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.8e-3931.06Show/hide
Query:  RFVSNHARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNL
        +F +  A  +Y   +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+   E     VRG++V WS  AINA++ L
Subjt:  RFVSNHARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNL

Query:  RTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEI
           P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH   VS++R+LL  ++L   SI+VG++I +EI
Subjt:  RTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEI

Query:  SGCWKKKVGKLFFPNTITMLCKRA----------------------------GRTQEARQ---------------GGLVYGINTILEQLALSASRQ----
          C  +K G LFFP+ IT LC+ A                            G T+  +Q               G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKRA----------------------------GRTQEARQ---------------GGLVYGINTILEQLALSASRQ----

Query:  ---EFAERQALTFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA
           +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E + DG  + A
Subjt:  ---EFAERQALTFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]1.5e-2733.48Show/hide
Query:  RFVSNHARAKYAE-------LLKRDFLFERGFSGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNLR
        +F S  A  +Y E        ++++F+++     + P F+   I  H W+ FC+ PE     +VREFY N+   +     +RG++V  S  AIN +++L 
Subjt:  RFVSNHARAKYAE-------LLKRDFLFERGFSGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNLR

Query:  TLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEIS
          P   ++E     +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH   VS+E V L +++L   SI+VG++I  EI 
Subjt:  TLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEIS

Query:  GCWKKKVGKLFFPNTITMLCK
         C  +K G LFFP+ IT +C+
Subjt:  GCWKKKVGKLFFPNTITMLCK

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.4e-3333.22Show/hide
Query:  VVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNLRTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQR
        +VREFYAN+   E     VRG++V WS  AINA++ L   P   ++E     +  +L   +  V   GA+W +S     T   + L   A  W  F+K R
Subjt:  VVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNLRTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQR

Query:  LLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEISGCWKKKVGKLFFPNTITMLCKRA--------------------------GRTQEARQ-----
        LLPTTH   VS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC+ A                          G T+  +Q     
Subjt:  LLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEISGCWKKKVGKLFFPNTITMLCKRA--------------------------GRTQEARQ-----

Query:  ----------GGLVYGINTILEQLALSASRQEFAERQALTFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA
                  G ++  +  + ++L    S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E + DG  + A
Subjt:  ----------GGLVYGINTILEQLALSASRQEFAERQALTFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)2.2e-2936.16Show/hide
Query:  RFVSNHARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNL
        +F +  A  +Y   +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+         VRG++V WS  AINA++ L
Subjt:  RFVSNHARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNL

Query:  RTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEI
           P   ++E     +   L   +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH   VS++R+LL  ++L   SI+VG++I +EI
Subjt:  RTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEI

Query:  SGCWKKKVGKLFFPNTITMLCKRA
          C  +K G LFFP+ IT LC+ A
Subjt:  SGCWKKKVGKLFFPNTITMLCKRA

A0A2P5BCG4 Uncharacterized protein (Fragment)1.4e-3931.06Show/hide
Query:  RFVSNHARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNL
        +F +  A  +Y   +  R    E+GF        G LP F+   I  H W++FC+ PE     +VREFYAN+   E     VRG++V WS  AINA++ L
Subjt:  RFVSNHARAKYA-ELLKRDFLFERGF-------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNL

Query:  RTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEI
           P   ++E     + + L   +  V   GA+W +S     T   + L   A  W  F+K RLLPTTH   VS++R+LL  ++L   SI+VG++I +EI
Subjt:  RTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEI

Query:  SGCWKKKVGKLFFPNTITMLCKRA----------------------------GRTQEARQ---------------GGLVYGINTILEQLALSASRQ----
          C  +K G LFFP+ IT LC+ A                            G T+  +Q               G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCKRA----------------------------GRTQEARQ---------------GGLVYGINTILEQLALSASRQ----

Query:  ---EFAERQALTFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA
           +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E + DG  + A
Subjt:  ---EFAERQALTFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA

A0A2P5DAQ2 Uncharacterized protein7.0e-2833.48Show/hide
Query:  RFVSNHARAKYAE-------LLKRDFLFERGFSGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNLR
        +F S  A  +Y E        ++++F+++     + P F+   I  H W+ FC+ PE     +VREFY N+   +     +RG++V  S  AIN +++L 
Subjt:  RFVSNHARAKYAE-------LLKRDFLFERGFSGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNLR

Query:  TLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEIS
          P   ++E     +  +L   +  V I GA+W +S     T   + L   A  W  F+K RLLPTTH   VS+E V L +++L   SI+VG++I  EI 
Subjt:  TLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEIS

Query:  GCWKKKVGKLFFPNTITMLCK
         C  +K G LFFP+ IT +C+
Subjt:  GCWKKKVGKLFFPNTITMLCK

A0A2P5DXM3 Uncharacterized protein6.6e-3433.22Show/hide
Query:  VVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNLRTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQR
        +VREFYAN+   E     VRG++V WS  AINA++ L   P   ++E     +  +L   +  V   GA+W +S     T   + L   A  W  F+K R
Subjt:  VVREFYANIDKEEGFLAIVRGIEVDWSPCAINALYNLRTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQR

Query:  LLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEISGCWKKKVGKLFFPNTITMLCKRA--------------------------GRTQEARQ-----
        LLPTTH   VS++R+LL  ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC+ A                          G T+  +Q     
Subjt:  LLPTTHDSAVSRERVLLAFAILRSLSIDVGKIIANEISGCWKKKVGKLFFPNTITMLCKRA--------------------------GRTQEARQ-----

Query:  ----------GGLVYGINTILEQLALSASRQEFAERQALTFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA
                  G ++  +  + ++L    S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E + DG  + A
Subjt:  ----------GGLVYGINTILEQLALSASRQEFAERQALTFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA

A0A6A3BU96 Uncharacterized protein1.4e-2827.7Show/hide
Query:  LSYDRFVSNHARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALY
        +++ +F ++ A+A++     R+  FE GF       G     +   +    W +F   P SVNA +V+EFYANI K       VRG ++ ++  AIN  +
Subjt:  LSYDRFVSNHARAKYAELLKRDFLFERGF------SGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEVDWSPCAINALY

Query:  NLRTL--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKII
        +L+ +   HA + E A    + +    + ++  E  +W   +T + +     L+  A  W  F+K +L+PT+H++ VS  R+LL  +++ S  IDVG+II
Subjt:  NLRTL--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDVGKII

Query:  ANEISGCWKKKVGKLFFPNTITMLCKR---------------AGRTQEARQGGLVYGI-------------------NTILEQLAL-SASRQEFAERQAL
          ++  C  KK   L FPN IT LC++               +G T++     L+ G+                   N  +  LAL  A  Q  A+  AL
Subjt:  ANEISGCWKKKVGKLFFPNTITMLCKR---------------AGRTQEARQGGLVYGI-------------------NTILEQLAL-SASRQEFAERQAL

Query:  -----TFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA
              F+ YV++RDV ++   QE         P FP+++L  +      E E D  + PA
Subjt:  -----TFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLLNPWIPPPPVEREGDGEEDPA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCCGTGACCCCCGAAGCACCGAAAGTAAAGGCAAAGAAGAAGAAGACACCAGAA
GAAACAGAAGCTAAAAGAAGAAGAAGACAGCAGAGGGCTGAAGATCAAGAAGCTGTTCAGAAAGCGGCGGAGGATGTTATTGTGGAAAAAGATCCGAAAGAACCA
GAAGGACAGAATCCAGAGCAGACTGACCCGATAGTTGCGGATACAGAGGAAGTTCAAGAAGAAAATACAGAGGGAGTTCGAGAAGAAAATGCAGAGGAAGTTCGA
GAAGAAAATACAGAGGAAGTTCAAGAAAAGCAGGCCGAGGATGTGCAAGAAGAACAGGCAGAGGTTGCGCCTGAAGAAGTTAGTGAGCAAGAACGGGAGGCTCGG
GTGGAGGTGATCATGCCGGAAGTGCCCAAACGCCGCCGTATAAAGCGAAAAGCGGGCCGTGTTAAGGTAGTCCGAACTGATACCCCCTCGCCTCCAACTACTGAT
TCTGAAAGAGAGAATGCAGAAAGAGAAGAACGTGAGAAGAAGGAGGAGGATAAAGCAAGAGAGGAAGCAGAGAAAAAGGCTGAAGAAGAAAGATTGCTCAAGCAA
AGGGCAGACAGGGGCAAGAGTGTTGCTGCGGCATCAGAGGAACCTGATGAAATAGAAGAGTCACAATTGTCGTATGATCGCTTTGTCAGCAATCATGCCAGAGCA
AAGTATGCAGAGTTGCTGAAAAGAGACTTCCTGTTTGAAAGGGGATTTAGTGGTGATCTTCCACATTTTCTGAGGACCGGTATTGCAGACCACGGTTGGGAACGG
TTTTGTTCAAAGCCTGAATCTGTAAACGCACAGGTGGTGCGTGAGTTTTATGCAAATATTGACAAAGAAGAAGGTTTCCTAGCAATTGTTCGAGGTATTGAGGTC
GACTGGAGTCCTTGTGCTATTAATGCACTGTATAACCTTCGAACTTTACCCCACGCAGCATATAATGAGATGGCTGTAGCGCCATCCAATGAGCAACTGAGTGAC
GCTGTAAGGGAAGTTGGTATTGAAGGGGCGCAGTGGCGGCTTTCGAAAACAGAGAAGAGGACGTTCCAATTAGCCTATTTGAAGAGGGAAGCAAATACTTGGATG
GGATTTATCAAACAAAGGTTGCTTCCAACGACTCATGACTCGGCGGTTTCTAGGGAACGAGTGCTTCTGGCTTTCGCTATTTTGAGGTCTCTCAGTATTGATGTG
GGAAAAATTATTGCTAATGAAATATCTGGATGTTGGAAGAAGAAAGTAGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGCGAGCAGGGCGTACG
CAAGAGGCACGTCAGGGTGGGCTGGTCTACGGCATCAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCCGAGCGGCAAGCTTTAACC
TTCTGGGACTATGTTAGAAATCGTGATGTCAATCTAAAGAAGGCACTACAGGAGAATTTTTCCAAACCATTTCCAGCCCTTCCAGCATTCCCTGAAGATTTATTG
AACCCCTGGATTCCGCCACCGCCTGTCGAGAGAGAAGGAGATGGAGAAGAAGATCCTGCTAGGTTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATT
TGTCCATGCCGGAAGAATTATTTTGCTGAAGCCGAGCTTGGTTTTGCAGAATGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAACTCTGTGCTGGAGCAA
AGCTGGGAACAAAACTGCCACGTCACAGCTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTAAAACGAGAGCAAGAAAAGAGAGAGATAATGAGGAAGAGGAGGTACCCGTGACCCCCGAAGCACCGAAAGTAAAGGCAAAGAAGAAGAAGACACCAGAA
GAAACAGAAGCTAAAAGAAGAAGAAGACAGCAGAGGGCTGAAGATCAAGAAGCTGTTCAGAAAGCGGCGGAGGATGTTATTGTGGAAAAAGATCCGAAAGAACCA
GAAGGACAGAATCCAGAGCAGACTGACCCGATAGTTGCGGATACAGAGGAAGTTCAAGAAGAAAATACAGAGGGAGTTCGAGAAGAAAATGCAGAGGAAGTTCGA
GAAGAAAATACAGAGGAAGTTCAAGAAAAGCAGGCCGAGGATGTGCAAGAAGAACAGGCAGAGGTTGCGCCTGAAGAAGTTAGTGAGCAAGAACGGGAGGCTCGG
GTGGAGGTGATCATGCCGGAAGTGCCCAAACGCCGCCGTATAAAGCGAAAAGCGGGCCGTGTTAAGGTAGTCCGAACTGATACCCCCTCGCCTCCAACTACTGAT
TCTGAAAGAGAGAATGCAGAAAGAGAAGAACGTGAGAAGAAGGAGGAGGATAAAGCAAGAGAGGAAGCAGAGAAAAAGGCTGAAGAAGAAAGATTGCTCAAGCAA
AGGGCAGACAGGGGCAAGAGTGTTGCTGCGGCATCAGAGGAACCTGATGAAATAGAAGAGTCACAATTGTCGTATGATCGCTTTGTCAGCAATCATGCCAGAGCA
AAGTATGCAGAGTTGCTGAAAAGAGACTTCCTGTTTGAAAGGGGATTTAGTGGTGATCTTCCACATTTTCTGAGGACCGGTATTGCAGACCACGGTTGGGAACGG
TTTTGTTCAAAGCCTGAATCTGTAAACGCACAGGTGGTGCGTGAGTTTTATGCAAATATTGACAAAGAAGAAGGTTTCCTAGCAATTGTTCGAGGTATTGAGGTC
GACTGGAGTCCTTGTGCTATTAATGCACTGTATAACCTTCGAACTTTACCCCACGCAGCATATAATGAGATGGCTGTAGCGCCATCCAATGAGCAACTGAGTGAC
GCTGTAAGGGAAGTTGGTATTGAAGGGGCGCAGTGGCGGCTTTCGAAAACAGAGAAGAGGACGTTCCAATTAGCCTATTTGAAGAGGGAAGCAAATACTTGGATG
GGATTTATCAAACAAAGGTTGCTTCCAACGACTCATGACTCGGCGGTTTCTAGGGAACGAGTGCTTCTGGCTTTCGCTATTTTGAGGTCTCTCAGTATTGATGTG
GGAAAAATTATTGCTAATGAAATATCTGGATGTTGGAAGAAGAAAGTAGGGAAGCTGTTTTTCCCGAATACCATTACCATGCTTTGCAAGCGAGCAGGGCGTACG
CAAGAGGCACGTCAGGGTGGGCTGGTCTACGGCATCAACACGATTTTAGAACAACTCGCACTTTCGGCCAGCAGGCAGGAGTTTGCCGAGCGGCAAGCTTTAACC
TTCTGGGACTATGTTAGAAATCGTGATGTCAATCTAAAGAAGGCACTACAGGAGAATTTTTCCAAACCATTTCCAGCCCTTCCAGCATTCCCTGAAGATTTATTG
AACCCCTGGATTCCGCCACCGCCTGTCGAGAGAGAAGGAGATGGAGAAGAAGATCCTGCTAGGTTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATT
TGTCCATGCCGGAAGAATTATTTTGCTGAAGCCGAGCTTGGTTTTGCAGAATGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAACTCTGTGCTGGAGCAA
AGCTGGGAACAAAACTGCCACGTCACAGCTCGTTAG
Protein sequenceShow/hide protein sequence
MAKTRARKERDNEEEEVPVTPEAPKVKAKKKKTPEETEAKRRRRQQRAEDQEAVQKAAEDVIVEKDPKEPEGQNPEQTDPIVADTEEVQEENTEGVREENAEEVR
EENTEEVQEKQAEDVQEEQAEVAPEEVSEQEREARVEVIMPEVPKRRRIKRKAGRVKVVRTDTPSPPTTDSERENAEREEREKKEEDKAREEAEKKAEEERLLKQ
RADRGKSVAAASEEPDEIEESQLSYDRFVSNHARAKYAELLKRDFLFERGFSGDLPHFLRTGIADHGWERFCSKPESVNAQVVREFYANIDKEEGFLAIVRGIEV
DWSPCAINALYNLRTLPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWRLSKTEKRTFQLAYLKREANTWMGFIKQRLLPTTHDSAVSRERVLLAFAILRSLSIDV
GKIIANEISGCWKKKVGKLFFPNTITMLCKRAGRTQEARQGGLVYGINTILEQLALSASRQEFAERQALTFWDYVRNRDVNLKKALQENFSKPFPALPAFPEDLL
NPWIPPPPVEREGDGEEDPARLWQVLRIELKVVIICPCRKNYFAEAELGFAECSESVAGRLEGANSVLEQSWEQNCHVTAR