; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg014154 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg014154
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold3:28602934..28614881
RNA-Seq ExpressionSpg014154
SyntenySpg014154
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.9e-3035.35Show/hide
Query:  PHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLS
        P F+   I  HGW  FC  P +    +VREFYAN+   +   V V+ V+V ++  AIN+++ L+      Y + A   ++EQL  V+ EV IEGA WQ+S
Subjt:  PHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLS

Query:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF
          G  T     LKR A  W  F+  R +P+TH  T +++RVLL ++ L  +S+++ ++ + EI  C   +K G L+FP+ IT L  +A VP  + + I+ 
Subjt:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF

Query:  DKGIIDTPNLAQLQR
        + G I T +++++ +
Subjt:  DKGIIDTPNLAQLQR

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]1.6e-3236.33Show/hide
Query:  LKRRAEKGKSVAEASEEPDEIEEHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKED
        +KR A K     +   E  E     R+ NN        +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+    
Subjt:  LKRRAEKGKSVAEASEEPDEIEEHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKED

Query:  GFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRE
           V VRGV+V WS  AINA++ L + P   ++E     +   L  V+  V + GA+W +S  G  T   + L   A  W  F++  +LPTTH  T S++
Subjt:  GFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRE

Query:  RVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLAQLQRTQE
        R+LL  + L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +A++  TQE
Subjt:  RVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLAQLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]2.3e-4433.97Show/hide
Query:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F    A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEI
         + P   ++E     + + L  V+  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH  T S++R+LL  + L   SI+VG+MI +EI
Subjt:  QNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEI

Query:  SGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIID---TPNLAQLQRTQEARQ---------------GGLIYGINTVLEQLALSASRQ----
          C  +K G LFFP+ IT LCR A  P    +  L + G ID      +AQ   T+  +Q               G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIID---TPNLAQLQRTQEARQ---------------GGLIYGINTVLEQLALSASRQ----

Query:  ---EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
           +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  ---EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]2.3e-3137.56Show/hide
Query:  LKRDFLFERGFSGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSD
        ++++F+++       P F+   I  H W+LFCA PE     +VREFY N+   D   V +RGV+V  S  AIN +++L + P   ++E     +  +L  
Subjt:  LKRDFLFERGFSGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSD

Query:  VVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR
        V+  V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH  T S+E V L ++ L   SI+VG+MI  EI  C  +K G LFFP+ IT +CR
Subjt:  VVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR

Query:  RAGVP
            P
Subjt:  RAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.6e-3736.73Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E     +  +L  V+  V   GA+W +S  G  T   + L   A  W  F++ R
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLAQL------QRTQE---
        +LPTTH    S++R+LL  + L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +A++      + TQ+   
Subjt:  MLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLAQL------QRTQE---

Query:  --------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
                +R  G +      LEQ     S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  --------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)7.6e-3336.33Show/hide
Query:  LKRRAEKGKSVAEASEEPDEIEEHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKED
        +KR A K     +   E  E     R+ NN        +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+    
Subjt:  LKRRAEKGKSVAEASEEPDEIEEHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKED

Query:  GFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRE
           V VRGV+V WS  AINA++ L + P   ++E     +   L  V+  V + GA+W +S  G  T   + L   A  W  F++  +LPTTH  T S++
Subjt:  GFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRE

Query:  RVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLAQLQRTQE
        R+LL  + L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +A++  TQE
Subjt:  RVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLAQLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.1e-4433.97Show/hide
Query:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F    A  +Y   +  R    E+GF        G LP F+   I  H W+ FCA PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEI
         + P   ++E     + + L  V+  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH  T S++R+LL  + L   SI+VG+MI +EI
Subjt:  QNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEI

Query:  SGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIID---TPNLAQLQRTQEARQ---------------GGLIYGINTVLEQLALSASRQ----
          C  +K G LFFP+ IT LCR A  P    +  L + G ID      +AQ   T+  +Q               G ++  +  + ++L+    +Q    
Subjt:  SGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIID---TPNLAQLQRTQEARQ---------------GGLIYGINTVLEQLALSASRQ----

Query:  ---EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
           +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  ---EFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

A0A2P5DAQ2 Uncharacterized protein1.1e-3137.56Show/hide
Query:  LKRDFLFERGFSGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSD
        ++++F+++       P F+   I  H W+LFCA PE     +VREFY N+   D   V +RGV+V  S  AIN +++L + P   ++E     +  +L  
Subjt:  LKRDFLFERGFSGNLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSD

Query:  VVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR
        V+  V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH  T S+E V L ++ L   SI+VG+MI  EI  C  +K G LFFP+ IT +CR
Subjt:  VVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR

Query:  RAGVP
            P
Subjt:  RAGVP

A0A2P5DXM3 Uncharacterized protein7.8e-3836.73Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E     +  +L  V+  V   GA+W +S  G  T   + L   A  W  F++ R
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLAQL------QRTQE---
        +LPTTH    S++R+LL  + L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +A++      + TQ+   
Subjt:  MLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLAQL------QRTQE---

Query:  --------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
                +R  G +      LEQ     S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  --------ARQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

W9QTD9 Uncharacterized protein9.2e-3135.35Show/hide
Query:  PHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLS
        P F+   I  HGW  FC  P +    +VREFYAN+   +   V V+ V+V ++  AIN+++ L+      Y + A   ++EQL  V+ EV IEGA WQ+S
Subjt:  PHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLS

Query:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF
          G  T     LKR A  W  F+  R +P+TH  T +++RVLL ++ L  +S+++ ++ + EI  C   +K G L+FP+ IT L  +A VP  + + I+ 
Subjt:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF

Query:  DKGIIDTPNLAQLQR
        + G I T +++++ +
Subjt:  DKGIIDTPNLAQLQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGGTACTTGTTACCCCTGAGGTGCAGAAGAACCCAGAGGAGGCTGAGCAAAGAGTCGCGGATAC
AGAAAAAGTTCAAGAGGAGCAAACAGAGGAAGTTCGAGAGGAAAATACAGAGGAAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAGAAATTTCAGAAGAAGTTCAAG
AAAAGCAGGCCGAGGATGTGCAAGAGCAACAGGCAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCGAGTGGAGGTGATCATGCCAGAAGCACTGAAA
CGTCGCCGCGTTAAGAGGAATGCAGGCCGCGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACGGATTCTGAAAGAGAGAATGCGGAGAGAGTAGAGCGTGA
GAAGAAGAAAGCCGAAGAAAGAGCAAGAGAAGAGGCAGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAGTGTTGCTGAAGCATCGG
AGGAACCTGATGAAATAGAAGAACATGGTCGCTTCATCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGAGACTTCCTGTTTGAAAGAGGATTTAGCGGT
AATCTTCCACATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGAGTTGTTTTGTGCGAAGCCTGAGTCTGTAAACGCACAGGTGGTGCGTGAATTTTATGCTAATAT
TGATAAAGAAGATGGTTTCCAGGTGATTGTTCGAGGAGTCGAGGTAGACTGGAGTCCTAGTGCTATCAATGCACTGTATAACCTTCAGAATTTCCCCCATGCAGCTTATA
ATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGTTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAGGACATTTCAG
TCCGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGGATTTATCAGACAGAGGATGCTTCCAACGACTCATGACTCGACGGCCTCGAGGGAACGGGTTCTTCTGGCTTT
CGCGACTTTGCGGTCTCTTAGCATTGATGTAGGGAAGATGATTGTTAATGAGATTTCTGGTTGTTGGAAAAAGAAAGTGGGGAAACTGTTCTTTCCGAACACAATCACGA
TGCTTTGCAGAAGAGCAGGGGTTCCAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGATACGCCTAACTTGGCGCAGCTTCAGCGTACGCAGGAGGCA
CGTCAAGGTGGCCTTATCTACGGCATCAACACGGTTTTAGAACAACTGGCACTTTCAGCCAGCAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGT
TAGAAATCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCGTATCCAGCCCTTCCAGCATTCCCTGAGGATCTGTTGAACCCTTGGATTCCACCCC
CACCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGGTAGTGTTG
ACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCACGCTTACTAAGCTATGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTA
TTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAAATCTGTGCTGCAGCAAAGCTGGGAGCAAAACTGCCACGTAG
AGATCGAGCTCCCAGTGCCTAATACACTGCCAACGTCTGCTGAAAGCTCCAGATCAAGCTCCAGTACGTTCAACGTTCAGGAGGGTCAGCGGTTAGATGTTGGGTCAGTT
CAGATAGAACTAGTAGAGTATGGTCCGAGTATTATTGAGAGTGAGTCCAAGCACAGGATTTTCGAGCAAACCAGGAGGATCTCTGGTGTTCTCTGGTGTTTTGAGCATTC
TGGGGTGTACAAGTCGAATCAGAAGCTCTATTTGGTAAGTTGTAGAGTGTTCGAAAATGAAGAGTTTGATGTGTATTCCATTAATTGTGGTTTGGTATTGAGGAAGTTAT
GGGTGTTGGAAGTTAGGGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGCTCCAGATCAAGCTCCAGTACGTGGGTGAAGTTGTATATTGAG
TCTGTTCATATTGTAAATGTTTTGTTCAACGTACAGGAGGGTCAGCGGGTATCGTTAGAGGAGGACGATGTCCGTTGGCGTCACGCCATCTTTCGGGCTAAGCTAGTAGG
TGGTCCGGGAGGGGGTGTGACAGGAGCTGAATGGAGTACTTATTCTTTTTCTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCAAAAACAAGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGAGGTACTTGTTACCCCTGAGGTGCAGAAGAACCCAGAGGAGGCTGAGCAAAGAGTCGCGGATAC
AGAAAAAGTTCAAGAGGAGCAAACAGAGGAAGTTCGAGAGGAAAATACAGAGGAAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAGAAATTTCAGAAGAAGTTCAAG
AAAAGCAGGCCGAGGATGTGCAAGAGCAACAGGCAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCGAGTGGAGGTGATCATGCCAGAAGCACTGAAA
CGTCGCCGCGTTAAGAGGAATGCAGGCCGCGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACGGATTCTGAAAGAGAGAATGCGGAGAGAGTAGAGCGTGA
GAAGAAGAAAGCCGAAGAAAGAGCAAGAGAAGAGGCAGAGAAAAAGGCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAGTGTTGCTGAAGCATCGG
AGGAACCTGATGAAATAGAAGAACATGGTCGCTTCATCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGAGACTTCCTGTTTGAAAGAGGATTTAGCGGT
AATCTTCCACATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGAGTTGTTTTGTGCGAAGCCTGAGTCTGTAAACGCACAGGTGGTGCGTGAATTTTATGCTAATAT
TGATAAAGAAGATGGTTTCCAGGTGATTGTTCGAGGAGTCGAGGTAGACTGGAGTCCTAGTGCTATCAATGCACTGTATAACCTTCAGAATTTCCCCCATGCAGCTTATA
ATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGTTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAGGACATTTCAG
TCCGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGGATTTATCAGACAGAGGATGCTTCCAACGACTCATGACTCGACGGCCTCGAGGGAACGGGTTCTTCTGGCTTT
CGCGACTTTGCGGTCTCTTAGCATTGATGTAGGGAAGATGATTGTTAATGAGATTTCTGGTTGTTGGAAAAAGAAAGTGGGGAAACTGTTCTTTCCGAACACAATCACGA
TGCTTTGCAGAAGAGCAGGGGTTCCAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGATACGCCTAACTTGGCGCAGCTTCAGCGTACGCAGGAGGCA
CGTCAAGGTGGCCTTATCTACGGCATCAACACGGTTTTAGAACAACTGGCACTTTCAGCCAGCAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGT
TAGAAATCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCGTATCCAGCCCTTCCAGCATTCCCTGAGGATCTGTTGAACCCTTGGATTCCACCCC
CACCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATTCTGGAGGTAGTGTTG
ACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCACGCTTACTAAGCTATGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTA
TTTTGCTGCAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAAATCTGTGCTGCAGCAAAGCTGGGAGCAAAACTGCCACGTAG
AGATCGAGCTCCCAGTGCCTAATACACTGCCAACGTCTGCTGAAAGCTCCAGATCAAGCTCCAGTACGTTCAACGTTCAGGAGGGTCAGCGGTTAGATGTTGGGTCAGTT
CAGATAGAACTAGTAGAGTATGGTCCGAGTATTATTGAGAGTGAGTCCAAGCACAGGATTTTCGAGCAAACCAGGAGGATCTCTGGTGTTCTCTGGTGTTTTGAGCATTC
TGGGGTGTACAAGTCGAATCAGAAGCTCTATTTGGTAAGTTGTAGAGTGTTCGAAAATGAAGAGTTTGATGTGTATTCCATTAATTGTGGTTTGGTATTGAGGAAGTTAT
GGGTGTTGGAAGTTAGGGTAGAGATCGAGCTCCCGGTGCCTGATACACTGCCAACGTCTGCTGAAAGCTCCAGATCAAGCTCCAGTACGTGGGTGAAGTTGTATATTGAG
TCTGTTCATATTGTAAATGTTTTGTTCAACGTACAGGAGGGTCAGCGGGTATCGTTAGAGGAGGACGATGTCCGTTGGCGTCACGCCATCTTTCGGGCTAAGCTAGTAGG
TGGTCCGGGAGGGGGTGTGACAGGAGCTGAATGGAGTACTTATTCTTTTTCTTAG
Protein sequenceShow/hide protein sequence
MAKTRARKERENEEEEVLVTPEVQKNPEEAEQRVADTEKVQEEQTEEVREENTEEVREENTEEVREEISEEVQEKQAEDVQEQQAEDVQVTDNEPVQEARVEVIMPEALK
RRRVKRNAGRVRVVRTDTPSPPTTDSERENAERVEREKKKAEERAREEAEKKAEEERLLKRRAEKGKSVAEASEEPDEIEEHGRFINNFARAKYAELLKRDFLFERGFSG
NLPHFLRTGIADHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDVVREVGIEGAQWQLSKTGKRTFQ
SAYLKREANTWMGFIRQRMLPTTHDSTASRERVLLAFATLRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLAQLQRTQEA
RQGGLIYGINTVLEQLALSASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQETFCLSIFSGLVVAAAKKILEVVL
TYVIRFKLRSSPTLTKLWQVLRIELKVVIICPCRKNYFAAAELGFAECSESVAGRLEGAKSVLQQSWEQNCHVEIELPVPNTLPTSAESSRSSSSTFNVQEGQRLDVGSV
QIELVEYGPSIIESESKHRIFEQTRRISGVLWCFEHSGVYKSNQKLYLVSCRVFENEEFDVYSINCGLVLRKLWVLEVRVEIELPVPDTLPTSAESSRSSSSTWVKLYIE
SVHIVNVLFNVQEGQRVSLEEDDVRWRHAIFRAKLVGGPGGGVTGAEWSTYSFS