; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg034451 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg034451
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold4:12448391..12457845
RNA-Seq ExpressionSpg034451
SyntenySpg034451
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.3e-3036.71Show/hide
Query:  HDHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQ
        H HGW  FC  P +    +VREFYAN+   +   V V+ V+V ++  AIN+++ L+      Y + A   ++EQL   + EV IEGA WQ+S  G  T  
Subjt:  HDHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQ

Query:  SAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTP
           LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ ++ + EI  C   +K G L+FP+ IT L  +A VP  + + I+ + G I T 
Subjt:  SAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTP

Query:  NLARLQR
        +++R+ +
Subjt:  NLARLQR

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.1e-2836.8Show/hide
Query:  KEAEERAREETE-KKAKEERLLKRR---AEKGKSVAEASEEPDEIE-----EHDHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSA
        ++A +  + ETE  + + E  ++ R   AEKG  V + SE   ++         H W+ FCA PE     +VREFYAN+       V VRGV+V WS  A
Subjt:  KEAEERAREETE-KKAKEERLLKRR---AEKGKSVAEASEEPDEIE-----EHDHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSA

Query:  INALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVG
        INA++ L + P   ++E     +   L   +  V + GA+W +S  G  T   + L   A  W  F++  +LPTTH  TVS++R+LL  ++L   SI+VG
Subjt:  INALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVG

Query:  KMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRTQEARQASRQE
        +MI +EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+  TQE    S Q+
Subjt:  KMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRTQEARQASRQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.6e-3632.73Show/hide
Query:  SERENAERVECDKKEAEERAREETEKKA-KEERLLKRR---AEKGKSVAEASEEPDEIE-----EHDHGWELFCAKPESVNAQVVREFYANIDKEDGFQV
        S   N   ++   ++A +  + ETE  A + E  ++ R   AEKG  V + SE   ++         H W+ FCA PE     +VREFYAN+   +   V
Subjt:  SERENAERVECDKKEAEERAREETEKKA-KEERLLKRR---AEKGKSVAEASEEPDEIE-----EHDHGWELFCAKPESVNAQVVREFYANIDKEDGFQV

Query:  IVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLL
         VRGV+V WS  AINA++ L + P   ++E     + + L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS++R+LL
Subjt:  IVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLL

Query:  AFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL-------------------------------
          ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+                               
Subjt:  AFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL-------------------------------

Query:  -----------QRTQEARQASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDQE
                   Q  Q+    S  +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  -----------QRTQEARQASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDQE

PON50458.1 hypothetical protein PanWU01x14_223230, partial [Parasponia andersonii]3.3e-2635.06Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNF--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIR
        +VREFY N+   D   V VRGV+V  S  AIN +Y L +    H+ + E    P   +L+  +  V I GA+W +S  G  T   + L   A  W  F++
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNF--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIR

Query:  QRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRTQEARQASRQ
         R+LPTTH   VS+ERVLL +++L   SI++G+MI  EI  C  +K G LFFP+ I  +CR A  P    +  L + G ID   +AR+ +   A + S Q
Subjt:  QRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRTQEARQASRQ

Query:  EFAERQALTFWNYVRNRDANLKKALQENFSK
          + R      +   +      K+L+++ S+
Subjt:  EFAERQALTFWNYVRNRDANLKKALQENFSK

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]2.1e-3636.08Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQR
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E     +  +L   +  V   GA+W +S  G  T   + L   A  W  F++ R
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL---------QRTQE
        +LPTTH   VS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +AR+         Q+   
Subjt:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL---------QRTQE

Query:  ARQA----------------------SRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDQE
        +R A                      S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  ARQA----------------------SRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDQE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.0e-2836.8Show/hide
Query:  KEAEERAREETE-KKAKEERLLKRR---AEKGKSVAEASEEPDEIE-----EHDHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSA
        ++A +  + ETE  + + E  ++ R   AEKG  V + SE   ++         H W+ FCA PE     +VREFYAN+       V VRGV+V WS  A
Subjt:  KEAEERAREETE-KKAKEERLLKRR---AEKGKSVAEASEEPDEIE-----EHDHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSA

Query:  INALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVG
        INA++ L + P   ++E     +   L   +  V + GA+W +S  G  T   + L   A  W  F++  +LPTTH  TVS++R+LL  ++L   SI+VG
Subjt:  INALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVG

Query:  KMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRTQEARQASRQE
        +MI +EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+  TQE    S Q+
Subjt:  KMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRTQEARQASRQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.7e-3632.73Show/hide
Query:  SERENAERVECDKKEAEERAREETEKKA-KEERLLKRR---AEKGKSVAEASEEPDEIE-----EHDHGWELFCAKPESVNAQVVREFYANIDKEDGFQV
        S   N   ++   ++A +  + ETE  A + E  ++ R   AEKG  V + SE   ++         H W+ FCA PE     +VREFYAN+   +   V
Subjt:  SERENAERVECDKKEAEERAREETEKKA-KEERLLKRR---AEKGKSVAEASEEPDEIE-----EHDHGWELFCAKPESVNAQVVREFYANIDKEDGFQV

Query:  IVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLL
         VRGV+V WS  AINA++ L + P   ++E     + + L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS++R+LL
Subjt:  IVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLL

Query:  AFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL-------------------------------
          ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+                               
Subjt:  AFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL-------------------------------

Query:  -----------QRTQEARQASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDQE
                   Q  Q+    S  +   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  -----------QRTQEARQASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDQE

A0A2P5BNT0 Uncharacterized protein (Fragment)1.6e-2635.06Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNF--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIR
        +VREFY N+   D   V VRGV+V  S  AIN +Y L +    H+ + E    P   +L+  +  V I GA+W +S  G  T   + L   A  W  F++
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNF--PHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIR

Query:  QRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRTQEARQASRQ
         R+LPTTH   VS+ERVLL +++L   SI++G+MI  EI  C  +K G LFFP+ I  +CR A  P    +  L + G ID   +AR+ +   A + S Q
Subjt:  QRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARLQRTQEARQASRQ

Query:  EFAERQALTFWNYVRNRDANLKKALQENFSK
          + R      +   +      K+L+++ S+
Subjt:  EFAERQALTFWNYVRNRDANLKKALQENFSK

A0A2P5DXM3 Uncharacterized protein1.0e-3636.08Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQR
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E     +  +L   +  V   GA+W +S  G  T   + L   A  W  F++ R
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL---------QRTQE
        +LPTTH   VS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +AR+         Q+   
Subjt:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL---------QRTQE

Query:  ARQA----------------------SRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDQE
        +R A                      S+QE   +Q   FW Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  ARQA----------------------SRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDQE

W9QTD9 Uncharacterized protein6.4e-3136.71Show/hide
Query:  HDHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQ
        H HGW  FC  P +    +VREFYAN+   +   V V+ V+V ++  AIN+++ L+      Y + A   ++EQL   + EV IEGA WQ+S  G  T  
Subjt:  HDHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDAVREVGIEGAQWQLSKTGKKTFQ

Query:  SAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTP
           LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ ++ + EI  C   +K G L+FP+ IT L  +A VP  + + I+ + G I T 
Subjt:  SAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTP

Query:  NLARLQR
        +++R+ +
Subjt:  NLARLQR

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGACGTGCCGCATCTACCACAGGGTCCTGAAGGTTTAGCGAACCCCCAGCAGAATCGTGTGCTGCAGCAAAACCCGCCGCTGGAGCAAAATGAGCAGCAAAATAA
TCAGGCTGAGAATCCTATCTTGCAGCCGCCAGTTGTGGAGCCTGCTGCAGTGGTGAACCAAGTTGCAGAGGAAGCTTGTGTCTATTGTGATAATAATTGTGATAGAAATG
TTGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAGGGTGTCGGAGGCAGCAATAAAGATGCTGGAGCATCTGGAATGATCCAACGGTCTGGCTATTTGACAACGCATAAT
TATTTACCATGGGCATTTGAATTCCAGATTTTACCATTGGATGGTTTAATTCTACCGGTTGCATGCGATTTAAAGCGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGA
GGTACCTGTTACCCCTGAGGTGCAGAAGGTAAAGGCGAAGAAGAAAAAAACACCAGAGGAGAAAGAAGCCAAGAGACGAAGACGACAACAGAGGGTTGAGGAACAAGAAA
AGGCAACAGAAGTTGTTACTGCCACAGTAGAAGAAGGAGACCCGCAAGAACTTGATGTACAGAACCCAAAGGAGGCTGAGCAGAGAGTCGCGGATACAGAAATAGTTCAA
GAGGAGAAAACAGAGGAAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAGAAATTTCAGAAGAAGTTCAAGAAAAGCAGGCCGAGGATGTGCAAGAGCAACAGGCAGA
AGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGGGGCTCGAGTGGAGGTGATCATGCTAGAAGCACCGAAACGTCGCCGCATTAAGAAGAAAGCAGGCCGCGTTAGGG
TTGTCCGAACTGATACTCCTTCGCCTCCAACCACGGATTCTGAAAGAGAGAATGCAGAGAGAGTAGAGTGTGATAAGAAGGAAGCCGAGGAAAGAGCAAGAGAAGAGACA
GAGAAAAAGGCTAAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAGTGTTGCTGAAGCATCGGAGGAACCTGATGAAATAGAAGAACATGACCATGGCTG
GGAGTTGTTTTGTGCGAAGCCTGAGTCTGTAAATGCACAGGTGGTGCGTGAATTTTATGCTAATATTGATAAAGAAGATGGTTTCCAAGTGATTGTTCGAGGAGTCGAGG
TAGACTGGAGTCCTAGTGCTATCAATGCACTGTATAACCTTCAGAATTTCCCCCATGCAGCTTATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCT
GTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAAGACATTTCAGTCAGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGGATTTAT
CAGACAGAGGATGCTTCCAACGACTCATGACTCGACAGTCTCGAGGGAACGGGTTCTCTTGGCTTTCGCGATTTTGCGGTCTCTTAGCATTGATGTAGGGAAGATGATTG
TTAATGAAATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAACTGTTCTTTCCGAACACAATCACGATGCTTTGCAGAAGAGCAGGGGTTCCAGTGGATGAGGGAGATGTT
ATCCTGTTTGACAAGGGAATTATAGATACGCCTAACTTGGCGCGGCTTCAGCGTACGCAGGAGGCACGTCAAGCCAGCAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAAC
CTTCTGGAACTATGTTAGAAATCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCGTATCCAGCCCTTCCAGCATTCCCTGAGGATCTGTTGAACC
CTTGGATTCCACCCCCACCTGTTGAAAGAGAAGAGGAGGATGATCAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATT
CTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCACGCTTACTAAGCTGTGGCAAGTTCTTAGAATTGAGTTTAAAATGGTGATTATTTGTCC
ATGCCGGAAGAATTATTTTGCTGCAGCAGTCCTTGGTTTTGCAGAATGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCGAATTCTGTGCTGCAGCAAAACTGGGAGC
AAAACTGCCACGTCACAGCTCGTTAG
mRNA sequenceShow/hide mRNA sequence
ATGGCTGACGTGCCGCATCTACCACAGGGTCCTGAAGGTTTAGCGAACCCCCAGCAGAATCGTGTGCTGCAGCAAAACCCGCCGCTGGAGCAAAATGAGCAGCAAAATAA
TCAGGCTGAGAATCCTATCTTGCAGCCGCCAGTTGTGGAGCCTGCTGCAGTGGTGAACCAAGTTGCAGAGGAAGCTTGTGTCTATTGTGATAATAATTGTGATAGAAATG
TTGTTGTTGTTGAGAAAGAGTTGGAGTCTGGTCAGGGTGTCGGAGGCAGCAATAAAGATGCTGGAGCATCTGGAATGATCCAACGGTCTGGCTATTTGACAACGCATAAT
TATTTACCATGGGCATTTGAATTCCAGATTTTACCATTGGATGGTTTAATTCTACCGGTTGCATGCGATTTAAAGCGAGCGAGGAAAGAAAGAGAGAATGAGGAGGAAGA
GGTACCTGTTACCCCTGAGGTGCAGAAGGTAAAGGCGAAGAAGAAAAAAACACCAGAGGAGAAAGAAGCCAAGAGACGAAGACGACAACAGAGGGTTGAGGAACAAGAAA
AGGCAACAGAAGTTGTTACTGCCACAGTAGAAGAAGGAGACCCGCAAGAACTTGATGTACAGAACCCAAAGGAGGCTGAGCAGAGAGTCGCGGATACAGAAATAGTTCAA
GAGGAGAAAACAGAGGAAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAGAAATTTCAGAAGAAGTTCAAGAAAAGCAGGCCGAGGATGTGCAAGAGCAACAGGCAGA
AGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGGGGCTCGAGTGGAGGTGATCATGCTAGAAGCACCGAAACGTCGCCGCATTAAGAAGAAAGCAGGCCGCGTTAGGG
TTGTCCGAACTGATACTCCTTCGCCTCCAACCACGGATTCTGAAAGAGAGAATGCAGAGAGAGTAGAGTGTGATAAGAAGGAAGCCGAGGAAAGAGCAAGAGAAGAGACA
GAGAAAAAGGCTAAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAGTGTTGCTGAAGCATCGGAGGAACCTGATGAAATAGAAGAACATGACCATGGCTG
GGAGTTGTTTTGTGCGAAGCCTGAGTCTGTAAATGCACAGGTGGTGCGTGAATTTTATGCTAATATTGATAAAGAAGATGGTTTCCAAGTGATTGTTCGAGGAGTCGAGG
TAGACTGGAGTCCTAGTGCTATCAATGCACTGTATAACCTTCAGAATTTCCCCCATGCAGCTTATAATGAGATGGCTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCT
GTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAAGACATTTCAGTCAGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGGATTTAT
CAGACAGAGGATGCTTCCAACGACTCATGACTCGACAGTCTCGAGGGAACGGGTTCTCTTGGCTTTCGCGATTTTGCGGTCTCTTAGCATTGATGTAGGGAAGATGATTG
TTAATGAAATTTCTGGTTGTTGGAAGAAGAAGGTGGGGAAACTGTTCTTTCCGAACACAATCACGATGCTTTGCAGAAGAGCAGGGGTTCCAGTGGATGAGGGAGATGTT
ATCCTGTTTGACAAGGGAATTATAGATACGCCTAACTTGGCGCGGCTTCAGCGTACGCAGGAGGCACGTCAAGCCAGCAGGCAAGAGTTTGCCGAGAGGCAAGCTTTAAC
CTTCTGGAACTATGTTAGAAATCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCGTATCCAGCCCTTCCAGCATTCCCTGAGGATCTGTTGAACC
CTTGGATTCCACCCCCACCTGTTGAAAGAGAAGAGGAGGATGATCAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCTGCGGCAAAGAAAATT
CTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCACGCTTACTAAGCTGTGGCAAGTTCTTAGAATTGAGTTTAAAATGGTGATTATTTGTCC
ATGCCGGAAGAATTATTTTGCTGCAGCAGTCCTTGGTTTTGCAGAATGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCGAATTCTGTGCTGCAGCAAAACTGGGAGC
AAAACTGCCACGTCACAGCTCGTTAG
Protein sequenceShow/hide protein sequence
MADVPHLPQGPEGLANPQQNRVLQQNPPLEQNEQQNNQAENPILQPPVVEPAAVVNQVAEEACVYCDNNCDRNVVVVEKELESGQGVGGSNKDAGASGMIQRSGYLTTHN
YLPWAFEFQILPLDGLILPVACDLKRARKERENEEEEVPVTPEVQKVKAKKKKTPEEKEAKRRRRQQRVEEQEKATEVVTATVEEGDPQELDVQNPKEAEQRVADTEIVQ
EEKTEEVREENTEEVREEISEEVQEKQAEDVQEQQAEDVQVTDNEPVQGARVEVIMLEAPKRRRIKKKAGRVRVVRTDTPSPPTTDSERENAERVECDKKEAEERAREET
EKKAKEERLLKRRAEKGKSVAEASEEPDEIEEHDHGWELFCAKPESVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVAPSNEQLSDA
VREVGIEGAQWQLSKTGKKTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDV
ILFDKGIIDTPNLARLQRTQEARQASRQEFAERQALTFWNYVRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDQEQETFCLSIFSGLVVAAAKKI
LEVVLTYVIRFKLRSSPTLTKLWQVLRIEFKMVIICPCRKNYFAAAVLGFAECSESVAGRLEGANSVLQQNWEQNCHVTAR