; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg020162 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg020162
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold1:28419158..28434649
RNA-Seq ExpressionSpg020162
SyntenySpg020162
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]8.7e-3236.62Show/hide
Query:  PHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW +FC  P      +VREFYAN+   +   V V+ V+V ++  AIN+++ L+      Y + A  +++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQWQLS

Query:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF
          G  T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ ++ + EI  C   +K G L+FP+ IT L  +A VP  + + I+ 
Subjt:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF

Query:  DKGIIDTPNLARL
        + G I T +++R+
Subjt:  DKGIIDTPNLARL

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.2e-3440Show/hide
Query:  GNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQW
        G LP F+   I  H W++FC+ PE     +VREFYAN+       V VRGV+V WS  AINA++ L + P   ++E    ++   L   +  V + GA+W
Subjt:  GNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQW

Query:  QLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVI
         +S  G  T   + L   A  W  F++  +LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A  P    +  
Subjt:  QLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVI

Query:  LFDKGIIDTPNLARLQCTQE
        L + G ID   +AR+  TQE
Subjt:  LFDKGIIDTPNLARLQCTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.9e-3933.53Show/hide
Query:  GNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQW
        G LP F+   I  H W++FC+ PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L + P   ++E    ++ + L   +  V   GA+W
Subjt:  GNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQW

Query:  QLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVI
         +S  G  T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A  P    +  
Subjt:  QLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVI

Query:  LFDKGIIDTPNLARL------QCTQE------------ARQGGLIYDINT------------------------------NYCRNRDANLKKALQENFSK
        L + G ID   +AR+      + TQ+               G ++  +                                 Y + RD  LKKALQ NF++
Subjt:  LFDKGIIDTPNLARL------QCTQE------------ARQGGLIYDINT------------------------------NYCRNRDANLKKALQENFSK

Query:  PYPALPAFPEDLLNPWIPPPPVEREEEDDEE
        P P  PAFP+++L         E E E D++
Subjt:  PYPALPAFPEDLLNPWIPPPPVEREEEDDEE

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]9.6e-3136.59Show/hide
Query:  LKRDFLFERGFSGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSD
        ++++F+++       P F+   I  H W+ FC+ PE     +VREFY N+   D   V +RGV+V  S  AIN +++L + P   ++E    ++  +L  
Subjt:  LKRDFLFERGFSGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSD

Query:  AVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR
         +  V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L   SI+VG+MI  EI  C  +K G LFFP+ IT +CR
Subjt:  AVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR

Query:  RAGVP
            P
Subjt:  RAGVP

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]1.7e-3234.02Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E    ++  +L   +  V   GA+W +S  G  T   + L   A  W  F++ R
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL------QCTQE---
        +LPTTH   VS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +AR+      + TQ+   
Subjt:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL------QCTQE---

Query:  ---------ARQGGLIYDINT-------------------NYCRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
                    G ++  +                      Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  ---------ARQGGLIYDINT-------------------NYCRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.5e-3440Show/hide
Query:  GNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQW
        G LP F+   I  H W++FC+ PE     +VREFYAN+       V VRGV+V WS  AINA++ L + P   ++E    ++   L   +  V + GA+W
Subjt:  GNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQW

Query:  QLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVI
         +S  G  T   + L   A  W  F++  +LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A  P    +  
Subjt:  QLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVI

Query:  LFDKGIIDTPNLARLQCTQE
        L + G ID   +AR+  TQE
Subjt:  LFDKGIIDTPNLARLQCTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)9.3e-4033.53Show/hide
Query:  GNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQW
        G LP F+   I  H W++FC+ PE     +VREFYAN+   +   V VRGV+V WS  AINA++ L + P   ++E    ++ + L   +  V   GA+W
Subjt:  GNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQW

Query:  QLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVI
         +S  G  T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A  P    +  
Subjt:  QLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVI

Query:  LFDKGIIDTPNLARL------QCTQE------------ARQGGLIYDINT------------------------------NYCRNRDANLKKALQENFSK
        L + G ID   +AR+      + TQ+               G ++  +                                 Y + RD  LKKALQ NF++
Subjt:  LFDKGIIDTPNLARL------QCTQE------------ARQGGLIYDINT------------------------------NYCRNRDANLKKALQENFSK

Query:  PYPALPAFPEDLLNPWIPPPPVEREEEDDEE
        P P  PAFP+++L         E E E D++
Subjt:  PYPALPAFPEDLLNPWIPPPPVEREEEDDEE

A0A2P5DAQ2 Uncharacterized protein4.6e-3136.59Show/hide
Query:  LKRDFLFERGFSGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSD
        ++++F+++       P F+   I  H W+ FC+ PE     +VREFY N+   D   V +RGV+V  S  AIN +++L + P   ++E    ++  +L  
Subjt:  LKRDFLFERGFSGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSD

Query:  AVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR
         +  V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L   SI+VG+MI  EI  C  +K G LFFP+ IT +CR
Subjt:  AVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR

Query:  RAGVP
            P
Subjt:  RAGVP

A0A2P5DXM3 Uncharacterized protein8.4e-3334.02Show/hide
Query:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR
        +VREFYAN+   +   + VRGV+V WS  AINA++ L + P   ++E    ++  +L   +  V   GA+W +S  G  T   + L   A  W  F++ R
Subjt:  VVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQR

Query:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL------QCTQE---
        +LPTTH   VS++R+LL  ++L   SI+VG+MI +EI  C  +K G LFFP+ IT LCR A   V+E    L + G ID   +AR+      + TQ+   
Subjt:  MLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL------QCTQE---

Query:  ---------ARQGGLIYDINT-------------------NYCRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE
                    G ++  +                      Y + RD  LKKALQ NF++P P  PAFP+++L         E E E D++
Subjt:  ---------ARQGGLIYDINT-------------------NYCRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEE

W9QTD9 Uncharacterized protein4.2e-3236.62Show/hide
Query:  PHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW +FC  P      +VREFYAN+   +   V V+ V+V ++  AIN+++ L+      Y + A  +++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMAVALSNEQLSDAVREVGIEGAQWQLS

Query:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF
          G  T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ ++ + EI  C   +K G L+FP+ IT L  +A VP  + + I+ 
Subjt:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF

Query:  DKGIIDTPNLARL
        + G I T +++R+
Subjt:  DKGIIDTPNLARL

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGCCCTACCCTCTTTATGGCACGAGAGGGATTTCTGTTTGTTGGTTGGACCTCAAACAGGTTGTTCATTAGAGGAGCACTGGTACTTAAGGACCAAGAGCGAGATAT
CGTTCCTTTTGCTGCTGTTTTTCAGAGGATCTCGGGGAAAAACTTGGTGAAACGATCAAGAAACTCTTCAAGGGTAAGCGTCGGGACGCTGTTCCTTAAGCGTCTCGACG
CTGTCACGATTTCATGCGGCAGTTCGTTTTTACGTGGAGACGTGGGAGAAACCCTAACTGCTGGTTCGGTTCGCGTGAATGGGTTGAGCTGGGACCGATTTGGTCCGGTT
CAGCCCATTTTTGGTTCGGTTCGGGTTCTTCAAGGTCGGTTCGAAGCGGTTCGGGTCGAATCTGTTGCTGGGCGACTTGAGGGAGCAAAATCTGCGCTGCAGCAAAGCTG
GGAACAAAACTGCCACGTCACAACTCGTTATCCAAATTGCCAAACTGAATTCTGTGTGAATTTGGTGCATGAACGATCCGCCTGGGGTGAGGTTCAGAAAGTTGTTGCGG
CAAAGTTATGGCTGAAGCGAATCGTCCGAGTAGATTGCATGGATGCTGCCACGTGTCATGCAGGAATGATCCAATGGTCTGGCTATTTGACAACGCATAGTTATTTACCA
TGGGCGTTTGAAATTCCAGATTTTACCGTTGGATGGTTTAATTCTACCGGTTGCATGCGATTTAAAACGAGGGCTAAGGAACAAGAAAAGGCAACAGAAGTTGTTGCTGC
AACAGTAGAAGAAGGAGACCCGCAAGAACCTGATGTACAGAACCCAGAGGAAGCTGAGCAGAGAGTCGCGGATACAGAAAAAGTTCAAGAGGAGCAAACAGAGGAAGTTC
GAGAGGAAAATACAGAGGAAGTTCGAGAGGACAATACAGAGGAAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAGAAATTTCAGAAGAAGTTCAAGAAGAGCAGGCC
GAGGTTGTGCAAGAGCAACAGGCAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCGAGTGGAGGTGATCATGCCAGAAGCACCGAAACGTCGCCGCGT
AAAGAGGAAAGCAGGCCGCGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCGACCACAGATTCTGAAAGAGAGAATGCGGAGAGAGTAGAGCGTGAGAAGAAGGAAG
CCGAGGAAAGAGCAAGAGAAGAGGAAGAGAAAAAGCCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAGTGTTGCTGAAGCATCGGAGGAACCTGAT
GAAATAGAAGAACATGGTTGCTTCATCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGAGACTTCCTGTTTGAAAGAGGATTTAGCGGTAATCTTCCACA
TTTTCTGAGGACCAGCATTGCAGACCACGGCTGGGAGCGGTTTTGTTCAAAGCCTGAGGCTGTAAACGCACAGGTGGTGCGTGAATTTTATGCTAATATTGACAAGGAAG
ATGGTTTCCAGGTGATTGTTCGAGGAGTCGAGGTAGACTGGAGTCCTAGTGCTATCAATGCACTGTATAACCTTCAGAATTTCCCACATGCAGCTTATAATGAGATGGCT
GTAGCGCTATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAGGACATTTCAGTCCGCTTATCT
GAAGAGGGAAGCGAACACGTGGATGGGATTTATAAGACAGCGGATGCTTCCAACGACTCATGACTCGACGGTCTCGAGGGAACGGGTTCTTCTGGCTTTCGCGATTTTGC
GGTCTCTTAGCATTGATGTAGGGAAGATGATTGTTAATGAGATTTCTGGTTGTTGGAAAAAGAAAGTGGGGAAACTGTTCTTTCCGAACACAATCACGATGCTTTGCAGA
AGAGCAGGGGTTCCAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGATACGCCTAACTTGGCGCGGCTTCAGTGTACGCAGGAGGCACGTCAAGGTGG
GCTTATCTACGACATCAACACGAACTATTGTAGAAATCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCATATCCAGCCCTTCCAGCATTCCCTG
AGGATCTGTTGAACCCTTGGATTCCACCCCCACCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCT
GCGGCAAAGAAAATTCTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCACGCTTAGTGATGAGTTTGAGGCAAGGGTATACTGCACCATAAA
GTGGGTTATCCCATGCTTAAGGGCTTATGACTCTAAGTTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAG
CAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAAATCTGCGCTGCAGCAAAGCTGGGAACAAAACTGCCACGTCACAGCTCGTTAT
CCAAATTGCCAAACTGAATTCTTGACTGAGCTAGGGATAGCCAGTTGCCTAGTGCTTGATCGGTCGTACTTCATTGGGGTTGAAACACATTCCAAAAGATTCGGAGTTCC
TGGAGGAATCCGCTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGCCCTACCCTCTTTATGGCACGAGAGGGATTTCTGTTTGTTGGTTGGACCTCAAACAGGTTGTTCATTAGAGGAGCACTGGTACTTAAGGACCAAGAGCGAGATAT
CGTTCCTTTTGCTGCTGTTTTTCAGAGGATCTCGGGGAAAAACTTGGTGAAACGATCAAGAAACTCTTCAAGGGTAAGCGTCGGGACGCTGTTCCTTAAGCGTCTCGACG
CTGTCACGATTTCATGCGGCAGTTCGTTTTTACGTGGAGACGTGGGAGAAACCCTAACTGCTGGTTCGGTTCGCGTGAATGGGTTGAGCTGGGACCGATTTGGTCCGGTT
CAGCCCATTTTTGGTTCGGTTCGGGTTCTTCAAGGTCGGTTCGAAGCGGTTCGGGTCGAATCTGTTGCTGGGCGACTTGAGGGAGCAAAATCTGCGCTGCAGCAAAGCTG
GGAACAAAACTGCCACGTCACAACTCGTTATCCAAATTGCCAAACTGAATTCTGTGTGAATTTGGTGCATGAACGATCCGCCTGGGGTGAGGTTCAGAAAGTTGTTGCGG
CAAAGTTATGGCTGAAGCGAATCGTCCGAGTAGATTGCATGGATGCTGCCACGTGTCATGCAGGAATGATCCAATGGTCTGGCTATTTGACAACGCATAGTTATTTACCA
TGGGCGTTTGAAATTCCAGATTTTACCGTTGGATGGTTTAATTCTACCGGTTGCATGCGATTTAAAACGAGGGCTAAGGAACAAGAAAAGGCAACAGAAGTTGTTGCTGC
AACAGTAGAAGAAGGAGACCCGCAAGAACCTGATGTACAGAACCCAGAGGAAGCTGAGCAGAGAGTCGCGGATACAGAAAAAGTTCAAGAGGAGCAAACAGAGGAAGTTC
GAGAGGAAAATACAGAGGAAGTTCGAGAGGACAATACAGAGGAAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAGAAATTTCAGAAGAAGTTCAAGAAGAGCAGGCC
GAGGTTGTGCAAGAGCAACAGGCAGAAGATGTTCAGGTAACGGATAATGAGCCAGTGCAGGAGGCTCGAGTGGAGGTGATCATGCCAGAAGCACCGAAACGTCGCCGCGT
AAAGAGGAAAGCAGGCCGCGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCGACCACAGATTCTGAAAGAGAGAATGCGGAGAGAGTAGAGCGTGAGAAGAAGGAAG
CCGAGGAAAGAGCAAGAGAAGAGGAAGAGAAAAAGCCTGAGGAAGAGCGGTTGCTCAAGCGAAGGGCGGAAAAGGGCAAAAGTGTTGCTGAAGCATCGGAGGAACCTGAT
GAAATAGAAGAACATGGTTGCTTCATCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGAGACTTCCTGTTTGAAAGAGGATTTAGCGGTAATCTTCCACA
TTTTCTGAGGACCAGCATTGCAGACCACGGCTGGGAGCGGTTTTGTTCAAAGCCTGAGGCTGTAAACGCACAGGTGGTGCGTGAATTTTATGCTAATATTGACAAGGAAG
ATGGTTTCCAGGTGATTGTTCGAGGAGTCGAGGTAGACTGGAGTCCTAGTGCTATCAATGCACTGTATAACCTTCAGAATTTCCCACATGCAGCTTATAATGAGATGGCT
GTAGCGCTATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACAGGGAAAAGGACATTTCAGTCCGCTTATCT
GAAGAGGGAAGCGAACACGTGGATGGGATTTATAAGACAGCGGATGCTTCCAACGACTCATGACTCGACGGTCTCGAGGGAACGGGTTCTTCTGGCTTTCGCGATTTTGC
GGTCTCTTAGCATTGATGTAGGGAAGATGATTGTTAATGAGATTTCTGGTTGTTGGAAAAAGAAAGTGGGGAAACTGTTCTTTCCGAACACAATCACGATGCTTTGCAGA
AGAGCAGGGGTTCCAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGATACGCCTAACTTGGCGCGGCTTCAGTGTACGCAGGAGGCACGTCAAGGTGG
GCTTATCTACGACATCAACACGAACTATTGTAGAAATCGTGATGCCAATCTGAAGAAGGCGCTGCAAGAGAATTTTTCCAAGCCATATCCAGCCCTTCCAGCATTCCCTG
AGGATCTGTTGAACCCTTGGATTCCACCCCCACCTGTTGAAAGAGAAGAGGAGGATGATGAAGAGCAGGAAACCTTTTGCTTGAGCATTTTCTCTGGCCTGGTCGTTGCT
GCGGCAAAGAAAATTCTGGAGGTAGTGTTGACTTATGTGATCCGCTTTAAGCTTAGGTCTAGTCCCACGCTTAGTGATGAGTTTGAGGCAAGGGTATACTGCACCATAAA
GTGGGTTATCCCATGCTTAAGGGCTTATGACTCTAAGTTGTGGCAAGTTCTTAGAATTGAGTTAAAAGTGGTGATTATTTGTCCATGCCGGAAGAATTATTTTGCTGCAG
CAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAAATCTGCGCTGCAGCAAAGCTGGGAACAAAACTGCCACGTCACAGCTCGTTAT
CCAAATTGCCAAACTGAATTCTTGACTGAGCTAGGGATAGCCAGTTGCCTAGTGCTTGATCGGTCGTACTTCATTGGGGTTGAAACACATTCCAAAAGATTCGGAGTTCC
TGGAGGAATCCGCTAA
Protein sequenceShow/hide protein sequence
MGPTLFMAREGFLFVGWTSNRLFIRGALVLKDQERDIVPFAAVFQRISGKNLVKRSRNSSRVSVGTLFLKRLDAVTISCGSSFLRGDVGETLTAGSVRVNGLSWDRFGPV
QPIFGSVRVLQGRFEAVRVESVAGRLEGAKSALQQSWEQNCHVTTRYPNCQTEFCVNLVHERSAWGEVQKVVAAKLWLKRIVRVDCMDAATCHAGMIQWSGYLTTHSYLP
WAFEIPDFTVGWFNSTGCMRFKTRAKEQEKATEVVAATVEEGDPQEPDVQNPEEAEQRVADTEKVQEEQTEEVREENTEEVREDNTEEVREENTEEVREEISEEVQEEQA
EVVQEQQAEDVQVTDNEPVQEARVEVIMPEAPKRRRVKRKAGRVRVVRTDTPSPPTTDSERENAERVEREKKEAEERAREEEEKKPEEERLLKRRAEKGKSVAEASEEPD
EIEEHGCFINNFARAKYAELLKRDFLFERGFSGNLPHFLRTSIADHGWERFCSKPEAVNAQVVREFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMA
VALSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISGCWKKKVGKLFFPNTITMLCR
RAGVPVDEGDVILFDKGIIDTPNLARLQCTQEARQGGLIYDINTNYCRNRDANLKKALQENFSKPYPALPAFPEDLLNPWIPPPPVEREEEDDEEQETFCLSIFSGLVVA
AAKKILEVVLTYVIRFKLRSSPTLSDEFEARVYCTIKWVIPCLRAYDSKLWQVLRIELKVVIICPCRKNYFAAAELGFAECSESVAGRLEGAKSALQQSWEQNCHVTARY
PNCQTEFLTELGIASCLVLDRSYFIGVETHSKRFGVPGGIR