; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg015439 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg015439
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold10:13890400..13896539
RNA-Seq ExpressionSpg015439
SyntenySpg015439
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
EXB49850.1 hypothetical protein L484_000844 [Morus notabilis]8.2e-2827.72Show/hide
Query:  SVAEASEEPDEIEKHGRFINNFARAKYAE-LLKRDFLFERGF------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGV
        + A+ S    ++  + +F++N A  +Y E +  R+ + E+GF      +   P F+   I   GW++F   P      +V EFYAN+  +    V V  +
Subjt:  SVAEASEEPDEIEKHGRFINNFARAKYAE-LLKRDFLFERGF------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGV

Query:  EVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAIL
        ++ ++ + IN +  + N     + E++     EQL + ++ + I GAQW LS  G  T     L+  A  W  F+  R+L +TH  T+SR R +L +A+L
Subjt:  EVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAIL

Query:  RSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL----QRTLEARQAELGFAECSESVAGRLEGANSVLQ
            I+VG++I+++I +C +K  G L+FP+ I+ LC ++ V  +  +  L + G +D   + R+        E  + E    E S       E A++   
Subjt:  RSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL----QRTLEARQAELGFAECSESVAGRLEGANSVLQ

Query:  QNW
        Q W
Subjt:  QNW

EXB53755.1 hypothetical protein L484_022412 [Morus notabilis]1.3e-2834.67Show/hide
Query:  PHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW  F   P      +V EFYAN+   +   V V+ V+V ++  AIN+++ L+      Y +     ++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLS

Query:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISSC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF
          G  T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ ++ + EI +C   +K G L+FP+ IT L  +A VP  + + I+ 
Subjt:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISSC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF

Query:  DKGIIDTPNLARLQ--RTLEARQAE
        + G I T +++R+   R + A + E
Subjt:  DKGIIDTPNLARLQ--RTLEARQAE

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.6e-3135.53Show/hide
Query:  LKRRAEKGKSVAEASEEPDEIEKHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKED
        +KR A K     +   E  E     R+ NN        +  R    E+GF        G LP F+   I  H W+ F A PE     +V EFYAN+    
Subjt:  LKRRAEKGKSVAEASEEPDEIEKHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKED

Query:  GFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRE
           V VRGV+V WS  AINA++ L + P   ++E +   +   L   +  V + GA+W +S  G  T   + L   A  W  F++  +LPTTH  TVS++
Subjt:  GFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRE

Query:  RVLLAFAILRSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL
        R+LL  ++L   SI+VG+MI +EI +C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  RVLLAFAILRSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]6.5e-3336.95Show/hide
Query:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F    A  +Y   +  R    E+GF        G LP F+   I  H W+ F A PE     +V EFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI
         + P   ++E +   + + L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI
Subjt:  QNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI

Query:  SSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL
         +C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  SSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL

PON70375.1 hypothetical protein PanWU01x14_080440 [Parasponia andersonii]6.7e-3037.07Show/hide
Query:  LKRDFLFERGFSGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSD
        ++++F+++       P F+   I  H W+LF A PE     +V EFY N+   D   V +RGV+V  S  AIN +++L + P   ++E V   +  +L  
Subjt:  LKRDFLFERGFSGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSD

Query:  AVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCR
         +  V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L   SI+VG+MI  EI +C  +K G LFFP+ IT +CR
Subjt:  AVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCR

Query:  RAGVP
            P
Subjt:  RAGVP

TrEMBL top hitse value%identityAlignment
A0A2P5AGA5 Uncharacterized protein (Fragment)1.7e-3135.53Show/hide
Query:  LKRRAEKGKSVAEASEEPDEIEKHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKED
        +KR A K     +   E  E     R+ NN        +  R    E+GF        G LP F+   I  H W+ F A PE     +V EFYAN+    
Subjt:  LKRRAEKGKSVAEASEEPDEIEKHGRFINNFARAKYAELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKED

Query:  GFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRE
           V VRGV+V WS  AINA++ L + P   ++E +   +   L   +  V + GA+W +S  G  T   + L   A  W  F++  +LPTTH  TVS++
Subjt:  GFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRE

Query:  RVLLAFAILRSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL
        R+LL  ++L   SI+VG+MI +EI +C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  RVLLAFAILRSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL

A0A2P5BCG4 Uncharacterized protein (Fragment)3.1e-3336.95Show/hide
Query:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL
        +F    A  +Y   +  R    E+GF        G LP F+   I  H W+ F A PE     +V EFYAN+   +   V VRGV+V WS  AINA++ L
Subjt:  RFINNFARAKYA-ELLKRDFLFERGF-------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNL

Query:  QNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI
         + P   ++E +   + + L   +  V   GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS++R+LL  ++L   SI+VG+MI +EI
Subjt:  QNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEI

Query:  SSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL
         +C  +K G LFFP+ IT LCR A  P    +  L + G ID   +AR+
Subjt:  SSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL

A0A2P5DAQ2 Uncharacterized protein3.3e-3037.07Show/hide
Query:  LKRDFLFERGFSGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSD
        ++++F+++       P F+   I  H W+LF A PE     +V EFY N+   D   V +RGV+V  S  AIN +++L + P   ++E V   +  +L  
Subjt:  LKRDFLFERGFSGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSD

Query:  AVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCR
         +  V I GA+W +S  G  T   + L   A  W  F++ R+LPTTH  TVS+E V L +++L   SI+VG+MI  EI +C  +K G LFFP+ IT +CR
Subjt:  AVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCR

Query:  RAGVP
            P
Subjt:  RAGVP

W9QTD9 Uncharacterized protein6.1e-2934.67Show/hide
Query:  PHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLS
        P F+   I  HGW  F   P      +V EFYAN+   +   V V+ V+V ++  AIN+++ L+      Y +     ++EQL   + EV IEGA WQ+S
Subjt:  PHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLS

Query:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISSC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF
          G  T     LKR A  W  F+  R +P+TH  TV+++RVLL ++IL  +S+++ ++ + EI +C   +K G L+FP+ IT L  +A VP  + + I+ 
Subjt:  KTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISSC-WKKKVGKLFFPNTITMLCRRAGVPVDEGDVILF

Query:  DKGIIDTPNLARLQ--RTLEARQAE
        + G I T +++R+   R + A + E
Subjt:  DKGIIDTPNLARLQ--RTLEARQAE

W9RBS1 Uncharacterized protein4.0e-2827.72Show/hide
Query:  SVAEASEEPDEIEKHGRFINNFARAKYAE-LLKRDFLFERGF------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGV
        + A+ S    ++  + +F++N A  +Y E +  R+ + E+GF      +   P F+   I   GW++F   P      +V EFYAN+  +    V V  +
Subjt:  SVAEASEEPDEIEKHGRFINNFARAKYAE-LLKRDFLFERGF------SGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGV

Query:  EVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAIL
        ++ ++ + IN +  + N     + E++     EQL + ++ + I GAQW LS  G  T     L+  A  W  F+  R+L +TH  T+SR R +L +A+L
Subjt:  EVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKTGKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAIL

Query:  RSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL----QRTLEARQAELGFAECSESVAGRLEGANSVLQ
            I+VG++I+++I +C +K  G L+FP+ I+ LC ++ V  +  +  L + G +D   + R+        E  + E    E S       E A++   
Subjt:  RSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL----QRTLEARQAELGFAECSESVAGRLEGANSVLQ

Query:  QNW
        Q W
Subjt:  QNW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCGATCCGCCTGGGGTGAGGTTTGAACTTGATCCAGAAATCGAGAGGACATTCAGGATAAGAAGGAGAGAGCAGCGAAGACAGCAAAATCAAATGGCTGACGTGCC
GCATCTACCGCAGGTAGCTCATATTAAGGCAGTGAAAACACCTTGGACTCTTCAGGTTCAGAAAGTTGTTGCGGCAAAGTTATGGCTGAAGCGAATCGTCCGAATAAGAA
GGGTCCTTCATGGCAAAAACAAGAGCGAGAAAGAAAGAGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTGAGGTGCAGAAGGTAAAGGCGAAGAAGAAAAAAACACCA
GAGGAGAAAGAAGCCAAGAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGTTGCTGCCACAGTTGAAGAAGGAGACTCGCAAGAACCTGA
TGTACAGAACCCAGAGGAGGCTAAGCAGAGAGTCGCGGATACAGGAAAAGTTCAAGAGGAGCAAACAGAGGAAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAGAAA
TTTCAGAAGAAGTTCAAGAAAAGCAGGCCGAGGATGTGCAAGAGCAACAGGCAGAAGATGTTCAGGTAGCGGATAATGAGCCGGTGCAGGAGGCTCGAGTGGAGGTGATC
ATGCCAGAAGCACCGAAACGTCGCTGCATTAAGAGGAAAGCAGGCCGCGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACGGATTCTGAAAGAGAGAATGT
AGAGAGAGAAGAGCGTGAGAAGAAGGAAGCCGAAGAAAGAGCAAGAGAAGAGGCAGAGAAAAAGGCTGAGGAAGAGCGGGTTCTCAAGCGAAGGGCGGAAAAGGGCAAAA
GTGTTGCTGAAGCATCGGAGGAACCTGATGAAATAGAAAAACATGGTCGCTTCATCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGAGACTTCCTGTTT
GAAAGAGGATTTAGCGGTAATCTTCCACATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGAGTTGTTTTATGCGAAGCCTGAGCCTGTAAACGCACAGGTGGTGTG
TGAATTTTATGCTAATATTGATAAAGAAGATGGTTTCCAGGTGATTGTTCGAGGAGTCGAGGTAGACTGGAGTCCTAGTGCTATCAATGCACTGTATAACCTTCAGAATT
TCCCCCATGCAGCTTATAATGAGATGGTTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACA
GGGAAAAGGACATTTCAGTCAGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGGATTTATCAGACAGAGGATGCTTCCAACGACTCATGACTCAACGGTCTCGAGGGA
ACGGGTTCTTCTGGCTTTCGCGATTTTGCGGTCTCTTAGCATTGATGTAGGGAAGATGATTGTTAATGAGATTTCTAGTTGTTGGAAAAAGAAAGTGGGGAAACTGTTCT
TTCCGAACACAATCACGATGCTTTGTAGAAGAGCAGGGGTTCCAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGATACGCCTAACTTGGCGCGGCTT
CAGCGTACGTTGGAGGCACGTCAAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGTA
G
mRNA sequenceShow/hide mRNA sequence
ATGAGCGATCCGCCTGGGGTGAGGTTTGAACTTGATCCAGAAATCGAGAGGACATTCAGGATAAGAAGGAGAGAGCAGCGAAGACAGCAAAATCAAATGGCTGACGTGCC
GCATCTACCGCAGGTAGCTCATATTAAGGCAGTGAAAACACCTTGGACTCTTCAGGTTCAGAAAGTTGTTGCGGCAAAGTTATGGCTGAAGCGAATCGTCCGAATAAGAA
GGGTCCTTCATGGCAAAAACAAGAGCGAGAAAGAAAGAGAGAATGAGGAGGAAGAGGTACCTGTTACCCCTGAGGTGCAGAAGGTAAAGGCGAAGAAGAAAAAAACACCA
GAGGAGAAAGAAGCCAAGAGACGAAGACGACAACAGAGGGCTGAGGAACAAGAAAAGGCAACAGAGGTTGTTGCTGCCACAGTTGAAGAAGGAGACTCGCAAGAACCTGA
TGTACAGAACCCAGAGGAGGCTAAGCAGAGAGTCGCGGATACAGGAAAAGTTCAAGAGGAGCAAACAGAGGAAGTTCGAGAGGAAAATACCGAGGAAGTTCGAGAAGAAA
TTTCAGAAGAAGTTCAAGAAAAGCAGGCCGAGGATGTGCAAGAGCAACAGGCAGAAGATGTTCAGGTAGCGGATAATGAGCCGGTGCAGGAGGCTCGAGTGGAGGTGATC
ATGCCAGAAGCACCGAAACGTCGCTGCATTAAGAGGAAAGCAGGCCGCGTTAGGGTTGTCCGAACTGATACTCCTTCGCCTCCAACCACGGATTCTGAAAGAGAGAATGT
AGAGAGAGAAGAGCGTGAGAAGAAGGAAGCCGAAGAAAGAGCAAGAGAAGAGGCAGAGAAAAAGGCTGAGGAAGAGCGGGTTCTCAAGCGAAGGGCGGAAAAGGGCAAAA
GTGTTGCTGAAGCATCGGAGGAACCTGATGAAATAGAAAAACATGGTCGCTTCATCAACAATTTTGCCAGAGCAAAATACGCTGAGCTGCTGAAAAGAGACTTCCTGTTT
GAAAGAGGATTTAGCGGTAATCTTCCACATTTTCTGAGGACCGGCATTGCAGACCATGGCTGGGAGTTGTTTTATGCGAAGCCTGAGCCTGTAAACGCACAGGTGGTGTG
TGAATTTTATGCTAATATTGATAAAGAAGATGGTTTCCAGGTGATTGTTCGAGGAGTCGAGGTAGACTGGAGTCCTAGTGCTATCAATGCACTGTATAACCTTCAGAATT
TCCCCCATGCAGCTTATAATGAGATGGTTGTAGCGCCATCTAATGAGCAGTTAAGTGATGCTGTGCGGGAGGTGGGTATTGAAGGGGCACAGTGGCAGCTGTCCAAGACA
GGGAAAAGGACATTTCAGTCAGCTTATCTGAAGAGGGAAGCGAACACGTGGATGGGATTTATCAGACAGAGGATGCTTCCAACGACTCATGACTCAACGGTCTCGAGGGA
ACGGGTTCTTCTGGCTTTCGCGATTTTGCGGTCTCTTAGCATTGATGTAGGGAAGATGATTGTTAATGAGATTTCTAGTTGTTGGAAAAAGAAAGTGGGGAAACTGTTCT
TTCCGAACACAATCACGATGCTTTGTAGAAGAGCAGGGGTTCCAGTGGATGAGGGAGATGTTATCCTGTTTGACAAGGGAATTATAGATACGCCTAACTTGGCGCGGCTT
CAGCGTACGTTGGAGGCACGTCAAGCAGAGCTTGGTTTTGCAGAGTGCTCAGAATCTGTTGCTGGGCGACTTGAGGGAGCAAATTCTGTGCTGCAGCAAAACTGGGAGTA
G
Protein sequenceShow/hide protein sequence
MSDPPGVRFELDPEIERTFRIRRREQRRQQNQMADVPHLPQVAHIKAVKTPWTLQVQKVVAAKLWLKRIVRIRRVLHGKNKSEKERENEEEEVPVTPEVQKVKAKKKKTP
EEKEAKRRRRQQRAEEQEKATEVVAATVEEGDSQEPDVQNPEEAKQRVADTGKVQEEQTEEVREENTEEVREEISEEVQEKQAEDVQEQQAEDVQVADNEPVQEARVEVI
MPEAPKRRCIKRKAGRVRVVRTDTPSPPTTDSERENVEREEREKKEAEERAREEAEKKAEEERVLKRRAEKGKSVAEASEEPDEIEKHGRFINNFARAKYAELLKRDFLF
ERGFSGNLPHFLRTGIADHGWELFYAKPEPVNAQVVCEFYANIDKEDGFQVIVRGVEVDWSPSAINALYNLQNFPHAAYNEMVVAPSNEQLSDAVREVGIEGAQWQLSKT
GKRTFQSAYLKREANTWMGFIRQRMLPTTHDSTVSRERVLLAFAILRSLSIDVGKMIVNEISSCWKKKVGKLFFPNTITMLCRRAGVPVDEGDVILFDKGIIDTPNLARL
QRTLEARQAELGFAECSESVAGRLEGANSVLQQNWE