; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg032567 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg032567
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold3:23085438..23092217
RNA-Seq ExpressionSpg032567
SyntenySpg032567
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN01433.1 hypothetical protein CDL12_26059 [Handroanthus impetiginosus]1.1e-2231.43Show/hide
Query:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL
        +REFYAN  E +  + +VRG EV +    IN LYN+       +       +     +  R +   GAQW++ K E  +F+S  L K A  W+ FI  R+
Subjt:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCKRAGVLENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVYGI
        LPT H   V+ +R LL + I+   + DVGKII+  I          L+FP+ IT  C RAGV  +E + ++F +  ID   + R+        GG    +
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCKRAGVLENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVYGI

Query:  NTILEQLALSASRQEF---AERQALTFWNYVGSQTFCLSIPSSLV
           +  L    S QE     ER+     +Y+G+    L   S +V
Subjt:  NTILEQLALSASRQEF---AERQALTFWNYVGSQTFCLSIPSSLV

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]2.3e-2339.58Show/hide
Query:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL
        +REFYAN+ +     V VRGV+V WS   IN ++ L + P   ++E     +   L   +  V V GA+W +      T   + L   A  W  F+K  L
Subjt:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCK--RAGVLENEGDVILFDKGIIDTPNLARLQRTQE
        LPTTH  TVS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT  C+  RA  L NE    L + G ID   +AR+  TQE
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCK--RAGVLENEGDVILFDKGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]1.2e-2730.43Show/hide
Query:  RFINNLARAKYIEMLE-RDFLFERGFGDD-------LPHFLRVGITDHGWSQFCTKPELVNSNVVREFYANIDDQEGFNEMV----VAPSNDQLNA----
        +F    A  +Y   ++ R    E+GF  D       LP   +V IT H W QFC  PE     +VREFYAN+ D E     V    V+ S + +NA    
Subjt:  RFINNLARAKYIEMLE-RDFLFERGFGDD-------LPHFLRVGITDHGWSQFCTKPELVNSNVVREFYANIDDQEGFNEMV----VAPSNDQLNA----

Query:  ------------------------AVRELG---------------------------FVKLRLLPTTHDSTVSRDQVLLVFAILRSLSIDVGKIISSEIY
                                 V   G                           F+K RLLPTTH  TVS+D++LL+ ++L   SI+VG++I SEI 
Subjt:  ------------------------AVRELG---------------------------FVKLRLLPTTHDSTVSRDQVLLVFAILRSLSIDVGKIISSEIY

Query:  DCWRKKVGKLFFPNTITMLCQRVGVPMNADDVTLMDKGIIDTPNLARL---------QRTQESR---------QGGLVCGIHQMQEKL------QMH-SS
         C  +K G LFFP+ IT LC+    P   ++  L + G ID   +AR+         Q+   SR          G ++  +  ++++L      Q H  S
Subjt:  DCWRKKVGKLFFPNTITMLCQRVGVPMNADDVTLMDKGIIDTPNLARL---------QRTQESR---------QGGLVCGIHQMQEKL------QMH-SS

Query:  RMEFAERQSQTFWNYVKRRDAALRRTLQSNFSKPYPAFPVFPDDLLNPWIPPLQIEREGDEEEDPGQE
         ++   +Q Q FW Y K RD AL++ LQ+NF++P P FP FP ++L      L  E E + ++D   E
Subjt:  RMEFAERQSQTFWNYVKRRDAALRRTLQSNFSKPYPAFPVFPDDLLNPWIPPLQIEREGDEEEDPGQE

PON59596.1 hypothetical protein PanWU01x14_158080 [Parasponia andersonii]1.7e-2336.49Show/hide
Query:  FVKLRLLPTTHDSTVSRDQVLLVFAILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCQRVGVPMNADDVTLMDKGIIDTPNLARL---------
        F+K RLLPTTH  TVS+D++LL++++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+    P   ++  L   G ID   +AR+         
Subjt:  FVKLRLLPTTHDSTVSRDQVLLVFAILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCQRVGVPMNADDVTLMDKGIIDTPNLARL---------

Query:  QRTQESR---------QGGLVCGIHQMQEKL------QMH-SSRMEFAERQSQTFWNYVKRRDAALRRTLQSNFSKPYPAFPVFPDDLLNPWIPPLQIER
        Q+   SR          G ++  +  ++++L      Q H  S ++   +Q Q FW Y K RD AL++ LQ+NF++P P FP FP +LL      L  E 
Subjt:  QRTQESR---------QGGLVCGIHQMQEKL------QMH-SSRMEFAERQSQTFWNYVKRRDAALRRTLQSNFSKPYPAFPVFPDDLLNPWIPPLQIER

Query:  EGDEEEDPGQE
        E + ++D   E
Subjt:  EGDEEEDPGQE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]6.6e-2331.41Show/hide
Query:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL
        +REFYAN+ + E   + VRGV+V WS   IN ++ L + P   ++E     +  +L   +  V   GA+W +      T   + L   A  W  F+K RL
Subjt:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCKRAGVLENEGDVILFDKGIIDTPNLARL------QRTQE----
        LPTTH   VS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT  C+ A  L NE    L + G ID   +AR+      + TQ+    
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCKRAGVLENEGDVILFDKGIIDTPNLARL------QRTQE----

Query:  -------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVGSQTFCLSIPSSLVVAAAKKILRGRRERDNEEEEVPVTPEAPKTKVKKRKTPEER
               +R  G V      LEQ     S+QE   +Q   FW Y   +             A KK L     ++N    +P  P  P+  ++      E 
Subjt:  -------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVGSQTFCLSIPSSLVVAAAKKILRGRRERDNEEEEVPVTPEAPKTKVKKRKTPEER

Query:  EAKRRRRQQRAE
        E+ +    + AE
Subjt:  EAKRRRRQQRAE

TrEMBL top hitse value%identityAlignment
A0A2G9G807 Uncharacterized protein5.5e-2331.43Show/hide
Query:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL
        +REFYAN  E +  + +VRG EV +    IN LYN+       +       +     +  R +   GAQW++ K E  +F+S  L K A  W+ FI  R+
Subjt:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCKRAGVLENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVYGI
        LPT H   V+ +R LL + I+   + DVGKII+  I          L+FP+ IT  C RAGV  +E + ++F +  ID   + R+        GG    +
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCKRAGVLENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVYGI

Query:  NTILEQLALSASRQEF---AERQALTFWNYVGSQTFCLSIPSSLV
           +  L    S QE     ER+     +Y+G+    L   S +V
Subjt:  NTILEQLALSASRQEF---AERQALTFWNYVGSQTFCLSIPSSLV

A0A2P5AGA5 Uncharacterized protein (Fragment)1.1e-2339.58Show/hide
Query:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL
        +REFYAN+ +     V VRGV+V WS   IN ++ L + P   ++E     +   L   +  V V GA+W +      T   + L   A  W  F+K  L
Subjt:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCK--RAGVLENEGDVILFDKGIIDTPNLARLQRTQE
        LPTTH  TVS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT  C+  RA  L NE    L + G ID   +AR+  TQE
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCK--RAGVLENEGDVILFDKGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)5.6e-2830.43Show/hide
Query:  RFINNLARAKYIEMLE-RDFLFERGFGDD-------LPHFLRVGITDHGWSQFCTKPELVNSNVVREFYANIDDQEGFNEMV----VAPSNDQLNA----
        +F    A  +Y   ++ R    E+GF  D       LP   +V IT H W QFC  PE     +VREFYAN+ D E     V    V+ S + +NA    
Subjt:  RFINNLARAKYIEMLE-RDFLFERGFGDD-------LPHFLRVGITDHGWSQFCTKPELVNSNVVREFYANIDDQEGFNEMV----VAPSNDQLNA----

Query:  ------------------------AVRELG---------------------------FVKLRLLPTTHDSTVSRDQVLLVFAILRSLSIDVGKIISSEIY
                                 V   G                           F+K RLLPTTH  TVS+D++LL+ ++L   SI+VG++I SEI 
Subjt:  ------------------------AVRELG---------------------------FVKLRLLPTTHDSTVSRDQVLLVFAILRSLSIDVGKIISSEIY

Query:  DCWRKKVGKLFFPNTITMLCQRVGVPMNADDVTLMDKGIIDTPNLARL---------QRTQESR---------QGGLVCGIHQMQEKL------QMH-SS
         C  +K G LFFP+ IT LC+    P   ++  L + G ID   +AR+         Q+   SR          G ++  +  ++++L      Q H  S
Subjt:  DCWRKKVGKLFFPNTITMLCQRVGVPMNADDVTLMDKGIIDTPNLARL---------QRTQESR---------QGGLVCGIHQMQEKL------QMH-SS

Query:  RMEFAERQSQTFWNYVKRRDAALRRTLQSNFSKPYPAFPVFPDDLLNPWIPPLQIEREGDEEEDPGQE
         ++   +Q Q FW Y K RD AL++ LQ+NF++P P FP FP ++L      L  E E + ++D   E
Subjt:  RMEFAERQSQTFWNYVKRRDAALRRTLQSNFSKPYPAFPVFPDDLLNPWIPPLQIEREGDEEEDPGQE

A0A2P5CEY2 Uncharacterized protein8.4e-2436.49Show/hide
Query:  FVKLRLLPTTHDSTVSRDQVLLVFAILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCQRVGVPMNADDVTLMDKGIIDTPNLARL---------
        F+K RLLPTTH  TVS+D++LL++++L   SI+VG++I SEI  C  +K G LFFP+ IT LC+    P   ++  L   G ID   +AR+         
Subjt:  FVKLRLLPTTHDSTVSRDQVLLVFAILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCQRVGVPMNADDVTLMDKGIIDTPNLARL---------

Query:  QRTQESR---------QGGLVCGIHQMQEKL------QMH-SSRMEFAERQSQTFWNYVKRRDAALRRTLQSNFSKPYPAFPVFPDDLLNPWIPPLQIER
        Q+   SR          G ++  +  ++++L      Q H  S ++   +Q Q FW Y K RD AL++ LQ+NF++P P FP FP +LL      L  E 
Subjt:  QRTQESR---------QGGLVCGIHQMQEKL------QMH-SSRMEFAERQSQTFWNYVKRRDAALRRTLQSNFSKPYPAFPVFPDDLLNPWIPPLQIER

Query:  EGDEEEDPGQE
        E + ++D   E
Subjt:  EGDEEEDPGQE

A0A2P5DXM3 Uncharacterized protein3.2e-2331.41Show/hide
Query:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL
        +REFYAN+ + E   + VRGV+V WS   IN ++ L + P   ++E     +  +L   +  V   GA+W +      T   + L   A  W  F+K RL
Subjt:  LREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAVAPSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRL

Query:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCKRAGVLENEGDVILFDKGIIDTPNLARL------QRTQE----
        LPTTH   VS++R+LL  ++L   SI+VG++I SEI  C  +K G LFFP+ IT  C+ A  L NE    L + G ID   +AR+      + TQ+    
Subjt:  LPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCKRAGVLENEGDVILFDKGIIDTPNLARL------QRTQE----

Query:  -------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVGSQTFCLSIPSSLVVAAAKKILRGRRERDNEEEEVPVTPEAPKTKVKKRKTPEER
               +R  G V      LEQ     S+QE   +Q   FW Y   +             A KK L     ++N    +P  P  P+  ++      E 
Subjt:  -------ARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVGSQTFCLSIPSSLVVAAAKKILRGRRERDNEEEEVPVTPEAPKTKVKKRKTPEER

Query:  EAKRRRRQQRAE
        E+ +    + AE
Subjt:  EAKRRRRQQRAE

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGAAAACGAGAGTAAGAAAAGAGAGAGACAACGAGGAAGAAGAGGTACCTGTTACCCCCGAGGCACAGAAAGTGAAAACGAAGAAGAAGAAGACGCTAGAGGAAAA
AGAAGCCAAAAGAAGGAGAAGGCAACAACGGGCTAAGGATCAAGAAGTTGTACAGAAGGTTGTGGAAGATGTTGCTGCCACAGTGGTTGAAGAAGACCCGAAGGAACCAG
AAGAACAAAATCCAGAGCAGACTGAGCCGAGAGTTGCAGATACAGAGGAAGTTAGAGAAGAAAATACAGAGGAGGTTCAAGAAAAGCAGACTAAGGATGTGCATGAAGAA
CAGGCAGAGGTTGCGCCTGAAAAAGAAGAGAGGAGCAGGAGAAGAAAGAAGCCGAGGAAAAAGGAAGAGAGGAAGCAGAAAAGAAGGCTGAAGAAGAAAGGTTGCTTAAG
CGAAGAGCAAACAGGGGCAAAAGTGTTGCAGCAGCACCGGAGGAACCTGACGAAACAGAAGAGCCACAGTTGCCTGCGCGAATTTTATGCGAATATTGACGAAGAGGAAG
GTTTCCAAGTTATCGTTCGAGGTGTCGAGGTCGACTGGAGTCCTAGTGTTATTAACACCCTGTACAACCTTCAAAATTTCCCCCACGCAGGATACAATGAGATGGCTGTG
GCGCCATCTAATGAGCAGCTGAGTGATGCTGTGAGGGAAGTTGGTGTTGAAGGGGCGCAGTGGAGACTTCAAAAAACTGAGAAAAGGACATTTCAGTCAGCCTATCTAAA
GAAGGAAGCAAACACATGGATGGGGTTTATTAAACAGAGATTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGTGTTCTGCTGGCGTTTGCGATTTTAAGGT
CTCTCAGTATTGATGTAGGAAAGATTATTGCTAGTGAAATATCTGGGTGTTGGAAGAAGAAAGTGGGGAAGTTGTTTTTCCCGAATACCATTACCATGTTTTGCAAGCGA
GCAGGGGTTCTAGAGAATGAAGGAGATGTTATTTTATTTGACAAAGGGATCATTGATACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCT
TGTCTATGGCATCAACACGATTTTAGAACAACTTGCACTGTCGGCCAGCAGACAAGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGTTGGAAGTCAAACCT
TTTGCTTGAGCATTCCTTCTAGCCTGGTCGTTGCTGCGGCAAAGAAGATTCTGAGAGGAAGAAGGGAAAGAGACAATGAGGAGGAAGAGGTGCCAGTTACCCCTGAGGCA
CCCAAGACAAAGGTGAAGAAAAGAAAAACGCCGGAAGAGAGGGAGGCTAAGCGAAGAAGACGTCAACAAAGGGCTGAAGTCGTAAGAAAAGTAGTAGAAGACGTTGCTGA
TGTTGTAGTTGAGGAAGAAAACCCAAAGGAACCAGAGGAAAAGAATCCTGAGCAAGAAGAGCAAGAGAAAAAAGGGACTGAGGATCAGGTGAGAGAAGAAACTGAAAAGA
AGGCCCAGGAAGAAATTTTGGTGAAGCAAACCGAAGACAAGGGCAAAGGAGTTGCTGAAGCATCGGGAGAGACAGAGGAGGTCGATCCTGAGGAACCAAGGTTGCCATAT
AATCGCTTCATCAACAATCTTGCCCGAGCAAAGTATATAGAGATGCTGGAAAGAGATTTTCTGTTTGAAAGGGGATTTGGTGATGATCTGCCGCATTTCTTAAGAGTTGG
GATAACAGATCACGGATGGAGCCAATTCTGCACAAAACCGGAGCTGGTAAATTCAAATGTTGTTCGAGAATTTTACGCGAACATTGATGATCAAGAAGGATTCAATGAGA
TGGTGGTAGCACCATCTAACGATCAATTGAACGCGGCTGTCAGGGAGTTGGGCTTCGTCAAGCTGCGTTTGCTACCAACAACTCATGATTCAACGGTGTCTCGAGACCAA
GTGCTCCTGGTATTTGCTATTCTTCGTTCGTTGAGTATCGATGTTGGGAAAATAATTTCAAGTGAAATTTATGATTGCTGGCGGAAGAAGGTAGGGAAACTGTTTTTCCC
GAACACAATAACCATGCTGTGTCAACGAGTAGGGGTTCCCATGAATGCAGACGATGTCACTCTAATGGACAAGGGAATAATAGACACACCGAACCTGGCAAGGCTTCAGA
GGACTCAAGAATCGCGCCAAGGTGGTTTGGTGTGTGGCATCCATCAAATGCAAGAGAAGTTGCAAATGCATTCCAGTCGGATGGAGTTTGCCGAAAGGCAATCCCAAACC
TTTTGGAATTATGTGAAGAGAAGGGATGCCGCGTTGAGGAGGACCTTGCAGTCTAATTTTTCTAAGCCTTATCCAGCCTTCCCAGTATTCCCTGATGACCTGTTGAACCC
CTGGATACCACCCCTGCAGATAGAAAGAGAAGGAGATGAGGAGGAAGACCCTGGTCAGGAGGATTGA
mRNA sequenceShow/hide mRNA sequence
ATGGCGAAAACGAGAGTAAGAAAAGAGAGAGACAACGAGGAAGAAGAGGTACCTGTTACCCCCGAGGCACAGAAAGTGAAAACGAAGAAGAAGAAGACGCTAGAGGAAAA
AGAAGCCAAAAGAAGGAGAAGGCAACAACGGGCTAAGGATCAAGAAGTTGTACAGAAGGTTGTGGAAGATGTTGCTGCCACAGTGGTTGAAGAAGACCCGAAGGAACCAG
AAGAACAAAATCCAGAGCAGACTGAGCCGAGAGTTGCAGATACAGAGGAAGTTAGAGAAGAAAATACAGAGGAGGTTCAAGAAAAGCAGACTAAGGATGTGCATGAAGAA
CAGGCAGAGGTTGCGCCTGAAAAAGAAGAGAGGAGCAGGAGAAGAAAGAAGCCGAGGAAAAAGGAAGAGAGGAAGCAGAAAAGAAGGCTGAAGAAGAAAGGTTGCTTAAG
CGAAGAGCAAACAGGGGCAAAAGTGTTGCAGCAGCACCGGAGGAACCTGACGAAACAGAAGAGCCACAGTTGCCTGCGCGAATTTTATGCGAATATTGACGAAGAGGAAG
GTTTCCAAGTTATCGTTCGAGGTGTCGAGGTCGACTGGAGTCCTAGTGTTATTAACACCCTGTACAACCTTCAAAATTTCCCCCACGCAGGATACAATGAGATGGCTGTG
GCGCCATCTAATGAGCAGCTGAGTGATGCTGTGAGGGAAGTTGGTGTTGAAGGGGCGCAGTGGAGACTTCAAAAAACTGAGAAAAGGACATTTCAGTCAGCCTATCTAAA
GAAGGAAGCAAACACATGGATGGGGTTTATTAAACAGAGATTGCTTCCAACGACTCATGACTCGACGGTTTCTAGGGAACGTGTTCTGCTGGCGTTTGCGATTTTAAGGT
CTCTCAGTATTGATGTAGGAAAGATTATTGCTAGTGAAATATCTGGGTGTTGGAAGAAGAAAGTGGGGAAGTTGTTTTTCCCGAATACCATTACCATGTTTTGCAAGCGA
GCAGGGGTTCTAGAGAATGAAGGAGATGTTATTTTATTTGACAAAGGGATCATTGATACGCCTAACTTGGCGCGGCTTCAGCGTACGCAAGAGGCACGTCAGGGTGGGCT
TGTCTATGGCATCAACACGATTTTAGAACAACTTGCACTGTCGGCCAGCAGACAAGAGTTTGCCGAGAGGCAAGCTTTAACCTTCTGGAACTATGTTGGAAGTCAAACCT
TTTGCTTGAGCATTCCTTCTAGCCTGGTCGTTGCTGCGGCAAAGAAGATTCTGAGAGGAAGAAGGGAAAGAGACAATGAGGAGGAAGAGGTGCCAGTTACCCCTGAGGCA
CCCAAGACAAAGGTGAAGAAAAGAAAAACGCCGGAAGAGAGGGAGGCTAAGCGAAGAAGACGTCAACAAAGGGCTGAAGTCGTAAGAAAAGTAGTAGAAGACGTTGCTGA
TGTTGTAGTTGAGGAAGAAAACCCAAAGGAACCAGAGGAAAAGAATCCTGAGCAAGAAGAGCAAGAGAAAAAAGGGACTGAGGATCAGGTGAGAGAAGAAACTGAAAAGA
AGGCCCAGGAAGAAATTTTGGTGAAGCAAACCGAAGACAAGGGCAAAGGAGTTGCTGAAGCATCGGGAGAGACAGAGGAGGTCGATCCTGAGGAACCAAGGTTGCCATAT
AATCGCTTCATCAACAATCTTGCCCGAGCAAAGTATATAGAGATGCTGGAAAGAGATTTTCTGTTTGAAAGGGGATTTGGTGATGATCTGCCGCATTTCTTAAGAGTTGG
GATAACAGATCACGGATGGAGCCAATTCTGCACAAAACCGGAGCTGGTAAATTCAAATGTTGTTCGAGAATTTTACGCGAACATTGATGATCAAGAAGGATTCAATGAGA
TGGTGGTAGCACCATCTAACGATCAATTGAACGCGGCTGTCAGGGAGTTGGGCTTCGTCAAGCTGCGTTTGCTACCAACAACTCATGATTCAACGGTGTCTCGAGACCAA
GTGCTCCTGGTATTTGCTATTCTTCGTTCGTTGAGTATCGATGTTGGGAAAATAATTTCAAGTGAAATTTATGATTGCTGGCGGAAGAAGGTAGGGAAACTGTTTTTCCC
GAACACAATAACCATGCTGTGTCAACGAGTAGGGGTTCCCATGAATGCAGACGATGTCACTCTAATGGACAAGGGAATAATAGACACACCGAACCTGGCAAGGCTTCAGA
GGACTCAAGAATCGCGCCAAGGTGGTTTGGTGTGTGGCATCCATCAAATGCAAGAGAAGTTGCAAATGCATTCCAGTCGGATGGAGTTTGCCGAAAGGCAATCCCAAACC
TTTTGGAATTATGTGAAGAGAAGGGATGCCGCGTTGAGGAGGACCTTGCAGTCTAATTTTTCTAAGCCTTATCCAGCCTTCCCAGTATTCCCTGATGACCTGTTGAACCC
CTGGATACCACCCCTGCAGATAGAAAGAGAAGGAGATGAGGAGGAAGACCCTGGTCAGGAGGATTGA
Protein sequenceShow/hide protein sequence
MAKTRVRKERDNEEEEVPVTPEAQKVKTKKKKTLEEKEAKRRRRQQRAKDQEVVQKVVEDVAATVVEEDPKEPEEQNPEQTEPRVADTEEVREENTEEVQEKQTKDVHEE
QAEVAPEKEERSRRRKKPRKKEERKQKRRLKKKGCLSEEQTGAKVLQQHRRNLTKQKSHSCLREFYANIDEEEGFQVIVRGVEVDWSPSVINTLYNLQNFPHAGYNEMAV
APSNEQLSDAVREVGVEGAQWRLQKTEKRTFQSAYLKKEANTWMGFIKQRLLPTTHDSTVSRERVLLAFAILRSLSIDVGKIIASEISGCWKKKVGKLFFPNTITMFCKR
AGVLENEGDVILFDKGIIDTPNLARLQRTQEARQGGLVYGINTILEQLALSASRQEFAERQALTFWNYVGSQTFCLSIPSSLVVAAAKKILRGRRERDNEEEEVPVTPEA
PKTKVKKRKTPEEREAKRRRRQQRAEVVRKVVEDVADVVVEEENPKEPEEKNPEQEEQEKKGTEDQVREETEKKAQEEILVKQTEDKGKGVAEASGETEEVDPEEPRLPY
NRFINNLARAKYIEMLERDFLFERGFGDDLPHFLRVGITDHGWSQFCTKPELVNSNVVREFYANIDDQEGFNEMVVAPSNDQLNAAVRELGFVKLRLLPTTHDSTVSRDQ
VLLVFAILRSLSIDVGKIISSEIYDCWRKKVGKLFFPNTITMLCQRVGVPMNADDVTLMDKGIIDTPNLARLQRTQESRQGGLVCGIHQMQEKLQMHSSRMEFAERQSQT
FWNYVKRRDAALRRTLQSNFSKPYPAFPVFPDDLLNPWIPPLQIEREGDEEEDPGQED