; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg003171 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg003171
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionUnknown protein
Genome locationscaffold4:33368478..33370486
RNA-Seq ExpressionSpg003171
SyntenySpg003171
Gene Ontology termsNA
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
PIN07564.1 hypothetical protein CDL12_19862 [Handroanthus impetiginosus]1.3e-2028.06Show/hide
Query:  NELARAKYREMLKRDFLFERGF---GDDLPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRG---------LNAAVRVVGIE----
        N+ AR    + L +  + ERGF   G+     +   +    W    A+P+     + REFYAN    + F+ +VRG         +N    +  IE    
Subjt:  NELARAKYREMLKRDFLFERGF---GDDLPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRG---------LNAAVRVVGIE----

Query:  --------------------GAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGK
                            GAQW+++K E  +F++  L   A  WL FI   +LPT+H   ++ D+ LL++ I+   + DVGKIISN I          
Subjt:  --------------------GAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGK

Query:  LFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARLQRTQEARQGGLVCGIHQILEQLALSASRQE---FAERQAQTYWTYA---KRRDDTLRRALQ
        L+FP+ IT LC+RAGV    ++ ++  R  ID   + R+        GG    + + +  L    S QE     ER+      Y     R    L R + 
Subjt:  LFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARLQRTQEARQGGLVCGIHQILEQLALSASRQE---FAERQAQTYWTYA---KRRDDTLRRALQ

Query:  SNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDE
        ++        P F  D  +P  PPPP   E ED+E
Subjt:  SNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDE

PON35554.1 hypothetical protein PanWU01x14_335450, partial [Parasponia andersonii]3.5e-2332.68Show/hide
Query:  IRFINELARAKYREMLK-RDFLFERGFGDD-------LPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRGLNAA-----------
        ++F  E A  +Y   ++ R    E+GF  D       LP F+   I+ H W Q CA P+     +VREFYAN+ +  E    VRG+  +           
Subjt:  IRFINELARAKYREMLK-RDFLFERGFGDD-------LPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRGLNAA-----------

Query:  -----------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEI
                               +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  T+S+DR+LL+ ++L   SI+VG++I +EI
Subjt:  -----------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEI

Query:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LC  A  P +  +  L + G ID   +AR+  TQE
Subjt:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARLQRTQE

PON46472.1 hypothetical protein PanWU01x14_251180, partial [Parasponia andersonii]3.5e-3130.35Show/hide
Query:  IRFINELARAKYREMLK-RDFLFERGFGDD-------LPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRGLNAA-----------
        ++F  E A  +Y   ++ R    E+GF  D       LP F+   I+ H W Q CA P+     +VREFYAN+ + EE    VRG+  +           
Subjt:  IRFINELARAKYREMLK-RDFLFERGFGDD-------LPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRGLNAA-----------

Query:  -----------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEI
                               +  V   GA+W +S     T   + L   A  W  F+K  LLPTTH  T+S+DR+LL+ ++L   SI+VG++I +EI
Subjt:  -----------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEI

Query:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARL------QRTQEA-----------RQGGLVCGIHQILEQLALSASRQEF---
          C  +K G LFFP+ IT LC  A  P +  +  L + G ID   +AR+      + TQ+            R  G +    + LEQ       Q++   
Subjt:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARL------QRTQEA-----------RQGGLVCGIHQILEQLALSASRQEF---

Query:  -----AERQAQTYWTYAKRRDDTLRRALQSNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDEEQGQE
               +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++    +     E E E D++   E
Subjt:  -----AERQAQTYWTYAKRRDDTLRRALQSNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDEEQGQE

PON78020.1 hypothetical protein PanWU01x14_023740 [Parasponia andersonii]6.5e-2531.29Show/hide
Query:  IVREFYANVDNAEEFQAIVRGLNAA----------------------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCL
        +VREFYAN+ + EE    VRG+  +                                  +  V   GA+W +S     T   + L   A  W  F+K  L
Subjt:  IVREFYANVDNAEEFQAIVRGLNAA----------------------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCL

Query:  LPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARL--------------Q
        LPTTH   +S+DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P +  +  L + G ID   +AR+               
Subjt:  LPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARL--------------Q

Query:  RTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDEEQGQE
        R   A        + Q L+ L    S+QE   +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++    +     E E E D++   E
Subjt:  RTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDEEQGQE

TYG52543.1 hypothetical protein ES288_D09G036700v1 [Gossypium darwinii]1.6e-2033.19Show/hide
Query:  INELARAKYREMLK-RDFLFERGFG---DDL---PHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRG--------LNAAVRVVGIE
        I+E  + ++  + K +  + E+GFG   +DL   P  +R +I+   W + C      +  +VREFYA++   +  + IVR         L   + VV   
Subjt:  INELARAKYREMLK-RDFLFERGFG---DDL---PHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRG--------LNAAVRVVGIE

Query:  GAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVP
        G+QW +      + Q  YLK  A  W  F++   +P +H ST+S + +LL++AIL   SI+VGKII  EI NC +KK    +FP+ IT LC +A V    
Subjt:  GAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVP

Query:  EDVILLDRGIIDTPNLARL-QRTQEARQG
               +G I   +L RL +R  E  QG
Subjt:  EDVILLDRGIIDTPNLARL-QRTQEARQG

TrEMBL top hitse value%identityAlignment
A0A2G9GQI5 Uncharacterized protein6.1e-2128.06Show/hide
Query:  NELARAKYREMLKRDFLFERGF---GDDLPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRG---------LNAAVRVVGIE----
        N+ AR    + L +  + ERGF   G+     +   +    W    A+P+     + REFYAN    + F+ +VRG         +N    +  IE    
Subjt:  NELARAKYREMLKRDFLFERGF---GDDLPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRG---------LNAAVRVVGIE----

Query:  --------------------GAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGK
                            GAQW+++K E  +F++  L   A  WL FI   +LPT+H   ++ D+ LL++ I+   + DVGKIISN I          
Subjt:  --------------------GAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGK

Query:  LFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARLQRTQEARQGGLVCGIHQILEQLALSASRQE---FAERQAQTYWTYA---KRRDDTLRRALQ
        L+FP+ IT LC+RAGV    ++ ++  R  ID   + R+        GG    + + +  L    S QE     ER+      Y     R    L R + 
Subjt:  LFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARLQRTQEARQGGLVCGIHQILEQLALSASRQE---FAERQAQTYWTYA---KRRDDTLRRALQ

Query:  SNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDE
        ++        P F  D  +P  PPPP   E ED+E
Subjt:  SNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDE

A0A2P5AGA5 Uncharacterized protein (Fragment)1.7e-2332.68Show/hide
Query:  IRFINELARAKYREMLK-RDFLFERGFGDD-------LPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRGLNAA-----------
        ++F  E A  +Y   ++ R    E+GF  D       LP F+   I+ H W Q CA P+     +VREFYAN+ +  E    VRG+  +           
Subjt:  IRFINELARAKYREMLK-RDFLFERGFGDD-------LPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRGLNAA-----------

Query:  -----------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEI
                               +  V + GA+W +S     T   + L   A  W  F+K  LLPTTH  T+S+DR+LL+ ++L   SI+VG++I +EI
Subjt:  -----------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEI

Query:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARLQRTQE
          C  +K G LFFP+ IT LC  A  P +  +  L + G ID   +AR+  TQE
Subjt:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARLQRTQE

A0A2P5BCG4 Uncharacterized protein (Fragment)1.7e-3130.35Show/hide
Query:  IRFINELARAKYREMLK-RDFLFERGFGDD-------LPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRGLNAA-----------
        ++F  E A  +Y   ++ R    E+GF  D       LP F+   I+ H W Q CA P+     +VREFYAN+ + EE    VRG+  +           
Subjt:  IRFINELARAKYREMLK-RDFLFERGFGDD-------LPHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRGLNAA-----------

Query:  -----------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEI
                               +  V   GA+W +S     T   + L   A  W  F+K  LLPTTH  T+S+DR+LL+ ++L   SI+VG++I +EI
Subjt:  -----------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEI

Query:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARL------QRTQEA-----------RQGGLVCGIHQILEQLALSASRQEF---
          C  +K G LFFP+ IT LC  A  P +  +  L + G ID   +AR+      + TQ+            R  G +    + LEQ       Q++   
Subjt:  FNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARL------QRTQEA-----------RQGGLVCGIHQILEQLALSASRQEF---

Query:  -----AERQAQTYWTYAKRRDDTLRRALQSNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDEEQGQE
               +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++    +     E E E D++   E
Subjt:  -----AERQAQTYWTYAKRRDDTLRRALQSNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDEEQGQE

A0A2P5DXM3 Uncharacterized protein3.1e-2531.29Show/hide
Query:  IVREFYANVDNAEEFQAIVRGLNAA----------------------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCL
        +VREFYAN+ + EE    VRG+  +                                  +  V   GA+W +S     T   + L   A  W  F+K  L
Subjt:  IVREFYANVDNAEEFQAIVRGLNAA----------------------------------VRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCL

Query:  LPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARL--------------Q
        LPTTH   +S+DR+LL+ ++L   SI+VG++I +EI  C  +K G LFFP+ IT LC  A  P +  +  L + G ID   +AR+               
Subjt:  LPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARL--------------Q

Query:  RTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDEEQGQE
        R   A        + Q L+ L    S+QE   +Q Q +W Y+K RD  L++ALQ+NF++P   FP FP ++    +     E E E D++   E
Subjt:  RTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQVFPMFPDDLFNPWIPPPPVEREEEDDEEQGQE

A0A5D2B8V0 Uncharacterized protein8.0e-2133.19Show/hide
Query:  INELARAKYREMLK-RDFLFERGFG---DDL---PHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRG--------LNAAVRVVGIE
        I+E  + ++  + K +  + E+GFG   +DL   P  +R +I+   W + C      +  +VREFYA++   +  + IVR         L   + VV   
Subjt:  INELARAKYREMLK-RDFLFERGFG---DDL---PHFLRARISNHGWNQLCAKPDPVNSNIVREFYANVDNAEEFQAIVRG--------LNAAVRVVGIE

Query:  GAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVP
        G+QW +      + Q  YLK  A  W  F++   +P +H ST+S + +LL++AIL   SI+VGKII  EI NC +KK    +FP+ IT LC +A V    
Subjt:  GAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWRKKVGKLFFPNTITMLCSRAGVPTVP

Query:  EDVILLDRGIIDTPNLARL-QRTQEARQG
               +G I   +L RL +R  E  QG
Subjt:  EDVILLDRGIIDTPNLARL-QRTQEARQG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
No hits found

Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGCCATCTGATGGAGCCACGTGTCACACAGGGAAGATCAAAGGGCTTAGCCGTTGGACAGCGCATTGTTGTCTTCACTTCATTTTCTTGCTTTCGATTTTTATTGTT
TCTGTTTTCCTTTTCGCTTTTATTTGCTGCAAATTCCTTTGTGTCTCTAATGGCGAAAACAAGAGCAAGAAAAGAGCGAGACAACGAGGAAGAAGAGGTACCTGTTACCC
CCGAAGTACAGAAAGTGAAAACAAAGAAAAAGAAAACGCCGGAGGAAAAAGAAGCCAAAAGAAGGAGAAGGCAACAGAGGGCTGAGGATCAAGAAAGTGTACAGAAGGTG
GTAGAAGATGTGGTTGCCACAGTGGTTGAAGACCCGAAGGAACCAGAGGGACAGAACACTGAGCTGATTGACCCAGTAGTTGCAGATATGGAGGGAGTTCAAGAAGAACA
AACAGAGGAAGTTCAAGAAAAACAGGCCGAAGATACGCAAGAAGGTAGGACAGAGGATGTTCAGGAAACAAGTAATGAGCAGGTGGAGCAAGAGCAAGAGGCTCGTGTTG
AGGTTATCATTCTGGAAGTACCAAAATGTCGCCGCGTGAAGCGGAAAGCTGGACGCGTCAAGGTAGTCCGAGCTGATACCCCATCACCACCATCGACGGATTCTGAGAAA
GAGAATGCAAAGAGAGAGGAACGGGAGAAAAAGGAGGCTGAGGACAGAGAGAAAGAAGAAGTAGGAAAGAAAGCAGCGGAAGAAACTTTGACAAAGCATCAAGAAGACAG
GGGCAAAGGAATTGCGGAAGCATCGGATGAACCTATAGAAGAAGCAGAAGAAAGACCATTCATCCGCTTCATCAATGAACTTGCCCGAGCAAAATACCGGGAGATGCTAA
AAAGGGATTTCTTATTCGAAAGAGGGTTTGGTGACGATCTGCCACATTTCTTAAGGGCAAGGATCTCGAATCACGGCTGGAATCAGTTATGTGCGAAACCGGACCCAGTG
AATTCGAACATTGTTCGAGAGTTTTATGCGAATGTTGATAATGCAGAGGAATTTCAGGCCATAGTCCGAGGATTAAATGCGGCGGTCCGAGTGGTTGGCATTGAGGGGGC
TCAATGGAGGCTATCAAAGACGGAGAAGCGAACATTTCAAGCTGCCTATTTAAAGAGTGAAGCCAATACTTGGTTGGGCTTCATCAAGCTGTGTTTGCTTCCAACTACGC
ATGATTCAACAATGTCTCGCGACCGAGTGCTTTTGATATTCGCAATTCTTCGATCCTTAAGTATTGATGTTGGAAAAATCATTTCGAATGAAATCTTTAATTGCTGGCGC
AAAAAGGTGGGGAAGTTGTTTTTCCCGAACACGATCACTATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAGGATGTGATTTTGCTTGACAGGGGAATCATCGA
TACGCCTAATCTGGCGCGGCTTCAGCGTACGCAGGAGGCACGCCAGGGTGGGCTAGTGTGTGGGATTCATCAAATCCTAGAGCAATTGGCACTTTCGGCCAGTAGGCAAG
AGTTTGCTGAGAGGCAAGCTCAAACCTATTGGACCTATGCTAAAAGGAGAGATGACACACTCAGGAGGGCCTTGCAATCCAATTTCTCCAAACCATATCAAGTCTTCCCT
ATGTTTCCCGATGATTTATTTAACCCTTGGATACCGCCCCCACCTGTCGAAAGAGAAGAAGAGGATGATGAAGAGCAGGGTCAGGAAGACTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGCCATCTGATGGAGCCACGTGTCACACAGGGAAGATCAAAGGGCTTAGCCGTTGGACAGCGCATTGTTGTCTTCACTTCATTTTCTTGCTTTCGATTTTTATTGTT
TCTGTTTTCCTTTTCGCTTTTATTTGCTGCAAATTCCTTTGTGTCTCTAATGGCGAAAACAAGAGCAAGAAAAGAGCGAGACAACGAGGAAGAAGAGGTACCTGTTACCC
CCGAAGTACAGAAAGTGAAAACAAAGAAAAAGAAAACGCCGGAGGAAAAAGAAGCCAAAAGAAGGAGAAGGCAACAGAGGGCTGAGGATCAAGAAAGTGTACAGAAGGTG
GTAGAAGATGTGGTTGCCACAGTGGTTGAAGACCCGAAGGAACCAGAGGGACAGAACACTGAGCTGATTGACCCAGTAGTTGCAGATATGGAGGGAGTTCAAGAAGAACA
AACAGAGGAAGTTCAAGAAAAACAGGCCGAAGATACGCAAGAAGGTAGGACAGAGGATGTTCAGGAAACAAGTAATGAGCAGGTGGAGCAAGAGCAAGAGGCTCGTGTTG
AGGTTATCATTCTGGAAGTACCAAAATGTCGCCGCGTGAAGCGGAAAGCTGGACGCGTCAAGGTAGTCCGAGCTGATACCCCATCACCACCATCGACGGATTCTGAGAAA
GAGAATGCAAAGAGAGAGGAACGGGAGAAAAAGGAGGCTGAGGACAGAGAGAAAGAAGAAGTAGGAAAGAAAGCAGCGGAAGAAACTTTGACAAAGCATCAAGAAGACAG
GGGCAAAGGAATTGCGGAAGCATCGGATGAACCTATAGAAGAAGCAGAAGAAAGACCATTCATCCGCTTCATCAATGAACTTGCCCGAGCAAAATACCGGGAGATGCTAA
AAAGGGATTTCTTATTCGAAAGAGGGTTTGGTGACGATCTGCCACATTTCTTAAGGGCAAGGATCTCGAATCACGGCTGGAATCAGTTATGTGCGAAACCGGACCCAGTG
AATTCGAACATTGTTCGAGAGTTTTATGCGAATGTTGATAATGCAGAGGAATTTCAGGCCATAGTCCGAGGATTAAATGCGGCGGTCCGAGTGGTTGGCATTGAGGGGGC
TCAATGGAGGCTATCAAAGACGGAGAAGCGAACATTTCAAGCTGCCTATTTAAAGAGTGAAGCCAATACTTGGTTGGGCTTCATCAAGCTGTGTTTGCTTCCAACTACGC
ATGATTCAACAATGTCTCGCGACCGAGTGCTTTTGATATTCGCAATTCTTCGATCCTTAAGTATTGATGTTGGAAAAATCATTTCGAATGAAATCTTTAATTGCTGGCGC
AAAAAGGTGGGGAAGTTGTTTTTCCCGAACACGATCACTATGTTATGCAGCAGGGCAGGAGTGCCCACGGTTCCAGAGGATGTGATTTTGCTTGACAGGGGAATCATCGA
TACGCCTAATCTGGCGCGGCTTCAGCGTACGCAGGAGGCACGCCAGGGTGGGCTAGTGTGTGGGATTCATCAAATCCTAGAGCAATTGGCACTTTCGGCCAGTAGGCAAG
AGTTTGCTGAGAGGCAAGCTCAAACCTATTGGACCTATGCTAAAAGGAGAGATGACACACTCAGGAGGGCCTTGCAATCCAATTTCTCCAAACCATATCAAGTCTTCCCT
ATGTTTCCCGATGATTTATTTAACCCTTGGATACCGCCCCCACCTGTCGAAAGAGAAGAAGAGGATGATGAAGAGCAGGGTCAGGAAGACTGA
Protein sequenceShow/hide protein sequence
MSHLMEPRVTQGRSKGLAVGQRIVVFTSFSCFRFLLFLFSFSLLFAANSFVSLMAKTRARKERDNEEEEVPVTPEVQKVKTKKKKTPEEKEAKRRRRQQRAEDQESVQKV
VEDVVATVVEDPKEPEGQNTELIDPVVADMEGVQEEQTEEVQEKQAEDTQEGRTEDVQETSNEQVEQEQEARVEVIILEVPKCRRVKRKAGRVKVVRADTPSPPSTDSEK
ENAKREEREKKEAEDREKEEVGKKAAEETLTKHQEDRGKGIAEASDEPIEEAEERPFIRFINELARAKYREMLKRDFLFERGFGDDLPHFLRARISNHGWNQLCAKPDPV
NSNIVREFYANVDNAEEFQAIVRGLNAAVRVVGIEGAQWRLSKTEKRTFQAAYLKSEANTWLGFIKLCLLPTTHDSTMSRDRVLLIFAILRSLSIDVGKIISNEIFNCWR
KKVGKLFFPNTITMLCSRAGVPTVPEDVILLDRGIIDTPNLARLQRTQEARQGGLVCGIHQILEQLALSASRQEFAERQAQTYWTYAKRRDDTLRRALQSNFSKPYQVFP
MFPDDLFNPWIPPPPVEREEEDDEEQGQED