; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0030097 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0030097
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF1997)
Genome locationchr8:44557588..44558946
RNA-Seq ExpressionLag0030097
SyntenyLag0030097
Gene Ontology termsNA
InterPro domainsIPR018971 - Protein of unknown function DUF1997


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7033185.1 hypothetical protein SDJN02_07239, partial [Cucurbita argyrosperma subsp. argyrosperma]6.2e-9471.26Show/hide
Query:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL
        ML+S TQ  GL V+NGV+      L+KSHLK QK + WKCF+V K+QQK RKDQN LSVSLR FSD+P YE  GKASFDQYLEDK RLV ATFPGKS+QL
Subjt:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL

Query:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV
        NQEEWRIETPKIE LFLKIWPTID+KI+SK++GEGYP DVP  ITKVL+L++T WE+NGI +DYRPSSAN+CSR A+YSEK G RSRLKFQL +NLSF +
Subjt:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV

Query:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK
        P+AL+FVP DVF+ IVE  L+ M+ED+K K +D+LVEDYC FRKEKK
Subjt:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK

XP_022961709.1 uncharacterized protein LOC111462397 [Cucurbita moschata]7.4e-9571.66Show/hide
Query:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL
        ML+S TQ  GL V+NGV+      L+KSHLK QK + WKCF+V K+QQK RKDQN LSVSLR FSD+P YE  GKASFDQYLEDK RLV ATFPGKS+QL
Subjt:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL

Query:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV
        NQEEWRIETPKIE LFLKIWPTID+KI+SK++GEGYP DVP  ITKVL+L++T WE+NGIH+DYRPSSAN+CSR A+YSEK G RSRLKFQL +NLSF +
Subjt:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV

Query:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK
        P+AL+FVP DVF+ IVE  L+ M+ED+K K +D+LVEDYC FRKEKK
Subjt:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK

XP_023517004.1 uncharacterized protein LOC111780797 isoform X1 [Cucurbita pepo subsp. pepo]4.3e-9571.66Show/hide
Query:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL
        MLYS TQ  GL V+NGV+      L+KSHLK QK + WKCF+V K+QQK RKDQN LSVSLR FSD+P YE  GKASFDQYLEDK R+V ATFPGKS+QL
Subjt:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL

Query:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV
        NQEEWRIETPKIE LFLKIWPTID+KI+SK++GEGYP DVP  ITKVL+L++T WE+NGIH+DYRPSSAN+CSR A+YS+K G RSRLKFQL +NLSF +
Subjt:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV

Query:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK
        P+AL+FVP DVF+ IVE  L+AM+ED+K K +D+LVEDYC FRKEKK
Subjt:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK

XP_023517005.1 uncharacterized protein LOC111780797 isoform X2 [Cucurbita pepo subsp. pepo]6.4e-9170.45Show/hide
Query:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL
        MLYS TQ  GL V+NGV+      L+KSHLK QK + WKCF+V K+QQK RKDQN LSVSLR FSD+P YE  GKASFDQYLEDK R+V ATFPGKS+QL
Subjt:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL

Query:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV
        NQEEWRIETPKIE LFLKIWPTID+KI+SK++GEGYP DVP  ITK      T WE+NGIH+DYRPSSAN+CSR A+YS+K G RSRLKFQL +NLSF +
Subjt:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV

Query:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK
        P+AL+FVP DVF+ IVE  L+AM+ED+K K +D+LVEDYC FRKEKK
Subjt:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK

XP_038891182.1 uncharacterized protein LOC120080556 [Benincasa hispida]9.3e-9072.8Show/hide
Query:  LHVENGVLPIQRESLKKS-HLKKQKSTSWKCFSVPKAQQKL-RKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQLNQEEWRIE
        L VENGVL +QRES + S +LKKQK   WKCF+V K Q+       N LSVSL FFSD+P Y+SPGKASFD+YLEDK RLV ATFPGK QQLNQEEWRIE
Subjt:  LHVENGVLPIQRESLKKS-HLKKQKSTSWKCFSVPKAQQKL-RKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQLNQEEWRIE

Query:  TPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLVPNALSFVP
         PKIELLFLKIWPT+D+KI  K+NGE YP DVP YITKVL LE+T WEINGIHKDYRPS AN+CSR A+YSEKIGTRS LKF+LL+NLSFLVP  L+FV 
Subjt:  TPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLVPNALSFVP

Query:  NDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK
        NDV + IV+T L+AMIEDLKHK+I KLVEDY +FRKE K
Subjt:  NDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK

TrEMBL top hitse value%identityAlignment
A0A0A0KSD5 Uncharacterized protein1.1e-8570.4Show/hide
Query:  KKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQLNQEEWRIETPKIELLFLKIWPTIDM
        ++S LKKQK  +WKCF++    QK+    N LSVS   FSD+P YESPGKASFD+YLEDK RLV ATFPGK+QQLNQEEWRIETPKI+LLFLKI PTIDM
Subjt:  KKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQLNQEEWRIETPKIELLFLKIWPTIDM

Query:  KIMSKSN-GEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLVPNALSFVPNDVFRGIVETVLRAMI
        KI+SK+N GE YP  VP YI K+L  ++T WEINGIHK+YRPSSAN+CS   +Y +KIGTRSRLKFQL+++LSFLVP+AL FVPNDV RGI+ETV++AM+
Subjt:  KIMSKSN-GEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLVPNALSFVPNDVFRGIVETVLRAMI

Query:  EDLKHKTIDKLVEDYCKFRKEKK
        EDLKHKT+ KLVEDY KFR EK+
Subjt:  EDLKHKTIDKLVEDYCKFRKEKK

A0A1S4E357 uncharacterized protein LOC1034987449.4e-7268.72Show/hide
Query:  KKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQLNQEEWRIETPKIELLFLKIWPTIDM
        ++S LKKQK   W+CF++P++Q+ +  D N LSVS   FSD+  +ESPGKASFD+YLEDK RL+ ATFPGK QQLNQEEWRIETPKI+LLFLKIWPT+DM
Subjt:  KKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQLNQEEWRIETPKIELLFLKIWPTIDM

Query:  KIMSKSN-GEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLVPNALSFVPNDVFRGIVETV
        KI+SK+N GE YP DVP YI KVL  E+T WEINGI+KDYRPSSAN+CS   +Y EKIGTRS LKF+L+++LSFLVP+AL FVPNDV RG++ TV
Subjt:  KIMSKSN-GEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLVPNALSFVPNDVFRGIVETV

A0A6J1C174 uncharacterized protein LOC111006493 isoform X11.1e-8064.11Show/hide
Query:  LYSSTQLGLHVENGVLPIQRESLKKSHLKKQKS---TSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQ
        L+S+  L  HVENG    QR + + + +KK+K     S K  +V K QQ   + QN LS S+ FFSD+P  ESPGKASFDQYLEDK R++ ATFPGKSQQ
Subjt:  LYSSTQLGLHVENGVLPIQRESLKKSHLKKQKS---TSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQ

Query:  LNQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFL
        LNQEEWRIETPK+ELL LKIWP IDMKI+SK++G+ YP  VP +ITK+L LE+T WEINGIH++YRPSSAN+ S+ A+YSEK GT SRLKFQ  MN +F+
Subjt:  LNQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFL

Query:  VPNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK
        VP ALSF+P D+FR I ETVL+ M+EDL +K IDKLVEDY KFRKEKK
Subjt:  VPNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK

A0A6J1HEU2 uncharacterized protein LOC1114623973.6e-9571.66Show/hide
Query:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL
        ML+S TQ  GL V+NGV+      L+KSHLK QK + WKCF+V K+QQK RKDQN LSVSLR FSD+P YE  GKASFDQYLEDK RLV ATFPGKS+QL
Subjt:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL

Query:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV
        NQEEWRIETPKIE LFLKIWPTID+KI+SK++GEGYP DVP  ITKVL+L++T WE+NGIH+DYRPSSAN+CSR A+YSEK G RSRLKFQL +NLSF +
Subjt:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV

Query:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK
        P+AL+FVP DVF+ IVE  L+ M+ED+K K +D+LVEDYC FRKEKK
Subjt:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK

A0A6J1JUJ3 uncharacterized protein LOC1114876271.2e-8768.02Show/hide
Query:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL
        MLYS TQ  GL V+NGV+      L+KSHLK QK + WKCF+V KA          LSVSLR FSD+P YE  GKASFDQYLEDK RLV A FPGKS+QL
Subjt:  MLYSSTQL-GLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQL

Query:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV
        NQEEWRIETPKIE LFLKIWPTID+KI+SK++GEGYP DVP  IT+VL+L++T WE+NGI +DY PSSAN+CSR A+YSEK G RSRLKFQL +NLSF +
Subjt:  NQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLV

Query:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK
        P+AL+F+P DVF+ IVET L+AM+ED+K K +D+LVEDYC FRKEKK
Subjt:  PNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G39520.1 Protein of unknown function (DUF1997)5.9e-4243.16Show/hide
Query:  SLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPG--KSQQLNQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEI
        S +  +D+  +ESP +A FD+YLEDKSR+  A FP   K+ +LN+EEWRI+   I+  FL   P + M+I  KSNG+ YP DVP +ITKVLEL +TKWE+
Subjt:  SLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPG--KSQQLNQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEI

Query:  NGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLVPNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK
         G+ +   P+   +  + A+Y ++ G  +RLK +L   +SF++P+ L+ VP DV R +   +L  +++++KH+ I+ LV DY KF+ E+K
Subjt:  NGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLVPNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKK

AT5G39530.1 Protein of unknown function (DUF1997)7.4e-4544.79Show/hide
Query:  SLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGK--SQQLNQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEI
        S R  +D+P  ESP +A FD+YLEDKSR+  A FP K  S +LN+EEWRI+   I  LFL +WP +DM++  KSNG+ YP DVP  ITKVLEL + +W++
Subjt:  SLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGK--SQQLNQEEWRIETPKIELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEI

Query:  NGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLVPNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKKIK
         G+ +   P+  ++  + A+Y ++ G  +RL+ QL MN+SF++P  L  VP DV R +   VL  ++E++KHK    L+ DY +F+ E+K++
Subjt:  NGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLVPNALSFVPNDVFRGIVETVLRAMIEDLKHKTIDKLVEDYCKFRKEKKIK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCTTTATTCAAGTACACAGCTAGGTTTGCATGTCGAAAATGGTGTTCTTCCAATTCAAAGAGAAAGCTTGAAGAAATCCCATCTGAAGAAGCAGAAAAGTACGAGCTG
GAAGTGCTTTTCAGTGCCAAAAGCACAGCAGAAACTAAGAAAAGATCAAAACTTTTTATCTGTTTCTTTGAGATTTTTCAGTGACGTACCATTTTATGAGTCTCCAGGGA
AAGCTTCTTTTGATCAGTACTTGGAAGATAAATCCAGATTGGTCAATGCAACATTTCCAGGAAAAAGTCAACAGCTCAACCAGGAAGAGTGGAGAATCGAAACGCCAAAA
ATCGAATTGCTGTTCCTCAAGATATGGCCAACGATTGATATGAAAATAATGAGCAAATCCAATGGAGAAGGTTACCCATTTGATGTTCCTGAATATATCACTAAAGTTCT
GGAGCTTGAATTGACAAAGTGGGAGATCAATGGCATTCATAAAGACTACAGGCCATCTTCAGCCAATATTTGTTCTCGAGCAGCTGTTTACAGTGAGAAAATAGGAACTA
GAAGCCGTCTTAAGTTTCAACTTCTAATGAATCTCAGCTTTCTTGTCCCCAATGCTTTGAGTTTCGTTCCGAACGACGTTTTTCGGGGCATCGTCGAGACGGTTTTGAGG
GCAATGATTGAGGACTTGAAGCATAAAACTATAGATAAATTGGTTGAGGATTATTGTAAATTCAGGAAGGAGAAGAAGATCAAGTAA
mRNA sequenceShow/hide mRNA sequence
ATGCTTTATTCAAGTACACAGCTAGGTTTGCATGTCGAAAATGGTGTTCTTCCAATTCAAAGAGAAAGCTTGAAGAAATCCCATCTGAAGAAGCAGAAAAGTACGAGCTG
GAAGTGCTTTTCAGTGCCAAAAGCACAGCAGAAACTAAGAAAAGATCAAAACTTTTTATCTGTTTCTTTGAGATTTTTCAGTGACGTACCATTTTATGAGTCTCCAGGGA
AAGCTTCTTTTGATCAGTACTTGGAAGATAAATCCAGATTGGTCAATGCAACATTTCCAGGAAAAAGTCAACAGCTCAACCAGGAAGAGTGGAGAATCGAAACGCCAAAA
ATCGAATTGCTGTTCCTCAAGATATGGCCAACGATTGATATGAAAATAATGAGCAAATCCAATGGAGAAGGTTACCCATTTGATGTTCCTGAATATATCACTAAAGTTCT
GGAGCTTGAATTGACAAAGTGGGAGATCAATGGCATTCATAAAGACTACAGGCCATCTTCAGCCAATATTTGTTCTCGAGCAGCTGTTTACAGTGAGAAAATAGGAACTA
GAAGCCGTCTTAAGTTTCAACTTCTAATGAATCTCAGCTTTCTTGTCCCCAATGCTTTGAGTTTCGTTCCGAACGACGTTTTTCGGGGCATCGTCGAGACGGTTTTGAGG
GCAATGATTGAGGACTTGAAGCATAAAACTATAGATAAATTGGTTGAGGATTATTGTAAATTCAGGAAGGAGAAGAAGATCAAGTAA
Protein sequenceShow/hide protein sequence
MLYSSTQLGLHVENGVLPIQRESLKKSHLKKQKSTSWKCFSVPKAQQKLRKDQNFLSVSLRFFSDVPFYESPGKASFDQYLEDKSRLVNATFPGKSQQLNQEEWRIETPK
IELLFLKIWPTIDMKIMSKSNGEGYPFDVPEYITKVLELELTKWEINGIHKDYRPSSANICSRAAVYSEKIGTRSRLKFQLLMNLSFLVPNALSFVPNDVFRGIVETVLR
AMIEDLKHKTIDKLVEDYCKFRKEKKIK