; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg019597 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg019597
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationscaffold5:35121325..35125007
RNA-Seq ExpressionSpg019597
SyntenySpg019597
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_008458982.1 PREDICTED: uncharacterized protein LOC103498231 [Cucumis melo]3.1e-19785.22Show/hide
Query:  EEKKSPSQIDLYQPLDSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAV
        EEKKS  Q+D  +   S+ AESESESE EAAELRR ESFLKWICI+D SN Y ASLSC VFFVF  AVPIASHFALSCSDCDEDHQRPFHVVVQ+SLSAV
Subjt:  EEKKSPSQIDLYQPLDSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAV

Query:  ATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRT
        ATLSF CLS+WLRLFGLNRFLFLDKL EASPK+R EY +QL+ SME++SFFLLPCFMAEA YK+WWY+SAA EIPYY  NMYISY  SCTLELCSWLYRT
Subjt:  ATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRT

Query:  SIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLV
        SIFFFVCILFRLIC LQMIRLEDFASIFR E EVGTIL+QHLGLRRT TIISHRFRVFMLLSLI VTASQFI+LLMTTRS AHVNLSKAGQLALCSISLV
Subjt:  SIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLV

Query:  TGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGET-PTASTVA-IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKAGI
        TG+FICLRSAAKITHKAQSITCLAAKWHVSAVIN+FDDLD ET PTAS V  I+ESNSDDEDGDEDDLDD KLMPVFAHTI+FQKRQALV YL+NNKAGI
Subjt:  TGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGET-PTASTVA-IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKAGI

Query:  TVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI
        TVYGFMVDRTWLKSIFA+ELAL LWLLNKT+G+
Subjt:  TVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI

XP_011660297.1 uncharacterized protein LOC101203162 [Cucumis sativus]4.8e-19081.32Show/hide
Query:  MENEEKKSPSQIDLYQPLDSRKAES-ESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVS
        ME  EKKS  QI      DS+K++S E ESE EA ELR+LESFL+WICI+D SN Y AS+SC +FFVF  AVPIASHF LSCSDCDEDHQRPFHVVVQ+S
Subjt:  MENEEKKSPSQIDLYQPLDSRKAES-ESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVS

Query:  LSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSW
        LSAVATLSF CLS+WLR+FGLNRFLFLDKLCEASPK+R EY +QL+ SM+++SFFLLPCFMAEA YK+WWY+SAA EIPYY  NMYISY  SCTLELCSW
Subjt:  LSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSW

Query:  LYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCS
        LYRTSIFFFVCI FRLIC LQMIRLEDFAS FR E EVGTIL+QHLGLRRT T+ISHRFRVFMLLSLI VTASQFI+LLMTTRS AH NLSK+GQLALCS
Subjt:  LYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCS

Query:  ISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGE-TPTASTVA-IIESNSDDEDG--DEDDLDDAKLMPVFAHTITFQKRQALVVYLK
        ISLVTG+FICLRSAAKITHKAQSITCLAAKWHVSAVIN+FD+LD E TPTAS V  ++ESNSDDEDG  DEDDLDDAKLMPVFAHTI+FQKRQALV YL+
Subjt:  ISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGE-TPTASTVA-IIESNSDDEDG--DEDDLDDAKLMPVFAHTITFQKRQALVVYLK

Query:  NNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI
        NNKAGITVYGFMVDRTWLKSIFA+ELAL LWLLNKT+G+
Subjt:  NNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI

XP_022954540.1 uncharacterized protein LOC111456780 [Cucurbita moschata]5.6e-19180.95Show/hide
Query:  MENEEKKSPSQIDLYQP--LDSRKA----ESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV
        ME+ EKKSPS      P  ++SRK+    ESESESES++AELRR ESFLKWICI+D SNP++A+LSCF+F  FA AVPIASHFALSCSDCDEDH+RPFHV
Subjt:  MENEEKKSPSQIDLYQP--LDSRKA----ESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV

Query:  VVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTL
        VVQ+SLSAVATLSF CLS WLR  GL+RFLFLDKLCE+S K RDEYSKQLK SME+ISFFLLPCFMAEAAYK+WWYVSAA EIPYYG NMY+SY  SCTL
Subjt:  VVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTL

Query:  ELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQ
        EL SWLYRTSIFFFVCILFRL+CRLQMIRLEDF S+F RE++VGTILMQHLGLRRTLT ISHRFRVFM LSLI VTASQFI+LLMTTRS+A  NLSK GQ
Subjt:  ELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQ

Query:  LALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGETPTASTVAIIESNS-DDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVY
        LALCSISLVTG+FICLRSAAKITHKAQSITCLAAKWHVSA IN+FDDLD ETPTAS +A  E NS DDE+ DEDD DD KLMPVFAHTI+FQKRQALV Y
Subjt:  LALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGETPTASTVAIIESNS-DDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVY

Query:  LKNNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI
        LKNNK GITVYGF+VDRTWLKS+FA+ELAL+LWLLNKT+GI
Subjt:  LKNNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI

XP_022994504.1 uncharacterized protein LOC111490209 [Cucurbita maxima]1.3e-19280.9Show/hide
Query:  MENEEKKSPSQIDLYQP------LDSR----KAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR
        ME+ EKKSPS      P      ++SR    K+ESESESES++AELRR ESFLKWICI+D SNP++A+LSCF+F  FA AVPIASHFALSCSDCDEDH+R
Subjt:  MENEEKKSPSQIDLYQP------LDSR----KAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR

Query:  PFHVVVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTI
        PFHVVVQ+SLSAVATLSF CLS WLR FGL+RFLFLDKLCE+S K RDEYSKQLK SME+ISFFLLPCFMAEAAYK+WWYVSAAYEIPYYG NMY+SY  
Subjt:  PFHVVVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTI

Query:  SCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLS
        SCTLEL SWLYRTSIFFFVCILFRL+CRLQMIRLEDF S+F RE++VGTILMQHLGLRRTLTIISHRFRVFM LSLI VTASQFI LLMTTRS+A  NLS
Subjt:  SCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLS

Query:  KAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGETPTASTVAIIESNS-DDEDGDEDDLDDAKLMPVFAHTITFQKRQA
        K GQLALCSISLVTG+FICLRSAAKI+HKAQSITCLAAKWHVSA IN+FDDLD ETPT S +A  E NS DDED DEDD DD KLMPVFAHTI+FQKRQA
Subjt:  KAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGETPTASTVAIIESNS-DDEDGDEDDLDDAKLMPVFAHTITFQKRQA

Query:  LVVYLKNNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI
        LV YL+NNKAGITVYGF+VDRTWLKS+FA+ELALVLWLLNKT+GI
Subjt:  LVVYLKNNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI

XP_038893800.1 uncharacterized protein LOC120082620 [Benincasa hispida]1.7e-19283.49Show/hide
Query:  MENEEKKSPSQIDLYQPLDSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSL
        ME EEKKS  QID  Q L+S   E ESE ESEAAELRR +S LKWICI D SNPY ASLSC VFFVFA AVP+ASHFALSCSDCDEDHQRPFHVVVQ+SL
Subjt:  MENEEKKSPSQIDLYQPLDSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSL

Query:  SAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWL
        SAVATLSF CLS+WLR FGLNRFLFLDKL EASP+VR EY +QL+ SM ++SFFLLPCFMAEA YK+WWY+SAA EIPYY  N+YISY ISCTLELCSWL
Subjt:  SAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWL

Query:  YRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSI
        YRTSIFFFVCILFRLIC LQMIRLEDFASIF REAEVGTILMQHL LRRT TIISHRFR F+LLSLI VTASQFI+LLMTTRS AHVNLSKAGQLALCSI
Subjt:  YRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSI

Query:  SLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGET-PTASTVA-IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNK
        SLVTG+FICLRSAAKITHKAQSITCLAAKWHVSAVIN+FDDLD ET PTAS ++ I+ESNS DE+ DEDDLDDAKLMPVFAHTI+FQKRQALV YL+NNK
Subjt:  SLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGET-PTASTVA-IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNK

Query:  AGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI
        AGITVYGF VDRTWLKSIFA+ELAL LWLLNKT+G+
Subjt:  AGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI

TrEMBL top hitse value%identityAlignment
A0A0A0M3M6 Uncharacterized protein2.3e-19081.32Show/hide
Query:  MENEEKKSPSQIDLYQPLDSRKAES-ESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVS
        ME  EKKS  QI      DS+K++S E ESE EA ELR+LESFL+WICI+D SN Y AS+SC +FFVF  AVPIASHF LSCSDCDEDHQRPFHVVVQ+S
Subjt:  MENEEKKSPSQIDLYQPLDSRKAES-ESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVS

Query:  LSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSW
        LSAVATLSF CLS+WLR+FGLNRFLFLDKLCEASPK+R EY +QL+ SM+++SFFLLPCFMAEA YK+WWY+SAA EIPYY  NMYISY  SCTLELCSW
Subjt:  LSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSW

Query:  LYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCS
        LYRTSIFFFVCI FRLIC LQMIRLEDFAS FR E EVGTIL+QHLGLRRT T+ISHRFRVFMLLSLI VTASQFI+LLMTTRS AH NLSK+GQLALCS
Subjt:  LYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCS

Query:  ISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGE-TPTASTVA-IIESNSDDEDG--DEDDLDDAKLMPVFAHTITFQKRQALVVYLK
        ISLVTG+FICLRSAAKITHKAQSITCLAAKWHVSAVIN+FD+LD E TPTAS V  ++ESNSDDEDG  DEDDLDDAKLMPVFAHTI+FQKRQALV YL+
Subjt:  ISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGE-TPTASTVA-IIESNSDDEDG--DEDDLDDAKLMPVFAHTITFQKRQALVVYLK

Query:  NNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI
        NNKAGITVYGFMVDRTWLKSIFA+ELAL LWLLNKT+G+
Subjt:  NNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI

A0A1S3C949 uncharacterized protein LOC1034982311.5e-19785.22Show/hide
Query:  EEKKSPSQIDLYQPLDSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAV
        EEKKS  Q+D  +   S+ AESESESE EAAELRR ESFLKWICI+D SN Y ASLSC VFFVF  AVPIASHFALSCSDCDEDHQRPFHVVVQ+SLSAV
Subjt:  EEKKSPSQIDLYQPLDSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAV

Query:  ATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRT
        ATLSF CLS+WLRLFGLNRFLFLDKL EASPK+R EY +QL+ SME++SFFLLPCFMAEA YK+WWY+SAA EIPYY  NMYISY  SCTLELCSWLYRT
Subjt:  ATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRT

Query:  SIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLV
        SIFFFVCILFRLIC LQMIRLEDFASIFR E EVGTIL+QHLGLRRT TIISHRFRVFMLLSLI VTASQFI+LLMTTRS AHVNLSKAGQLALCSISLV
Subjt:  SIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLV

Query:  TGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGET-PTASTVA-IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKAGI
        TG+FICLRSAAKITHKAQSITCLAAKWHVSAVIN+FDDLD ET PTAS V  I+ESNSDDEDGDEDDLDD KLMPVFAHTI+FQKRQALV YL+NNKAGI
Subjt:  TGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGET-PTASTVA-IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKAGI

Query:  TVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI
        TVYGFMVDRTWLKSIFA+ELAL LWLLNKT+G+
Subjt:  TVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI

A0A6J1GSP8 uncharacterized protein LOC1114567802.7e-19180.95Show/hide
Query:  MENEEKKSPSQIDLYQP--LDSRKA----ESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV
        ME+ EKKSPS      P  ++SRK+    ESESESES++AELRR ESFLKWICI+D SNP++A+LSCF+F  FA AVPIASHFALSCSDCDEDH+RPFHV
Subjt:  MENEEKKSPSQIDLYQP--LDSRKA----ESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV

Query:  VVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTL
        VVQ+SLSAVATLSF CLS WLR  GL+RFLFLDKLCE+S K RDEYSKQLK SME+ISFFLLPCFMAEAAYK+WWYVSAA EIPYYG NMY+SY  SCTL
Subjt:  VVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTL

Query:  ELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQ
        EL SWLYRTSIFFFVCILFRL+CRLQMIRLEDF S+F RE++VGTILMQHLGLRRTLT ISHRFRVFM LSLI VTASQFI+LLMTTRS+A  NLSK GQ
Subjt:  ELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQ

Query:  LALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGETPTASTVAIIESNS-DDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVY
        LALCSISLVTG+FICLRSAAKITHKAQSITCLAAKWHVSA IN+FDDLD ETPTAS +A  E NS DDE+ DEDD DD KLMPVFAHTI+FQKRQALV Y
Subjt:  LALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGETPTASTVAIIESNS-DDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVY

Query:  LKNNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI
        LKNNK GITVYGF+VDRTWLKS+FA+ELAL+LWLLNKT+GI
Subjt:  LKNNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI

A0A6J1K314 uncharacterized protein LOC1114902096.5e-19380.9Show/hide
Query:  MENEEKKSPSQIDLYQP------LDSR----KAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR
        ME+ EKKSPS      P      ++SR    K+ESESESES++AELRR ESFLKWICI+D SNP++A+LSCF+F  FA AVPIASHFALSCSDCDEDH+R
Subjt:  MENEEKKSPSQIDLYQP------LDSR----KAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR

Query:  PFHVVVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTI
        PFHVVVQ+SLSAVATLSF CLS WLR FGL+RFLFLDKLCE+S K RDEYSKQLK SME+ISFFLLPCFMAEAAYK+WWYVSAAYEIPYYG NMY+SY  
Subjt:  PFHVVVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTI

Query:  SCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLS
        SCTLEL SWLYRTSIFFFVCILFRL+CRLQMIRLEDF S+F RE++VGTILMQHLGLRRTLTIISHRFRVFM LSLI VTASQFI LLMTTRS+A  NLS
Subjt:  SCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLS

Query:  KAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGETPTASTVAIIESNS-DDEDGDEDDLDDAKLMPVFAHTITFQKRQA
        K GQLALCSISLVTG+FICLRSAAKI+HKAQSITCLAAKWHVSA IN+FDDLD ETPT S +A  E NS DDED DEDD DD KLMPVFAHTI+FQKRQA
Subjt:  KAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGETPTASTVAIIESNS-DDEDGDEDDLDDAKLMPVFAHTITFQKRQA

Query:  LVVYLKNNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI
        LV YL+NNKAGITVYGF+VDRTWLKS+FA+ELALVLWLLNKT+GI
Subjt:  LVVYLKNNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI

A0A6J1KGZ0 uncharacterized protein LOC1114956691.4e-18780Show/hide
Query:  MENEEKKSPSQIDLYQPLDSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSL
        M+N +KKSPS      P DS+    ESESE EA ELRRLESFLKWIC+ DQSNPY ASLSC +FF+FA AVP+ASHFALSCSDCDEDHQRPFHVVVQ+SL
Subjt:  MENEEKKSPSQIDLYQPLDSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSL

Query:  SAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWL
        SAVA LSF  LS+WLRLFG NRFLFLDKL +ASP+V+ EYS+QL+ SME+IS F++PCFMAEAAYKMWWY++AA +IPYY  NMY+SY  SCTLELCSWL
Subjt:  SAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWL

Query:  YRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSI
        YRTSIFFFVC+LFRLIC LQMIRLEDFAS+F RE +VGTIL+ HLGLRRT TIISHRFR F+LLSLI VTASQFI+LLMTT + AHVNLSKAGQLALCSI
Subjt:  YRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSI

Query:  SLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGE-TPTASTVAIIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKA
        SLVTG+FICLRSAAKITHKAQSITCLAAKWHVSAV+++FDDLD + TPTA+T   IE NSDDEDGDEDDLDDAKLMPVFA TI+FQKRQALV+YL+NNKA
Subjt:  SLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSFDDLDGE-TPTASTVAIIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKA

Query:  GITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI
        GITVYGFMVDRTWLKSIFA+ELAL LWLLNKT+GI
Subjt:  GITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)5.2e-11853.46Show/hide
Query:  ELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASP
        EL     +L+W+C +D S+P++A LS  +F VF   VP  SHF L+C+DCD  H RP+  VVQ+SLS+VAT+SF CL+ ++  +GL RFLF DKL + S 
Subjt:  ELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASP

Query:  KVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE
         VR  Y+ QL  S+ I+S+F++PCF A +AYK+WWY S    IP+ G N  +S T++C +ELCSWLYRT++ F VC+LFRLIC LQ++RL+DFA +F+ +
Subjt:  KVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE

Query:  AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSA
        ++VG+IL +HL +RR L IISHR+R F+L  LI VT SQF +LL+TT++   VN+ +AG+LALCS++LVT + I LRSA+KITHKAQ++TCLAAKWHV A
Subjt:  AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSA

Query:  VINSFD------DLDGETPTA------------STVAIIESNSDDEDGDEDDLDDAKLMPVFA-HTITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKS
         + SFD      D   ETPT               V + ES+SD+   +EDDLD+  ++PV+A  T++FQKRQALV Y +NN AGITVYGF +DR  L +
Subjt:  VINSFD------DLDGETPTA------------STVAIIESNSDDEDGDEDDLDDAKLMPVFA-HTITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKS

Query:  IFAVELALVLWLLNKTIGI
        IF +EL+LVLWLL KTIGI
Subjt:  IFAVELALVLWLLNKTIGI

AT3G20300.1 Protein of unknown function (DUF3537)2.2e-12154.61Show/hide
Query:  ELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASP
        EL     +L+W+C +DQS+P++A LS  +F VF   VP  SHF L+CSDCD  H RP+  VVQ+SLS+ A LSF CLS ++  +GL RFLF DKL + S 
Subjt:  ELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASP

Query:  KVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE
         VR  Y+ QL  S++I+S+F+ PCF+A ++YK+WWY S A +IP+ G N+ +S T++C +ELCSWLYRT++ F VC+LFRLIC LQ++RL+DFA +F+ +
Subjt:  KVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE

Query:  AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSA
        ++VG+IL +HL +RR L IISHR+R F+LLSLI VT SQF +LL+TT++ A +N+ +AG+LALCS++LVT + I LRSA+KITHKAQ++TCLAAKWHV A
Subjt:  AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSA

Query:  VINSFDDLDGETPTASTVAI-----------IESNSDDEDGDEDDLDDAKLMPVFAH-TITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKSIFAVELA
         I SF+ +DGETP     A             ES+S+D   +EDD D+  L+P +A+ TI+FQKRQALV Y +NN++GITV+GF +DR+ L +IF +E++
Subjt:  VINSFDDLDGETPTASTVAI-----------IESNSDDEDGDEDDLDDAKLMPVFAH-TITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKSIFAVELA

Query:  LVLWLLNKTIGI
        LVLWLL KTIGI
Subjt:  LVLWLLNKTIGI

AT4G03820.1 Protein of unknown function (DUF3537)6.6e-10548.95Show/hide
Query:  DSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSVWLRLF
        ++R    +   ES A +L    SF +     DQSN     LS  +FF+ A  VP+ SHF L C+DCD  H+RP+  +VQ+SLS  A +SF  LS W + +
Subjt:  DSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSVWLRLF

Query:  GLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICR
        G+ RFLF DKL + S KVR  Y  +++ SM++++ F+LP    +A Y++WWY S   +IPY   N  +S+ ++CTL+L SWLYRTS+F   CIL++ IC 
Subjt:  GLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICR

Query:  LQMIRLEDFASIFRRE-AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKIT
        LQ++RL++FA  F  E  +  +IL +HL +RR L I+SHRFR F+LLSL FVTA+QF+ LL T R++   N+ + G+LALCS SLV+G+FICL+SA ++T
Subjt:  LQMIRLEDFASIFRRE-AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKIT

Query:  HKAQSITCLAAKWHVSAVINSFDDL-DGETPTASTVA-----------IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKAGITVYG
        HKAQS+T +A KW+V A +++FD L DGETP   T             +++S+ DDE+G+ DD +D ++ P+FA  I+ QKRQALV YL+NN+AGITVYG
Subjt:  HKAQSITCLAAKWHVSAVINSFDDL-DGETPTASTVA-----------IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKAGITVYG

Query:  FMVDRTWLKSIFAVELALVLWLLNKTI
        F+VD+TWL+ IF++ELAL+LWLL KTI
Subjt:  FMVDRTWLKSIFAVELALVLWLLNKTI

AT4G03820.2 Protein of unknown function (DUF3537)6.6e-10548.95Show/hide
Query:  DSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSVWLRLF
        ++R    +   ES A +L    SF +     DQSN     LS  +FF+ A  VP+ SHF L C+DCD  H+RP+  +VQ+SLS  A +SF  LS W + +
Subjt:  DSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSVWLRLF

Query:  GLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICR
        G+ RFLF DKL + S KVR  Y  +++ SM++++ F+LP    +A Y++WWY S   +IPY   N  +S+ ++CTL+L SWLYRTS+F   CIL++ IC 
Subjt:  GLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICR

Query:  LQMIRLEDFASIFRRE-AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKIT
        LQ++RL++FA  F  E  +  +IL +HL +RR L I+SHRFR F+LLSL FVTA+QF+ LL T R++   N+ + G+LALCS SLV+G+FICL+SA ++T
Subjt:  LQMIRLEDFASIFRRE-AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKIT

Query:  HKAQSITCLAAKWHVSAVINSFDDL-DGETPTASTVA-----------IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKAGITVYG
        HKAQS+T +A KW+V A +++FD L DGETP   T             +++S+ DDE+G+ DD +D ++ P+FA  I+ QKRQALV YL+NN+AGITVYG
Subjt:  HKAQSITCLAAKWHVSAVINSFDDL-DGETPTASTVA-----------IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKAGITVYG

Query:  FMVDRTWLKSIFAVELALVLWLLNKTI
        F+VD+TWL+ IF++ELAL+LWLL KTI
Subjt:  FMVDRTWLKSIFAVELALVLWLLNKTI

AT4G22270.1 Protein of unknown function (DUF3537)1.8e-11855.36Show/hide
Query:  SFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEY
        +F+  +   DQSN  +A LS  VFF+    VP+ SHF L CSDCD  H+RP+ V+VQ+SLS  A +SF  LS+W R FG+ RFLFLDKL + S KVR EY
Subjt:  SFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSVWLRLFGLNRFLFLDKLCEASPKVRDEY

Query:  SKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE-AEVGT
          +++ S++ +  F+LP    EA Y++WWY+S   +IPY   N  +S+ ++CTL+L SWLYR S+F  VCIL+++ C LQ +RL+DFA  F  E  +V +
Subjt:  SKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE-AEVGT

Query:  ILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSF
         L +H  +RR L I+SHRFR F+LLSLI VTA+QF+ LL TTR++  VN+ + G+LALCS+SLVTG+FICLRSA KITHKAQS+T LAAKW+V A ++SF
Subjt:  ILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINSF

Query:  DDLDGETPTASTVA--------IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKT
        D LDGETPT S +          IE++ D+E   +DDLD+ K+ P++A+TI++QKRQALV YL+NNKAGITVYGF+VDR+WL +IF +ELAL+LWLLNKT
Subjt:  DDLDGETPTASTVA--------IIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKT

Query:  I
        I
Subjt:  I


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACGAGGAGAAGAAATCTCCCTCTCAAATCGATCTGTACCAACCACTCGATTCGCGAAAAGCAGAATCTGAATCTGAATCTGAATCTGAAGCGGCCGAATTGAG
AAGGCTCGAATCATTCCTGAAATGGATTTGCATAATCGATCAATCCAATCCATACAGCGCTTCGCTCTCCTGCTTCGTCTTCTTCGTCTTCGCATTCGCCGTCCCTATCG
CATCGCACTTCGCTCTCTCTTGCTCCGATTGCGACGAAGATCACCAGAGGCCTTTCCATGTCGTCGTCCAGGTTTCTCTCTCCGCCGTTGCCACGCTTTCATTCGCTTGC
CTCTCTGTTTGGCTCCGTCTCTTCGGATTGAACCGATTTCTGTTCCTCGATAAGCTCTGTGAAGCAAGTCCGAAGGTTCGGGATGAGTATTCCAAGCAATTGAAGATATC
AATGGAGATCATCTCCTTCTTCCTGCTGCCATGTTTCATGGCAGAAGCAGCGTACAAAATGTGGTGGTACGTGTCAGCAGCGTACGAAATCCCATACTACGGCAAGAACA
TGTACATAAGCTACACCATCTCCTGCACATTGGAGCTGTGCTCATGGCTTTACAGAACTTCCATCTTCTTCTTCGTGTGCATTCTGTTTCGTCTAATCTGCCGCCTGCAA
ATGATCAGACTTGAAGATTTTGCTTCTATCTTCCGTCGGGAAGCCGAGGTCGGCACCATCCTCATGCAGCATTTGGGCCTCAGAAGAACCTTGACCATCATCAGCCATCG
CTTCAGAGTCTTCATGTTGCTCTCCTTGATTTTCGTCACTGCCAGTCAGTTCATCACTCTCTTGATGACTACTAGATCTAATGCTCATGTTAACCTCTCCAAGGCTGGAC
AACTTGCGCTATGCTCCATCAGCCTGGTCACAGGCATGTTCATATGCCTCCGCAGTGCAGCAAAGATCACCCACAAAGCACAGTCCATCACATGCCTGGCAGCAAAGTGG
CACGTCTCCGCCGTCATAAACAGCTTCGACGACCTCGATGGCGAGACGCCAACAGCATCTACGGTTGCGATCATCGAGTCGAACTCTGATGATGAAGACGGCGACGAGGA
CGACTTGGATGATGCCAAACTAATGCCAGTTTTTGCCCACACAATCACATTCCAAAAGAGGCAGGCACTAGTGGTGTATTTGAAGAATAATAAAGCAGGAATTACAGTGT
ATGGATTCATGGTGGACAGAACATGGCTGAAATCCATTTTTGCTGTTGAACTTGCACTTGTGCTGTGGCTGCTCAACAAGACTATTGGTATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACGAGGAGAAGAAATCTCCCTCTCAAATCGATCTGTACCAACCACTCGATTCGCGAAAAGCAGAATCTGAATCTGAATCTGAATCTGAAGCGGCCGAATTGAG
AAGGCTCGAATCATTCCTGAAATGGATTTGCATAATCGATCAATCCAATCCATACAGCGCTTCGCTCTCCTGCTTCGTCTTCTTCGTCTTCGCATTCGCCGTCCCTATCG
CATCGCACTTCGCTCTCTCTTGCTCCGATTGCGACGAAGATCACCAGAGGCCTTTCCATGTCGTCGTCCAGGTTTCTCTCTCCGCCGTTGCCACGCTTTCATTCGCTTGC
CTCTCTGTTTGGCTCCGTCTCTTCGGATTGAACCGATTTCTGTTCCTCGATAAGCTCTGTGAAGCAAGTCCGAAGGTTCGGGATGAGTATTCCAAGCAATTGAAGATATC
AATGGAGATCATCTCCTTCTTCCTGCTGCCATGTTTCATGGCAGAAGCAGCGTACAAAATGTGGTGGTACGTGTCAGCAGCGTACGAAATCCCATACTACGGCAAGAACA
TGTACATAAGCTACACCATCTCCTGCACATTGGAGCTGTGCTCATGGCTTTACAGAACTTCCATCTTCTTCTTCGTGTGCATTCTGTTTCGTCTAATCTGCCGCCTGCAA
ATGATCAGACTTGAAGATTTTGCTTCTATCTTCCGTCGGGAAGCCGAGGTCGGCACCATCCTCATGCAGCATTTGGGCCTCAGAAGAACCTTGACCATCATCAGCCATCG
CTTCAGAGTCTTCATGTTGCTCTCCTTGATTTTCGTCACTGCCAGTCAGTTCATCACTCTCTTGATGACTACTAGATCTAATGCTCATGTTAACCTCTCCAAGGCTGGAC
AACTTGCGCTATGCTCCATCAGCCTGGTCACAGGCATGTTCATATGCCTCCGCAGTGCAGCAAAGATCACCCACAAAGCACAGTCCATCACATGCCTGGCAGCAAAGTGG
CACGTCTCCGCCGTCATAAACAGCTTCGACGACCTCGATGGCGAGACGCCAACAGCATCTACGGTTGCGATCATCGAGTCGAACTCTGATGATGAAGACGGCGACGAGGA
CGACTTGGATGATGCCAAACTAATGCCAGTTTTTGCCCACACAATCACATTCCAAAAGAGGCAGGCACTAGTGGTGTATTTGAAGAATAATAAAGCAGGAATTACAGTGT
ATGGATTCATGGTGGACAGAACATGGCTGAAATCCATTTTTGCTGTTGAACTTGCACTTGTGCTGTGGCTGCTCAACAAGACTATTGGTATTTAA
Protein sequenceShow/hide protein sequence
MENEEKKSPSQIDLYQPLDSRKAESESESESEAAELRRLESFLKWICIIDQSNPYSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFAC
LSVWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQ
MIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKW
HVSAVINSFDDLDGETPTASTVAIIESNSDDEDGDEDDLDDAKLMPVFAHTITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKSIFAVELALVLWLLNKTIGI