; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lag0025474 (gene) of Sponge gourd (AG-4) v1 genome

Gene IDLag0025474
OrganismLuffa acutangula AG-4 (Sponge gourd (AG-4) v1)
DescriptionProtein of unknown function (DUF3537)
Genome locationchr10:13552820..13555937
RNA-Seq ExpressionLag0025474
SyntenyLag0025474
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR021924 - Protein of unknown function DUF3537


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6573486.1 hypothetical protein SDJN03_27373, partial [Cucurbita argyrosperma subsp. sororia]1.3e-19080.73Show/hide
Query:  MENEEKKSPSQIDLYQP------LDSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV
        ME+ EKKSPS      P      ++S K+E + ESES++AEL+RFESFLKWICI+D SNPF+A+LSCF+F  FA AVPIASHFALSCSDCDEDH+RPFHV
Subjt:  MENEEKKSPSQIDLYQP------LDSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV

Query:  VVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTL
        VVQ+SLSAVATLSF CLS WLR  GL+RFLFLDKLCE+S K RDEYSKQLK SME+ISFFLLPCFMAEAAYK+WWYVSAA EIPYYG NMY+SY  SCTL
Subjt:  VVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTL

Query:  ELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQ
        EL SWLYRTSIFFFVCILFRL+CRLQMIRLEDF S+F RE++VGTILMQHLGLRRTLT+ISHRFRVFM LSLI VTASQFI+LLMTTRS+A  NLSK GQ
Subjt:  ELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQ

Query:  LALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGETPTASTVAVIESNS-DDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVY
        LALCSISLVTG+FICLRSAAKITHKAQSITCLAAKWHVSA INTFDDLD ETPTAS +A  E NS DDE+ DEDD DD KLMPVFAHTI+FQKRQALV Y
Subjt:  LALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGETPTASTVAVIESNS-DDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVY

Query:  LKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI
        LKNNK GITVYGF+VDRTWLKS+FAIELAL LWLLNKT+GI
Subjt:  LKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI

XP_008458982.1 PREDICTED: uncharacterized protein LOC103498231 [Cucumis melo]1.6e-19885.45Show/hide
Query:  EEKKSPSQIDLYQPLDSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAV
        EEKKS  Q+D  +   S+ AESESESE EAAEL+RFESFLKWICI+D SN + ASLSC VFFVF  AVPIASHFALSCSDCDEDHQRPFHVVVQ+SLSAV
Subjt:  EEKKSPSQIDLYQPLDSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAV

Query:  ATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRT
        ATLSF CLS+WLRLFGLNRFLFLDKL EASPK+R EY +QL+ SME++SFFLLPCFMAEA YK+WWY+SAA EIPYY  NMYISY  SCTLELCSWLYRT
Subjt:  ATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRT

Query:  SIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLV
        SIFFFVCILFRLIC LQMIRLEDFASIFR E EVGTIL+QHLGLRRT TIISHRFRVFMLLSLI VTASQFI+LLMTTRS AHVNLSKAGQLALCSISLV
Subjt:  SIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLV

Query:  TGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGET-PTASTVA-VIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKAGI
        TG+FICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLD ET PTAS V  ++ESNSDDEDGDEDDLDDPKLMPVFAHTI+FQKRQALV YL+NNKAGI
Subjt:  TGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGET-PTASTVA-VIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKAGI

Query:  TVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI
        TVYGFMVDRTWLKSIFAIELAL LWLLNKT+G+
Subjt:  TVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI

XP_022954540.1 uncharacterized protein LOC111456780 [Cucurbita moschata]6.6e-19281.63Show/hide
Query:  MENEEKKSPSQIDLYQP--LDSRKA----ESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV
        ME+ EKKSPS      P  ++SRK+    ESESESES++AEL+RFESFLKWICI+D SNPF+A+LSCF+F  FA AVPIASHFALSCSDCDEDH+RPFHV
Subjt:  MENEEKKSPSQIDLYQP--LDSRKA----ESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV

Query:  VVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTL
        VVQ+SLSAVATLSF CLS WLR  GL+RFLFLDKLCE+S K RDEYSKQLK SME+ISFFLLPCFMAEAAYK+WWYVSAA EIPYYG NMY+SY  SCTL
Subjt:  VVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTL

Query:  ELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQ
        EL SWLYRTSIFFFVCILFRL+CRLQMIRLEDF S+F RE++VGTILMQHLGLRRTLT ISHRFRVFM LSLI VTASQFI+LLMTTRS+A  NLSK GQ
Subjt:  ELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQ

Query:  LALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGETPTASTVAVIESNS-DDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVY
        LALCSISLVTG+FICLRSAAKITHKAQSITCLAAKWHVSA INTFDDLD ETPTAS +A  E NS DDE+ DEDD DD KLMPVFAHTI+FQKRQALV Y
Subjt:  LALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGETPTASTVAVIESNS-DDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVY

Query:  LKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI
        LKNNK GITVYGF+VDRTWLKS+FAIELAL+LWLLNKT+GI
Subjt:  LKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI

XP_022994504.1 uncharacterized protein LOC111490209 [Cucurbita maxima]1.6e-19381.57Show/hide
Query:  MENEEKKSPSQIDLYQP------LDSR----KAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR
        ME+ EKKSPS      P      ++SR    K+ESESESES++AEL+RFESFLKWICI+D SNPF+A+LSCF+F  FA AVPIASHFALSCSDCDEDH+R
Subjt:  MENEEKKSPSQIDLYQP------LDSR----KAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR

Query:  PFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTI
        PFHVVVQ+SLSAVATLSF CLS WLR FGL+RFLFLDKLCE+S K RDEYSKQLK SME+ISFFLLPCFMAEAAYK+WWYVSAAYEIPYYG NMY+SY  
Subjt:  PFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTI

Query:  SCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLS
        SCTLEL SWLYRTSIFFFVCILFRL+CRLQMIRLEDF S+F RE++VGTILMQHLGLRRTLTIISHRFRVFM LSLI VTASQFI LLMTTRS+A  NLS
Subjt:  SCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLS

Query:  KAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGETPTASTVAVIESNS-DDEDGDEDDLDDPKLMPVFAHTITFQKRQA
        K GQLALCSISLVTG+FICLRSAAKI+HKAQSITCLAAKWHVSA INTFDDLD ETPT S +A  E NS DDED DEDD DD KLMPVFAHTI+FQKRQA
Subjt:  KAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGETPTASTVAVIESNS-DDEDGDEDDLDDPKLMPVFAHTITFQKRQA

Query:  LVVYLKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI
        LV YL+NNKAGITVYGF+VDRTWLKS+FAIELALVLWLLNKT+GI
Subjt:  LVVYLKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI

XP_038893800.1 uncharacterized protein LOC120082620 [Benincasa hispida]5.1e-19282.76Show/hide
Query:  MENEEKKSPSQIDLYQPLDSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSL
        ME EEKKS  QID  Q L+S   E ESE ESEAAEL+RF+S LKWICI D SNP+ ASLSC VFFVFA AVP+ASHFALSCSDCDEDHQRPFHVVVQ+SL
Subjt:  MENEEKKSPSQIDLYQPLDSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSL

Query:  SAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWL
        SAVATLSF CLS+WLR FGLNRFLFLDKL EASP+VR EY +QL+ SM ++SFFLLPCFMAEA YK+WWY+SAA EIPYY  N+YISY ISCTLELCSWL
Subjt:  SAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWL

Query:  YRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSI
        YRTSIFFFVCILFRLIC LQMIRLEDFASIF REAEVGTILMQHL LRRT TIISHRFR F+LLSLI VTASQFI+LLMTTRS AHVNLSKAGQLALCSI
Subjt:  YRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSI

Query:  SLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGET-PTASTVAVIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKA
        SLVTG+FICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLD ET PTAS ++ I  ++ DE+ DEDDLDD KLMPVFAHTI+FQKRQALV YL+NNKA
Subjt:  SLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGET-PTASTVAVIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKA

Query:  GITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI
        GITVYGF VDRTWLKSIFAIELAL LWLLNKT+G+
Subjt:  GITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI

TrEMBL top hitse value%identityAlignment
A0A0A0M3M6 Uncharacterized protein3.3e-18981.09Show/hide
Query:  MENEEKKSPSQIDLYQPLDSRKAES-ESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVS
        ME  EKKS  QI      DS+K++S E ESE EA EL++ ESFL+WICI+D SN + AS+SC +FFVF  AVPIASHF LSCSDCDEDHQRPFHVVVQ+S
Subjt:  MENEEKKSPSQIDLYQPLDSRKAES-ESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVS

Query:  LSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSW
        LSAVATLSF CLS+WLR+FGLNRFLFLDKLCEASPK+R EY +QL+ SM+++SFFLLPCFMAEA YK+WWY+SAA EIPYY  NMYISY  SCTLELCSW
Subjt:  LSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSW

Query:  LYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCS
        LYRTSIFFFVCI FRLIC LQMIRLEDFAS FR E EVGTIL+QHLGLRRT T+ISHRFRVFMLLSLI VTASQFI+LLMTTRS AH NLSK+GQLALCS
Subjt:  LYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCS

Query:  ISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGE-TPTASTVA-VIESNSDDEDG--DEDDLDDPKLMPVFAHTITFQKRQALVVYLK
        ISLVTG+FICLRSAAKITHKAQSITCLAAKWHVSAVINTFD+LD E TPTAS V  V+ESNSDDEDG  DEDDLDD KLMPVFAHTI+FQKRQALV YL+
Subjt:  ISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGE-TPTASTVA-VIESNSDDEDG--DEDDLDDPKLMPVFAHTITFQKRQALVVYLK

Query:  NNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI
        NNKAGITVYGFMVDRTWLKSIFAIELAL LWLLNKT+G+
Subjt:  NNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI

A0A1S3C949 uncharacterized protein LOC1034982317.9e-19985.45Show/hide
Query:  EEKKSPSQIDLYQPLDSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAV
        EEKKS  Q+D  +   S+ AESESESE EAAEL+RFESFLKWICI+D SN + ASLSC VFFVF  AVPIASHFALSCSDCDEDHQRPFHVVVQ+SLSAV
Subjt:  EEKKSPSQIDLYQPLDSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAV

Query:  ATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRT
        ATLSF CLS+WLRLFGLNRFLFLDKL EASPK+R EY +QL+ SME++SFFLLPCFMAEA YK+WWY+SAA EIPYY  NMYISY  SCTLELCSWLYRT
Subjt:  ATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRT

Query:  SIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLV
        SIFFFVCILFRLIC LQMIRLEDFASIFR E EVGTIL+QHLGLRRT TIISHRFRVFMLLSLI VTASQFI+LLMTTRS AHVNLSKAGQLALCSISLV
Subjt:  SIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLV

Query:  TGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGET-PTASTVA-VIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKAGI
        TG+FICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLD ET PTAS V  ++ESNSDDEDGDEDDLDDPKLMPVFAHTI+FQKRQALV YL+NNKAGI
Subjt:  TGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGET-PTASTVA-VIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKAGI

Query:  TVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI
        TVYGFMVDRTWLKSIFAIELAL LWLLNKT+G+
Subjt:  TVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI

A0A6J1GSP8 uncharacterized protein LOC1114567803.2e-19281.63Show/hide
Query:  MENEEKKSPSQIDLYQP--LDSRKA----ESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV
        ME+ EKKSPS      P  ++SRK+    ESESESES++AEL+RFESFLKWICI+D SNPF+A+LSCF+F  FA AVPIASHFALSCSDCDEDH+RPFHV
Subjt:  MENEEKKSPSQIDLYQP--LDSRKA----ESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHV

Query:  VVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTL
        VVQ+SLSAVATLSF CLS WLR  GL+RFLFLDKLCE+S K RDEYSKQLK SME+ISFFLLPCFMAEAAYK+WWYVSAA EIPYYG NMY+SY  SCTL
Subjt:  VVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTL

Query:  ELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQ
        EL SWLYRTSIFFFVCILFRL+CRLQMIRLEDF S+F RE++VGTILMQHLGLRRTLT ISHRFRVFM LSLI VTASQFI+LLMTTRS+A  NLSK GQ
Subjt:  ELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQ

Query:  LALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGETPTASTVAVIESNS-DDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVY
        LALCSISLVTG+FICLRSAAKITHKAQSITCLAAKWHVSA INTFDDLD ETPTAS +A  E NS DDE+ DEDD DD KLMPVFAHTI+FQKRQALV Y
Subjt:  LALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGETPTASTVAVIESNS-DDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVY

Query:  LKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI
        LKNNK GITVYGF+VDRTWLKS+FAIELAL+LWLLNKT+GI
Subjt:  LKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI

A0A6J1K314 uncharacterized protein LOC1114902097.7e-19481.57Show/hide
Query:  MENEEKKSPSQIDLYQP------LDSR----KAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR
        ME+ EKKSPS      P      ++SR    K+ESESESES++AEL+RFESFLKWICI+D SNPF+A+LSCF+F  FA AVPIASHFALSCSDCDEDH+R
Subjt:  MENEEKKSPSQIDLYQP------LDSR----KAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQR

Query:  PFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTI
        PFHVVVQ+SLSAVATLSF CLS WLR FGL+RFLFLDKLCE+S K RDEYSKQLK SME+ISFFLLPCFMAEAAYK+WWYVSAAYEIPYYG NMY+SY  
Subjt:  PFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTI

Query:  SCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLS
        SCTLEL SWLYRTSIFFFVCILFRL+CRLQMIRLEDF S+F RE++VGTILMQHLGLRRTLTIISHRFRVFM LSLI VTASQFI LLMTTRS+A  NLS
Subjt:  SCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLS

Query:  KAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGETPTASTVAVIESNS-DDEDGDEDDLDDPKLMPVFAHTITFQKRQA
        K GQLALCSISLVTG+FICLRSAAKI+HKAQSITCLAAKWHVSA INTFDDLD ETPT S +A  E NS DDED DEDD DD KLMPVFAHTI+FQKRQA
Subjt:  KAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGETPTASTVAVIESNS-DDEDGDEDDLDDPKLMPVFAHTITFQKRQA

Query:  LVVYLKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI
        LV YL+NNKAGITVYGF+VDRTWLKS+FAIELALVLWLLNKT+GI
Subjt:  LVVYLKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI

A0A6J1KGZ0 uncharacterized protein LOC1114956692.6e-18679.54Show/hide
Query:  MENEEKKSPSQIDLYQPLDSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSL
        M+N +KKSPS      P DS+    ESESE EA EL+R ESFLKWIC+ DQSNP+ ASLSC +FF+FA AVP+ASHFALSCSDCDEDHQRPFHVVVQ+SL
Subjt:  MENEEKKSPSQIDLYQPLDSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSL

Query:  SAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWL
        SAVA LSF  LS+WLRLFG NRFLFLDKL +ASP+V+ EYS+QL+ SME+IS F++PCFMAEAAYKMWWY++AA +IPYY  NMY+SY  SCTLELCSWL
Subjt:  SAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWL

Query:  YRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSI
        YRTSIFFFVC+LFRLIC LQMIRLEDFAS+F RE +VGTIL+ HLGLRRT TIISHRFR F+LLSLI VTASQFI+LLMTT + AHVNLSKAGQLALCSI
Subjt:  YRTSIFFFVCILFRLICRLQMIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSI

Query:  SLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGE-TPTASTVAVIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKA
        SLVTG+FICLRSAAKITHKAQSITCLAAKWHVSAV++TFDDLD + TPTA+T   IE NSDDEDGDEDDLDD KLMPVFA TI+FQKRQALV+YL+NNKA
Subjt:  SLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTFDDLDGE-TPTASTVAVIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKA

Query:  GITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI
        GITVYGFMVDRTWLKSIFAIELAL LWLLNKT+GI
Subjt:  GITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G50630.1 Protein of unknown function (DUF3537)5.2e-11853.46Show/hide
Query:  ELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASP
        EL  F  +L+W+C +D S+P++A LS  +F VF   VP  SHF L+C+DCD  H RP+  VVQ+SLS+VAT+SF CL+ ++  +GL RFLF DKL + S 
Subjt:  ELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASP

Query:  KVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE
         VR  Y+ QL  S+ I+S+F++PCF A +AYK+WWY S    IP+ G N  +S T++C +ELCSWLYRT++ F VC+LFRLIC LQ++RL+DFA +F+ +
Subjt:  KVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE

Query:  AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSA
        ++VG+IL +HL +RR L IISHR+R F+L  LI VT SQF +LL+TT++   VN+ +AG+LALCS++LVT + I LRSA+KITHKAQ++TCLAAKWHV A
Subjt:  AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSA

Query:  VINTFD------DLDGETPTA------------STVAVIESNSDDEDGDEDDLDDPKLMPVFA-HTITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKS
         + +FD      D   ETPT               V + ES+SD+   +EDDLD+  ++PV+A  T++FQKRQALV Y +NN AGITVYGF +DR  L +
Subjt:  VINTFD------DLDGETPTA------------STVAVIESNSDDEDGDEDDLDDPKLMPVFA-HTITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKS

Query:  IFAIELALVLWLLNKTIGI
        IF +EL+LVLWLL KTIGI
Subjt:  IFAIELALVLWLLNKTIGI

AT3G20300.1 Protein of unknown function (DUF3537)1.7e-12154.85Show/hide
Query:  ELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASP
        EL  F  +L+W+C +DQS+P++A LS  +F VF   VP  SHF L+CSDCD  H RP+  VVQ+SLS+ A LSF CLS ++  +GL RFLF DKL + S 
Subjt:  ELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASP

Query:  KVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE
         VR  Y+ QL  S++I+S+F+ PCF+A ++YK+WWY S A +IP+ G N+ +S T++C +ELCSWLYRT++ F VC+LFRLIC LQ++RL+DFA +F+ +
Subjt:  KVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE

Query:  AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSA
        ++VG+IL +HL +RR L IISHR+R F+LLSLI VT SQF +LL+TT++ A +N+ +AG+LALCS++LVT + I LRSA+KITHKAQ++TCLAAKWHV A
Subjt:  AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSA

Query:  VINTFDDLDGETPTASTVAV-----------IESNSDDEDGDEDDLDDPKLMPVFAH-TITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKSIFAIELA
         I +F+ +DGETP     A             ES+S+D   +EDD D+  L+P +A+ TI+FQKRQALV Y +NN++GITV+GF +DR+ L +IF IE++
Subjt:  VINTFDDLDGETPTASTVAV-----------IESNSDDEDGDEDDLDDPKLMPVFAH-TITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKSIFAIELA

Query:  LVLWLLNKTIGI
        LVLWLL KTIGI
Subjt:  LVLWLLNKTIGI

AT4G03820.1 Protein of unknown function (DUF3537)2.9e-10549.65Show/hide
Query:  DSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLF
        ++R    +   ES A +L    SF +     DQSN     LS  +FF+ A  VP+ SHF L C+DCD  H+RP+  +VQ+SLS  A +SF  LS W + +
Subjt:  DSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLF

Query:  GLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICR
        G+ RFLF DKL + S KVR  Y  +++ SM++++ F+LP    +A Y++WWY S   +IPY   N  +S+ ++CTL+L SWLYRTS+F   CIL++ IC 
Subjt:  GLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICR

Query:  LQMIRLEDFASIFRRE-AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKIT
        LQ++RL++FA  F  E  +  +IL +HL +RR L I+SHRFR F+LLSL FVTA+QF+ LL T R++   N+ + G+LALCS SLV+G+FICL+SA ++T
Subjt:  LQMIRLEDFASIFRRE-AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKIT

Query:  HKAQSITCLAAKWHVSAVINTFDDL-DGETPTASTVA-----------VIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKAGITVYG
        HKAQS+T +A KW+V A ++TFD L DGETP   T             V++S+ DDE+G+ DD +D ++ P+FA  I+ QKRQALV YL+NN+AGITVYG
Subjt:  HKAQSITCLAAKWHVSAVINTFDDL-DGETPTASTVA-----------VIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKAGITVYG

Query:  FMVDRTWLKSIFAIELALVLWLLNKTI
        F+VD+TWL+ IF+IELAL+LWLL KTI
Subjt:  FMVDRTWLKSIFAIELALVLWLLNKTI

AT4G03820.2 Protein of unknown function (DUF3537)2.9e-10549.65Show/hide
Query:  DSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLF
        ++R    +   ES A +L    SF +     DQSN     LS  +FF+ A  VP+ SHF L C+DCD  H+RP+  +VQ+SLS  A +SF  LS W + +
Subjt:  DSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLF

Query:  GLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICR
        G+ RFLF DKL + S KVR  Y  +++ SM++++ F+LP    +A Y++WWY S   +IPY   N  +S+ ++CTL+L SWLYRTS+F   CIL++ IC 
Subjt:  GLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICR

Query:  LQMIRLEDFASIFRRE-AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKIT
        LQ++RL++FA  F  E  +  +IL +HL +RR L I+SHRFR F+LLSL FVTA+QF+ LL T R++   N+ + G+LALCS SLV+G+FICL+SA ++T
Subjt:  LQMIRLEDFASIFRRE-AEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKIT

Query:  HKAQSITCLAAKWHVSAVINTFDDL-DGETPTASTVA-----------VIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKAGITVYG
        HKAQS+T +A KW+V A ++TFD L DGETP   T             V++S+ DDE+G+ DD +D ++ P+FA  I+ QKRQALV YL+NN+AGITVYG
Subjt:  HKAQSITCLAAKWHVSAVINTFDDL-DGETPTASTVA-----------VIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKAGITVYG

Query:  FMVDRTWLKSIFAIELALVLWLLNKTI
        F+VD+TWL+ IF+IELAL+LWLL KTI
Subjt:  FMVDRTWLKSIFAIELALVLWLLNKTI

AT4G22270.1 Protein of unknown function (DUF3537)3.0e-11855.61Show/hide
Query:  SFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEY
        +F+  +   DQSN  +A LS  VFF+    VP+ SHF L CSDCD  H+RP+ V+VQ+SLS  A +SF  LSIW R FG+ RFLFLDKL + S KVR EY
Subjt:  SFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFACLSIWLRLFGLNRFLFLDKLCEASPKVRDEY

Query:  SKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE-AEVGT
          +++ S++ +  F+LP    EA Y++WWY+S   +IPY   N  +S+ ++CTL+L SWLYR S+F  VCIL+++ C LQ +RL+DFA  F  E  +V +
Subjt:  SKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQMIRLEDFASIFRRE-AEVGT

Query:  ILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTF
         L +H  +RR L I+SHRFR F+LLSLI VTA+QF+ LL TTR++  VN+ + G+LALCS+SLVTG+FICLRSA KITHKAQS+T LAAKW+V A +++F
Subjt:  ILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKWHVSAVINTF

Query:  DDLDGETPTASTVA--------VIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT
        D LDGETPT S +          IE++ D+E   +DDLD+ K+ P++A+TI++QKRQALV YL+NNKAGITVYGF+VDR+WL +IF IELAL+LWLLNKT
Subjt:  DDLDGETPTASTVA--------VIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKT

Query:  I
        I
Subjt:  I


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAACGAGGAGAAGAAATCTCCCTCTCAAATCGATCTGTACCAACCACTCGATTCGCGAAAAGCAGAATCTGAATCTGAATCTGAATCTGAAGCGGCCGAATTGAA
GAGGTTCGAATCATTCCTGAAATGGATTTGCATAATCGATCAATCCAATCCATTCAGCGCTTCGCTCTCCTGCTTCGTCTTCTTCGTCTTCGCATTCGCCGTCCCTATCG
CATCGCACTTCGCTCTCTCTTGCTCCGATTGCGACGAAGATCACCAGAGGCCTTTTCATGTCGTCGTTCAGGTTTCTCTCTCCGCCGTTGCCACGCTTTCATTCGCTTGC
CTCTCTATTTGGCTCCGTCTCTTCGGATTAAACCGATTTCTGTTCCTCGATAAGCTTTGTGAAGCAAGTCCGAAGGTTCGGGATGAGTATTCCAAGCAATTGAAGATATC
AATGGAGATCATCTCCTTCTTCCTGCTGCCGTGTTTCATGGCAGAAGCAGCGTACAAAATGTGGTGGTACGTATCAGCAGCGTACGAAATCCCATACTACGGCAAGAACA
TGTACATAAGCTACACCATCTCGTGCACATTGGAGCTGTGCTCATGGCTTTACAGAACTTCCATCTTCTTCTTCGTGTGCATTCTGTTTCGTCTAATCTGCCGCCTGCAA
ATGATCAGACTTGAAGATTTTGCTTCTATCTTCCGTCGGGAAGCCGAGGTCGGCACCATCCTCATGCAGCATTTGGGCCTCAGAAGAACCTTGACCATCATCAGCCATCG
CTTCAGAGTCTTCATGTTGCTCTCCTTGATTTTCGTCACTGCCAGTCAGTTCATCACTCTCTTGATGACTACTAGATCTAATGCTCATGTTAACCTCTCCAAGGCTGGAC
AACTTGCGCTATGCTCCATCAGCCTGGTCACAGGCATGTTCATATGCCTCCGCAGTGCTGCAAAGATCACCCACAAAGCACAGTCCATCACATGCCTGGCAGCAAAGTGG
CACGTCTCCGCCGTCATAAACACCTTCGACGACCTTGATGGCGAGACGCCAACAGCATCTACGGTTGCGGTCATCGAATCAAACTCCGATGATGAAGACGGCGACGAGGA
CGACTTGGATGATCCCAAACTAATGCCAGTTTTTGCCCACACAATCACATTCCAAAAGAGGCAGGCACTAGTGGTGTATTTGAAGAATAATAAAGCAGGAATTACAGTGT
ATGGATTCATGGTGGACAGAACATGGCTGAAATCCATTTTTGCTATTGAACTTGCACTTGTGCTGTGGCTGCTCAACAAGACTATTGGTATTTAA
mRNA sequenceShow/hide mRNA sequence
ATGGAAAACGAGGAGAAGAAATCTCCCTCTCAAATCGATCTGTACCAACCACTCGATTCGCGAAAAGCAGAATCTGAATCTGAATCTGAATCTGAAGCGGCCGAATTGAA
GAGGTTCGAATCATTCCTGAAATGGATTTGCATAATCGATCAATCCAATCCATTCAGCGCTTCGCTCTCCTGCTTCGTCTTCTTCGTCTTCGCATTCGCCGTCCCTATCG
CATCGCACTTCGCTCTCTCTTGCTCCGATTGCGACGAAGATCACCAGAGGCCTTTTCATGTCGTCGTTCAGGTTTCTCTCTCCGCCGTTGCCACGCTTTCATTCGCTTGC
CTCTCTATTTGGCTCCGTCTCTTCGGATTAAACCGATTTCTGTTCCTCGATAAGCTTTGTGAAGCAAGTCCGAAGGTTCGGGATGAGTATTCCAAGCAATTGAAGATATC
AATGGAGATCATCTCCTTCTTCCTGCTGCCGTGTTTCATGGCAGAAGCAGCGTACAAAATGTGGTGGTACGTATCAGCAGCGTACGAAATCCCATACTACGGCAAGAACA
TGTACATAAGCTACACCATCTCGTGCACATTGGAGCTGTGCTCATGGCTTTACAGAACTTCCATCTTCTTCTTCGTGTGCATTCTGTTTCGTCTAATCTGCCGCCTGCAA
ATGATCAGACTTGAAGATTTTGCTTCTATCTTCCGTCGGGAAGCCGAGGTCGGCACCATCCTCATGCAGCATTTGGGCCTCAGAAGAACCTTGACCATCATCAGCCATCG
CTTCAGAGTCTTCATGTTGCTCTCCTTGATTTTCGTCACTGCCAGTCAGTTCATCACTCTCTTGATGACTACTAGATCTAATGCTCATGTTAACCTCTCCAAGGCTGGAC
AACTTGCGCTATGCTCCATCAGCCTGGTCACAGGCATGTTCATATGCCTCCGCAGTGCTGCAAAGATCACCCACAAAGCACAGTCCATCACATGCCTGGCAGCAAAGTGG
CACGTCTCCGCCGTCATAAACACCTTCGACGACCTTGATGGCGAGACGCCAACAGCATCTACGGTTGCGGTCATCGAATCAAACTCCGATGATGAAGACGGCGACGAGGA
CGACTTGGATGATCCCAAACTAATGCCAGTTTTTGCCCACACAATCACATTCCAAAAGAGGCAGGCACTAGTGGTGTATTTGAAGAATAATAAAGCAGGAATTACAGTGT
ATGGATTCATGGTGGACAGAACATGGCTGAAATCCATTTTTGCTATTGAACTTGCACTTGTGCTGTGGCTGCTCAACAAGACTATTGGTATTTAA
Protein sequenceShow/hide protein sequence
MENEEKKSPSQIDLYQPLDSRKAESESESESEAAELKRFESFLKWICIIDQSNPFSASLSCFVFFVFAFAVPIASHFALSCSDCDEDHQRPFHVVVQVSLSAVATLSFAC
LSIWLRLFGLNRFLFLDKLCEASPKVRDEYSKQLKISMEIISFFLLPCFMAEAAYKMWWYVSAAYEIPYYGKNMYISYTISCTLELCSWLYRTSIFFFVCILFRLICRLQ
MIRLEDFASIFRREAEVGTILMQHLGLRRTLTIISHRFRVFMLLSLIFVTASQFITLLMTTRSNAHVNLSKAGQLALCSISLVTGMFICLRSAAKITHKAQSITCLAAKW
HVSAVINTFDDLDGETPTASTVAVIESNSDDEDGDEDDLDDPKLMPVFAHTITFQKRQALVVYLKNNKAGITVYGFMVDRTWLKSIFAIELALVLWLLNKTIGI