; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG05G005770 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG05G005770
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptionvesicle-associated protein 1-2-like
Genome locationCG_Chr05:5652798..5670848
RNA-Seq ExpressionClCG05G005770
SyntenyClCG05G005770
Gene Ontology termsGO:0000124 - SAGA complex (cellular component)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
InterPro domainsIPR000535 - Major sperm protein (MSP) domain
IPR008962 - PapD-like superfamily
IPR013783 - Immunoglobulin-like fold
IPR016763 - Vesicle-associated membrane-protein-associated protein


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6597489.1 Vesicle-associated protein 2-2, partial [Cucurbita argyrosperma subsp. sororia]8.6e-30576.44Show/hide
Query:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE
        MTMELFE++P ELKFTFELK QSSCLIQLINKS+QH+AFKVKTTSPKKYCVRPNTG+IKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVI  GTSEE
Subjt:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE

Query:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPRDRMQTGVENIPPPHKVAEDSNGLDTCKHIDELRTVDTPELLSPPYKVA
        DIT D+FAK SGKHIEEKKLKVFLVSATPPPVLLPINGELKLD N ETS+P+DRMQTGVENIPPP KVAED+NGLDT KHIDELR VD P  LSPPYKVA
Subjt:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPRDRMQTGVENIPPPHKVAEDSNGLDTCKHIDELRTVDTPELLSPPYKVA

Query:  EGVEKIDSCKDSGENRAAVDVSTRQNEDVVAKPSENIETTPAEGIEESKLAKDLPELNLTKDFQELKSKLTLMDAELLEAEATIMRLKKERMITTQEREM
        EGVEK+D+CKDS +NR A DV+TRQNED VA+P  N+ETTP EGIEESK  KDLPELNLTKD  ELKSKLTLMD EL+EAEATIMRLK+ERM+TTQEREM
Subjt:  EGVEKIDSCKDSGENRAAVDVSTRQNEDVVAKPSENIETTPAEGIEESKLAKDLPELNLTKDFQELKSKLTLMDAELLEAEATIMRLKKERMITTQEREM

Query:  LKRDLS--------SSWILHPSLAEA--EQINGGS--------ITAVDSNFSAFSLVSCAWMIGGAFVLRCRTSSFPYIIHLLSLNVGNGRMAVMTRLLA
        LKRDL         SS +  P +  +    I+  +        ++A+ S  SAFS VS      G +  R    SF   + + S  VG+ RMAVMTRLLA
Subjt:  LKRDLS--------SSWILHPSLAEA--EQINGGS--------ITAVDSNFSAFSLVSCAWMIGGAFVLRCRTSSFPYIIHLLSLNVGNGRMAVMTRLLA

Query:  AGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GFGQGTIMDLDGGMGHRKHSR
        AGSFSRTIAEE GHQK ASEF+CRELRDADEANLIDEEDMHVFGLKPM DPLNLVCCNICKKPVKASQYIIHS      G GQG IMDLDGGMGHRKHSR
Subjt:  AGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GFGQGTIMDLDGGMGHRKHSR

Query:  KEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTKRSKLITGEGLLLASDLEPS
        KE KKLL ADANISA+EKEGSES +ADYSA+PAFPTNNQ EMVKLTKRN T  VAPI DD TGVC GVVDH+ASL HPSTKRSKLITGEGLLL SDLEPS
Subjt:  KEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTKRSKLITGEGLLLASDLEPS

Query:  SAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK------------MDSQSLTS
        SAK KIR VP PLASKIYYSQRNN LRSALGYLYWEAVASSKEICN VDH++TKE++KQF N+SQEESSQEQT++IIGKK            MDS SLTS
Subjt:  SAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK------------MDSQSLTS

Query:  AWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRLDNSQV
        A K D+NLAIFSSGKCLPAGGASN+FV+GSSVAWPQIAP  L   ++
Subjt:  AWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRLDNSQV

XP_008438795.1 PREDICTED: uncharacterized protein LOC103483792 isoform X2 [Cucumis melo]1.2e-16587.68Show/hide
Query:  NVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GF
        +VGNGRMAVMTRL+AAGSFSRTIA    EEVG QKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS      GF
Subjt:  NVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GF

Query:  GQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTK
        GQGTIMDLDGGMGHRKHSRKEKKKLL +DANIS VEKEGSEST AD+SAAPA P NNQ EM+KLTKRNSTC VAPILDDGTG CSGV   AAS IHPSTK
Subjt:  GQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTK

Query:  RSKLITGEGLLLASDLEPSSAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKM
        RSKLITGEGLLLASDLEPSSAKTKIRNVP PLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNM+D Q+TKE+IK FH+TS+EE SQEQTSD+IG KM
Subjt:  RSKLITGEGLLLASDLEPSSAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKM

Query:  DSQSLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL
        D+QSLTSAWKSDHNLA+FSSGKCLPAGGASNKFVIGSSVAWPQIAP  L
Subjt:  DSQSLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL

XP_008438798.1 PREDICTED: uncharacterized protein LOC103483792 isoform X3 [Cucumis melo]2.1e-16588.15Show/hide
Query:  NVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GFGQGT
        +VGNGRMAVMTRL+AAGSFSRTIAEEVG QKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS      GFGQGT
Subjt:  NVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GFGQGT

Query:  IMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTKRSKL
        IMDLDGGMGHRKHSRKEKKKLL +DANIS VEKEGSEST AD+SAAPA P NNQ EM+KLTKRNSTC VAPILDDGTG CSGV   AAS IHPSTKRSKL
Subjt:  IMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTKRSKL

Query:  ITGEGLLLASDLEPSSAKTKIR-NVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKMDSQ
        ITGEGLLLASDLEPSSAKTKIR +VP PLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNM+D Q+TKE+IK FH+TS+EE SQEQTSD+IG KMD+Q
Subjt:  ITGEGLLLASDLEPSSAKTKIR-NVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKMDSQ

Query:  SLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL
        SLTSAWKSDHNLA+FSSGKCLPAGGASNKFVIGSSVAWPQIAP  L
Subjt:  SLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL

XP_038900665.1 uncharacterized protein LOC120087804 isoform X2 [Benincasa hispida]1.1e-16989.77Show/hide
Query:  NVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GF
        +VGNGRMAVMTRLLAAGSFSR+IA    EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS      GF
Subjt:  NVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GF

Query:  GQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTK
        GQGTIMDLDGGMGHRKHSRKEKKK+L  DANISAVEKEGSEST+A+YS APAFP NNQ EMVKLTKRNSTCTVA ILDD TGVCS VVDHAASLIHPSTK
Subjt:  GQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTK

Query:  RSKLITGEGLLLASDLEPSSAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKM
        RSKLITGEGLLLASDLEPSSAKTKIRNVP PLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQ+TKE+IK FHNTSQEESSQEQTSDIIG KM
Subjt:  RSKLITGEGLLLASDLEPSSAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKM

Query:  DS---QSLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL
        DS   Q LTSAWKSDHNL IFSSGKCLPA GASNKFVIGSSVAWPQIAP  L
Subjt:  DS---QSLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL

XP_038900699.1 uncharacterized protein LOC120087804 isoform X3 [Benincasa hispida]1.9e-17190.8Show/hide
Query:  NVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GFGQGT
        +VGNGRMAVMTRLLAAGSFSR+IAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS      GFGQGT
Subjt:  NVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GFGQGT

Query:  IMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTKRSKL
        IMDLDGGMGHRKHSRKEKKK+L  DANISAVEKEGSEST+A+YS APAFP NNQ EMVKLTKRNSTCTVA ILDD TGVCS VVDHAASLIHPSTKRSKL
Subjt:  IMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTKRSKL

Query:  ITGEGLLLASDLEPSSAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKMDS--
        ITGEGLLLASDLEPSSAKTKIRNVP PLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQ+TKE+IK FHNTSQEESSQEQTSDIIG KMDS  
Subjt:  ITGEGLLLASDLEPSSAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKMDS--

Query:  -QSLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL
         Q LTSAWKSDHNL IFSSGKCLPA GASNKFVIGSSVAWPQIAP  L
Subjt:  -QSLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL

TrEMBL top hitse value%identityAlignment
A0A0A0L7T6 MSP domain-containing protein1.1e-16480.34Show/hide
Query:  AGGKIVQTTERERGRETRVAIVTILRNQKHFYSIFSLLSLLPFSASISTHFPLPIFSSDLSNKLLTHPPLFPLLCLLAQVRIQSPNRFNWKSIVNSAVRV
        +G KI+Q       RETRVAIVTILR QK     FSLLSLLPFS      F   I S   +N  +THP  F L  + +   IQS  RFNWK IV+SAVR+
Subjt:  AGGKIVQTTERERGRETRVAIVTILRNQKHFYSIFSLLSLLPFSASISTHFPLPIFSSDLSNKLLTHPPLFPLLCLLAQVRIQSPNRFNWKSIVNSAVRV

Query:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE
        MTMELFE+QPAELKFTFELKKQSSCLIQLINKSEQH+AFKVKTTSPKKYCVRPNTGIIKPK+TCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE
Subjt:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE

Query:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPRDRMQTGVENIPPPHKVAEDSNGLDTCKHIDELRTVDTPELLSPPYKVA
        DITSDVFAKDSGKHIEEKKLKVFL SATP PVLLPINGELKLDSNHETSMPRDRMQTGVENIPPP KVAEDSNGLDT KHIDELR VDTP  LSPPYKVA
Subjt:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPRDRMQTGVENIPPPHKVAEDSNGLDTCKHIDELRTVDTPELLSPPYKVA

Query:  EGVEKIDSCKDSGENRAAVDVSTRQNEDVVAKPSENIETTPAEGIEESKLAKDLPELNLTKDFQELKSKLTLMDAELLEAEATIMRLKKERMITTQEREM
        EGVEKID+CKD+ EN AA +V TR++E  VA+  ENIET PAEGIEESKL+KDLPELNLTKDFQELKSKL LMDAELLEAEATIMRLKKER +TTQEREM
Subjt:  EGVEKIDSCKDSGENRAAVDVSTRQNEDVVAKPSENIETTPAEGIEESKLAKDLPELNLTKDFQELKSKLTLMDAELLEAEATIMRLKKERMITTQEREM

Query:  LKRDLSS
        LKRDL +
Subjt:  LKRDLSS

A0A1S3AWW9 uncharacterized protein LOC103483792 isoform X15.5e-16487.14Show/hide
Query:  NVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GF
        +VGNGRMAVMTRL+AAGSFSRTIA    EEVG QKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS      GF
Subjt:  NVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GF

Query:  GQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTK
        GQGTIMDLDGGMGHRKHSRKEKKKLL +DANIS VEKEGSEST AD+SAAPA P NNQ EM+KLTKRNSTC VAPILDDGTG CSGV   AAS IHPSTK
Subjt:  GQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTK

Query:  RSKLITGEGLLLASDLEPSSAKTKIR-NVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK
        RSKLITGEGLLLASDLEPSSAKTKIR +VP PLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNM+D Q+TKE+IK FH+TS+EE SQEQTSD+IG K
Subjt:  RSKLITGEGLLLASDLEPSSAKTKIR-NVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKK

Query:  MDSQSLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL
        MD+QSLTSAWKSDHNLA+FSSGKCLPAGGASNKFVIGSSVAWPQIAP  L
Subjt:  MDSQSLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL

A0A1S3AX95 uncharacterized protein LOC103483792 isoform X25.9e-16687.68Show/hide
Query:  NVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GF
        +VGNGRMAVMTRL+AAGSFSRTIA    EEVG QKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS      GF
Subjt:  NVGNGRMAVMTRLLAAGSFSRTIA----EEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GF

Query:  GQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTK
        GQGTIMDLDGGMGHRKHSRKEKKKLL +DANIS VEKEGSEST AD+SAAPA P NNQ EM+KLTKRNSTC VAPILDDGTG CSGV   AAS IHPSTK
Subjt:  GQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTK

Query:  RSKLITGEGLLLASDLEPSSAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKM
        RSKLITGEGLLLASDLEPSSAKTKIRNVP PLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNM+D Q+TKE+IK FH+TS+EE SQEQTSD+IG KM
Subjt:  RSKLITGEGLLLASDLEPSSAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKM

Query:  DSQSLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL
        D+QSLTSAWKSDHNLA+FSSGKCLPAGGASNKFVIGSSVAWPQIAP  L
Subjt:  DSQSLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL

A0A1S3AXZ3 uncharacterized protein LOC103483792 isoform X31.0e-16588.15Show/hide
Query:  NVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GFGQGT
        +VGNGRMAVMTRL+AAGSFSRTIAEEVG QKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS      GFGQGT
Subjt:  NVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GFGQGT

Query:  IMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTKRSKL
        IMDLDGGMGHRKHSRKEKKKLL +DANIS VEKEGSEST AD+SAAPA P NNQ EM+KLTKRNSTC VAPILDDGTG CSGV   AAS IHPSTKRSKL
Subjt:  IMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTKRSKL

Query:  ITGEGLLLASDLEPSSAKTKIR-NVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKMDSQ
        ITGEGLLLASDLEPSSAKTKIR +VP PLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNM+D Q+TKE+IK FH+TS+EE SQEQTSD+IG KMD+Q
Subjt:  ITGEGLLLASDLEPSSAKTKIR-NVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKMDSQ

Query:  SLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL
        SLTSAWKSDHNLA+FSSGKCLPAGGASNKFVIGSSVAWPQIAP  L
Subjt:  SLTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRL

A0A6J1IRX4 uncharacterized protein LOC111479416 isoform X31.4e-16285.14Show/hide
Query:  NVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GFGQGT
        +VGNGRMAVMTRLLAAGSFSRTIAEEVGHQK ASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLV CNICKKPVKASQYIIHS      G GQGT
Subjt:  NVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKASQYIIHS------GFGQGT

Query:  IMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTKRSKL
        IMDLD GMGHRKHSRKEKKKLL ADAN SAVEKEGSEST+ADYS+A  FP +N+ EMVKLTKRNSTCTVAPILDD  GVC GVVDH+ S IHPSTKRSKL
Subjt:  IMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTKRSKL

Query:  ITGEGLLLASDLEPSSAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKMDSQS
        ITGEGLLLASDLEPSS+KTKI+N P PLASKIYYSQRNNRLRS L YLYWEAV+SSKEICNMVDH +TKE+IKQFH+TSQEE SQEQ+SD+IGKKMDS S
Subjt:  ITGEGLLLASDLEPSSAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKMDSQS

Query:  LTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRLDNSQV
        LTSAWKSD NLAIFSSGKCLPAGGAS KFV GSSVAWPQIAP  L   ++
Subjt:  LTSAWKSDHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRLDNSQV

SwissProt top hitse value%identityAlignment
B9DHD7 Vesicle-associated protein 2-26.0e-5136.62Show/hide
Query:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE
        M M L ++QP  L+F  +LKKQ+SC++QL N +  +VAFKVKTTSPKKYCVRPN G++ PK TC+FTV M A +  PPDM CKDKFL+Q T +S  T++E
Subjt:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE

Query:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPRDRMQTGVENIPPPHKVAEDSN-----GLDTCKHIDELRTVDTPELLSP
        DIT+ +F+K  GKHIEE KL+V LV  +  P L PIN   K  +  E S+ +DR+ +  E + PP    E        G D  K  D  + + TP++ + 
Subjt:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPRDRMQTGVENIPPPHKVAEDSN-----GLDTCKHIDELRTVDTPELLSP

Query:  PY--------------------KVAE--GVEKIDSCKDSGENRA-------------AVDVSTRQ---------NEDVVAKPSENIETTPAEGIEESKLA
         +                    ++A+  G + I S KD+ + RA             A+D+   Q         +E  ++K  + ++    +G    +  
Subjt:  PY--------------------KVAE--GVEKIDSCKDSGENRA-------------AVDVSTRQ---------NEDVVAKPSENIETTPAEGIEESKLA

Query:  KDLPELNLTKDFQELKSKLTLMDAELLEAEATIMRLKKERMITTQEREMLKRDLS
        + L EL L KD +E+K K+  ++++L +A++TI +L +ER I++Q R+ L+ +L+
Subjt:  KDLPELNLTKDFQELKSKLTLMDAELLEAEATIMRLKKERMITTQEREMLKRDLS

Q84WW5 Vesicle-associated protein 1-38.5e-3758.46Show/hide
Query:  TMELFEVQPAELKFTFELKKQSSCLIQLINK-SEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE
        T +L  + P ELKF FELKKQSSC +QL NK + Q VAFKVKTT+P+KYCVRPNTG++ P D+C+ TVTM AQ+ AP DMQCKDKFLVQ  V+S GT+ +
Subjt:  TMELFEVQPAELKFTFELKKQSSCLIQLINK-SEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE

Query:  DITSDVFAKDSGKHIEEKKLKVFLVSATPP
        ++ +++F K++G+ IE+ KL+V  + A PP
Subjt:  DITSDVFAKDSGKHIEEKKLKVFLVSATPP

Q8VZ95 Vesicle-associated protein 1-11.0e-3751.68Show/hide
Query:  ELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEEDIT
        EL  V+P +L+F FELKKQ SC + L NK++ +VAFKVKTT+PKKYCVRPNTG++ P+ TC+  VTM AQ+ AP DMQCKDKFL+QG + SPG + +++T
Subjt:  ELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEEDIT

Query:  SDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPR
         ++F+K++G  +EE KL+V  V+   PP   P++     + + E S PR
Subjt:  SDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPR

Q9LVU1 Vesicle-associated protein 2-13.6e-3558.54Show/hide
Query:  ELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEEDIT
        +L  +QP ELKF FEL+KQS C +++ NK+E +VAFKVKTTSPKKY VRPNTG+I+P D+C   VT+ AQR  PPDMQCKDKFL+Q T++ P T  +++ 
Subjt:  ELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEEDIT

Query:  SDVFAKDSGKHIEEKKLKVFLVS
         D F KDSGK + E KLKV  ++
Subjt:  SDVFAKDSGKHIEEKKLKVFLVS

Q9SHC8 Vesicle-associated protein 1-22.1e-3553.44Show/hide
Query:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE
        M+ EL  + P +L+F FELKKQ SC + L NK++ +VAFKVKTT+PKKYCVRPNTG++ P+ + +  VTM AQ+ AP D+QCKDKFL+Q  V SPG + +
Subjt:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE

Query:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPP
        D+T ++F+K++G  +EE KL+V  V+   PP
Subjt:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPP

Arabidopsis top hitse value%identityAlignment
AT1G08820.1 vamp/synaptobrevin-associated protein 27-24.3e-5236.62Show/hide
Query:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE
        M M L ++QP  L+F  +LKKQ+SC++QL N +  +VAFKVKTTSPKKYCVRPN G++ PK TC+FTV M A +  PPDM CKDKFL+Q T +S  T++E
Subjt:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE

Query:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPRDRMQTGVENIPPPHKVAEDSN-----GLDTCKHIDELRTVDTPELLSP
        DIT+ +F+K  GKHIEE KL+V LV  +  P L PIN   K  +  E S+ +DR+ +  E + PP    E        G D  K  D  + + TP++ + 
Subjt:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPRDRMQTGVENIPPPHKVAEDSN-----GLDTCKHIDELRTVDTPELLSP

Query:  PY--------------------KVAE--GVEKIDSCKDSGENRA-------------AVDVSTRQ---------NEDVVAKPSENIETTPAEGIEESKLA
         +                    ++A+  G + I S KD+ + RA             A+D+   Q         +E  ++K  + ++    +G    +  
Subjt:  PY--------------------KVAE--GVEKIDSCKDSGENRA-------------AVDVSTRQ---------NEDVVAKPSENIETTPAEGIEESKLA

Query:  KDLPELNLTKDFQELKSKLTLMDAELLEAEATIMRLKKERMITTQEREMLKRDLS
        + L EL L KD +E+K K+  ++++L +A++TI +L +ER I++Q R+ L+ +L+
Subjt:  KDLPELNLTKDFQELKSKLTLMDAELLEAEATIMRLKKERMITTQEREMLKRDLS

AT1G08820.2 vamp/synaptobrevin-associated protein 27-24.3e-5236.62Show/hide
Query:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE
        M M L ++QP  L+F  +LKKQ+SC++QL N +  +VAFKVKTTSPKKYCVRPN G++ PK TC+FTV M A +  PPDM CKDKFL+Q T +S  T++E
Subjt:  MTMELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEE

Query:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPRDRMQTGVENIPPPHKVAEDSN-----GLDTCKHIDELRTVDTPELLSP
        DIT+ +F+K  GKHIEE KL+V LV  +  P L PIN   K  +  E S+ +DR+ +  E + PP    E        G D  K  D  + + TP++ + 
Subjt:  DITSDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPRDRMQTGVENIPPPHKVAEDSN-----GLDTCKHIDELRTVDTPELLSP

Query:  PY--------------------KVAE--GVEKIDSCKDSGENRA-------------AVDVSTRQ---------NEDVVAKPSENIETTPAEGIEESKLA
         +                    ++A+  G + I S KD+ + RA             A+D+   Q         +E  ++K  + ++    +G    +  
Subjt:  PY--------------------KVAE--GVEKIDSCKDSGENRA-------------AVDVSTRQ---------NEDVVAKPSENIETTPAEGIEESKLA

Query:  KDLPELNLTKDFQELKSKLTLMDAELLEAEATIMRLKKERMITTQEREMLKRDLS
        + L EL L KD +E+K K+  ++++L +A++TI +L +ER I++Q R+ L+ +L+
Subjt:  KDLPELNLTKDFQELKSKLTLMDAELLEAEATIMRLKKERMITTQEREMLKRDLS

AT3G60600.1 vesicle associated protein7.1e-3951.68Show/hide
Query:  ELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEEDIT
        EL  V+P +L+F FELKKQ SC + L NK++ +VAFKVKTT+PKKYCVRPNTG++ P+ TC+  VTM AQ+ AP DMQCKDKFL+QG + SPG + +++T
Subjt:  ELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEEDIT

Query:  SDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPR
         ++F+K++G  +EE KL+V  V+   PP   P++     + + E S PR
Subjt:  SDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPR

AT3G60600.2 vesicle associated protein7.1e-3951.68Show/hide
Query:  ELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEEDIT
        EL  V+P +L+F FELKKQ SC + L NK++ +VAFKVKTT+PKKYCVRPNTG++ P+ TC+  VTM AQ+ AP DMQCKDKFL+QG + SPG + +++T
Subjt:  ELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEEDIT

Query:  SDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPR
         ++F+K++G  +EE KL+V  V+   PP   P++     + + E S PR
Subjt:  SDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPR

AT3G60600.3 vesicle associated protein7.1e-3951.68Show/hide
Query:  ELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEEDIT
        EL  V+P +L+F FELKKQ SC + L NK++ +VAFKVKTT+PKKYCVRPNTG++ P+ TC+  VTM AQ+ AP DMQCKDKFL+QG + SPG + +++T
Subjt:  ELFEVQPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEEDIT

Query:  SDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPR
         ++F+K++G  +EE KL+V  V+   PP   P++     + + E S PR
Subjt:  SDVFAKDSGKHIEEKKLKVFLVSATPPPVLLPINGELKLDSNHETSMPR


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGAGAGCGGGGGGAAAGATAGTACAGACTACGGAGAGAGAGAGAGGGAGAGAGACCCGCGTAGCCATCGTTACCATTTTGCGAAACCAAAAACATTTTTACTCCATTTT
TTCTTTGCTTTCACTTCTTCCCTTCTCTGCTTCCATTTCCACTCATTTTCCTCTTCCAATTTTCAGCTCAGATCTCTCCAACAAACTCCTCACTCACCCACCCCTTTTCC
CTCTCCTCTGTTTGCTCGCGCAAGTTCGAATTCAATCCCCTAATCGCTTCAACTGGAAGTCTATCGTTAATTCAGCGGTTAGAGTCATGACTATGGAGCTTTTCGAAGTT
CAACCGGCCGAACTTAAATTCACTTTTGAGTTGAAGAAGCAAAGTTCATGCTTGATACAACTTATCAATAAGTCCGAGCAACATGTTGCTTTCAAGGTAAAAACTACATC
TCCCAAGAAGTACTGTGTGCGACCCAATACTGGCATCATTAAGCCCAAGGATACGTGTGACTTTACAGTCACCATGCTAGCTCAGCGTACGGCTCCACCTGATATGCAAT
GCAAAGACAAATTCCTGGTCCAAGGCACAGTCATCTCCCCTGGAACATCTGAAGAGGACATCACATCTGACGTGTTTGCAAAAGATAGTGGCAAGCACATTGAAGAGAAG
AAGCTGAAGGTTTTTCTAGTGAGCGCAACACCACCTCCTGTTTTATTACCAATTAATGGTGAGTTGAAACTAGATTCAAATCATGAAACTTCCATGCCAAGAGATAGAAT
GCAGACAGGAGTTGAGAACATACCTCCACCTCATAAGGTTGCAGAAGACTCCAATGGACTTGATACCTGCAAACACATAGATGAACTTAGAACAGTTGATACCCCAGAAT
TGTTATCTCCACCTTATAAGGTGGCTGAGGGAGTTGAGAAAATTGATAGTTGTAAAGATTCAGGTGAGAATAGAGCAGCTGTGGATGTTTCAACAAGACAAAATGAGGAT
GTGGTGGCCAAACCAAGTGAAAATATTGAAACCACGCCAGCTGAGGGAATTGAGGAATCAAAACTGGCGAAGGATCTACCAGAGTTAAACTTAACTAAAGACTTTCAAGA
GCTGAAATCAAAGCTTACTCTCATGGATGCAGAACTACTAGAGGCTGAAGCTACCATAATGAGGCTGAAAAAGGAGAGGATGATAACAACTCAGGAAAGGGAAATGCTCA
AGCGTGACTTGTCTTCTTCTTGGATACTTCATCCATCCCTAGCAGAAGCAGAACAAATCAATGGGGGTTCCATCACAGCAGTTGATTCGAATTTCTCCGCTTTCTCGCTG
GTTTCATGCGCCTGGATGATTGGAGGAGCTTTCGTGCTGCGGTGTAGAACCTCTTCCTTCCCGTACATAATACATTTGCTTAGTCTCAATGTTGGAAATGGGAGAATGGC
AGTGATGACAAGGCTTCTGGCTGCTGGGAGTTTCTCTCGTACCATTGCAGAGGAAGTTGGTCACCAGAAATTTGCTTCTGAATTTATCTGCCGAGAACTTCGTGATGCAG
ATGAAGCAAATTTAATTGATGAGGAAGATATGCACGTTTTTGGTTTGAAGCCTATGGTTGATCCTCTGAACTTGGTTTGCTGCAATATTTGTAAGAAGCCAGTAAAGGCC
AGTCAATATATCATTCATTCAGGTTTTGGACAAGGAACTATAATGGACCTTGATGGTGGGATGGGTCATAGAAAACACTCAAGGAAGGAGAAGAAAAAGTTACTACTTGC
TGATGCTAATATATCAGCTGTGGAGAAAGAAGGGTCTGAATCAACATTTGCTGACTATTCTGCTGCACCTGCATTTCCAACTAATAACCAACTTGAAATGGTCAAGTTGA
CAAAAAGAAATTCAACTTGTACTGTGGCACCTATACTGGATGATGGTACAGGAGTCTGTTCTGGTGTTGTAGACCATGCAGCTAGTCTCATACATCCTTCGACAAAGCGG
TCCAAATTGATAACTGGTGAAGGGCTGTTACTGGCATCTGATTTAGAACCATCGTCAGCTAAAACAAAAATTAGAAATGTTCCGATTCCCCTTGCAAGTAAAATATATTA
CTCTCAGAGAAATAATCGTCTGCGCTCGGCTCTTGGTTATCTTTACTGGGAGGCTGTTGCATCTAGCAAGGAAATTTGTAATATGGTGGATCATCAAATAACAAAGGAAG
ATATAAAACAATTTCACAATACTTCCCAGGAGGAGTCGTCTCAAGAACAAACAAGTGACATTATTGGAAAGAAGATGGATAGTCAGTCCTTAACCTCTGCATGGAAATCT
GACCATAATCTGGCCATATTCTCATCTGGCAAATGTCTGCCTGCCGGTGGTGCCTCAAATAAGTTTGTTATTGGCAGCAGTGTCGCATGGCCGCAGATTGCTCCAGATAG
ACTAGACAACAGCCAAGTGGAAGTATTCCTGTTGTATAGAAGTCTTGGTGGCACTTTTTTTGTAGGCACAGTGGTACATGTAATTAACCCTAAAACTTGA
mRNA sequenceShow/hide mRNA sequence
ATGAGAGCGGGGGGAAAGATAGTACAGACTACGGAGAGAGAGAGAGGGAGAGAGACCCGCGTAGCCATCGTTACCATTTTGCGAAACCAAAAACATTTTTACTCCATTTT
TTCTTTGCTTTCACTTCTTCCCTTCTCTGCTTCCATTTCCACTCATTTTCCTCTTCCAATTTTCAGCTCAGATCTCTCCAACAAACTCCTCACTCACCCACCCCTTTTCC
CTCTCCTCTGTTTGCTCGCGCAAGTTCGAATTCAATCCCCTAATCGCTTCAACTGGAAGTCTATCGTTAATTCAGCGGTTAGAGTCATGACTATGGAGCTTTTCGAAGTT
CAACCGGCCGAACTTAAATTCACTTTTGAGTTGAAGAAGCAAAGTTCATGCTTGATACAACTTATCAATAAGTCCGAGCAACATGTTGCTTTCAAGGTAAAAACTACATC
TCCCAAGAAGTACTGTGTGCGACCCAATACTGGCATCATTAAGCCCAAGGATACGTGTGACTTTACAGTCACCATGCTAGCTCAGCGTACGGCTCCACCTGATATGCAAT
GCAAAGACAAATTCCTGGTCCAAGGCACAGTCATCTCCCCTGGAACATCTGAAGAGGACATCACATCTGACGTGTTTGCAAAAGATAGTGGCAAGCACATTGAAGAGAAG
AAGCTGAAGGTTTTTCTAGTGAGCGCAACACCACCTCCTGTTTTATTACCAATTAATGGTGAGTTGAAACTAGATTCAAATCATGAAACTTCCATGCCAAGAGATAGAAT
GCAGACAGGAGTTGAGAACATACCTCCACCTCATAAGGTTGCAGAAGACTCCAATGGACTTGATACCTGCAAACACATAGATGAACTTAGAACAGTTGATACCCCAGAAT
TGTTATCTCCACCTTATAAGGTGGCTGAGGGAGTTGAGAAAATTGATAGTTGTAAAGATTCAGGTGAGAATAGAGCAGCTGTGGATGTTTCAACAAGACAAAATGAGGAT
GTGGTGGCCAAACCAAGTGAAAATATTGAAACCACGCCAGCTGAGGGAATTGAGGAATCAAAACTGGCGAAGGATCTACCAGAGTTAAACTTAACTAAAGACTTTCAAGA
GCTGAAATCAAAGCTTACTCTCATGGATGCAGAACTACTAGAGGCTGAAGCTACCATAATGAGGCTGAAAAAGGAGAGGATGATAACAACTCAGGAAAGGGAAATGCTCA
AGCGTGACTTGTCTTCTTCTTGGATACTTCATCCATCCCTAGCAGAAGCAGAACAAATCAATGGGGGTTCCATCACAGCAGTTGATTCGAATTTCTCCGCTTTCTCGCTG
GTTTCATGCGCCTGGATGATTGGAGGAGCTTTCGTGCTGCGGTGTAGAACCTCTTCCTTCCCGTACATAATACATTTGCTTAGTCTCAATGTTGGAAATGGGAGAATGGC
AGTGATGACAAGGCTTCTGGCTGCTGGGAGTTTCTCTCGTACCATTGCAGAGGAAGTTGGTCACCAGAAATTTGCTTCTGAATTTATCTGCCGAGAACTTCGTGATGCAG
ATGAAGCAAATTTAATTGATGAGGAAGATATGCACGTTTTTGGTTTGAAGCCTATGGTTGATCCTCTGAACTTGGTTTGCTGCAATATTTGTAAGAAGCCAGTAAAGGCC
AGTCAATATATCATTCATTCAGGTTTTGGACAAGGAACTATAATGGACCTTGATGGTGGGATGGGTCATAGAAAACACTCAAGGAAGGAGAAGAAAAAGTTACTACTTGC
TGATGCTAATATATCAGCTGTGGAGAAAGAAGGGTCTGAATCAACATTTGCTGACTATTCTGCTGCACCTGCATTTCCAACTAATAACCAACTTGAAATGGTCAAGTTGA
CAAAAAGAAATTCAACTTGTACTGTGGCACCTATACTGGATGATGGTACAGGAGTCTGTTCTGGTGTTGTAGACCATGCAGCTAGTCTCATACATCCTTCGACAAAGCGG
TCCAAATTGATAACTGGTGAAGGGCTGTTACTGGCATCTGATTTAGAACCATCGTCAGCTAAAACAAAAATTAGAAATGTTCCGATTCCCCTTGCAAGTAAAATATATTA
CTCTCAGAGAAATAATCGTCTGCGCTCGGCTCTTGGTTATCTTTACTGGGAGGCTGTTGCATCTAGCAAGGAAATTTGTAATATGGTGGATCATCAAATAACAAAGGAAG
ATATAAAACAATTTCACAATACTTCCCAGGAGGAGTCGTCTCAAGAACAAACAAGTGACATTATTGGAAAGAAGATGGATAGTCAGTCCTTAACCTCTGCATGGAAATCT
GACCATAATCTGGCCATATTCTCATCTGGCAAATGTCTGCCTGCCGGTGGTGCCTCAAATAAGTTTGTTATTGGCAGCAGTGTCGCATGGCCGCAGATTGCTCCAGATAG
ACTAGACAACAGCCAAGTGGAAGTATTCCTGTTGTATAGAAGTCTTGGTGGCACTTTTTTTGTAGGCACAGTGGTACATGTAATTAACCCTAAAACTTGAGGTTAGGACG
AAGACTCTGTATTTTGATATTGGG
Protein sequenceShow/hide protein sequence
MRAGGKIVQTTERERGRETRVAIVTILRNQKHFYSIFSLLSLLPFSASISTHFPLPIFSSDLSNKLLTHPPLFPLLCLLAQVRIQSPNRFNWKSIVNSAVRVMTMELFEV
QPAELKFTFELKKQSSCLIQLINKSEQHVAFKVKTTSPKKYCVRPNTGIIKPKDTCDFTVTMLAQRTAPPDMQCKDKFLVQGTVISPGTSEEDITSDVFAKDSGKHIEEK
KLKVFLVSATPPPVLLPINGELKLDSNHETSMPRDRMQTGVENIPPPHKVAEDSNGLDTCKHIDELRTVDTPELLSPPYKVAEGVEKIDSCKDSGENRAAVDVSTRQNED
VVAKPSENIETTPAEGIEESKLAKDLPELNLTKDFQELKSKLTLMDAELLEAEATIMRLKKERMITTQEREMLKRDLSSSWILHPSLAEAEQINGGSITAVDSNFSAFSL
VSCAWMIGGAFVLRCRTSSFPYIIHLLSLNVGNGRMAVMTRLLAAGSFSRTIAEEVGHQKFASEFICRELRDADEANLIDEEDMHVFGLKPMVDPLNLVCCNICKKPVKA
SQYIIHSGFGQGTIMDLDGGMGHRKHSRKEKKKLLLADANISAVEKEGSESTFADYSAAPAFPTNNQLEMVKLTKRNSTCTVAPILDDGTGVCSGVVDHAASLIHPSTKR
SKLITGEGLLLASDLEPSSAKTKIRNVPIPLASKIYYSQRNNRLRSALGYLYWEAVASSKEICNMVDHQITKEDIKQFHNTSQEESSQEQTSDIIGKKMDSQSLTSAWKS
DHNLAIFSSGKCLPAGGASNKFVIGSSVAWPQIAPDRLDNSQVEVFLLYRSLGGTFFVGTVVHVINPKT