; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

ClCG01G007130 (gene) of Watermelon (Charleston Gray) v2.5 genome

Gene IDClCG01G007130
OrganismCitrullus lanatus subsp. vulgaris cv. Charleston Gray (Watermelon (Charleston Gray) v2.5)
Descriptioncharged multivesicular body protein 7
Genome locationCG_Chr01:8200016..8204245
RNA-Seq ExpressionClCG01G007130
SyntenyClCG01G007130
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0032511 - late endosome to vacuole transport via multivesicular body sorting pathway (biological process)
GO:0000815 - ESCRT III complex (cellular component)
GO:0005771 - multivesicular body (cellular component)
GO:0009898 - cytoplasmic side of plasma membrane (cellular component)
InterPro domainsIPR005024 - Snf7 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579154.1 Charged multivesicular body protein 7, partial [Cucurbita argyrosperma subsp. sororia]1.3e-20687.21Show/hide
Query:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK   +R+FIREKVPDWD+E+V+TARFKAFSGQKSDWEPRYLFWR LILTIA QFNF+F+KPSEI NQWFSRGGLAPLCLDHVLHLM IEGDIIRR
Subjt:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPR GQLSYLFKKLSNLMG SKKN D  L DDY+VLACVLQDRAAEV+ CLS SNWTSSC+ITMVKFQNICGGPDEAT ILSYL  CGKARYLSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        +KEL+EGVK+SLSA  V GITTLDYDILHLIWTTE+LQ+QLDVIDQRYDV +QSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AESTKTVSEAIQIGARVMKEHEV+WD LQHSLQELEASIDIQKQVAS IDSAPSGSI E+EDIEEEFKKLELEV AGQ L+ASTS++GVNIA G  VATV
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMEL
        SDDSLSAALSNLKLVEETGKET  QK N KSKSK+MEL
Subjt:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMEL

KGN50975.2 hypothetical protein Csa_017819 [Cucumis sativus]1.8e-20887.67Show/hide
Query:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK S +R+FIREKVPDWDDEVV+TARFKAFSGQKSDWEPRYLFWR LILT+ARQFNF+ IKPSEI NQWF RGGL PLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPRSGQLSY+FKKLSNLMG SKKNPDSLLRDDY+VLACVLQDRAAEVI CLSLS+WTSSCIITMVKFQNICGGPDEATVILSYLI CGKA++LSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        KKELLEGVKVSLSATTV GIT+LDYDILHL+WT EKLQQQLDVIDQRYDV KQSAL SLKSGN+KTALKHARELKITTESREKVASL NRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AE TKTVSEAIQIGARVMKEHEVNWDQLQ SLQELEAS+DIQKQVA+AIDS PS SIP+DEDIEEEFKKLELE+TAGQ L+ASTSESGVNIA GETVA V
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMEL
         DDSLS ALSNLKLVEET KE  N   + K KSK+MEL
Subjt:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMEL

XP_011654554.1 charged multivesicular body protein 7 [Cucumis sativus]9.4e-21087.53Show/hide
Query:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK S +R+FIREKVPDWDDEVV+TARFKAFSGQKSDWEPRYLFWR LILT+ARQFNF+ IKPSEI NQWF RGGL PLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPRSGQLSY+FKKLSNLMG SKKNPDSLLRDDY+VLACVLQDRAAEVI CLSLS+WTSSCIITMVKFQNICGGPDEATVILSYLI CGKA++LSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        KKELLEGVKVSLSATTV GIT+LDYDILHL+WT EKLQQQLDVIDQRYDV KQSAL SLKSGN+KTALKHARELKITTESREKVASL NRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AE TKTVSEAIQIGARVMKEHEVNWDQLQ SLQELEAS+DIQKQVA+AIDS PS SIP+DEDIEEEFKKLELE+TAGQ L+ASTSESGVNIA GETVA V
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS
         DDSLS ALSNLKLVEET KE  N   + K KSK+ME+GIS
Subjt:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS

XP_038875993.1 uncharacterized protein LOC120068336 isoform X1 [Benincasa hispida]1.4e-21388.5Show/hide
Query:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK SR+R+FIREKVPDWDDEVV+TARFKAFSGQKSDWEPRY  WR LI+TIAR+FNF+FIKPSEI NQWFSRGGL+PLCLDHVLH+MYIEGDIIRR
Subjt:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQ---------DRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPRSGQLSYLFKKLSNLMG SKKNPDSLLRDDYVVLACVLQ         DRAAEVI CLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIG 
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQ---------DRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRV
        GKARYLSKEKKELLEGVK+SL+A TV GITTLDYDILHLIWTTEKLQQQLDVIDQRYDV +QSALASLKSGNKKTALKHARELKITTESREKVASLLNRV
Subjt:  GKARYLSKEKKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRV

Query:  EEVLNAIADAESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAI--DSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGV
        EEVLNAIADAESTKTVSEAIQIGARVMKEHEV+WDQLQ+SLQE+E SID+QKQVASAI  DSAPSGSIPEDEDIEEEFKKLELEVTAGQ L+ STSES V
Subjt:  EEVLNAIADAESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAI--DSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGV

Query:  NIAAGETVATVSDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS
        NIA GETVATVSDD LS ALSNLKLVEETG  TA QK N KSKSK+MELGIS
Subjt:  NIAAGETVATVSDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS

XP_038875996.1 uncharacterized protein LOC120068336 isoform X2 [Benincasa hispida]6.7e-21690.29Show/hide
Query:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK SR+R+FIREKVPDWDDEVV+TARFKAFSGQKSDWEPRY  WR LI+TIAR+FNF+FIKPSEI NQWFSRGGL+PLCLDHVLH+MYIEGDIIRR
Subjt:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
         DMLDPRSGQLSYLFKKLSNLMG SKKNPDSLLRDDYVVLACVLQDRAAEVI CLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIG GKARYLSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        KKELLEGVK+SL+A TV GITTLDYDILHLIWTTEKLQQQLDVIDQRYDV +QSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAI--DSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVA
        AESTKTVSEAIQIGARVMKEHEV+WDQLQ+SLQE+E SID+QKQVASAI  DSAPSGSIPEDEDIEEEFKKLELEVTAGQ L+ STSES VNIA GETVA
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAI--DSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVA

Query:  TVSDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS
        TVSDD LS ALSNLKLVEETG  TA QK N KSKSK+MELGIS
Subjt:  TVSDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS

TrEMBL top hitse value%identityAlignment
A0A0A0KMY2 Uncharacterized protein4.5e-21087.53Show/hide
Query:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK S +R+FIREKVPDWDDEVV+TARFKAFSGQKSDWEPRYLFWR LILT+ARQFNF+ IKPSEI NQWF RGGL PLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPRSGQLSY+FKKLSNLMG SKKNPDSLLRDDY+VLACVLQDRAAEVI CLSLS+WTSSCIITMVKFQNICGGPDEATVILSYLI CGKA++LSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        KKELLEGVKVSLSATTV GIT+LDYDILHL+WT EKLQQQLDVIDQRYDV KQSAL SLKSGN+KTALKHARELKITTESREKVASL NRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AE TKTVSEAIQIGARVMKEHEVNWDQLQ SLQELEAS+DIQKQVA+AIDS PS SIP+DEDIEEEFKKLELE+TAGQ L+ASTSESGVNIA GETVA V
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS
         DDSLS ALSNLKLVEET KE  N   + K KSK+ME+GIS
Subjt:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS

A0A1S3CRA4 charged multivesicular body protein 7 isoform X11.1e-20084.16Show/hide
Query:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK S +R+FIREKV DWDDEVV+TARFKAFSGQKSDWEPRYLFWR LILT+ARQ NF+ IKPSEI NQWFSRGGL PLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPRSGQLSY+FK+LSNLMG SKKNP+SLLRDDY++LACVLQDRA EVI CLSLSNWTSS IITMVKFQNICGGPDEATVILSYLI CGKA++LSK 
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        K +LLEGVKVS SATTV GITTLDYDILHL+WT EKLQQQLD I+QRYDV KQSAL SLKSGNKK ALKHARELKITTESREKVASL NRVEEVLNAI D
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVAT
        AE TK+VSEAIQIGARVMKEHEVNWDQLQHSLQELE SIDIQKQVA+ IDS PS SIP D EDIEE FKKLELE+TA Q L+ASTSES VNIA GETV  
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVAT

Query:  VSDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS
        V DDSLS+ LSNLKLVEE  KE ANQK N K  SK+MELGIS
Subjt:  VSDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS

A0A1S3CRC5 charged multivesicular body protein 7 isoform X22.6e-19783.48Show/hide
Query:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK S +R+FIREKV DWDDEVV+TARFKAFSGQKSDWEPRYLFWR LILT+ARQ NF+ IKPSEI NQWFSRGGL PLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPRSGQLSY+FK+LSNLMG SKKNP+SLLRDDY++LACVLQDRA EVI CLSLSNWTSS IITMVKFQNICGGPDEATVILSYLI CGKA++LSK 
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        K +LLE   VS SATTV GITTLDYDILHL+WT EKLQQQLD I+QRYDV KQSAL SLKSGNKK ALKHARELKITTESREKVASL NRVEEVLNAI D
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVAT
        AE TK+VSEAIQIGARVMKEHEVNWDQLQHSLQELE SIDIQKQVA+ IDS PS SIP D EDIEE FKKLELE+TA Q L+ASTSES VNIA GETV  
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVAT

Query:  VSDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS
        V DDSLS+ LSNLKLVEE  KE ANQK N K  SK+MELGIS
Subjt:  VSDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGIS

A0A6J1FF90 charged multivesicular body protein 73.4e-20586.53Show/hide
Query:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK   +R+FIREKVPDWD+E+V+TARFKAFSGQKSDWEPRYLFWR LILTIA QFNF+F+KPSEI NQWFSRGGLAPLCLDHVLHLM IEGDIIRR
Subjt:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPR GQLSYLFKKLSNLMG SKKN D LL DDY+VLACVLQDRAAEV+ CLS SNWTSSC+ITMVKFQNICGGPDEAT  LSYL  CGKARYLSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        +KEL+EGVK+SLSA  V GITTLDYDILHLIWTTE+LQ+QLDVIDQRYDV +QSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AESTKTVSEAIQIGAR MKEHEV+WD LQHSLQELEASIDIQKQVAS IDSAPSG I E+EDIEEEFKKLELEV AGQ L+ASTS++G NIA G  VATV
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMEL
        SDDSLSAALSNLKLVEETGKET  QK N KSKSK+MEL
Subjt:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMEL

A0A6J1K252 charged multivesicular body protein 71.7e-20486.3Show/hide
Query:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK   +R+FIREKVPDWD+EVV+TA FKAFSGQKSDWEPRYLFWR LIL I+ QFNF+FIKPSEI NQWFSRGGLAPLCLDHVLHLM IEGDIIRR
Subjt:  MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPR GQLSYLFKKLSN+MG SKKNPD LL DDY+VLACVLQDRAAEV+ CLS SNWTSSC+ITMVKFQNICGGPDEAT ILSYL  CGKARYLSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        +KEL+EGVK+SLSA  V GITTLDYDILHLIWTTE+LQ+QLDVIDQRYDV +QSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AESTKTVSEAIQIGARVMKEHEV+WDQLQHSL ELEASIDIQKQV S IDSAPSGSI E+EDIEEEFKKLELEV AGQ L+A+TS++GVNIA G  VATV
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMEL
        SDDSLSAALSNLKLV ET KET  QK N KSKSK+MEL
Subjt:  SDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMEL

SwissProt top hitse value%identityAlignment
Q5ZJB7 Charged multivesicular body protein 71.1e-1624.71Show/hide
Query:  PDWD-DEVVSTARFKAFSGQK----SDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSD-MLDPRSGQLS
        P+W+ D+      F AF   +    ++W+ +  FW GL+L   R+   V     E+ N  F R G  PL L  VL  +   G + R SD M    S  +S
Subjt:  PDWD-DEVVSTARFKAFSGQK----SDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSD-MLDPRSGQLS

Query:  Y---------LFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGG--PDEATVILSYLIGCGKARYLSKEK
        +         L   LS+++G SK   +    ++ ++   +LQ++A EV      S  +S  ++ + + +++C G  PDE T  L  L        L KEK
Subjt:  Y---------LFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGG--PDEATVILSYLIGCGKARYLSKEK

Query:  KELL---EGVKVSLSA----TTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEV
        K  +    G K+   A      V+ +  +D  +  L+ + + L Q+++ + Q  +  K  A ++ ++G K+ AL+  +  + T    E++ S L+ V+ +
Subjt:  KELL---EGVKVSLSA----TTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEV

Query:  LNAIADAESTKTVSEAIQ--IGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNI
        L+ I  +++ + V  A Q  +GA  +   +V  ++ ++ + +++   D Q +VA  +  A    +  D E++E+E   L L+ +A + ++        + 
Subjt:  LNAIADAESTKTVSEAIQ--IGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNI

Query:  AAGETVATVSDDSLSAALSNLKLVE
         AG     +SD  L A L  L + +
Subjt:  AAGETVATVSDDSLSAALSNLKLVE

Q6PBQ2 Charged multivesicular body protein 72.4e-1924.25Show/hide
Query:  PDWDDEVVSTARFKAFSGQK----SDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDM-LDPRSGQLSY
        PDWDD+   +  F AF   +    +DW+ +  FW  LI+   R+   V +   ++ N+ F R G  PL L  V+  M   G + + SD   +  SG LS+
Subjt:  PDWDDEVVSTARFKAFSGQK----SDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDM-LDPRSGQLSY

Query:  -----LFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGG--PDEATVILSYLIGCGKARYLSKEKKELLE
             L + L   + A   +    L + +VV+  V +++AAE++     S  ++  +++  + +++     PDE+T+ ++ L+   + ++++    E  +
Subjt:  -----LFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGG--PDEATVILSYLIGCGKARYLSKEKKELLE

Query:  GVKVSLSAT-TVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTK
         VK S +    V+ ++ +D  I  L  + + L+++++ +    +  KQ A + LK G K  AL+  R  K   +  +++ + L  V+ +L+ IA++++ +
Subjt:  GVKVSLSAT-TVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTK

Query:  TVSEAIQIGARVMK--EHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKL
         V +A Q G   ++     V  ++ ++ + +++   D Q +V   + S    +  + ED+EEE K L
Subjt:  TVSEAIQIGARVMK--EHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKL

Arabidopsis top hitse value%identityAlignment
AT3G62080.1 SNF7 family protein2.8e-11151.05Show/hide
Query:  IRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDMLDPRSG
        +++FIR +VPDWDDEVV+ ARFKAFSGQ+SDWE ++ FWR LI+ ++RQF    I P ++   WF RGG+ PLC+D V+ LM+ EGD++R SD+ DP SG
Subjt:  IRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDMLDPRSG

Query:  QLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKEKKELLEGVK
        +++ L + + NLM       + +L ++ +VL  +L+++AA+V+  LS  +WTS+C++T+ KF+N+C G +EA+ +LS+L GCGKA  +S  + EL+EGVK
Subjt:  QLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKEKKELLEGVK

Query:  VSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTKTVSE
        VS S T + GI+TLD DILHL+ TTEKLQ QL+V+DQR +  K+SALASLKSG++K AL+HARELK+ TESREK  SLLNRVEEVLN IAD+ESTK VSE
Subjt:  VSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTKTVSE

Query:  AIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATVSDDSLSAAL
        AI+ GARVMK+ +++ D +   L+ELE +I+ QKQV  A++SAP   I +DEDIEEE  +LE+++          SES   + A    A    DSL+   
Subjt:  AIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATVSDDSLSAAL

Query:  SNLKL--VEETGKETANQKLNFKSKSK
        S LKL   ++T +E A +    K   K
Subjt:  SNLKL--VEETGKETANQKLNFKSKSK

AT3G62080.2 SNF7 family protein2.8e-11151.05Show/hide
Query:  IRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDMLDPRSG
        +++FIR +VPDWDDEVV+ ARFKAFSGQ+SDWE ++ FWR LI+ ++RQF    I P ++   WF RGG+ PLC+D V+ LM+ EGD++R SD+ DP SG
Subjt:  IRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDMLDPRSG

Query:  QLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKEKKELLEGVK
        +++ L + + NLM       + +L ++ +VL  +L+++AA+V+  LS  +WTS+C++T+ KF+N+C G +EA+ +LS+L GCGKA  +S  + EL+EGVK
Subjt:  QLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKEKKELLEGVK

Query:  VSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTKTVSE
        VS S T + GI+TLD DILHL+ TTEKLQ QL+V+DQR +  K+SALASLKSG++K AL+HARELK+ TESREK  SLLNRVEEVLN IAD+ESTK VSE
Subjt:  VSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTKTVSE

Query:  AIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATVSDDSLSAAL
        AI+ GARVMK+ +++ D +   L+ELE +I+ QKQV  A++SAP   I +DEDIEEE  +LE+++          SES   + A    A    DSL+   
Subjt:  AIQIGARVMKEHEVNWDQLQHSLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATVSDDSLSAAL

Query:  SNLKL--VEETGKETANQKLNFKSKSK
        S LKL   ++T +E A +    K   K
Subjt:  SNLKL--VEETGKETANQKLNFKSKSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGGAATCAAAGAGGTCTCGTATTAGAGACTTCATCAGGGAAAAAGTCCCAGACTGGGATGACGAAGTGGTGTCTACAGCTCGGTTCAAGGCATTTAGTGGGCA
GAAATCTGATTGGGAACCCAGATACCTATTTTGGAGGGGTTTGATCCTCACAATTGCCCGCCAATTCAACTTCGTCTTCATTAAACCTTCTGAAATAACGAATCAATGGT
TTTCTCGAGGAGGGTTGGCCCCATTGTGCCTTGACCATGTTCTGCATCTAATGTATATTGAGGGCGACATTATAAGACGTAGTGACATGCTGGATCCAAGGAGTGGCCAA
CTCTCCTACTTGTTTAAAAAACTAAGCAATTTGATGGGTGCATCTAAAAAGAACCCCGACAGTTTGCTTCGTGATGATTATGTAGTTCTTGCCTGTGTATTACAGGATAG
AGCAGCTGAAGTTATCAACTGTTTATCTCTTAGTAATTGGACCTCGTCCTGCATTATTACAATGGTGAAGTTCCAAAACATCTGTGGAGGACCTGATGAAGCGACTGTTA
TCTTGAGTTACTTGATTGGATGTGGTAAAGCAAGGTATCTCTCTAAGGAAAAAAAGGAACTTCTAGAGGGTGTAAAGGTCTCTCTTTCGGCAACAACAGTTGCTGGCATC
ACAACTCTCGATTATGACATTTTGCACTTGATTTGGACAACAGAAAAGCTTCAGCAACAACTTGATGTGATTGACCAGCGCTACGATGTGTTGAAACAATCCGCACTGGC
TTCTTTGAAGTCTGGAAACAAAAAAACTGCATTGAAACATGCAAGAGAGTTGAAGATCACCACAGAAAGTCGGGAAAAAGTTGCATCTCTCTTAAACAGAGTGGAGGAAG
TCCTAAATGCTATTGCAGATGCTGAATCGACAAAAACGGTTTCTGAGGCTATTCAAATTGGTGCTCGAGTAATGAAAGAACACGAGGTTAATTGGGATCAACTCCAGCAT
AGTTTGCAGGAACTAGAAGCAAGCATTGATATACAAAAACAAGTTGCAAGTGCTATAGATTCAGCTCCATCTGGCTCAATTCCGGAAGATGAAGATATTGAGGAGGAGTT
TAAGAAGCTCGAGTTGGAAGTAACAGCAGGCCAAAAACTCAATGCGTCAACATCAGAATCTGGGGTTAATATTGCAGCTGGTGAAACAGTGGCTACAGTTTCCGACGATT
CATTGAGTGCTGCGTTATCAAATCTAAAGCTTGTTGAAGAAACAGGAAAGGAGACAGCGAACCAGAAGTTGAATTTTAAGAGCAAGTCGAAACTTATGGAGCTTGGCATT
TCTTAG
mRNA sequenceShow/hide mRNA sequence
GAAATCCCCCTTTTTGGTTTCTTCTGTTCTTTTGAGTTCTGAGGCTTCGCCAAATATCGTCTTCATTTTCGACACCAGCCCTTTGTGATTGATTATGAAGTCTTCGATGT
GCCGCGTGCTTGATTCCATGCAGATTGTAGTGAGAAGAAGAGGATTACAGTTCCTCTAATTGAAGAAGCCTAGGTAATTTTACTTATCATCTACTTAAAAACTCTGCTCA
TTTTCTTTGATTGTGCTATCTCTTTTCTAGTTGATGAAAGATGAACTTTGAATTTGTATACCAGTTAATTTCCAGGCGGTTTCCTGTTAGCCCCAGGTTACAATATACAA
AACAATGGTAAAATTTGTCCCTATCTGATTCTGAGTATGCACTCAGGATATATAATGATCAAATTTGCAAATACATTAAAAACTTTAGTGACCTTGTTTGAATAGGATCC
ATTCTATTAGTTGTATGTCTGTCATTGGGTTCTAAGAATGTATTAAATGATTTATATATTGGTTGAACACTCTGGGATGGGTCAGTAATCTCCCTTTTTCATGGGGGAAA
CTAGGTAGGCGTCTCTATATTCCTTTTTGTCTGTCTGTCTCTCTCTCTCTCTCTGTAGGAATTTCTTAGATTTGAATCTTCTATATGTCTTCTCCGTTGGGTATTATCAA
CTTACGCCATTAAGTAGTGGAGAATTTTTCTCAATGTAAGGACTTCTGTCTGTATAGTCCTCAGAATTATCTTTTGCTGCGCACAGTGGAATTATTAGTTGATCAAATGA
TGTAATTTATCCCCTTTGGGTGGTATGACGTGTAAAGCATCTTATATAAGTGATTAGAAGGTCGAGGATGCTTGATGATGGACTAATAGGAAGGCCATGATCTTTTTTTG
TACTAACCTACAATTTAAAATACTCGAAAACATGTAAAGGAAGTATCCAGAATTATAATACATGTTTTTGGTAACTCATGGTTTACATAGTTAGATTGGGCTGAAAGTAG
GTTGCAGTTGCAGCTCTATTCATAGGTCTGAACAGTGCAAACATAATGGAAAAGGAATCAAAGAGGTCTCGTATTAGAGACTTCATCAGGGAAAAAGTCCCAGACTGGGA
TGACGAAGTGGTGTCTACAGCTCGGTTCAAGGCATTTAGTGGGCAGAAATCTGATTGGGAACCCAGATACCTATTTTGGAGGGGTTTGATCCTCACAATTGCCCGCCAAT
TCAACTTCGTCTTCATTAAACCTTCTGAAATAACGAATCAATGGTTTTCTCGAGGAGGGTTGGCCCCATTGTGCCTTGACCATGTTCTGCATCTAATGTATATTGAGGGC
GACATTATAAGACGTAGTGACATGCTGGATCCAAGGAGTGGCCAACTCTCCTACTTGTTTAAAAAACTAAGCAATTTGATGGGTGCATCTAAAAAGAACCCCGACAGTTT
GCTTCGTGATGATTATGTAGTTCTTGCCTGTGTATTACAGGATAGAGCAGCTGAAGTTATCAACTGTTTATCTCTTAGTAATTGGACCTCGTCCTGCATTATTACAATGG
TGAAGTTCCAAAACATCTGTGGAGGACCTGATGAAGCGACTGTTATCTTGAGTTACTTGATTGGATGTGGTAAAGCAAGGTATCTCTCTAAGGAAAAAAAGGAACTTCTA
GAGGGTGTAAAGGTCTCTCTTTCGGCAACAACAGTTGCTGGCATCACAACTCTCGATTATGACATTTTGCACTTGATTTGGACAACAGAAAAGCTTCAGCAACAACTTGA
TGTGATTGACCAGCGCTACGATGTGTTGAAACAATCCGCACTGGCTTCTTTGAAGTCTGGAAACAAAAAAACTGCATTGAAACATGCAAGAGAGTTGAAGATCACCACAG
AAAGTCGGGAAAAAGTTGCATCTCTCTTAAACAGAGTGGAGGAAGTCCTAAATGCTATTGCAGATGCTGAATCGACAAAAACGGTTTCTGAGGCTATTCAAATTGGTGCT
CGAGTAATGAAAGAACACGAGGTTAATTGGGATCAACTCCAGCATAGTTTGCAGGAACTAGAAGCAAGCATTGATATACAAAAACAAGTTGCAAGTGCTATAGATTCAGC
TCCATCTGGCTCAATTCCGGAAGATGAAGATATTGAGGAGGAGTTTAAGAAGCTCGAGTTGGAAGTAACAGCAGGCCAAAAACTCAATGCGTCAACATCAGAATCTGGGG
TTAATATTGCAGCTGGTGAAACAGTGGCTACAGTTTCCGACGATTCATTGAGTGCTGCGTTATCAAATCTAAAGCTTGTTGAAGAAACAGGAAAGGAGACAGCGAACCAG
AAGTTGAATTTTAAGAGCAAGTCGAAACTTATGGAGCTTGGCATTTCTTAGGTGCATTATCATGGAGTAGTTTACAATGGCAATGTCAGCCCAATGTAACCTTCTTCTTT
TGGTTGCCCTCTGTACTTTGTCAGTAAAAGAAAGTGAAATTGACGTACAAAATTGCTACAAATTTGCTGCTCTTTGTTCCTGAGTCAGCTATTGTATTACTGTTATTGGC
GTTTGAAGTTGAAAATGTAAGATTGAGATTGTTGTATAAACTTCAAACCGTGATTAGTTTGAAACCTCCCTTCTAAAAACAAAATGTGGTTAAATTACCAAGGGACGTGG
AAATTTGTGTAATCGAAATGAGATTTCATTGATGGATGAATTGTAATAAGCCTGAATCGC
Protein sequenceShow/hide protein sequence
MEKESKRSRIRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDMLDPRSGQ
LSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKEKKELLEGVKVSLSATTVAGI
TTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTKTVSEAIQIGARVMKEHEVNWDQLQH
SLQELEASIDIQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATVSDDSLSAALSNLKLVEETGKETANQKLNFKSKSKLMELGI
S