; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CaUC01G007880 (gene) of Watermelon (USVL246-FR2) v1 genome

Gene IDCaUC01G007880
OrganismCitrullus amarus (Watermelon (USVL246-FR2) v1)
Descriptioncharged multivesicular body protein 7
Genome locationCiama_Chr01:8902564..8906783
RNA-Seq ExpressionCaUC01G007880
SyntenyCaUC01G007880
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0032511 - late endosome to vacuole transport via multivesicular body sorting pathway (biological process)
GO:0000815 - ESCRT III complex (cellular component)
GO:0005771 - multivesicular body (cellular component)
GO:0009898 - cytoplasmic side of plasma membrane (cellular component)
InterPro domainsIPR005024 - Snf7 family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579154.1 Charged multivesicular body protein 7, partial [Cucurbita argyrosperma subsp. sororia]6.3e-20687.21Show/hide
Query:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK   VR+FIREKVPDWD+E+V+TARFKAFSGQKSDWEPRYLFWR LILTIA QFNF+F+KPSEI NQWFSRGGLAPLCLDHVLHLM IEGDIIRR
Subjt:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPR GQLSYLFKKLSNLMG SKKN D  L DDY+VLACVLQDRAAEV+ CLS SNWTSSC+ITMVKFQNICGGPDEAT ILSYL  CGKARYLSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        +KEL+EGVK+SLSA  V GITTLDYDILHLIWTTE+LQ+QLDVIDQRYDV +QSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AESTKTVSEAIQIGARVMKEHEV+WD LQHSLQELEASID QKQVAS IDSAPSGSI E+EDIEEEFKKLELEV AGQ L+ASTS++GVNIA G  VATV
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMEL
        SDDSLSAALSNLKLVEETGKET   KSN KSKSK+MEL
Subjt:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMEL

KGN50975.2 hypothetical protein Csa_017819 [Cucumis sativus]2.7e-20988.13Show/hide
Query:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESKGS VR+FIREKVPDWDDEVV+TARFKAFSGQKSDWEPRYLFWR LILT+ARQFNF+ IKPSEI NQWF RGGL PLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPRSGQLSY+FKKLSNLMG SKKNPDSLLRDDY+VLACVLQDRAAEVI CLSLS+WTSSCIITMVKFQNICGGPDEATVILSYLI CGKA++LSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        KKELLEGVKVSLSATTV GIT+LDYDILHL+WT EKLQQQLDVIDQRYDV KQSAL SLKSGN+KTALKHARELKITTESREKVASL NRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AE TKTVSEAIQIGARVMKEHEVNWDQLQ SLQELEAS+D QKQVA+AIDS PS SIP+DEDIEEEFKKLELE+TAGQ L+ASTSESGVNIA GETVA V
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMEL
         DDSLS ALSNLKLVEET KE  N  S+ K KSK+MEL
Subjt:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMEL

XP_011654554.1 charged multivesicular body protein 7 [Cucumis sativus]1.4e-21087.98Show/hide
Query:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESKGS VR+FIREKVPDWDDEVV+TARFKAFSGQKSDWEPRYLFWR LILT+ARQFNF+ IKPSEI NQWF RGGL PLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPRSGQLSY+FKKLSNLMG SKKNPDSLLRDDY+VLACVLQDRAAEVI CLSLS+WTSSCIITMVKFQNICGGPDEATVILSYLI CGKA++LSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        KKELLEGVKVSLSATTV GIT+LDYDILHL+WT EKLQQQLDVIDQRYDV KQSAL SLKSGN+KTALKHARELKITTESREKVASL NRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AE TKTVSEAIQIGARVMKEHEVNWDQLQ SLQELEAS+D QKQVA+AIDS PS SIP+DEDIEEEFKKLELE+TAGQ L+ASTSESGVNIA GETVA V
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS
         DDSLS ALSNLKLVEET KE  N  S+ K KSK+ME+GIS
Subjt:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS

XP_038875993.1 uncharacterized protein LOC120068336 isoform X1 [Benincasa hispida]2.2e-21488.94Show/hide
Query:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESKGSRVR+FIREKVPDWDDEVV+TARFKAFSGQKSDWEPRY  WR LI+TIAR+FNF+FIKPSEI NQWFSRGGL+PLCLDHVLH+MYIEGDIIRR
Subjt:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQ---------DRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC
         DMLDPRSGQLSYLFKKLSNLMG SKKNPDSLLRDDYVVLACVLQ         DRAAEVI CLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIG 
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQ---------DRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGC

Query:  GKARYLSKEKKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRV
        GKARYLSKEKKELLEGVK+SL+A TV GITTLDYDILHLIWTTEKLQQQLDVIDQRYDV +QSALASLKSGNKKTALKHARELKITTESREKVASLLNRV
Subjt:  GKARYLSKEKKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRV

Query:  EEVLNAIADAESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAI--DSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGV
        EEVLNAIADAESTKTVSEAIQIGARVMKEHEV+WDQLQ+SLQE+E SID QKQVASAI  DSAPSGSIPEDEDIEEEFKKLELEVTAGQ L+ STSES V
Subjt:  EEVLNAIADAESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAI--DSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGV

Query:  NIAAGETVATVSDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS
        NIA GETVATVSDD LS ALSNLKLVEETG  TA  KSN KSKSK+MELGIS
Subjt:  NIAAGETVATVSDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS

XP_038875996.1 uncharacterized protein LOC120068336 isoform X2 [Benincasa hispida]1.0e-21690.74Show/hide
Query:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESKGSRVR+FIREKVPDWDDEVV+TARFKAFSGQKSDWEPRY  WR LI+TIAR+FNF+FIKPSEI NQWFSRGGL+PLCLDHVLH+MYIEGDIIRR
Subjt:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
         DMLDPRSGQLSYLFKKLSNLMG SKKNPDSLLRDDYVVLACVLQDRAAEVI CLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIG GKARYLSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        KKELLEGVK+SL+A TV GITTLDYDILHLIWTTEKLQQQLDVIDQRYDV +QSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAI--DSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVA
        AESTKTVSEAIQIGARVMKEHEV+WDQLQ+SLQE+E SID QKQVASAI  DSAPSGSIPEDEDIEEEFKKLELEVTAGQ L+ STSES VNIA GETVA
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAI--DSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVA

Query:  TVSDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS
        TVSDD LS ALSNLKLVEETG  TA  KSN KSKSK+MELGIS
Subjt:  TVSDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS

TrEMBL top hitse value%identityAlignment
A0A0A0KMY2 Uncharacterized protein7.0e-21187.98Show/hide
Query:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESKGS VR+FIREKVPDWDDEVV+TARFKAFSGQKSDWEPRYLFWR LILT+ARQFNF+ IKPSEI NQWF RGGL PLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPRSGQLSY+FKKLSNLMG SKKNPDSLLRDDY+VLACVLQDRAAEVI CLSLS+WTSSCIITMVKFQNICGGPDEATVILSYLI CGKA++LSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        KKELLEGVKVSLSATTV GIT+LDYDILHL+WT EKLQQQLDVIDQRYDV KQSAL SLKSGN+KTALKHARELKITTESREKVASL NRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AE TKTVSEAIQIGARVMKEHEVNWDQLQ SLQELEAS+D QKQVA+AIDS PS SIP+DEDIEEEFKKLELE+TAGQ L+ASTSESGVNIA GETVA V
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS
         DDSLS ALSNLKLVEET KE  N  S+ K KSK+ME+GIS
Subjt:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS

A0A1S3CRA4 charged multivesicular body protein 7 isoform X13.8e-20184.39Show/hide
Query:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESKGS VR+FIREKV DWDDEVV+TARFKAFSGQKSDWEPRYLFWR LILT+ARQ NF+ IKPSEI NQWFSRGGL PLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPRSGQLSY+FK+LSNLMG SKKNP+SLLRDDY++LACVLQDRA EVI CLSLSNWTSS IITMVKFQNICGGPDEATVILSYLI CGKA++LSK 
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        K +LLEGVKVS SATTV GITTLDYDILHL+WT EKLQQQLD I+QRYDV KQSAL SLKSGNKK ALKHARELKITTESREKVASL NRVEEVLNAI D
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVAT
        AE TK+VSEAIQIGARVMKEHEVNWDQLQHSLQELE SID QKQVA+ IDS PS SIP D EDIEE FKKLELE+TA Q L+ASTSES VNIA GETV  
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVAT

Query:  VSDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS
        V DDSLS+ LSNLKLVEE  KE AN KSN K  SK+MELGIS
Subjt:  VSDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS

A0A1S3CRC5 charged multivesicular body protein 7 isoform X28.9e-19883.71Show/hide
Query:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESKGS VR+FIREKV DWDDEVV+TARFKAFSGQKSDWEPRYLFWR LILT+ARQ NF+ IKPSEI NQWFSRGGL PLCLDHVLHLMY  GDIIRR
Subjt:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPRSGQLSY+FK+LSNLMG SKKNP+SLLRDDY++LACVLQDRA EVI CLSLSNWTSS IITMVKFQNICGGPDEATVILSYLI CGKA++LSK 
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        K +LLE   VS SATTV GITTLDYDILHL+WT EKLQQQLD I+QRYDV KQSAL SLKSGNKK ALKHARELKITTESREKVASL NRVEEVLNAI D
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVAT
        AE TK+VSEAIQIGARVMKEHEVNWDQLQHSLQELE SID QKQVA+ IDS PS SIP D EDIEE FKKLELE+TA Q L+ASTSES VNIA GETV  
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVAT

Query:  VSDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS
        V DDSLS+ LSNLKLVEE  KE AN KSN K  SK+MELGIS
Subjt:  VSDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMELGIS

A0A6J1FF90 charged multivesicular body protein 71.7e-20486.53Show/hide
Query:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK   VR+FIREKVPDWD+E+V+TARFKAFSGQKSDWEPRYLFWR LILTIA QFNF+F+KPSEI NQWFSRGGLAPLCLDHVLHLM IEGDIIRR
Subjt:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPR GQLSYLFKKLSNLMG SKKN D LL DDY+VLACVLQDRAAEV+ CLS SNWTSSC+ITMVKFQNICGGPDEAT  LSYL  CGKARYLSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        +KEL+EGVK+SLSA  V GITTLDYDILHLIWTTE+LQ+QLDVIDQRYDV +QSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AESTKTVSEAIQIGAR MKEHEV+WD LQHSLQELEASID QKQVAS IDSAPSG I E+EDIEEEFKKLELEV AGQ L+ASTS++G NIA G  VATV
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMEL
        SDDSLSAALSNLKLVEETGKET   KSN KSKSK+MEL
Subjt:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMEL

A0A6J1K252 charged multivesicular body protein 78.3e-20486.3Show/hide
Query:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR
        MEKESK   VR+FIREKVPDWD+EVV+TA FKAFSGQKSDWEPRYLFWR LIL I+ QFNF+FIKPSEI NQWFSRGGLAPLCLDHVLHLM IEGDIIRR
Subjt:  MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRR

Query:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE
        SDMLDPR GQLSYLFKKLSN+MG SKKNPD LL DDY+VLACVLQDRAAEV+ CLS SNWTSSC+ITMVKFQNICGGPDEAT ILSYL  CGKARYLSKE
Subjt:  SDMLDPRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKE

Query:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
        +KEL+EGVK+SLSA  V GITTLDYDILHLIWTTE+LQ+QLDVIDQRYDV +QSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD
Subjt:  KKELLEGVKVSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIAD

Query:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV
        AESTKTVSEAIQIGARVMKEHEV+WDQLQHSL ELEASID QKQV S IDSAPSGSI E+EDIEEEFKKLELEV AGQ L+A+TS++GVNIA G  VATV
Subjt:  AESTKTVSEAIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATV

Query:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMEL
        SDDSLSAALSNLKLV ET KET   KSN KSKSK+MEL
Subjt:  SDDSLSAALSNLKLVEETGKETANPKSNFKSKSKLMEL

SwissProt top hitse value%identityAlignment
Q5ZJB7 Charged multivesicular body protein 71.5e-1624.49Show/hide
Query:  PDWD-DEVVSTARFKAFSGQK----SDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSD-MLDPRSGQLS
        P+W+ D+      F AF   +    ++W+ +  FW GL+L   R+   V     E+ N  F R G  PL L  VL  +   G + R SD M    S  +S
Subjt:  PDWD-DEVVSTARFKAFSGQK----SDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSD-MLDPRSGQLS

Query:  Y---------LFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGG--PDEATVILSYLIGCGKARYLSKEK
        +         L   LS+++G SK   +    ++ ++   +LQ++A EV      S  +S  ++ + + +++C G  PDE T  L  L        L KEK
Subjt:  Y---------LFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGG--PDEATVILSYLIGCGKARYLSKEK

Query:  KELL---EGVKVSLSA----TTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEV
        K  +    G K+   A      V+ +  +D  +  L+ + + L Q+++ + Q  +  K  A ++ ++G K+ AL+  +  + T    E++ S L+ V+ +
Subjt:  KELL---EGVKVSLSA----TTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEV

Query:  LNAIADAESTKTVSEAIQ--IGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNI
        L+ I  +++ + V  A Q  +GA  +   +V  ++ ++ + +++   D Q +VA  +  A    +  D E++E+E   L L+ +A + ++        + 
Subjt:  LNAIADAESTKTVSEAIQ--IGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPED-EDIEEEFKKLELEVTAGQKLNASTSESGVNI

Query:  AAGETVATVSDDSLSAALSNLK-----LVEETGKETANPKS
         AG     +SD  L A L  L      L ++T   ++ P++
Subjt:  AAGETVATVSDDSLSAALSNLK-----LVEETGKETANPKS

Q6PBQ2 Charged multivesicular body protein 73.2e-1924.25Show/hide
Query:  PDWDDEVVSTARFKAFSGQK----SDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDM-LDPRSGQLSY
        PDWDD+   +  F AF   +    +DW+ +  FW  LI+   R+   V +   ++ N+ F R G  PL L  V+  M   G + + SD   +  SG LS+
Subjt:  PDWDDEVVSTARFKAFSGQK----SDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDM-LDPRSGQLSY

Query:  -----LFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGG--PDEATVILSYLIGCGKARYLSKEKKELLE
             L + L   + A   +    L + +VV+  V +++AAE++     S  ++  +++  + +++     PDE+T+ ++ L+   + ++++    E  +
Subjt:  -----LFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGG--PDEATVILSYLIGCGKARYLSKEKKELLE

Query:  GVKVSLSAT-TVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTK
         VK S +    V+ ++ +D  I  L  + + L+++++ +    +  KQ A + LK G K  AL+  R  K   +  +++ + L  V+ +L+ IA++++ +
Subjt:  GVKVSLSAT-TVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTK

Query:  TVSEAIQIGARVMK--EHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKL
         V +A Q G   ++     V  ++ ++ + +++   D Q +V   + S    +  + ED+EEE K L
Subjt:  TVSEAIQIGARVMK--EHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKL

Arabidopsis top hitse value%identityAlignment
AT3G62080.1 SNF7 family protein2.1e-11151.29Show/hide
Query:  VRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDMLDPRSG
        V++FIR +VPDWDDEVV+ ARFKAFSGQ+SDWE ++ FWR LI+ ++RQF    I P ++   WF RGG+ PLC+D V+ LM+ EGD++R SD+ DP SG
Subjt:  VRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDMLDPRSG

Query:  QLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKEKKELLEGVK
        +++ L + + NLM       + +L ++ +VL  +L+++AA+V+  LS  +WTS+C++T+ KF+N+C G +EA+ +LS+L GCGKA  +S  + EL+EGVK
Subjt:  QLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKEKKELLEGVK

Query:  VSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTKTVSE
        VS S T + GI+TLD DILHL+ TTEKLQ QL+V+DQR +  K+SALASLKSG++K AL+HARELK+ TESREK  SLLNRVEEVLN IAD+ESTK VSE
Subjt:  VSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTKTVSE

Query:  AIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATVSDDSLSAAL
        AI+ GARVMK+ +++ D +   L+ELE +I+ QKQV  A++SAP   I +DEDIEEE  +LE+++          SES   + A    A    DSL+   
Subjt:  AIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATVSDDSLSAAL

Query:  SNLKL--VEETGKETANPKSNFKSKSK
        S LKL   ++T +E A   +  K   K
Subjt:  SNLKL--VEETGKETANPKSNFKSKSK

AT3G62080.2 SNF7 family protein2.1e-11151.29Show/hide
Query:  VRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDMLDPRSG
        V++FIR +VPDWDDEVV+ ARFKAFSGQ+SDWE ++ FWR LI+ ++RQF    I P ++   WF RGG+ PLC+D V+ LM+ EGD++R SD+ DP SG
Subjt:  VRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDMLDPRSG

Query:  QLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKEKKELLEGVK
        +++ L + + NLM       + +L ++ +VL  +L+++AA+V+  LS  +WTS+C++T+ KF+N+C G +EA+ +LS+L GCGKA  +S  + EL+EGVK
Subjt:  QLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKEKKELLEGVK

Query:  VSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTKTVSE
        VS S T + GI+TLD DILHL+ TTEKLQ QL+V+DQR +  K+SALASLKSG++K AL+HARELK+ TESREK  SLLNRVEEVLN IAD+ESTK VSE
Subjt:  VSLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTKTVSE

Query:  AIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATVSDDSLSAAL
        AI+ GARVMK+ +++ D +   L+ELE +I+ QKQV  A++SAP   I +DEDIEEE  +LE+++          SES   + A    A    DSL+   
Subjt:  AIQIGARVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATVSDDSLSAAL

Query:  SNLKL--VEETGKETANPKSNFKSKSK
        S LKL   ++T +E A   +  K   K
Subjt:  SNLKL--VEETGKETANPKSNFKSKSK


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAAAAGGAATCAAAGGGGTCTCGTGTTAGAGACTTCATCAGGGAAAAAGTCCCAGACTGGGATGACGAAGTGGTGTCTACAGCTCGGTTCAAGGCATTTAGT
GGGCAGAAATCTGATTGGGAACCCAGATACCTATTTTGGAGGGGTTTGATCCTCACAATTGCCCGCCAATTCAACTTCGTCTTCATTAAACCTTCTGAAATAACG
AATCAATGGTTTTCTCGAGGAGGGTTGGCCCCATTGTGCCTTGACCATGTTCTGCATCTAATGTATATTGAGGGCGACATTATAAGACGTAGTGACATGCTGGAT
CCAAGGAGTGGCCAACTCTCCTACTTGTTTAAAAAACTAAGCAATTTGATGGGTGCATCTAAAAAGAACCCCGACAGTTTGCTTCGTGATGATTATGTAGTTCTT
GCCTGTGTATTACAGGATAGAGCAGCTGAAGTTATCAACTGTTTATCTCTTAGTAATTGGACCTCGTCCTGCATTATTACAATGGTGAAGTTCCAAAACATCTGT
GGAGGACCTGATGAAGCGACTGTTATCTTGAGTTACTTGATTGGATGTGGTAAAGCAAGGTATCTCTCTAAGGAAAAAAAGGAACTTCTAGAGGGTGTAAAGGTC
TCTCTTTCGGCAACAACAGTTGCTGGCATCACAACTCTCGATTATGACATTTTGCACTTGATTTGGACAACAGAAAAGCTTCAGCAACAACTTGATGTGATTGAC
CAGCGCTACGATGTGTTGAAACAATCCGCACTGGCTTCTTTGAAGTCTGGAAACAAAAAAACTGCATTGAAACATGCAAGAGAGTTGAAGATCACCACAGAAAGT
CGGGAAAAAGTTGCATCTCTCTTAAACAGAGTGGAGGAAGTCCTAAATGCTATTGCAGATGCCGAATCGACAAAAACGGTTTCTGAGGCTATTCAAATTGGTGCT
CGAGTAATGAAAGAACACGAGGTTAATTGGGATCAACTCCAGCATAGTTTGCAGGAACTAGAAGCAAGCATTGATAGACAAAAACAAGTTGCAAGTGCTATAGAT
TCAGCTCCATCTGGCTCAATTCCGGAAGATGAAGATATTGAGGAGGAGTTTAAGAAGCTCGAGTTGGAAGTAACAGCAGGCCAAAAACTCAATGCGTCAACATCA
GAATCTGGGGTTAATATTGCAGCTGGTGAAACAGTGGCTACAGTTTCCGACGATTCATTGAGTGCTGCGTTATCAAATCTAAAGCTTGTTGAAGAAACAGGAAAG
GAGACAGCGAACCCGAAGTCGAATTTTAAGAGCAAGTCGAAACTTATGGAGCTTGGCATTTCTTAG
mRNA sequenceShow/hide mRNA sequence
CTTTTTGGTTTCTTCTGTTCTTTTGAGTTCTGAGGCTTCGCCAAATATCGTCTTCATTTTCGACACCAGCTCTTTGTGATTGATTATGAAGTCTTCGATGTGCCG
CGTGCTTGATTCCATGCAGATTGTAGTGAGAAGAAGAGGATTACAGTTCCTCTAATTGAAGAAGCCTAGGTTGCAGTTGCAGCTCTATTCATAGGTGTGAACAGT
GCAAACATAATGGAAAAGGAATCAAAGGGGTCTCGTGTTAGAGACTTCATCAGGGAAAAAGTCCCAGACTGGGATGACGAAGTGGTGTCTACAGCTCGGTTCAAG
GCATTTAGTGGGCAGAAATCTGATTGGGAACCCAGATACCTATTTTGGAGGGGTTTGATCCTCACAATTGCCCGCCAATTCAACTTCGTCTTCATTAAACCTTCT
GAAATAACGAATCAATGGTTTTCTCGAGGAGGGTTGGCCCCATTGTGCCTTGACCATGTTCTGCATCTAATGTATATTGAGGGCGACATTATAAGACGTAGTGAC
ATGCTGGATCCAAGGAGTGGCCAACTCTCCTACTTGTTTAAAAAACTAAGCAATTTGATGGGTGCATCTAAAAAGAACCCCGACAGTTTGCTTCGTGATGATTAT
GTAGTTCTTGCCTGTGTATTACAGGATAGAGCAGCTGAAGTTATCAACTGTTTATCTCTTAGTAATTGGACCTCGTCCTGCATTATTACAATGGTGAAGTTCCAA
AACATCTGTGGAGGACCTGATGAAGCGACTGTTATCTTGAGTTACTTGATTGGATGTGGTAAAGCAAGGTATCTCTCTAAGGAAAAAAAGGAACTTCTAGAGGGT
GTAAAGGTCTCTCTTTCGGCAACAACAGTTGCTGGCATCACAACTCTCGATTATGACATTTTGCACTTGATTTGGACAACAGAAAAGCTTCAGCAACAACTTGAT
GTGATTGACCAGCGCTACGATGTGTTGAAACAATCCGCACTGGCTTCTTTGAAGTCTGGAAACAAAAAAACTGCATTGAAACATGCAAGAGAGTTGAAGATCACC
ACAGAAAGTCGGGAAAAAGTTGCATCTCTCTTAAACAGAGTGGAGGAAGTCCTAAATGCTATTGCAGATGCCGAATCGACAAAAACGGTTTCTGAGGCTATTCAA
ATTGGTGCTCGAGTAATGAAAGAACACGAGGTTAATTGGGATCAACTCCAGCATAGTTTGCAGGAACTAGAAGCAAGCATTGATAGACAAAAACAAGTTGCAAGT
GCTATAGATTCAGCTCCATCTGGCTCAATTCCGGAAGATGAAGATATTGAGGAGGAGTTTAAGAAGCTCGAGTTGGAAGTAACAGCAGGCCAAAAACTCAATGCG
TCAACATCAGAATCTGGGGTTAATATTGCAGCTGGTGAAACAGTGGCTACAGTTTCCGACGATTCATTGAGTGCTGCGTTATCAAATCTAAAGCTTGTTGAAGAA
ACAGGAAAGGAGACAGCGAACCCGAAGTCGAATTTTAAGAGCAAGTCGAAACTTATGGAGCTTGGCATTTCTTAGGTGCATTATCATGGAGTAGTTTACAATGGC
AATGTCAGCCCAATGTAACCTTCTTCTTTTGGTTGCCCTCTGTACTTTGTCAGTAAAAGAAAGTGAAATTGACGTACAAAATTGCTACAAATTTGCTGCTCTTTG
TTCCTGAGTCAGCTATTGTATTACTGTTATTGGCGTTTGAAGTTGAAAATGTAAGATTGAGATTGTTGTATAAACTTCAAACCGTGATTAGTTTGAAACCTCCCT
TCTAAAAACAAAATGTGGTTAAATTACCAAGGGACGTGGAAATTTGTGTAATCGAAACGAGATTTCATTGATGGATGAATTGTAATAAGCCTGAATCGCAAATA
Protein sequenceShow/hide protein sequence
MEKESKGSRVRDFIREKVPDWDDEVVSTARFKAFSGQKSDWEPRYLFWRGLILTIARQFNFVFIKPSEITNQWFSRGGLAPLCLDHVLHLMYIEGDIIRRSDMLD
PRSGQLSYLFKKLSNLMGASKKNPDSLLRDDYVVLACVLQDRAAEVINCLSLSNWTSSCIITMVKFQNICGGPDEATVILSYLIGCGKARYLSKEKKELLEGVKV
SLSATTVAGITTLDYDILHLIWTTEKLQQQLDVIDQRYDVLKQSALASLKSGNKKTALKHARELKITTESREKVASLLNRVEEVLNAIADAESTKTVSEAIQIGA
RVMKEHEVNWDQLQHSLQELEASIDRQKQVASAIDSAPSGSIPEDEDIEEEFKKLELEVTAGQKLNASTSESGVNIAAGETVATVSDDSLSAALSNLKLVEETGK
ETANPKSNFKSKSKLMELGIS