; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Moc04g00930 (gene) of Bitter gourd (OHB3-1) v2 genome

Gene IDMoc04g00930
OrganismMomordica charantia cv. OHB3-1 (Bitter gourd (OHB3-1) v2)
Descriptionclathrin coat assembly protein AP180-like
Genome locationchr4:649279..650574
RNA-Seq ExpressionMoc04g00930
SyntenyMoc04g00930
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0048268 - clathrin coat assembly (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR011417 - AP180 N-terminal homology (ANTH) domain
IPR014712 - ANTH domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022147825.1 uncharacterized protein LOC111016670, partial [Momordica charantia]8.4e-179100Show/hide
Query:  IETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQSEKQSQFGDSASMNGNENE
        IETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQSEKQSQFGDSASMNGNENE
Subjt:  IETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQSEKQSQFGDSASMNGNENE

Query:  QNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNPFLQGADDLFAVPPTFTAEFSS
        QNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNPFLQGADDLFAVPPTFTAEFSS
Subjt:  QNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNPFLQGADDLFAVPPTFTAEFSS

Query:  PETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVSWRSSEKGNDRSFDGKTMNEESLLE
        PETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVSWRSSEKGNDRSFDGKTMNEESLLE
Subjt:  PETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVSWRSSEKGNDRSFDGKTMNEESLLE

Query:  QQKLWKEQQNKIIAKHLA
        QQKLWKEQQNKIIAKHLA
Subjt:  QQKLWKEQQNKIIAKHLA

XP_022952666.1 clathrin coat assembly protein AP180 [Cucurbita moschata]3.8e-14768.25Show/hide
Query:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT
        MLIDRI +WQKLLDRAIATRPTGPAK NRLVL +LHAVV ESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCK+IGVGRT
Subjt:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT

Query:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD
        SEYPSVQQPSDELIETLQEFLKDQASFPCH +RSPP  V L    S    L+DDLEQSESSD+        NS G++MSATGSW SP NSVEQ+GE YSD
Subjt:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD

Query:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSG-RLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSI-SPQQQYN
        YQSEKQS+ GD A      NE NL+PNFAFF+D    +  P+QE+   GA  G R GDWELVLAESAT+   +EWPDFFAPSIGDD F K   SPQ QYN
Subjt:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSG-RLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSI-SPQQQYN

Query:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLN--FEAWGSTETVVANSTKESNLFSFNPAGEDDPFAV
        NPFLQ +++ FAVPPTF A     ETEIAPTFRA+N KE   AVAVAPTFR ET  F   N     F+ WG     + N +     F F+   E+DPFA 
Subjt:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLN--FEAWGSTETVVANSTKESNLFSFNPAGEDDPFAV

Query:  SWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH
        +W SS  GN+ S +   M++ESLL+QQKLWKEQQNKIIAKH
Subjt:  SWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH

XP_022969464.1 clathrin coat assembly protein AP180-like [Cucurbita maxima]2.4e-14968.48Show/hide
Query:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT
        MLIDRI +WQKLLDRAIATRPTGPAK NRLVL +LHAVV ESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCK+IGVGRT
Subjt:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT

Query:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD
        SEYPSVQQPSDELIETLQEFLKDQASFPCH +RSPP  V L    S    L+DDLEQSESSD+        NS G++MSATGSW SP NSVEQDGE YSD
Subjt:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD

Query:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSG-RLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSI-SPQQQYN
        YQSEKQS+ GD A      NE NL+PNFAFF+D    + KP+QE+  FGA  G R GDWELVLAESAT+   +EWPDFFAPSIGDD F K   SPQ QYN
Subjt:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSG-RLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSI-SPQQQYN

Query:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCP--ANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAV
        NPFLQ +++ FAVPPTF A     ETEIAPTFRA+N KE   AV VAPTFR ET  F     N   F+ WG     + N +   + F F+   E+DPFA 
Subjt:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCP--ANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAV

Query:  SWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH
        +W S+  GN+ S +   M++ESLL+QQKLWKEQQNKIIAKH
Subjt:  SWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH

XP_023553928.1 clathrin coat assembly protein AP180-like [Cucurbita pepo subsp. pepo]1.7e-14768.25Show/hide
Query:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT
        MLIDRI +WQKLLDRAIATRPTGPAK NRLVL +LHAVV ESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCK+IGVGRT
Subjt:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT

Query:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD
        SEYPSVQQPSDELIETLQEFLKDQASFPCH +RSPP  V L    S    L+DDLEQSESSD+        NS G++MSATGSW SP NSVEQDGE YSD
Subjt:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD

Query:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSG-RLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSI-SPQQQYN
        YQSEKQS+ GD A      NE NL+PNFAFFDD    +  P+QE+  FGA  G R GDWELVLAESAT+   +EWPDFFAPSIGDD F K   SPQ QYN
Subjt:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSG-RLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSI-SPQQQYN

Query:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLN--FEAWGSTETVVANSTKESNLFSFNPAGEDDPFAV
        NPFLQ +++ FAVPPTF A     ETEIAPTFRA+N KE   AV VAPTFR ET  F   N     F+ WG     + N +     F F+   E+DPFA 
Subjt:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLN--FEAWGSTETVVANSTKESNLFSFNPAGEDDPFAV

Query:  SWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH
        +W S+  GN+ S +   M++ESLL+QQKLWKE QNKIIAKH
Subjt:  SWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH

XP_038887470.1 clathrin coat assembly protein AP180 [Benincasa hispida]6.7e-15268.93Show/hide
Query:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT
        MLIDRI++WQKLLDRAIATRPTGPAK NRLV ++LHAVV ESFDLYRDI+DGLALLLDSFFHLQYQSCVNAFQACVKA KQFEELGSFYDLCK+IGVGRT
Subjt:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT

Query:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD
        SEYPSVQQPSDELIETLQEFLKDQASFPCHG+RSPPPQ  LP      + L+DDLEQSESSD+        NS+GDIMSATGSW SP NSVEQDGE YSD
Subjt:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD

Query:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNP
        YQSEKQS+ GD A      NE NL+PNFAFF    +EN  P+QE      PS R GDWELVLAESAT   PQEWPDFF+PSIGDD F K  SPQ  YNNP
Subjt:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNP

Query:  FLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLN--FEAWGSTETVVANSTKESNLFSFNPAG-EDDPFAVS
        FLQ +DDL AVPPTF A     E EIAPTFRAQN KE    V+VAPTFRAET      N  +  FEAWG+ +            F F+  G E+DPFA  
Subjt:  FLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLN--FEAWGSTETVVANSTKESNLFSFNPAG-EDDPFAVS

Query:  WRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKHL
        W  +  GND +FD     +ESL++QQKLWKEQQNKIIAKHL
Subjt:  WRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKHL

TrEMBL top hitse value%identityAlignment
A0A1S3BN17 clathrin coat assembly protein AP1801.5e-14166.59Show/hide
Query:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT
        MLIDRI++WQKLLDRAIATRPTGPAK NRLV ++LHAVV ESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCK+IGVGRT
Subjt:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT

Query:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIP-SILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYS
        SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPP  +      S+P + L+DDLEQSESSD+        NS+GDIMSATGSW SP NSVEQDGE YS
Subjt:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIP-SILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYS

Query:  DYQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFG-APSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYN
        DYQSEKQS+ GD A      NE NL+PNFAFF    +EN  P++E+  FG APS R GDWELVLAESAT   P+EWPDFF+PSIGDD F K  SPQ  Y 
Subjt:  DYQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFG-APSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYN

Query:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPAN-NLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVS
        NPFLQ +DDLF  PPTF A     E E+ PTFRA+N        +VAPTFRAET      N N  F+  G                       +DPFA  
Subjt:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPAN-NLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVS

Query:  WRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH
              GND+SFD     EESLL+QQKLWKEQQNKIIAKH
Subjt:  WRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH

A0A5D3C685 Clathrin coat assembly protein AP1801.5e-14166.59Show/hide
Query:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT
        MLIDRI++WQKLLDRAIATRPTGPAK NRLV ++LHAVV ESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCK+IGVGRT
Subjt:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT

Query:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIP-SILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYS
        SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPP  +      S+P + L+DDLEQSESSD+        NS+GDIMSATGSW SP NSVEQDGE YS
Subjt:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIP-SILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYS

Query:  DYQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFG-APSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYN
        DYQSEKQS+ GD A      NE NL+PNFAFF    +EN  P++E+  FG APS R GDWELVLAESAT   P+EWPDFF+PSIGDD F K  SPQ  Y 
Subjt:  DYQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFG-APSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYN

Query:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPAN-NLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVS
        NPFLQ +DDLF  PPTF A     E E+ PTFRA+N        +VAPTFRAET      N N  F+  G                       +DPFA  
Subjt:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPAN-NLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVS

Query:  WRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH
              GND+SFD     EESLL+QQKLWKEQQNKIIAKH
Subjt:  WRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH

A0A6J1D176 uncharacterized protein LOC1110166704.0e-179100Show/hide
Query:  IETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQSEKQSQFGDSASMNGNENE
        IETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQSEKQSQFGDSASMNGNENE
Subjt:  IETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQSEKQSQFGDSASMNGNENE

Query:  QNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNPFLQGADDLFAVPPTFTAEFSS
        QNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNPFLQGADDLFAVPPTFTAEFSS
Subjt:  QNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNPFLQGADDLFAVPPTFTAEFSS

Query:  PETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVSWRSSEKGNDRSFDGKTMNEESLLE
        PETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVSWRSSEKGNDRSFDGKTMNEESLLE
Subjt:  PETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVSWRSSEKGNDRSFDGKTMNEESLLE

Query:  QQKLWKEQQNKIIAKHLA
        QQKLWKEQQNKIIAKHLA
Subjt:  QQKLWKEQQNKIIAKHLA

A0A6J1GKV6 clathrin coat assembly protein AP1801.8e-14768.25Show/hide
Query:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT
        MLIDRI +WQKLLDRAIATRPTGPAK NRLVL +LHAVV ESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCK+IGVGRT
Subjt:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT

Query:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD
        SEYPSVQQPSDELIETLQEFLKDQASFPCH +RSPP  V L    S    L+DDLEQSESSD+        NS G++MSATGSW SP NSVEQ+GE YSD
Subjt:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD

Query:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSG-RLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSI-SPQQQYN
        YQSEKQS+ GD A      NE NL+PNFAFF+D    +  P+QE+   GA  G R GDWELVLAESAT+   +EWPDFFAPSIGDD F K   SPQ QYN
Subjt:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSG-RLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSI-SPQQQYN

Query:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLN--FEAWGSTETVVANSTKESNLFSFNPAGEDDPFAV
        NPFLQ +++ FAVPPTF A     ETEIAPTFRA+N KE   AVAVAPTFR ET  F   N     F+ WG     + N +     F F+   E+DPFA 
Subjt:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCPANNLN--FEAWGSTETVVANSTKESNLFSFNPAGEDDPFAV

Query:  SWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH
        +W SS  GN+ S +   M++ESLL+QQKLWKEQQNKIIAKH
Subjt:  SWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH

A0A6J1I2N8 clathrin coat assembly protein AP180-like1.1e-14968.48Show/hide
Query:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT
        MLIDRI +WQKLLDRAIATRPTGPAK NRLVL +LHAVV ESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCK+IGVGRT
Subjt:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT

Query:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD
        SEYPSVQQPSDELIETLQEFLKDQASFPCH +RSPP  V L    S    L+DDLEQSESSD+        NS G++MSATGSW SP NSVEQDGE YSD
Subjt:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDR--------NSMGDIMSATGSWASPTNSVEQDGESYSD

Query:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSG-RLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSI-SPQQQYN
        YQSEKQS+ GD A      NE NL+PNFAFF+D    + KP+QE+  FGA  G R GDWELVLAESAT+   +EWPDFFAPSIGDD F K   SPQ QYN
Subjt:  YQSEKQSQFGDSASMNGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSG-RLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSI-SPQQQYN

Query:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCP--ANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAV
        NPFLQ +++ FAVPPTF A     ETEIAPTFRA+N KE   AV VAPTFR ET  F     N   F+ WG     + N +   + F F+   E+DPFA 
Subjt:  NPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRAETVPFCP--ANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAV

Query:  SWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH
        +W S+  GN+ S +   M++ESLL+QQKLWKEQQNKIIAKH
Subjt:  SWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKH

SwissProt top hitse value%identityAlignment
Q8GX47 Putative clathrin assembly protein At4g026506.7e-2227.51Show/hide
Query:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS
        + +R+ H Q+LLDR +A RPTG AK+NR+V+  ++ +V ESF LY +I++ + +L++ F  L     +  ++   + +KQF+EL  FY  CKN+ V R+S
Subjt:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS

Query:  EYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQS--EKQS
        EYP +++ + + ++ + EF++D+++      +S        +K S  S   +   +    ++  +  I +            E+  E+  D +    +Q 
Subjt:  EYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQS--EKQS

Query:  QFGDSASMNGNENEQNLTPN-------FAFFDD--GPQENVKPYQEQLHFGAPSGRLGDWELVLAESAT
        Q GD   +    +E  +T          A FD   G +    P  E     A +    DWE  L  SAT
Subjt:  QFGDSASMNGNENEQNLTPN-------FAFFDD--GPQENVKPYQEQLHFGAPSGRLGDWELVLAESAT

Q8LF20 Putative clathrin assembly protein At2g254307.2e-3248.59Show/hide
Query:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS
        +  ++ H Q+LLDR ++ RPTG AK++R++L  L+ VV ESF LY DI + LA+LLD FF ++Y  CV AF A   AAKQ +EL +FY+ CK  GV R+S
Subjt:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS

Query:  EYPSVQQPSDELIETLQEFLKDQASFPCHGNR----SPPPQV
        EYP VQ+ + +L+ETL+EF++D+A       R    +PPP V
Subjt:  EYPSVQQPSDELIETLQEFLKDQASFPCHGNR----SPPPQV

Q8S9J8 Probable clathrin assembly protein At4g322857.9e-3150.81Show/hide
Query:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS
        +  ++ H Q+LLDR ++ RPTG AK++R++L  ++ VV ESF LY DI + LA+LLD FF ++Y  CV AF A   AAKQ +EL +FY  CK+ GV R+S
Subjt:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS

Query:  EYPSVQQPSDELIETLQEFLKDQA
        EYP VQ+ + +L+ETL+EF++D+A
Subjt:  EYPSVQQPSDELIETLQEFLKDQA

Q9SA65 Putative clathrin assembly protein At1g030509.1e-2729.28Show/hide
Query:  RISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTSEYP
        RI H Q+LLDR +A RPTG A++NR+V+  L+ +V ESF +Y D+++ + +L++ F  L     +  +    + +KQFEEL  FY  CKN+G+ R+SEYP
Subjt:  RISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTSEYP

Query:  SVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQSE-----KQS
         +++ + + ++ + EF++D+++   H  +S           S+ S   +D +++ + + N   + M+A  +   P    E D +   + + E     KQ 
Subjt:  SVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQSE-----KQS

Query:  QFGDSASM---NGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESAT
        + GD   +   NG E  Q          DGP  +    +    + A      DWE  L ++AT
Subjt:  QFGDSASM---NGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESAT

Q9ZVN6 Clathrin coat assembly protein AP1801.4e-5937.14Show/hide
Query:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT
        ML+D+I++WQKLLDRAIATRPTG AK+NRLV  +L+AV+ ESFDLYRDISDGLALLLDSFFHLQYQSC+NAFQACV+A+KQFEEL +FYDL K+IG+GRT
Subjt:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT

Query:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLP----AKDSIPSILSD----DLEQSE------SSDRNSMGDIMSAT-GSWASPTNSVEQ
        SEYPS+Q+ S EL+ETLQEFLKDQ+SFP      P P  FLP    +KDS  S   D     ++ SE      S    S+ D+MS T    +SP  S   
Subjt:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLP----AKDSIPSILSD----DLEQSE------SSDRNSMGDIMSAT-GSWASPTNSVEQ

Query:  DGESYSDYQSEKQSQFGDSASMNGNENEQNLTPNFAFFD---------------------------------------DGPQENVKPYQEQLHFGAPSGR
          E Y   + +      D+ S     N  +++ +    D                                       D P++ ++  +E+        R
Subjt:  DGESYSDYQSEKQSQFGDSASMNGNENEQNLTPNFAFFD---------------------------------------DGPQENVKPYQEQLHFGAPSGR

Query:  -LGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNPFLQG--ADDLFAVPPTFT-AEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRA
          G+W L L E+AT+   Q          G D+ + +        NPFL+   A    A  P  T    +    +  PTF+      +    +  PTF+A
Subjt:  -LGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNPFLQG--ADDLFAVPPTFT-AEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRA

Query:  -ETVPFCPANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVSWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKHLA
         ET+P        FE++G  ET                             SE G         +N++S+L++Q++W + Q KIIAKHL+
Subjt:  -ETVPFCPANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVSWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKHLA

Arabidopsis top hitse value%identityAlignment
AT1G03050.1 ENTH/ANTH/VHS superfamily protein6.4e-2829.28Show/hide
Query:  RISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTSEYP
        RI H Q+LLDR +A RPTG A++NR+V+  L+ +V ESF +Y D+++ + +L++ F  L     +  +    + +KQFEEL  FY  CKN+G+ R+SEYP
Subjt:  RISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTSEYP

Query:  SVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQSE-----KQS
         +++ + + ++ + EF++D+++   H  +S           S+ S   +D +++ + + N   + M+A  +   P    E D +   + + E     KQ 
Subjt:  SVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQSE-----KQS

Query:  QFGDSASM---NGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESAT
        + GD   +   NG E  Q          DGP  +    +    + A      DWE  L ++AT
Subjt:  QFGDSASM---NGNENEQNLTPNFAFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESAT

AT1G05020.1 ENTH/ANTH/VHS superfamily protein9.9e-6137.14Show/hide
Query:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT
        ML+D+I++WQKLLDRAIATRPTG AK+NRLV  +L+AV+ ESFDLYRDISDGLALLLDSFFHLQYQSC+NAFQACV+A+KQFEEL +FYDL K+IG+GRT
Subjt:  MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRT

Query:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLP----AKDSIPSILSD----DLEQSE------SSDRNSMGDIMSAT-GSWASPTNSVEQ
        SEYPS+Q+ S EL+ETLQEFLKDQ+SFP      P P  FLP    +KDS  S   D     ++ SE      S    S+ D+MS T    +SP  S   
Subjt:  SEYPSVQQPSDELIETLQEFLKDQASFPCHGNRSPPPQVFLP----AKDSIPSILSD----DLEQSE------SSDRNSMGDIMSAT-GSWASPTNSVEQ

Query:  DGESYSDYQSEKQSQFGDSASMNGNENEQNLTPNFAFFD---------------------------------------DGPQENVKPYQEQLHFGAPSGR
          E Y   + +      D+ S     N  +++ +    D                                       D P++ ++  +E+        R
Subjt:  DGESYSDYQSEKQSQFGDSASMNGNENEQNLTPNFAFFD---------------------------------------DGPQENVKPYQEQLHFGAPSGR

Query:  -LGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNPFLQG--ADDLFAVPPTFT-AEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRA
          G+W L L E+AT+   Q          G D+ + +        NPFL+   A    A  P  T    +    +  PTF+      +    +  PTF+A
Subjt:  -LGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNPFLQG--ADDLFAVPPTFT-AEFSSPETEIAPTFRAQNFKEETVAVAVAPTFRA

Query:  -ETVPFCPANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVSWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKHLA
         ET+P        FE++G  ET                             SE G         +N++S+L++Q++W + Q KIIAKHL+
Subjt:  -ETVPFCPANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVSWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKHLA

AT2G25430.1 epsin N-terminal homology (ENTH) domain-containing protein / clathrin assembly protein-related5.1e-3348.59Show/hide
Query:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS
        +  ++ H Q+LLDR ++ RPTG AK++R++L  L+ VV ESF LY DI + LA+LLD FF ++Y  CV AF A   AAKQ +EL +FY+ CK  GV R+S
Subjt:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS

Query:  EYPSVQQPSDELIETLQEFLKDQASFPCHGNR----SPPPQV
        EYP VQ+ + +L+ETL+EF++D+A       R    +PPP V
Subjt:  EYPSVQQPSDELIETLQEFLKDQASFPCHGNR----SPPPQV

AT4G32285.1 ENTH/ANTH/VHS superfamily protein5.6e-3250.81Show/hide
Query:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS
        +  ++ H Q+LLDR ++ RPTG AK++R++L  ++ VV ESF LY DI + LA+LLD FF ++Y  CV AF A   AAKQ +EL +FY  CK+ GV R+S
Subjt:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS

Query:  EYPSVQQPSDELIETLQEFLKDQA
        EYP VQ+ + +L+ETL+EF++D+A
Subjt:  EYPSVQQPSDELIETLQEFLKDQA

AT4G32285.2 ENTH/ANTH/VHS superfamily protein5.6e-3250.81Show/hide
Query:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS
        +  ++ H Q+LLDR ++ RPTG AK++R++L  ++ VV ESF LY DI + LA+LLD FF ++Y  CV AF A   AAKQ +EL +FY  CK+ GV R+S
Subjt:  LIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTS

Query:  EYPSVQQPSDELIETLQEFLKDQA
        EYP VQ+ + +L+ETL+EF++D+A
Subjt:  EYPSVQQPSDELIETLQEFLKDQA


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTTGATCGATCGGATCTCCCACTGGCAGAAATTGCTCGACAGGGCCATTGCAACCAGGCCCACCGGACCCGCCAAATCCAACCGCTTGGTCCTCTATACCCTCCACGC
CGTCGTCCACGAGAGCTTCGACCTCTACCGGGACATCTCCGACGGCCTCGCTCTCCTCCTCGACAGCTTCTTCCATTTGCAGTACCAATCCTGCGTCAATGCGTTTCAGG
CTTGCGTTAAGGCGGCCAAGCAATTTGAGGAGCTTGGTTCGTTTTACGATTTGTGTAAGAACATCGGAGTTGGGAGAACCTCCGAGTATCCCAGCGTTCAGCAGCCATCC
GATGAACTGATTGAGACATTGCAGGAGTTTTTGAAAGATCAAGCTTCGTTCCCCTGTCACGGCAACCGCTCGCCGCCGCCGCAGGTGTTTCTCCCGGCCAAGGACTCCAT
TCCCTCTATACTCAGCGATGATCTCGAACAATCGGAATCGTCGGATAGAAATTCGATGGGGGATATTATGAGCGCGACGGGGAGCTGGGCGAGTCCGACTAATTCGGTGG
AGCAAGACGGAGAAAGCTATTCCGATTATCAATCGGAGAAGCAATCTCAGTTTGGGGATTCCGCGAGCATGAACGGAAACGAGAACGAACAGAATCTGACCCCTAATTTT
GCGTTCTTCGACGACGGACCACAAGAAAACGTAAAGCCCTATCAAGAGCAGCTCCATTTTGGCGCTCCGAGTGGGAGATTAGGCGATTGGGAACTCGTTCTGGCGGAGTC
CGCAACGGAGCCGCCGCCTCAAGAGTGGCCGGATTTCTTCGCGCCGTCCATCGGAGACGACGAGTTCGTGAAATCCATCTCGCCGCAGCAGCAGTACAACAACCCGTTCC
TTCAAGGTGCAGACGATTTATTCGCAGTTCCGCCGACGTTCACAGCAGAATTCTCCAGTCCAGAGACGGAAATCGCTCCGACATTTCGGGCGCAGAATTTCAAGGAGGAG
ACGGTCGCCGTGGCCGTAGCCCCAACTTTCCGCGCAGAAACGGTGCCATTTTGTCCGGCGAATAATCTGAATTTTGAGGCATGGGGCTCGACGGAGACGGTTGTTGCGAA
CTCAACGAAGGAGAGCAATCTGTTTTCATTCAATCCGGCCGGGGAAGACGATCCATTTGCAGTTTCATGGCGTAGCAGTGAAAAGGGAAACGATCGGAGTTTTGATGGGA
AGACGATGAACGAAGAAAGTTTGTTGGAGCAGCAGAAATTGTGGAAGGAGCAGCAAAACAAGATTATAGCGAAGCATTTAGCGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTTGATCGATCGGATCTCCCACTGGCAGAAATTGCTCGACAGGGCCATTGCAACCAGGCCCACCGGACCCGCCAAATCCAACCGCTTGGTCCTCTATACCCTCCACGC
CGTCGTCCACGAGAGCTTCGACCTCTACCGGGACATCTCCGACGGCCTCGCTCTCCTCCTCGACAGCTTCTTCCATTTGCAGTACCAATCCTGCGTCAATGCGTTTCAGG
CTTGCGTTAAGGCGGCCAAGCAATTTGAGGAGCTTGGTTCGTTTTACGATTTGTGTAAGAACATCGGAGTTGGGAGAACCTCCGAGTATCCCAGCGTTCAGCAGCCATCC
GATGAACTGATTGAGACATTGCAGGAGTTTTTGAAAGATCAAGCTTCGTTCCCCTGTCACGGCAACCGCTCGCCGCCGCCGCAGGTGTTTCTCCCGGCCAAGGACTCCAT
TCCCTCTATACTCAGCGATGATCTCGAACAATCGGAATCGTCGGATAGAAATTCGATGGGGGATATTATGAGCGCGACGGGGAGCTGGGCGAGTCCGACTAATTCGGTGG
AGCAAGACGGAGAAAGCTATTCCGATTATCAATCGGAGAAGCAATCTCAGTTTGGGGATTCCGCGAGCATGAACGGAAACGAGAACGAACAGAATCTGACCCCTAATTTT
GCGTTCTTCGACGACGGACCACAAGAAAACGTAAAGCCCTATCAAGAGCAGCTCCATTTTGGCGCTCCGAGTGGGAGATTAGGCGATTGGGAACTCGTTCTGGCGGAGTC
CGCAACGGAGCCGCCGCCTCAAGAGTGGCCGGATTTCTTCGCGCCGTCCATCGGAGACGACGAGTTCGTGAAATCCATCTCGCCGCAGCAGCAGTACAACAACCCGTTCC
TTCAAGGTGCAGACGATTTATTCGCAGTTCCGCCGACGTTCACAGCAGAATTCTCCAGTCCAGAGACGGAAATCGCTCCGACATTTCGGGCGCAGAATTTCAAGGAGGAG
ACGGTCGCCGTGGCCGTAGCCCCAACTTTCCGCGCAGAAACGGTGCCATTTTGTCCGGCGAATAATCTGAATTTTGAGGCATGGGGCTCGACGGAGACGGTTGTTGCGAA
CTCAACGAAGGAGAGCAATCTGTTTTCATTCAATCCGGCCGGGGAAGACGATCCATTTGCAGTTTCATGGCGTAGCAGTGAAAAGGGAAACGATCGGAGTTTTGATGGGA
AGACGATGAACGAAGAAAGTTTGTTGGAGCAGCAGAAATTGTGGAAGGAGCAGCAAAACAAGATTATAGCGAAGCATTTAGCGTGA
Protein sequenceShow/hide protein sequence
MLIDRISHWQKLLDRAIATRPTGPAKSNRLVLYTLHAVVHESFDLYRDISDGLALLLDSFFHLQYQSCVNAFQACVKAAKQFEELGSFYDLCKNIGVGRTSEYPSVQQPS
DELIETLQEFLKDQASFPCHGNRSPPPQVFLPAKDSIPSILSDDLEQSESSDRNSMGDIMSATGSWASPTNSVEQDGESYSDYQSEKQSQFGDSASMNGNENEQNLTPNF
AFFDDGPQENVKPYQEQLHFGAPSGRLGDWELVLAESATEPPPQEWPDFFAPSIGDDEFVKSISPQQQYNNPFLQGADDLFAVPPTFTAEFSSPETEIAPTFRAQNFKEE
TVAVAVAPTFRAETVPFCPANNLNFEAWGSTETVVANSTKESNLFSFNPAGEDDPFAVSWRSSEKGNDRSFDGKTMNEESLLEQQKLWKEQQNKIIAKHLA