; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CmoCh15G009300 (gene) of Cucurbita moschata (Rifu) v1 genome

Gene IDCmoCh15G009300
OrganismCucurbita moschata Rifu (Cucurbita moschata (Rifu) v1)
DescriptionENTH domain-containing protein
Genome locationCmo_Chr15:4759839..4762228
RNA-Seq ExpressionCmoCh15G009300
SyntenyCmoCh15G009300
Gene Ontology termsGO:0006900 - vesicle budding from membrane (biological process)
GO:0048268 - clathrin coat assembly (biological process)
GO:0072583 - clathrin-dependent endocytosis (biological process)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005905 - clathrin-coated pit (cellular component)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0000149 - SNARE binding (molecular function)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0005546 - phosphatidylinositol-4,5-bisphosphate binding (molecular function)
GO:0032050 - clathrin heavy chain binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR013809 - ENTH domain
IPR014712 - ANTH domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6579186.1 putative clathrin assembly protein, partial [Cucurbita argyrosperma subsp. sororia]2.1e-19499.16Show/hide
Query:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
        MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
Subjt:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS

Query:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP
        APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP
Subjt:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP

Query:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
        AGRAARSTAVR+AMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
Subjt:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF

Query:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA
        GIMWQLTESSSSFTSFTNTSESSSFNSPARDETE KVAATRRNVVVGTPWAISHDS FA
Subjt:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA

XP_022938716.1 putative clathrin assembly protein At2g25430 [Cucurbita moschata]1.7e-196100Show/hide
Query:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
        MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
Subjt:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS

Query:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP
        APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP
Subjt:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP

Query:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
        AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
Subjt:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF

Query:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA
        GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA
Subjt:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA

XP_022994032.1 putative clathrin assembly protein At2g25430 [Cucurbita maxima]2.1e-18996.1Show/hide
Query:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
        MMQ+RFKQVLTAVKENCSVGYAK++TAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
Subjt:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS

Query:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP
        APQNSEFRLELLRSRA GFISLYQRHIREDEDYASFIRSYARLLNEALDSD FYSTEVPGVSSE +A+GTTSSRIKKIN+VIEISTQMQSLIDRVIDCRP
Subjt:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP

Query:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
        AGRAARSTAVR+AMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
Subjt:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF

Query:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA
        GIMWQLTESSSSFTSFTNTSESSSFNSPARDETE KVAAT RNVVVGTPW ISH SGFA
Subjt:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA

XP_023551316.1 putative clathrin assembly protein At2g25430 [Cucurbita pepo subsp. pepo]1.9e-19599.44Show/hide
Query:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
        MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
Subjt:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS

Query:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP
        APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP
Subjt:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP

Query:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
        AGRAARSTAVR+AMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
Subjt:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF

Query:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA
        GIMWQLTESSSSFTSFTNTSESSSFNSPARDETE KVAATRRNVVVGTPWAISHDSGFA
Subjt:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA

XP_038875351.1 putative clathrin assembly protein At4g02650 [Benincasa hispida]4.7e-12571.95Show/hide
Query:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA
        MQRRF++VLT VKENCSVGYAKIVTA G+S+V+LIVIKAT+  DSPL EKYVQELL IFAFSP S R+F+LSFSRRFRKTRCWRVGLKCLLLLHRL+QS 
Subjt:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA

Query:  PQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPA
          N+EFRL LLRSRANG IS +Q  IREDEDY+SFIRSYARLL+E+L+SDLFY T+ P  SS  EA GT SSRI +IN+VIEIS  MQ+LID+VIDC+P 
Subjt:  PQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPA

Query:  GRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSFG
        GRAA+S  VR+AMKHI+RESF CYQS+ R+I+S ED LLQLP+RSC AA+ +Y+KA +QADRL+ELYDWCK +EVCS+++FPDI RIPE+RI+AL +S G
Subjt:  GRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSFG

Query:  IMWQLTESSSSFTSFTNTSESSSFNSPA
         MWQ+TESSSS TS ++ S+SS+  SPA
Subjt:  IMWQLTESSSSFTSFTNTSESSSFNSPA

TrEMBL top hitse value%identityAlignment
A0A0A0KUR2 ENTH domain-containing protein2.3e-11466.47Show/hide
Query:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA
        MQ RF++ LTAVKENCSV YAKIVTA G+S+V+LIVIKAT+P DSPL EKYVQELLKIFAFSP S R FSLSFSRRFRK+ C  V LKCLLLLHRL+QS 
Subjt:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA

Query:  PQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPA
        P N+EFRL LLRSR+NG ISLY  H R+DEDY +FIRSYAR L+EAL+SDL Y T+    S    ++GT SSRI +IN+VIE +TQMQ++IDRVIDC+P 
Subjt:  PQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPA

Query:  GRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSFG
        GR ++S  VR+AMK+IIRESF CY SVCRD++SIED LLQLP+RS  AAI +Y+KAA+QA++L+ELYDWCK +EVCS Y+FPDI RIPESRIQ + ++  
Subjt:  GRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSFG

Query:  IMWQLTESSSSFTSFTNTSESSSFNSPARDETEK
         MW++TESSSS TS   +  S    +  R E EK
Subjt:  IMWQLTESSSSFTSFTNTSESSSFNSPARDETEK

A0A1S3CSQ2 putative clathrin assembly protein At4g026501.5e-11361.29Show/hide
Query:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA
        M+ RF++ LTAVKENCSV YAKIVTA G+S+V+LIVIKAT+P DSPL EKYVQELLKIFAFSP S R+FSLSFSRRFRK+ C  V LKCLLLLHRL+QS 
Subjt:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA

Query:  PQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPA
        P N EFRL LLRSR+NG ISL+Q H R DEDY SFIRSYAR L+EAL+SDL Y  + P  S   +++GT  SRI +IN+VIE +TQMQ++IDRVIDC+P 
Subjt:  PQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPA

Query:  GRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSFG
        GR  +S  VR+AMK+IIRESF CY S+CRD++SIED LLQLP+RS  AAI +Y+KAA+QA++L+ LYDWCK +EVCS Y+FPDI RIPESRIQ + ++  
Subjt:  GRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSFG

Query:  IMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFAKTGQEELLRKGWEN
         MW++TESSSS      +S S    SP         A  R+ VVV + W    ++G  K    EL  + WE+
Subjt:  IMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFAKTGQEELLRKGWEN

A0A6J1CZB0 putative clathrin assembly protein At1g030506.2e-12369.79Show/hide
Query:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA
        MQRRF++VLT VKENCSVGYAKIVTAGGFS+V+LIV+KAT+P DSPL EKYVQELLKIFAFSP S R FS+SFSRRFR TRCWRVGLKCLLLLHRL+QS 
Subjt:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA

Query:  PQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPA
         +N+EFR ELLR RA+G+I L+QR IR+DEDYASFIRSY+ LL+E+L+ DLFY    P  S + EA+GT SSRI +IN+ IEI +QMQSLIDRVIDCRPA
Subjt:  PQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPA

Query:  GRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSFG
        GRAARS A+R AMKHI+RESFICY+  CRDI SIED LLQLP+RSCAAAI +Y+KAAVQA++L+ELY WCK + VC+ Y+FPD+ RIPESRIQAL    G
Subjt:  GRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSFG

Query:  IMWQLTESSSSFTSFTNTSESSSFNSP--ARDETEKKVAAT
         MW+LTESSS          SS  +SP  A DE   +V  T
Subjt:  IMWQLTESSSSFTSFTNTSESSSFNSP--ARDETEKKVAAT

A0A6J1FDX7 putative clathrin assembly protein At2g254308.4e-197100Show/hide
Query:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
        MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
Subjt:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS

Query:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP
        APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP
Subjt:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP

Query:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
        AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
Subjt:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF

Query:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA
        GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA
Subjt:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA

A0A6J1JUK5 putative clathrin assembly protein At2g254301.0e-18996.1Show/hide
Query:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
        MMQ+RFKQVLTAVKENCSVGYAK++TAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS
Subjt:  MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQS

Query:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP
        APQNSEFRLELLRSRA GFISLYQRHIREDEDYASFIRSYARLLNEALDSD FYSTEVPGVSSE +A+GTTSSRIKKIN+VIEISTQMQSLIDRVIDCRP
Subjt:  APQNSEFRLELLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRP

Query:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
        AGRAARSTAVR+AMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF
Subjt:  AGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSF

Query:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA
        GIMWQLTESSSSFTSFTNTSESSSFNSPARDETE KVAAT RNVVVGTPW ISH SGFA
Subjt:  GIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRNVVVGTPWAISHDSGFA

SwissProt top hitse value%identityAlignment
Q8GX47 Putative clathrin assembly protein At4g026501.8e-2626.22Show/hide
Query:  MMQRRFKQVLTAVKENCSVGYAKI-VTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQ
        M   + K+ + AVK+  SVG AK+   +   + + + V+KAT   D P  +KY++E+L + ++S         + SRR  KT+ W V LK L+L+ RL+ 
Subjt:  MMQRRFKQVLTAVKENCSVGYAKI-VTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQ

Query:  SAPQNSEFRLELLRSRANGFISL--YQRHIREDE-DYASFIRSYARLLNEALDSDLFYSTEV---------PGVSSEAEAMGTTSSRIK------KINKV
           +  E  +     R    +++  ++   + D  DY++F+R+YA  L+E LD  +                G S E +    TS+ I+      K   V
Subjt:  SAPQNSEFRLELLRSRANGFISL--YQRHIREDE-DYASFIRSYARLLNEALDSDLFYSTEV---------PGVSSEAEAMGTTSSRIK------KINKV

Query:  IEISTQ--------MQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKF
         E+ T+        +Q L+DR + CRP G A  +  V +AM  I++ESF  Y ++   +  + +  ++L          ++ + + Q D L   Y WCK 
Subjt:  IEISTQ--------MQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKF

Query:  VEVCSLYQFPDIERIPESRIQALISSFGIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRN
        + V    ++P++E+I + ++        +M +     S+  +   T++SSS  S   +E E K    + N
Subjt:  VEVCSLYQFPDIERIPESRIQALISSFGIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRN

Q8LF20 Putative clathrin assembly protein At2g254302.0e-2525Show/hide
Query:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA
        M    ++ + AVK+  S+G AK V +    ++ + ++KATS  D P SEKY++E+L + + S         S SRR  KTR W V LK L+L+HRL+   
Subjt:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA

Query:  PQNSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALDSDLFYSTEVPGVSS---------------------------------
          +  F+ E+L S   G   L     R++      D+++F+R+YA  L++ L+  LF       V+S                                 
Subjt:  PQNSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALDSDLFYSTEVPGVSS---------------------------------

Query:  ----------EAEAMGTTSSRIKKINKVIEIS-----------------------------TQMQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFIC
                  +    G    R +    + E+                                +Q L+DR +  RP G A  S  + +A+  ++RESF  
Subjt:  ----------EAEAMGTTSSRIKKINKVIEIS-----------------------------TQMQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFIC

Query:  YQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQAL
        Y  +C  +  + D    + +  C  A   Y  AA Q D L   Y+WCK   V    ++P+++RI    ++ L
Subjt:  YQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQAL

Q8S9J8 Probable clathrin assembly protein At4g322857.2e-2825.85Show/hide
Query:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA
        M    ++ +  VK+  S+G AK V +    ++ + ++KATS  D   S+KY++E+L + + S         S SRR +KTR W V LK L+L+HRL+   
Subjt:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA

Query:  PQNSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALDSDLF--------------------------------------YST--
          +  F+ E+L +   G   L     R++      D+++F+R+YA  L++ L+  LF                                      Y T  
Subjt:  PQNSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALDSDLF--------------------------------------YST--

Query:  ------------EVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPH
                    +V  + +  E    T  R     ++      +Q L+DR + CRP G A  S  + +AM  +++ESF  Y  +C  +  + D    + +
Subjt:  ------------EVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPH

Query:  RSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQAL
          C  A   Y  AA Q D L   Y WCK   V    ++P+++RI    ++ L
Subjt:  RSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQAL

Q9LVD8 Putative clathrin assembly protein At5g572002.0e-2527.7Show/hide
Query:  FKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFA-FSPTSCRTFSL-SFSRRFRKTRCWRVGLKCLLLLHRLIQSAPQ
        F++   A+K+  +VG AK+ +   F ++++ ++KAT+  +SP  E++V+++    +   P +   + + + S+R  KTR W V +K L+++HR ++    
Subjt:  FKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFA-FSPTSCRTFSL-SFSRRFRKTRCWRVGLKCLLLLHRLIQSAPQ

Query:  NSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALD--SDLFYSTEVPGV-SSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRV
        +  FR ELL       I L   + ++D      D ++++R+YA  L E L+    L Y  E   +  +   A  T  +R+     ++E    +Q L+ R+
Subjt:  NSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALD--SDLFYSTEVPGV-SSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRV

Query:  IDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPES
        I C+P G A  +  ++ A+  +++ESF  Y ++   I ++ D   ++       A+ +Y++A  QA+ LAE YD+CK +E+   +QFP + + P S
Subjt:  IDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPES

Q9SA65 Putative clathrin assembly protein At1g030509.4e-2824.93Show/hide
Query:  MMQRRFKQVLTAVKENCSVGYAKI-VTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQ
        M   +FK+ + AVK+  SVG AK+   +   S +++ ++KAT   + P  EKY++E+L + ++S +       + SRR  KT+CW V LK L+L+ RL+ 
Subjt:  MMQRRFKQVLTAVKENCSVGYAKI-VTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQ

Query:  SAPQNSEFRLELLRSRANGFISLYQ-RHIREDE--DYASFIRSYARLLNEALDSDLFYSTEVPGV----------SSEAEAMGTTSSRIKKINKVIEIST
           Q  E  +     R    +++   R +      DY++F+R+YA  L+E LD  +       GV            +  A   +++ + +   + E+ T
Subjt:  SAPQNSEFRLELLRSRANGFISLYQ-RHIREDE--DYASFIRSYARLLNEALDSDLFYSTEVPGV----------SSEAEAMGTTSSRIKKINKVIEIST

Query:  Q--------MQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCS
        +        +Q L+DR + CRP G A  +  V +A+  I++ESF  Y  V   +  + +  ++L          ++ + + Q + L + Y WCK + +  
Subjt:  Q--------MQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCS

Query:  LYQFPDIERIPESRIQALISSFGIMWQLTESSSSFTSFTNTSESSSFNSPARDETEK
          ++P+IE+I + ++            + E     ++  +T +S S  S A ++ ++
Subjt:  LYQFPDIERIPESRIQALISSFGIMWQLTESSSSFTSFTNTSESSSFNSPARDETEK

Arabidopsis top hitse value%identityAlignment
AT1G03050.1 ENTH/ANTH/VHS superfamily protein6.7e-2924.93Show/hide
Query:  MMQRRFKQVLTAVKENCSVGYAKI-VTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQ
        M   +FK+ + AVK+  SVG AK+   +   S +++ ++KAT   + P  EKY++E+L + ++S +       + SRR  KT+CW V LK L+L+ RL+ 
Subjt:  MMQRRFKQVLTAVKENCSVGYAKI-VTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQ

Query:  SAPQNSEFRLELLRSRANGFISLYQ-RHIREDE--DYASFIRSYARLLNEALDSDLFYSTEVPGV----------SSEAEAMGTTSSRIKKINKVIEIST
           Q  E  +     R    +++   R +      DY++F+R+YA  L+E LD  +       GV            +  A   +++ + +   + E+ T
Subjt:  SAPQNSEFRLELLRSRANGFISLYQ-RHIREDE--DYASFIRSYARLLNEALDSDLFYSTEVPGV----------SSEAEAMGTTSSRIKKINKVIEIST

Query:  Q--------MQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCS
        +        +Q L+DR + CRP G A  +  V +A+  I++ESF  Y  V   +  + +  ++L          ++ + + Q + L + Y WCK + +  
Subjt:  Q--------MQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCS

Query:  LYQFPDIERIPESRIQALISSFGIMWQLTESSSSFTSFTNTSESSSFNSPARDETEK
          ++P+IE+I + ++            + E     ++  +T +S S  S A ++ ++
Subjt:  LYQFPDIERIPESRIQALISSFGIMWQLTESSSSFTSFTNTSESSSFNSPARDETEK

AT2G25430.1 epsin N-terminal homology (ENTH) domain-containing protein / clathrin assembly protein-related1.4e-2625Show/hide
Query:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA
        M    ++ + AVK+  S+G AK V +    ++ + ++KATS  D P SEKY++E+L + + S         S SRR  KTR W V LK L+L+HRL+   
Subjt:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA

Query:  PQNSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALDSDLFYSTEVPGVSS---------------------------------
          +  F+ E+L S   G   L     R++      D+++F+R+YA  L++ L+  LF       V+S                                 
Subjt:  PQNSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALDSDLFYSTEVPGVSS---------------------------------

Query:  ----------EAEAMGTTSSRIKKINKVIEIS-----------------------------TQMQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFIC
                  +    G    R +    + E+                                +Q L+DR +  RP G A  S  + +A+  ++RESF  
Subjt:  ----------EAEAMGTTSSRIKKINKVIEIS-----------------------------TQMQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFIC

Query:  YQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQAL
        Y  +C  +  + D    + +  C  A   Y  AA Q D L   Y+WCK   V    ++P+++RI    ++ L
Subjt:  YQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQAL

AT4G02650.1 ENTH/ANTH/VHS superfamily protein1.3e-2726.22Show/hide
Query:  MMQRRFKQVLTAVKENCSVGYAKI-VTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQ
        M   + K+ + AVK+  SVG AK+   +   + + + V+KAT   D P  +KY++E+L + ++S         + SRR  KT+ W V LK L+L+ RL+ 
Subjt:  MMQRRFKQVLTAVKENCSVGYAKI-VTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQ

Query:  SAPQNSEFRLELLRSRANGFISL--YQRHIREDE-DYASFIRSYARLLNEALDSDLFYSTEV---------PGVSSEAEAMGTTSSRIK------KINKV
           +  E  +     R    +++  ++   + D  DY++F+R+YA  L+E LD  +                G S E +    TS+ I+      K   V
Subjt:  SAPQNSEFRLELLRSRANGFISL--YQRHIREDE-DYASFIRSYARLLNEALDSDLFYSTEV---------PGVSSEAEAMGTTSSRIK------KINKV

Query:  IEISTQ--------MQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKF
         E+ T+        +Q L+DR + CRP G A  +  V +AM  I++ESF  Y ++   +  + +  ++L          ++ + + Q D L   Y WCK 
Subjt:  IEISTQ--------MQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKF

Query:  VEVCSLYQFPDIERIPESRIQALISSFGIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRN
        + V    ++P++E+I + ++        +M +     S+  +   T++SSS  S   +E E K    + N
Subjt:  VEVCSLYQFPDIERIPESRIQALISSFGIMWQLTESSSSFTSFTNTSESSSFNSPARDETEKKVAATRRN

AT4G32285.1 ENTH/ANTH/VHS superfamily protein5.1e-2925.85Show/hide
Query:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA
        M    ++ +  VK+  S+G AK V +    ++ + ++KATS  D   S+KY++E+L + + S         S SRR +KTR W V LK L+L+HRL+   
Subjt:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA

Query:  PQNSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALDSDLF--------------------------------------YST--
          +  F+ E+L +   G   L     R++      D+++F+R+YA  L++ L+  LF                                      Y T  
Subjt:  PQNSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALDSDLF--------------------------------------YST--

Query:  ------------EVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPH
                    +V  + +  E    T  R     ++      +Q L+DR + CRP G A  S  + +AM  +++ESF  Y  +C  +  + D    + +
Subjt:  ------------EVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPH

Query:  RSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQAL
          C  A   Y  AA Q D L   Y WCK   V    ++P+++RI    ++ L
Subjt:  RSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQAL

AT4G32285.2 ENTH/ANTH/VHS superfamily protein5.1e-2925.85Show/hide
Query:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA
        M    ++ +  VK+  S+G AK V +    ++ + ++KATS  D   S+KY++E+L + + S         S SRR +KTR W V LK L+L+HRL+   
Subjt:  MQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSA

Query:  PQNSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALDSDLF--------------------------------------YST--
          +  F+ E+L +   G   L     R++      D+++F+R+YA  L++ L+  LF                                      Y T  
Subjt:  PQNSEFRLELLRSRANGFISLYQRHIREDE-----DYASFIRSYARLLNEALDSDLF--------------------------------------YST--

Query:  ------------EVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPH
                    +V  + +  E    T  R     ++      +Q L+DR + CRP G A  S  + +AM  +++ESF  Y  +C  +  + D    + +
Subjt:  ------------EVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRESFICYQSVCRDINSIEDGLLQLPH

Query:  RSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQAL
          C  A   Y  AA Q D L   Y WCK   V    ++P+++RI    ++ L
Subjt:  RSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQAL


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGATGCAGAGGAGATTCAAGCAAGTTCTCACTGCCGTCAAAGAGAATTGTTCCGTCGGCTACGCCAAAATCGTCACAGCCGGAGGATTTTCCAACGTCAATCTCATAGT
CATCAAAGCCACCTCTCCAACTGACTCACCGTTGTCGGAGAAGTACGTTCAAGAGCTTCTCAAGATCTTCGCCTTCTCTCCGACGTCATGTCGCACCTTTTCTCTCAGCT
TTTCCCGTCGATTTCGAAAAACTCGCTGTTGGCGAGTTGGACTCAAATGTCTGCTTCTACTCCACAGATTGATCCAATCAGCCCCTCAGAACAGTGAATTTCGATTAGAG
CTTCTTCGCAGCCGGGCCAATGGATTCATTTCTCTATATCAGCGCCACATCCGAGAGGATGAAGATTATGCCTCTTTCATCAGATCCTACGCTCGGTTGCTTAATGAAGC
TCTGGATTCCGATTTGTTCTATAGCACCGAAGTACCGGGCGTTTCATCTGAAGCCGAAGCTATGGGAACAACTTCAAGTAGAATAAAAAAAATTAACAAAGTAATTGAAA
TATCGACACAGATGCAGAGCCTAATTGACAGAGTAATCGACTGCAGGCCGGCCGGGAGAGCGGCCCGAAGCACCGCAGTTCGAATGGCGATGAAGCACATAATACGTGAG
AGCTTCATTTGTTATCAGTCCGTCTGTCGGGATATCAATTCAATCGAAGACGGTCTTCTTCAATTGCCGCACCGGAGTTGCGCTGCAGCGATCCGAGTATACAGGAAGGC
GGCCGTTCAAGCAGATCGACTAGCGGAGCTTTACGATTGGTGCAAGTTCGTCGAAGTGTGTAGCCTGTATCAATTCCCCGATATCGAACGTATTCCGGAATCGCGGATCC
AAGCCCTAATATCATCCTTTGGCATTATGTGGCAGCTGACGGAATCGTCGTCTTCATTTACCTCTTTCACAAATACATCAGAATCGTCGTCGTTCAACTCTCCGGCAAGG
GATGAGACTGAGAAGAAAGTTGCGGCTACGCGAAGAAACGTTGTTGTGGGCACTCCATGGGCGATTTCTCACGACAGCGGATTCGCAAAAACAGGGCAGGAGGAACTGCT
AAGGAAGGGATGGGAAAATGCCCCAAGTTCACACTAA
mRNA sequenceShow/hide mRNA sequence
ATGATGCAGAGGAGATTCAAGCAAGTTCTCACTGCCGTCAAAGAGAATTGTTCCGTCGGCTACGCCAAAATCGTCACAGCCGGAGGATTTTCCAACGTCAATCTCATAGT
CATCAAAGCCACCTCTCCAACTGACTCACCGTTGTCGGAGAAGTACGTTCAAGAGCTTCTCAAGATCTTCGCCTTCTCTCCGACGTCATGTCGCACCTTTTCTCTCAGCT
TTTCCCGTCGATTTCGAAAAACTCGCTGTTGGCGAGTTGGACTCAAATGTCTGCTTCTACTCCACAGATTGATCCAATCAGCCCCTCAGAACAGTGAATTTCGATTAGAG
CTTCTTCGCAGCCGGGCCAATGGATTCATTTCTCTATATCAGCGCCACATCCGAGAGGATGAAGATTATGCCTCTTTCATCAGATCCTACGCTCGGTTGCTTAATGAAGC
TCTGGATTCCGATTTGTTCTATAGCACCGAAGTACCGGGCGTTTCATCTGAAGCCGAAGCTATGGGAACAACTTCAAGTAGAATAAAAAAAATTAACAAAGTAATTGAAA
TATCGACACAGATGCAGAGCCTAATTGACAGAGTAATCGACTGCAGGCCGGCCGGGAGAGCGGCCCGAAGCACCGCAGTTCGAATGGCGATGAAGCACATAATACGTGAG
AGCTTCATTTGTTATCAGTCCGTCTGTCGGGATATCAATTCAATCGAAGACGGTCTTCTTCAATTGCCGCACCGGAGTTGCGCTGCAGCGATCCGAGTATACAGGAAGGC
GGCCGTTCAAGCAGATCGACTAGCGGAGCTTTACGATTGGTGCAAGTTCGTCGAAGTGTGTAGCCTGTATCAATTCCCCGATATCGAACGTATTCCGGAATCGCGGATCC
AAGCCCTAATATCATCCTTTGGCATTATGTGGCAGCTGACGGAATCGTCGTCTTCATTTACCTCTTTCACAAATACATCAGAATCGTCGTCGTTCAACTCTCCGGCAAGG
GATGAGACTGAGAAGAAAGTTGCGGCTACGCGAAGAAACGTTGTTGTGGGCACTCCATGGGCGATTTCTCACGACAGCGGATTCGCAAAAACAGGGCAGGAGGAACTGCT
AAGGAAGGGATGGGAAAATGCCCCAAGTTCACACTAA
Protein sequenceShow/hide protein sequence
MMQRRFKQVLTAVKENCSVGYAKIVTAGGFSNVNLIVIKATSPTDSPLSEKYVQELLKIFAFSPTSCRTFSLSFSRRFRKTRCWRVGLKCLLLLHRLIQSAPQNSEFRLE
LLRSRANGFISLYQRHIREDEDYASFIRSYARLLNEALDSDLFYSTEVPGVSSEAEAMGTTSSRIKKINKVIEISTQMQSLIDRVIDCRPAGRAARSTAVRMAMKHIIRE
SFICYQSVCRDINSIEDGLLQLPHRSCAAAIRVYRKAAVQADRLAELYDWCKFVEVCSLYQFPDIERIPESRIQALISSFGIMWQLTESSSSFTSFTNTSESSSFNSPAR
DETEKKVAATRRNVVVGTPWAISHDSGFAKTGQEELLRKGWENAPSSH