; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017036 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017036
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionENTH domain-containing protein
Genome locationtig00153017:937401..938703
RNA-Seq ExpressionSgr017036
SyntenySgr017036
Gene Ontology termsGO:0016192 - vesicle-mediated transport (biological process)
GO:0048268 - clathrin coat assembly (biological process)
GO:0030136 - clathrin-coated vesicle (cellular component)
GO:0005545 - 1-phosphatidylinositol binding (molecular function)
GO:0030276 - clathrin binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR011417 - AP180 N-terminal homology (ANTH) domain
IPR014712 - ANTH domain superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046396.1 putative clathrin assembly protein [Cucumis melo var. makuwa]4.9e-9853.16Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV
        M+ RFRR LTAVKENCSV YAK+VT  G+SDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPS+R+FSLSFSR                   RLLQS+
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV

Query:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDS-GDEGIETTSRRINEIRRVIEILTQLQSFID--------
        P+N  FR  LLRSR+NG +SLH+CH R DED+ SFIRSYAR LDEALN DL Y  K PDDS   + I T   RINEI RVIE  TQ+Q+ ID        
Subjt:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDS-GDEGIETTSRRINEIRRVIEILTQLQSFID--------

Query:  ----------------------------RGYRFSRDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS
                                    R      D LLQLPYRS VAAIGIYKKAA+QA+QLS LY WCKL+ VC  YEFPD++RIPE+RIQ +EA+V 
Subjt:  ----------------------------RGYRFSRDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS

Query:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFT--WDPMSCELK
        +MW++TESSS+S  S+S  S+SP           A  +  VVV ++WE FE    P+        L+ELEE +    WEDLLEAS SFT  WD + CE  
Subjt:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFT--WDPMSCELK

Query:  SYDWDQNKEGSGTEKMQLYNPTPLNPF
             +N EG   E +     + LNPF
Subjt:  SYDWDQNKEGSGTEKMQLYNPTPLNPF

XP_008467093.1 PREDICTED: putative clathrin assembly protein At4g02650 [Cucumis melo]7.0e-9752.93Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV
        M+ RFRR LTAVKENCSV YAK+VT  G+SDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPS+R+FSLSFSR                   RLLQS+
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV

Query:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDS-GDEGIETTSRRINEIRRVIEILTQLQSFID--------
        P+N  FR  LLRSR+NG +SLH+CH R DED+ SFIRSYAR LDEALN DL Y  K PDDS   + I T   RINEI RVIE  TQ+Q+ ID        
Subjt:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDS-GDEGIETTSRRINEIRRVIEILTQLQSFID--------

Query:  ----------------------------RGYRFSRDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS
                                    R      D LLQLPYRS VAAI IYKKAA+QA+QLS LY WCKL+ VC  YEFPD++RIPE+RIQ +EA+V 
Subjt:  ----------------------------RGYRFSRDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS

Query:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFT--WDPMSCELK
        +MW++TESSS+S  S+S  S+SP           A  +  VVV ++WE FE    P+        L+ELEE +    WEDLLEAS SFT  WD       
Subjt:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFT--WDPMSCELK

Query:  SYDWDQNKEGSGTEKMQLYNPTPLNPF
        S  W++N EG   E +     + LNPF
Subjt:  SYDWDQNKEGSGTEKMQLYNPTPLNPF

XP_011656244.1 putative clathrin assembly protein At1g03050 [Cucumis sativus]1.1e-9752.94Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV
        MQ RFRR LTAVKENCSV YAK+VT  G+SDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPS+RAFSLSFSR                   RLLQS+
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV

Query:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDSG-DEGIETTSRRINEIRRVIEILTQLQSFIDRGY-----
        P+N+ FR  LLRSR+NG +SL+ CH R DED+ +FIRSYAR LDEALN DL Y  K  DDS     I T S RINEI RVIE  TQ+Q+ IDR       
Subjt:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDSG-DEGIETTSRRINEIRRVIEILTQLQSFIDRGY-----

Query:  -RFSR------------------------------DGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS
         R S+                              D LLQLPYRS VAAIGIYKKAA+QA+QLSELY WCKL+ VC  YEFPD++RIPE+RIQ +EA+V 
Subjt:  -RFSR------------------------------DGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS

Query:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFTWDPMSCELKSY
        +MW++TESSS+ST+S +S                   +   VV ++WE FE    P         L+ELEE +    WEDLLEAS SFT      E  S 
Subjt:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFTWDPMSCELKSY

Query:  DWDQNKEGSGTEKMQLYNPTPLNPF
        +W+ N EG   E +     + LNPF
Subjt:  DWDQNKEGSGTEKMQLYNPTPLNPF

XP_022146332.1 putative clathrin assembly protein At1g03050 [Momordica charantia]3.2e-12660.73Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV
        MQRRFRR+LT VKENCSVGYAK+VT GGFSDVDLIV+KATAPNDSPLPEKYVQELLKIFAFSPPSFRAFS+SFSR                   RLLQSV
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV

Query:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDSGDEGIETTSRRINEIRRVIEILTQLQSFIDR--------
         EN+ FR ELLR RA+GW+ LH+  IRDDED+ASFIRSY+ LLDE+LNCDLFY+A  PDDSGDE I T S RI+EI R IEIL+Q+QS IDR        
Subjt:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDSGDEGIETTSRRINEIRRVIEILTQLQSFIDR--------

Query:  ------GYRFS----------------------RDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSK
                RF+                       D LLQLPYRSC AAI IYKKAAVQA+QLSELYGWCK +GVC  YEFPDV RIPE+RIQALE  V +
Subjt:  ------GYRFS----------------------RDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSK

Query:  MWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIEL-----EEENQDSRWEDLLEASASFTWDPMSCE
        MW+LTESS     S SS S+SPP ANDE VN+V      V  +++WETFEG+D  E E RKEK LI+L     EEE     WEDLLEASA          
Subjt:  MWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIEL-----EEENQDSRWEDLLEASASFTWDPMSCE

Query:  LKSYDWDQNKEGSGTEKMQLYNPTPLNPFRHGCFFPTL
          S+ WDQN  G  T  +QLYNPT +NPF H CFFPTL
Subjt:  LKSYDWDQNKEGSGTEKMQLYNPTPLNPFRHGCFFPTL

XP_038875351.1 putative clathrin assembly protein At4g02650 [Benincasa hispida]2.7e-10453.33Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV
        MQRRFRR+LT VKENCSVGYAK+VT  G+SDVDLIVIKATA NDSPLPEKYVQELL IFAFSPPS+R+F+LSFSR                   RLLQSV
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV

Query:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDD-SGDEGIETTSRRINEIRRVIEILTQLQSFID--------
         +N+ FR  LLRSRANG +S H+  IR+DED++SFIRSYARLLDE+LN DLFY  K PDD SG+E   T S RINEI RVIEI   +Q+ ID        
Subjt:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDD-SGDEGIETTSRRINEIRRVIEILTQLQSFID--------

Query:  ----------------------------RGYRFSRDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS
                                    R    + D LLQLPYRSC+AA+ IYKKA +QAD+LSELY WCKL+ VC ++EFPD++RIPEARI+ALEASV 
Subjt:  ----------------------------RGYRFSRDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS

Query:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFT--WDPMSCELK
        +MWQ+TESSS+ T+S++S S                       ST      G++   KER + KPL+ELEE +    WEDLLEASASFT  WD       
Subjt:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFT--WDPMSCELK

Query:  SYDWDQNKEGSGTEKMQLYNPTPLNPFRHGCFFPT
           W+ N+E    E+M+ ++P PLNPF H  FFPT
Subjt:  SYDWDQNKEGSGTEKMQLYNPTPLNPFRHGCFFPT

TrEMBL top hitse value%identityAlignment
A0A0A0KUR2 ENTH domain-containing protein5.3e-9852.94Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV
        MQ RFRR LTAVKENCSV YAK+VT  G+SDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPS+RAFSLSFSR                   RLLQS+
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV

Query:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDSG-DEGIETTSRRINEIRRVIEILTQLQSFIDRGY-----
        P+N+ FR  LLRSR+NG +SL+ CH R DED+ +FIRSYAR LDEALN DL Y  K  DDS     I T S RINEI RVIE  TQ+Q+ IDR       
Subjt:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDSG-DEGIETTSRRINEIRRVIEILTQLQSFIDRGY-----

Query:  -RFSR------------------------------DGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS
         R S+                              D LLQLPYRS VAAIGIYKKAA+QA+QLSELY WCKL+ VC  YEFPD++RIPE+RIQ +EA+V 
Subjt:  -RFSR------------------------------DGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS

Query:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFTWDPMSCELKSY
        +MW++TESSS+ST+S +S                   +   VV ++WE FE    P         L+ELEE +    WEDLLEAS SFT      E  S 
Subjt:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFTWDPMSCELKSY

Query:  DWDQNKEGSGTEKMQLYNPTPLNPF
        +W+ N EG   E +     + LNPF
Subjt:  DWDQNKEGSGTEKMQLYNPTPLNPF

A0A1S3CSQ2 putative clathrin assembly protein At4g026503.4e-9752.93Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV
        M+ RFRR LTAVKENCSV YAK+VT  G+SDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPS+R+FSLSFSR                   RLLQS+
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV

Query:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDS-GDEGIETTSRRINEIRRVIEILTQLQSFID--------
        P+N  FR  LLRSR+NG +SLH+CH R DED+ SFIRSYAR LDEALN DL Y  K PDDS   + I T   RINEI RVIE  TQ+Q+ ID        
Subjt:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDS-GDEGIETTSRRINEIRRVIEILTQLQSFID--------

Query:  ----------------------------RGYRFSRDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS
                                    R      D LLQLPYRS VAAI IYKKAA+QA+QLS LY WCKL+ VC  YEFPD++RIPE+RIQ +EA+V 
Subjt:  ----------------------------RGYRFSRDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS

Query:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFT--WDPMSCELK
        +MW++TESSS+S  S+S  S+SP           A  +  VVV ++WE FE    P+        L+ELEE +    WEDLLEAS SFT  WD       
Subjt:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFT--WDPMSCELK

Query:  SYDWDQNKEGSGTEKMQLYNPTPLNPF
        S  W++N EG   E +     + LNPF
Subjt:  SYDWDQNKEGSGTEKMQLYNPTPLNPF

A0A5D3CT35 Putative clathrin assembly protein2.4e-9853.16Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV
        M+ RFRR LTAVKENCSV YAK+VT  G+SDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPS+R+FSLSFSR                   RLLQS+
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV

Query:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDS-GDEGIETTSRRINEIRRVIEILTQLQSFID--------
        P+N  FR  LLRSR+NG +SLH+CH R DED+ SFIRSYAR LDEALN DL Y  K PDDS   + I T   RINEI RVIE  TQ+Q+ ID        
Subjt:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDS-GDEGIETTSRRINEIRRVIEILTQLQSFID--------

Query:  ----------------------------RGYRFSRDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS
                                    R      D LLQLPYRS VAAIGIYKKAA+QA+QLS LY WCKL+ VC  YEFPD++RIPE+RIQ +EA+V 
Subjt:  ----------------------------RGYRFSRDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS

Query:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFT--WDPMSCELK
        +MW++TESSS+S  S+S  S+SP           A  +  VVV ++WE FE    P+        L+ELEE +    WEDLLEAS SFT  WD + CE  
Subjt:  KMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASASFT--WDPMSCELK

Query:  SYDWDQNKEGSGTEKMQLYNPTPLNPF
             +N EG   E +     + LNPF
Subjt:  SYDWDQNKEGSGTEKMQLYNPTPLNPF

A0A6J1CZB0 putative clathrin assembly protein At1g030501.6e-12660.73Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV
        MQRRFRR+LT VKENCSVGYAK+VT GGFSDVDLIV+KATAPNDSPLPEKYVQELLKIFAFSPPSFRAFS+SFSR                   RLLQSV
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV

Query:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDSGDEGIETTSRRINEIRRVIEILTQLQSFIDR--------
         EN+ FR ELLR RA+GW+ LH+  IRDDED+ASFIRSY+ LLDE+LNCDLFY+A  PDDSGDE I T S RI+EI R IEIL+Q+QS IDR        
Subjt:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDSGDEGIETTSRRINEIRRVIEILTQLQSFIDR--------

Query:  ------GYRFS----------------------RDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSK
                RF+                       D LLQLPYRSC AAI IYKKAAVQA+QLSELYGWCK +GVC  YEFPDV RIPE+RIQALE  V +
Subjt:  ------GYRFS----------------------RDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSK

Query:  MWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIEL-----EEENQDSRWEDLLEASASFTWDPMSCE
        MW+LTESS     S SS S+SPP ANDE VN+V      V  +++WETFEG+D  E E RKEK LI+L     EEE     WEDLLEASA          
Subjt:  MWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIEL-----EEENQDSRWEDLLEASASFTWDPMSCE

Query:  LKSYDWDQNKEGSGTEKMQLYNPTPLNPFRHGCFFPTL
          S+ WDQN  G  T  +QLYNPT +NPF H CFFPTL
Subjt:  LKSYDWDQNKEGSGTEKMQLYNPTPLNPFRHGCFFPTL

A0A6J1JUK5 putative clathrin assembly protein At2g254304.6e-9455.97Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV
        MQ+RF+++LTAVKENCSVGYAK++T GGFS+V+LIVIKAT+P DSPL EKYVQELLKIFAFSP S R FSLSFSR                   RL+QS 
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSR-------------------RLLQSV

Query:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDSGD-EGIETTSRRINEIRRVIEILTQLQSFIDR------G
        P+NS FR ELLRSRA G++SL++ HIR+DED+ASFIRSYARLL+EAL+ D FY+ + P  S + + I TTS RI +I RVIEI TQ+QS IDR       
Subjt:  PENSAFRFELLRSRANGWMSLHRCHIRDDEDFASFIRSYARLLDEALNCDLFYNAKPPDDSGD-EGIETTSRRINEIRRVIEILTQLQSFIDR------G

Query:  YRFSR------------------------------DGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS
         R +R                              DGLLQLP+RSC AAI +Y+KAAVQAD+L+ELY WCK + VC LY+FPD++RIPE+RIQAL +S  
Subjt:  YRFSR------------------------------DGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVS

Query:  KMWQLTESSS--TSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQD
         MWQLTESSS  TS T+TS SS     A DE  NKVAA   NVVV T WE    + F        KPLIELE E +D
Subjt:  KMWQLTESSS--TSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQD

SwissProt top hitse value%identityAlignment
Q8GX47 Putative clathrin assembly protein At4g026502.3e-1021.07Show/hide
Query:  RFRRLLTAVKENCSVGYAKVVTVGG----FSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VP
        + +R + AVK+  SVG AK   VGG     +++++ V+KAT  +D P  +KY++E+L + ++S     A   + SRRL ++                   
Subjt:  RFRRLLTAVKENCSVGYAKVVTVGG----FSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VP

Query:  ENSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDL---------FYNAKPPDDSGDE-------------GIETTSRRIN
         + A+  E+  +   G   L+    RD       D+++F+R+YA  LDE L+  +                 DSG+E              I   S+ + 
Subjt:  ENSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDL---------FYNAKPPDDSGDE-------------GIETTSRRIN

Query:  EIR--RVIEILTQLQSFIDRGYRFSRDG------------------------------------LLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLI
        E++  ++   +  LQ  +DR       G                                     ++L     +    I+ + + Q D+L   YGWCK +
Subjt:  EIR--RVIEILTQLQSFIDRGYRFSRDG------------------------------------LLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLI

Query:  GVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEEN
         V    E+P++++I + ++  ++       +     S     T+ SS    + ++E  +K   +Q N       +     +  E+E  +EK   + + E 
Subjt:  GVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEEN

Query:  QDSRWE---DLLE
          SR +   DLL+
Subjt:  QDSRWE---DLLE

Q8LF20 Putative clathrin assembly protein At2g254301.8e-1021.88Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VPE
        M    R+ + AVK+  S+G AKV +     D+++ ++KAT+ +D P  EKY++E+L + + S     A   S SRRL ++                    
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VPE

Query:  NSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDLF------------------------------YNAKPP---------
        +  F+ E+L S   G   L+    RD+      D ++F+R+YA  LD+ L   LF                              + + PP         
Subjt:  NSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDLF------------------------------YNAKPP---------

Query:  --------DDSGDEGIETTSRRINEI--------------------------RRVIEILTQLQSFIDRGYRFSRDGL-----------------------
                D++G  G+   SR   ++                           R+   +  LQ  +DR       GL                       
Subjt:  --------DDSGDEGIETTSRRINEI--------------------------RRVIEILTQLQSFIDRGYRFSRDGL-----------------------

Query:  -------------LQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPS
                       + Y  CV A   Y  AA Q D+L   Y WCK  GV    E+P+V RI    ++ LE  V       + +    +      E+PP 
Subjt:  -------------LQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPS

Query:  ANDE-----PVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKP-----LIELEEE
          +E      +N++ A+          E +     PE E + EKP     L+ L E+
Subjt:  ANDE-----PVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKP-----LIELEEE

Q8S9J8 Probable clathrin assembly protein At4g322858.9e-1021.94Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VPE
        M    R+ +  VK+  S+G AKV +     D+++ ++KAT+ +D    +KY++E+L + + S     A   S SRRL ++                    
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VPE

Query:  NSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDLF----------------------------YNAKPP-----DDSGDE
        +  F+ E+L +   G   L+    RD+      D ++F+R+YA  LD+ L   LF                            + + PP     +     
Subjt:  NSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDLF----------------------------YNAKPP-----DDSGDE

Query:  GIETTSRR---INEI-----------------RRVIEILTQLQSFIDRGYRFSRDGL------------------------------------LQLPYRS
        G+   SR    +NEI                  R+   +  LQ  +DR       GL                                      + Y  
Subjt:  GIETTSRR---INEI-----------------RRVIEILTQLQSFIDRGYRFSRDGL------------------------------------LQLPYRS

Query:  CVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAV
        CV A   Y  AA Q D+L   Y WCK  GV    E+P+V RI    ++ LE  V    +  +S        +  + +PP      +N++ A+
Subjt:  CVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAV

Q9LVD8 Putative clathrin assembly protein At5g572001.1e-0724.02Show/hide
Query:  FRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFA-FSPPSFRAFSL-SFSRRLLQS-----------------VPENS
        FR+   A+K+  +VG AKV +   F D+D+ ++KAT   +SP  E++V+++    +   P +  A+ + + S+RL ++                    + 
Subjt:  FRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFA-FSPPSFRAFSL-SFSRRLLQS-----------------VPENS

Query:  AFRFELLRSRANGWMSLHRCHI------RDDE-----DFASFIRSYARLLDEALNC--DLFYNAKP---PDDSGDEGIETTSRRINEIRRVIEILTQLQS
         FR ELL          HR HI      +DD      D ++++R+YA  L+E L C   L Y+ +    P  SG    +T   R+     ++E L  LQ 
Subjt:  AFRFELLRSRANGWMSLHRCHI------RDDE-----DFASFIRSYARLLDEALNC--DLFYNAKP---PDDSGDEGIETTSRRINEIRRVIEILTQLQS

Query:  FIDR-------GYRFS----------------------RDGLL-------QLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPE
         + R       G  +S                       DG++       ++     V A+ IYK+A  QA+ L+E Y +CK + +   ++FP + + P 
Subjt:  FIDR-------GYRFS----------------------RDGLL-------QLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPE

Query:  ARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEE
        + +  +E  +    +  +S S          E      +E   +  A + N   +T+      ND P  E  +E+P  E+E E
Subjt:  ARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEE

Q9SA65 Putative clathrin assembly protein At1g030504.9e-0828.86Show/hide
Query:  RFRRLLTAVKENCSVGYAKV-VTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQSV-----------------PENS
        +F+R + AVK+  SVG AKV       S++D+ ++KAT   + P  EKY++E+L + ++S     A   + SRRL ++                    + 
Subjt:  RFRRLLTAVKENCSVGYAKV-VTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQSV-----------------PENS

Query:  AFRFELLRSRANGWMSLHRCHIRD-----DEDFASFIRSYARLLDEALN
        A+  E+  +   G   L+    RD       D+++F+R+YA  LDE L+
Subjt:  AFRFELLRSRANGWMSLHRCHIRD-----DEDFASFIRSYARLLDEALN

Arabidopsis top hitse value%identityAlignment
AT1G03050.1 ENTH/ANTH/VHS superfamily protein3.5e-0928.86Show/hide
Query:  RFRRLLTAVKENCSVGYAKV-VTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQSV-----------------PENS
        +F+R + AVK+  SVG AKV       S++D+ ++KAT   + P  EKY++E+L + ++S     A   + SRRL ++                    + 
Subjt:  RFRRLLTAVKENCSVGYAKV-VTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQSV-----------------PENS

Query:  AFRFELLRSRANGWMSLHRCHIRD-----DEDFASFIRSYARLLDEALN
        A+  E+  +   G   L+    RD       D+++F+R+YA  LDE L+
Subjt:  AFRFELLRSRANGWMSLHRCHIRD-----DEDFASFIRSYARLLDEALN

AT2G25430.1 epsin N-terminal homology (ENTH) domain-containing protein / clathrin assembly protein-related1.3e-1121.88Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VPE
        M    R+ + AVK+  S+G AKV +     D+++ ++KAT+ +D P  EKY++E+L + + S     A   S SRRL ++                    
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VPE

Query:  NSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDLF------------------------------YNAKPP---------
        +  F+ E+L S   G   L+    RD+      D ++F+R+YA  LD+ L   LF                              + + PP         
Subjt:  NSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDLF------------------------------YNAKPP---------

Query:  --------DDSGDEGIETTSRRINEI--------------------------RRVIEILTQLQSFIDRGYRFSRDGL-----------------------
                D++G  G+   SR   ++                           R+   +  LQ  +DR       GL                       
Subjt:  --------DDSGDEGIETTSRRINEI--------------------------RRVIEILTQLQSFIDRGYRFSRDGL-----------------------

Query:  -------------LQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPS
                       + Y  CV A   Y  AA Q D+L   Y WCK  GV    E+P+V RI    ++ LE  V       + +    +      E+PP 
Subjt:  -------------LQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPS

Query:  ANDE-----PVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKP-----LIELEEE
          +E      +N++ A+          E +     PE E + EKP     L+ L E+
Subjt:  ANDE-----PVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKP-----LIELEEE

AT4G02650.1 ENTH/ANTH/VHS superfamily protein1.7e-1121.07Show/hide
Query:  RFRRLLTAVKENCSVGYAKVVTVGG----FSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VP
        + +R + AVK+  SVG AK   VGG     +++++ V+KAT  +D P  +KY++E+L + ++S     A   + SRRL ++                   
Subjt:  RFRRLLTAVKENCSVGYAKVVTVGG----FSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VP

Query:  ENSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDL---------FYNAKPPDDSGDE-------------GIETTSRRIN
         + A+  E+  +   G   L+    RD       D+++F+R+YA  LDE L+  +                 DSG+E              I   S+ + 
Subjt:  ENSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDL---------FYNAKPPDDSGDE-------------GIETTSRRIN

Query:  EIR--RVIEILTQLQSFIDRGYRFSRDG------------------------------------LLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLI
        E++  ++   +  LQ  +DR       G                                     ++L     +    I+ + + Q D+L   YGWCK +
Subjt:  EIR--RVIEILTQLQSFIDRGYRFSRDG------------------------------------LLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLI

Query:  GVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEEN
         V    E+P++++I + ++  ++       +     S     T+ SS    + ++E  +K   +Q N       +     +  E+E  +EK   + + E 
Subjt:  GVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEEN

Query:  QDSRWE---DLLE
          SR +   DLL+
Subjt:  QDSRWE---DLLE

AT4G32285.1 ENTH/ANTH/VHS superfamily protein6.3e-1121.94Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VPE
        M    R+ +  VK+  S+G AKV +     D+++ ++KAT+ +D    +KY++E+L + + S     A   S SRRL ++                    
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VPE

Query:  NSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDLF----------------------------YNAKPP-----DDSGDE
        +  F+ E+L +   G   L+    RD+      D ++F+R+YA  LD+ L   LF                            + + PP     +     
Subjt:  NSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDLF----------------------------YNAKPP-----DDSGDE

Query:  GIETTSRR---INEI-----------------RRVIEILTQLQSFIDRGYRFSRDGL------------------------------------LQLPYRS
        G+   SR    +NEI                  R+   +  LQ  +DR       GL                                      + Y  
Subjt:  GIETTSRR---INEI-----------------RRVIEILTQLQSFIDRGYRFSRDGL------------------------------------LQLPYRS

Query:  CVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAV
        CV A   Y  AA Q D+L   Y WCK  GV    E+P+V RI    ++ LE  V    +  +S        +  + +PP      +N++ A+
Subjt:  CVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAV

AT4G32285.2 ENTH/ANTH/VHS superfamily protein6.3e-1121.94Show/hide
Query:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VPE
        M    R+ +  VK+  S+G AKV +     D+++ ++KAT+ +D    +KY++E+L + + S     A   S SRRL ++                    
Subjt:  MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQS-----------------VPE

Query:  NSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDLF----------------------------YNAKPP-----DDSGDE
        +  F+ E+L +   G   L+    RD+      D ++F+R+YA  LD+ L   LF                            + + PP     +     
Subjt:  NSAFRFELLRSRANGWMSLHRCHIRDDE-----DFASFIRSYARLLDEALNCDLF----------------------------YNAKPP-----DDSGDE

Query:  GIETTSRR---INEI-----------------RRVIEILTQLQSFIDRGYRFSRDGL------------------------------------LQLPYRS
        G+   SR    +NEI                  R+   +  LQ  +DR       GL                                      + Y  
Subjt:  GIETTSRR---INEI-----------------RRVIEILTQLQSFIDRGYRFSRDGL------------------------------------LQLPYRS

Query:  CVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAV
        CV A   Y  AA Q D+L   Y WCK  GV    E+P+V RI    ++ LE  V    +  +S        +  + +PP      +N++ A+
Subjt:  CVAAIGIYKKAAVQADQLSELYGWCKLIGVCGLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGCAGAGGAGATTCCGGCGACTTCTCACCGCCGTCAAAGAGAATTGTTCCGTCGGCTACGCCAAAGTCGTCACCGTCGGAGGATTTTCCGACGTCGATCTCATAGTCAT
CAAAGCTACTGCTCCCAATGACTCACCGTTGCCGGAGAAGTACGTTCAGGAGCTCCTCAAGATCTTCGCCTTCTCTCCGCCGTCGTTTCGAGCCTTTTCGCTCAGCTTTT
CCCGCCGATTGCTCCAATCAGTCCCCGAGAACAGTGCATTTCGATTTGAGCTTCTTCGCAGCCGAGCCAATGGCTGGATGTCTCTCCATCGGTGCCACATCCGAGACGAC
GAAGATTTTGCTTCTTTTATCAGATCCTACGCTCGGTTGCTTGATGAAGCTCTGAATTGTGATTTGTTCTATAACGCCAAACCACCGGACGATTCTGGAGACGAAGGGAT
CGAAACAACATCGAGAAGAATCAACGAAATTAGGAGAGTGATTGAAATTTTGACACAGCTACAGAGCTTCATTGACAGAGGATATCGTTTCAGTCGAGACGGCCTTCTTC
AACTGCCGTACCGGAGTTGCGTTGCAGCAATTGGAATATACAAGAAAGCCGCCGTTCAAGCAGATCAACTGTCGGAGCTCTACGGTTGGTGCAAGCTAATCGGAGTTTGC
GGGCTGTATGAATTCCCCGACGTCGATCGCATACCGGAGGCACGGATCCAAGCTCTAGAAGCATCCGTCAGCAAAATGTGGCAGCTGACGGAATCATCTTCGACCTCCAC
GACATCAACATCATCGTCATCGGAGTCTCCGCCGTCGGCGAACGATGAGCCTGTGAATAAAGTTGCAGCTGTGCAAACGAACGTAGTTGTCAGCACGCAGTGGGAAACTT
TTGAGGGCAATGATTTCCCAGAGAAGGAGAGGAGAAAGGAGAAGCCATTGATTGAGCTAGAAGAAGAGAACCAAGACAGTAGGTGGGAGGATTTGCTTGAAGCTTCTGCT
AGCTTTACATGGGATCCAATGAGCTGTGAATTGAAATCTTATGATTGGGACCAAAATAAAGAAGGATCAGGCACTGAGAAAATGCAACTGTACAATCCCACTCCTCTTAA
CCCATTTCGCCATGGTTGTTTCTTTCCAACACTTCAATAA
mRNA sequenceShow/hide mRNA sequence
ATGCAGAGGAGATTCCGGCGACTTCTCACCGCCGTCAAAGAGAATTGTTCCGTCGGCTACGCCAAAGTCGTCACCGTCGGAGGATTTTCCGACGTCGATCTCATAGTCAT
CAAAGCTACTGCTCCCAATGACTCACCGTTGCCGGAGAAGTACGTTCAGGAGCTCCTCAAGATCTTCGCCTTCTCTCCGCCGTCGTTTCGAGCCTTTTCGCTCAGCTTTT
CCCGCCGATTGCTCCAATCAGTCCCCGAGAACAGTGCATTTCGATTTGAGCTTCTTCGCAGCCGAGCCAATGGCTGGATGTCTCTCCATCGGTGCCACATCCGAGACGAC
GAAGATTTTGCTTCTTTTATCAGATCCTACGCTCGGTTGCTTGATGAAGCTCTGAATTGTGATTTGTTCTATAACGCCAAACCACCGGACGATTCTGGAGACGAAGGGAT
CGAAACAACATCGAGAAGAATCAACGAAATTAGGAGAGTGATTGAAATTTTGACACAGCTACAGAGCTTCATTGACAGAGGATATCGTTTCAGTCGAGACGGCCTTCTTC
AACTGCCGTACCGGAGTTGCGTTGCAGCAATTGGAATATACAAGAAAGCCGCCGTTCAAGCAGATCAACTGTCGGAGCTCTACGGTTGGTGCAAGCTAATCGGAGTTTGC
GGGCTGTATGAATTCCCCGACGTCGATCGCATACCGGAGGCACGGATCCAAGCTCTAGAAGCATCCGTCAGCAAAATGTGGCAGCTGACGGAATCATCTTCGACCTCCAC
GACATCAACATCATCGTCATCGGAGTCTCCGCCGTCGGCGAACGATGAGCCTGTGAATAAAGTTGCAGCTGTGCAAACGAACGTAGTTGTCAGCACGCAGTGGGAAACTT
TTGAGGGCAATGATTTCCCAGAGAAGGAGAGGAGAAAGGAGAAGCCATTGATTGAGCTAGAAGAAGAGAACCAAGACAGTAGGTGGGAGGATTTGCTTGAAGCTTCTGCT
AGCTTTACATGGGATCCAATGAGCTGTGAATTGAAATCTTATGATTGGGACCAAAATAAAGAAGGATCAGGCACTGAGAAAATGCAACTGTACAATCCCACTCCTCTTAA
CCCATTTCGCCATGGTTGTTTCTTTCCAACACTTCAATAA
Protein sequenceShow/hide protein sequence
MQRRFRRLLTAVKENCSVGYAKVVTVGGFSDVDLIVIKATAPNDSPLPEKYVQELLKIFAFSPPSFRAFSLSFSRRLLQSVPENSAFRFELLRSRANGWMSLHRCHIRDD
EDFASFIRSYARLLDEALNCDLFYNAKPPDDSGDEGIETTSRRINEIRRVIEILTQLQSFIDRGYRFSRDGLLQLPYRSCVAAIGIYKKAAVQADQLSELYGWCKLIGVC
GLYEFPDVDRIPEARIQALEASVSKMWQLTESSSTSTTSTSSSSESPPSANDEPVNKVAAVQTNVVVSTQWETFEGNDFPEKERRKEKPLIELEEENQDSRWEDLLEASA
SFTWDPMSCELKSYDWDQNKEGSGTEKMQLYNPTPLNPFRHGCFFPTLQ