; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Carg18313 (gene) of Silver-seed gourd (SMH-JMG-627) v2 genome

Gene IDCarg18313
OrganismCucurbita argyrosperma subsp. argyrosperma cv. SMH-JMG-627 (Silver-seed gourd (SMH-JMG-627) v2)
DescriptionENTH domain-containing protein
Genome locationCarg_Chr04:1115143..1116086
RNA-Seq ExpressionCarg18313
SyntenyCarg18313
Gene Ontology termsGO:0006897 - endocytosis (biological process)
GO:0005768 - endosome (cellular component)
GO:0005794 - Golgi apparatus (cellular component)
GO:0005886 - plasma membrane (cellular component)
GO:0030125 - clathrin vesicle coat (cellular component)
GO:0005543 - phospholipid binding (molecular function)
GO:0030276 - clathrin binding (molecular function)
InterPro domainsIPR008942 - ENTH/VHS
IPR013809 - ENTH domain


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG6600056.1 hypothetical protein SDJN03_05289, partial [Cucurbita argyrosperma subsp. sororia]1.2e-12397.02Show/hide
Query:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
        MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
Subjt:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL

Query:  THGPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSHESHFDRFNRKQHEYEYNGTRNLS
        THGPYSFAKEFANDRAVLREMEGFHFVDDKG      VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSHESHFDRFNRKQHEYEYNGTR LS
Subjt:  THGPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSHESHFDRFNRKQHEYEYNGTRNLS

Query:  LIETRQGSGDWKGVGKWEEAADWEEVSQSLISSVL
        LIETRQGSGDWKGVGKWEEAADWEEVSQSLISSVL
Subjt:  LIETRQGSGDWKGVGKWEEAADWEEVSQSLISSVL

KAG7030727.1 ENT3-4, partial [Cucurbita argyrosperma subsp. argyrosperma]2.6e-126100Show/hide
Query:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
        MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
Subjt:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL

Query:  THGPYSFAKEFANDRAVLREMEGFHFVDDKGVRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSHESHFDRFNRKQHEYEYNGTRNLSLIETRQ
        THGPYSFAKEFANDRAVLREMEGFHFVDDKGVRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSHESHFDRFNRKQHEYEYNGTRNLSLIETRQ
Subjt:  THGPYSFAKEFANDRAVLREMEGFHFVDDKGVRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSHESHFDRFNRKQHEYEYNGTRNLSLIETRQ

Query:  GSGDWKGVGKWEEAADWEEVSQSLISSVL
        GSGDWKGVGKWEEAADWEEVSQSLISSVL
Subjt:  GSGDWKGVGKWEEAADWEEVSQSLISSVL

XP_004147716.1 epsin-2 [Cucumis sativus]2.0e-6264.73Show/hide
Query:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
        ME +YF ELKK+A  F K ++K+ARLALTDVT AQLLTEEATSGNP PPDSP+MREITKA+FEVD+F+RIVEILHKRLE+F+ ++WR SYNA+IL+EH L
Subjt:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL

Query:  THGPYSFAKEFA--NDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH----ESHFDRFNRKQHEYEYN
        THGP SF +EF   N++ VL EM+GFHFVD KG      VRKLS RVLKLLE+E+FL QERIKARNLTRGI GFG+ +     ES   RFN   H    +
Subjt:  THGPYSFAKEFA--NDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH----ESHFDRFNRKQHEYEYN

Query:  GTRNLSL
          R ++L
Subjt:  GTRNLSL

XP_008461662.1 PREDICTED: epsin-2-like [Cucumis melo]6.9e-6360.75Show/hide
Query:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
        ME +YF ELKK+A FF K ++K+ARLALTDVT AQLLTEEATSGNPWPPDSP+MREIT+A+FEVD+F+RIVEILHKRLE+F+ ++WR SYNA+IL+EH L
Subjt:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL

Query:  THGPYSFAKEFAND--RAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH------ESHFDRFNRKQHEYE
        THGP SF +EF +D  + VL EM+GFHFVD KG      VRKLS RV+KLLE+++FL QERIKARNL RGI GFG+ +       +S   R N  +H + 
Subjt:  THGPYSFAKEFAND--RAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH------ESHFDRFNRKQHEYE

Query:  YNGTRNLSLIETRQ
         +  R +++ E  +
Subjt:  YNGTRNLSLIETRQ

XP_023542259.1 epsin-3-like [Cucurbita pepo subsp. pepo]1.5e-11390.12Show/hide
Query:  MEHSYF-HELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHA
        ME SYF HELKKKACFFLKEH+KI RLALTDVTHAQLLTEEA SGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAR+WRASYNAVILVEHA
Subjt:  MEHSYF-HELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHA

Query:  LTHGPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH-------ESHFDRFNRKQHEYE
        LTHGPYSFAKEFANDRAVLREMEGFHFVDDKG      VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFS        ESH +RFNRKQHE E
Subjt:  LTHGPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH-------ESHFDRFNRKQHEYE

Query:  YNGTRNLSLIETRQGSGDWKGVGKWEEAADWEEVSQSLISSVL
        YNGTRNLSLIETRQGSGDWKGV KWEEAADWEEVSQSLISSVL
Subjt:  YNGTRNLSLIETRQGSGDWKGVGKWEEAADWEEVSQSLISSVL

TrEMBL top hitse value%identityAlignment
A0A0A0KNV8 ENTH domain-containing protein9.7e-6364.73Show/hide
Query:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
        ME +YF ELKK+A  F K ++K+ARLALTDVT AQLLTEEATSGNP PPDSP+MREITKA+FEVD+F+RIVEILHKRLE+F+ ++WR SYNA+IL+EH L
Subjt:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL

Query:  THGPYSFAKEFA--NDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH----ESHFDRFNRKQHEYEYN
        THGP SF +EF   N++ VL EM+GFHFVD KG      VRKLS RVLKLLE+E+FL QERIKARNLTRGI GFG+ +     ES   RFN   H    +
Subjt:  THGPYSFAKEFA--NDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH----ESHFDRFNRKQHEYEYN

Query:  GTRNLSL
          R ++L
Subjt:  GTRNLSL

A0A1S3CFQ3 epsin-2-like3.3e-6360.75Show/hide
Query:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
        ME +YF ELKK+A FF K ++K+ARLALTDVT AQLLTEEATSGNPWPPDSP+MREIT+A+FEVD+F+RIVEILHKRLE+F+ ++WR SYNA+IL+EH L
Subjt:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL

Query:  THGPYSFAKEFAND--RAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH------ESHFDRFNRKQHEYE
        THGP SF +EF +D  + VL EM+GFHFVD KG      VRKLS RV+KLLE+++FL QERIKARNL RGI GFG+ +       +S   R N  +H + 
Subjt:  THGPYSFAKEFAND--RAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH------ESHFDRFNRKQHEYE

Query:  YNGTRNLSLIETRQ
         +  R +++ E  +
Subjt:  YNGTRNLSLIETRQ

A0A2N9ES11 ENTH domain-containing protein2.4e-5361.45Show/hide
Query:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
        M  S FHELKK+A FF KE +K ARLALTDVT AQL+TEEATSGNPW PD+P++  I+KA+FE++++ RIVEILHKR  +FE + WR SYN++I++EH L
Subjt:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL

Query:  THGPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSHES
        THGP S A EF +D+ V+ EMEGF ++D+KG      VRK S R+LKLLE+   L +ER +AR LTR IQGFGSFSH S
Subjt:  THGPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSHES

A0A6J1CBF1 epsin-3-like isoform X22.0e-6062.3Show/hide
Query:  EHSYFHE-LKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
        E++ FH  LKK+A FF K   K+ARLA TDVT AQLLTEEATSGNPWPPD+P+MR IT+A+FEV++F+RIVEILH RL++F+A++WRA YNA+IL+EH L
Subjt:  EHSYFHE-LKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL

Query:  THGPYSFAKEFANDRAVLREMEGFHFVDDKGVRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH-------ESHFDRFNRKQHE
        THGP SFA+EF +D  + + M+GFHF+D+KGVRKLSARV+KLLE+ +FL +ER + RNL+RGIQGFG+FSH       +SH  ++ R   E
Subjt:  THGPYSFAKEFANDRAVLREMEGFHFVDDKGVRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH-------ESHFDRFNRKQHE

A0A6J1CC13 epsin-3-like isoform X11.9e-5860.41Show/hide
Query:  EHSYFHE-LKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
        E++ FH  LKK+A FF K   K+ARLA TDVT AQLLTEEATSGNPWPPD+P+MR IT+A+FEV++F+RIVEILH RL++F+A++WRA YNA+IL+EH L
Subjt:  EHSYFHE-LKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL

Query:  THGPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH-------ESHFDRFNRKQHE
        THGP SFA+EF +D  + + M+GFHF+D+KG      VRKLSARV+KLLE+ +FL +ER + RNL+RGIQGFG+FSH       +SH  ++ R   E
Subjt:  THGPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSH-------ESHFDRFNRKQHE

SwissProt top hitse value%identityAlignment
O88339 Epsin-14.1e-1029.77Show/hide
Query:  THAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPYSFAKEFANDRAVLREMEGFHFVDDKG
        + A++   EATS +PW P S  M EI   ++ V  F  I+ ++ KRL     + WR  Y A+ L+E+ +  G    +++   +   ++ ++ F +VD  G
Subjt:  THAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPYSFAKEFANDRAVLREMEGFHFVDDKG

Query:  ------VRKLSARVLKLLEEEDFLIQERIKA
              VR+ + +++ LL +ED L +ER  A
Subjt:  ------VRKLSARVLKLLEEEDFLIQERIKA

P47160 Epsin-31.3e-1125.15Show/hide
Query:  LKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPYSFA
        L   + +  K++ + A+  + + T  +    EAT+  PW   S  M +I++ ++   E   I+ ++ +R       EWR  Y A+ L+++ + HG   F 
Subjt:  LKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPYSFA

Query:  KEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQG
         +  N   ++R +E FH++D +G      VR     +++LL +++ +  ER KAR   +  +G
Subjt:  KEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQG

Q80VP1 Epsin-14.1e-1029.77Show/hide
Query:  THAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPYSFAKEFANDRAVLREMEGFHFVDDKG
        + A++   EATS +PW P S  M EI   ++ V  F  I+ ++ KRL     + WR  Y A+ L+E+ +  G    +++   +   ++ ++ F +VD  G
Subjt:  THAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPYSFAKEFANDRAVLREMEGFHFVDDKG

Query:  ------VRKLSARVLKLLEEEDFLIQERIKA
              VR+ + +++ LL +ED L +ER  A
Subjt:  ------VRKLSARVLKLLEEEDFLIQERIKA

Q8CHU3 Epsin-23.1e-1028.15Show/hide
Query:  LTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPYSFAKEFANDRAVLREMEGFHFV
        + + + A++   EATS +PW P S  M EI   ++ V  F  I+ ++ KRL     + WR  Y A+ L+++ +  G    A++   +   ++ ++ F ++
Subjt:  LTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPYSFAKEFANDRAVLREMEGFHFV

Query:  DDKG------VRKLSARVLKLLEEEDFLIQERIKA
        D  G      VR+ S +++ LL++E+ L  ER++A
Subjt:  DDKG------VRKLSARVLKLLEEEDFLIQERIKA

Q9Y6I3 Epsin-14.1e-1029.77Show/hide
Query:  THAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPYSFAKEFANDRAVLREMEGFHFVDDKG
        + A++   EATS +PW P S  M EI   ++ V  F  I+ ++ KRL     + WR  Y A+ L+E+ +  G    +++   +   ++ ++ F +VD  G
Subjt:  THAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPYSFAKEFANDRAVLREMEGFHFVDDKG

Query:  ------VRKLSARVLKLLEEEDFLIQERIKA
              VR+ + +++ LL +ED L +ER  A
Subjt:  ------VRKLSARVLKLLEEEDFLIQERIKA

Arabidopsis top hitse value%identityAlignment
AT1G08670.1 ENTH/VHS family protein3.2e-3443.01Show/hide
Query:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL
        M++   HELKK+A FF+KE +K ARLA+TDVT  +LLTEE T  +    DS SM  IT+ SFEVD+F RIV+IL +R+  F+ +EWR   N + ++ H L
Subjt:  MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHAL

Query:  THGPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRG-IQGFGSFSHESHFDRFN
         +GP S   EF ++RA++ +     ++D++G      VR ++ +VL+LLE++ FL  ER + R  + G I GFG+ S   H +  N
Subjt:  THGPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRG-IQGFGSFSHESHFDRFN

AT3G23350.1 ENTH/VHS family protein2.0e-2837.79Show/hide
Query:  YFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRL--ERFEAREWRASYNAVILVEHALTH
        +F + KK+A  F+++   +ARL LTDVT A+LL EE T+G+P  PD+ +M +I +ASF+  E++RIV++LH+++  +  E + WR +Y A++L+E  L H
Subjt:  YFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRL--ERFEAREWRASYNAVILVEHALTH

Query:  GPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGS
        GP     +F  D    R +  F +VD+ G      V+K + ++  LL  ++ L + R+KA  +T  I GFG+
Subjt:  GPYSFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGS

AT3G46540.1 ENTH/VHS family protein3.6e-4654.02Show/hide
Query:  FHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPY
        F ELKK+A FF KE LK ARLALTDVT  QL+TEEAT G    P++ ++  I+KA+FE +++  IVE+LHKRL +F+ R WR +YN++I+VEH LTHGP 
Subjt:  FHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPY

Query:  SFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSHES
        S + EF  D  V+ +M+ F  +D+KG      VRK + +VLKLLE+ + L +ER +AR L+RGIQGFGSF+H+S
Subjt:  SFAKEFANDRAVLREMEGFHFVDDKG------VRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSHES


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGAACACTCTTATTTCCATGAATTGAAGAAGAAAGCTTGTTTCTTCCTCAAAGAACATCTCAAAATCGCTCGTTTGGCTCTCACTGATGTTACTCATGCACAATTGTT
AACAGAAGAAGCAACGAGTGGGAATCCATGGCCACCGGATTCACCGAGCATGAGAGAGATAACAAAGGCAAGTTTTGAAGTTGATGAGTTTTACAGAATTGTGGAGATTC
TTCACAAGAGACTGGAGAGATTTGAGGCAAGGGAATGGAGAGCATCGTATAATGCAGTGATATTGGTAGAACATGCTCTAACACATGGCCCATACAGCTTTGCAAAGGAA
TTTGCAAATGATAGGGCTGTTTTGAGGGAGATGGAGGGCTTCCACTTTGTTGATGACAAAGGTGTGAGAAAATTAAGTGCCAGAGTTCTCAAACTTCTGGAAGAAGAAGA
TTTTCTAATACAGGAGAGAATCAAAGCTCGTAATCTTACACGTGGAATCCAAGGATTCGGAAGCTTCAGCCATGAATCACATTTCGACAGATTCAATCGGAAGCAACACG
AGTATGAGTATAATGGAACTAGAAACCTCTCCTTAATAGAGACGAGACAAGGGAGCGGGGATTGGAAAGGAGTCGGAAAATGGGAGGAGGCAGCTGATTGGGAGGAAGTT
TCCCAATCTCTAATTTCTTCTGTATTATAA
mRNA sequenceShow/hide mRNA sequence
ATGGAACACTCTTATTTCCATGAATTGAAGAAGAAAGCTTGTTTCTTCCTCAAAGAACATCTCAAAATCGCTCGTTTGGCTCTCACTGATGTTACTCATGCACAATTGTT
AACAGAAGAAGCAACGAGTGGGAATCCATGGCCACCGGATTCACCGAGCATGAGAGAGATAACAAAGGCAAGTTTTGAAGTTGATGAGTTTTACAGAATTGTGGAGATTC
TTCACAAGAGACTGGAGAGATTTGAGGCAAGGGAATGGAGAGCATCGTATAATGCAGTGATATTGGTAGAACATGCTCTAACACATGGCCCATACAGCTTTGCAAAGGAA
TTTGCAAATGATAGGGCTGTTTTGAGGGAGATGGAGGGCTTCCACTTTGTTGATGACAAAGGTGTGAGAAAATTAAGTGCCAGAGTTCTCAAACTTCTGGAAGAAGAAGA
TTTTCTAATACAGGAGAGAATCAAAGCTCGTAATCTTACACGTGGAATCCAAGGATTCGGAAGCTTCAGCCATGAATCACATTTCGACAGATTCAATCGGAAGCAACACG
AGTATGAGTATAATGGAACTAGAAACCTCTCCTTAATAGAGACGAGACAAGGGAGCGGGGATTGGAAAGGAGTCGGAAAATGGGAGGAGGCAGCTGATTGGGAGGAAGTT
TCCCAATCTCTAATTTCTTCTGTATTATAA
Protein sequenceShow/hide protein sequence
MEHSYFHELKKKACFFLKEHLKIARLALTDVTHAQLLTEEATSGNPWPPDSPSMREITKASFEVDEFYRIVEILHKRLERFEAREWRASYNAVILVEHALTHGPYSFAKE
FANDRAVLREMEGFHFVDDKGVRKLSARVLKLLEEEDFLIQERIKARNLTRGIQGFGSFSHESHFDRFNRKQHEYEYNGTRNLSLIETRQGSGDWKGVGKWEEAADWEEV
SQSLISSVL