; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr017844 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr017844
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
Descriptionzinc_ribbon_10 domain-containing protein
Genome locationtig00153056:628433..632614
RNA-Seq ExpressionSgr017844
SyntenySgr017844
Gene Ontology termsGO:0071786 - endoplasmic reticulum tubular network organization (biological process)
GO:0016021 - integral component of membrane (cellular component)
GO:0071782 - endoplasmic reticulum tubular network (cellular component)
InterPro domainsIPR019273 - Lunapark domain
IPR040115 - Lunapark family


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022135651.1 uncharacterized protein At2g24330 isoform X1 [Momordica charantia]6.3e-19285.85Show/hide
Query:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM
        MAEEKA GEGEKKDT ADVAGKKKSRG  SRLW+ IF VRGDDFEKRLQHISKEEAAV+AR+KRRSLTWRRMARNLIV SVIFEV+AVCYAIITTR  D+
Subjt:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM

Query:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG
        NW+MRAFRVLP+FLLPAL+ LAY+TFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSG+KVHLG
Subjt:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG

Query:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY
        DE N+NVPSGKSNDVE+VQ   GLRNRKQG  RS SAG ASLHHP+E+T  P VSE PH  E+NQLVV+HYNPQGPAVNDGGWLARIAALLVGEDPTQSY
Subjt:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY

Query:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE
        ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE  SSHGT G  S RAG+SSDP+NTAGS IDSVLTSNV AEAISE+QDTV+  E+ENLV E
Subjt:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE

Query:  GEKVRTAETT
        GEKV TA TT
Subjt:  GEKVRTAETT

XP_022135665.1 uncharacterized protein At2g24330 isoform X2 [Momordica charantia]2.6e-19085.89Show/hide
Query:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM
        MAEEKA GEGEKKDT ADVAGKKKSRG  SRLW+ IF VRGDDFEKRLQHISKEEAAV+AR+KRRSLTWRRMARNLIV SVIFEV+AVCYAIITTR  D+
Subjt:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM

Query:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG
        NW+MRAFRVLP+FLLPAL+ LAY+TFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSG+KVHLG
Subjt:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG

Query:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY
        DE N+NVPSGKSNDVE+VQ   GLRNRKQG  RS SAG ASLHHP+E+T  P VSE PH  E+NQLVV+HYNPQGPAVNDGGWLARIAALLVGEDPTQSY
Subjt:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY

Query:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE
        ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE  SSHGT G  S RAG+SSDP+NTAGS IDSVLTSNV AEAISE+QDTV+  E+ENLV E
Subjt:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE

Query:  GEKV
        GEK+
Subjt:  GEKV

XP_023524231.1 uncharacterized protein At2g24330-like [Cucurbita pepo subsp. pepo]3.8e-18985.44Show/hide
Query:  MAEEK-ATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGED
        MAEEK A GEGEKKDT+ADVAGKKKS+GIFSRLWN IFRVRGDDFEKRLQHISKEEAAVLAR+KRRSLTWRRMARNLI+FSVIFE+IA+CYAIITTR  D
Subjt:  MAEEK-ATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGED

Query:  MNWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHL
        MNW+MRAFRVLP+FLLPALSTLAY+TFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHL
Subjt:  MNWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHL

Query:  GDEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQS
        GDE+NLNVPSGKSNDVEFV   GGLRNRKQGH+RS SAGS SLHH DED  R AV EG   SE NQLVV+HYNPQGPAVNDGGWLARIAALLVGEDP QS
Subjt:  GDEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQS

Query:  YALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLV-
        YALICGNCHMHNGLVK+EDFPFITYYCPHCHALNRSNQ +E VS +G+ G+ SVRA N+SD  NT  S ID VL SN+H E ISE QDTVEE ESEN+V 
Subjt:  YALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLV-

Query:  REGEKVRTAETT
         E EKVRTAETT
Subjt:  REGEKVRTAETT

XP_038898924.1 uncharacterized protein At2g24330 isoform X1 [Benincasa hispida]7.6e-19086.35Show/hide
Query:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM
        MAEEKA GEGEKKDTVADV GKKKSRG FSRLWN IFRVRGDDFEKRLQHISKEEAAVLAR+KRRSLTWRRMARNLI+FSV+FE+IAVCYAIITTR  DM
Subjt:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM

Query:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG
        NW+MRAFRVLP+FLLPALSTLAY+TFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKV LG
Subjt:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG

Query:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY
        DE+N NVPSGKSNDVEFVQA GGLRNRKQGH+RS SAGSASLHHPDED  R AVSEGPH SEHNQLVV+HYNPQGPAVNDGGWLARIAALLVGEDPTQSY
Subjt:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY

Query:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE
        ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE VS HG+  +GS+RA ++SDP          VL++NV  E ISEIQ+TV++ ESENLV E
Subjt:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE

Query:  GEK
         EK
Subjt:  GEK

XP_038898925.1 uncharacterized protein At2g24330 isoform X2 [Benincasa hispida]1.3e-19286.34Show/hide
Query:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM
        MAEEKA GEGEKKDTVADV GKKKSRG FSRLWN IFRVRGDDFEKRLQHISKEEAAVLAR+KRRSLTWRRMARNLI+FSV+FE+IAVCYAIITTR  DM
Subjt:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM

Query:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG
        NW+MRAFRVLP+FLLPALSTLAY+TFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKV LG
Subjt:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG

Query:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY
        DE+N NVPSGKSNDVEFVQA GGLRNRKQGH+RS SAGSASLHHPDED  R AVSEGPH SEHNQLVV+HYNPQGPAVNDGGWLARIAALLVGEDPTQSY
Subjt:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY

Query:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE
        ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE VS HG+  +GS+RA ++SDP          VL++NV  E ISEIQ+TV++ ESENLV E
Subjt:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE

Query:  GEKVRTAETT
         EKVRTAE T
Subjt:  GEKVRTAETT

TrEMBL top hitse value%identityAlignment
A0A1S3CQZ7 uncharacterized protein At2g243302.4e-18984.39Show/hide
Query:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM
        MAEEKA GEGEKKD VADV GKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLAR+KRRSLTWRRMARNLI+FSV+FE++AVCYAIITTR  D+
Subjt:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM

Query:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG
        NW+MRAFRVLP+FLLPALSTLAY+TFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASK+GADSGLKV LG
Subjt:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG

Query:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY
        DE+N NVPSGKSNDVE VQA GGLRNRKQGHTRS S GSASLHH DED  RPAVS+ P+ SEHNQLVV HYNPQGPAVNDGGWLARIAALLVGEDPTQSY
Subjt:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY

Query:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE
        ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE +S HG+  AGS++  ++SDP          +LTSNV AE ISEIQ+ +E+ ES NLV E
Subjt:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE

Query:  GEKVRTAETT
         EKV TAETT
Subjt:  GEKVRTAETT

A0A5D3E5H7 zinc_ribbon_10 domain-containing protein2.4e-18984.39Show/hide
Query:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM
        MAEEKA GEGEKKD VADV GKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLAR+KRRSLTWRRMARNLI+FSV+FE++AVCYAIITTR  D+
Subjt:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM

Query:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG
        NW+MRAFRVLP+FLLPALSTLAY+TFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASK+GADSGLKV LG
Subjt:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG

Query:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY
        DE+N NVPSGKSNDVE VQA GGLRNRKQGHTRS S GSASLHH DED  RPAVS+ P+ SEHNQLVV HYNPQGPAVNDGGWLARIAALLVGEDPTQSY
Subjt:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY

Query:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE
        ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE +S HG+  AGS++  ++SDP          +LTSNV AE ISEIQ+ +E+ ES NLV E
Subjt:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE

Query:  GEKVRTAETT
         EKV TAETT
Subjt:  GEKVRTAETT

A0A6J1C1N4 uncharacterized protein At2g24330 isoform X21.3e-19085.89Show/hide
Query:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM
        MAEEKA GEGEKKDT ADVAGKKKSRG  SRLW+ IF VRGDDFEKRLQHISKEEAAV+AR+KRRSLTWRRMARNLIV SVIFEV+AVCYAIITTR  D+
Subjt:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM

Query:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG
        NW+MRAFRVLP+FLLPAL+ LAY+TFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSG+KVHLG
Subjt:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG

Query:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY
        DE N+NVPSGKSNDVE+VQ   GLRNRKQG  RS SAG ASLHHP+E+T  P VSE PH  E+NQLVV+HYNPQGPAVNDGGWLARIAALLVGEDPTQSY
Subjt:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY

Query:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE
        ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE  SSHGT G  S RAG+SSDP+NTAGS IDSVLTSNV AEAISE+QDTV+  E+ENLV E
Subjt:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE

Query:  GEKV
        GEK+
Subjt:  GEKV

A0A6J1C231 uncharacterized protein At2g24330 isoform X13.0e-19285.85Show/hide
Query:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM
        MAEEKA GEGEKKDT ADVAGKKKSRG  SRLW+ IF VRGDDFEKRLQHISKEEAAV+AR+KRRSLTWRRMARNLIV SVIFEV+AVCYAIITTR  D+
Subjt:  MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDM

Query:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG
        NW+MRAFRVLP+FLLPAL+ LAY+TFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSG+KVHLG
Subjt:  NWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLG

Query:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY
        DE N+NVPSGKSNDVE+VQ   GLRNRKQG  RS SAG ASLHHP+E+T  P VSE PH  E+NQLVV+HYNPQGPAVNDGGWLARIAALLVGEDPTQSY
Subjt:  DEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSY

Query:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE
        ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE  SSHGT G  S RAG+SSDP+NTAGS IDSVLTSNV AEAISE+QDTV+  E+ENLV E
Subjt:  ALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVRE

Query:  GEKVRTAETT
        GEKV TA TT
Subjt:  GEKVRTAETT

A0A6J1FRL3 uncharacterized protein At2g24330-like2.2e-18784.95Show/hide
Query:  MAEEK-ATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGED
        MAEEK A GEGEKKDTVADVAGKKKS+GIFSRLWN IFRVRGDDFEKRLQHISKEEAAVLAR+KRRSLTWRRMARNLI+FSVIFE+IAVCYAIITTR  D
Subjt:  MAEEK-ATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGED

Query:  MNWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHL
        MNW+MRAFRVLP+FLLPALSTLAY+TFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHL
Subjt:  MNWRMRAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHL

Query:  GDEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQS
        GDE+NLNVPSGKSNDVEFV   GGLRNRKQGH+RS SAGS SLHH DED  R AV EG H SE NQLVV+HYNPQGPAVNDGGWLARIAALLVGEDP QS
Subjt:  GDEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQS

Query:  YALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLV-
        YALICGNCHMHNGLVK+EDFPFITYYCPHCHALNRSNQ +E VS +G+ G+ SVR  N+SD  NT  S ID VL SN H E ISE QD VEE E E++V 
Subjt:  YALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELVSSHGT-GAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLV-

Query:  REGEKVRTAETT
         E EKVRTA+TT
Subjt:  REGEKVRTAETT

SwissProt top hitse value%identityAlignment
Q1KKR9 Endoplasmic reticulum junction formation protein lunapark-B1.0e-1126.35Show/hide
Query:  KRLQHISKE-EAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTL--AYSTFLSFTRMCDRKDQKTLE
        ++L++I KE +     R K + L    + R L+  S ++ +I++   ++    +   W +R    LP F+ P L      +  FL F++  +R + K LE
Subjt:  KRLQHISKE-EAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTL--AYSTFLSFTRMCDRKDQKTLE

Query:  RLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAK-AAAATVLASKLGADSGLKVHLGDEANLNVPSGK---------------SNDVEFV--QASGGL
         L+A ++  ++E+ E   Y   + +++R+DPD   K    AT +  ++   +G ++     A  ++P G                   VE V   A GG 
Subjt:  RLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAK-AAAATVLASKLGADSGLKVHLGDEANLNVPSGK---------------SNDVEFV--QASGGL

Query:  RNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAV------NDGGWLARIAALLVGEDPTQSYALICGNCHMHNGLVKKED
                RSG A S  +      T R   S  P    H         P GP +       D G + R+   LVG+ P   YALIC  C  HNG+  KE+
Subjt:  RNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAV------NDGGWLARIAALLVGEDPTQSYALICGNCHMHNGLVKKED

Query:  FPFITYYCPHCHALN
        F ++ + C +C+ LN
Subjt:  FPFITYYCPHCHALN

Q5R891 Endoplasmic reticulum junction formation protein lunapark3.5e-1225.84Show/hide
Query:  GIFSRLWNSIFRVRGDDFEKRLQHISKE-EAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTLAYST
        G+FSR    +  V      + L+ I KE +A    R K + L   R+ R ++  SV++  +  C  +      D  +  R    LP F  P +     + 
Subjt:  GIFSRLWNSIFRVRGDDFEKRLQHISKE-EAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTLAYST

Query:  FLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPD-PAAKAAAATVLASKLGADSGLKVHLGDEANLNV-PSGKSNDVEFVQASGG
         + F      ++ + L+ L+++R+  ++E+ EK  Y   + +++R+DPD   AK          + A  G ++     A  N+ P+  S +    Q    
Subjt:  FLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPD-PAAKAAAATVLASKLGADSGLKVHLGDEANLNV-PSGKSNDVEFVQASGG

Query:  LRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSE--------GPHTSEHNQLVVNHYNPQGPAV------NDGGWLARIAALLVGEDPTQSYALICGNCHM
              G  +  SA       P E T  PA+S          P TS          +P GP +       + G L RI   LVG+ P   YALIC  C  
Subjt:  LRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSE--------GPHTSEHNQLVVNHYNPQGPAV------NDGGWLARIAALLVGEDPTQSYALICGNCHM

Query:  HNGLVKKEDFPFITYYCPHCHALNRSNQS
        HNG+  KE+F +I + C +C  LN + ++
Subjt:  HNGLVKKEDFPFITYYCPHCHALNRSNQS

Q6PFM4 Endoplasmic reticulum junction formation protein lunapark-B2.7e-1224.92Show/hide
Query:  RMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTLAYSTFL-SFTRMCDRKDQKTLERLRAERQAKIDELKEKT
        R K + L    + R L   S ++ +  +C   +        W  R    LP+   PAL  L     +  F++  +R + K LE L+ +++  ++E+ E  
Subjt:  RMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTLAYSTFL-SFTRMCDRKDQKTLERLRAERQAKIDELKEKT

Query:  NYYITQQLIQRYDPDPAAKA-AAATVLASKLGADSGLKVHLGDEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGP
         Y   + +++R+DP+   KA A AT +   +    G ++     A +  P      +     +  LR    G    G AGSAS          PA +   
Subjt:  NYYITQQLIQRYDPDPAAKA-AAATVLASKLGADSGLKVHLGDEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGP

Query:  HTSEHNQLVVNHYNPQGPAVNDG-----------------GWLARIAALLVGEDPTQSYALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQS
         T +  Q++    NP  P    G                 G + R+   LVG+ P   YALIC  C  HNG+  KE+F F+ + C +C+ +N + ++
Subjt:  HTSEHNQLVVNHYNPQGPAVNDG-----------------GWLARIAALLVGEDPTQSYALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQS

Q7TQ95 Endoplasmic reticulum junction formation protein lunapark3.9e-1123.75Show/hide
Query:  GIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTLAYSTF
        G+FSR     +R +    E  L++I KE  A+    ++     +     LI++S I  +       +    ++   R+     LP F  P +     +  
Subjt:  GIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTLAYSTF

Query:  LSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPD-PAAKAAAATVLASKLGADSGLKVHLGDEANLNVPSGKSNDVEFVQASGGLR
        + F      ++ + L+ L+++++  ++E+ EK  Y   + +++R+DPD   AK        + + A  G ++     A  N+            AS    
Subjt:  LSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPD-PAAKAAAATVLASKLGADSGLKVHLGDEANLNVPSGKSNDVEFVQASGGLR

Query:  NRKQGHTRSGSAGSASL-HHPDEDTCRPAVSE---GPHTSEHNQLVVNHYNPQGPAV------NDGGWLARIAALLVGEDPTQSYALICGNCHMHNGLVK
           QG    G A  AS    P E T  PA+      P TS          +P GP +       + G L RI   LVG+ P   YALIC  C  HNG+  
Subjt:  NRKQGHTRSGSAGSASL-HHPDEDTCRPAVSE---GPHTSEHNQLVVNHYNPQGPAV------NDGGWLARIAALLVGEDPTQSYALICGNCHMHNGLVK

Query:  KEDFPFITYYCPHCHALNRSN----QSEELVSSHGTGAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAE--SENLVREGEKVRTAETT
        KE+F +I + C +C  LN +     Q+  L         +V   +S+ P           L  +V +     I+D++EE +    +  +  +K+   E T
Subjt:  KEDFPFITYYCPHCHALNRSN----QSEELVSSHGTGAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAE--SENLVREGEKVRTAETT

Q9ZQ34 Uncharacterized protein At2g243302.4e-11764.69Show/hide
Query:  GEKKDT--VADVAGKKKS---RGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRM
        GEK D+  V   +G+KK+   +G+FSRLWN+IFRVRGDDFEKRL++ISKEEA V  RMKRRS+T R   RNLI FSV FEVIAV YAI+TTR ED++W++
Subjt:  GEKKDT--VADVAGKKKS---RGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRM

Query:  RAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLGDEAN
        R+FR+LP+FLLPA++ L YS+ + F RMCDR+DQ TLE+L+AE   KI+ELKE+TNYYITQQLIQRYDPDPAAKAAAATVLASKLGA+SGLKV +GDE+ 
Subjt:  RAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLGDEAN

Query:  LNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEG-PHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSYALI
        L   +GK+N     + SGGLRNRKQ +TR  SA +  +HH D ++     SE    T ++ Q+V  HYNPQ  A +DG W++RIAALLVGEDP+QSYALI
Subjt:  LNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEG-PHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSYALI

Query:  CGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE
        CGNC MHNGL +KEDFP+ITYYCPHC ALN+   SEE
Subjt:  CGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE

Arabidopsis top hitse value%identityAlignment
AT2G24330.1 Protein of unknown function (DUF2296)1.7e-11864.69Show/hide
Query:  GEKKDT--VADVAGKKKS---RGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRM
        GEK D+  V   +G+KK+   +G+FSRLWN+IFRVRGDDFEKRL++ISKEEA V  RMKRRS+T R   RNLI FSV FEVIAV YAI+TTR ED++W++
Subjt:  GEKKDT--VADVAGKKKS---RGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRM

Query:  RAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLGDEAN
        R+FR+LP+FLLPA++ L YS+ + F RMCDR+DQ TLE+L+AE   KI+ELKE+TNYYITQQLIQRYDPDPAAKAAAATVLASKLGA+SGLKV +GDE+ 
Subjt:  RAFRVLPIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLGDEAN

Query:  LNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEG-PHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSYALI
        L   +GK+N     + SGGLRNRKQ +TR  SA +  +HH D ++     SE    T ++ Q+V  HYNPQ  A +DG W++RIAALLVGEDP+QSYALI
Subjt:  LNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEG-PHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSYALI

Query:  CGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE
        CGNC MHNGL +KEDFP+ITYYCPHC ALN+   SEE
Subjt:  CGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEE

AT4G31080.1 Protein of unknown function (DUF2296)2.0e-12770.59Show/hide
Query:  KKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTL
        KKK  G FSRLWN IFRVRGDDFEKRLQ+IS+EEA VL+RMKRRS++WR++ RNLIV SV+FE+IAV YAI+TTR ED++WRMR+FR+LP+F+LPA+S L
Subjt:  KKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTL

Query:  AYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLGDEANLNVPSGKSNDVEFVQAS
        AYS+ +SF++M DR+DQKTLE+LRAER AKI+ELKE+TNYY TQQLIQRYDPDPAAKAAAATVLASKLGADSGLKV+LGDE+ L+  SGKSND+E V  S
Subjt:  AYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLGDEANLNVPSGKSNDVEFVQAS

Query:  GGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEG-PHTSEHN-QLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSYALICGNCHMHNGLVKKEDF
         GLRNR+Q +TR   +GS S HH D+++     SE  P T+E N Q++V HY+PQG A +DG W++RIAALLVGEDPTQSYALICGNC MHNGL +KEDF
Subjt:  GGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEG-PHTSEHN-QLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSYALICGNCHMHNGLVKKEDF

Query:  PFITYYCPHCHALNRSNQSEELV
         +ITYYCPHC+ALN+   SEE V
Subjt:  PFITYYCPHCHALNRSNQSEELV

AT4G31080.2 Protein of unknown function (DUF2296)3.3e-12263.87Show/hide
Query:  KKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTL
        KKK  G FSRLWN IFRVRGDDFEKRLQ+IS+EEA VL+RMKRRS++WR++ RNLIV SV+FE+IAV YAI+TTR ED++WRMR+FR+LP+F+LPA+S L
Subjt:  KKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVLPIFLLPALSTL

Query:  AYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLI----------------------------------QRYDPDPAAKAAAATVLAS
        AYS+ +SF++M DR+DQKTLE+LRAER AKI+ELKE+TNYY TQQLI                                  QRYDPDPAAKAAAATVLAS
Subjt:  AYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLI----------------------------------QRYDPDPAAKAAAATVLAS

Query:  KLGADSGLKVHLGDEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEG-PHTSEHN-QLVVNHYNPQGPAVNDGGWLA
        KLGADSGLKV+LGDE+ L+  SGKSND+E V  S GLRNR+Q +TR   +GS S HH D+++     SE  P T+E N Q++V HY+PQG A +DG W++
Subjt:  KLGADSGLKVHLGDEANLNVPSGKSNDVEFVQASGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEG-PHTSEHN-QLVVNHYNPQGPAVNDGGWLA

Query:  RIAALLVGEDPTQSYALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELV
        RIAALLVGEDPTQSYALICGNC MHNGL +KEDF +ITYYCPHC+ALN+   SEE V
Subjt:  RIAALLVGEDPTQSYALICGNCHMHNGLVKKEDFPFITYYCPHCHALNRSNQSEELV


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTGAGGAAAAAGCCACTGGAGAAGGTGAAAAGAAGGATACTGTTGCTGATGTTGCCGGGAAGAAGAAATCCCGGGGGATTTTCTCGCGGCTTTGGAATAGTATCTT
CCGAGTACGTGGTGACGATTTCGAGAAGAGGCTTCAACACATTTCTAAGGAAGAAGCCGCTGTCCTTGCTAGGATGAAGCGTAGATCCCTAACCTGGCGGCGAATGGCCA
GAAATCTTATTGTATTTTCTGTCATTTTTGAGGTTATTGCAGTTTGTTATGCTATCATAACTACAAGAGGTGAAGATATGAACTGGAGGATGAGAGCTTTTAGAGTTTTA
CCGATCTTTCTTTTACCTGCTTTATCAACTCTGGCTTATTCTACATTTCTTAGCTTCACAAGAATGTGTGATCGCAAGGACCAGAAAACTCTTGAAAGGCTGCGAGCTGA
GAGGCAAGCTAAAATTGATGAACTAAAGGAGAAAACAAATTATTATATAACTCAGCAGCTCATTCAGAGATATGATCCTGACCCTGCTGCAAAGGCAGCCGCTGCAACTG
TATTGGCTTCCAAGTTGGGAGCAGATTCAGGCTTAAAGGTTCATTTGGGCGATGAAGCCAATCTTAATGTCCCATCAGGGAAGAGCAATGATGTTGAATTTGTTCAAGCT
AGTGGTGGACTTCGGAATAGAAAACAAGGTCACACCCGTTCCGGTAGCGCAGGTAGTGCTTCATTGCATCATCCTGATGAAGATACGTGTCGGCCTGCAGTATCCGAGGG
TCCTCATACTTCTGAGCACAATCAGTTGGTTGTCAATCATTACAATCCCCAAGGACCAGCTGTAAATGATGGAGGATGGTTAGCTCGAATCGCTGCACTGCTCGTTGGAG
AGGATCCAACACAGTCTTATGCACTCATCTGTGGCAACTGCCATATGCACAATGGGCTTGTCAAGAAGGAAGATTTCCCATTCATTACTTACTACTGCCCACATTGTCAT
GCCCTGAATCGGTCCAATCAATCGGAGGAACTAGTTTCTAGCCATGGTACTGGTGCTGGTTCTGTGAGAGCGGGAAACAGTAGTGACCCAGCAAATACTGCTGGATCTGC
TATTGATAGTGTACTCACCAGCAATGTCCATGCAGAAGCAATTTCTGAGATTCAGGACACCGTCGAAGAAGCGGAATCTGAAAACTTAGTTAGGGAAGGAGAAAAGGTAC
GCACGGCAGAAACGACATAA
mRNA sequenceShow/hide mRNA sequence
ATGGCTGAGGAAAAAGCCACTGGAGAAGGTGAAAAGAAGGATACTGTTGCTGATGTTGCCGGGAAGAAGAAATCCCGGGGGATTTTCTCGCGGCTTTGGAATAGTATCTT
CCGAGTACGTGGTGACGATTTCGAGAAGAGGCTTCAACACATTTCTAAGGAAGAAGCCGCTGTCCTTGCTAGGATGAAGCGTAGATCCCTAACCTGGCGGCGAATGGCCA
GAAATCTTATTGTATTTTCTGTCATTTTTGAGGTTATTGCAGTTTGTTATGCTATCATAACTACAAGAGGTGAAGATATGAACTGGAGGATGAGAGCTTTTAGAGTTTTA
CCGATCTTTCTTTTACCTGCTTTATCAACTCTGGCTTATTCTACATTTCTTAGCTTCACAAGAATGTGTGATCGCAAGGACCAGAAAACTCTTGAAAGGCTGCGAGCTGA
GAGGCAAGCTAAAATTGATGAACTAAAGGAGAAAACAAATTATTATATAACTCAGCAGCTCATTCAGAGATATGATCCTGACCCTGCTGCAAAGGCAGCCGCTGCAACTG
TATTGGCTTCCAAGTTGGGAGCAGATTCAGGCTTAAAGGTTCATTTGGGCGATGAAGCCAATCTTAATGTCCCATCAGGGAAGAGCAATGATGTTGAATTTGTTCAAGCT
AGTGGTGGACTTCGGAATAGAAAACAAGGTCACACCCGTTCCGGTAGCGCAGGTAGTGCTTCATTGCATCATCCTGATGAAGATACGTGTCGGCCTGCAGTATCCGAGGG
TCCTCATACTTCTGAGCACAATCAGTTGGTTGTCAATCATTACAATCCCCAAGGACCAGCTGTAAATGATGGAGGATGGTTAGCTCGAATCGCTGCACTGCTCGTTGGAG
AGGATCCAACACAGTCTTATGCACTCATCTGTGGCAACTGCCATATGCACAATGGGCTTGTCAAGAAGGAAGATTTCCCATTCATTACTTACTACTGCCCACATTGTCAT
GCCCTGAATCGGTCCAATCAATCGGAGGAACTAGTTTCTAGCCATGGTACTGGTGCTGGTTCTGTGAGAGCGGGAAACAGTAGTGACCCAGCAAATACTGCTGGATCTGC
TATTGATAGTGTACTCACCAGCAATGTCCATGCAGAAGCAATTTCTGAGATTCAGGACACCGTCGAAGAAGCGGAATCTGAAAACTTAGTTAGGGAAGGAGAAAAGGTAC
GCACGGCAGAAACGACATAA
Protein sequenceShow/hide protein sequence
MAEEKATGEGEKKDTVADVAGKKKSRGIFSRLWNSIFRVRGDDFEKRLQHISKEEAAVLARMKRRSLTWRRMARNLIVFSVIFEVIAVCYAIITTRGEDMNWRMRAFRVL
PIFLLPALSTLAYSTFLSFTRMCDRKDQKTLERLRAERQAKIDELKEKTNYYITQQLIQRYDPDPAAKAAAATVLASKLGADSGLKVHLGDEANLNVPSGKSNDVEFVQA
SGGLRNRKQGHTRSGSAGSASLHHPDEDTCRPAVSEGPHTSEHNQLVVNHYNPQGPAVNDGGWLARIAALLVGEDPTQSYALICGNCHMHNGLVKKEDFPFITYYCPHCH
ALNRSNQSEELVSSHGTGAGSVRAGNSSDPANTAGSAIDSVLTSNVHAEAISEIQDTVEEAESENLVREGEKVRTAETT