; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Spg030895 (gene) of Sponge gourd (cylindrica) v1 genome

Gene IDSpg030895
OrganismLuffa cylindrica (Sponge gourd (cylindrica) v1)
DescriptionASCH domain-containing protein
Genome locationscaffold11:30884972..30889215
RNA-Seq ExpressionSpg030895
SyntenySpg030895
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007374 - ASCH domain
IPR015947 - PUA-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAG7018417.1 hypothetical protein SDJN02_20285 [Cucurbita argyrosperma subsp. argyrosperma]3.3e-10081.09Show/hide
Query:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK
        +MQR+PSSPGSS VEL DCLEELLKFTLQ H+DG   HDLGLSA+ C HLLNDDLPR NLDRPD    +S+LY DLA  L KSVS+A CGSL  D+L+DK
Subjt:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK

Query:  EEWNELITQGGAELVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDGV
        E+  ELITQGGAELVLKT NFELHVQEPFFTQLKDGLK VEGRCA GDY RIQPGALILFNKCLLFEVQDV +YPSFSAMLE E LDKVLPGVKTLTDGV
Subjt:  EEWNELITQGGAELVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDGV

Query:  QIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        QIYR FYSEEKE SNGVLGIHVKKS VQP +ILSRIIS
Subjt:  QIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

XP_022980139.1 uncharacterized protein LOC111479615 [Cucurbita maxima]4.9e-10484.17Show/hide
Query:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK
        +MQR+PSSPGSSPVEL DCLEELLKFTLQ H+DG   HDLGLSA+ CSHLLNDDLPRPNLDRPD    VS+LYKDLA  L KSVS+A CGSL  DDL+DK
Subjt:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK

Query:  EEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD
        E+  ELITQGGAEL  VLKTVNFELHVQEPFFTQLKDGLKTVEGRCA GDY RIQPGALILFNKCLLFEVQDV +YPSFSAMLE E LDKVLPGVKTLTD
Subjt:  EEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD

Query:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        GVQIYR FYSEEKE SNGVLGIHVKKSVVQP +ILSRIIS
Subjt:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

XP_023527933.1 uncharacterized protein LOC111791000 [Cucurbita pepo subsp. pepo]1.7e-10182.5Show/hide
Query:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK
        +MQR+PSSPGSSPVEL DCLEELLKFTLQ H+DG   HDLGLSA+ CSHLLNDDLPR NLDRPD    VS+LY DLA  L KSVS+A C SL  DDL+DK
Subjt:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK

Query:  EEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD
        E+  ELITQGGAEL  VLKTVNFELHVQEPFFTQLKDGLKTVEGRCA GDY RIQPGALILFNKCLLFEVQDV +YPSFSAMLE E LDKVLPGVKTLTD
Subjt:  EEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD

Query:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        GVQIYR FYSEEKE SNGVLGIHVKKSVVQP ++LSRIIS
Subjt:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

XP_038879095.1 uncharacterized protein LOC120071108 isoform X1 [Benincasa hispida]2.5e-10076.68Show/hide
Query:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDR-------------PDSLSEVSRLYKDLALALCKSVSEAS
        M+RQPSSPGSSPVEL DCLEELL+FTLQ H+DG   HDLGL  D CSHLLN +LP PNLDR              +SLS+VSRLYKDL  +L KSVS+ S
Subjt:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDR-------------PDSLSEVSRLYKDLALALCKSVSEAS

Query:  CGSLDDDDLKDKEEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGL
        CGSL  DDL+DKEE NELI QGGAEL  VLKTVNFELHV EPFFTQL+DGLK VEGRCA GDYNRIQPGALILFNKCLLFEVQDV +YPSFSAML+ E L
Subjt:  CGSLDDDDLKDKEEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGL

Query:  DKVLPGVKTLTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISV
        DKVLPG+KTL DG+QIYRKFYSEEKE+SNGVLGIHVKKSV QPY++LSRIISV
Subjt:  DKVLPGVKTLTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISV

XP_038879102.1 uncharacterized protein LOC120071108 isoform X2 [Benincasa hispida]2.5e-10080Show/hide
Query:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDKE
        M+RQPSSPGSSPVEL DCLEELL+FTLQ H+DG   HDLGL  D CSHLLN +LP PNLDR D    VSRLYKDL  +L KSVS+ SCGSL  DDL+DKE
Subjt:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDKE

Query:  EWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDG
        E NELI QGGAEL  VLKTVNFELHV EPFFTQL+DGLK VEGRCA GDYNRIQPGALILFNKCLLFEVQDV +YPSFSAML+ E LDKVLPG+KTL DG
Subjt:  EWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDG

Query:  VQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISV
        +QIYRKFYSEEKE+SNGVLGIHVKKSV QPY++LSRIISV
Subjt:  VQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISV

TrEMBL top hitse value%identityAlignment
A0A6J1CWC5 uncharacterized protein LOC111015172 isoform X72.6e-9576.23Show/hide
Query:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL
        MQRQ  SPG+SPVEL DC+EELLKFTLQ H+DG   HDL LSA+ CS LL DD P  N DRP    E SRLYK+LALA+ KSVS+ SCGS D+    DDL
Subjt:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL

Query:  KDKEEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT
        ++KEEW+ELITQGGAEL  VLKTVN+ELHVQEPFFTQ+K  LKTVEGRCA GDYNR+QPG LILFNKCLL EVQDV +Y SFSAMLE EGLDKVLPGVKT
Subjt:  KDKEEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT

Query:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISV
        L DGVQ+YRKFYSEEKELSNGVLGIHVKKSV QP+I+LSRIIS+
Subjt:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISV

A0A6J1CWY4 uncharacterized protein LOC111015172 isoform X62.6e-9576.54Show/hide
Query:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL
        MQRQ  SPG+SPVEL DC+EELLKFTLQ H+DG   HDL LSA+ CS LL DD P  N DRP    E SRLYK+LALA+ KSVS+ SCGS D+    DDL
Subjt:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL

Query:  KDKEEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT
        ++KEEW+ELITQGGAEL  VLKTVN+ELHVQEPFFTQ+K  LKTVEGRCA GDYNR+QPG LILFNKCLL EVQDV +Y SFSAMLE EGLDKVLPGVKT
Subjt:  KDKEEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT

Query:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        L DGVQ+YRKFYSEEKELSNGVLGIHVKKSV QP+I+LSRIIS
Subjt:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

A0A6J1CXS2 uncharacterized protein LOC111015172 isoform X92.6e-9576.54Show/hide
Query:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL
        MQRQ  SPG+SPVEL DC+EELLKFTLQ H+DG   HDL LSA+ CS LL DD P  N DRP    E SRLYK+LALA+ KSVS+ SCGS D+    DDL
Subjt:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL

Query:  KDKEEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT
        ++KEEW+ELITQGGAEL  VLKTVN+ELHVQEPFFTQ+K  LKTVEGRCA GDYNR+QPG LILFNKCLL EVQDV +Y SFSAMLE EGLDKVLPGVKT
Subjt:  KDKEEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT

Query:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        L DGVQ+YRKFYSEEKELSNGVLGIHVKKSV QP+I+LSRIIS
Subjt:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

A0A6J1GV12 uncharacterized protein LOC1114574334.6e-10081.25Show/hide
Query:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK
        +MQR+PSSPGSS VEL DCLEELLKFTLQ H+DG   HDLGLSA+ C HLLNDDLPR NLDRPD    +S+LY DLA  L KSVS+A CGSL  D+L+DK
Subjt:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK

Query:  EEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD
        E+  ELITQGGAEL  VLKTVNFELHVQEPFFTQLKDGLKTVEGRCA GDY RIQPGALILFNKCLLFEVQDV +YPSFSAMLE E LDKVLPGVKTLTD
Subjt:  EEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD

Query:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        GVQIYR FYSEEKE SNGVLGIHVKKS VQP +ILSRIIS
Subjt:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

A0A6J1IYG9 uncharacterized protein LOC1114796152.4e-10484.17Show/hide
Query:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK
        +MQR+PSSPGSSPVEL DCLEELLKFTLQ H+DG   HDLGLSA+ CSHLLNDDLPRPNLDRPD    VS+LYKDLA  L KSVS+A CGSL  DDL+DK
Subjt:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK

Query:  EEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD
        E+  ELITQGGAEL  VLKTVNFELHVQEPFFTQLKDGLKTVEGRCA GDY RIQPGALILFNKCLLFEVQDV +YPSFSAMLE E LDKVLPGVKTLTD
Subjt:  EEWNELITQGGAEL--VLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD

Query:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        GVQIYR FYSEEKE SNGVLGIHVKKSVVQP +ILSRIIS
Subjt:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G43465.1 RNA-binding ASCH domain protein3.2e-4542Show/hide
Query:  VELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEV----------SRLYKDLALALCKSVSEAS-CGSLDDDD------
        +++ DCL+E++KFTL + V+     D+GL+ + CS LL  +    + +R +S S              LYK LAL L KS+   S CG+ +         
Subjt:  VELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEV----------SRLYKDLALALCKSVSEAS-CGSLDDDD------

Query:  -LKDKE-EWNELITQGGAELV--LKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRI-QPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLP
         LK+KE EW++LI Q G+ELV  LK V  EL VQEP F+ +KDG+KTVE RC   +Y+RI + G++++ NKCL+FEV ++ +Y SF  +L+ E  +KV P
Subjt:  -LKDKE-EWNELITQGGAELV--LKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRI-QPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLP

Query:  GVKTLTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISVVS
        G KT+ +G+Q++RK Y  ++E  NGV+ IH+ KSV QP + L+ I+S +S
Subjt:  GVKTLTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISVVS

AT3G03320.1 RNA-binding ASCH domain protein1.9e-5852.32Show/hide
Query:  QPSSPGSSPVELEDCLEELLKFTLQFHV-DGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDKEEW
        QP SPG+  V+L +C+E LL+F+L+ H+ +  P  DL L+ D C HLL         +  DS +E S +YK LA AL      + C + + D   + E++
Subjt:  QPSSPGSSPVELEDCLEELLKFTLQFHV-DGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDKEEW

Query:  NELITQGGAELV--LKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDGVQ
        ++LI   G +L+  LK VNFELHVQEP+FTQLKDGLKTVEGRCA GDY RI  G  +LFNKCLL EVQDV RY SFS ML+ EGL KVLPGV+++ +GVQ
Subjt:  NELITQGGAELV--LKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDGVQ

Query:  IYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        +YR FYSEEKE  NGV+ I V K   QP   L+ ++S
Subjt:  IYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTTACTTCCCCCCAAAAGACGGAAGAAAGAAATGCAGCGGCAGCCATCTTCACCTGGATCATCGCCCGTTGAGCTCGAAGACTGCCTAGAAGAGCTTCTCAAGTT
CACGCTCCAATTTCACGTAGATGGAGACCCAGTGCATGATCTAGGGTTGTCCGCAGACTCATGCTCCCACCTCCTGAATGACGATCTTCCTCGACCCAATTTGGATCGTC
CTGATTCTCTATCAGAAGTTTCCAGATTGTATAAGGATCTAGCGCTGGCTCTTTGTAAGTCAGTCTCTGAAGCATCATGTGGGTCATTAGACGATGATGATTTGAAGGAC
AAAGAAGAATGGAATGAGTTGATTACCCAAGGAGGAGCTGAGTTAGTTCTAAAGACAGTAAACTTTGAGCTTCATGTACAGGAGCCATTCTTTACTCAGCTGAAAGATGG
CCTAAAGACAGTGGAAGGAAGATGTGCTACTGGAGATTACAATCGAATTCAGCCTGGAGCCTTGATACTTTTCAATAAATGTTTGTTGTTTGAGGTTCAGGATGTATGTC
GATACCCTTCATTTTCTGCAATGTTGGAAACAGAGGGTCTTGATAAAGTTCTTCCTGGAGTAAAAACCTTAACTGATGGTGTCCAAATATACAGGAAGTTCTACTCTGAA
GAGAAAGAACTGTCCAATGGCGTCCTTGGGATCCATGTCAAAAAATCTGTTGTCCAGCCATACATTATTTTGTCCAGAATTATATCTGTTGTCAGTGGCGGCTGGTCGGC
GGCGGCCAGTATTAGGAGGTCGACGATGGCGGCCGGTAGTCAGAGGTCAACGGTGGCGATCGGTAGTCGGAGGTCGAAGGTGGCGGCTGGTCGGTGGAGGTCGGTCAGTG
GAGGTGGGTTGCGGCCGGTTAGCGGAGGTCGACGGCACCCGAAAGTCGAAGGTCAACGGTGA
mRNA sequenceShow/hide mRNA sequence
ATGTACTTACTTCCCCCCAAAAGACGGAAGAAAGAAATGCAGCGGCAGCCATCTTCACCTGGATCATCGCCCGTTGAGCTCGAAGACTGCCTAGAAGAGCTTCTCAAGTT
CACGCTCCAATTTCACGTAGATGGAGACCCAGTGCATGATCTAGGGTTGTCCGCAGACTCATGCTCCCACCTCCTGAATGACGATCTTCCTCGACCCAATTTGGATCGTC
CTGATTCTCTATCAGAAGTTTCCAGATTGTATAAGGATCTAGCGCTGGCTCTTTGTAAGTCAGTCTCTGAAGCATCATGTGGGTCATTAGACGATGATGATTTGAAGGAC
AAAGAAGAATGGAATGAGTTGATTACCCAAGGAGGAGCTGAGTTAGTTCTAAAGACAGTAAACTTTGAGCTTCATGTACAGGAGCCATTCTTTACTCAGCTGAAAGATGG
CCTAAAGACAGTGGAAGGAAGATGTGCTACTGGAGATTACAATCGAATTCAGCCTGGAGCCTTGATACTTTTCAATAAATGTTTGTTGTTTGAGGTTCAGGATGTATGTC
GATACCCTTCATTTTCTGCAATGTTGGAAACAGAGGGTCTTGATAAAGTTCTTCCTGGAGTAAAAACCTTAACTGATGGTGTCCAAATATACAGGAAGTTCTACTCTGAA
GAGAAAGAACTGTCCAATGGCGTCCTTGGGATCCATGTCAAAAAATCTGTTGTCCAGCCATACATTATTTTGTCCAGAATTATATCTGTTGTCAGTGGCGGCTGGTCGGC
GGCGGCCAGTATTAGGAGGTCGACGATGGCGGCCGGTAGTCAGAGGTCAACGGTGGCGATCGGTAGTCGGAGGTCGAAGGTGGCGGCTGGTCGGTGGAGGTCGGTCAGTG
GAGGTGGGTTGCGGCCGGTTAGCGGAGGTCGACGGCACCCGAAAGTCGAAGGTCAACGGTGA
Protein sequenceShow/hide protein sequence
MYLLPPKRRKKEMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKD
KEEWNELITQGGAELVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDGVQIYRKFYSE
EKELSNGVLGIHVKKSVVQPYIILSRIISVVSGGWSAAASIRRSTMAAGSQRSTVAIGSRRSKVAAGRWRSVSGGGLRPVSGGRRHPKVEGQR