; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Lcy10g007700 (gene) of Sponge gourd (P93075) v1 genome

Gene IDLcy10g007700
OrganismLuffa cylindrica cv. P93075 (Sponge gourd (P93075) v1)
DescriptionASCH domain-containing protein
Genome locationChr10:32528963..32532938
RNA-Seq ExpressionLcy10g007700
SyntenyLcy10g007700
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR007374 - ASCH domain
IPR015947 - PUA-like superfamily


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_022980139.1 uncharacterized protein LOC111479615 [Cucurbita maxima]2.0e-10685Show/hide
Query:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK
        +MQR+PSSPGSSPVEL DCLEELLKFTLQ H+DG   HDLGLSA+ CSHLLNDDLPRPNLDRPD    VS+LYKDLA  L KSVS+A CGSL  DDL+DK
Subjt:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK

Query:  EEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD
        E+  ELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCA GDY RIQPGALILFNKCLLFEVQDV +YPSFSAMLE E LDKVLPGVKTLTD
Subjt:  EEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD

Query:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        GVQIYR FYSEEKE SNGVLGIHVKKSVVQP +ILSRIIS
Subjt:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

XP_023527933.1 uncharacterized protein LOC111791000 [Cucurbita pepo subsp. pepo]7.1e-10483.33Show/hide
Query:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK
        +MQR+PSSPGSSPVEL DCLEELLKFTLQ H+DG   HDLGLSA+ CSHLLNDDLPR NLDRPD    VS+LY DLA  L KSVS+A C SL  DDL+DK
Subjt:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK

Query:  EEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD
        E+  ELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCA GDY RIQPGALILFNKCLLFEVQDV +YPSFSAMLE E LDKVLPGVKTLTD
Subjt:  EEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD

Query:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        GVQIYR FYSEEKE SNGVLGIHVKKSVVQP ++LSRIIS
Subjt:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

XP_038879095.1 uncharacterized protein LOC120071108 isoform X1 [Benincasa hispida]1.5e-10677.27Show/hide
Query:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDR-------------PDSLSEVSRLYKDLALALCKSVSEAS
        M+RQPSSPGSSPVEL DCLEELL+FTLQ H+DG   HDLGL  D CSHLLN +LP PNLDR              +SLS+VSRLYKDL  +L KSVS+ S
Subjt:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDR-------------PDSLSEVSRLYKDLALALCKSVSEAS

Query:  CGSLDDDDLKDKEEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGL
        CGSL  DDL+DKEE NELI QGGAELVNVLKTVNFELHV EPFFTQL+DGLK VEGRCA GDYNRIQPGALILFNKCLLFEVQDV +YPSFSAML+ E L
Subjt:  CGSLDDDDLKDKEEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGL

Query:  DKVLPGVKTLTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISVSSLASLIFFTY
        DKVLPG+KTL DG+QIYRKFYSEEKE+SNGVLGIHVKKSV QPY++LSRIISVS LASL+F TY
Subjt:  DKVLPGVKTLTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISVSSLASLIFFTY

XP_038879102.1 uncharacterized protein LOC120071108 isoform X2 [Benincasa hispida]1.5e-10680.48Show/hide
Query:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDKE
        M+RQPSSPGSSPVEL DCLEELL+FTLQ H+DG   HDLGL  D CSHLLN +LP PNLDR D    VSRLYKDL  +L KSVS+ SCGSL  DDL+DKE
Subjt:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDKE

Query:  EWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDG
        E NELI QGGAELVNVLKTVNFELHV EPFFTQL+DGLK VEGRCA GDYNRIQPGALILFNKCLLFEVQDV +YPSFSAML+ E LDKVLPG+KTL DG
Subjt:  EWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDG

Query:  VQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISVSSLASLIFFTY
        +QIYRKFYSEEKE+SNGVLGIHVKKSV QPY++LSRIISVS LASL+F TY
Subjt:  VQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISVSSLASLIFFTY

XP_038879108.1 uncharacterized protein LOC120071108 isoform X3 [Benincasa hispida]2.3e-10277.38Show/hide
Query:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDR-------------PDSLSEVSRLYKDLALALCKSVSEAS
        M+RQPSSPGSSPVEL DCLEELL+FTLQ H+DG   HDLGL  D CSHLLN +LP PNLDR              +SLS+VSRLYKDL  +L KSVS+ S
Subjt:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDR-------------PDSLSEVSRLYKDLALALCKSVSEAS

Query:  CGSLDDDDLKDKEEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGL
        CGSL  DDL+DKEE NELI QGGAELVNVLKTVNFELHV EPFFTQL+DGLK VEGRCA GDYNRIQPGALILFNKCLLFEVQDV +YPSFSAML+ E L
Subjt:  CGSLDDDDLKDKEEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGL

Query:  DKVLPGVKTLTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        DKVLPG+KTL DG+QIYRKFYSEEKE+SNGVLGIHVKKSV QPY++LSRIIS
Subjt:  DKVLPGVKTLTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

TrEMBL top hitse value%identityAlignment
A0A6J1CWC0 uncharacterized protein LOC111015172 isoform X21.8e-9776.33Show/hide
Query:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL
        MQRQ  SPG+SPVEL DC+EELLKFTLQ H+DG   HDL LSA+ CS LL DD P  N DRP    E SRLYK+LALA+ KSVS+ SCGS D+    DDL
Subjt:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL

Query:  KDKEEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT
        ++KEEW+ELITQGGAELV VLKTVN+ELHVQEPFFTQ+K  LKTVEGRCA GDYNR+QPG LILFNKCLL EVQDV +Y SFSAMLE EGLDKVLPGVKT
Subjt:  KDKEEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT

Query:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISVS
        L DGVQ+YRKFYSEEKELSNGVLGIHVKKSV QP+I+LSRIIS++
Subjt:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISVS

A0A6J1CWY0 uncharacterized protein LOC111015172 isoform X11.8e-9776.33Show/hide
Query:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL
        MQRQ  SPG+SPVEL DC+EELLKFTLQ H+DG   HDL LSA+ CS LL DD P  N DRP    E SRLYK+LALA+ KSVS+ SCGS D+    DDL
Subjt:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL

Query:  KDKEEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT
        ++KEEW+ELITQGGAELV VLKTVN+ELHVQEPFFTQ+K  LKTVEGRCA GDYNR+QPG LILFNKCLL EVQDV +Y SFSAMLE EGLDKVLPGVKT
Subjt:  KDKEEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT

Query:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISVS
        L DGVQ+YRKFYSEEKELSNGVLGIHVKKSV QP+I+LSRIIS++
Subjt:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIISVS

A0A6J1CXS2 uncharacterized protein LOC111015172 isoform X94.1e-9776.95Show/hide
Query:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL
        MQRQ  SPG+SPVEL DC+EELLKFTLQ H+DG   HDL LSA+ CS LL DD P  N DRP    E SRLYK+LALA+ KSVS+ SCGS D+    DDL
Subjt:  MQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDD----DDL

Query:  KDKEEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT
        ++KEEW+ELITQGGAELV VLKTVN+ELHVQEPFFTQ+K  LKTVEGRCA GDYNR+QPG LILFNKCLL EVQDV +Y SFSAMLE EGLDKVLPGVKT
Subjt:  KDKEEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKT

Query:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        L DGVQ+YRKFYSEEKELSNGVLGIHVKKSV QP+I+LSRIIS
Subjt:  LTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

A0A6J1GV12 uncharacterized protein LOC1114574331.9e-10282.08Show/hide
Query:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK
        +MQR+PSSPGSS VEL DCLEELLKFTLQ H+DG   HDLGLSA+ C HLLNDDLPR NLDRPD    +S+LY DLA  L KSVS+A CGSL  D+L+DK
Subjt:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK

Query:  EEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD
        E+  ELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCA GDY RIQPGALILFNKCLLFEVQDV +YPSFSAMLE E LDKVLPGVKTLTD
Subjt:  EEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD

Query:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        GVQIYR FYSEEKE SNGVLGIHVKKS VQP +ILSRIIS
Subjt:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

A0A6J1IYG9 uncharacterized protein LOC1114796159.7e-10785Show/hide
Query:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK
        +MQR+PSSPGSSPVEL DCLEELLKFTLQ H+DG   HDLGLSA+ CSHLLNDDLPRPNLDRPD    VS+LYKDLA  L KSVS+A CGSL  DDL+DK
Subjt:  EMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDK

Query:  EEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD
        E+  ELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCA GDY RIQPGALILFNKCLLFEVQDV +YPSFSAMLE E LDKVLPGVKTLTD
Subjt:  EEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTD

Query:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        GVQIYR FYSEEKE SNGVLGIHVKKSVVQP +ILSRIIS
Subjt:  GVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT2G43465.1 RNA-binding ASCH domain protein3.8e-4742.51Show/hide
Query:  VELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEV----------SRLYKDLALALCKSVSEAS-CGSLDDDD------
        +++ DCL+E++KFTL + V+     D+GL+ + CS LL  +    + +R +S S              LYK LAL L KS+   S CG+ +         
Subjt:  VELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEV----------SRLYKDLALALCKSVSEAS-CGSLDDDD------

Query:  -LKDKE-EWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRI-QPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLP
         LK+KE EW++LI Q G+ELVN LK V  EL VQEP F+ +KDG+KTVE RC   +Y+RI + G++++ NKCL+FEV ++ +Y SF  +L+ E  +KV P
Subjt:  -LKDKE-EWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRI-QPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLP

Query:  GVKTLTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        G KT+ +G+Q++RK Y  ++E  NGV+ IH+ KSV QP + L+ I+S
Subjt:  GVKTLTDGVQIYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS

AT3G03320.1 RNA-binding ASCH domain protein1.4e-6052.74Show/hide
Query:  QPSSPGSSPVELEDCLEELLKFTLQFHV-DGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDKEEW
        QP SPG+  V+L +C+E LL+F+L+ H+ +  P  DL L+ D C HLL         +  DS +E S +YK LA AL      + C + + D   + E++
Subjt:  QPSSPGSSPVELEDCLEELLKFTLQFHV-DGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKDKEEW

Query:  NELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDGVQ
        ++LI   G +L+N+LK VNFELHVQEP+FTQLKDGLKTVEGRCA GDY RI  G  +LFNKCLL EVQDV RY SFS ML+ EGL KVLPGV+++ +GVQ
Subjt:  NELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDGVQ

Query:  IYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS
        +YR FYSEEKE  NGV+ I V K   QP   L+ ++S
Subjt:  IYRKFYSEEKELSNGVLGIHVKKSVVQPYIILSRIIS


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGTACTTACTTCCCCCCAAAAGACGGAAGAAAGAAATGCAGCGGCAGCCATCTTCACCTGGATCATCGCCCGTTGAGCTCGAAGACTGCCTAGAAGAGCTTCTCAAGTT
CACGCTCCAATTTCACGTAGATGGAGACCCAGTGCATGATCTAGGGTTGTCCGCAGACTCATGCTCCCACCTCCTGAATGACGATCTTCCTCGACCCAATTTGGATCGTC
CTGATTCTCTATCAGAAGTTTCCAGATTGTATAAGGATCTAGCGCTGGCTCTTTGTAAGTCAGTCTCTGAAGCATCATGTGGGTCATTAGACGATGATGATTTGAAGGAC
AAAGAAGAATGGAATGAGTTGATTACCCAAGGAGGAGCTGAGTTAGTAAATGTTCTAAAGACAGTAAACTTTGAGCTTCATGTACAGGAGCCATTCTTTACTCAGCTGAA
AGATGGCCTAAAGACAGTGGAAGGAAGATGTGCTACTGGAGATTACAATCGAATTCAGCCTGGAGCCTTGATACTTTTCAATAAATGTTTGTTGTTTGAGGTTCAGGATG
TATGTCGATACCCTTCATTTTCTGCAATGTTGGAAACAGAGGGTCTTGATAAAGTTCTTCCTGGAGTAAAAACCTTAACTGATGGTGTCCAAATATACAGGAAGTTCTAC
TCTGAAGAGAAAGAACTGTCCAATGGCGTCCTTGGGATCCATGTCAAAAAATCTGTTGTCCAGCCATACATTATTTTGTCCAGAATTATATCTGTGAGTTCGCTTGCTTC
CCTAATTTTCTTTACCTACTTTCTTTATTGGTAA
mRNA sequenceShow/hide mRNA sequence
CTCCGATCTTGTCAATACCCTCTCTTGTACCTGTATGTACTTACTTCCCCCCAAAAGACGGAAGAAAGAAATGCAGCGGCAGCCATCTTCACCTGGATCATCGCCCGTTG
AGCTCGAAGACTGCCTAGAAGAGCTTCTCAAGTTCACGCTCCAATTTCACGTAGATGGAGACCCAGTGCATGATCTAGGGTTGTCCGCAGACTCATGCTCCCACCTCCTG
AATGACGATCTTCCTCGACCCAATTTGGATCGTCCTGATTCTCTATCAGAAGTTTCCAGATTGTATAAGGATCTAGCGCTGGCTCTTTGTAAGTCAGTCTCTGAAGCATC
ATGTGGGTCATTAGACGATGATGATTTGAAGGACAAAGAAGAATGGAATGAGTTGATTACCCAAGGAGGAGCTGAGTTAGTAAATGTTCTAAAGACAGTAAACTTTGAGC
TTCATGTACAGGAGCCATTCTTTACTCAGCTGAAAGATGGCCTAAAGACAGTGGAAGGAAGATGTGCTACTGGAGATTACAATCGAATTCAGCCTGGAGCCTTGATACTT
TTCAATAAATGTTTGTTGTTTGAGGTTCAGGATGTATGTCGATACCCTTCATTTTCTGCAATGTTGGAAACAGAGGGTCTTGATAAAGTTCTTCCTGGAGTAAAAACCTT
AACTGATGGTGTCCAAATATACAGGAAGTTCTACTCTGAAGAGAAAGAACTGTCCAATGGCGTCCTTGGGATCCATGTCAAAAAATCTGTTGTCCAGCCATACATTATTT
TGTCCAGAATTATATCTGTGAGTTCGCTTGCTTCCCTAATTTTCTTTACCTACTTTCTTTATTGGTAAGGCCAACGTAACAACAAGAGAAACTCTTAGGGTTACAGATAA
TTTTTAGCAGAAAAAGTGATTTCGATTATTAAAATTACTTCTCAAAATTTTCACAATGTTGCACTAAACAATCGAAATGATTATGATAGTTTTAAAGTCACTTTAAGTGC
ATTAGAAGAAGCTATTGGCTTGTGCTTCTTCAAACGTAAATAATTGCAGTTAAAAGCGGCAATGGAGCCCTGAGAGGAAGCTCCAAAGAAGCCAAGTCTAGAACATAAGC
TGAGCCCGACCTACTAGGCCTAGCGTGGCTTGAAGCCCAATCTCAGCCCAGTCACTTAAGGCCTTGAGCTGACGGCTCACCAAGGGTCAAGTGAAAACTCTAAGGGCTCG
TTTGATAACGTTCTCGTTTCTGGTTTCTTGTTTCATGTTTCTTGTTTCTTGTTTCTTAAGAACTAAAAACAGAAACATGTTTGACAACTATTTCTGTTTCTCGTTTCTTT
AAAAAAAGAAACAAAAACAGATATGTGTTTGATAACTGTTTCTCGTTTTTTGATTTTTAGTTGAAATAAAG
Protein sequenceShow/hide protein sequence
MYLLPPKRRKKEMQRQPSSPGSSPVELEDCLEELLKFTLQFHVDGDPVHDLGLSADSCSHLLNDDLPRPNLDRPDSLSEVSRLYKDLALALCKSVSEASCGSLDDDDLKD
KEEWNELITQGGAELVNVLKTVNFELHVQEPFFTQLKDGLKTVEGRCATGDYNRIQPGALILFNKCLLFEVQDVCRYPSFSAMLETEGLDKVLPGVKTLTDGVQIYRKFY
SEEKELSNGVLGIHVKKSVVQPYIILSRIISVSSLASLIFFTYFLYW