; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr027035 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr027035
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionMembrane fusion protein Use1
Genome locationtig00153047:3343767..3349922
RNA-Seq ExpressionSgr027035
SyntenySgr027035
Gene Ontology termsGO:0015031 - protein transport (biological process)
GO:0005789 - endoplasmic reticulum membrane (cellular component)
GO:0016021 - integral component of membrane (cellular component)
InterPro domainsIPR019150 - Vesicle transport protein, Use1


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004134064.1 uncharacterized protein LOC101209335 [Cucumis sativus]1.1e-9890.95Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKL VPLPDV+ESSEPSTSTSV E SS+AE +
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INTLSSP-GLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTG
        +NT SSP GLRRRF  SS+V+DRSHGT+++DSSAPVKLDAAAI+HIEKHRKLQEDLTDEMVGLAKQLKESSL+MSKSLENTEKILDSTEKAVEDSLATTG
Subjt:  INTLSSP-GLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTG

Query:  RVNKRAVEIYSESSKTSCFTW
        RVNKRAV+IYSESSKTSCFTW
Subjt:  RVNKRAVEIYSESSKTSCFTW

XP_008438519.1 PREDICTED: uncharacterized protein LOC103483591 [Cucumis melo]1.3e-9992.31Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKL VPLPDV+ESSEPSTSTSV E SSVAEG+
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INTLSSP-GLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTG
        +NT SSP GLRRRF  SS V+DRSHGT++EDSSAPVKLDAAAI+HIEKHRKLQEDLTDEMVGLAKQLKESSL+MSKSLENTEKILDSTEKAVEDSLATTG
Subjt:  INTLSSP-GLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTG

Query:  RVNKRAVEIYSESSKTSCFTW
        RVNKRAV+IYSESSKTSCFTW
Subjt:  RVNKRAVEIYSESSKTSCFTW

XP_022135542.1 uncharacterized protein LOC111007470 [Momordica charantia]4.9e-10495.45Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        M LSKTEINLKRLLATAP QKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDV+ESSEPSTS SV EFSSV EGE
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INTLSSPGLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTGR
        +NTLSSPGLRRRFQPSSVVDDRSHGT++EDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSL+MSKSLENTEKILDSTEKAVEDSLATTGR
Subjt:  INTLSSPGLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTGR

Query:  VNKRAVEIYSESSKTSCFTW
        VNKRAVEIYSESSKTSCFTW
Subjt:  VNKRAVEIYSESSKTSCFTW

XP_022921329.1 uncharacterized protein LOC111429634 isoform X1 [Cucurbita moschata]9.0e-9890.99Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTP+G+SRVSKALLGDY+EKIEAIASKLAVPLPD +ESSEPSTSTSV EFSSVAEG+
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INT-LSSPGLRRRFQP-SSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATT
        IN   S PGLRRRF P SSVV+DRSHGT++EDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLA+QLKESSL+MSKSLE+TEKILDSTEKAVEDSLATT
Subjt:  INT-LSSPGLRRRFQP-SSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATT

Query:  GRVNKRAVEIYSESSKTSCFTW
        GRVNKRAV+IYSESSKTSCFTW
Subjt:  GRVNKRAVEIYSESSKTSCFTW

XP_038879132.1 uncharacterized protein LOC120071129 isoform X3 [Benincasa hispida]2.8e-9991.86Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDV+ESSEPSTSTS  E SSVAEG+
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INTLSSP-GLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTG
        +NT SSP GLRRRF  SS+V+DRSHGT++EDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSL+MSKSLE+TEKILDSTEKAVEDSLATTG
Subjt:  INTLSSP-GLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTG

Query:  RVNKRAVEIYSESSKTSCFTW
        +VNKRAV+IYSESSKTSCFTW
Subjt:  RVNKRAVEIYSESSKTSCFTW

TrEMBL top hitse value%identityAlignment
A0A1S3AWP3 uncharacterized protein LOC1034835916.1e-10092.31Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKL VPLPDV+ESSEPSTSTSV E SSVAEG+
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INTLSSP-GLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTG
        +NT SSP GLRRRF  SS V+DRSHGT++EDSSAPVKLDAAAI+HIEKHRKLQEDLTDEMVGLAKQLKESSL+MSKSLENTEKILDSTEKAVEDSLATTG
Subjt:  INTLSSP-GLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTG

Query:  RVNKRAVEIYSESSKTSCFTW
        RVNKRAV+IYSESSKTSCFTW
Subjt:  RVNKRAVEIYSESSKTSCFTW

A0A5A7U6J9 Cation exchanger family protein6.1e-10092.31Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKL VPLPDV+ESSEPSTSTSV E SSVAEG+
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INTLSSP-GLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTG
        +NT SSP GLRRRF  SS V+DRSHGT++EDSSAPVKLDAAAI+HIEKHRKLQEDLTDEMVGLAKQLKESSL+MSKSLENTEKILDSTEKAVEDSLATTG
Subjt:  INTLSSP-GLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTG

Query:  RVNKRAVEIYSESSKTSCFTW
        RVNKRAV+IYSESSKTSCFTW
Subjt:  RVNKRAVEIYSESSKTSCFTW

A0A6J1C2Z7 uncharacterized protein LOC1110074702.4e-10495.45Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        M LSKTEINLKRLLATAP QKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDV+ESSEPSTS SV EFSSV EGE
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INTLSSPGLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTGR
        +NTLSSPGLRRRFQPSSVVDDRSHGT++EDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSL+MSKSLENTEKILDSTEKAVEDSLATTGR
Subjt:  INTLSSPGLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTGR

Query:  VNKRAVEIYSESSKTSCFTW
        VNKRAVEIYSESSKTSCFTW
Subjt:  VNKRAVEIYSESSKTSCFTW

A0A6J1E043 uncharacterized protein LOC111429634 isoform X14.4e-9890.99Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTP+G+SRVSKALLGDY+EKIEAIASKLAVPLPD +ESSEPSTSTSV EFSSVAEG+
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INT-LSSPGLRRRFQP-SSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATT
        IN   S PGLRRRF P SSVV+DRSHGT++EDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLA+QLKESSL+MSKSLE+TEKILDSTEKAVEDSLATT
Subjt:  INT-LSSPGLRRRFQP-SSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATT

Query:  GRVNKRAVEIYSESSKTSCFTW
        GRVNKRAV+IYSESSKTSCFTW
Subjt:  GRVNKRAVEIYSESSKTSCFTW

A0A6J1JGJ9 uncharacterized protein LOC111484922 isoform X12.8e-9790.99Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        MGLSKTEINLKRLLATA HQKDQAKLIHYVTTLREQLEQLAEEKTP+G+SRVSKALLGDY+EKIEAIASKLAVPLPD +ESSEPSTSTSV EFSSVAEGE
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INT-LSSPGLRRRFQP-SSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATT
        IN   S PGLRRRF P SSVV+DRSHGT++EDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLA+QLKESSL+MSKSLE+TEKILDSTEKAVEDSLATT
Subjt:  INT-LSSPGLRRRFQP-SSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATT

Query:  GRVNKRAVEIYSESSKTSCFTW
        GRVNKRAV+IYSESSKTSCFTW
Subjt:  GRVNKRAVEIYSESSKTSCFTW

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G54110.1 Membrane fusion protein Use11.1e-5655.45Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        MG+ KTEIN  RLL+ AP+Q++Q+KL+HYV TLREQLEQL+EEKT EGL RV+ A + +Y EKIEA+ S++   +P  + S E     S  + S   E +
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INTLSSPGLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTGR
          T +SP LRRR  P+S  +     + + D S P+KLD AA + + K RKLQEDLTDEMV LA+QLKE S M+S+S++NTEKILDSTE+A+E SLA+TG 
Subjt:  INTLSSPGLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTGR

Query:  VNKRAVEIYSESSKTSCFTW
           RA +IYSESSKTSCF W
Subjt:  VNKRAVEIYSESSKTSCFTW

AT3G55600.1 Membrane fusion protein Use13.4e-6360.45Show/hide
Query:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE
        MG+SKTEINL+RLL+ AP+Q++Q+KL+HYV TLREQLEQL+EEKTPEGL RV+KA + +Y EKIEA+ASK+A   P+ + S EP    S    S   E E
Subjt:  MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGE

Query:  INTLSSPGLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTGR
          + +SP LRRR  P+S   ++S    + DSS P+KLD AA +HI+KHRKLQEDLTDEMV LA+QLKE S  +S+S++NTEKILDSTE+A+E SLA+TG 
Subjt:  INTLSSPGLRRRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTGR

Query:  VNKRAVEIYSESSKTSCFTW
           RA +IYS+SSKTSCF W
Subjt:  VNKRAVEIYSESSKTSCFTW


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGGTTTAAGTAAAACTGAAATCAACTTGAAGCGGCTGCTGGCAACTGCCCCTCATCAAAAGGATCAGGCCAAACTAATACACTATGTTACTACTTTAAGAGAACAGCT
GGAGCAACTTGCAGAAGAGAAAACCCCGGAAGGCTTATCGAGAGTTTCAAAGGCTTTGCTTGGCGATTATTCAGAGAAGATAGAGGCCATTGCTTCGAAATTAGCTGTTC
CTTTGCCTGATGTGCAAGAGTCTTCTGAGCCCTCTACAAGTACTTCTGTTGTGGAATTTTCTTCTGTAGCAGAAGGAGAAATTAACACCCTTTCATCTCCAGGACTAAGG
AGGAGATTTCAGCCTTCCTCTGTTGTGGATGATAGATCTCATGGCACCATGGAAGAAGACTCCTCAGCACCTGTCAAGTTGGATGCTGCAGCAATATCCCACATCGAGAA
ACACAGGAAGCTTCAAGAGGACCTGACTGATGAGATGGTTGGGTTAGCGAAGCAACTGAAAGAGAGCAGTCTGATGATGAGCAAATCTTTGGAGAACACTGAAAAGATAC
TGGATTCCACAGAGAAGGCTGTTGAGGATAGCTTGGCAACCACTGGCCGGGTCAACAAACGTGCTGTAGAGATCTACTCGGAGAGCTCGAAAACTTCGTGCTTCACTTGG
GCGTTTGAAGCTCAATATACACTGAACTCAGATCTTGAGGAGCTGCAAAAAATGGCGGCGGTGGGAAGTTCAATTTTCTTCGTGGAGGCCGAGGTTTCTTGGGCGCCGGA
GGACACACCATGCTCGACTTCAGAAGGCTACCGGCCGACGTCGGAGTCTGGCATTCTTCTTCAGTATCCCTTTCCTCCGCCGCCAGCTTGTCGGACGAATCTTGGGGTTG
GGGTCGGCCGTGGCCGACCAGACCACCTGATCTGCCGGCGACCTGAAGTGTTCTTAACGGCGTAA
mRNA sequenceShow/hide mRNA sequence
ATGGGTTTAAGTAAAACTGAAATCAACTTGAAGCGGCTGCTGGCAACTGCCCCTCATCAAAAGGATCAGGCCAAACTAATACACTATGTTACTACTTTAAGAGAACAGCT
GGAGCAACTTGCAGAAGAGAAAACCCCGGAAGGCTTATCGAGAGTTTCAAAGGCTTTGCTTGGCGATTATTCAGAGAAGATAGAGGCCATTGCTTCGAAATTAGCTGTTC
CTTTGCCTGATGTGCAAGAGTCTTCTGAGCCCTCTACAAGTACTTCTGTTGTGGAATTTTCTTCTGTAGCAGAAGGAGAAATTAACACCCTTTCATCTCCAGGACTAAGG
AGGAGATTTCAGCCTTCCTCTGTTGTGGATGATAGATCTCATGGCACCATGGAAGAAGACTCCTCAGCACCTGTCAAGTTGGATGCTGCAGCAATATCCCACATCGAGAA
ACACAGGAAGCTTCAAGAGGACCTGACTGATGAGATGGTTGGGTTAGCGAAGCAACTGAAAGAGAGCAGTCTGATGATGAGCAAATCTTTGGAGAACACTGAAAAGATAC
TGGATTCCACAGAGAAGGCTGTTGAGGATAGCTTGGCAACCACTGGCCGGGTCAACAAACGTGCTGTAGAGATCTACTCGGAGAGCTCGAAAACTTCGTGCTTCACTTGG
GCGTTTGAAGCTCAATATACACTGAACTCAGATCTTGAGGAGCTGCAAAAAATGGCGGCGGTGGGAAGTTCAATTTTCTTCGTGGAGGCCGAGGTTTCTTGGGCGCCGGA
GGACACACCATGCTCGACTTCAGAAGGCTACCGGCCGACGTCGGAGTCTGGCATTCTTCTTCAGTATCCCTTTCCTCCGCCGCCAGCTTGTCGGACGAATCTTGGGGTTG
GGGTCGGCCGTGGCCGACCAGACCACCTGATCTGCCGGCGACCTGAAGTGTTCTTAACGGCGTAA
Protein sequenceShow/hide protein sequence
MGLSKTEINLKRLLATAPHQKDQAKLIHYVTTLREQLEQLAEEKTPEGLSRVSKALLGDYSEKIEAIASKLAVPLPDVQESSEPSTSTSVVEFSSVAEGEINTLSSPGLR
RRFQPSSVVDDRSHGTMEEDSSAPVKLDAAAISHIEKHRKLQEDLTDEMVGLAKQLKESSLMMSKSLENTEKILDSTEKAVEDSLATTGRVNKRAVEIYSESSKTSCFTW
AFEAQYTLNSDLEELQKMAAVGSSIFFVEAEVSWAPEDTPCSTSEGYRPTSESGILLQYPFPPPPACRTNLGVGVGRGRPDHLICRRPEVFLTA