; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

MC08g1259 (gene) of Bitter gourd (Dali-11) v1 genome

Gene IDMC08g1259
OrganismMomordica charantia cv. Dali-11 (Bitter gourd (Dali-11) v1)
Descriptionsurfeit locus protein 2-like
Genome locationMC08:10916688..10918937
RNA-Seq ExpressionMC08g1259
SyntenyMC08g1259
Gene Ontology termsNA
InterPro domainsIPR008833 - Surfeit locus protein 2


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004141117.1 surfeit locus protein 2 [Cucumis sativus]1.86e-11182.78Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        M+TS AEE A   VEG DLLGQPTFTEL+NGRFRCVETGHEL+ KDKD+YSRTKRCR+GLID ALS RKAPLNMF+QDPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKE-SAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKE EKE  AKSGEQQ+KKKAAKA K S+ENS KKKKKE E+T SEA+E NG +++EDAFWMPPVGQRWD DNGGDRW SGSDSE
Subjt:  IWKHINGKRFLNKLEQKELEKE-SAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAM
        HES+K+IAM
Subjt:  HESEKVIAM

XP_022152129.1 surfeit locus protein 2-like [Momordica charantia]1.05e-13999.03Show/hide
Query:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
        MATSR EERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKD YSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
Subjt:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES
        KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES
Subjt:  KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES

Query:  EKVIAM
        EKVIAM
Subjt:  EKVIAM

XP_023526152.1 surfeit locus protein 2-like isoform X1 [Cucurbita pepo subsp. pepo]3.49e-11183.73Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        MATS  EERA   VEGADLLG+PTF EL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GLIDFALSHRKAPLNMF+ DPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKELEK+S AKSGEQ+ KKKAAKA K STENS KK KKELEK  SEARE NGG+D+ED FWMPP GQRWD DNGGDRW S SDSE
Subjt:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAM
        HESEK+ AM
Subjt:  HESEKVIAM

XP_023526153.1 uncharacterized protein LOC111789715 isoform X2 [Cucurbita pepo subsp. pepo]5.94e-11283.73Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        MATS  EERA   VEGADLLG+PTF EL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GLIDFALSHRKAPLNMF+ DPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKELEK+S AKSGEQ+ KKKAAKA K STENS KK KKELEK  SEARE NGG+D+ED FWMPP GQRWD DNGGDRW S SDSE
Subjt:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAM
        HESEK+ AM
Subjt:  HESEKVIAM

XP_038903228.1 surfeit locus protein 2-like isoform X1 [Benincasa hispida]4.28e-11284.21Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        MATS AEERA   VEGADLLGQPTFTEL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GLIDFALSHRKAPLNMF+QDPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKELEK S AKSGEQQ KKKAAKA K S ENS KKKKKE E+T SEA++ NG +D EDAFWMPP+GQRWD D+GGDRW SGSDSE
Subjt:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAM
        HE +K+IAM
Subjt:  HESEKVIAM

TrEMBL top hitse value%identityAlignment
A0A0A0LD47 Uncharacterized protein9.00e-11282.78Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        M+TS AEE A   VEG DLLGQPTFTEL+NGRFRCVETGHEL+ KDKD+YSRTKRCR+GLID ALS RKAPLNMF+QDPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKE-SAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKE EKE  AKSGEQQ+KKKAAKA K S+ENS KKKKKE E+T SEA+E NG +++EDAFWMPPVGQRWD DNGGDRW SGSDSE
Subjt:  IWKHINGKRFLNKLEQKELEKE-SAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAM
        HES+K+IAM
Subjt:  HESEKVIAM

A0A6J1DF42 surfeit locus protein 2-like5.09e-14099.03Show/hide
Query:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
        MATSR EERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKD YSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
Subjt:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES
        KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES
Subjt:  KHINGKRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHES

Query:  EKVIAM
        EKVIAM
Subjt:  EKVIAM

A0A6J1F4F2 uncharacterized protein LOC1114420702.80e-11082.78Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        MAT   EERA   VEGADLLGQPTF EL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GLIDFALSHRKAPLNMF+ DPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKELEK+S AKSGEQ+ KKKAAKA K STENS KK KKELEK  SEARE NGG+D+ED FWMPP GQRWD DNGGDRW S SDSE
Subjt:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIAM
        HES+K+ A+
Subjt:  HESEKVIAM

A0A6J1J004 surfeit locus protein 2 isoform X26.50e-10982.21Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        MAT   EERA   VEGADLLG+PTF EL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GLIDFALSHRKAPLNMF+ DPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKE EK+S AKSGEQ+ KKKAAKA K STENS KK KKELEK  SEARE NGG+D+ED FWMPP GQRWD DNGGDRW S SDSE
Subjt:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIA
        HES+K+ A
Subjt:  HESEKVIA

A0A6J1J5G4 uncharacterized protein LOC111481480 isoform X34.61e-10982.21Show/hide
Query:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH
        MAT   EERA   VEGADLLG+PTF EL+NGRFRCVETGHE+L KDKD+YSRTKRCR+GLIDFALSHRKAPLNMF+ DPLSRSKLKCKLTGDTINKTEEH
Subjt:  MATSRAEERA--NVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEH

Query:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE
        IWKHINGKRFLNKLEQKE EK+S AKSGEQ+ KKKAAKA K STENS KK KKELEK  SEARE NGG+D+ED FWMPP GQRWD DNGGDRW S SDSE
Subjt:  IWKHINGKRFLNKLEQKELEKES-AKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSE

Query:  HESEKVIA
        HES+K+ A
Subjt:  HESEKVIA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT5G14440.1 Surfeit locus protein 2 (SURF2)6.0e-5857.94Show/hide
Query:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
        MA +  E     EGADLLG+P + +LENGRF+CV+TGHELL KDK  YS++KRCR+GLID+ALSH K PLN+F+QDP +RSKLKCKLTGDT+NKTEEHIW
Subjt:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKES----AKSGEQQAKKKAA----KASKASTENSNKKKKKELEKTTS---EAREPNGGND----SEDAFWMPPVGQRWDCDN
        KHI G+RFLN+LE+KE EKES    A+ GE  AK+       K  K    N  KK KK +EK  +    A E    ND     E  FWMPP G+RWD D+
Subjt:  KHINGKRFLNKLEQKELEKES----AKSGEQQAKKKAA----KASKASTENSNKKKKKELEKTTS---EAREPNGGND----SEDAFWMPPVGQRWDCDN

Query:  GGDRWVSGSDSEHE
        G DRW S SDS+ E
Subjt:  GGDRWVSGSDSEHE

AT5G14440.2 Surfeit locus protein 2 (SURF2)6.0e-5857.94Show/hide
Query:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW
        MA +  E     EGADLLG+P + +LENGRF+CV+TGHELL KDK  YS++KRCR+GLID+ALSH K PLN+F+QDP +RSKLKCKLTGDT+NKTEEHIW
Subjt:  MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIW

Query:  KHINGKRFLNKLEQKELEKES----AKSGEQQAKKKAA----KASKASTENSNKKKKKELEKTTS---EAREPNGGND----SEDAFWMPPVGQRWDCDN
        KHI G+RFLN+LE+KE EKES    A+ GE  AK+       K  K    N  KK KK +EK  +    A E    ND     E  FWMPP G+RWD D+
Subjt:  KHINGKRFLNKLEQKELEKES----AKSGEQQAKKKAA----KASKASTENSNKKKKKELEKTTS---EAREPNGGND----SEDAFWMPPVGQRWDCDN

Query:  GGDRWVSGSDSEHE
        G DRW S SDS+ E
Subjt:  GGDRWVSGSDSEHE

AT5G40570.1 Surfeit locus protein 2 (SURF2)2.6e-3750Show/hide
Query:  EGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIWKHINGKRFLNKL
        EG  L G PTF +L NGR RCVETGHE+L  D ++Y+R KRCR+GLI+ ALS  K PLNMF Q PLSRSKL CKLTGDT+NK EEHIWKH+NGKRFL+KL
Subjt:  EGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIWKHINGKRFLNKL

Query:  EQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGD
        EQ  +E+ +  SG  +            T+ +N +  KE           + G++  D FWMP      + D   D
Subjt:  EQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGD

AT5G40570.2 Surfeit locus protein 2 (SURF2)3.2e-3548.09Show/hide
Query:  EGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSR-------SKLKCKLTGDTINKTEEHIWKHING
        EG  L G PTF +L NGR RCVETGHE+L  D ++Y+R KRCR+GLI+ ALS  K PLNMF Q PLSR       SKL CKLTGDT+NK EEHIWKH+NG
Subjt:  EGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSR-------SKLKCKLTGDTINKTEEHIWKHING

Query:  KRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGD
        KRFL+KLEQ  +E+ +  SG  +            T+ +N +  KE           + G++  D FWMP      + D   D
Subjt:  KRFLNKLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGD


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCGACGAGTAGAGCCGAGGAGAGGGCGAATGTGGAAGGGGCCGATCTTCTGGGGCAACCGACCTTCACAGAGCTCGAAAATGGCCGATTCCGCTGCGTGGAGACCGG
CCACGAACTCCTCCCCAAGGATAAGGACACCTACTCTCGGACCAAGCGCTGCCGCATCGGCCTCATCGACTTCGCTTTGTCCCATCGAAAAGCTCCTCTCAATATGTTCG
ACCAGGATCCTCTCTCTCGTTCGAAGTTAAAATGCAAACTGACTGGTGATACCATCAACAAAACAGAGGAACATATATGGAAGCACATTAATGGCAAGCGTTTCCTCAAC
AAATTAGAGCAAAAGGAGTTAGAGAAAGAGTCGGCTAAATCAGGAGAGCAGCAAGCCAAGAAGAAGGCTGCTAAGGCTTCAAAGGCAAGTACAGAAAATTCAAATAAGAA
GAAGAAGAAGGAACTGGAGAAGACTACTTCTGAAGCAAGGGAACCCAATGGGGGGAATGACTCAGAGGACGCCTTCTGGATGCCTCCGGTAGGGCAGCGTTGGGATTGTG
ACAATGGAGGAGACCGATGGGTTTCAGGATCAGATTCGGAGCATGAAAGTGAGAAGGTCATTGCAATGGTTTGTTTGGGGTTAATGGTTGTGTCAGGGGGACCTTGGGGG
TCCTTTCTTGGTCTCGGTTTCTGGCTTGAGGGAGTTGGTTTTGCTCTGGGTTGCTTGTTTGGCTGTTCGGATATCTCCTACTATGCTTTTAAAAGGCAA
mRNA sequenceShow/hide mRNA sequence
CCCAACATCTATTAGTTCAATCAACCAACATTTTCAAAATCATTTTGAAGCCGTATACCCATTTCGGCCTGACTCCCGGTTCTCTCTTAAATAAAGCTATGAAAAAACGT
GAGTACATCGACATTTAAAACTTTAGATCCAATAAACCGTTTGAAAATATCCAAAACAAATAAGAGTGGAAATTGCATTTTAACCGATAAATAAATAAACAACCCTCATA
TTGTCCAAAACCCCCGAAGCCCGCCGACATTTGTCCCATGGCGACGAGTAGAGCCGAGGAGAGGGCGAATGTGGAAGGGGCCGATCTTCTGGGGCAACCGACCTTCACAG
AGCTCGAAAATGGCCGATTCCGCTGCGTGGAGACCGGCCACGAACTCCTCCCCAAGGATAAGGACACCTACTCTCGGACCAAGCGCTGCCGCATCGGCCTCATCGACTTC
GCTTTGTCCCATCGAAAAGCTCCTCTCAATATGTTCGACCAGGATCCTCTCTCTCGTTCGAAGTTAAAATGCAAACTGACTGGTGATACCATCAACAAAACAGAGGAACA
TATATGGAAGCACATTAATGGCAAGCGTTTCCTCAACAAATTAGAGCAAAAGGAGTTAGAGAAAGAGTCGGCTAAATCAGGAGAGCAGCAAGCCAAGAAGAAGGCTGCTA
AGGCTTCAAAGGCAAGTACAGAAAATTCAAATAAGAAGAAGAAGAAGGAACTGGAGAAGACTACTTCTGAAGCAAGGGAACCCAATGGGGGGAATGACTCAGAGGACGCC
TTCTGGATGCCTCCGGTAGGGCAGCGTTGGGATTGTGACAATGGAGGAGACCGATGGGTTTCAGGATCAGATTCGGAGCATGAAAGTGAGAAGGTCATTGCAATGGTTTG
TTTGGGGTTAATGGTTGTGTCAGGGGGACCTTGGGGGTCCTTTCTTGGTCTCGGTTTCTGGCTTGAGGGAGTTGGTTTTGCTCTGGGTTGCTTGTTTGGCTGTTCGGATA
TCTCCTACTATGCTTTTAAAAGGCAA
Protein sequenceShow/hide protein sequence
MATSRAEERANVEGADLLGQPTFTELENGRFRCVETGHELLPKDKDTYSRTKRCRIGLIDFALSHRKAPLNMFDQDPLSRSKLKCKLTGDTINKTEEHIWKHINGKRFLN
KLEQKELEKESAKSGEQQAKKKAAKASKASTENSNKKKKKELEKTTSEAREPNGGNDSEDAFWMPPVGQRWDCDNGGDRWVSGSDSEHESEKVIAMVCLGLMVVSGGPWG
SFLGLGFWLEGVGFALGCLFGCSDISYYAFKRQ