; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

Sgr019859 (gene) of Monk fruit (Qingpiguo) v1 genome

Gene IDSgr019859
OrganismSiraitia grosvenorii cv. Qingpiguo (Monk fruit (Qingpiguo) v1)
DescriptionProtein of unknown function (DUF1685)
Genome locationtig00153424:140062..152608
RNA-Seq ExpressionSgr019859
SyntenySgr019859
Gene Ontology termsNA
InterPro domainsIPR012881 - Protein of unknown function DUF1685


Homology Show/hide homology
GenBank top hitse value%identityAlignment
XP_004142530.1 uncharacterized protein LOC101212719 [Cucumis sativus]4.2e-7389.82Show/hide
Query:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA
        MDKKH QR+PS+LSSSSSCPSL ESEDELKHMPLAPP+LKNKKRLSKQLSMCETPRDLAWEKRRRQMLRP   RNG TDRDDLTDEDWNELKG IELGFA
Subjt:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA

Query:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTAS-TSSLGGRSSSFGSPKSESDTWRVCSPG
        FNEE GHKLCGTLPALDLYFAVNRQLSPSPVSTPQS+AS +SSLGGRSSSF SP+SE DTWRVCSPG
Subjt:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTAS-TSSLGGRSSSFGSPKSESDTWRVCSPG

XP_008462705.1 PREDICTED: uncharacterized protein LOC103501007 [Cucumis melo]8.1e-7792.17Show/hide
Query:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA
        MDKKH QRMPS+LSSSSSCPSL ES+DELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRP   RNG TDRDDLTDEDWNELKGCIELGFA
Subjt:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA

Query:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG
        FNEE GHKLCGTLPALDLYFAVNRQLSPSPVSTPQS+ASTSSLGGRSSSF SP+SE DTWRVCSPG
Subjt:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG

XP_022155108.1 uncharacterized protein LOC111022242 [Momordica charantia]5.6e-7893.41Show/hide
Query:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA
        MD++  QRM STLSSS SCPSLRESEDELKHM LAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA
Subjt:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA

Query:  FNEEDG-HKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG
        FNEEDG HKLCGTLPALDLYFAVNRQLSPSPVSTPQS+ASTSSLGGRSSSF SPKS+SDTWRVCSPG
Subjt:  FNEEDG-HKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG

XP_023533045.1 uncharacterized protein LOC111795047 [Cucurbita pepo subsp. pepo]1.4e-7389.16Show/hide
Query:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA
        MDKK  QRMPS+LSSSSSCPSL ESEDELKHMPLAPPR KNKKRLSKQLSMCETPRDLAWEKRRRQMLRP   RN  TDRDDLTDEDWNELKGCIELGFA
Subjt:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA

Query:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG
        FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTP S+ S+SSLG RSSSF SPKSES+TW+VCSPG
Subjt:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG

XP_038878789.1 uncharacterized protein LOC120070939 [Benincasa hispida]2.4e-7691.57Show/hide
Query:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA
        MDKKH QRMPS+LSSSSSCPS  ESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRP   RNG TDRDDLTDEDWNELKGCIELGFA
Subjt:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA

Query:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG
        FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQS+ STSSLG RSSSF SP+SE DTWRVCSPG
Subjt:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG

TrEMBL top hitse value%identityAlignment
A0A0A0LY38 Uncharacterized protein2.0e-7389.82Show/hide
Query:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA
        MDKKH QR+PS+LSSSSSCPSL ESEDELKHMPLAPP+LKNKKRLSKQLSMCETPRDLAWEKRRRQMLRP   RNG TDRDDLTDEDWNELKG IELGFA
Subjt:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA

Query:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTAS-TSSLGGRSSSFGSPKSESDTWRVCSPG
        FNEE GHKLCGTLPALDLYFAVNRQLSPSPVSTPQS+AS +SSLGGRSSSF SP+SE DTWRVCSPG
Subjt:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTAS-TSSLGGRSSSFGSPKSESDTWRVCSPG

A0A1S3CHL5 uncharacterized protein LOC1035010073.9e-7792.17Show/hide
Query:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA
        MDKKH QRMPS+LSSSSSCPSL ES+DELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRP   RNG TDRDDLTDEDWNELKGCIELGFA
Subjt:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA

Query:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG
        FNEE GHKLCGTLPALDLYFAVNRQLSPSPVSTPQS+ASTSSLGGRSSSF SP+SE DTWRVCSPG
Subjt:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG

A0A5A7V807 Ankyrin-2-like3.4e-7392.41Show/hide
Query:  MPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFAFNEEDGHK
        MPS+LSSSSSCPSL ES+DELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRP   RNG TDRDDLTDEDWNELKGCIELGFAFNEE GHK
Subjt:  MPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFAFNEEDGHK

Query:  LCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG
        LCGTLPALDLYFAVNRQLSPSPVSTPQS+ASTSSLGGRSSSF SP+SE DTWRVCSPG
Subjt:  LCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG

A0A6J1DM37 uncharacterized protein LOC1110222422.7e-7893.41Show/hide
Query:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA
        MD++  QRM STLSSS SCPSLRESEDELKHM LAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA
Subjt:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA

Query:  FNEEDG-HKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG
        FNEEDG HKLCGTLPALDLYFAVNRQLSPSPVSTPQS+ASTSSLGGRSSSF SPKS+SDTWRVCSPG
Subjt:  FNEEDG-HKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG

A0A6J1JGY3 uncharacterized protein LOC1114850031.3e-7288.55Show/hide
Query:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA
        MDKK  QRM S+LSSSSSCPSL ESEDELKHMPLAPPR KNKKRLSKQLSMCETPRDLAWEKRRRQMLRP   RN  TDRDDLTDEDWNELKGCIELGFA
Subjt:  MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFA

Query:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG
        F EEDGHKLCGTLPALDLYFAVNRQLSPSPVSTP S+ S+SSLG RSSSF SPKSES+TWRVCSPG
Subjt:  FNEEDGHKLCGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPG

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G08790.1 Protein of unknown function (DUF1685)3.2e-3963.52Show/hide
Query:  SSSSCPSLRESE-DELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTL
        S+SS  S  +SE +EL+ MPL PP+ K KKRLSKQLSM ET RD+AWE+RRRQML    K N     DDLTDED +ELKG IELGF FNEE G  L  TL
Subjt:  SSSSCPSLRESE-DELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTL

Query:  PALDLYFAVNRQLSPSPVSTPQSTASTS-----SLGGRSSSFGSPKSESDTWRVCSPGN
        PALDLYFAV RQ+  SPVSTP S  S+S     SLG RSSSFGSP S+SD+ +V SPG+
Subjt:  PALDLYFAVNRQLSPSPVSTPQSTASTS-----SLGGRSSSFGSPKSESDTWRVCSPGN

AT2G43340.1 Protein of unknown function (DUF1685)7.7e-0931.65Show/hide
Query:  SSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTD----RDDLTDEDWNELKGCIELGFAFNEEDGHKL
        SS SSC      E+E+ +        K  K+L K              K+   +L      + V D       LTD+D  ELKGC++LGF FN E+  +L
Subjt:  SSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTD----RDDLTDEDWNELKGCIELGFAFNEEDGHKL

Query:  CGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPGN
        C TLPAL+L ++++++     +       S+SS   +SS   SP S   +W++ SPG+
Subjt:  CGTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPGN

AT3G04700.1 Protein of unknown function (DUF1685)9.4e-3156.62Show/hide
Query:  KNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKR---NGVTD----RDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTLPALDLYFAVNRQLSPSP-V
        K K RLSKQLSMCETPRD+AWE+RRRQM+  + K+    G +D    + +LTDED NELKG IELGF FNEE G KLC TLPALDLYFAVNRQLSP P  
Subjt:  KNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKR---NGVTD----RDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTLPALDLYFAVNRQLSPSP-V

Query:  STPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPGN
        S+ +S+++++S    S      K++SD+ ++  PG+
Subjt:  STPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPGN

AT3G04710.3 ankyrin repeat family protein9.4e-3156.62Show/hide
Query:  KNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKR---NGVTD----RDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTLPALDLYFAVNRQLSPSP-V
        K K RLSKQLSMCETPRD+AWE+RRRQM+  + K+    G +D    + +LTDED NELKG IELGF FNEE G KLC TLPALDLYFAVNRQLSP P  
Subjt:  KNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKR---NGVTD----RDDLTDEDWNELKGCIELGFAFNEEDGHKLCGTLPALDLYFAVNRQLSPSP-V

Query:  STPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPGN
        S+ +S+++++S    S      K++SD+ ++  PG+
Subjt:  STPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPGN

AT5G28690.1 Protein of unknown function (DUF1685)1.4e-3459.15Show/hide
Query:  TLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLR-PERKRNGVTDRD----DLTDEDWNELKGCIELGFAFNEEDG
        TL  SSS PSL ES  ++K   +AP   K K+RLSKQLSM ETPRD+AWEKRRRQML+  E+K+  V++ D    DLTDED  ELKG IELGF F+EE G
Subjt:  TLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLR-PERKRNGVTDRD----DLTDEDWNELKGCIELGFAFNEEDG

Query:  HKLCGTLPALDLYFAVNRQLS--PSPVSTPQSTASTSSLGGRSSSFG-SPKSESDTWRVCSPGN
         KLC TLPALDLYFAVNRQLS  PSP S+     S SS    SSS   SPK++SD+ ++  PG+
Subjt:  HKLCGTLPALDLYFAVNRQLS--PSPVSTPQSTASTSSLGGRSSSFG-SPKSESDTWRVCSPGN


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGATAAGAAACACTCGCAGAGGATGCCTTCGACGCTGTCGTCATCGTCGTCGTGCCCGTCGCTGAGGGAGTCGGAGGACGAGCTGAAGCACATGCCATTGGCGCCACC
GCGGCTGAAGAACAAGAAGCGTTTGTCGAAGCAGCTTTCGATGTGCGAGACGCCGCGAGACCTAGCTTGGGAGAAGCGGCGGCGACAGATGCTGCGACCGGAGCGGAAAA
GGAACGGCGTGACGGACAGGGACGACCTGACGGATGAGGACTGGAATGAGCTGAAAGGGTGCATTGAGCTAGGGTTTGCATTCAATGAGGAAGATGGGCACAAGCTGTGC
GGGACATTGCCGGCGCTTGACCTTTACTTCGCCGTCAACCGGCAACTCTCGCCGAGCCCGGTGTCGACGCCGCAGAGCACCGCCTCCACGTCTTCTCTCGGCGGAAGGTC
TTCATCTTTTGGGAGCCCCAAGAGCGAGTCTGATACATGGAGGGTTTGCAGCCCAGGAAATCTTGGAGGCGCAGAGCAGAACACACGCAGCAAAGGAGATCTTCATGCAA
GATCTGCTTTGAGACTTTATTCTGATGACGAGAACGACAAGATTGACAAAGCTGAAGGTGAGATTGATGAATGGGGCTTTTTAGAAGCCAAGTCTGACTATTTCACCCTT
CATTCCGCCGATCACTTTGATTGCCCGAGATTTCTGATGGTCGGCTCGACCAAGCTCCCATCGACCGAGGTCGCCTCGGCCTCCAGGCTTCTTCGATCAGTAGACCCCAC
TCTTGAGTTGGGGTTGTCGAGATCGTTCGTCAAGGCATTGAGCGAGTTGATCCAACTTTGGGAGTATCGTAAGTGGTTAGATCGTTTAGGTACGATGTGGTCTTTCCAAA
GTGAAGCTTTGCCTGAGGCTGTATTTTTCGAGATCACATCATGTACAATATGGTCTTCCCAAGGACAGGCTTTGCCCGAGATGGTACTTCTCGAGATCACATCATGTACG
ATGTGGTCTTCCCAAGGTGAGGCTTGGCCCGAGGCTGTACTTCTCGAGATGAAGCTTTGCCTGAGATGGCATATCATTCATCAGGTGTTCCTTCTCGAGGGTGAAGCTTT
GCCTGAGGTGGTTCTTCCTGAGGTGGCATGTCACGTCCCATCATTAGGTGCTTCTTTCCGAGGTAGCACTGATGATGAGGCGACCACTTTTGATCTGTGCTCTGAAGAAG
GGGACTCTAAAGAAAATCGTCAGGCTGAGGCTCGAAATATGCCTTCCTCTGCTTGTTTCAATCCTACACTTGTATCAAAGGCTAACCTGCTAGCTCAAAGACTCACTTTT
CGTACCGACGTGGAAGACGAAATCTCCACAGAAGGTAACTCTCAGATTTCACTCCTCCAGCAAGAGCGTCTTAGGGCCATGGGAATCAATCCAAATTCTGTCAACCCTCC
CCCTTCAGACGATGCCACAGTTACTGATTTGGTCGAAGATGAGCCCCTTGCGCAACATTAA
mRNA sequenceShow/hide mRNA sequence
ATGGATAAGAAACACTCGCAGAGGATGCCTTCGACGCTGTCGTCATCGTCGTCGTGCCCGTCGCTGAGGGAGTCGGAGGACGAGCTGAAGCACATGCCATTGGCGCCACC
GCGGCTGAAGAACAAGAAGCGTTTGTCGAAGCAGCTTTCGATGTGCGAGACGCCGCGAGACCTAGCTTGGGAGAAGCGGCGGCGACAGATGCTGCGACCGGAGCGGAAAA
GGAACGGCGTGACGGACAGGGACGACCTGACGGATGAGGACTGGAATGAGCTGAAAGGGTGCATTGAGCTAGGGTTTGCATTCAATGAGGAAGATGGGCACAAGCTGTGC
GGGACATTGCCGGCGCTTGACCTTTACTTCGCCGTCAACCGGCAACTCTCGCCGAGCCCGGTGTCGACGCCGCAGAGCACCGCCTCCACGTCTTCTCTCGGCGGAAGGTC
TTCATCTTTTGGGAGCCCCAAGAGCGAGTCTGATACATGGAGGGTTTGCAGCCCAGGAAATCTTGGAGGCGCAGAGCAGAACACACGCAGCAAAGGAGATCTTCATGCAA
GATCTGCTTTGAGACTTTATTCTGATGACGAGAACGACAAGATTGACAAAGCTGAAGGTGAGATTGATGAATGGGGCTTTTTAGAAGCCAAGTCTGACTATTTCACCCTT
CATTCCGCCGATCACTTTGATTGCCCGAGATTTCTGATGGTCGGCTCGACCAAGCTCCCATCGACCGAGGTCGCCTCGGCCTCCAGGCTTCTTCGATCAGTAGACCCCAC
TCTTGAGTTGGGGTTGTCGAGATCGTTCGTCAAGGCATTGAGCGAGTTGATCCAACTTTGGGAGTATCGTAAGTGGTTAGATCGTTTAGGTACGATGTGGTCTTTCCAAA
GTGAAGCTTTGCCTGAGGCTGTATTTTTCGAGATCACATCATGTACAATATGGTCTTCCCAAGGACAGGCTTTGCCCGAGATGGTACTTCTCGAGATCACATCATGTACG
ATGTGGTCTTCCCAAGGTGAGGCTTGGCCCGAGGCTGTACTTCTCGAGATGAAGCTTTGCCTGAGATGGCATATCATTCATCAGGTGTTCCTTCTCGAGGGTGAAGCTTT
GCCTGAGGTGGTTCTTCCTGAGGTGGCATGTCACGTCCCATCATTAGGTGCTTCTTTCCGAGGTAGCACTGATGATGAGGCGACCACTTTTGATCTGTGCTCTGAAGAAG
GGGACTCTAAAGAAAATCGTCAGGCTGAGGCTCGAAATATGCCTTCCTCTGCTTGTTTCAATCCTACACTTGTATCAAAGGCTAACCTGCTAGCTCAAAGACTCACTTTT
CGTACCGACGTGGAAGACGAAATCTCCACAGAAGGTAACTCTCAGATTTCACTCCTCCAGCAAGAGCGTCTTAGGGCCATGGGAATCAATCCAAATTCTGTCAACCCTCC
CCCTTCAGACGATGCCACAGTTACTGATTTGGTCGAAGATGAGCCCCTTGCGCAACATTAA
Protein sequenceShow/hide protein sequence
MDKKHSQRMPSTLSSSSSCPSLRESEDELKHMPLAPPRLKNKKRLSKQLSMCETPRDLAWEKRRRQMLRPERKRNGVTDRDDLTDEDWNELKGCIELGFAFNEEDGHKLC
GTLPALDLYFAVNRQLSPSPVSTPQSTASTSSLGGRSSSFGSPKSESDTWRVCSPGNLGGAEQNTRSKGDLHARSALRLYSDDENDKIDKAEGEIDEWGFLEAKSDYFTL
HSADHFDCPRFLMVGSTKLPSTEVASASRLLRSVDPTLELGLSRSFVKALSELIQLWEYRKWLDRLGTMWSFQSEALPEAVFFEITSCTIWSSQGQALPEMVLLEITSCT
MWSSQGEAWPEAVLLEMKLCLRWHIIHQVFLLEGEALPEVVLPEVACHVPSLGASFRGSTDDEATTFDLCSEEGDSKENRQAEARNMPSSACFNPTLVSKANLLAQRLTF
RTDVEDEISTEGNSQISLLQQERLRAMGINPNSVNPPPSDDATVTDLVEDEPLAQH