; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; ; CuGenDBv2

CSPI07G18300 (gene) of Cucumber (PI 183967) v1 genome

Gene IDCSPI07G18300
OrganismCucumis sativus L. var. sativus cv. PI 183967 (Cucumber (PI 183967) v1)
DescriptionUnknown protein
Genome locationChr7:16374811..16379086
RNA-Seq ExpressionCSPI07G18300
SyntenyCSPI07G18300
Gene Ontology termsGO:0016021 - integral component of membrane (cellular component)
InterPro domainsNA


Homology Show/hide homology
GenBank top hitse value%identityAlignment
KAA0046709.1 putative transmembrane protein [Cucumis melo var. makuwa]4.8e-10993.42Show/hide
Query:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
        MALRERDL FDLENGGKI+EE GSGEPSSTKRDVKN WSRLTEDSLLKDERAIASNSNFAN++ D+IADESLGLLIDKNLEGED HEVF H+EK+NARGK
Subjt:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
        HKNKKKA KPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS

Query:  GFISVQYIKSFPPNESNVSNPPSFNSAA
        GFISVQYIKSFPP+ES+VSNPPSFNSAA
Subjt:  GFISVQYIKSFPPNESNVSNPPSFNSAA

TYK18245.1 putative transmembrane protein [Cucumis melo var. makuwa]7.0e-10892.98Show/hide
Query:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
        MALRERDL FDLENGGKI+EE GSGEPSSTKRDVKN WSRLTEDSLLKDERAIASNSNFAN++ D+IADESLGLLIDKNLEGED HEVF H+EK+NARGK
Subjt:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
        HKNKKKA KPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS

Query:  GFISVQYIKSFPPNESNVSNPPSFNSAA
        GFISVQYIKSFPP+ES+VSN PSFNSAA
Subjt:  GFISVQYIKSFPPNESNVSNPPSFNSAA

XP_008451449.1 PREDICTED: uncharacterized protein LOC103492741 [Cucumis melo]1.8e-10892.98Show/hide
Query:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
        MALRERDL FDLENGGKI+EE GSGEPSSTKRDVKN WSRLTEDSLLKDERAIASNSNFAN++ D+IADESLGLLI+KNLEGED HEVF H+EK+NARGK
Subjt:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
        HKNKKKA KPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS

Query:  GFISVQYIKSFPPNESNVSNPPSFNSAA
        GFISVQYIKSFPP+ES+VSNPPSFNSAA
Subjt:  GFISVQYIKSFPPNESNVSNPPSFNSAA

XP_031744965.1 uncharacterized protein LOC101218752 [Cucumis sativus]6.3e-117100Show/hide
Query:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
        MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
Subjt:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
        HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS

Query:  GFISVQYIKSFPPNESNVSNPPSFNSAA
        GFISVQYIKSFPPNESNVSNPPSFNSAA
Subjt:  GFISVQYIKSFPPNESNVSNPPSFNSAA

XP_038898210.1 uncharacterized protein LOC120085950 [Benincasa hispida]7.2e-10589.04Show/hide
Query:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
        MALRERDL  DLE+GGKI++EVGS EPSS KRDVKN WSRLT+DSLLKDER +ASNSNFANS+ ++IADE++ +LIDKNLEGED HEVFVH+EKNNARGK
Subjt:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
        HKNKKKAPKPPRPPKGPSLDAADRMMVKE+AVLAMKKRARAERMKALKKAKAEKTSSFNSCIPA+IITFLFFLVIIIQGIS RSSSILQGSPEPAVGGSS
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS

Query:  GFISVQYIKSFPPNESNVSNPPSFNSAA
        GFISVQYIKSFPPNESN+SNPPSFNSAA
Subjt:  GFISVQYIKSFPPNESNVSNPPSFNSAA

TrEMBL top hitse value%identityAlignment
A0A0A0K5U6 Uncharacterized protein4.1e-11493.44Show/hide
Query:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
        MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
Subjt:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQ----------------GISPRS
        HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQ                GISPRS
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQ----------------GISPRS

Query:  SSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPPSFNSAA
        SSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPPSFNSAA
Subjt:  SSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSNPPSFNSAA

A0A1S4DYQ5 uncharacterized protein LOC1034927418.9e-10992.98Show/hide
Query:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
        MALRERDL FDLENGGKI+EE GSGEPSSTKRDVKN WSRLTEDSLLKDERAIASNSNFAN++ D+IADESLGLLI+KNLEGED HEVF H+EK+NARGK
Subjt:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
        HKNKKKA KPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS

Query:  GFISVQYIKSFPPNESNVSNPPSFNSAA
        GFISVQYIKSFPP+ES+VSNPPSFNSAA
Subjt:  GFISVQYIKSFPPNESNVSNPPSFNSAA

A0A5A7TZQ1 Putative transmembrane protein2.3e-10993.42Show/hide
Query:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
        MALRERDL FDLENGGKI+EE GSGEPSSTKRDVKN WSRLTEDSLLKDERAIASNSNFAN++ D+IADESLGLLIDKNLEGED HEVF H+EK+NARGK
Subjt:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
        HKNKKKA KPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS

Query:  GFISVQYIKSFPPNESNVSNPPSFNSAA
        GFISVQYIKSFPP+ES+VSNPPSFNSAA
Subjt:  GFISVQYIKSFPPNESNVSNPPSFNSAA

A0A5D3D3X4 Putative transmembrane protein3.4e-10892.98Show/hide
Query:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
        MALRERDL FDLENGGKI+EE GSGEPSSTKRDVKN WSRLTEDSLLKDERAIASNSNFAN++ D+IADESLGLLIDKNLEGED HEVF H+EK+NARGK
Subjt:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
        HKNKKKA KPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS

Query:  GFISVQYIKSFPPNESNVSNPPSFNSAA
        GFISVQYIKSFPP+ES+VSN PSFNSAA
Subjt:  GFISVQYIKSFPPNESNVSNPPSFNSAA

A0A6J1JVV1 uncharacterized protein LOC1114883452.1e-9481.58Show/hide
Query:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK
        MALRERDL FDLE+GG+I EEVGS EPSS KRDVKN W+RLTED+LLKDE  +ASNSNFANS+ DV+AD ++ LLIDKNLEGED  E F HVEK NARGK
Subjt:  MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGK

Query:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS
        HKNKKKA KPPRPPKGPSLDAADR +VKE+AV+AMKKRAR ERMKAL+K+KAEKTSSFNSCIPA+IITFLFFLVII+QGIS RSS +LQGSPEPAV GSS
Subjt:  HKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSS

Query:  GFISVQYIKSFPPNESNVSNPPSFNSAA
        GFISVQYIKSFPPNESN++NPP  NSAA
Subjt:  GFISVQYIKSFPPNESNVSNPPSFNSAA

SwissProt top hitse value%identityAlignment
No hits found
Arabidopsis top hitse value%identityAlignment
AT1G02380.1 unknown protein3.4e-2038.76Show/hide
Query:  IEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFAN-SITDVIADE-SLGLLIDKNLEGEDVHEVFVHVEKNNARGKHKNKKKAPKPPRPPKG
        ++++ SGE    + D++   S +T++S           S  AN  +++ IAD+ S  L+ D+N   E   +     EK    GK K  +KA KPPRPPKG
Subjt:  IEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFAN-SITDVIADE-SLGLLIDKNLEGEDVHEVFVHVEKNNARGKHKNKKKAPKPPRPPKG

Query:  PSLDAADRMMVKELAVLAMKKRARAERM-KALKKAKAEKTSSFNSCIP--ALIITFLFFLVIIIQGISPRSSSI-LQGSPEPAVGGSSGFISVQYIKSFP
        PSL   DR +++++  LAM+KRAR ERM K+LK+ KA KTS  + CI   ++IIT +FF  ++ QG S  SSS+    SP P V  ++  ISVQ+   F 
Subjt:  PSLDAADRMMVKELAVLAMKKRARAERM-KALKKAKAEKTSSFNSCIP--ALIITFLFFLVIIIQGISPRSSSI-LQGSPEPAVGGSSGFISVQYIKSFP

Query:  PNESNVSNP
        P E    +P
Subjt:  PNESNVSNP

AT3G17120.1 unknown protein5.9e-2051.33Show/hide
Query:  KHKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSC---IPALIITFLFFLVIIIQGISPRSSSILQGSPEPAV
        K K KK A KPPRPP+GPSLDAAD+ +++E+A LAM KRAR ERM+ALKK++A K +S  S    + A + T +FF V++ QG+SPR++    G     V
Subjt:  KHKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSC---IPALIITFLFFLVIIIQGISPRSSSILQGSPEPAV

Query:  GG--SSGFISVQY
         G  + GF+SVQY
Subjt:  GG--SSGFISVQY

AT3G17120.2 unknown protein5.9e-2051.33Show/hide
Query:  KHKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSC---IPALIITFLFFLVIIIQGISPRSSSILQGSPEPAV
        K K KK A KPPRPP+GPSLDAAD+ +++E+A LAM KRAR ERM+ALKK++A K +S  S    + A + T +FF V++ QG+SPR++    G     V
Subjt:  KHKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSC---IPALIITFLFFLVIIIQGISPRSSSILQGSPEPAV

Query:  GG--SSGFISVQY
         G  + GF+SVQY
Subjt:  GG--SSGFISVQY

AT4G01960.1 unknown protein5.5e-1845.3Show/hide
Query:  KHKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSI-LQGSPEPAVGG
        K K  +K  KPPRPPKGP L A D+ +++E+  LAM+KRAR ERMK L++ KA K+SS  S I A+I+T +FF+ +I QG    ++S+    SP P    
Subjt:  KHKNKKKAPKPPRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSI-LQGSPEPAVGG

Query:  SSGFISVQYIKSFPPNE
        ++  +SVQ+   F P E
Subjt:  SSGFISVQYIKSFPPNE


Sequences Show/hide sequences
CDS sequenceShow/hide CDS sequence
ATGGCTTTAAGAGAAAGGGATCTCATATTTGATCTCGAAAATGGGGGGAAGATCATTGAAGAGGTTGGAAGTGGGGAACCGAGTTCAACCAAAAGAGATGTAAAGAATTT
TTGGAGTAGGTTGACAGAGGATTCACTGCTGAAAGATGAGCGAGCCATAGCCTCAAACAGTAATTTTGCTAATTCTATTACCGACGTCATTGCTGATGAGAGCTTAGGAT
TGTTGATAGATAAGAACTTGGAAGGAGAAGATGTTCATGAAGTTTTTGTGCACGTGGAGAAAAATAATGCTAGAGGGAAGCATAAGAATAAGAAAAAGGCTCCGAAGCCA
CCACGGCCTCCCAAAGGTCCTTCGCTTGATGCTGCGGACAGAATGATGGTGAAGGAACTTGCAGTGCTTGCTATGAAAAAACGTGCAAGAGCTGAGAGAATGAAAGCATT
GAAGAAGGCAAAAGCAGAGAAAACATCTTCTTTCAATAGTTGCATACCTGCCTTGATTATCACATTCCTCTTCTTCCTTGTCATTATCATTCAAGGAATAAGCCCCAGAA
GCAGTTCGATATTGCAGGGGTCACCTGAACCTGCTGTTGGTGGTAGTAGTGGGTTCATTTCTGTTCAGTACATTAAAAGCTTTCCCCCAAATGAAAGCAATGTATCCAAT
CCCCCCTCGTTTAATTCTGCTGCATAG
mRNA sequenceShow/hide mRNA sequence
ATTAAACTATATTATTCTTTTTCCAATAATGCTTTGTTTTAAATTTCAGATGATCAGAAGAAAAATATCGAAATCGATTTGGAATTTGTCGGTCTGAGTAACTCATGGGT
TAGATTCTTTGGAGGTCCCTTTGATGTCCTTGAGATTGACTTTGAGAAATTTGCATTGGGATTGAACATTCATCTTCAATTGTGGTGTTCGACTTCCCTTTTCTCTATTC
TTTTGTGGTGTGAAGTGAATTTTGGCTTTTATAGTCAAAAGGCTATGAGATTGAGTACATGGCTTTAAGAGAAAGGGATCTCATATTTGATCTCGAAAATGGGGGGAAGA
TCATTGAAGAGGTTGGAAGTGGGGAACCGAGTTCAACCAAAAGAGATGTAAAGAATTTTTGGAGTAGGTTGACAGAGGATTCACTGCTGAAAGATGAGCGAGCCATAGCC
TCAAACAGTAATTTTGCTAATTCTATTACCGACGTCATTGCTGATGAGAGCTTAGGATTGTTGATAGATAAGAACTTGGAAGGAGAAGATGTTCATGAAGTTTTTGTGCA
CGTGGAGAAAAATAATGCTAGAGGGAAGCATAAGAATAAGAAAAAGGCTCCGAAGCCACCACGGCCTCCCAAAGGTCCTTCGCTTGATGCTGCGGACAGAATGATGGTGA
AGGAACTTGCAGTGCTTGCTATGAAAAAACGTGCAAGAGCTGAGAGAATGAAAGCATTGAAGAAGGCAAAAGCAGAGAAAACATCTTCTTTCAATAGTTGCATACCTGCC
TTGATTATCACATTCCTCTTCTTCCTTGTCATTATCATTCAAGGAATAAGCCCCAGAAGCAGTTCGATATTGCAGGGGTCACCTGAACCTGCTGTTGGTGGTAGTAGTGG
GTTCATTTCTGTTCAGTACATTAAAAGCTTTCCCCCAAATGAAAGCAATGTATCCAATCCCCCCTCGTTTAATTCTGCTGCATAGCTCGTTACTGATTTTGTTCCTCTAG
AAGGCAAAAGGGTGGCTGGTTTGAGGAGTTTTTTTTGAAGTTGAGATCCTAGGTGCCAGCATGGTGGCTCGTATGGCCTTGATTTTTGAATAGCTATTTTAATATGCAGT
TGCGTTTGACTGTTTTTGTACATAAATACGAAGAATGTCTTTTTTTATCTTTCTTTCTTTTTGTGATCAAATGCTTTTTGTATTTTAACTTTATGTTAATAATTTATATA
AAAAAGTTCATGGTGACGTTCAAGGATGTCAGTAGCTGAACCTTCTTAAAGGCTAAGGAGAAAATCCTGCTTGCTTCCAGGCTTTCTTTCGTTGAATTGAATTATGAACT
TTTTCCTTTTTGATTTTGATAACAAACCTGCAGAATTATGATGATATGATTGGCTGATGCTTAACCATTTGCACCTTTGATTGGCTGCTGTTAAACAACATCCAATCTGT
TCAAGAACAAACAGCAATATCTTTACAGATACATGTACGAAAGCATCCCATGTCGACCTTTTCATTTCGAGTCGCAGGTTCATCGTTGCAGCAAATGACTGACATGAAAG
GGTATAGCCAAGATCACCCTCGTAGAAATGTTAGTGAATGGTAAAGAAAATTTTGCTCGCAGTTTAAAGGAATGTCTACCTGCCTGCAATGTGTTGATTTGTAACCACTA
CGAAGATCACACTTTTTTGTCGAAGGAGTACAAGGTTGCAATGCCATGGATGTAGCTCGGGAGGTCGAGGCCTTGCAACGGAGCCATTCAGTGAAAGGTATAGGGATAGG
CATTGACATCATGGGACCCTGCTTTGCTGAAATCTCTGCTTGTATGATCATCTGTGGGTTGGATAACAAAATGGTATTTGTCCAATTTGGTTCTATTCAAGAGATGGGGC
AATTTCTTTCGCAATTAATTGAATATTGTGGTACCCAAAGCAACACGCAACGCACCAACGTGAGTAGAATTTTGAGCACGTGTTTTGGAATTTTCTCTCTGTCACAACTT
CATTGAAAATTTGAACTATTTTCTCATGGTGAGGGACTAAAATGTCAAAATTTTAAATTTAAGAACTAAATAGAAACATAGAGACGTTTTTTGTACAAATCATACAATTC
TTCTTAGGGAAATAGATAAAAAGAAAGGTATAAGAAGAAGTATTTTAGACCATTCAAATCA
Protein sequenceShow/hide protein sequence
MALRERDLIFDLENGGKIIEEVGSGEPSSTKRDVKNFWSRLTEDSLLKDERAIASNSNFANSITDVIADESLGLLIDKNLEGEDVHEVFVHVEKNNARGKHKNKKKAPKP
PRPPKGPSLDAADRMMVKELAVLAMKKRARAERMKALKKAKAEKTSSFNSCIPALIITFLFFLVIIIQGISPRSSSILQGSPEPAVGGSSGFISVQYIKSFPPNESNVSN
PPSFNSAA